• Title/Summary/Keyword: speechTool

Search Result 155, Processing Time 0.031 seconds

Human Voice, This Mystery

  • Horiuchi, Terumichi
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.378-378
    • /
    • 1996
  • Human beings and chimpanzees are very much alike. and scientists say there is only 1% difference between them. Contrary to our expectations, the difference lies not in brains but in tracheas ( windpipes ). Those of human beings are bigger and longer than those of chimpanzees. Thu means more air is inspired and expired as breath. About breath there are interesting descriptions in the Bible. In the Genesis it says God made a man out of soil and breathed life-giving breath into his nostrils and the man began to live. In other part it says life exists between incoming breath and outgoing breath. Thus breath plays key role is our life. In Hebrew and Greek, breath and spirit are the same words. In Hebrew it is ‘Luahf’ and in Greek, ‘Pneuma’ With breath and mouth organs human beings produced voice, and with haritage and through leaning we train our voice to reach the level of language which convey our culture. My contention is to realize the gift of voice and train it so that it can perform proper function as a tool of conveying our thought and culture. This is a kind of practice of speech and it may be called speechology. It includes the following practical methods: 1. Try to read aloud. 2. Encourage recitation, 3. Make public speaking as possible. 4. Learn theories of phonetics; such as about pronunciation, accent, intonation, prominence, assimilation and so on.

  • PDF

Implementation of Information Access Embedded System for the Blind People (시각 장애인을 위한 정보접근 임베디드 시스템의 구현)

  • Kim, Si-Woo;Lee, Jae-Kyun;Lee, Chae-Wook
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.2C
    • /
    • pp.167-172
    • /
    • 2008
  • Since a 2-dimensional (2D) bar code can retrieve data and information quickly, it is widely used and recognized as a useful tool for many industrial applications. However, the information capacity of the 2D bar code is still limited. Recently the analog-digital code (AD code), which has the largest storage capacity yet contained in a code, has been developed, thereby expanding the bar code's application range because it overcomes the limitation of data capacity. In this paper, we present the AD code and implement an effective embedded system which can transform text information into voice using the 2D AD code and Text To Speech (TTS). This voice information can also be transmitted to blind people as well as the old by capturing the AD code on paper or in books.

Mieko Han and her Works on Korean Phonetics (Mieko Han의 한국어 음성학 연구)

  • Ko, Do-Heung
    • Speech Sciences
    • /
    • v.1
    • /
    • pp.213-223
    • /
    • 1997
  • This paper deals with a general review of Mieko S. Han, who made a significant contribution to the studies of Korean phonetics during the 1960' s and early 1970' s. As both a single and joint author, Dr. Han published important papers in both quantity and quality, which have been cited among Korean phoneticians until today. Before Dr. M. Han' s work, professor of USC in the department of East Asian Languages & Cultures, there were only a few phonetics-related publications in Korea, most of which are papers or books based on non-experimental traditional approach. It is known that there was coexistence between traditionalism and structuralism in the field of Korean linguistics. It was, however, fortunate that we had two important phoneticians (M. Han and Chin-W Kim) abroad at that time. Mieko Han' s concern was to investigate experimental characteristics of the system of Korean vowels and consonants using a Spectrograph, which was the single most important tool for analysing phonetic data at that time. Dr. Han conducted her experimental studies on Korean phonetics, mostly funded by the Office of Naval Research, in terms of duration, fundamental frequency, Voice Onset Time (VOT), intensity, and so on. This paper aims to re-appreciate Dr. Han's specific contribution to the study of Korean phonetics since she played an important role as a pioneer of early Korean phonetics. Further, it is highly recommended that Dr. Han's works can be extremely useful for a graduate student, who seriously would like to specialize in Korean phonetics in the first step.

  • PDF

A Study on Voice User Interface for Domestic Appliance (가전제품의 음성 인터페이스 디자인 적용에 대한 연구)

  • Hong, Ji-Young;Jeon, Myoung-Hoon;Han, Kwang-Hee;Chae, Haeng-Suk
    • Science of Emotion and Sensibility
    • /
    • v.10 no.1
    • /
    • pp.55-68
    • /
    • 2007
  • This paper describes a Voice User Interface(VUI) method and a design guideline tool which supports the studies for domestic appliance. This issue covers specification of user requirement and selection of appropriate VUI to represent speech generation. The criteria for paper is interaction design to enhance user engagement. The studies were carried out to measure prototype of domestic appliance such as a refrigerator, a washing machine, a Gimchi refrigerator, an oven range, a dishwasher and an air conditioner. This paper is presented a study of user preferences and suitability. The results of these findings to voice interface design are discussed and it is suggested that VUI guideline and optimal prototyping can provide a useful application tools in the design process.

  • PDF

Usefulness of Cepstral Peak Prominence (CPP) in Unilateral Vocal Fold Paralysis Dysphonia Evaluation (일측성 성대마비 환자 평가에서 Cepstral Peak Prominence의 유용성)

  • Lee, Chang-Yoon;Jeong, Hee Seok;Son, Hee Young
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.28 no.2
    • /
    • pp.84-88
    • /
    • 2017
  • Background and Objectives : The purpose of this study was to compare the usefulness of Cepstral peak prominence (CPP) with parameter of Multiple Dimensional Voice Program (MDVP) in evaluating unilateral vocal fold paraylsis patients with subjective voice impairment. Materials and Methods : From July 2014 to August 2016, 37 patients with unilateral vocal fold paralysis who had been diagnosed with unilateral vocal fold paralysis and had received two or more voice tests before and after the diagnosis were evaluated for maximum phonation time (MPT), MDVP and CPP. Respectively. Voice tests were performed with short vowel /a/ and paragraph reading. Results : The CPP-a (CPP with vowel /a/) and CPP-s (CPP with paragraph reading) of the Cepstrum were statistically negatively correlated with G, R, B, and A before the voice therapy. Jitter, Shimmer, and NHR of MDVP were positively correlated with G, R, B. Jitter, Shimmer, and NHR of the MDVP were significantly correlated with the Cepstrum index. G, B, A and CPP-a and CPP-s showed a statistically significant negative correlation and a somewhat higher correlation coefficient between 0.5 and 0.78. On the other hand, in MDVP index, there was a positive correlation with G and B only with Jitter of 0.4. Conclusion : CPP can be an important evaluation tool in the evaluation of speech in the unilateral vocal cord paralysis when speech energy changes or the cycle is not constant during speech.

  • PDF

Pilot Study on the Classification for Sasangin by the Voice Analysis (음성분석에 의한 체질진단에 관한 연구)

  • Lee Eui-Ju;Song Kwang-Bin;Choi Hwan-Soo;Yoo Jung-Hee;Kwak Chang-Kyu;Sohn Eun-Hae;Koh Byung-Hee
    • The Journal of Korean Medicine
    • /
    • v.26 no.1 s.61
    • /
    • pp.93-102
    • /
    • 2005
  • Objective : This research was conducted to evaluate the method of sasangin classification by voice analysis, The 2 pilot tests were thus designed to solve the following problems: 'What are the conditions at classification for sasangin by the voice analysis?' and 'What are the important variances of /a/ parameter?'. Methods: 122 volunteers Were examined to make a diagnosis of sasangin by QSCC II and they were disease-free and healthy, First, they said /a/ three times for 2 seconds in their usual voice, Second, they said /a/ for 2 seconds by the different ways of high tone, mid tone, and low tone. The sounds were collected by a recording program (cooledit 2000) through a Sony microphone (ecm-26l). We analyzed the voices by maltlab, the simulation tool. Results: There were no differences and were correlations when one said /a/ three times for 2 seconds in the usual voice. There were some things to correlate when one said /a/ three times for 2 seconds by the different ways of high speech, usual speech, and low speech. Others were nothing to correlate. We evaluated the value of sasangin classification method by only /a/ voice analysis. The hit ratio was average $66.3\%\;:\;soyangin\;67.9\%,\;taeumin\;68.0\%,\;soeumin\;63.9\%$. Conclusion: We must set up the conditions to use the method of sasangin classification by voice analysis. The value of sasangin classification method by only fa! voice analysis was a hit ratio of $66.3\%$.

  • PDF

Implementation of Real-time Vowel Recognition Mouse based on Smartphone (스마트폰 기반의 실시간 모음 인식 마우스 구현)

  • Jang, Taeung;Kim, Hyeonyong;Kim, Byeongman;Chung, Hae
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.8
    • /
    • pp.531-536
    • /
    • 2015
  • The speech recognition is an active research area in the human computer interface (HCI). The objective of this study is to control digital devices with voices. In addition, the mouse is used as a computer peripheral tool which is widely used and provided in graphical user interface (GUI) computing environments. In this paper, we propose a method of controlling the mouse with the real-time speech recognition function of a smartphone. The processing steps include extracting the core voice signal after receiving a proper length voice input with real time, to perform the quantization by using the learned code book after feature extracting with mel frequency cepstral coefficient (MFCC), and to finally recognize the corresponding vowel using hidden markov model (HMM). In addition a virtual mouse is operated by mapping each vowel to the mouse command. Finally, we show the various mouse operations on the desktop PC display with the implemented smartphone application.

MPEG-D USAC: Unified Speech and Audio Coding Technology (MPEG-D USAC: 통합 음성 오디오 부호화 기술)

  • Lee, Tae-Jin;Kang, Kyeong-Ok;Kim, Whan-Woo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.589-598
    • /
    • 2009
  • As mobile devices become multi-functional, and converge into a single platform, there is a strong need for a codec that is able to provide consistent quality for speech and music content MPEG-D USAC standardization activities started at the 82nd MPEG meeting with a CfP and approved WD3 at the 88th MPEG meeting. MPEG-D USAC is converged technology of AMR-WB+ and HE-AAC V2. Specifically, USAC utilizes three core codecs (AAC ACELP and TCX) for low frequency regions, SBR for high frequency regions and the MPEG Surround tool for stereo information. USAC can provide consistent sound quality for both speech and music content and can be applied to various applications such as multi-media download to mobile device Digital radio Mobile TV and audio books.

Effectiveness of "Village Image Construction Tool Kit" in the Residents Workshop of a Housing Improvement Area (주거지 정비지역 주민 워크샵을 통한 마을이미지 맵 제작도구의 효용성 연구)

  • Lee, Yeun-Sook;Kim, Ju-Suck;Jung, Eun-Jung
    • Journal of the Korean housing association
    • /
    • v.21 no.1
    • /
    • pp.67-77
    • /
    • 2010
  • Citizen participation in local redevelopment has recently been regarded as essential, since progress in democracy and diversified public interests have contributed to more importance being placed on citizen participation in the implementation of public policies. While the importance of resident participation has been increasingly emphasized in principle, in reality more effort is still required in its application. We need to develop practical strategies of collecting community opinion in order to reflect it in public policy, if we are to achieve a resident and citizen-centered society. The purpose of this study is to develop an image map construction tool that can be applied to the "Maul-Mandulgi" projects as a visualized method to facilitate the exchange of opinions and work toward agreements. The tool is intended to assist public discussion by visualizing policies and plans and reducing the possibility of misunderstanding, so that residents can properly respond to the plans. Second, this study will verify the effectiveness of the tool in the application to local community workshops. The main research method is participant observation method and field study. Major findings are as follows, First, every resident who had participated in previous workshops gathered together, used the tool and represented their opinions unusually more than once. Each resident tried to make sure that other participants appropriately understood his or her opinion. The workshop finished when all participants agreed and produced a consensus. The workshop took much less time, which is in stark contrast to previous workshops in which it took significantly more time to collect opinions. Second, it proved that residents in the redevelopment area can strike a broad agreement by themselves on a method and direction for residential improvement. In previous workshops, conflicts between residents developed over the choice between the two methods, of local improvement and total demolition prior to multi-housing construction. In this study, opinions of residents were not limited to the two methods by finding a winwin solution. Third, the use of the tool kit for image map became efficient for inactive residents to develop their own opinions in regard to the direction and orientations of the residential improvement process. In addition, for those who have either no or a slight understanding of the residential improvement projects, the tool can provide access to information and knowledge. This study concludes that the developed tool for imaging of the redevelopment projection like a design game, rather than using forms of text and speech, can be a useful tool in collecting opinions and forming an agreed opinion for forthcoming residential improvement plans.

Pediatric Voice Handicap Index-Korean(pVHI-K) : A Pilot Study for Standardization (한국어판 소아음성장애지수(pVHI-K : Pediatric Voice Handicap Index-Korean) : 표준화를 위한 예비연구)

  • Park, Sung-Shin;Choi, Seong-Hee;Hong, Young-Hye;Jeong, Nyun-Gi;Sung, Myung-Whun;Kim, Kwang-Hyun;Kwon, Tack-Kyun
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.22 no.2
    • /
    • pp.137-142
    • /
    • 2011
  • Background and Objectives : The aim of this study is to introduce Korea version of pediatric VHI and to compare pVHI-K scores between children with dysphonia and children without voice problems before pVHI-K is developed as a preliminary study. Additionally, the relationship between pVHI and acoustic measures were investigated. Materials and Methods : pVHI-K scores in normal group were obtained from 15 parents who have children with no present or past history of a voice disorder, hearing loss, or related disability that can affect the their voice or speech. Dysphonia group consisted of 15 parents who have children with bilateral vocal fold nodule's at Department of Otolaryngology, the Seoul National University Hospital (SNUH). pVHI-K and acoustic parameters were measured in two group. Results : The mean pVHI scores (total, functional, physical, emotional) in normal group were 2.33 (T), 0.80 (F) 1.33 (P) and 0.27 (E), respectively whereas those of pVHI in children group with dysphonia were 23.13 (T), 11.07 (F), 5.73 (P) and 6.13 (E), respectively and significant differences were revealed in total pVHI score as well as in all of the sub-pVHI scores. Moreover, significant correlation between pVHI-K parameters (T, F, P) and acoustic measures [Shimmer(%)] were shown in children in dysphonia group. Conclusion : Reported by parents can be useful as a supplementary clinical tool for diagnosing and measuring treatment effectiveness in young children with dysphonia.

  • PDF