Search | Korea Science

Matlab Implementation of Real-time Speech Analysis Tool (실시간 음성분석도구의 MatLab 구현)

Bak Il-suh;Kim Dae-hyun;Jo Cheol-woo
- MALSORI
- /
- no.44
- /
- pp.93-104
- /
- 2002
There are many speech analysis tools available. Among them real-time analysis tool is very useful for interactive experiments. A real-time speech analysis tool was implemented using Matlab. Matlab is a very widely used general purpose signal processing tool. In general, its computational speed is relatively lower than that of the codes from conventional programming languages. Especially, real-time analysis including input of signal and output of the result was not possible in the past. However, due to the improvement of computing power of PCs and inclusion of real-time I/O toolboxes in Matlab, real-time analysis is now possible in some extent by Matlab only. In this experiment, we tried to implement a real-time speech analysis tool using Matlab. Pitch and spectral information is computed in real-time. From the result it is shown that such real-time applications can be implemented easily using Matlab.
PDF

Digital enhancement of pronunciation assessment: Automated speech recognition and human raters

Miran Kim
- Phonetics and Speech Sciences
- /
- v.15 no.2
- /
- pp.13-20
- /
- 2023
This study explores the potential of automated speech recognition (ASR) in assessing English learners' pronunciation. We employed ASR technology, acknowledged for its impartiality and consistent results, to analyze speech audio files, including synthesized speech, both native-like English and Korean-accented English, and speech recordings from a native English speaker. Through this analysis, we establish baseline values for the word error rate (WER). These were then compared with those obtained for human raters in perception experiments that assessed the speech productions of 30 first-year college students before and after taking a pronunciation course. Our sub-group analyses revealed positive training effects for Whisper, an ASR tool, and human raters, and identified distinct human rater strategies in different assessment aspects, such as proficiency, intelligibility, accuracy, and comprehensibility, that were not observed in ASR. Despite such challenges as recognizing accented speech traits, our findings suggest that digital tools such as ASR can streamline the pronunciation assessment process. With ongoing advancements in ASR technology, its potential as not only an assessment aid but also a self-directed learning tool for pronunciation feedback merits further exploration.
https://doi.org/10.13064/KSSS.2023.15.2.013 인용 PDF

Users' Preferences and Efficient Performances of Tool-like Service Robots Comparing Speech Interface with Non-speech Audio: with Emphasis on Korean Elderly Subjects (언어 및 비언어 인터페이스의 비교를 통한 서비스 로봇의 사용자 선호도 및 수행도에 관한 연구 - 한국 노인을 대상으로)

Kwak, So-Nya;Kim, Myung-Suk
- Proceedings of the Korea Society of Design Studies Conference
- /
- 2005.10a
- /
- pp.36-37
- /
- 2005
PDF

Developing the speech screening test for 4-year-old children and application of Korean speech sound analysis tool (KSAT) (4세 말소리발달 선별검사 개발과 한국어말소리분석도구(Korean Speech Sound Analysis Tool, KSAT)의 활용)

Soo-Jin Kim;Ki-Wan Jang;Moon-Soo Chang
- Phonetics and Speech Sciences
- /
- v.16 no.1
- /
- pp.49-55
- /
- 2024
This study aims to develop a three-sentence speech screening test to evaluate speech development in 4-year-old children and provide standards for comparison with peers. Screening tests were conducted on 24 children each in the first and second halves of 4 years old. The screening test results showed a correlation of .7 with the existing speech disorder evaluation test results. We compared whether there was a difference between the two groups of 4-year-old in the phonological development indicators and error patterns obtained through the screening test. The developmental indicators of the children in the second half were high, but there were no statistically significant differences. The Korean Speech Sound Analysis Tool (KSAT) was used for all analyses, and the automatic analysis results and contents of the clinician's manual analysis were compared. The degree of agreement between the automatic and manual error pattern analyses was 93.63%. The significance of this study is that the standard of speech of a 4-year-old child of the speech screening test according to three sentences at the level of elicited sentences, and the applicability of the KSAT were reviewed in both clinical and research fields.
https://doi.org/10.13064/KSSS.2024.16.1.049 인용 PDF

MPEG-4 TTS (Text-to-Speech)

한민수
- Proceedings of the IEEK Conference
- /
- 1999.06a
- /
- pp.699-707
- /
- 1999
It cannot be argued that speech is the most natural interfacing tool between men and machines. In order to realize acceptable speech interfaces, highly advanced speech recognizers and synthesizers are inevitable. Text-to-Speech(TTS) technology has been attracting a lot of interest among speech engineers because of its own benefits. Namely, the possible application areas of talking computers, emergency alarming systems in speech, speech output devices fur speech-impaired, and so on. Hence, many researchers have made significant progresses in the speech synthesis techniques in the sense of their own languages and as a result, the quality of currently available speech synthesizers are believed to be acceptable to normal users. These are partly why the MPEG group had decided to include the TTS technology as one of its MPEG-4 functionalities. ETRI has made major contributions to the current MPEG-4 TTS among various MPEG-4 functionalities. They are; 1) use of original prosody for synthesized speech output, 2) trick mode functions fer general users without breaking synthesized speech prosody, 3) interoperability with Facial Animation(FA) tools, and 4) dubbing a moving/animated picture with lib-shape pattern information.
PDF

The Effect of Visual Feedback Intervention on Voice Pitch of Adult with Hearing Impairment (선천성 청각장애성인의 시각적피드백 이용 음도치료 효과)

Euh, Su-Ji;Yoon, Mi-Sun
- Speech Sciences
- /
- v.12 no.4
- /
- pp.215-226
- /
- 2005
This study is an attempt to investigate effect of pitch treatment program using visual feedback for profound deaf adults. Dr. Speech program was applied as a training tool. The subjects of this study were 3 profound deaf adults. Speech samples for evaluation were vowel prolongations and connected speech. Analysis was performed under the principle of single subject research design. As results of this study, all subjects showed the treatment effects which were represented by lowering fundamental frequency and speaking fundamental frequency.
PDF

Acoustic Analysis of Speech Disorder Associated with Motor Aphasia - A Case Report -

Ko, Myung-Hwan;Kim, Hyun-Ki;Kim, Yun-Hee
- Speech Sciences
- /
- v.7 no.1
- /
- pp.97-107
- /
- 2000
Motor aphasia is an affection frequently caused by insult of the left middle cerebral artery and usually accompanied by a large lesion involving the Broca's area and the adjacent motor and premotor areas. Therefore, a patient with motor aphasia commonly shows articulatory disturbances due to failure of the motor programing of speech sound. Objective assessment and treatment of phonologic programing is one of the important aspects of speech therapy in aphasic patients. We analyzed the speech disorders acompanied with motor aphasia in a 45-year-old man using a computerized sound spectrograph, Visi-$Pitch{\circledR}$, and Multi-Dimensional Voice $Program{\circledR}$. We concluded that a computerized speech analysis system is a useful tool to visualize and quantitatively analyse the severity and progression of dysarthria, and the effect of speech therapy.
PDF

Implementation of Voice Source Simulator Using Simulink (Simulink를 이용한 음원모델 시뮬레이터 구현)

Jo, Cheol-Woo;Kim, Jae-Hee
- Phonetics and Speech Sciences
- /
- v.3 no.2
- /
- pp.89-96
- /
- 2011
In this paper, details of the design and implementation of a voice source simulator using Simulink and Matlab are discussed. This simulator is an implementation by model-based design concept. Voice sources can be analyzed and manipulated through various factors by choosing options from GUI input and selecting pre-defined blocks or user created ones. This kind of simulation tool can simplify the procedure of analyzing speech signals for various purposes such as voice quality analysis, pathological voice analysis, and speech coding. Also, basic analysis functions are supported to compare the original signal and the manipulated ones.
PDF

A CART-based diagnostic model using speech technology for evaluating mental fatigue caused by monotonous work (단순작업으로 인한 정신피로도 측정을 위한 음성기술을 이용한 CART 기반 진단모델)

Kwon, Chul Hong
- Phonetics and Speech Sciences
- /
- v.8 no.4
- /
- pp.97-101
- /
- 2016
This paper presents a CART(Classification and Regression Tree)-based model to diagnose mental fatigue using speech technology. The parameters used in the model are the significant speech parameters highly correlated to the fatigue and questionnaire responses obtained before and after imposing the fatigue. It is shown from the experiments that the proposed model achieves classification accuracies of 96.67% and 98.33% using the speech parameters and questionnaire responses, respectively. This implies that the proposed model can be used as a tool to diagnose the mental fatigue, and that speech technology is useful to diagnose the fatigue.
https://doi.org/10.13064/KSSS.2016.8.4.097 인용 PDF KSCI

Analysis of Mobile Application Trends for Speech and Language Therapy of Children with Disabilities in Korea (국내 장애 아동을 위한 언어치료용 모바일 어플리케이션 현황 분석)

Lee, Youngmee;Lee, Soobok;Sung, Minkyoung
- Phonetics and Speech Sciences
- /
- v.7 no.3
- /
- pp.153-163
- /
- 2015
This study investigated the trends of mobile applications which were developed for prompting speech and language skills for children with disabilities, and analyzed the function and contents of these applications as a tool of speech and language therapy. For this analysis, twenty applications among 71 ones were selected according to the exclusion criteria. These applications were classified by the 8 using types of contents and analyzed the function of mobile applications by the revised mobile contents evaluation standard (ease of use, value of education, interest level, and interactivity). As a results, applications for augmentative and alternative communication were developed much more than any other types. And the ease of use got the highest score whereas the interest level got the lowest score in whole evaluation analysis. The result of this study would suggest way to evaluate applications for speech language therapy and to contribute to developing the contents and function of mobile applications aims to help children with disabilities improving their speech and language skills.
https://doi.org/10.13064/KSSS.2015.7.3.153 인용 PDF KSCI

Search Result 155, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)