• 제목/요약/키워드: speechTool

검색결과 155건 처리시간 0.022초

실시간 음성분석도구의 MatLab 구현 (Matlab Implementation of Real-time Speech Analysis Tool)

  • 박일서;김대현;조철우
    • 대한음성학회지:말소리
    • /
    • 제44호
    • /
    • pp.93-104
    • /
    • 2002
  • There are many speech analysis tools available. Among them real-time analysis tool is very useful for interactive experiments. A real-time speech analysis tool was implemented using Matlab. Matlab is a very widely used general purpose signal processing tool. In general, its computational speed is relatively lower than that of the codes from conventional programming languages. Especially, real-time analysis including input of signal and output of the result was not possible in the past. However, due to the improvement of computing power of PCs and inclusion of real-time I/O toolboxes in Matlab, real-time analysis is now possible in some extent by Matlab only. In this experiment, we tried to implement a real-time speech analysis tool using Matlab. Pitch and spectral information is computed in real-time. From the result it is shown that such real-time applications can be implemented easily using Matlab.

  • PDF

Digital enhancement of pronunciation assessment: Automated speech recognition and human raters

  • Miran Kim
    • 말소리와 음성과학
    • /
    • 제15권2호
    • /
    • pp.13-20
    • /
    • 2023
  • This study explores the potential of automated speech recognition (ASR) in assessing English learners' pronunciation. We employed ASR technology, acknowledged for its impartiality and consistent results, to analyze speech audio files, including synthesized speech, both native-like English and Korean-accented English, and speech recordings from a native English speaker. Through this analysis, we establish baseline values for the word error rate (WER). These were then compared with those obtained for human raters in perception experiments that assessed the speech productions of 30 first-year college students before and after taking a pronunciation course. Our sub-group analyses revealed positive training effects for Whisper, an ASR tool, and human raters, and identified distinct human rater strategies in different assessment aspects, such as proficiency, intelligibility, accuracy, and comprehensibility, that were not observed in ASR. Despite such challenges as recognizing accented speech traits, our findings suggest that digital tools such as ASR can streamline the pronunciation assessment process. With ongoing advancements in ASR technology, its potential as not only an assessment aid but also a self-directed learning tool for pronunciation feedback merits further exploration.

4세 말소리발달 선별검사 개발과 한국어말소리분석도구(Korean Speech Sound Analysis Tool, KSAT)의 활용 (Developing the speech screening test for 4-year-old children and application of Korean speech sound analysis tool (KSAT))

  • 김수진;장기완;장문수
    • 말소리와 음성과학
    • /
    • 제16권1호
    • /
    • pp.49-55
    • /
    • 2024
  • 본 연구는 4세 아동에 대한 말소리발달 평가를 위해 세 문장 따라말하기 선별검사를 개발하고 또래와 비교할 수 있는 규준을 제공하기 위한 것이다. 이를 위해 4세 전반과 후반 각각 24명씩 총 48명의 아동에게 선별검사를 실시하였다. 선별검사 결과는 기존의 말소리장애 평가 검사 결과와 .7의 상관을 보였다. 선별검사를 통해 구한 음운발달 지표와 오류패턴에서 4세 전반과 후반으로 나눈 두 집단에 차이가 있는지 비교하였다. 후반 아동의 발달지표가 높은 것으로 나왔지만 통계적으로 유의한 차이는 없었다. 모든 분석은 한국어말소리분석도구(Korean Speech Sound Analysis Tool, KSAT)를 사용하였으며, 자동분석 결과와 임상가의 수동분석 내용을 비교하였다. 자동분석과 수동분석의 오류패턴분석 일치도는 93.63%였다. 본 연구의 의의는 유도 문장수준에서 세 문장 따라말하기 선별검사의 4세 아동의 말소리 규준을 제시했다는 것과 KSAT의 임상과 연구 현장에서 적용 가능성을 검토하였다는 것이다.

MPEG-4 TTS (Text-to-Speech)

  • 한민수
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 1999년도 하계종합학술대회 논문집
    • /
    • pp.699-707
    • /
    • 1999
  • It cannot be argued that speech is the most natural interfacing tool between men and machines. In order to realize acceptable speech interfaces, highly advanced speech recognizers and synthesizers are inevitable. Text-to-Speech(TTS) technology has been attracting a lot of interest among speech engineers because of its own benefits. Namely, the possible application areas of talking computers, emergency alarming systems in speech, speech output devices fur speech-impaired, and so on. Hence, many researchers have made significant progresses in the speech synthesis techniques in the sense of their own languages and as a result, the quality of currently available speech synthesizers are believed to be acceptable to normal users. These are partly why the MPEG group had decided to include the TTS technology as one of its MPEG-4 functionalities. ETRI has made major contributions to the current MPEG-4 TTS among various MPEG-4 functionalities. They are; 1) use of original prosody for synthesized speech output, 2) trick mode functions fer general users without breaking synthesized speech prosody, 3) interoperability with Facial Animation(FA) tools, and 4) dubbing a moving/animated picture with lib-shape pattern information.

  • PDF

선천성 청각장애성인의 시각적피드백 이용 음도치료 효과 (The Effect of Visual Feedback Intervention on Voice Pitch of Adult with Hearing Impairment)

  • 어수지;윤미선
    • 음성과학
    • /
    • 제12권4호
    • /
    • pp.215-226
    • /
    • 2005
  • This study is an attempt to investigate effect of pitch treatment program using visual feedback for profound deaf adults. Dr. Speech program was applied as a training tool. The subjects of this study were 3 profound deaf adults. Speech samples for evaluation were vowel prolongations and connected speech. Analysis was performed under the principle of single subject research design. As results of this study, all subjects showed the treatment effects which were represented by lowering fundamental frequency and speaking fundamental frequency.

  • PDF

Acoustic Analysis of Speech Disorder Associated with Motor Aphasia - A Case Report -

  • Ko, Myung-Hwan;Kim, Hyun-Ki;Kim, Yun-Hee
    • 음성과학
    • /
    • 제7권1호
    • /
    • pp.97-107
    • /
    • 2000
  • Motor aphasia is an affection frequently caused by insult of the left middle cerebral artery and usually accompanied by a large lesion involving the Broca's area and the adjacent motor and premotor areas. Therefore, a patient with motor aphasia commonly shows articulatory disturbances due to failure of the motor programing of speech sound. Objective assessment and treatment of phonologic programing is one of the important aspects of speech therapy in aphasic patients. We analyzed the speech disorders acompanied with motor aphasia in a 45-year-old man using a computerized sound spectrograph, Visi-$Pitch{\circledR}$, and Multi-Dimensional Voice $Program{\circledR}$. We concluded that a computerized speech analysis system is a useful tool to visualize and quantitatively analyse the severity and progression of dysarthria, and the effect of speech therapy.

  • PDF

Simulink를 이용한 음원모델 시뮬레이터 구현 (Implementation of Voice Source Simulator Using Simulink)

  • 조철우;김재희
    • 말소리와 음성과학
    • /
    • 제3권2호
    • /
    • pp.89-96
    • /
    • 2011
  • In this paper, details of the design and implementation of a voice source simulator using Simulink and Matlab are discussed. This simulator is an implementation by model-based design concept. Voice sources can be analyzed and manipulated through various factors by choosing options from GUI input and selecting pre-defined blocks or user created ones. This kind of simulation tool can simplify the procedure of analyzing speech signals for various purposes such as voice quality analysis, pathological voice analysis, and speech coding. Also, basic analysis functions are supported to compare the original signal and the manipulated ones.

  • PDF

단순작업으로 인한 정신피로도 측정을 위한 음성기술을 이용한 CART 기반 진단모델 (A CART-based diagnostic model using speech technology for evaluating mental fatigue caused by monotonous work)

  • 권철홍
    • 말소리와 음성과학
    • /
    • 제8권4호
    • /
    • pp.97-101
    • /
    • 2016
  • This paper presents a CART(Classification and Regression Tree)-based model to diagnose mental fatigue using speech technology. The parameters used in the model are the significant speech parameters highly correlated to the fatigue and questionnaire responses obtained before and after imposing the fatigue. It is shown from the experiments that the proposed model achieves classification accuracies of 96.67% and 98.33% using the speech parameters and questionnaire responses, respectively. This implies that the proposed model can be used as a tool to diagnose the mental fatigue, and that speech technology is useful to diagnose the fatigue.

국내 장애 아동을 위한 언어치료용 모바일 어플리케이션 현황 분석 (Analysis of Mobile Application Trends for Speech and Language Therapy of Children with Disabilities in Korea)

  • 이영미;이수복;성민경
    • 말소리와 음성과학
    • /
    • 제7권3호
    • /
    • pp.153-163
    • /
    • 2015
  • This study investigated the trends of mobile applications which were developed for prompting speech and language skills for children with disabilities, and analyzed the function and contents of these applications as a tool of speech and language therapy. For this analysis, twenty applications among 71 ones were selected according to the exclusion criteria. These applications were classified by the 8 using types of contents and analyzed the function of mobile applications by the revised mobile contents evaluation standard (ease of use, value of education, interest level, and interactivity). As a results, applications for augmentative and alternative communication were developed much more than any other types. And the ease of use got the highest score whereas the interest level got the lowest score in whole evaluation analysis. The result of this study would suggest way to evaluate applications for speech language therapy and to contribute to developing the contents and function of mobile applications aims to help children with disabilities improving their speech and language skills.