• Title/Summary/Keyword: valid speech- sound

Search Result 4, Processing Time 0.02 seconds

A Merging Algorithm with the Discrete Wavelet Transform to Extract Valid Speech-Sounds (이산 웨이브렛 변환을 이용한 유효 음성 추출을 위한 머징 알고리즘)

  • Kim, Jin-Ok;Hwang, Dae-Jun;Paek, Han-Wook;Chung, Chin-Hyun
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.8 no.3
    • /
    • pp.289-294
    • /
    • 2002
  • A valid speech-sound block can be classified to provide important information for speech recognition. The classification of the speech-sound block comes from the MRA(multi-resolution analysis) property of the DWT(discrete wavelet transform), which is used to reduce the computational time for the pre-processing of speech recognition. The merging algorithm is proposed to extract valid speech-sounds in terms of position and frequency range. It needs some numerical methods for an adaptive DWT implementation and performs unvoiced/voiced classification and denoising. Since the merging algorithm can decide the processing parameters relating to voices only and is independent of system noises, it is useful for extracting valid speech-sounds. The merging algorithm has an adaptive feature for arbitrary system noises and an excellent denoising SNR(signal-to-nolle ratio).

A Study on Extracting Valid Speech Sounds by the Discrete Wavelet Transform (이산 웨이브렛 변환을 이용한 유효 음성 추출에 관한 연구)

  • Kim, Jin-Ok;Hwang, Dae-Jun;Baek, Han-Uk;Jeong, Jin-Hyeon
    • The KIPS Transactions:PartB
    • /
    • v.9B no.2
    • /
    • pp.231-236
    • /
    • 2002
  • The classification of the speech-sound block comes from the multi-resolution analysis property of the discrete wavelet transform, which is used to reduce the computational time for the pre-processing of speech recognition. The merging algorithm is proposed to extract vapid speech-sounds in terms of position and frequency range. It performs unvoiced/voiced classification and denoising. Since the merging algorithm can decide the processing parameters relating to voices only and is independent of system noises, it is useful for extracting valid speech-sounds. The merging algorithm has an adaptive feature for arbitrary system noises and an excellent denoising signal-to-noise ratio and a useful system tuning for the system implementation.

Production and perception of Korean word-initial stops from a sound change perspective (음 변화 관점에서 바라본 한국어 어두 폐쇄음의 발화 및 지각)

  • Kim, Jin-Woo
    • Phonetics and Speech Sciences
    • /
    • v.13 no.3
    • /
    • pp.39-51
    • /
    • 2021
  • Based on spontaneous speech data collected in 2020, this study examined the production and perception of Korean lenis, aspirated, and fortis stops. Unlike the controlled experiments of previous studies, lenis and aspirated stops of males in their 30s were not distinguished by voice onset time (VOT) in spontaneous speech. Perceptual experiments were conducted on young females, the leaders of language change. F0 was found to serve as the primary cue for the perception of lenis stops, and then VOT distinguished the aspirated and fortis stops. The fact that the sounds were always perceived as lenis stops when F0 was low, irrespective of whether VOT was short or long, showed that F0 plays an absolute role in the perception of lenis stops. However, in some cases the aspirated and lenis stops were distinguished only by VOT, which does not happen in production. In terms of sound change, disagreement between production and perception systems occurs when sound change is in progress. In particular, when production change precedes perception change, it indicates that the sound change is in its latter stages. Young females still maintain the previous system in perception because the distinction of lenis and aspirated stops by VOT was valid in their parents' generation. In other words, VOT is still used for perception to communicate with other groups.

Development of Differential Diagnosis Scale Items for Adductor Spasmodic Dysphonia and Evaluation of Clinical Availability (내전형 연축성 발성장애 감별진단 문항 개발과 임상적 유용성 평가)

  • Cho, Jae Kyung;Choi, Seong Hee;Lee, Sang Hyuk;Jin, Sung Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.30 no.2
    • /
    • pp.112-117
    • /
    • 2019
  • Background and Objectives The purpose of this study was to develop the differential diagnosis scale containing items from adductor spasmodic dysphonia (ADSD) to muscle tension dysphonia (MTD) and the determine clinical utility of newly developed items. Materials and Method The four parts of pitch, redirected phonation, automatic speech and voiced sound were selected for analyzing the characteristics of ADSD in the literature. One part of tense voiceless sound was developed according to the Korean manner of articulation. The content validity was evaluated based on 5 scales (1-5 point) analysis from 30 experts. One hundred patients (50 ADSD and 50 MTD) were recorded in reading a sentence and sustained phonation. The two speech language pathologist evaluated recorded voices through a blind test using 4 scales (0-3 point) for newly developed items. Results As a result of verifying the content validity of items with experts, it was identified that the differentiated items were valid with 4.2 out of 5. Through the differential diagnosis between two groups according to the items, the correlation between sub-domains and total scores was shown as higher than 0.710. The result of analyzing the reliability on each diagnosis domain was 0.840-0.893, which showed the internal consistency of items was great. Newly developed five parts of ADSD were significantly higher than those of MTD with strong correlation (p<0.01). The reliability among the evaluators was analyzed as high with 0.892. Conclusion In this study, the differential diagnosis scale of ADSD was revealed as having validity and reliability. It is considered that it will be useful for differentiating ADSD and MTD in the clinical field.