• 제목/요약/키워드: speech sound

검색결과 625건 처리시간 0.026초

말소리장애 아동의 말명료도와 음향학적 측정치 간 상관관계 (The Correlation between Speech Intelligibility and Acoustic Measurements in Children with Speech Sound Disorders)

  • 강은영
    • 대한통합의학회지
    • /
    • 제6권4호
    • /
    • pp.191-206
    • /
    • 2018
  • Purpose : This study investigated the correlation between speech intelligibility and acoustic measurements of speech sounds produced by the children with speech sound disorders and children without any diagnosed speech sound disorder. Methods : A total of 60 children with and without speech sound disorders were the subjects of this study. Speech samples were obtained by having the subjects? speak meaningful words. Acoustic measurements were analyzed on a spectrogram using the Multi-speech 3700 program. Speech intelligibility was determined according to a listener's perceptual judgment. Results : Children with speech sound disorders had significantly lower speech intelligibility than those without speech sound disorders. The intensity of the vowel /u/, the duration of the vowel /${\omega}$/, and the second formant of the vowel /${\omega}$/ were significantly different between both groups. There was no difference in voice onset time between the groups. There was a correlation between acoustic measurements and speech intelligibility. Conclusion : The results of this study showed that the speech intelligibility of children with speech sound disorders was affected by intensity, word duration, and formant frequency. It is necessary to complement clinical setting results using acoustic measurements in addition to evaluation of speech intelligibility.

웨이브렛 변환을 이용한 음성신호의 유성음/무성음/묵음 분류 (Voiced/Unvoiced/Silence Classification웨 of Speech Signal Using Wavelet Transform)

  • 손영호;배건성
    • 음성과학
    • /
    • 제4권2호
    • /
    • pp.41-54
    • /
    • 1998
  • Speech signals are, depending on the characteristics of waveform, classified as voiced sound, unvoiced sound, and silence. Voiced sound, produced by an air flow generated by the vibration of the vocal cords, is quasi-periodic, while unvoiced sound, produced by a turbulent air flow passed through some constriction in the vocal tract, is noise-like. Silence represents the ambient noise signal during the absence of speech. The need for deciding whether a given segment of a speech waveform should be classified as voiced, unvoiced, or silence has arisen in many speech analysis systems. In this paper, a voiced/unvoiced/silence classification algorithm using spectral change in the wavelet transformed signal is proposed and then, experimental results are demonstrated with our discussions.

  • PDF

회의실 유리창 진동음의 음성 명료도 분석 (Speech Intelligibility Analysis on the Vibration Sound of the Glass Window of a Conference Room)

  • 김희동;김윤호;김석현
    • 한국소음진동공학회논문집
    • /
    • 제17권4호
    • /
    • pp.363-369
    • /
    • 2007
  • The purpose of the study is to obtain acoustical information to prevent eavesdropping of the glass window. Speech intelligibility was investigated on the vibration sound detected from the glass window of a conference room. Objective test using speech transmission index(STI) was performed to estimate quantitatively the speech intelligibility. STI was determined based on tile modulation transfer function(MTF) of the room-glass window system. Using Maximum Length Sequency(MLS) signal as a sound source, impulse responses of the glass window and MTF were determined by signals from accelerometers and laser doppler vibrometer. Finally, speech intelligibility of the interior sound and window vibration were compared under different sound pressure levels and amplifier gains to confirm the effect of measurement condition on the speech intelligibility.

Computerized Sound Dictionary of Korean and English

  • Kim, Jong-Mi
    • 음성과학
    • /
    • 제8권1호
    • /
    • pp.33-52
    • /
    • 2001
  • A bilingual sound dictionary in Korean and English has been created for a broad range of sound reference to cross-linguistic, dialectal, native language (L1)-transferred biological and allophonic variations. The paper demonstrates that the pronunciation dictionary of the lexicon is inadequate for sound reference due to the preponderance of unmarked sounds. The audio registry consists of the three-way comparison of 1) English speech from native English speakers, 2) Korean speech from Korean speakers, and 3) English speech from Korean speakers. Several sub-dictionaries have been created as the foundation research for independent development. They are 1) a pronunciation dictionary of the Korean lexicon in a keyboard-compatible phonetic transcription, 2) a sound dictionary of L1-interfered language, and 3) an audible dictionary of Korean sounds. The dictionary was designed to facilitate the exchange of the speech signal and its corresponding text data on various media particularly on CD-ROM. The methodology and findings of the construction are discussed.

  • PDF

교란파가 유리창 진동음의 음성명료도에 미치는 영향 (The Effect of the Disturbing Wave on the Speech Intelligibility of the Eavesdropping Sound of a Window Glass)

  • 김석현;김희동;허욱
    • 한국소음진동공학회논문집
    • /
    • 제17권9호
    • /
    • pp.888-894
    • /
    • 2007
  • The speech sound is detected by the vibration measurement of the window glass. In this study, we investigate the effect of the disturbing waves by background noise and window shaker excitation on the speech intelligibility of the detected sound. Based upon Modulation Transfer Function(MTF), speech intelligibility of the sound is objectively estimated by Speech Transmission Index(STI) As the level of the disturbing wave varies, variation of the speech intelligibility is examined. Experimental result reveals how STI is influenced by the level and frequency characteristics of the disturbing wave. By using a customized window shaker for disturbing sound, we evaluate the efficiency and the frequency characteristics of the anti-eavesdropping system. The purpose of the study is to provide useful information to prevent the eavesdropping through the window glass.

한국어 음성합성기의 성능 향상을 위한 합성 단위의 유무성음 분리 (Separation of Voiced Sounds and Unvoiced Sounds for Corpus-based Korean Text-To-Speech)

  • 홍문기;신지영;강선미
    • 음성과학
    • /
    • 제10권2호
    • /
    • pp.7-25
    • /
    • 2003
  • Predicting the right prosodic elements is a key factor in improving the quality of synthesized speech. Prosodic elements include break, pitch, duration and loudness. Pitch, which is realized by Fundamental Frequency (F0), is the most important element relating to the quality of the synthesized speech. However, the previous method for predicting the F0 appears to reveal some problems. If voiced and unvoiced sounds are not correctly classified, it results in wrong prediction of pitch, wrong unit of triphone in synthesizing the voiced and unvoiced sounds, and the sound of click or vibration. This kind of feature is usual in the case of the transformation from the voiced sound to the unvoiced sound or from the unvoiced sound to the voiced sound. Such problem is not resolved by the method of grammar, and it much influences the synthesized sound. Therefore, to steadily acquire the correct value of pitch, in this paper we propose a new model for predicting and classifying the voiced and unvoiced sounds using the CART tool.

  • PDF

3-5세 일반아동의 말소리에 대한 융합적 분석: 단어와 자발화를 중심으로 (Convergent Analysis on the Speech Sound of Typically Developing Children Aged 3 to 5 : Focused on Word Level and Connected Speech Level)

  • 김윤주;박현주
    • 한국융합학회논문지
    • /
    • 제9권6호
    • /
    • pp.125-132
    • /
    • 2018
  • 본 연구는 단어 및 자발화 평가를 통해 학령전 아동의 말소리 산출 특성과 평가 관련 양상을 살펴보고자 하였다. 이를 위해 3-5세 일반아동 72명(연령별 각각 24명)을 대상으로 아동용발음검사(APAC)를 실시하고, 연령과 성별에 따른 자음정확도와 명료도의 차이, 자음정확도와 명료도 간 상관관계, 자음 위치 및 조음 방법에 따른 말소리 오류 패턴을 분석하였다. 연구 결과, 자음정확도와 명료도는 연령에 따라 증가하였으나 성별에 따른 차이는 없었고, 상관관계는 5세 집단에서 통계적으로 유의했으며, 말소리 오류 패턴 또한 두 평가에서 다르게 나타났다. 본 연구 결과는 아동의 말소리 산출이 언어단위에 따라 다르게 나타나기에, 이들의 말소리 능력을 적절히 파악하려면 단어뿐 아니라 자발화 평가가 병행되어야 함을 보여주었다. 이는 단어에 대한 자음정확도만으로 언어장애 등급을 판정하는 현재 기준에 대한 재검토와 추가적인 연구가 필요함을 시사한다.

자동차 주행 환경에서의 음성 전달 명료도와 음성 인식 성능 비교 (Comparison of Speech Intelligibility & Performance of Speech Recognition in Real Driving Environments)

  • 이광현;최대림;김영일;김봉완;이용주
    • 대한음성학회지:말소리
    • /
    • 제50호
    • /
    • pp.99-110
    • /
    • 2004
  • The normal transmission characteristics of sound are hardly obtained due to the various noises and structural factors in a running car environment. It is due to the channel distortion of the original source sound recorded by microphones, and it seriously degrades the performance of the speech recognition in real driving environments. In this paper we analyze the degree of intelligibility under the various sound distortion environments by channels according to driving speed with respect to speech transmission index(STI) and compare the STI with rates of speech recognition. We examine the correlation between measures of intelligibility depending on sound pick-up patterns and performance in speech recognition. Thereby we consider the optimal location of a microphone in single channel environment. In experimentation we find that high correlation is obtained between STI and rates of speech recognition.

  • PDF

회의실 유리창 진동음의 명료도 분석 (Speech Intelligibility Analysis on the Vibration Sound of the Window Glass of a Conference Room)

  • 김윤호;김희동;김석현
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2006년도 추계학술대회논문집
    • /
    • pp.150-155
    • /
    • 2006
  • Speech intelligibility is investigated on a conference room-window glass coupled system. Using MLS(Maximum Length Sequency) signal as a sound source, acceleration and velocity responses of the window glass are measured by accelerometer and laser doppler vibrometer. MTF(Modulation Transfer Function) is used to identify the speech transmission characteristics of the room and window system. STI(Speech Transmission Index) is calculated by using MTF and speech intelligibility of the room and the window glass is estimated. Speech intelligibilities by the acceleration signal and the velocity signal are compared and the possibility of the wiretapping is investigated. Finally, intelligibility of the conversation sound is examined by the subjective test.

  • PDF

음성신호 적응분할방법에 의한 특징분석 (Features Analysis of Speech Signal by Adaptive Dividing Method)

  • 장승관;최성연;김창석
    • 음성과학
    • /
    • 제5권1호
    • /
    • pp.63-80
    • /
    • 1999
  • In this paper, an adaptive method of dividing a speech signal into an initial, a medial and a final sound of the form of utterance utilized by evaluating extreme limits of short term energy and autocorrelation functions. By applying this method into speech signal composed of a consonant, a vowel and a consonant, it was divided into an initial, a medial and a final sound and its feature analysis of sample by LPC were carried out. As a result of spectrum analysis in each period, it was observed that there existed spectrum features of a consonant and a vowel in the initial and medial periods respectively and features of both in a final sound. Also, when all kinds of words were adaptively divided into 3 periods by using the proposed method, it was found that the initial sounds of the same consonant and the medial sounds of the same vowels have the same spectrum characteristics respectively, but the final sound showed different spectrum characteristics even if it had the same consonant as the initial sound.

  • PDF