• 제목/요약/키워드: Speech characteristics

검색결과 969건 처리시간 0.028초

발화속도와 한국어 분절음의 음향학적 특성 (Speech Rate and the Acoustic Features of Korean Segments)

  • 이숙향;고현주
    • 한국음향학회지
    • /
    • 제23권2호
    • /
    • pp.162-172
    • /
    • 2004
  • 본 연구에서는 산출실험을 통해 발화속도와 한국어의 분절음의 지속시간 및 포만트 특성과의 관계와 모음의 지속시간과 포만트 간의 상관관계를 살펴보았다. 빠른 발화일수록 음절 및 자음과 모음의 지속시간은 짧게 나타났으며 대부분의 화자에서 폐쇄음의 폐쇄구간 대 기식구간의 비율이나 한 음절 내의 모음 대 자음 지속시간의 비율은 발화속도의 영향을 받지 않는 반면 일부 화자들은 발화속도의 영향을 받는 것으로 나타났다. 발화속도의 영향을 받는 화자들에서 폐쇄음의 경우 폐쇄구간이 기식구간보다 영향을 더 받으며 음절의 경우 모음이 자음보다 더 영향을 받는 것으로 나타났다. 발화속도와 모음의 포만트값과의 관계 분석 결과 발화속도가 모음약화에 영향을 미치는 정도가 화자간에 차이를 보였으며 이는 화자마다 모음의 포만트값 구현에 관하여 다른 발화기재를 이용하고 있다는 것을 간접적으로 시사해주는 것이라고 할 수 있다. 즉, 발화속도의 증가에 따라 조음기관의 움직임의 속도를 증가시키는 화자가 있는 반면 발화속도의 변화에 관계없이 일정한 속도를 유지하는 화자가 있다는 것을 의미한다.

경직형 뇌성마비 아동의 하위그룹별 말속도와 쉼의 특성 및 말명료도와의 관계 (Characteristics of speech rate and pause in children with spastic cerebral palsy and their relationships with speech intelligibility)

  • 정필연;심현섭
    • 말소리와 음성과학
    • /
    • 제12권3호
    • /
    • pp.95-103
    • /
    • 2020
  • 본 연구의 목적은 경직형 뇌성마비 아동의 하위그룹별로 말속도와 쉼에서 차이가 있는지 살펴보고, 말명료도와의 관련성에 대해서 알아보고자 하였다. 연구대상은 경직형 뇌성마비 아동 26명이 참여하였다. 말문제와 언어문제가 없는 NSMI-LCT 4명, 말문제는 없지만 언어문제가 있는 NSMI-LCI 그룹 6명, 말문제가 있지만 언어문제는 없는 SMI-LCT 6명, 말과 언어문제를 모두 동반하는 SMI-LCI 그룹 10명이 참여하였다. 연구과제는 문장 따라말하기였고, Praat을 통해 말속도, 조음속도, 쉼 시간의 비율, 평균 쉼 횟수, 평균 쉼 시간을 측정하였다. 연구결과, 첫째, 말속도와 조음속도는 언어문제의 유무와 관계없이 NSMI와 SMI 그룹 간에 유의한 차이가 나타났다. 둘째, NSMI에 비해 SMI 그룹에서에서 쉼 시간의 비율은 더 높고, 쉼 횟수는 더 빈번하였으며 쉼 시간은 더 길게 나타났다. 셋째, 말속도와 조음속도는 말명료도와 유의한 상관을 나타내었다. 본 연구의 결과는 느린 말속도가 SMI 그룹의 말산출 과정에서 나타나는 주요한 특성이고, 말명료도에 있어서 조음속도와 말속도가 중요한 역할을 함을 시사한다.

음성신호개선을 위한 임계대역 웨이블렛 패킷 기반의 스펙트럼 차감법 (Critical Banded Wavelet Packet-Based Spectral Subtractions for Speech Enhancement)

  • Chang, Sung-Wook;Yang, Sung-Il
    • The Journal of the Acoustical Society of Korea
    • /
    • 제23권4E호
    • /
    • pp.125-133
    • /
    • 2004
  • In this paper, we propose a critical banded wavelet packet-based spectral subtraction for speech enhancement. Critical banded wavelet packet, which reflects the human auditory system, may lead to minimization of intelligibility loss and quality improvement of the enhanced speech in the spectral domain, when combined with an appropriate spectral subtraction gain function. The proposed method shows better performance than the conventional one in comparative assessments. We also show that, for effective evaluation of enhanced speech, it is essential to consider the characteristics of speech quality measures.

A Study on Intonation Patterns of Speech Produced by Cochlear Implanted Children

  • Park, Sang-Hee;Jang, Tae-Yeoub;Lee, Sang-Heun;Jeong, Ok-Ran;Seok, Dong-Il
    • 음성과학
    • /
    • 제9권1호
    • /
    • pp.27-38
    • /
    • 2002
  • The purpose of the study is to examine intonation patterns of cochlear implanted children compared with those of normal hearing children. The data tokens of three normal and five cochlear implanted children were collected and investigated. Their intonation patterns were analyzed using the speech analysis tool, Praat. The characteristics of the two utterance types, interrogative and declarative, were investigated. No significant difference in intonation patterns between the two subject groups was found. However, the general pitch of cochlear implanted children was higher than that of normal hearing children. In addition, cochlear implanted children showed frequent pitch breaks.

  • PDF

수정된 MAP 적응 기법을 이용한 음성 데이터 자동 군집화 (Automatic Clustering of Speech Data Using Modified MAP Adaptation Technique)

  • 반성민;강병옥;김형순
    • 말소리와 음성과학
    • /
    • 제6권1호
    • /
    • pp.77-83
    • /
    • 2014
  • This paper proposes a speaker and environment clustering method in order to overcome the degradation of the speech recognition performance caused by various noise and speaker characteristics. In this paper, instead of using the distance between Gaussian mixture model (GMM) weight vectors as in the Google's approach, the distance between the adapted mean vectors based on the modified maximum a posteriori (MAP) adaptation is used as a distance measure for vector quantization (VQ) clustering. According to our experiments on the simulation data generated by adding noise to clean speech, the proposed clustering method yields error rate reduction of 10.6% compared with baseline speaker-independent (SI) model, which is slightly better performance than the Google's approach.

청각장애 성인의 일음절 낱말대조 명료도 특성 (Phonetic Contrasts of One-syllable Words and Speech Intelligibility in Adults with Hearing Impairments)

  • 김수진;도연지
    • 대한음성학회지:말소리
    • /
    • 제56호
    • /
    • pp.1-13
    • /
    • 2005
  • This study examined the speech intelligibility of one-syllable words with phonetic contrasts and analyzed segmental factors that can predict the overall speech intelligibility in hearing-impaired adults. To identify the speech error characteristics, a Korean word list was audio-recorded by 7 hearing-impaired adults, and 35 listeners selected the heard word out of 5 choices. Based in part on previous studies of speech of the hearing impaired, the word list consisted of monosyllabic consonant-vowel-consonant (CVC) real word pairs. Stimulus words included 77 phonetic contrast pairs. The results showed that the percentage of errors in final position (coda) contrast was higher than in any other position in syllable. And the intelligibility deficit factors of phonetic contrast in the hearing-impaired were analyzed through stepwise regression analysis. The overall intelligibility was predicted by the error rate of manner contrast at coda, voicing contrast (homorganic triplets) at onset and high-low contrast at nucleus.

  • PDF

다중칼만필터를 이용한 음성향상 (Speech Enhancement Using Multiple Kalman Filter)

  • 이기용
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 제15회 음성통신 및 신호처리 워크샵(KSCSP 98 15권1호)
    • /
    • pp.225-230
    • /
    • 1998
  • In this paper, a Kalman filter approach for enhancing speech signals degraded by statistically independent additive nonstationary noise is developed. The autoregressive hidden markov model is used for modeling the statistical characteristics of both the clean speech signal and the nonstationary noise process. In this case, the speech enhancement comprises a weighted sum of conditional mean estimators for the composite states of the models for the speech and noise, where the weights equal to the posterior probabilities of the composite states, given the noisy speech. The conditional mean estimators use a smoothing spproach based on two Kalmean filters with Markovian switching coefficients, where one of the filters propagates in the forward-time direction with one frame. The proposed method is tested against the noisy speech signals degraded by Gaussian colored noise or nonstationary noise at various input signal-to-noise ratios. An app개ximate improvement of 4.7-5.2 dB is SNR is achieved at input SNR 10 and 15 dB. Also, in a comparison of conventional and the proposed methods, an improvement of the about 0.3 dB in SNR is obtained with our proposed method.

  • PDF

웨이브렛 변환을 이용한 음성신호의 유성음/무성음/묵음 분류 (Voiced/Unvoiced/Silence Classification웨 of Speech Signal Using Wavelet Transform)

  • 손영호;배건성
    • 음성과학
    • /
    • 제4권2호
    • /
    • pp.41-54
    • /
    • 1998
  • Speech signals are, depending on the characteristics of waveform, classified as voiced sound, unvoiced sound, and silence. Voiced sound, produced by an air flow generated by the vibration of the vocal cords, is quasi-periodic, while unvoiced sound, produced by a turbulent air flow passed through some constriction in the vocal tract, is noise-like. Silence represents the ambient noise signal during the absence of speech. The need for deciding whether a given segment of a speech waveform should be classified as voiced, unvoiced, or silence has arisen in many speech analysis systems. In this paper, a voiced/unvoiced/silence classification algorithm using spectral change in the wavelet transformed signal is proposed and then, experimental results are demonstrated with our discussions.

  • PDF

회의실내 유리창 진동의 도청에 대한 연구 (A Study on the Eavesdropping of the Glass Window Vibration in a Conference Room)

  • 김석현;김윤호;허욱
    • 산업기술연구
    • /
    • 제27권A호
    • /
    • pp.55-60
    • /
    • 2007
  • Possibility of the eavesdropping is investigated on a conference room-glass window coupled system. Speech intelligibility analysis is performed on the eavesdropping sound of the glass window. Using MLS(Maximum Length Sequency) signal as a sound source, acceleration and velocity responses of the glass window are measured by accelerometer and laser doppler vibrometer. MTF(Modulation Transfer Function) is used to identify the speech transmission characteristics of the room and window system. STI(Speech Transmission Index) is calculated by using MTF and speech intelligibility of the vibration sound is estimated. Speech intelligibilities by the acceleration signal and the velocity signal are compared.

  • PDF

음향 파라미터에 의한 정서적 음성의 음질 분석 (Analysis of the Voice Quality in Emotional Speech Using Acoustical Parameters)

  • 조철우;리타오
    • 대한음성학회지:말소리
    • /
    • 제55권
    • /
    • pp.119-130
    • /
    • 2005
  • The aim of this paper is to investigate some acoustical characteristics of the voice quality features from the emotional speech database. Six different parameters are measured and compared for 6 different emotions (normal, happiness, sadness, fear, anger, boredom) and from 6 different speakers. Inter-speaker variability and intra-speaker variability are measured. Some intra-speaker consistency of the parameter change across the emotions are observed, but inter-speaker consistency are not observed.

  • PDF