• 제목/요약/키워드: Formant Analysis

검색결과 191건 처리시간 0.032초

피치 검출을 위한 스펙트럼 평탄화 기법 (Flattening Techniques for Pitch Detection)

  • 김종국;조왕래;배명진
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
    • /
    • pp.381-384
    • /
    • 2002
  • In speech signal processing, it Is very important to detect the pitch exactly in speech recognition, synthesis and analysis. but, it is very difficult to pitch detection from speech signal because of formant and transition amplitude affect. therefore, in this paper, we proposed a pitch detection using the spectrum flattening techniques. Spectrum flattening is to eliminate the formant and transition amplitude affect. In time domain, positive center clipping is process in order to emphasize pitch period with a glottal component of removed vocal tract characteristic. And rough formant envelope is computed through peak-fitting spectrum of original speech signal in frequency domain. As a results, well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. After all, we obtain residual signal which is removed vocal tract element The performance was compared with LPC and Cepstrum, ACF 0wing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

최고도이상의 청력손실을 가진 아동의 모음음형대 분석 (An Acoustic Analysis of Vowels for Severe-profound Hearing Impaired Children)

  • 허명진
    • 음성과학
    • /
    • 제14권2호
    • /
    • pp.65-71
    • /
    • 2007
  • The severe-profound hearing impaired children have various disorders in everday communication due to the lack of hearing feedback. Especially, their speech produced unstable voice, omission and distortion of articulation, pitch break, cul-de-sac voice, and so on so that they were difficult to accurately deliver an intended message. This study attempts to analyze the acoustic characteristics of 4 vowel sounds produced by 35 severe-profound hearing impaired children using CSL(Computerized Speech Lab, Model 4300b). The formant data were obtained from the spectrogram and analyzed data by 12 formant filter and auto-correlation among the formants. Results showed that the hearing impaired children's formant values came out very high. They produced the vowels at the mode of hypertension with unstable voice. In order to improve their speech, they would need some adequate auditory feedback.

  • PDF

NOISE ROBUST FORMANT FREQUENCY ESTIMATION BASED ON COMPLEX AUTOCORRELATION FUNCTION

  • Diankha, Ousmane;Shimamura, Tetsuya
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 ITC-CSCC -3
    • /
    • pp.1799-1802
    • /
    • 2002
  • This paper proposes an improved method for formant frequencies estimation based on the complex autocorrelation function of the speech signal. Instead of using the incoming signal as an input fur the LPC analysis, the analytic signal of the autocorrelation function of the speech signal is computed and itself used as an input for the LPC analysis. Due to the properties of the analytic signal, which occupies half of the bandwidth of the original signal, the required model order for the LPC analysis is halved. The accuracy of the proposed method in noisy environments is examined on five natural vowels. The effectiveness of the proposed method is shown by the estimated spectral shapes and the estimation errors of the formant frequencies.

  • PDF

돼지의 수.포유 행동 I. 수유 행동에서 모돈(랜드레이스$\times$요크셔) 발성음의 특성 (Nursing and Suckling Behaviour in Domestic Pigs 1. Characteristics of the Grunting Sound of the Sow(Landrace $\times$ Yorkshire) during Nursing Behaviour)

  • 장홍희;연성찬
    • 한국임상수의학회지
    • /
    • 제19권2호
    • /
    • pp.191-194
    • /
    • 2002
  • The nursing vocalization of domestic pigs(Landrace$\times$Yorkshire) was investigated with respect to common features. All vocalizations uttered during nursings in 5 sows at 5 days after farrowing were recorded and 305 grunts were processed in a spectrograph. The sow's repeated grunting during nursing can be regarded as a contact call and a signal of the mother to start and synchronize the suckling behavior of the piglets. Analysis in the time domain revealed the gross structure of the call, whereas in the frequency domain the fine structure of single grunts was investigated. Nursing interval, duration of nursing behavior, duration of grunt, grunt rate per 10 seconds, fundamental frequency, 1 formant, 2 formant, 3 formant, 4 formant and spectrum were investigated. The results showed that mean interval between the nursing following one another was 25, 4.6 min and duration of nursing behavior was 3.2 $\pm$ 0.7 min. Average duration of grunt was 203.9 $\pm$ 63.6 ms. The formant contours could be identified. The nursing behavior might be disturbed by the grunts of alien sow.

한국어 비음의 음향학적 구분을 위한 장구간 스펙트럼(LTAS) 분석 (Long Term Average Spectral Analysis for Acoustical Discrimination of Korean Nasal Consonants)

  • 최순애;성철재
    • 대한음성학회지:말소리
    • /
    • 제60호
    • /
    • pp.67-84
    • /
    • 2006
  • The purpose of this study is to find some acoustic parameters on frequency domain to distinguish the Korean nasals, $/m,\;n,\;{\eta}/$ from each other. The new parameters are devised on the basis of LTAS (Long Term Average Spectrum). The maximum peak amplitude and the relevant formant frequency are measured in low and high frequency range, respectively. The frequency of spectral valley and its energy level are also obtained in the specific frequency range of the spectrum. Spectral slope, total energy value in specific frequency range, statistical distribution of spectral energy like centroid, skewness, and kurtosis are suggested as new parameters as well. The parameters that show statistically significant differences across nasals are summerized as follows. 1) in syllable initial positions: the total energy value from 1,500 to 2,200 Hz(zeroENG); 2) in syllable final positions: the peak amplitude of the first formant(peak1_a), the formant frequency with maximum peak amplitude from 4,000 to 8,000 Hz(peak2_f), the maximum peak amplitude of the formant frequency from 4,000 to 8,000 Hz(peak2_a), and the total energy value from 1,500 to 2,200 Hz(zeroENG).

  • PDF

음성신호 분석을 적용한 이침요법(耳針療法)에 따른 심장 기능 향상 측정 (Measurement of Cardiac Function Improvement by Auricular Acupuncture Applying Speech Signal Analysis)

  • 김봉현;조동욱;한길성
    • 한국산학기술학회논문지
    • /
    • 제12권12호
    • /
    • pp.5588-5593
    • /
    • 2011
  • 본 논문에서는 심장에 해당하는 이(耳)혈 상응점을 자극하여 심장과 관련된 음성분석 요소의 변화를 측정하였다. 이를 위해 심장에 이상이 없는 피실험자 10명을 선정하고 심장에 해당하는 이혈 상응점을 자극하기 전과 후의 음성을 수집하였다. 실험은 음성분석 요소 중 심장과 관련된 Jitter와 2 Formant Frequency Bandwidths를 적용하여 심장 이혈 자극 전과 후의 변화를 측정, 분석하였다. 실험 결과 90%의 피실험자가 Jitter와 2 Formant Frequency Bandwidths 값이 감소하는 현상을 보였으며 이를 통해 이혈 자극에 따른 심장과 음성의 상관성을 분석할 수 있었다. 끝으로 실험에 의해 제안한 방법의 유용성을 입증하고자 한다.

편도적출술로 음성변화가 올 수 있는 편도 상태에 관한 연구 (The Study of Tonsil Affected Voice Quality after Tonsillectomy)

  • 안철민;정덕희
    • 대한후두음성언어의학회지
    • /
    • 제9권1호
    • /
    • pp.32-37
    • /
    • 1998
  • Tonsillectomy is the one of operation that is performed the most commonly in otolaryngology field. Many changes that include range of voice, tone, voice quality and resonance were made by tonsillectomy. Sometimes, any patients taken tonsillectomy has suffer from these voice problem after tonsillectomy. However there are less study for these problems until now. Then, we studied to find the anatomical findings that affected the voice quality when tonsillectomy was performed. We evaluated the voice in 2 groups, one is the group showed the normal pharyngeal space by using the transnasal fiberscopy, the other is group showed medially bulging tonsil at pharyngeal cavity by using same method, with perceptual evaluation, nasalance score, nasality, oral formant and nasal formant. We used the computerized speech analysis system, the nasometer and the spectrogram in the CSL program. We could not find any differences in perceptual evaluation between two groups. But objective measures were provided. Nasalance score and nasality on the nasometric analysis were increased significantly and oral formant on the spectrogram was changed singnificantly after tonsillectomy in Group 2. Authors thought medially bulging tonsil in the pharynx is able to affect the voice quality after tonsillectomy when we evaluted through the nasal cavity by the using of fiberscopy and this evaluation would be important especially in singers.

  • PDF

벅아이 코퍼스에서의 젊은 성인 남성의 모음 포먼트 분석 (An Analysis of the Vowel Formants of the Young Males in the Buckeye Corpus)

  • 윤규철;노혜욱
    • 말소리와 음성과학
    • /
    • 제4권2호
    • /
    • pp.41-49
    • /
    • 2012
  • The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, syllabic stress information, the location in a word, location in utterance, speech rate of three consecutive words, and the word frequency in the corpus. The results indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants. The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, the syllabic stress information, the location in a word, the location in an utterance, the speech rate of the three consecutive words, and the word frequency in the corpus. The result indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants.

외부 자극에 따른 부비동과 포먼트주파수와의 상관성 분석 (Correlation Analysis of Between Paranasal Sinuses and Formant Frequency According to External Stimulation)

  • 김봉현
    • 한국정보통신학회논문지
    • /
    • 제17권8호
    • /
    • pp.1955-1961
    • /
    • 2013
  • 부비동은 얼굴에서 뼈 속에 존재하는 공기로 가득 찬 빈 공간이다. 그러나 부비동에 지속적으로 염증이 생기고 고름이 차면 축농증으로 발병하여 두통과 무기력증을 호소하고 음성의 변화를 가져온다. 따라서 본 논문에서는 외부 자극을 통해 부비동의 변화를 음성분석 요소로 측정하여 부비동 관련 질환을 예측하는 연구와 전두동, 사골동, 상악동, 접형동으로 구성된 부비동의 영역별 기능을 분석하는 연구를 수행하였다. 이를 위해 부비동 영역에 냉찜질 자극을 시행하고 자극 전과 후의 음성에 대한 포먼트주파수를 측정하여 상호간의 상관성 분석을 통해 외부 자극이 부비동에 미치는 영향을 분석하였다.

Neural Spike Train Decoding에 기반한 인공와우 어음처리방식 성능평가 (Performance Evaluation of Cochlear Implants Speech Processing Strategy Using Neural Spike Train Decoding)

  • 김두희;김진호;김경환
    • 대한의용생체공학회:의공학회지
    • /
    • 제28권2호
    • /
    • pp.271-279
    • /
    • 2007
  • We suggest a novel method for the evaluation of cochlear implant (CI) speech processing strategy based on neural spike train decoding. From formant trajectories of input speech and auditory nerve responses responding to the electrical pulse trains generated from a specific CI speech processing strategy, optimal linear decoding filter was obtained, and used to estimate formant trajectory of incoming speech. Performance of a specific strategy is evaluated by comparing true and estimated formant trajectories. We compared a newly-developed strategy rooted from a closer mimicking of auditory periphery using nonlinear time-varying filter, with a conventional linear-filter-based strategy. It was shown that the formant trajectories could be estimated more exactly in the case of the nonlinear time-varying strategy. The superiority was more prominent when background noise level is high, and the spectral characteristic of the background noise was close to that of speech signals. This confirms the superiority observed from other evaluation methods, such as acoustic simulation and spectral analysis.