• Title/Summary/Keyword: LPC analysis

Search Result 95, Processing Time 0.026 seconds

Acoustic Analysis of Singing Voice (성악도의 두성구와 흉성구 발성에 대한 음향학적 분석)

  • 진성민
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.13 no.1
    • /
    • pp.52-58
    • /
    • 2002
  • The pitch range of the human voice is variable, extending from chest register to falsetto. Although numerous studies have investigated after laryngeal mechanism description of registers, systematic and objective studies were lack. The purpose of this study was to analyze and compare head register with chest register of singers acoustically. Fifteen healthy tenor major students were selected. Fifteen healthy untrained adults were the control group for this study. Long term average(LTA) power spectrum using the Fast Fourier transform(FFT) algorithm and Linear predictive coding (LPC) filter response were made during /a/ sustained in both head(G4, 392Hz) md chest registers (C3, 131Hz). Statistical analysis was performed using Mann-Whitney test. In the LTA power spectrum, head register of singer has increased level(energy gain) in the frequency band of 2.2-3.4kHz(p<0.01), and 7.5-8.4kHz(p<0.01, p<0.05). Chest register of singer has increased level in the frequency band of 2.2-3.1kHz(p<0.01), 7.8-8.4kHz(p<0.05) and around 9.6kHz(p<0.01). LTA power spectrum reveals a peak of acoustic energy around 2500Hz known as the singer's formant and another peak of acoustic energy around 8000Hz in singer's voice.

  • PDF

Highband Coding Method Using Matching Pusuit Estimation and CELP Coding for Wideband Speech Coder (광대역 음성부호화기를 위한 매칭퍼슈잇 알고리즘과 CELP 방법을 이용한 고대역 부호화 방법)

  • Jeong Gyu-Hyeok;Ahn Yeong-Uk;Kim Jong-Hark;Shin Jae-Hyun;Seo Sang-Won;Hwang In-Kwan;Lee In-Sung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.1
    • /
    • pp.21-29
    • /
    • 2006
  • In this Paper a split bandwidth wideband speech coder and its highband coding method are Proposed. The coder uses a split-band approach. where the wideband input speech signal is split into two equal frequency bands from 0-4kHz and 4-8kHz. The lowband and the highband are coded respectively by the 11.8kb/s G.729 Annex E and the proposed coding method. After the LPC analysis, the highband is divided by two modes according to the properties of signals. In stationary mode. the highband signals are compressed by the mixture excitation model; CELP algorithm and W (Matching Pursuit) algorithm. The others are coded by the only CELP algorithm. We compare the performance of the new wideband speech coder with that of G.722 48kbps SB-ADPCM and G.722.2 12.85kbps in a subjective method. The simulation results show that the Performance of the proposed wideband speech coder has better than that of 48kbps G.722 and no better than that of 12.85kbps G.722.2.

A Study on the Technique of Spectrum Flattening for Improved Pitch Detection (개선된 피치검출을 위한 스펙트럼 평탄화 기법에 관한 연구)

  • 강은영;배명진;민소연
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.3
    • /
    • pp.310-314
    • /
    • 2002
  • The exact pitch (fundamental frequency) extraction is important in speech signal processing like speech recognition, speech analysis and synthesis. However the exact pitch extraction from speech signal is very difficult due to the effect of formant and transitional amplitude. So in this paper, the pitch is detected after the elimination of formant ingredients by flattening the spectrum in frequency region. The effect of the transition and change of phoneme is low in frequency region. In this paper we proposed the new flattening method of log spectrum and the performance was compared with LPC method and Cepstrum method. The results show the proposed method is better than conventional method.

Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers - Possibility of a Second Singer's Formant-

  • Jin, Sung-Min;Kwon, Young-Kyung;Song, Yun-Kyung
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.99-109
    • /
    • 2003
  • The purpose of this study was to analyze and compare head register with chest register of singers acoustically. Fifteen healthy tenor major students were participated. Fifteen healthy untrained adults were chosen as the control group for this study. Long term average (LTA) power spectrum using the Fast Fourier transform (FFT) algorithm and Linear predictive coding (LPC) filter response were made with /a/ sustained in both head (G4, 392 Hz) and chest registers (C3, 131 Hz). Statistical analysis was performed using the Mann-Whitney test. In the LTA power spectrum, head register of singers increased in the level of energy gain within the frequency of 2.2-3.4 kHz (p<0.01), and 7.5-8.4 kHz (p<0.01, p<0.05). Chest register of singers increased in the frequency of 2.2-3.1 kHz (p<0.01), 7.8-8.4 kHz (p<0.05) and around 9.6 kHz (p<0.01). The LTA power spectrum revealed a peak of acoustic energy around 2,500 Hz, known as the singer's formant and another peak of acoustic energy around 8,000 Hz in the singer's voice.

  • PDF

Perceptual Boundary on a Synthesized Korean Vowel /o/-/u/ Continuum by Chinese Learners of Korean Language (/오/-/우/ 합성모음 연속체에 대한 중국인 한국어 학습자의 청지각적 경계)

  • Yun, Jihyeon;Kim, EunKyung;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.111-121
    • /
    • 2015
  • The present study examines the auditory boundary between Korean /o/ and /u/ on a synthesized vowel continuum by Chinese learners of Korean language. Preceding researches reported that the Chinese learners have difficulty pronouncing Korean monophthongs /o/ and /u/. In this experiment, a nine-step continuum was resynthesized using Praat from a vowel token from a recording of a male announcer who produced it in isolated form. F1 and F2 were synchronously shifted in equal steps in qtone (quarter tone), while F3 and F4 values were held constant for the entire stimuli. A forced choice identification task was performed by the advanced learners who speak Mandarin Chinese as their native language. Their experiment data were compared to a Korean native group. ROC (Receiver Operating Characteristic) analysis and logistic regression were performed to estimate the perceptual boundary. The result indicated the learner group has a different auditory criterion on the continuum from the Korean native group. This suggests that more importance should be placed on hearing and listening training in order to acquire the phoneme categories of the two vowels.

Noise Source Identification and Countermeasure for the Noise of LPG Injector (LPC 인젝터의 소음원 규명 및 소음저감 대책)

  • Kim, Won-Jin;Park, Chong-Hyun;Kim, Sung-Dae;Lee, Byung-Ho
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.19 no.3
    • /
    • pp.144-151
    • /
    • 2002
  • This work focuses on finding out the noise source and the method of reducing the noise level of LPG(liquefied petroleum gas) fuel injector. The noise of LPG injector in operating condition is due to the impact between valve and valve seat. This study shows that if the revolution of engine is increased, the noise of LPG injector will be more serious but it is not nearly affected by the increment of fuel pressure. The source and transmission paths of noise are identified through the analysis of noise generation mechanism and noise spectrum. The sound absorbing material is tested to verify its efficiency of sound absorption thor the LPG injector. The effect of noise reduction of absorbing material is remarkable when the engine speed is high. Consequently two methods of reducing the noise level are suggested from the identified results. The one is to equip the absorbing material on the outer side of injector and the other is to coat with a soft material or equip a soft ring on the surface of impact.

Analysis of the Time Delayed Effect for Speech Feature (음성 특징에 대한 시간 지연 효과 분석)

  • Ahn, Young-Mok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.1
    • /
    • pp.100-103
    • /
    • 1997
  • In this paper, we analyze the time delayed effect of speech feature. Here, the time delayed effect means that the current feature vector of speech is under the influence of the previous feature vectors. In this paper, we use a set of LPC driven cepstal coefficients and evaluate the time delayed effect of cepstrum with the performance of the speech recognition system. For the experiments, we used the speech database consisting of 22 words which uttered by 50 male speakers. The speech database uttered by 25 male speakers was used for training, and the other set was used for testing. The experimental results show that the time delayed effect is large in the lower orders of feature vector but small in the higher orders.

  • PDF

A Study on A Multi-Pulse Linear Predictive Filtering And Likelihood Ratio Test with Adaptive Threshold (멀티 펄스에 의한 선형 예측 필터링과 적응 임계값을 갖는 LRT의 연구)

  • Lee, Ki-Yong;Lee, Joo-Hun;Song, Iick-Ho;Ann, Sou-Guil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.10 no.1
    • /
    • pp.20-29
    • /
    • 1991
  • A fundamental assumption in conventional linear predictive coding (LPC) analysis procedure is that the input to an all-pole vocal tract filter is white process. In the case of periodic inputs, however, a pitch bias error is introduced into the conventional LP coefficient. Multi-pulse (MP) LP analysis can reduce this bias, provided that an estimate of the excitation is available. Since the prediction error of conventional LP analysis can be modeled as the sum of an MP excitation sequence and a random noise sequence, we can view extracting MP sequences from the prediction error as a classical detection and estimation problem. In this paper, we propose an algorithm in which the locations and amplitudes of the MP sequences are first obtained by applying a likelihood ratio test (LRT) to the prediction error, and LP coefficients free of pitch bias are then obtained from the MP sequences. To verify the performance enhancement, we iterate the above procedure with adaptive threshold at each step.

  • PDF

Characterization of Aerosol Concentration during Severe Asian Dust Period at Busan, Korea in 20 March 2010 (2010년 3월 20일 부산지역에 발생한 극심한 황사의 에어로솔 농도 분포 특성)

  • Jung, Woon-Seon;Park, Sung-Hwa;Lee, Dong-In;Kang, Deok-Du;Kim, Dongchul
    • Journal of Environmental Science International
    • /
    • v.23 no.2
    • /
    • pp.275-289
    • /
    • 2014
  • Asian dust (or yellow sand) occurring mainly in spring in East Asia is affected by the distribution of weather systems. This study was performed to investigate the characteristics of suspended particulate for Asian dust at Busan, Korea in 20 March 2010, which was one of the extreme case for the last 10 years. There was used the data of weather chart, satellite, automatic weather system (AWS), $PM_{10}$, laser particle counter (LPC), and backward trajectories model. In synoptically, the high pressure was located in the northwestern part and low pressure was located in the northeastern part of Korea. The strong westerly winds from surface to upper layer makes it possible to move air masses rapidly. Air masses passing through Gobi Desert in Mongolia and Inner Mongolia plateau covered the entire Korean peninsula. As the results of aerosol analysis, $PM_{10}$ concentration at Gudeok mountain in Busan was recorded $2,344{\mu}g/m^3$ in 2300 LST 20 March 2010 and their concentration was markedly increased at coarse mode particle. In surface condition, westerly wind about 3 ~ 5 m/s was dominant and small particles of $0.3{\sim}0.5{\mu}m$ were distributed on the whole. In heavy metal components analysis, the elements from the land was predominated.

Design of a 4kb/s ACELP Codec Using the Generalized AbS Principle (Generalized AbS 구조를 이용한 4kb/s ACELP 음성 부호화기의 설계)

  • 성호상;강상원
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.7
    • /
    • pp.33-38
    • /
    • 1999
  • In this paper, we combine a generalized analysis-by-synthesis (AbS) structure and an algebraic excitation scheme to propose a new 4kb/s speech codec. This codec partly uses the structure of G.729. We design a line spectrum pair (LSP) quantizer, an adaptive codebook, and an excitation codebook to fit the 4 kb/s bit rate. The codec has a 25㎳ algorithmic delay, which corresponds to a 20㎳ frame size and a 5㎳ lookahead. At the bit rates below 4kb/s, most CELP speech codecs using the AbS principle have a drawback that results a rapid degradation of speech quality. To overcome this drawback we use the generalized AbS structure which is efficient for the low bit rate speech codec. LP coefficients are converted to LSP and quantized using a predictive 2-stage VQ. A low complexity algebraic codebook which uses shifting method is used for the fixed codebook excitation, and gains of the adaptive codebook and the fixed codebook are quantized using the VQ. To evaluate the performance of the proposed codec A-B preference tests are done with the fixed rate 8kb/s QCELP. As the result of the test, the performance of the codec is similar to that of the fixed rate 8kb/s QCELP.

  • PDF