• Title/Summary/Keyword: LPC analysis

Search Result 95, Processing Time 0.019 seconds

A Study on the Consonant Classification Using Fuzzy Inference (퍼지추론을 이용한 한국어 자음분류에 관한 연구)

  • 박경식
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1992.06a
    • /
    • pp.71-75
    • /
    • 1992
  • This paper proposes algorithm in order to classify Korean consonant phonemes same as polosives, fricatives affricates into la sounds, glottalized sounds, aspirated sounds. This three kinds of sounds are one of distinctive characters of the Korean language which don't eist in language same as English. This is thesis on classfication of 14 Korean consonants(k, t, p, s, c, k', t', p', s', c', kh, ph, ch) as a previous stage for Korean phone recognition. As feature sets for classification, LPC cepstral analysis. The eperiments are two stages. First, using short-time speech signal analysis and Mahalanobis distance, consonant segments are detected from original speech signal, then the consonants are classified by fuzzy inference. As the results of computer simulations, the classification rate of the speech data was come to 93.75%.

  • PDF

Implementation and Performance Analysis of a Speaker Verification System (화자 확인 시스템의 설계 제작 및 성능 분석)

  • 권석규;이병기
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.30B no.3
    • /
    • pp.1-9
    • /
    • 1993
  • This paper discusses issues on the disign and implementation of real-time automatic speaker verification system, as well as the performance analysis of the implemented system. The system employs TI's TMS320C25 digital signal processor TMS320C25 and high speed SRAMs. The system is designed to be used stand-alone as well as via hand-shaking with IBM-PC. The speech parameters used for speaker verification are PARCOR and LPC-cepstrum coefficients, and the employed decision logics are those based on the generalized weighted distance comcept. The implemented system showed the performance of 5.3% error rate for the PARCOR coefficient, and 4.7% error rate for the LPG-cepstrum coefficient.

  • PDF

Compression of LSP Coefficents Using Principal Component Analysis (Principal component analysis를 이용한 LSP 계수의 압축기법)

  • Ahn Haeyong;Lee Chulhee
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.85-88
    • /
    • 2001
  • Line spectrum pair(LSP) 계수는 양자화 오류에 강하고. 선형 릴간에 효율적이며, 필터의 안정성 판정이 용이하므로 LPC를 대신하여 음성 부호화에 널리 사용되고 있다. 일반적으로 LSP 계수간에는 일정한 상관관계가 나타나고, 이 특성을 이용하면 LSP 계수의 부호량을 줄일 수 있는 가능성이 있나. 본 논문에서는 LSP 계수를 압축하기 위해 principal component analysis(PCA)를 사용한 방법을 제안한다. 제안된 방법에서는 LSP 계수를 Karhunen-Loeve(KL) 변환해 에너지가 집중되는 고유치(eigenvalue)와 고유벡터(eigenvector)를 찾고 값을 양자화 한다. 성능 평가를 위해 2.4kbps MELP(mixed excitation linear prediction)와 8kbps QCELP(qualcumn code excited linear prediction) 음성 부호화기를 사용해 결과 값을 비교했고, 압축률이 증가하는 것을 확인했다.

  • PDF

A Study of the Predictive Effectiveness of Stem and Root Extracts of Cannabis sativa L. Through Network Pharmacological Analysis (네트워크 분석기반을 통한 대마 줄기 및 뿌리 추출물의 약리효능 예측연구)

  • Myung-Ja Shin;Min-Ho Cha
    • Journal of Life Science
    • /
    • v.34 no.3
    • /
    • pp.179-190
    • /
    • 2024
  • Cannabis sativa is a plant widely cultivated worldwide and has been used as a material for food, medicine, building materials and cosmetics. In this study, we assessed the functional effects of C. sativa stem and root extracts using network pharmacology and confirmed their novel functions. The components in stem and root ethanol extracts were identified by gas chromatography-mass spectrometry analysis, and networks between the components and proteins were constructed using the STICHI database. Functional annotation of the proteins was performed using the KEGG pathway. The effects of the extracts were confirmed in lysophosphatidylcholine-induced THP-1 cells using real-time PCR. A total of 21 and 32 components were identified in stem and root extracts, respectively, and 147 and 184 proteins were linked to stem and root components, respectively. KEGG pathway analysis showed that 69 pathways, including the MAPK signaling pathway, were commonly affected by the extracts. Further investigation using pathway networks revealed that terpenoid backbone biosynthesis was likely affected by the extracts, and the expression of the MVK and MVD genes, key proteins in terpenoid backbone biosynthesis, was decreased in LPC-induced THP-1 cells. Therefore, this study determined the diverse function of C. sativa extracts, providing information for predicting and researching the effects of C. sativa.

An Extensive Analysis of High-density Electroencephalogram during Semantic Decision of Visually Presented Words

  • Kim, Kyung-Hwan;Kim, Ja-Hyun
    • Journal of Biomedical Engineering Research
    • /
    • v.27 no.4
    • /
    • pp.170-179
    • /
    • 2006
  • The purpose of this study was to investigate the spatiotemporal cortical activation pattern and functional connectivity during visual perception of words. 61 channel recordings of electroencephalogram were obtained from 15 subjects while they were judging the meaning of Korean, English, and Chinese words with concrete meanings. We examined event-related potentials (ERP) and applied independent component analysis (ICA) to find and separate simultaneously activated neural sources. Spectral analysis was also performed to investigate the gamma-band activity (GBA, 30-50 Hz) which is known to reflect feature binding. Five significant ERP components were identified and left hemispheric dominance was observed for most sites. Meaningful differences of amplitudes and latencies among languages were observed. It seemed that familiarity with each language and orthographic characteristics affected the characteristics of ERP components. ICA helped confirm several prominent sources corresponding to some ERP components. The results of spectral and time-frequency analyses showed distinct GBAs at prefrontal, frontal, and temporal sites. The GBAs at prefrontal and temporal sites were significantly correlated with the LPC amplitude and response time. The differences in spatiotemporal patterns of GBA among languages were not prominent compared to the inter-individual differences. The gamma-band coherence revealed short-range connectivity within frontal region and long-range connectivity between frontal, posterior, and temporal sites.

The implementation of Korean adult's optimal formant setting by Praat scripting (성인 포먼트 측정에서의 최적 세팅 구현: Praat software와 관련하여)

  • Park, Jiyeon;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.97-108
    • /
    • 2019
  • An automated Praat script was implemented to measure optimal formant frequencies for adults. Optimal formant analysis could be interpreted to show that the deviation of formant frequency that resulted from the two variously combined setting parameters (maximum formant and number of formants) was minimal. To increase the reliability of formant analysis, LPC order should be set differently, based on the gender or vowel type. Praat recommends 5,000 Hz and 5,500 Hz as maximum formant settings and, at the same time, recommends 5 as the number of formants for males and females. However, verification is needed to determine whether these recommended settings are valid for Korean vowels. Statistical analysis showed that formant frequencies significantly varied across the adapted scripts, especially with respect to the data on females. Formant plots and statistical results showed that linear_script and qtone_script are much more reliable in formant measurements. Among four kinds of scripts, the linear and qtone_scripts proved to be more stable and reliable. While the linear_script was designed to have a linearly increased formant step in for-loop, the increment of formant step in the qtone_script was arranged by quarter tone scale (base frequency×common ratio ($\sqrt[24]{2}$)). When looking at the tendency of the formant setting drawn by the two referred algorithms in the context of front vowel [i, e], the maximum formant was set higher; and the number of formants set at a lower value than recommended by Praat. The back vowel [o, u], on the contrary, has a lower maximum formant and a higher number of formants than the standard setting.

Speech training aids for deafs (청각 장애자용 발음 훈련 기기의 개발)

  • 김동준;윤태성;박상희
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1991.10a
    • /
    • pp.746-751
    • /
    • 1991
  • Deafs train articulation by observing mouth of a tutor. sensing tactually the notions of the vocal organs, or using speech training aids. Present speech training aids for deafs can measure only single speech ter, or display only frequency spectra in histogrm or pseudo-color. In this study, a speech training aids that can display subject's articulation in the form of a cross section of the vocal organs and other speech parameters together in a single system Is aimed to develop and this system makes a subject to know where to correct. For our objective, first, speech production mechanism is assumed to be AR model in order to estimate articulatory notions of the vocal tract from speech signal. Next, a vocal tract profile mode using LPC analysis is made up. And using this model, articulatory notions for Korean vowels are estimated and displayed in the vocal tract profile graphics.

  • PDF

A Study on the Effect of the Vibration and Particle Generation of a Spin Coater on Thin Film Coating (회전박막제조기의 진동 및 입자발생이 박막제조에 미치는 영향에 관한 연구)

  • 허진욱;권태종;정진태;한창수;안강호
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.11 no.4
    • /
    • pp.31-36
    • /
    • 2001
  • A spin coater is a machine to coat wafer or LCD display with thin film. Vibration in the spin coater may be one of main troubles in the coating process. In this paper, we focus on the difference between two spin coaters. Vibration sources are identified by experimental approach and are compared to find the difference between the two spin coaters. Also, the particle concentration is observed by laser particle counter (LPC) for the two spin coaters, when the spin coaxers are working. It is also considered whether the defect rate is proportional to the particle concentration. The result shows that particle generation in the coating process is related to excessive vibration of the spin coater shaft and the particles influence the defect rate of the thin film product.

  • PDF

Introduction to the Spectrum and Spectrogram (스팩트럼과 스팩트로그램의 이해)

  • Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.19 no.2
    • /
    • pp.101-106
    • /
    • 2008
  • The speech signal has been put into a form suitable for storage and analysis by computer, several different operation can be performed. Filtering, sampling and quantization are the basic operation in digiting a speech signal. The waveform can be displayed, measured and even edited, and spectra can be computed using methods such as the Fast Fourier Transform (FFT), Linear predictive Coding (LPC), Cepstrum and filtering. The digitized signal also can be used to generate spectrograms. The spectrograph provide major advantages to the study of speech. So, author introduces the basic techniques for the acoustic recording, digital signal processing and the principles of spectrum and spectrogram.

  • PDF

A Study on the Phonemic Segmentation of an Initial Affricate (초성파찰음의 음소분류에 관한 연구)

  • Kim, Ki-Woon;Lee, Ki-Young;Bae, Chul-Soo;Choi, Kap-Seok
    • Proceedings of the KIEE Conference
    • /
    • 1988.07a
    • /
    • pp.33-36
    • /
    • 1988
  • In this paper, the starting point of affricate is detected from the first predictor coefficient of a 12-pole linear predictive coding (LPC) analysis and phonemic segmentation is done through measuring short time energy and zero crossing rate. By this segmentation method, the duration of an aspirate can be mearsured in order to detect an aspirate or not.

  • PDF