Search | Korea Science

A Study on the Algorithm Development for Speech Recognition of Korean and Japanese (한국어와 일본어의 음성 인식을 위한 알고리즘 개발에 관한 연구)

Lee, Sung-Hwa;Kim, Hyung-Lae
- Journal of IKEEE
- /
- v.2 no.1 s.2
- /
- pp.61-67
- /
- 1998
In this thesis, experiment have performed with the speaker recognition using multilayer feedforward neural network(MFNN) model using Korean and Japanese digits . The 5 adult males and 5 adult females pronounciate form 0 to 9 digits of Korean, Japanese 7 times. And then, they are extracted characteristics coefficient through Pitch deletion algorithm, LPC analysis, and LPC Cepstral analysis to generate input pattern of MFNN. 5 times among them are used to train a neural network, and 2 times is used to measure the performance of neural network. Both Korean and Japanese, Pitch coefficients is about 4%t more enhanced than LPC or LPC Cepstral coefficients.
PDF

A new Implementation of Perceptual LPC Cepstrum and its Application to Speech Recognition (인지 LPC cepstrum의 새로운 구현 및 음성인식에의 적용)

Kim, Jin-Young;Choi, Seong-Ho
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.5
- /
- pp.61-64
- /
- 1996
To improve the performance of a recognition system, namely the recognition rate, we propose a hew implementation of perceptual distance using LPC cepstrum(perceptual cepstrum, PLC). The PLC is caculated by convolution of a usual LPC cepstrum and a perceptual lifter(PL). To caculate PL, we define a new weighting function in the linear frequency domain considering the frequency scale(Bark-scale) characteristics. The PL is the inverse Fourier transform of the exponents of the weighting function. We verified our method through the speech recognition experiments. The performance of PLC was compared with that of the rasied sine liftering method.
PDF

Choline-Lipid Release from Normal and Transformed Cells

Hong, Seong-Tshool;Jang, Yong-Suk;Park, Kie-In
- BMB Reports
- /
- v.29 no.1
- /
- pp.73-80
- /
- 1996
The effect of albumin on phosphatidylcholine (PC) metabolism in Hep-G2, 3T3-H.ras, and 3T3 cells pre-labelled with [Me-$^3H$]choline was studied. The [$^3H$]choline was more efficiently taken up and incorporated into cellular phospholipids in 3T3-H.ras cells than in Hep-G2 and 3T3 cells. In each of the three cell lines, most of the [$^3H$]choline metabolized into the phospholipids was incorporated into PC and only minor was incorporated into lysophosphatidylcholine (LPC). Bovine serum albumin stimulated the release of [$^3H$]LPC and [$^3H$]PC from each of the three cell lines pre-labelled with [$^3H$]choline. [$^3H$]PC was also released in the absence of albumin but [$^3H$]LPC was not. The efficiency of LPC secretion represented as the proportion of medium [$^3H$]LPC to cellular [$^3H$]choline lipid during a chase period is approximately 9 to 14 times greater in 3T3 cells compared with the transformed 3T3-H.ras and Hep-G2 cells. A similar comparison of published data for rat hepatocytes with Hep-G2 shows secretion to be 35~75 times greater from the rat hepatocytes than from Hep-G2. Also, PC secretion from 3T3 cells was 1.6 times more effective than from 3T3-H.ras, whereas rat hepatocytes secrete PC 2.8~3.8 times more effectively than does Hep-G2. The measurement of specific radioactivity of cellular PC in pre-labelled 3T3 cells showed it to be similar to that of the secreted PC. However, the specific radioactivity of secreted LPC was markedly lower than that of the cellular PC, which suggests that LPC is being secreted from a PC pool distinct from that used for PC secretion.
PDF

Quantization of LPC Coefficients Using a Multi-frame AR-model (Multi-frame AR model을 이용한 LPC 계수 양자화)

Jung, Won-Jin;Kim, Moo-Young
- The Journal of the Acoustical Society of Korea
- /
- v.31 no.2
- /
- pp.93-99
- /
- 2012
For speech coding, a vocal tract is modeled using Linear Predictive Coding (LPC) coefficients. The LPC coefficients are typically transformed to Line Spectral Frequency (LSF) parameters which are advantageous for linear interpolation and quantization. If multidimensional LSF data are quantized directly using Vector-Quantization (VQ), high rate-distortion performance can be obtained by fully utilizing intra-frame correlation. In practice, since this direct VQ system cannot be used due to high computational complexity and memory requirement, Split VQ (SVQ) is used where a multidimensional vector is split into multilple sub-vectors for quantization. The LSF parameters also have high inter-frame correlation, and thus Predictive SVQ (PSVQ) is utilized. PSVQ provides better rate-distortion performance than SVQ. In this paper, to implement the optimal predictors in PSVQ for voice storage devices, we propose Multi-Frame AR-model based SVQ (MF-AR-SVQ) that considers the inter-frame correlations with multiple previous frames. Compared with conventional PSVQ, the proposed MF-AR-SVQ provides 1 bit gain in terms of spectral distortion without significant increase in complexity and memory requirement.
https://doi.org/10.7776/ASK.2012.31.2.093 인용 PDF KSCI

Word Recognition using Fuzzy Inference based on LPC (선형예측계수에 기초한 퍼지추론 단어 인식)

Choi, Seung-Ho;Kim, Hyeong-Geun
- The Journal of the Acoustical Society of Korea
- /
- v.13 no.1
- /
- pp.32-41
- /
- 1994
To solve the frequency variation of speech patterns which consist of LPC sequences, new membership function view from LPC, spectrum and the relations between the order of LPC and spectrum is proposed. To solve the time variation, multi-secation equi-segmentation method which equally divide the speech section into several section are applied. False recognition mainly occur at time when the same syllable is placed at the same utterance. To reduce the error, fuzzy inference is executed using the proposed membership function and weights are assigned into sectional certainty and then the decision method for recognized the section up to the third candidate. To testify the validation of this method, we experimented the recognition test of 28 DDD area names. The recognition rate of the fuzzy inference by the triangle membership function is $92\%$. That of the combined method of the fuzzy inference and the dicision method is $92.9\%$ and that of fuzzy inference by the proposed membership funtion is $93.8\%$.
PDF

Voice Personality Transformation Using a Probabilistic Method (확률적 방법을 이용한 음성 개성 변환)

Lee Ki-Seung
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.3
- /
- pp.150-159
- /
- 2005
This paper addresses a voice personality transformation algorithm which makes one person's voices sound as if another person's voices. In the proposed method, one person's voices are represented by LPC cepstrum, pitch period and speaking rate, the appropriate transformation rules for each Parameter are constructed. The Gaussian Mixture Model (GMM) is used to model one speaker's LPC cepstrums and conditional probability is used to model the relationship between two speaker's LPC cepstrums. To obtain the parameters representing each probabilistic model. a Maximum Likelihood (ML) estimation method is employed. The transformed LPC cepstrums are obtained by using a Minimum Mean Square Error (MMSE) criterion. Pitch period and speaking rate are used as the parameters for prosody transformation, which is implemented by using the ratio of the average values. The proposed method reveals the superior performance to the previous VQ-based method in subjective measures including average cepstrum distance reduction ratio and likelihood increasing ratio. In subjective test. we obtained almost the same correct identification ratio as the previous method and we also confirmed that high qualify transformed speech is obtained, which is due to the smoothly evolving spectral contours over time.
PDF KSCI

Performance Improvement of Double Talk Detection before Convergence of the Echo Canceller by Using Linear Predictive Coding Filter Gain of the Primary Input Signal (주입력신호의 LPC 필터 이득을 이용한 반향제거기의 수렴전 동시통화검출 성능 개선)

Yoo, Jae-Ha
- Journal of the Korean Institute of Intelligent Systems
- /
- v.24 no.6
- /
- pp.628-633
- /
- 2014
This paper proposes a performance improvement method of the conventional double talk detection method which can operate before convergence of the echo canceller. The proposed method estimates the coefficients of the linear predictive coding(LPC) filter by using the primary input signal. The time-varying threshold for double talk detection is determined based on the LPC filter gain of the primary input signal level. The proposed method can reduce not only false detection rate which means wrong detection of single talk as double talk but also double talk detection delay. Computer simulation was performed using a long-term real speech signals. It is shown that the proposed method improves the conventional method in terms of lowering the false detection rate and shortening the detection delay.
https://doi.org/10.5391/JKIIS.2014.24.6.628 인용 PDF KSCI

Development of Process Technology for Low Pressure Vaccum Carburizing (저압식 진공 침탄(LPC) 열처리 공정 기술 개발)

Dong, Sang-Keun;Yang, Jae-Bok
- 한국연소학회:학술대회논문집
- /
- 2004.11a
- /
- pp.231-237
- /
- 2004
Vacuum carburizing continues to gain acceptance as an alternative to atmosphere carburizing particularly in the car industry. The advantages of low-pressure carburization over atmospheric gas carburization is not only the creation of a surface entirely free of oxide and the environmentally friendly nature of these methods but also an improvement in deformation behaviour achieved by combining carburization with gas quenching, a reduction in batch times by increasing the carburization temperature, low gas and energy consumption and the prevention of soot to a large extent. In present study, an improved vacuum carburizing method is provided which is effective to deposit carbon in the surface of materials and to reduce cycle time. Also LPC process simulator was made to optimize to process controls parameters such as pulse/pause cycles of pressure pattern, temperature, carburizing time, diffusion time. The carburizing process was simulated by a diffusion calculation program, where as the model parameters are proposed with help the experimental results and allows the control of the carburizing process with good accordance to the practical results. Thus it can be concluded that LPC process control method based on the theoretical simulation and experimental datas appears to provide a reasonable tool for prototype LPC system.
PDF

The Assessment on the Sound Quality of Reduced Frequency Selectivity of Hearing Impaired People (난청인의 주파수 선택도 둔화현상이 음질에 미치는 영향 평가)

An, Hong-Sub;Park, Gyu-Seok;Jeon, Yu-Yong;Song, Young-Rok;Lee, Sang-Min
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.60 no.6
- /
- pp.1196-1203
- /
- 2011
The reduced frequency selectivity is a typical phenomenon of sensorineural hearing loss. In this paper, we compared two modeling methods for reduced frequency selectivity of hearing impaired people. The two models of reduced frequency selectivity were made using LPC(linear prediction coding) algorithm and bandwidth control algorithm based on ERB(equivalent rectangular bandwidth) of auditory filter, respectively. To compare the effectiveness of two models, we compared the result of PESQ (perceptual evaluation of speech quality) and LLR(log likelihood ratio) using 36 Korean words of two syllables. To verify the effect on noise condition, we mixed white and babble noise with 0dB and -3dB SNR to speech words. As the result, it is confirmed that the PESQ score of bandwidth control algorithm is higher than the score of LPC algorithm, on the other hands, and the LLR score of LPC algorithm is lower than the score of bandwidth control algorithm. It means that both non-linearity and widen auditory filter characteristics caused by reduced frequency selectivity could be more reflected in bandwidth control algorithm than in LPC algorithm.
https://doi.org/10.5370/KIEE.2011.60.6.1196 인용 PDF KSCI

Enhanced Ex Vivo Buccal Transport of Propranolol: Evaluation of Phospholipids as Permeation Enhancers

Lee, Jae-Hwi;Choi, Young-Wook
- Archives of Pharmacal Research
- /
- v.26 no.5
- /
- pp.421-425
- /
- 2003
The aim of the present study was to evaluate the effects of two phospholipid permeation enhancers, lysophosphatidylcholine (LPC) and didecanoylphosphatidylcholine (DDPC), along with a fusidic acid derivative, sodium taurodihydrofusidate (STDHF) and ethanol (EtOH) on the buccal transport of propranolol hydrochloride (PPL) using an ex vivo buccal diffusion model. The permeation rate of [$^3 H$]PPL as measured by steady-state fluxes increased with increasing EtOH concentration. A significant flux enhancement (P＜0.05) was achieved by EtOH at 20 and 30 %v/v concentrations. At a 0.5 %w/v permeation enhancer concentration, the buccal permeation of [$^3 H$]PPL was significantly enhanced by all the enhancers studied (i.e., LPC, DDPC and STDHF) compared to the control (phosphate-buffered saline pH 7.4, PBS). LPC and DDPC displayed a greater degree of permeation enhancement compared with STDHF and EtOH-PBS mixtures with an enhancement ratio of 3.2 and 2.9 for LPC and DDPC, respectively compared with 2.0 and 1.5 for STDHF and EtOH:PBS 30:70 %v/v mixture, respectively. There was no significant difference between LPC and DDPC for the flux values and apparent permeability coefficients of [$^3$H]PPL. These results suggest that phospholipids are suitable as permeation enhancers for the buccal delivery of drugs.
PDF KSCI

Search Result 384, Processing Time 0.141 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)