Search | Korea Science

New Data Extraction Method using the Difference in Speaker Recognition (화자인식에서 차분을 이용한 새로운 데이터 추출 방법)

Seo, Chang-Woo;Ko, Hee-Ae;Lim, Yong-Hwan;Choi, Min-Jung;Lee, Youn-Jeong
- Speech Sciences
- /
- v.15 no.3
- /
- pp.7-15
- /
- 2008
This paper proposes the method to extract new feature vectors using the difference between the cepstrum for static characteristics and delta cepstrum for dynamic characteristics in speaker recognition (SR). The difference vector (DV) which it proposes from this paper is containing the static and the dynamic characteristics simultaneously at the intermediate characteristic vector which uses the deference between the static and the dynamic characteristics and as the characteristic vector which is new there is a possibility of doing. Compared to the conventional method, the proposed method can achieve new feature vector without increasing of new parameter, but only need the calculation process for the difference between the cepstrum and delta cepstrum. Experimental results show that the proposed method has a good performance more than 2.03%, on average, compared with conventional method in speaker identification (SI).
PDF

Modality-Based Sentence-Final Intonation Prediction for Korean Conversational-Style Text-to-Speech Systems

Oh, Seung-Shin;Kim, Sang-Hun
- ETRI Journal
- /
- v.28 no.6
- /
- pp.807-810
- /
- 2006
This letter presents a prediction model for sentence-final intonations for Korean conversational-style text-to-speech systems in which we introduce the linguistic feature of 'modality' as a new parameter. Based on their function and meaning, we classify tonal forms in speech data into tone types meaningful for speech synthesis and use the result of this classification to build our prediction model using a tree structured classification algorithm. In order to show that modality is more effective for the prediction model than features such as sentence type or speech act, an experiment is performed on a test set of 970 utterances with a training set of 3,883 utterances. The results show that modality makes a higher contribution to the determination of sentence-final intonation than sentence type or speech act, and that prediction accuracy improves up to 25% when the feature of modality is introduced.
PDF

Lightweight CNN-based Expression Recognition on Humanoid Robot

Zhao, Guangzhe;Yang, Hanting;Tao, Yong;Zhang, Lei;Zhao, Chunxiao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.3
- /
- pp.1188-1203
- /
- 2020
The human expression contains a lot of information that can be used to detect complex conditions such as pain and fatigue. After deep learning became the mainstream method, the traditional feature extraction method no longer has advantages. However, in order to achieve higher accuracy, researchers continue to stack the number of layers of the neural network, which makes the real-time performance of the model weak. Therefore, this paper proposed an expression recognition framework based on densely concatenated convolutional neural networks to balance accuracy and latency and apply it to humanoid robots. The techniques of feature reuse and parameter compression in the framework improved the learning ability of the model and greatly reduced the parameters. Experiments showed that the proposed model can reduce tens of times the parameters at the expense of little accuracy.
https://doi.org/10.3837/tiis.2020.03.015 인용 PDF KSCI HTML

Harmonic Peak Picking-based MVF Estimation for Improvement of HMM-based Speech Synthesis System Using TBE Model (TBE 모델을 사용하는 HMM 기반 음성합성기 성능 향상을 위한 하모닉 선택에 기반한 MVF 예측 방법)

Park, Jihoon;Hahn, Minsoo
- Phonetics and Speech Sciences
- /
- v.4 no.4
- /
- pp.79-86
- /
- 2012
In the two-band excitation (TBE) model, maximum voiced frequency (MVF) is the most important feature of the excitation parameter because the synthetic speech quality depends on MVF. Thus, this paper proposes an enhanced MVF estimation scheme based on the peak picking method. In the proposed scheme, the local peak and the peak lobe are picked from the spectrum of a linear predictive residual signal. The normalized distance between neighboring peak lobes is calculated and utilized as a feature to estimate MVF. Experimental results of both objective and subjective tests show that the proposed scheme improves synthetic speech quality compared with that of the conventional one.
https://doi.org/10.13064/KSSS.2012.4.4.079 인용 PDF

A Study on the Diagnosis of Cutting Tool States Using Cutting Conditions and Cutting Force Parameters(l) - Signal Processing and Feature Extraction - (절삭조건과 절삭력 파라메타를 이용한 공구상태 진단에 관한 연구(I) - 신호처리 및 특징추출 -)

Cheong, C.Y.;Yu, K.H.;Suh, N.S.
- Journal of the Korean Society for Precision Engineering
- /
- v.14 no.10
- /
- pp.135-140
- /
- 1997
The detection of cutting tool states in machining is important for the automation. The information of cutting tool states in metal cutting process is uncertain. Hence a industry needs the system which can detect the cutting tool states in real time and control the feed motion. Cutting signal features must be sifted before the classification. In this paper the Fisher's linear discriminant function was applied to the pattern recognition of the cutting tool states successfully. Cutting conditions and cutting force para- meters have shown to be sensitive to tool states, so these cutting conditions and cutting force paramenters can be used as features for tool state detection.
PDF

Parameters Comparison in the speaker Identification under the Noisy Environments (화자식별을 위한 파라미터의 잡음환경에서의 성능비교)

Choi, Hong-Sub
- Speech Sciences
- /
- v.7 no.3
- /
- pp.185-195
- /
- 2000
This paper seeks to compare the feature parameters used in speaker identification systems under noisy environments. The feature parameters compared are LP cepstrum (LPCC), Cepstral mean subtraction(CMS), Pole-filtered CMS(PFCMS), Adaptive component weighted cepstrum(ACW) and Postfilter cepstrum(PF). The GMM-based text independent speaker identification system is designed for this target. Some series of experiments show that the LPCC parameter is adequate for modelling the speaker in the matched environments between train and test stages. But in the mismatched training and testing conditions, modified parameters are preferable the LPCC. Especially CMS and PFCMS parameters are more effective for the microphone mismatching conditions while the ACW and PF parameters are good for more noisy mismatches.
PDF

Enhanced Maximum Voiced Frequency Estimation Scheme for HTS Using Two-Band Excitation Model

Park, Jihoon;Hahn, Minsoo
- ETRI Journal
- /
- v.37 no.6
- /
- pp.1211-1219
- /
- 2015
In a hidden Markov model-based speech synthesis system using a two-band excitation model, a maximum voiced frequency (MVF) is the most important feature as an excitation parameter because the synthetic speech quality depends on the MVF. This paper proposes an enhanced MVF estimation scheme based on a peak picking method. In the proposed scheme, both local peaks and peak lobes are picked from the spectrum of a linear predictive residual signal. The average of the normalized distances of local peaks and peak lobes is calculated and utilized as a feature to estimate an MVF. Experimental results of both objective and subjective tests show that the proposed scheme improves the synthetic speech quality compared with that of a conventional one in a mobile device as well as a PC environment.
https://doi.org/10.4218/etrij.15.0115.0124 인용 PDF KSCI

A Scheme Tracking a Moving Object for Biped Robot (이족로봇을 이용한 이동물체 추적 기법)

Park, Sang-Bum;Lee, Boo-Hyung;Han, Young-Joon;Hahn, Hern-Soo
- Proceedings of the IEEK Conference
- /
- 2006.06a
- /
- pp.839-840
- /
- 2006
Our paper proposes a novel moving object tracking scheme for biped robot using a single camera. For walking control of a biped robot we analyze the dynamics of a three-dimensional inverted pendulum model. This analysis leads us a simple linear dynamics. And, the control parameter of the biped robot is derived from the feedback signal which converges the position of a image feature to the feature position of a desired image and the feedforward signal which compensates the motion component due to the moving object.
PDF

Combining genetic algorithms and support vector machines for bankruptcy prediction

Min, Sung-Hwan;Lee, Ju-Min;Han, In-Goo
- Proceedings of the Korea Inteligent Information System Society Conference
- /
- 2004.11a
- /
- pp.179-188
- /
- 2004
Bankruptcy prediction is an important and widely studied topic since it can have significant impact on bank lending decisions and profitability. Recently, support vector machine (SVM) has been applied to the problem of bankruptcy prediction. The SVM-based method has been compared with other methods such as neural network, logistic regression and has shown good results. Genetic algorithm (GA) has been increasingly applied in conjunction with other AI techniques such as neural network, CBR. However, few studies have dealt with integration of GA and SVM, though there is a great potential for useful applications in this area. This study proposes the methods for improving SVM performance in two aspects: feature subset selection and parameter optimization. GA is used to optimize both feature subset and parameters of SVM simultaneously for bankruptcy prediction.
PDF

Discrimination of Cancer Cell by Fuzzy Logic in Medical Images

Na Cheol-Hun
- Journal of information and communication convergence engineering
- /
- v.4 no.1
- /
- pp.36-40
- /
- 2006
A new method of digital image analysis technique for medical images of cancer cell is presented. This paper deals with the cancer cell discrimination. The object images were the Thyroid Gland cell images that were diagnosed as normal and abnormal. This paper proposes a new discrimination method based on fuzzy logic algorithm. The focus of this paper is an automatic discrimination of cells into normal and abnormal of medical images by dominant feature parameters method with fuzzy algorithm. As a consequence of using fuzzy logic algorithm, the nucleus were successfully diagnosed as normal and abnormal. As for the experimental result, average recognition rate of 64.66% was obtained by applying single parameter of 16 feature parameters at a time. The discrimination rate of 93.08% was obtained by proposed method.
PDF KSCI

Search Result 528, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)