Search | Korea Science

Feature Extraction Based on DBN-SVM for Tone Recognition

Chao, Hao;Song, Cheng;Lu, Bao-Yun;Liu, Yong-Li
- Journal of Information Processing Systems
- /
- v.15 no.1
- /
- pp.91-99
- /
- 2019
An innovative tone modeling framework based on deep neural networks in tone recognition was proposed in this paper. In the framework, both the prosodic features and the articulatory features were firstly extracted as the raw input data. Then, a 5-layer-deep deep belief network was presented to obtain high-level tone features. Finally, support vector machine was trained to recognize tones. The 863-data corpus had been applied in experiments, and the results show that the proposed method helped improve the recognition accuracy significantly for all tone patterns. Meanwhile, the average tone recognition rate reached 83.03%, which is 8.61% higher than that of the original method.
https://doi.org/10.3745/JIPS.04.0101 인용 PDF KSCI HTML

The Effect of Listening to Music for the Children's Development of Tone Recognition & Sense of Rhythm (음악감상활동이 유아의 음정감과 리듬감 발달에 미치는 영향)

Ohm Jung-ae;Kim Kyungnam
- Journal of the Korean Home Economics Association
- /
- v.41 no.10 s.188
- /
- pp.75-84
- /
- 2003
The purpose of this study was to examine the effect of listening to music during musical activities on children's development of tone recognition and sense of rhythm. The subjects were total sixty 4-years-olds from two classes of thirty. The children were divided into two groups, experimental and control. Before the experimental procedures, a pre-test was taken to evaluate the level of tone recognition and sense of rhythm of the children. Cordon's 'Audie' was employed and used to measure the difference of tone recognition and sense of rhythm. Then, the activity of listening to music was applied to the experimental group for ten weeks. For the experimental group, the musical activity was selected based on the themes of our tfe which was related to the weekly and yearly teaching plan. One the other hand, no musical activity was provided for the control group. After the experiment, a post-test was carried out using the same methodology of pre-test. Data were analysed by ANCOVA test. Results showed that there was a statistically significant difference in the development of tone recognition and sense of rhythm between the experimental group and the control group.
PDF KSCI

Emotion Recognition Using Tone and Tempo Based on Voice for IoT (IoT를 위한 음성신호 기반의 톤, 템포 특징벡터를 이용한 감정인식)

Byun, Sung-Woo;Lee, Seok-Pil
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.65 no.1
- /
- pp.116-121
- /
- 2016
In Internet of things (IoT) area, researches on recognizing human emotion are increasing recently. Generally, multi-modal features like facial images, bio-signals and voice signals are used for the emotion recognition. Among the multi-modal features, voice signals are the most convenient for acquisition. This paper proposes an emotion recognition method using tone and tempo based on voice. For this, we make voice databases from broadcasting media contents. Emotion recognition tests are carried out by extracted tone and tempo features from the voice databases. The result shows noticeable improvement of accuracy in comparison to conventional methods using only pitch.
https://doi.org/10.5370/KIEE.2016.65.1.116 인용 PDF KSCI

Efficient and Automatic Face Detection Using Skin-tone and Shape (Skin-tone과 특징형태를 적용한 효율적인 얼굴영역 자동검출 기법의 구현)

김광희;김성환;최옥매;이배호
- Proceedings of the IEEK Conference
- /
- 1999.06a
- /
- pp.575-578
- /
- 1999
The principal features of a face are as follows : skin-tone, symmetry, and requisites such as shape of ellipse, eyes, nose, mouth. Also, faces have different size, various shape and position. In case of application of face recognition and detection without preprocessing, efficiency of the performance is decreased. In addition, face itself, complex background, image quality, etc. are included. Therefore, previous face recognition methods are implemented on the base of specific constraints of the face image. In this paper, we propose the efficient and automatic face detection algorithm for minimizing influence such as complex background, image quality, etc. This face detection technique consists of skin-tone, candidate face region and face region extractions.
PDF

Analyzing the element of emotion recognition from speech (음성으로부터 감성인식 요소분석)

심귀보;박창현
- Journal of the Korean Institute of Intelligent Systems
- /
- v.11 no.6
- /
- pp.510-515
- /
- 2001
Generally, there are (1)Words for conversation (2)Tone (3)Pitch (4)Formant frequency (5)Speech speed, etc as the element for emotional recognition from speech signal. For human being, it is natural that the tone, vice quality, speed words are easier elements rather than frequency to perceive other s feeling. Therefore, the former things are important elements fro classifying feelings. And, previous methods have mainly used the former thins but using formant is good for implementing as machine. Thus. our final goal of this research is to implement an emotional recognition system based on pitch, formant, speech speed, etc. from speech signal. In this paper, as first stage we foun specific features of feeling angry from his words when a man got angry.
PDF

Prosodic Break Index Estimation using LDA and Tri-tone Model (LDA와 tri-tone 모델을 이용한 운율경계강도 예측)

강평수;엄기완;김진영
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.7
- /
- pp.17-22
- /
- 1999
In this paper we propose a new mixed method of LDA and tri-tone model to predict Korean prosodic break indices(PBI) for a given utterance. PBI can be used as an important cue of syntactic discontinuity in continuous speech recognition(CSR). The model consists of three steps. At the first step, PBI was predicted with the information of syllable and pause duration through the linear discriminant analysis (LDA) method. At the second step, syllable tone information was used to estimate PBI. In this step we used vector quantization (VQ) for coding the syllable tones and PBI is estimated by tri-tone model. In the last step, two PBI predictors were integrated by a weight factor. The proposed method was tested on 200 literal style spoken sentences. The experimental results showed 72% accuracy.
PDF

Speech Recognition Performance Improvement using Gamma-tone Feature Extraction Acoustic Model (감마톤 특징 추출 음향 모델을 이용한 음성 인식 성능 향상)

Ahn, Chan-Shik;Choi, Ki-Ho
- Journal of Digital Convergence
- /
- v.11 no.7
- /
- pp.209-214
- /
- 2013
Improve the recognition performance of speech recognition systems as a method for recognizing human listening skills were incorporated into the system. In noisy environments by separating the speech signal and noise, select the desired speech signal. but In terms of practical performance of speech recognition systems are factors. According to recognized environmental changes due to noise speech detection is not accurate and learning model does not match. In this paper, to improve the speech recognition feature extraction using gamma tone and learning model using acoustic model was proposed. The proposed method the feature extraction using auditory scene analysis for human auditory perception was reflected In the process of learning models for recognition. For performance evaluation in noisy environments, -10dB, -5dB noise in the signal was performed to remove 3.12dB, 2.04dB SNR improvement in performance was confirmed.
https://doi.org/10.14400/JDPM.2013.11.7.209 인용 PDF

Automatic Face Identification System Using Adaptive Face Region Detection and Facial Feature Vector Classification

Kim, Jung-Hoon;Do, Kyeong-Hoon;Lee, Eung-Joo
- Proceedings of the IEEK Conference
- /
- 2002.07b
- /
- pp.1252-1255
- /
- 2002
In this paper, face recognition algorithm, by using skin color information of HSI color coordinate collected from face images, elliptical mask, fratures of face including eyes, nose and mouth, and geometrical feature vectors of face and facial angles, is proposed. The proposed algorithm improved face region extraction efficacy by using HSI information relatively similar to human's visual system along with color tone information about skin colors of face, elliptical mask and intensity information. Moreover, it improved face recognition efficacy with using feature information of eyes, nose and mouth, and Θ1(ACRED), Θ2(AMRED) and Θ 3(ANRED), which are geometrical face angles of face. In the proposed algorithm, it enables exact face reading by using color tone information, elliptical mask, brightness information and structural characteristic angle together, not like using only brightness information in existing algorithm. Moreover, it uses structural related value of characteristics and certain vectors together for the recognition method.
PDF

Recognition of License Plate with Brightness and Tone of Color Data (명암과 색상 정보를 이용한 번호판 인식)

Lee, Seung-Su;Lee, Kee-Seong
- Proceedings of the KIEE Conference
- /
- 2003.11c
- /
- pp.528-531
- /
- 2003
Recognition of licence plate becomes a key issue to many traffic related application such as road traffic monitoring or parking lots access control. In this paper, the brightness, YIQ and HSI methods were used to locate a license. After the characters in license plate were extracted, template matching method was applied for character recognitions. To test the performance of the proposed algorithm, images of seventy vehicle were tested. The success rates for license plate and character recognition were approximately 99%, and 96%, respectively
PDF

A Study on Motion Control of the Pet-Robot using Voice-Recognition (음성인식을 이용한 반려 로봇의 모션제어에 대한 연구)

Ye-Jin, Cho;Hyun-Seok, Kim;Tae-Sung, Bae;Su-Haeng, Lee;Jin-Hyean, Kim;Jae-Wook, Kim
- The Journal of the Korea institute of electronic communication sciences
- /
- v.17 no.6
- /
- pp.1089-1094
- /
- 2022
In this paper, a human coexistence-type companion robot that can communicate with people in daily life and alleviate the gap in care personnel was studied. Based on the voice recognition module, servo motor, and Arduino board, a companion robot equipped with a robot arm control function using voice recognition, a position movement function using RC cars, and a voice recognition function was tested and manufactured. As a result of the experiment, the speech recognition experiment according to distance showed the optimal recognition rate at a distance of 5 to 30 cm, and the speech recognition experiment according to gender showed a higher recognition rate in the first tone, monotonous tone. Through the evaluation results of these motion experiments, it was confirmed that a companion robot could be made.
https://doi.org/10.13067/JKIECS.2022.17.6.1089 인용 PDF KSCI

Search Result 73, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)