통합 검색 | Korea Science

Feature Extraction Based on DBN-SVM for Tone Recognition

Chao, Hao;Song, Cheng;Lu, Bao-Yun;Liu, Yong-Li
- Journal of Information Processing Systems
- /
- 제15권1호
- /
- pp.91-99
- /
- 2019
An innovative tone modeling framework based on deep neural networks in tone recognition was proposed in this paper. In the framework, both the prosodic features and the articulatory features were firstly extracted as the raw input data. Then, a 5-layer-deep deep belief network was presented to obtain high-level tone features. Finally, support vector machine was trained to recognize tones. The 863-data corpus had been applied in experiments, and the results show that the proposed method helped improve the recognition accuracy significantly for all tone patterns. Meanwhile, the average tone recognition rate reached 83.03%, which is 8.61% higher than that of the original method.
https://doi.org/10.3745/JIPS.04.0101 인용 PDF KSCI HTML

음악감상활동이 유아의 음정감과 리듬감 발달에 미치는 영향 (The Effect of Listening to Music for the Children's Development of Tone Recognition & Sense of Rhythm)

엄정애;김경남
- 대한가정학회지
- /
- 제41권10호
- /
- pp.75-84
- /
- 2003
The purpose of this study was to examine the effect of listening to music during musical activities on children's development of tone recognition and sense of rhythm. The subjects were total sixty 4-years-olds from two classes of thirty. The children were divided into two groups, experimental and control. Before the experimental procedures, a pre-test was taken to evaluate the level of tone recognition and sense of rhythm of the children. Cordon's 'Audie' was employed and used to measure the difference of tone recognition and sense of rhythm. Then, the activity of listening to music was applied to the experimental group for ten weeks. For the experimental group, the musical activity was selected based on the themes of our tfe which was related to the weekly and yearly teaching plan. One the other hand, no musical activity was provided for the control group. After the experiment, a post-test was carried out using the same methodology of pre-test. Data were analysed by ANCOVA test. Results showed that there was a statistically significant difference in the development of tone recognition and sense of rhythm between the experimental group and the control group.
PDF KSCI

IoT를 위한 음성신호 기반의 톤, 템포 특징벡터를 이용한 감정인식 (Emotion Recognition Using Tone and Tempo Based on Voice for IoT)

변성우;이석필
- 전기학회논문지
- /
- 제65권1호
- /
- pp.116-121
- /
- 2016
In Internet of things (IoT) area, researches on recognizing human emotion are increasing recently. Generally, multi-modal features like facial images, bio-signals and voice signals are used for the emotion recognition. Among the multi-modal features, voice signals are the most convenient for acquisition. This paper proposes an emotion recognition method using tone and tempo based on voice. For this, we make voice databases from broadcasting media contents. Emotion recognition tests are carried out by extracted tone and tempo features from the voice databases. The result shows noticeable improvement of accuracy in comparison to conventional methods using only pitch.
https://doi.org/10.5370/KIEE.2016.65.1.116 인용 PDF KSCI

Skin-tone과 특징형태를 적용한 효율적인 얼굴영역 자동검출 기법의 구현 (Efficient and Automatic Face Detection Using Skin-tone and Shape)

김광희;김성환;최옥매;이배호
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 1999년도 하계종합학술대회 논문집
- /
- pp.575-578
- /
- 1999
The principal features of a face are as follows : skin-tone, symmetry, and requisites such as shape of ellipse, eyes, nose, mouth. Also, faces have different size, various shape and position. In case of application of face recognition and detection without preprocessing, efficiency of the performance is decreased. In addition, face itself, complex background, image quality, etc. are included. Therefore, previous face recognition methods are implemented on the base of specific constraints of the face image. In this paper, we propose the efficient and automatic face detection algorithm for minimizing influence such as complex background, image quality, etc. This face detection technique consists of skin-tone, candidate face region and face region extractions.
PDF

음성으로부터 감성인식 요소분석 (Analyzing the element of emotion recognition from speech)

심귀보;박창현
- 한국지능시스템학회논문지
- /
- 제11권6호
- /
- pp.510-515
- /
- 2001
일반적으로 음성신호로부터 사람의 감정을 인식할 수 있는 요소는(1)대화의 내용에 사용한 단어, (2)톤 (tore), (3)음성신호의 피치(Pitch), (4)포만트 주파수(Formant Frequencey)그리고 (5)말의 빠르기(Speech Speed)(6)음질(Voice Quality)등이다. 사람의 경우는주파수 같은 분석요소 보다 톤과 단어 빠르기, 음질로 감정을 받아들이게 되는것이 자연스러운 방법이므로 당연히 후자의 요소들이 감정을 분류하는데 중요한 인자로쓰일 수있다. 그리고, 종래는 주로 후자의 효소들을 이용하였는데, 기계로써 구현하기 위해서는 포만트 주파수를 사용할 수있게 되는것이 도움이 된다. 그러므로, 본 연구는 음성 신호로부터 피치와 포만트, 그리고 말의 빠르기 등을 이용하여 감성인식시스템을 구현하는것을 목표로 연구를 진행하고 있으며, 그 1단계 연구로서 본 논문에서는 화가 나서 내뱉는 말을 기반으로 하여 화난 감정의 독특한 특성을 찾아내었다.
PDF

LDA와 tri-tone 모델을 이용한 운율경계강도 예측 (Prosodic Break Index Estimation using LDA and Tri-tone Model)

강평수;엄기완;김진영
- 한국음향학회지
- /
- 제18권7호
- /
- pp.17-22
- /
- 1999
본 논문에서는 발화된 문장으로부터 운율 경계 강도를 효과적으로 예측하기 위해 LDA와 tri-tone 모델을 혼합한 방법을 제안하였다. 이 방법은 기존의 LDA 방법을 사용하여 음절과 휴지기의 길이 정보를 운율경계강도 예측에 적용하고 피치정보를 벡터양자화에 적용하여 tri-tone이란 개념을 도입한 혼합형 모형이다. 제안된 방법은 주어진 200문장의 운율경계 강도를 예측하는 실험에서 72%의 정확성을 나타내었다.
PDF

감마톤 특징 추출 음향 모델을 이용한 음성 인식 성능 향상 (Speech Recognition Performance Improvement using Gamma-tone Feature Extraction Acoustic Model)

안찬식;최기호
- 디지털융복합연구
- /
- 제11권7호
- /
- pp.209-214
- /
- 2013
음성 인식 시스템에서는 인식 성능 향상을 위한 방법으로 인간의 청취 능력을 인식 시스템에 접목하였으며 잡음 환경에서 음성 신호와 잡음을 분리하여 원하는 음성 신호만을 선택할 수 있도록 구성되었다. 하지만 실용적 측면에서 음성 인식 시스템의 성능 저하 요인으로 인식 환경 변화에 따른 잡음으로 인한 음성 검출이 정확하지 못하여 일어나는 것과 학습 모델이 일치하지 않는 것을 들 수 있다. 따라서 본 논문에서는 음성 인식 향상을 위해 감마톤을 이용하여 특징을 추출하고 음향 모델을 이용한 학습 모델을 제안하였다. 제안한 방법은 청각 장면 분석을 이용한 특징을 추출을 통해 인간의 청각 인지 능력을 반영하였으며 인식을 위한 학습 모델 과정에서 음향 모델을 이용하여 인식 성능을 향상시켰다. 성능 평가를 위해 잡음 환경의 -10dB, -5dB 신호에서 잡음 제거를 수행하여 SNR을 측정한 결과 3.12dB, 2.04dB의 성능이 향상됨을 확인하였다.
https://doi.org/10.14400/JDPM.2013.11.7.209 인용 PDF

Automatic Face Identification System Using Adaptive Face Region Detection and Facial Feature Vector Classification

Kim, Jung-Hoon;Do, Kyeong-Hoon;Lee, Eung-Joo
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2002년도 ITC-CSCC -2
- /
- pp.1252-1255
- /
- 2002
In this paper, face recognition algorithm, by using skin color information of HSI color coordinate collected from face images, elliptical mask, fratures of face including eyes, nose and mouth, and geometrical feature vectors of face and facial angles, is proposed. The proposed algorithm improved face region extraction efficacy by using HSI information relatively similar to human's visual system along with color tone information about skin colors of face, elliptical mask and intensity information. Moreover, it improved face recognition efficacy with using feature information of eyes, nose and mouth, and Θ1(ACRED), Θ2(AMRED) and Θ 3(ANRED), which are geometrical face angles of face. In the proposed algorithm, it enables exact face reading by using color tone information, elliptical mask, brightness information and structural characteristic angle together, not like using only brightness information in existing algorithm. Moreover, it uses structural related value of characteristics and certain vectors together for the recognition method.
PDF

명암과 색상 정보를 이용한 번호판 인식 (Recognition of License Plate with Brightness and Tone of Color Data)

이승수;이기성
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2003년도 학술회의 논문집 정보 및 제어부문 B
- /
- pp.528-531
- /
- 2003
Recognition of licence plate becomes a key issue to many traffic related application such as road traffic monitoring or parking lots access control. In this paper, the brightness, YIQ and HSI methods were used to locate a license. After the characters in license plate were extracted, template matching method was applied for character recognitions. To test the performance of the proposed algorithm, images of seventy vehicle were tested. The success rates for license plate and character recognition were approximately 99%, and 96%, respectively
PDF

음성인식을 이용한 반려 로봇의 모션제어에 대한 연구 (A Study on Motion Control of the Pet-Robot using Voice-Recognition)

조예진;김현석;배태성;이수행;김진현;김재욱
- 한국전자통신학회논문지
- /
- 제17권6호
- /
- pp.1089-1094
- /
- 2022
본 논문에서는 일상생활에서 사람과 교감하며 돌봄 인력의 공백을 완화할 수 있는 인간 공존형 반려 로봇을 연구하였다. 음성인식 모듈과 서보모터, 아두이노 보드를 기반으로 음성인식을 이용한 로봇팔 제어기능, RC카를 활용한 위치 이동 기능, 음성인식 기능을 활용하여 도트 매트릭스를 이용한 반려 로봇의 감정표현 기능을 탑재한 반려 로봇을 실험, 제작하였다. 실험 결과로 거리에 따른 음성인식 실험 결과 5~30cm의 거리에서 최적의 인식률을 보였고 성별에 따른 음성인식 실험 결과에서는 남성의 인식률이 더 높은 것을 확인할 수 있었으며 성조에 따른 음성인식 실험 결과 첫 번째 성조인 단조로운 톤에서 높은 인식률을 보였다. 이러한 동작 실험에 대한 평가 결과를 통해 반려 로봇을 만들 수 있음을 확인할 수 있었다.
https://doi.org/10.13067/JKIECS.2022.17.6.1089 인용 PDF KSCI

검색결과 73건 처리시간 0.024초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)