Search | Korea Science

A study on the algorithm for speech recognition (음성인식을 위한 알고리즘에 관한 연구)

Kim, Sun-Chul;Lee, Jung-Woo;Cho, Kyu-Ok;Park, Jae-Gyun;Oh, Yong Taek
- Proceedings of the KIEE Conference
- /
- 2008.07a
- /
- pp.2255-2256
- /
- 2008
음성인식 시스템을 설계함에 있어서는 대표적으로 사람의 성도 특성을 모방한 LPC(Linear Predict Cording)방식과 청각 특성을 고려한 MFCC(Mel-Frequency Cepstral Coefficients)방식이 있다. 본 논문에서는 MFCC를 통해 특징파라미터를 추출하고 해당 영역에서의 수행된 작업을 매틀랩 알고리즘을 이용하여 그래프로 시현하였다. MFCC 방식의 추출과정은 최초의 음성신호로부터 전처리과정을 통해 아날로그 신호를 디지털 신호로 변환하고, 잡음부분을 최소화하며, 음성 부분을 강조한다. 이 신호는 다시 Windowing을 통해 음성의 불연속을 제거해 주고, FFT를 통해 시간의 영역을 주파수의 영역으로 변환한다. 이 변환된 신호는 Filter Bank를 거쳐 다수의 복잡한 신호를 몇 개의 간단한 신호로 간소화 할 수 있으며, 마지막으로 Mel-cepstrum을 통해 최종적으로 특징 파라미터를 얻고자 하였다.
PDF

Feature Selection of Training set for Supervised Classification of Satellite Imagery (위성영상의 감독분류를 위한 훈련집합의 특징 선택에 관한 연구)

곽장호;이황재;이준환
- Korean Journal of Remote Sensing
- /
- v.15 no.1
- /
- pp.39-50
- /
- 1999
It is complicate and time-consuming process to classify a multi-band satellite imagery according to the application. In addition, classification rate sensitively depends on the selection of training data set and features in a supervised classification process. This paper introduced a classification network adopting a fuzzy-based $\gamma$-model in order to select a training data set and to extract feature which highly contribute to an actual classification. The features used in the classification were gray-level histogram, textures, and NDVI(Normalized Difference Vegetation Index) of target imagery. Moreover, in order to minimize the errors in the classification network, the Gradient Descent method was used in the training process for the $\gamma$-parameters at each code used. The trained parameters made it possible to know the connectivity of each node and to delete the void features from all the possible input features.
https://doi.org/10.7780/kjrs.1999.15.1.39 인용 PDF

Generation of Korean Intonation using Vector Quantization (벡터 양자화를 이용한 한국어 억양 곡선 생성)

An, Hye-Sun;Kim, Hyung-Soon
- Annual Conference on Human and Language Technology
- /
- 2001.10d
- /
- pp.209-212
- /
- 2001
본 논문에서는 text-to-speech 시스템에서 사용할 억양 모델을 위해 벡터 양자화(vector quantization) 방식을 이용한다. 어절 경계강도(break index)는 세단계로 분류하였고, CART(Classification And Regression Tree)를 사용하여 어절 경계강도의 예측 규칙을 생성하였다. 예측된 어절 경계강도를 바탕으로 운율구를 예측하였으며 운율구는 다섯 개의 억양 패턴으로 분류하였다. 하나의 운율구는 정점(peak)의 시간축, 주파수축 값과 이를 기준으로 한 앞, 뒤 기울기를 추출하여 네 개의 파라미터로 단순화하였다. 운율구에 대해서 먼저 운율구가 문장의 끝일 경우와 아닐 경우로 분류하고, 억양 패턴 다섯 개로 분류하여. 모두 10개의 운율구 set으로 나누었다. 그리고 네 개의 파라미터를 가지고 있는 운율구의 억양 패턴을 벡터 양자화 방식을 이용하여 분류(clusteing)하였다 운율의 변화가 두드러지는 조사와 어미는 12 point의 기본주파수 값을 추출하고 벡터 양자화하였다. 운율구와 조사 어미의 codebook index는 문장에 대한 특징 변수 값을 추출하고 CART를 사용하여 예측하였다. 합성할 때에는 입력 tort에 대해서 운율구의 억양 파라미터를 추정한 다음, 조사와 어미의 12 point 기본주파수 값을 추정하여 전체 억양 곡선을 생성하였고 본 연구실에서 제작한 음성합성기를 통해 합성하였다.
PDF

A Variable Parameter Model based on SSMS for an On-line Speech and Character Combined Recognition System (음성 문자 공용인식기를 위한 SSMS 기반 가변 파라미터 모델)

석수영;정호열;정현열
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.7
- /
- pp.528-538
- /
- 2003
A SCCRS (Speech and Character Combined Recognition System) is developed for working on mobile devices such as PDA (Personal Digital Assistants). In SCCRS, the feature extraction is separately carried out for speech and for hand-written character, but the recognition is performed in a common engine. The recognition engine employs essentially CHMM (Continuous Hidden Markov Model), which consists of variable parameter topology in order to minimize the number of model parameters and to reduce recognition time. For generating contort independent variable parameter model, we propose the SSMS(Successive State and Mixture Splitting), which gives appropriate numbers of mixture and of states through splitting in mixture domain and in time domain. The recognition results show that the proposed SSMS method can reduce the total number of GOPDD (Gaussian Output Probability Density Distribution) up to 40.0% compared to the conventional method with fixed parameter model, at the same recognition performance in speech recognition system.
PDF KSCI

Fuzzy Scheme for Extracting Linear Features (선형적 특징을 추출하기 위한 퍼지 후프 방법)

주문원;최영미
- Journal of Korea Multimedia Society
- /
- v.2 no.2
- /
- pp.129-136
- /
- 1999
A linear feature often provide sufficient information for image understanding and coding. An objective of the research reported in this paper is to develop and analyze the reliable methods of extracting lines in gray scale images. The Hough Transform is known as one of the optimal paradigms to detect or identify the linear features by transforming edges in images into peaks in parameter space. The scheme proposed here uses the fuzzy gradient direction model and weights the gradient magnitudes for deciding the voting values to be accumulated in parameter space. This leads to significant computational savings by restricting the transform to within some support region of the observed gradient direction which can be considered as a fuzzy variable and produces robust results.
PDF

A study on the Caricature Generation using Face Features (얼굴의 특징을 이용한 캐리커쳐 생성에 관한 연구)

Oh, S.H.;Lim, H.;Park, S.Y.;Kim, I.S.;Park, H.S.
- Proceedings of the IEEK Conference
- /
- 2000.09a
- /
- pp.623-626
- /
- 2000
본 논문에서는 얼굴의 특징 추출을 이용해서 캐리커쳐를 자동으로 생성하는 알고리즘을 제안한다. 제안된 방법은 사진이나 카메라를 이용해서 입력된 영상으로부터 색상정보를 이용하여 얼굴영역을 검출하고 얼굴의 기하학적인 구조를 이용해서 유전자 알고리즘의 추정 파라미터를 설정하여 최적의 특징 점의 위치를 검출한다. 검출된 특징 점 위치를 이용하여 눈, 코, 입, 눈썹, 머리카락 등 얼굴의 특징이 되는 구성요소를 추출한다. 마지막으로 얼굴의 윤곽선을 구한 다음 추출된 얼굴의 구성요소들을 합성하여 간단하면서도 개인의 특징을 잘 반영할 수 있는 캐리커쳐를 생성한다.
PDF

A Real-Time Automatic Diagnosis System for Semiconductor Process (반도체 공정 실시간 자동 진단 시스템)

권오범;한혜정;김계영
- Proceedings of the Korean Information Science Society Conference
- /
- 2003.04c
- /
- pp.241-243
- /
- 2003
일반적으로 사용되는 반도체 공정에 대한 진단 기법은 한 공정을 진행하기 전에 테스트 공정을 수행하여 공정의 진행 여부를 결정하고, 한 공정의 진행을 완료한 후에 다시 테스트 공정을 수행하여 공정의 결과를 진단하는 방법이다. 본 논문에서 제안하는 실시간 자동 진단 시스템은 기존 방법의 문제점인 자원의 낭비를 막고, 실시간으로 진단함으로써 시간의 낭비를 막는 진단 시스템을 제안한다. 실시간 자동 진단 시스템은 크게 시스템 초기화 단계, 학습 단계 그리고 예측 단계로 나누어진다. 초기화 단계는 진단할 공정에 대한 사전 입력값을 받아 시스템을 초기화하는 과정으로 공정장비 파라미터별 중요도 자동 설정 과정과 초기화 클러스터링으로 이루어진다. 학습 단계는 실시간으로 저장된 공정장치별 데이터와 계측기로부터 획득된 데이터를 이용하여 최적의 유사 클래스를 결정하는 단계와 결정된 유사 클래스를 이용하여 가중치를 학습하는 단계로 나누어진다. 예측 단계는 공정 진행 중 획득된 실시간 데이터를 학습 단계에서 결정된 파라미터별 가중치를 사용하여 공정에 대한 진단을 한다. 본 시스템에서 사용하는 클러스터링 알고리즘은 DTW(Dynamic Time Warping)를 이용하여 파라미터 데이터에 대한 특징을 추출하고 LBG(Linde, Buzo and Gray) 알고리즘을 사용하여 데이터를 군집화 한다.
PDF

Acoustic parameters for induced emotion categorizing and dimensional approach (자연스러운 정서 반응의 범주 및 차원 분류에 적합한 음성 파라미터)

Park, Ji-Eun;Park, Jeong-Sik;Sohn, Jin-Hun
- Science of Emotion and Sensibility
- /
- v.16 no.1
- /
- pp.117-124
- /
- 2013
This study examined that how precisely MFCC, LPC, energy, and pitch related parameters of the speech data, which have been used mainly for voice recognition system could predict the vocal emotion categories as well as dimensions of vocal emotion. 110 college students participated in this experiment. For more realistic emotional response, we used well defined emotion-inducing stimuli. This study analyzed the relationship between the parameters of MFCC, LPC, energy, and pitch of the speech data and four emotional dimensions (valence, arousal, intensity, and potency). Because dimensional approach is more useful for realistic emotion classification. It results in the best vocal cue parameters for predicting each of dimensions by stepwise multiple regression analysis. Emotion categorizing accuracy analyzed by LDA is 62.7%, and four dimension regression models are statistically significant, p<.001. Consequently, this result showed the possibility that the parameters could also be applied to spontaneous vocal emotion recognition.
PDF

Pattern Classification of Hard Disk Defect Distribution Using Gaussian Mixture Model (가우시안 혼합 모델을 이용한 하드 디스크 결함 분포의 패턴 분류)

Jun, Jae-Young;Kim, Jeong-Heon;Moon, Un-Chul;Choi, Kwang-Nam
- Proceedings of the Korean Information Science Society Conference
- /
- 2008.06c
- /
- pp.482-486
- /
- 2008
본 논문에서는 하드 디스크 드라이브(Hard Disk Drive, HDD) 생산 공정 과정에서 발생할 수 있는 불량 HDD의 결함 분포에 대해서 패턴을 자동으로 분류해주는 기법을 제시한다. 이를 위해서 표준 패턴 클래스로 분류되어 있는 불량 HDD의 각 클래스의 확률 모델을 GMM(Gaussian Mixture Model)로 가정한다. 실험은 전문가에 의해 분류된 실제 HDD 결함 분포로부터 5가지의 특징 값들을 추출한 후, 결함 분포의 클래스를 표현할 수 있는 GMM의 파라미터(Parameter)를 학습한다. 각 모델의 파라미터를 추정하기 위해 EM(Expectation Maximization) 알고리즘을 사용한다. 학습된 GMM의 분류 테스트는 학습에 사용되지 않은 HDD 결함 분포에서 5가지의 특징 값을 입력 값으로 추정된 모델들의 파라미터 값에 의해 사후 확률을 구한다. 계산된 확률 값 중 가장 큰 값을 갖는 모델의 클래스를 표준 패턴 클래스로 분류한다. 그 결과 제시된 GMM을 이용한 HDD의 패턴 분류의 결과 96.1%의 정답률을 보여준다.
PDF

Fingerprint Feature Extraction Using the Convex Structure (컨벡스(Convex) 구조를 이용한지문의 특징점 추출)

김두현;박래홍
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.40 no.6
- /
- pp.1-9
- /
- 2003
In this paper, we propose a new fingerprint feature extraction method using the convex structure. A fingerprint minutiae flows along the uniform direction and is regarded as a sinusoidal signal across the normal direction. Local maxima of the signal represent coarse thinned one-pixel-wide ridges in which the convex region of the signal correspond to ridges. The proposed fingerprint feature extraction method detects the convex structure and local maxima. Finally fingerprint features are extracted from one-pixel-wide ridges. Because it has no parameter, it is efficient for various fingerprint identification systems.
PDF KSCI

Search Result 225, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)