Search | Korea Science

A Query-by-Speech Scheme for Photo Albuming (음성 질의 기반 디지털 사진 검색 기법)

Kim Tae-Sung;Suh Young-Joo;Lee Yong-Ju;Kim Hoi-Rin
- MALSORI
- /
- no.57
- /
- pp.99-112
- /
- 2006
In this paper, we introduce two retrieval methods for photos with speech documents. We compare the pattern of speech query with those of speech documents recorded in digital cameras, and measure the similarities, and retrieve photos corresponding to the speech documents which have high similarity scores. As the first approach, a phoneme recognition scheme is used as the pre-processor for the pattern matching, and in the second one, the vector quantization (VQ) and the dynamic time warping (DTW) are applied to match the speech query with the documents in signal domain itself. Experimental results show that the performance of the first approach is highly dependent on that of phoneme recognition while the processing time is short. The second method provides a great improvement of performance. While the processing time is longer than that of the first method due to DTW, but we can reduce it by taking approximated methods.
PDF

Enhancement of ST-segment Features in ECG Signals by Warping Transformation (워핑 변환을 이용한 심전도 신호의 ST 분절 특징 값 강화)

Shin, Seung-Won;Kim, Kyeong-Seop
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.59 no.6
- /
- pp.1143-1149
- /
- 2010
In this study, we propose a novel method to detect and enhance the feature of ST-segment which offers the crucial information for the diagnosis of myocardial infarction and ischemia. With this aim, PQRST features of Electrocardiogram initially are detected and subsequently ST-segment are estimated. And Dynamic Time Warping(DTW) transformation is applied recursively to minimize the difference in time between ST-segments and calculate the minimum cumulative distance that decides the degree of similarity among ST-segments. As of the results, the inherent characteristic of ST-segment can be emphasized in terms of time parameter and thus the diagnostic features of a ST-segment can be revealed further.
https://doi.org/10.5370/KIEE.2010.59.6.1143 인용 PDF KSCI

Pattern Recognition of Monitored Waveforms from Power Supplies Feeding High-Speed Rail Systems

Gu, Wei;Zhang, Shuai;Yuan, Xiaodong;Chen, Bing;Bai, Jingjing
- Journal of Electrical Engineering and Technology
- /
- v.11 no.1
- /
- pp.55-64
- /
- 2016
The development of high-speed rail (HSR) has had a major impact on the power supply grid. Based on the monitored waveforms of HSR, a pattern recognition approach is proposed for the first time in this paper to identify the operating conditions. To reduce the data dimensions for monitored waveforms, the principal component analysis (PCA) algorithm was used to extract the characteristics and their waveforms from the monitored waveforms data. The dynamic time wrapping (DTW) algorithm was then used to identify the operating conditions of the HSR. Cases studies show that the proposed approach is effective and feasible, and that it is possible to identify the real-time operating conditions based on the monitored waveforms.
https://doi.org/10.5370/JEET.2016.11.1.055 인용 PDF KSCI KPUBS

Implementation of the Auditory Sense for the Smart Robot: Speaker/Speech Recognition (로봇 시스템에의 적용을 위한 음성 및 화자인식 알고리즘)

Jo, Hyun;Kim, Gyeong-Ho;Park, Young-Jin
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2007.05a
- /
- pp.1074-1079
- /
- 2007
We will introduce speech/speaker recognition algorithm for the isolated word. In general case of speaker verification, Gaussian Mixture Model (GMM) is used to model the feature vectors of reference speech signals. On the other hand, Dynamic Time Warping (DTW) based template matching technique was proposed for the isolated word recognition in several years ago. We combine these two different concepts in a single method and then implement in a real time speaker/speech recognition system. Using our proposed method, it is guaranteed that a small number of reference speeches (5 or 6 times training) are enough to make reference model to satisfy 90% of recognition performance.
PDF

Development of Audio Feature Sequence Data Indexing Method for Query by Singing and Humming (허밍 기반 음원 검색을 위한 오디오 특징 시퀀스 데이터 색인 기법 개발)

Song, Chai-Jong;Lim, Tea-Buem
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2013.06a
- /
- pp.381-384
- /
- 2013
본 논문에서는 허밍기반 음원 검색 시스템을 위한 오디오 특징 시퀀스 데이터 색인 기법을 제안한다. 우선 Query-by-Singing/Humming (QbSH) 시스템의 특징 데이터베이스를 생성하기 위하여 MP3 와 같은 다성음원에서 주요 멜로디를 추출하여 시퀀스데이터를 생성하고, 고속 검색을 지원하기 위한 시퀀스데이터를 색인화한다. 본 논문에서는 최소 Dynamic Time Warping (DTW) 거리 기법, 시퀀스 추상화 기법, 상한 값 기반 DTW 기법과 같이 세 가지의 시퀀스 데이터의 색인화 기술을 제시하고 각각에 대한 문제점을 파악하고, 성능을 평가한다. 이를 통하여 향상된 검색 시간과 검색 정확도를 얻을 수 있다.
PDF

A Study on Design and Implementation of Embedded System for speech Recognition Process

Kim, Jung-Hoon;Kang, Sung-In;Ryu, Hong-Suk;Lee, Sang-Bae
- Journal of the Korean Institute of Intelligent Systems
- /
- v.14 no.2
- /
- pp.201-206
- /
- 2004
This study attempted to develop a speech recognition module applied to a wheelchair for the physically handicapped. In the proposed speech recognition module, TMS320C32 was used as a main processor and Mel-Cepstrum 12 Order was applied to the pro-processor step to increase the recognition rate in a noisy environment. DTW (Dynamic Time Warping) was used and proven to be excellent output for the speaker-dependent recognition part. In order to utilize this algorithm more effectively, the reference data was compressed to 1/12 using vector quantization so as to decrease memory. In this paper, the necessary diverse technology (End-point detection, DMA processing, etc.) was managed so as to utilize the speech recognition system in real time
https://doi.org/10.5391/JKIIS.2004.14.2.201 인용 PDF KSCI

Enhancement of Ship's Wheel Order Recognition System using Speaker's Intention Predictive Parameters (화자의도예측 파라미터를 이용한 조타명령 음성인식 시스템의 개선)

Moon, Serng-Bae
- Journal of Advanced Marine Engineering and Technology
- /
- v.32 no.5
- /
- pp.791-797
- /
- 2008
The officer of the deck(OOD) may sometimes have to carry out lookout as well as handling of auto pilot without a quartermaster at sea. The purpose of this paper is to develop the ship's auto pilot control module using speech recognition in order to reduce the potential risk of one man bridge system. The feature parameters predicting the OOD's intention was extracted from the sample wheel orders written in SMCP(IMO Standard Marine Communication Phrases). We designed a pre-recognition procedure which could make some candidate words using DTW(Dynamic Time Warping) algorithm, a post-recognition procedure which made a final decision from the candidate words using the feature parameters. To evaluate the effectiveness of these procedures the experiment was conducted with 500 wheel orders.
https://doi.org/10.5916/jkosme.2008.32.5.791 인용 PDF KSCI

Speech Feature Extraction Based on the Human Hearing Model

Chung, Kwang-Woo;Kim, Paul;Hong, Kwang-Seok
- Proceedings of the KSPS conference
- /
- 1996.10a
- /
- pp.435-447
- /
- 1996
In this paper, we propose the method that extracts the speech feature using the hearing model through signal processing techniques. The proposed method includes the following procedure ; normalization of the short-time speech block by its maximum value, multi-resolution analysis using the discrete wavelet transformation and re-synthesize using the discrete inverse wavelet transformation, differentiation after analysis and synthesis, full wave rectification and integration. In order to verify the performance of the proposed speech feature in the speech recognition task, korean digit recognition experiments were carried out using both the DTW and the VQ-HMM. The results showed that, in the case of using DTW, the recognition rates were 99.79% and 90.33% for speaker-dependent and speaker-independent task respectively and, in the case of using VQ-HMM, the rate were 96.5% and 81.5% respectively. And it indicates that the proposed speech feature has the potential for use as a simple and efficient feature for recognition task
PDF

A Study on the Efficient Speech Recognition System using Database Grouping (어휘 그룹화를 이용한 음성인식시스템의 성능향상에 관한 연구)

우상욱;권승호;한수양;이동규;이두수
- Proceedings of the IEEK Conference
- /
- 2003.07e
- /
- pp.2455-2458
- /
- 2003
In this paper, the Classification of Energy Labeling has been Proposed. Energy Parameters of input signal which is extracted from each phoneme is labelled. And groups of labelling according to detected energies of input signals are detected. Next, DTW processes in a selected group of labeling. This leads to DTW processing faster than a previous algorithm. In this Method, because an accurate detection of parameters is necessary on the assumption in steps of a detection of speeching duration and a detection of energy parameters, variable windows which are decided by pitch period is used. Extract algorithms don't search for exact frame energy, because 256 frame window-sizes is fixed. For this reason, a new energy extraction method has been proposed. A pitch period is detected firstly; next window scale is decided between 200 frames and 300 frames. The proposed method make it possible to cancel an influence of windows.
PDF

A Basic Study on Automation of the Subjective Evaluation using Speech Recognition (음성인식을 이용한 주관평가의 자동화에 관한 기초연구)

한화영;고한우;윤용현;조택동
- Proceedings of the Korean Society for Emotion and Sensibility Conference
- /
- 2000.11a
- /
- pp.113-117
- /
- 2000
수작업으로 이루어지고 있는 환경의 영향이나 작업의 영향에 따른 정신피로나 신체피로의 주관적인 평가를 자동화하기 위한 방법에 대하여 논하였다. 사람의 가장 자연스러운 의사소통인 평가어를 척도로 하여 평가가 이루어지는 음성인식기술을 응용한 주관평가법에 대하여 연구하였다. 주관평가의 자동화를 위하여 우선, 평가어에 대한 음성 인식을 한 후 인식된 평가 결과 데이터를 이용하여 설문지를 자동 생성시킴과 동시에 파일 형태로 저장시켰다. 음성 인식 알고리즘으로는 DTW(Dynamic Time Warping)인식 알고리즘을 사용하였고. 설문지 질의 내용은 집중도 평가를 이용하였다. 인식실험은 설문에 대한 응답에 필요한 평가어를 대상으로 하였다.
PDF

Search Result 225, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)