Search | Korea Science

Vector Quantizer Based Speaker Normalization for Continuos Speech Recognition (연속음성 인식기를 위한 벡터양자화기 기반의 화자정규화)

Shin Ok-keun
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.8
- /
- pp.583-589
- /
- 2004
Proposed is a speaker normalization method based on vector quantizer for continuous speech recognition (CSR) system in which no acoustic information is made use of. The proposed method, which is an improvement of the previously reported speaker normalization scheme for a simple digit recognizer, builds up a canonical codebook by iteratively training the codebook while the size of codebook is increased after each iteration from a relatively small initial size. Once the codebook established, the warp factors of speakers are estimated by comparing exhaustively the warped versions of each speaker's utterance with the codebook. Two sets of phones are used to estimate the warp factors: one, a set of vowels only. and the other, a set composed of all the Phonemes. A Piecewise linear warping function which corresponds to the estimated warp factor is adopted to warp the power spectrum of the utterance. Then the warped feature vectors are extracted to be used to train and to test the speech recognizer. The effectiveness of the proposed method is investigated by a set of recognition experiments using the TIMIT corpus and HTK speech recognition tool kit. The experimental results showed comparable recognition rate improvement with the formant based warping method.
PDF KSCI

A Study on the Improvement of DTW with Speech Silence Detection (음성의 묵음구간 검출을 통한 DTW의 성능개선에 관한 연구)

Kim, Jong-Kuk;Jo, Wang-Rae;Bae, Myung-Jin
- Speech Sciences
- /
- v.10 no.4
- /
- pp.117-124
- /
- 2003
Speaker recognition is the technology that confirms the identification of speaker by using the characteristic of speech. Such technique is classified into speaker identification and speaker verification: The first method discriminates the speaker from the preregistered group and recognize the word, the second verifies the speaker who claims the identification. This method that extracts the information of speaker from the speech and confirms the individual identification becomes one of the most efficient technology as the service via telephone network is popularized. Some problems, however, must be solved for the real application as follows; The first thing is concerning that the safe method is necessary to reject the imposter because the recognition is not performed for the only preregistered customer. The second thing is about the fact that the characteristic of speech is changed as time goes by, So this fact causes the severe degradation of recognition rate and the inconvenience of users as the number of times to utter the text increases. The last thing is relating to the fact that the common characteristic among speakers causes the wrong recognition result. The silence parts being included the center of speech cause that identification rate is decreased. In this paper, to make improvement, We proposed identification rate can be improved by removing silence part before processing identification algorithm. The methods detecting speech area are zero crossing rate, energy of signal detect end point and starting point of the speech and process DTW algorithm by using two methods in this paper. As a result, the proposed method is obtained about 3% of improved recognition rate compare with the conventional methods.
PDF

A Study on the Improvement of Regulations for AMO Global Recognition System of International Civil Aviation Organization (정비조직인증 국제인정체계 대응을 위한 규정 개선 연구)

Choe, Yunseon;Lee, Sunkyung;Lee, Chaeyoung
- Journal of Aerospace System Engineering
- /
- v.14 no.3
- /
- pp.32-41
- /
- 2020
The International Civil Aviation Organization (ICAO) in 2015 proposed a road-map for the global recognition system of the Approved Maintenance Organization (AMO) fto mitigate the redundant work and regulatory burdens of the aviation industry and authorities on the certification and oversight activities of the State of Registry. Since then, the ICAO standards and guidelines have been revised accordingly with the goal of implementing the system in 2024. Korea should actively prepare for this AMO global recognition system to cope with the ICAO road-map appropriately as well as to develop the Maintenance Repair Overhaul (MRO) industry. Thus, this paper focused on the ratings and limitations system, a key element of the AMO, and proposes the improvement of domestic regulatory/administrative rules necessary for the global recognition system, through the review of newly established ICAO standards/guidelines and the comparative analysis of leading aviation countries' and Korean system/requirements.
https://doi.org/10.20910/JASE.2020.14.3.32 인용 PDF KSCI

The Teachers' Recognition and a Plan for the Improvement of the System on Selection of Gifted Students in Science Using Teachers' Observation and Nomination (과학 영재 관찰.추천 선발 방식에 대한 교사의 인식 조사 및 개선 방안)

Bang, Mi Seon;Kim, Yong Gwon
- Journal of Korean Elementary Science Education
- /
- v.32 no.2
- /
- pp.169-184
- /
- 2013
The purpose of this study is to investigate teachers' recognition and to suggest an improvement in the system of teacher's observation and nomination used to selecting gifted and talented students in Science in the Busan Metropolitan School District in 2013 by investigating teachers' recognition of the system and their expressed needs. The results are as follows. First, it was observed that teachers are of the opinion that it is difficult to determine the science gifted students by observation due to their lack of expertise in giftedness and gifted education, the lack of a check list to use, and the difficulty of ensuring the objectivity of the results of the determination. Second, the absence of objective screening tools used for the selection, the selection of gifted students based on their subjective judgment, and the possibility to select students based only on visible manifestations of ability may cause parents to mistrust the system. Thus, institutional support is required to address the concerns of teachers and parents. Third, the teachers who are in charge of observation, nomination, selection and determination need to be trained. After that, at least one of these teachers should be assigned in each school and training should operate continuously and systematically. Lastly, while these things are occurring, the process of observation and nomination of by teachers, which is the basis of pooling gifted students at the level of Busan Metropolitan School District, should be continued.
https://doi.org/10.15267/keses.2013.32.2.169 인용 PDF

Emotion Recognition Using Tone and Tempo Based on Voice for IoT (IoT를 위한 음성신호 기반의 톤, 템포 특징벡터를 이용한 감정인식)

Byun, Sung-Woo;Lee, Seok-Pil
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.65 no.1
- /
- pp.116-121
- /
- 2016
In Internet of things (IoT) area, researches on recognizing human emotion are increasing recently. Generally, multi-modal features like facial images, bio-signals and voice signals are used for the emotion recognition. Among the multi-modal features, voice signals are the most convenient for acquisition. This paper proposes an emotion recognition method using tone and tempo based on voice. For this, we make voice databases from broadcasting media contents. Emotion recognition tests are carried out by extracted tone and tempo features from the voice databases. The result shows noticeable improvement of accuracy in comparison to conventional methods using only pitch.
https://doi.org/10.5370/KIEE.2016.65.1.116 인용 PDF KSCI

Neural Network Recognition of Scanning Electron Microscope Image for Plasma Diagnosis (플라즈마 진단을 위한 Scanning Electron Microscope Image의 신경망 인식 모델)

Ko, Woo-Ram;Kim, Byung-Whan
- Proceedings of the KIEE Conference
- /
- 2006.04a
- /
- pp.132-134
- /
- 2006
To improve equipment throughput and device yield, a malfunction in plasma equipment should be accurately diagnosed. A recognition model for plasma diagnosis was constructed by applying neural network to scanning electron microscope (SEM) image of plasma-etched patterns. The experimental data were collected from a plasma etching of tungsten thin films. Faults in plasma were generated by simulating a variation in process parameters. Feature vectors were obtained by applying direct and wavelet techniques to SEM Images. The wavelet techniques generated three feature vectors composed of detailed components. The diagnosis models constructed were evaluated in terms of the recognition accuracy. The direct technique yielded much smaller recognition accuracy with respect to the wavelet technique. The improvement was about 82%. This demonstrates that the direct method is more effective in constructing a neural network model of SEM profile information.
PDF

Feature Combination and Selection Using Genetic Algorithm for Character Recognition (유전 알고리즘을 이용한 특징 결합과 선택)

Lee Jin-Seon
- The Journal of the Korea Contents Association
- /
- v.5 no.5
- /
- pp.152-158
- /
- 2005
By using a combination of different feature sets extracted from input character patterns, we can improve the character recognition system performance. To reduce the dimensionality of the combined feature vector, we conduct the feature selection. This paper proposes a general framework for the feature combination and selection for character recognition problems. It also presents a specific design for the handwritten numeral recognition. Tn the design, DDD and AGD feature sets are extracted from handwritten numeral patterns, and a genetic algorithm is used for the feature selection. Experimental result showed a significant accuracy improvement by about 0.7% for the CENPARMI handwrittennumeral database.
PDF

Performance Improvement of Microphone Array Speech Recognition Using Features Weighted Mahalanobis Distance (가중특징 Mahalanobis거리를 이용한 마이크 어레이 음석인식의 성능향상)

Nguyen, Dinh Cuong;Chung, Hyun-Yeol
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.1E
- /
- pp.45-53
- /
- 2010
In this paper, we present the use of the Features Weighted Mahalanobis Distance (FWMD) in improving the performance of Likelihood Maximizing Beamforming (Limabeam) algorithm in speech recognition for microphone array. The proposed approach is based on the replacement of the traditional distance measure in a Gaussian classifier with adding weight for different features in the Mahalanobis distance according to their distances after the variance normalization. By using Features Weighted Mahalanobis Distance for Limabeam algorithm (FWMD-Limabeam), we obtained correct word recognition rate of 90.26% for calibrate Limabeam and 87.23% for unsupervised Limabeam, resulting in a higher rate of 3% and 6% respectively than those produced by the original Limabearn. By implementing a HM-Net speech recognition strategy alternatively, we could save memory and reduce computation complexity.
PDF KSCI

Named Entity Boundary Recognition Using Hidden Markov Model and Hierarchical Information (은닉 마르코프 모델과 계층 정보를 이용한 개체명 경계 인식)

Lim, Heui-Seok
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.7 no.2
- /
- pp.182-187
- /
- 2006
This paper proposes a method for boundary recognition of named entity using hidden markov model and ontology information of biological named entity. We uses smoothing method using 31 feature information of word and hierarchical information to alleviate sparse data problem in HMM. The GENIA corpus version 2.1 was used to train and to experiment the proposed boundary recognition system. The experimental results show that the proposed system outperform the previous system which did not use ontology information of hierarchical information and smoothing technique. Also the system shows improvement of execution time of boundary recognition.
PDF

Performance analysis of shape recognition in Senzimir mill control systems (젠지미어 압연기 제어시스템에서 형상인식에 관한 성능분석)

Lee, M.H.;Shin, J.M.;Han, S.I.;Kim, J.S.
- Journal of Power System Engineering
- /
- v.15 no.5
- /
- pp.83-90
- /
- 2011
In general, 20-high Sendzimir mills(ZRM) use small diameter work rolls to provide massive rolling force. Because of small diameter of work rolls, steel strip has a complex shape mixed with quarter, edge and center waves. Especially when the shape of the strip is controlled automatically, the actuator saturation occurs. These problems affect the productivity and quality of products. In this paper, the problems in automatic shape control of ZRM were analyzed. In order to evaluate the problems for the automatic shape control in ZRM, recognition performance was analyzed by comparing the measured shape and the recognized shape. The actuator positions by the shape recognition and the manual operation were compared. From the analysis results, the necessity of the improvement of recognition performance in ZRM is suggested.
https://doi.org/10.9726/kspse.2011.15.5.083 인용 PDF KSCI

Search Result 1,496, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)