Search | Korea Science

Pronunciation Network Construction of Speech Recognizer for Mispronunciation Detection of Foreign Language (한국인의 외국어 발화오류 검출을 위한 음성인식기의 발음 네트워크 구성)

Lee Sang-Pil;Kwon Chul-Hong
- MALSORI
- /
- no.49
- /
- pp.123-134
- /
- 2004
An automatic pronunciation correction system provides learners with correction guidelines for each mispronunciation. In this paper we propose an HMM based speech recognizer which automatically classifies pronunciation errors when Koreans speak Japanese. We also propose two pronunciation networks for automatic detection of mispronunciation. In this paper, we evaluated performances of the networks by computing the correlation between the human ratings and the machine scores obtained from the speech recognizer.
PDF

The rocognition of two-dimensional objects using the inverse histogram (인버스 히스토그램을 이용한 다수의 이차원 물체 인식)

박성혁;고명삼
- 제어로봇시스템학회:학술대회논문집
- /
- 1986.10a
- /
- pp.331-336
- /
- 1986
Because the threshold technique using the histogram of intensity is the most attractive for segmentation in the sense of fast image processing, this paper defined the new function of inverse histogram of intensity and found out a threshold by means of it. The segmented errors are removed by regulating a scan size of blob coloring. Blob-coloring algorithm presented by [6] was reproved for good performance i.e., no change of feature in bolobs after blob coloring. The ratio of successful recognition was about 85 percents.
PDF

The Recognition and Pedagogy of Chinese Tones (중국어 성조의 인지와 교육)

Shim So Hee
- MALSORI
- /
- no.40
- /
- pp.65-78
- /
- 2000
Korean learners of Chinese have diniculties in pronouncing Chinese tones which distinguish the meaning of words, because there are not such tones in Korean language. It makes Koreans hard to acquire Chinese. In this paper, I present the followings: First, I examine the characteristics of the tones pronounced by Korean speakers, exploiting the method of modern experimental phonetics. Second, I present the pedagogy of Chinese tones, considering the typical errors shown by the experiments on Korean speakers. The Pedagogy Presented in this Paper, which is based on the results of experiments, is not perfect. However, I expect this paper to serve as instrumental tools to help Korean speakers to improve their command of Chinese.
PDF

A Recognition of Word Spacing Errors Using By Syllable (음절 bigram 특성을 이용한 띄어쓰기 오류의 인식)

강승식
- Proceedings of the Korean Society for Cognitive Science Conference
- /
- 2000.06a
- /
- pp.85-88
- /
- 2000
대용량 말뭉치에서 이웃 음절간 공기빈도 정보를 추출하여 한글의 bigram 음절 특성을 조사하였다. Bigram 음절 특성은 띄어쓰기가 무시된 문서에 대한 자동 띄어쓰기, 어떤 어절이 띄어쓰기 오류어인지 판단, 맞춤법 검사기에서 절차 오류어의 교정 등 다양한 응용분야에서 유용하게 사용될 것으로 예상되고 있다. 본 논문에서는 한글의 bigram 음절 특성을 자동 띄어쓰기 및 입력어절이 띄어쓰기 오류어인지를 판단하는데 적용하는 실험을 하였다. 실험 결과에 의하면 bigram 음절 특성이 매우 유용하게 사용될 수 있음을 확인하였다.
PDF

A Recognition of Word Spacing Errors Using By Syllable Bigram (음절 bigram 특성을 이용한 띄어쓰기 오류의 인식)

Kang, Seung-Shik
- Annual Conference on Human and Language Technology
- /
- 2000.10d
- /
- pp.85-88
- /
- 2000
대용량 말뭉치에서 이웃 음절간 공기빈도 정보를 추출하여 한글의 bigram 음절 특성을 조사하였다. Bigram 음절 특성은 띄어쓰기가 무시된 문서에 대한 자동 띄어쓰기, 어떤 어절이 띄어쓰기 오류어인지 판단, 맞춤법 검사기에서 철자 오류어의 교정 등 다양한 응용분야에서 유용하게 사용될 것으로 예상되고 있다. 본 논문에서는 한글의 bigram 음절 특성을 자동 띄어쓰기 및 입력어절이 띄어쓰기 오류어인지를 판단하는데 적용하는 실험을 하였다. 실험 결과에 의하면 bigram 음절 특성이 매우 유용하게 사용될 수 있음을 확인하였다.
PDF

A STUDY ON MODIFIED MEMBERSHIP FUNCTION BASED ON FREQUENCY VARIATION OF LPC

Choi, Seung-Ho;Kim, Hyoung-Guen
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06a
- /
- pp.1092-1097
- /
- 1994
To solve the frequency variation of speech patterns which consist of LPC sequences, a new membership function made by the relation between order of LPC and spectrum is proposed in this paper. To reduce errors, fuzzy inference is executed using the proposed membership function. The computer simulation shows the effectiveness of the word recognition.
PDF

Constrained High Accuracy Stereo Reconstruction Method for Surgical Instruments Positioning

Wang, Chenhao;Shen, Yi;Zhang, Wenbin;Liu, Yuncai
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.6 no.10
- /
- pp.2679-2691
- /
- 2012
In this paper, a high accuracy stereo reconstruction method for surgery instruments positioning is proposed. Usually, the problem of surgical instruments reconstruction is considered as a basic task in computer vision to estimate the 3-D position of each marker on a surgery instrument from three pairs of image points. However, the existing methods considered the 3-D reconstruction of the points separately thus ignore the structure information. Meanwhile, the errors from light variation, imaging noise and quantization still affect the reconstruction accuracy. This paper proposes a method which takes the structure information of surgical instruments as constraints, and reconstructs the whole markers on one surgical instrument together. Firstly, we calibrate the instruments before navigation to get the structure parameters. The structure parameters consist of markers' number, distances between each markers and a linearity sign of each instrument. Then, the structure constraints are added to stereo reconstruction. Finally, weighted filter is used to reduce the jitter. Experiments conducted on surgery navigation system showed that our method not only improve accuracy effectively but also reduce the jitter of surgical instrument greatly.
PDF KSCI

Optimizing Multiple Pronunciation Dictionary Based on a Confusability Measure for Non-native Speech Recognition (타언어권 화자 음성 인식을 위한 혼잡도에 기반한 다중발음사전의 최적화 기법)

Kim, Min-A;Oh, Yoo-Rhee;Kim, Hong-Kook;Lee, Yeon-Woo;Cho, Sung-Eui;Lee, Seong-Ro
- MALSORI
- /
- no.65
- /
- pp.93-103
- /
- 2008
In this paper, we propose a method for optimizing a multiple pronunciation dictionary used for modeling pronunciation variations of non-native speech. The proposed method removes some confusable pronunciation variants in the dictionary, resulting in a reduced dictionary size and less decoding time for automatic speech recognition (ASR). To this end, a confusability measure is first defined based on the Levenshtein distance between two different pronunciation variants. Then, the number of phonemes for each pronunciation variant is incorporated into the confusability measure to compensate for ASR errors due to words of a shorter length. We investigate the effect of the proposed method on ASR performance, where Korean is selected as the target language and Korean utterances spoken by Chinese native speakers are considered as non-native speech. It is shown from the experiments that an ASR system using the multiple pronunciation dictionary optimized by the proposed method can provide a relative average word error rate reduction of 6.25%, with 11.67% less ASR decoding time, as compared with that using a multiple pronunciation dictionary without the optimization.
PDF

Improvement of Activity Recognition Based on Learning Model of AI and Wearable Motion Sensors (웨어러블 동작센서와 인공지능 학습모델 기반에서 행동인지의 개선)

Ahn, Junguk;Kang, Un Gu;Lee, Young Ho;Lee, Byung Mun
- Journal of Korea Multimedia Society
- /
- v.21 no.8
- /
- pp.982-990
- /
- 2018
In recent years, many wearable devices and mobile apps related to life care have been developed, and a service for measuring the movement during walking and showing the amount of exercise has been provided. However, they do not measure walking in detail, so there may be errors in the total calorie consumption. If the user's behavior is measured by a multi-axis sensor and learned by a machine learning algorithm to recognize the kind of behavior, the detailed operation of walking can be autonomously distinguished and the total calorie consumption can be calculated more than the conventional method. In order to verify this, we measured activities and created a model using a machine learning algorithm. As a result of the comparison experiment, it was confirmed that the average accuracy was 12.5% or more higher than that of the conventional method. Also, in the measurement of the momentum, the calorie consumption accuracy is more than 49.53% than that of the conventional method. If the activity recognition is performed using the wearable device and the machine learning algorithm, the accuracy can be improved and the energy consumption calculation accuracy can be improved.
https://doi.org/10.9717/kmms.2018.21.8.982 인용 PDF KSCI

Efficient distribution support of Shipping Request Data based on Digitalized Character Recognition (디지털 문자 판독기술을 적용한 선적요청서 데이터의 효율적인 유통지원)

Park, Joon-Hyuk;Goh, Hyun-Woo
- Journal of Korean Society of Industrial and Systems Engineering
- /
- v.31 no.2
- /
- pp.112-121
- /
- 2008
Nowaday the supply chain competitiveness is emphasized more and more than a company's own competitiveness. One of the most important processes in import and export is a publication over the Bill of Loading. In the publication of those bills S/R(shipping request) and check B/L for reviewing are circulated among consignors, forwarders, shipping companies and airlines by fax and e-mail. Or there should be expensive a One to One system, like an EDI. Each party has to re-input S/R data to their own systems and check it several times. The S/R data are converted digital to analog type and analog to digital repeatedly to check in the process. As the process goes by there can be not only input data errors but also waste of time and cost. ECR(electronic character recognition) is a technology can solve the Problem. Considering the data structure of documents in many systems used ECR samples SIR data from documents written in the digital type. But it is not enough with it only. To make N to N composition in reality more efficient we make a documents hub on the web reengineering the existing process to One to One relation. The ECR documents hub system has given us beneficial effects over a year throughout a field test.
PDF KSCI

Search Result 353, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)