통합 검색 | Korea Science

Korean LVCSR for Broadcast News Speech

Lee, Gang-Seong
- The Journal of the Acoustical Society of Korea
- /
- 제20권2E호
- /
- pp.3-8
- /
- 2001
In this paper, we will examine a Korean large vocabulary continuous speech recognition (LVCSR) system for broadcast news speech. The combined vowel and implosive unit is included in a phone set together with other short phone units in order to obtain a longer unit acoustic model. The effect of this unit is compared with conventional phone units. The dictionary units for language processing are automatically extracted from eojeols appearing in transcriptions. Triphone models are used for acoustic modeling and a trigram model is used for language modeling. Among three major speaker groups in news broadcasts-anchors, journalists and people (those other than anchors or journalists, who are being interviewed), the speech of anchors and journalists, which has a lot of noise, was used for testing and recognition.
PDF

C++ 언어와 Standard Library 를 이용한 음성인식기 개발 (Development of a Speech Recognition System uSing e++ Language and Standard library)

황규웅
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1998년도 제15회 음성통신 및 신호처리 워크샵(KSCSP 98 15권1호)
- /
- pp.74-77
- /
- 1998
우리는 C++를 이용하여 음성인식기를 구현하여 기존의 C를 이용한 경우에 비하여 30% 수준의 소스로 표현하였고 인식기의 공동개발, 확장 및 개선, 기술 전수 등이 용이하게 되었으며 이를 음성인식 엔진 및 음성인식 연구를 위한 툴로 사용할 수 있게 되었다. 이 인식기의 특징으로는 연속 음성 및 대화체 음성을 인식할 수 있으며 trigram 언어 모델을 사용하였고 문맥 종속 음소 모델링에서는 기존의 triphone 보다 넓은 문맥을 고려한 n-phone context modeling을 사용하였으며 모델의 선정에는 음성학적 지식을 기반으로 한 질문을 사용한 decision tree를 사용하여 훈련에 나타나지 않은 단어나 문맥인 경우라도 가장 가까운 모델을 선정할 수 있게 하였다. 또, tree lexicon을 사용하여 속도를 개선하였으며 state 단위의 모델 공유를 통해 제한된 데이터를 이용하여 더 많은 모델을 훈련할 수 있어 성능을 개선하였다. 상용화를 염두에 두고 pc에서 구현하였다.
PDF

미등록어 추정을 이용한 TAKTAG의 개선 (Improvement of TAKTAG using unknown-word handling)

차정원;이원일;이근배;이종혁
- 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
- /
- 한국정보과학회언어공학연구회 1996년도 제8회 한글 및 한국어 정보처리 학술대회
- /
- pp.203-206
- /
- 1996
본 논문에서는 음소단위의 bigram과 trigram 정보를 이용하여 어절내에서의 위치와 개수에 관계없이 미등록어를 추정하고, 미등록어용 형태소 패턴 사전을 도입하여 마치 등록어처럼 미등록어를 처리할 수 있는 방법을 제안한다. 제안된 미등록어 추정 모텔은 조사나 어미와 같은 기능어에 의한 간접적인 추정방법이 아닌 미등록어 자체의 추정과 접속정보를 이용한 검사를 동시에 하여 정확도를 높였다. 본 미등록어 추정방법은 기존의 한국어 품사태깅모델인 TAKTAG에 적용하여 미등록어가 포함된 어절에 대해서 83.72%의 성능을 보였다.
PDF

비교사 분할 및 병합으로 구한 의사형태소 음성인식 단위의 성능 (Performance of Pseudomorpheme-Based Speech Recognition Units Obtained by Unsupervised Segmentation and Merging)

방정욱;권오욱
- 말소리와 음성과학
- /
- 제6권3호
- /
- pp.155-164
- /
- 2014
This paper proposes a new method to determine the recognition units for large vocabulary continuous speech recognition (LVCSR) in Korean by applying unsupervised segmentation and merging. In the proposed method, a text sentence is segmented into morphemes and position information is added to morphemes. Then submorpheme units are obtained by splitting the morpheme units through the maximization of posterior probability terms. The posterior probability terms are computed from the morpheme frequency distribution, the morpheme length distribution, and the morpheme frequency-of-frequency distribution. Finally, the recognition units are obtained by sequentially merging the submorpheme pair with the highest frequency. Computer experiments are conducted using a Korean LVCSR with a 100k word vocabulary and a trigram language model obtained by a 300 million eojeol (word phrase) corpus. The proposed method is shown to reduce the out-of-vocabulary rate to 1.8% and reduce the syllable error rate relatively by 14.0%.
https://doi.org/10.13064/KSSS.2014.6.3.155 인용 PDF KSCI

자연어 처리 및 기계학습을 활용한 제조업 현장의 품질 불량 예측 방법론 (A Method for Prediction of Quality Defects in Manufacturing Using Natural Language Processing and Machine Learning)

노정민;김용성
- Journal of Platform Technology
- /
- 제9권3호
- /
- pp.52-62
- /
- 2021
제조업 현장에서 제작 공정 수행 전 품질 불량 위험 공정을 예측하여 사전품질관리를 수행하는 것은 매우 중요한 일이다. 하지만 기존 엔지니어의 역량에 의존하는 방법은 그 제작공정의 종류와 수가 다양할수록 인적, 물리적 한계에 부딪힌다. 특히 원자력 주요기기 제작과 같이 제작공정이 매우 광범위한 도메인 영역에서는 그 한계가 더욱 명확하다. 본 논문은 제조업 현장에서 자연어 처리 및 기계학습을 활용하여 품질 불량 위험 공정을 예측하는 방법을 제시하였다. 이를 위해 실제 원자력발전소에 설치되는 주기기를 제작하는 공장에서 6년 동안 수집된 제작 기록의 텍스트 데이터를 활용하였다. 텍스트 데이터의 전처리 단계에서는 도메인 지식이 잘 반영될 수 있도록 단어사전에 Mapping 하는 방식을 적용하였고, 문장 벡터화 과정에서는 N-gram, TF-IDF, SVD를 결합한 하이브리드 알고리즘을 구성하였다. 다음으로 품질 불량 위험 공정을 분류해내는 실험에서는 k-fold 교차 검증을 적용하고 Unigram에서 누적 Trigram까지 여러 케이스로 나누어 데이터셋에 대한 객관성을 확보하였다. 또한, 분류 알고리즘으로 나이브 베이즈(NB)와 서포트 벡터 머신(SVM)을 사용하여 유의미한 결과를 확보하였다. 실험결과 최대 accuracy와 F1-score가 각각 0.7685와 0.8641로서 상당히 유효한 수준으로 나타났다. 또한, 수행해본 적이 없는 새로운 공정을 예측하여 현장 엔지니어들의 투표와의 비교를 통해서 실제 현장에 자연스럽게 적용할 수 있음을 보여주었다.
PDF KSCI

CT 조사를 통한 청화백자투각연당 초팔괘문연적의 3차원적 구조와 제작방법에 대한 고찰 (Computed tomography investigation of the three-dimensional structure and production method of White Porcelain Water Dropper with Openwork Lotus Scroll Design and Eight Trigram Design in Cobalt-blue Underglaze)

나아영;황현성
- 박물관보존과학
- /
- 제25권
- /
- pp.1-8
- /
- 2021
국립중앙박물관 소장품 청화백자투각연당초팔괘문연적(수정147)을 대상으로 CT 조사를 실시하고 복제품을 제작하여 구조와 제작방법에 대해 살펴보았다. CT 조사를 실시한 결과, 접합선이나 기공이 없는 것으로 보아 하부 동체부 틀을 사용하여 한 번에 찍어 빼낸 후 상부 틀로 찍어 뽑아낸 상부 뚜껑을 서로 접합하였음을 알게 되었다. 특히 하부 동체부의 내기 상면과 연접한 뚜껑 하단면이 서로 접합이 잘 되도록 내면 가운데를 대나무 칼로 거칠게 돌려 깎아 접합면이 누수 되지 않도록 처리하였다. 처음 제작할 당시 물을 담는 연적의 내기(內器)는 도량형 규격에 맞고 틀을 뽑아내기에도 용이한 원통형(圓筒形)으로 만들었을 것으로 짐작되나 상부면과 동체부를 붙이는 과정에서 형태가 사다리꼴로 변형되었을 것으로 여겨진다. 또한 실리콘 복제를 이용하여 원통형 내기로 다시 제작한 후 내기의 용량을 비교 측정한 결과, 3D프린팅을 이용해 복제한 유물 내기의 용량이 152.5㎖인데 반해 원통형 내기의 용량은 대략 168.6㎖로 조선시대 도량형 기준인 '량(量)'의 단위로 3홉(약 174㎖)과 유사하다는 것을 확인 할 수 있었다. 원통형 내기의 용량이 조선 후기 도량형 기준과 부합하므로 실제 도공이 팔괘문연적을 제작할 당시 원통형내기를 가진 연적으로 제작하였을 것으로 생각된다.
https://doi.org/10.22790/conservation.2021.25.0001 인용 PDF KSCI

한역(漢易) 괘기설(卦氣說)의 학술적 배경에 대한 연구 (A Study on the Academic Background of Gwae(卦氣) Theory of Yiology in Han(漢) Dynasty)

은석민
- 대한한의학원전학회지
- /
- 제21권3호
- /
- pp.69-81
- /
- 2008
Gwae(卦氣) theory was one of the main theoretical foundation of yiology in the Han(漢)-dynasty. It was based on the concept that the trigram or hexagram of the book of change corresponds to the seasonal point such as 24 solar terms in one year, so there was so much influence from astronomy and divination system of that time in the development of theoretical principle of Gwae(卦氣) theory. Since Han(漢) Dynasty, the theoretical method such as Gwae(卦氣) theory that correlates the astronomy and divination system with the book of change, had become one of the main academic thoughts throughout the entire history in China, and it was also like that in medicine. Nevertheless there still exists the skeptical sights that Gwae(卦氣) theory was not a part of orthodox yiology, that had been developed by Confucian scholar and had also been recognized as the right path to the study of the book of change. Nowadays because of the new opportunity such as the excavation of the ancient silk script, this kind of controversy has moved on its another step. With regard to this problem, this article will treat the current thoughts about the Gwae(卦氣) theory and think about the substantial basis of each point of view.
PDF

한국어 음성인식 플랫폼(ECHOS)의 개선 및 평가 (Improvement and Evaluation of the Korean Large Vocabulary Continuous Speech Recognition Platform (ECHOS))

권석봉;윤성락;장규철;김용래;김봉완;김회린;유창동;이용주;권오욱
- 대한음성학회지:말소리
- /
- 제59호
- /
- pp.53-68
- /
- 2006
We report the evaluation results of the Korean speech recognition platform called ECHOS. The platform has an object-oriented and reusable architecture so that researchers can easily evaluate their own algorithms. The platform has all intrinsic modules to build a large vocabulary speech recognizer: Noise reduction, end-point detection, feature extraction, hidden Markov model (HMM)-based acoustic modeling, cross-word modeling, n-gram language modeling, n-best search, word graph generation, and Korean-specific language processing. The platform supports both lexical search trees and finite-state networks. It performs word-dependent n-best search with bigram in the forward search stage, and rescores the lattice with trigram in the backward stage. In an 8000-word continuous speech recognition task, the platform with a lexical tree increases 40% of word errors but decreases 50% of recognition time compared to the HTK platform with flat lexicon. ECHOS reduces 40% of recognition errors through incorporation of cross-word modeling. With the number of Gaussian mixtures increasing to 16, it yields word accuracy comparable to the previous lexical tree-based platform, Julius.
PDF

악관절의 기능과 이상에 관한 역학적(易學的) 해석 (A Changeological Interpretation on the Function and Malfunction of the Oromaxillary Structure)

지규용;이영준
- 턱관절균형의학회지
- /
- 제7권1호
- /
- pp.11-16
- /
- 2017
In order to understand Changeologically on the meaning of FCST's TMJ (temporo-mandibular joint) treatment procedure, Yi, Shike, Bi, Gen trigrams concerning the jaw and change by treatment were analyzed from the viewpoint of semiotic context of hexagon and holistic interpretation on disease. Yi is meant by jaw but actually indicates mouth made by maxilla and mandible, and it's characters are related with nourishing by aliment and words. But when we eat and speak in the daily life, jaw does not nourish properly it's own body by bad habit or postures. For the treatment of this ill state, there needs punishment and correction symbolized with Shike. Shike has fourth nine meaning obstacles between the two strong lines in the upper and lower end, and so it has the function of mastication and get rid of the fourth nine metaphorically indicating subluxation of axis using CBA and auxiliary measures of four movement or laughing methods. Bi expresses the achievement and effects of consecutive mastication process implicating normalized manifestation of jaw and its linked spinal function. Gen symbolizes removing selfish motive or partiality in advance and reaches the best state of the saint righteously self-nourishable human being.
PDF

어절별 중의성 해소 정보를 이용한 품사 태깅의 성능 향상 (Improving Part-of-speech Tagging by using Resolution Information for Individual Ambiguous Word)

박희근;서영훈
- 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
- /
- 한국정보과학회언어공학연구회 2007년도 제19회 한글 및 한국어 정보처리 학술대회
- /
- pp.134-139
- /
- 2007
품사 태깅 시스템에서 규칙 정보와 통계 정보는 상호보완적으로 사용되어 품사 태깅의 성능을 향상시킨다. 하지만, 두 가지 정보로는 품사 태깅의 성능을 향상시키기에는 한계가 있다. 이에 본 논문에서는 어절별 중의성 해소 정보를 이용하여 품사 태깅 시스템의 정확률을 향상시키는 방법에 대해서 기술한다. 통계 정보는 21세기 세종계획의 천만 어절 균형 말뭉치와 태그 부착 말뭉치에서 추출한 trigram 형태의 중의성 어절 및 품사 태그열 출현 빈도 정보를 이용하여 구축하였고, 규칙 정보는 보조용언, 숙어, 관용적 표현 등을 이용하여 구축하였다. 어절별 중의성 해소 정보는 세종 천만 어절 균형 말뭉치의 중의성 어절에서 고빈도 상위 50%에 해당하는 어절을 대상으로 해당 어절의 의미정보와 문맥정보를 고려하여 구축되었고, 이것은 통계 정보를 이용한 품사 태깅 전에 적용되어 분석 후보를 줄여준다. 또한, 학습을 통하여 어절별 중의성 해소 정보를 수정 및 보강하여 잘못된 품사 태깅 결과를 보정해준다. 이와 같이 통계 정보와 규칙 정보를 이용한 품사 태깅 시스템에 고빈도 중의성 어절에 대한 어절별 중의성 해소 정보를 이용함으로써 품사 태깅의 성능을 향상시킬 수 있었다.
PDF

검색결과 39건 처리시간 0.027초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)