Search | Korea Science

A Study on the Multiple Pronunciation Dictionary for Spontaneous Speech Recognition (대화체 연속음성인식을 위한 확장 다중발음 사전에 관한 연구)

Kang ByungOk
- Proceedings of the KSPS conference
- /
- 2003.10a
- /
- pp.65-68
- /
- 2003
본 논문에서는 대화체 연속음성인식 과정에서 사용되는 다중발음사전의 개념을 확장하여 대화체 발화에 빈번하게 나타나는 불규칙한 발음변이 현상을 포용하도록 한 확장된 발음사전의 방법을 적용하여 대화체 연속음성인식에서 인식성능의 향상을 가져오게 됨을 실험을 통해 보여준다. 대화체 음성에서 빈번하게 나타나는 음운축약 및 음운탈락, 전형적인 오발화, 양성음의 음성음화 등의 발음변이는 언어모델의 효율성을 떨어뜨리고 어휘 수를 증가시켜 음성인식의 성능을 저하시키고, 또한 음성인식 결과로 나타나는 출력형태가 정형화되지 못하는 단점을 가지고 있다. 이에 이러한 발음변이들을 발음사전에 수용할 때 각각의 대표어휘에 대한 변이발음으로 처리하고, 언어모델과 어휘사전은 대표어휘만을 이용해 구성하도록 한다. 그리고, 음성인식기의 탐색부에서는 각각의 변이발음의 발음열도 탐색하되 대표어휘로 언어모델을 참조하도록 하고, 인식결과를 출력하도록 하여 결과적으로 인식성능을 향상시키고, 정형화된 출력패턴을 얻도록 한다. 본 연구에서는 어절단위 뿐 아니라 의사형태소[2] 단위의 발음사전에도 발음변이를 포용하도록 하여 실험을 하였다. 실험을 통해 어절단위의 다중발음사전 구성을 통해 ERR 10.9％, 의사형태소 단위의 다중발음 사전의 구성을 통해 ERR 4.3％의 성능향상을 보였다.
PDF

Analysis of Korean Spontaneous Speech Characteristics for Spoken Dialogue Recognition (대화체 연속음성 인식을 위한 한국어 대화음성 특성 분석)

박영희;정민화
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.3
- /
- pp.330-338
- /
- 2002
Spontaneous speech is ungrammatical as well as serious phonological variations, which make recognition extremely difficult, compared with read speech. In this paper, for conversational speech recognition, we analyze the transcriptions of the real conversational speech, and then classify the characteristics of conversational speech in the speech recognition aspect. Reflecting these features, we obtain the baseline system for conversational speech recognition. The classification consists of long duration of silence, disfluencies and phonological variations; each of them is classified with similar features. To deal with these characteristics, first, we update silence model and append a filled pause model, a garbage model; second, we append multiple phonetic transcriptions to lexicon for most frequent phonological variations. In our experiments, our baseline morpheme error rate (WER) is 31.65%; we obtain MER reductions such as 2.08% for silence and garbage model, 0.73% for filled pause model, and 0.73% for phonological variations. Finally, we obtain 27.92% MER for conversational speech recognition, which will be used as a baseline for further study.
PDF KSCI

Large Vocabulary Continuous Speech Recognition using Stochastic Pronunciatioin Lexicon Modeling (확률 발음사전을 이용한 대어휘 연속음성인식)

윤성진
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.315-319
- /
- 1998
대어휘 연속음성인식을 위한 확률 발음사전 모델에 대해서 제안하였다. 제안된 확률 발음 사전은 연속음성과 같은 자연스런 발성에서 자주 발생되는 단어의 변이를 확률적인 subword-state로 이루어진 HMM으로 모델화 함으로써 단어의 발음 변이를 효과적으로 표현할 수 있으며, 단위 인식 시스템의 성능을 보다 높일 수 있도록 구성되었다. 확률 발음사전의 생성은 음성 자료와 음소 모델을 이용하여 단어 단위의 분할과 학습을 통해서 자동으로 생성되게 됨 음소와 같은 언어학적인 단위뿐만 아니라 PLU 이나 비언어학적인 인식 모델을 이용한 연속음성인식기에도 적용이 가능하다.연속음성인식실험결과 확률 발음사전을 사용함으로써 표준 발음 표기를 사용하는 인식 시스템에 비해 단어 오류율은 39.8%, 문장 오류율은 24.4%의 큰 폭으로 오류율을 감소시킬 수 있었다.
PDF

Occurrence and identification of genetic variation and variation continuity in strawberry tissue culture caused by benzyladenine treatment (딸기 조직배양 시 BA (benzyladenine) 처리에 따른 변이 발생 및 변이 연속성 검정)

Kim, Hye Jin;Choi, Mi Ja;Lee, Jong Nam;Suh, Jong Taek;Kim, Ki Deog;Kim, Yul Ho;Hong, Su Young;Kim, Su Jeong;Sohn, Hwang Bae;Nam, Jeong Hwan
- Journal of Plant Biotechnology
- /
- v.47 no.1
- /
- pp.46-52
- /
- 2020
This experiment study aimed to identify the continuous genetic variation caused by benzyladenine (BA) treatment in strawberry tissue culture. The 'Goha' cultivar was used and treated with different concentrations of BA (0.0, 0.5, 1.0, 2.0 mg·L^-1). Morphological and genetic variation tests were performed, and genetic continuity tests were performed for three years. The morphological variation induced by BA was distinctively high (10.5 ~ 20.0%) and the genetic variation was 7.0 ~ 15.0%, 1.8 ~ 10.0%, and 5.0% in the first, second, and third year of cultivation, respectively. The rate of genetic variation decreased with increasing cultivation years. In addition, genetic variation caused by BA 1.0 mg·L^-1 and BA 2.0 mg·L^-1 occurred in the first and second years of cultivation, whereas only BA 2.0 mg·L^-1 caused genetic variation in the third year of cultivation. Therefore, a concentration of less than 1.0 mg·L^-1 BA was used for the propagation of strawberry tissue culture plants, and it was necessary to identify their variation.
https://doi.org/10.5010/JPB.2020.47.1.046 인용 PDF KSCI

Continuum constitutive equation of Shape Memory Alloy based on plasticity model (소성모델에 기초한 형상기억합금의 연속체 구성방정식)

Ryu, Jung-Hyun;Kim, Sang-Huan;Cho, Mang-Hyo
- Proceedings of the Computational Structural Engineering Institute Conference
- /
- 2009.04a
- /
- pp.30-33
- /
- 2009
본 논문에서는 형상기억합금의 특징적인 거동을 모사하기위한 구성방정식을 제안한다. 제안되는 구성방정식은 기존의 소성모델을 기초로 하는 현상학적인 모델로, 소성 경화이론에서 사용되는 항복곡면에 대응되는 상변이 곡면을 정의하여 형상기억합금의 비선형 거동을 모사한다. 단, 상변이 곡면이 1개만 존재하는 소성모델과는 다르게, 오스테나이트에서 마르텐사이트로의 정방향 상변이와 마르텐사이트에서 오스테나이트로의 역방향 상변이를 각각 해석하기위해 독립적인 2개의 상변이 곡면을 정의해주게 된다. 기계적 하중만이 아닌 열적 하중의 변화에도 비선형 거동을 보이는 형상기억합금의 특성을 반영하기위해 상변이 곡면은 응력과 온도의 함수로 정의되며, 이렇게 정의된 상변이 곡면을 바탕으로 리턴 매핑 알고리즘을 적용하여 열적하중과 기계적하중의 변화에 따른 형상기억합금의 거동을 모사하는 구성방정식을 제안하였다.
PDF

Object-based Stereo Sequence Coding using Disparity and Motion Vector Relationship (변이-움직임 벡터의 상관관계를 이용한 객체기반 스테레오 동영상 부호화)

박찬희;손광훈
- Journal of Broadcast Engineering
- /
- v.7 no.3
- /
- pp.238-247
- /
- 2002
In this paper, we propose an object-based stereo sequence compression technique using disparity-motion vector relationship. The proposed method uses the coherence of motion vectors and disparity vectors in the left and right Image sequences. After two motion vectors and one disparity vector ate computed using FBMA(Fixed Block Matching Algorithm), the disparity vector of the current stereoscopic pall is computed by disparity-motion vector relationship with vectors which are previously estimated. Moreover, a vector regularization technique is applied in order to obtain reliable vectors. For an object-based coding. the object is defined and coded in terms of layers of VOP such as in MPEG-4. we present a method using disparity and motion vector relationship for extending two-frame compensation into three-frame compensation method for prediction coding of B-VOP. The proposed algorithm shows a high performance when comparing with a conventional method.
PDF KSCI

A Study on 7-Connected Digits Speech Recognition using SCHMM (SCHMM 기반 7연속 숫자음 인식에 관한 연구)

Kim Se Yong;Jung Hui Seok;Kang Chul Ho
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.127-130
- /
- 2002
본 연구에서는 우리말 연속 숫자음 인식에서 본래의 숫자음을 변이 시키는 주된 요인인 연음현상에 대한 인식을 높이기 위해 별도의 연음부분의 레퍼런스를 작성하여 매칭 시키는 방식을 제안한다 또한 단모음으로 이루어진 /2/와 /5/의 연속된 음에 대하여도 레퍼런스를 작성하였다. 제안한 방식에 의하여 전체적으로 $1.4\%$정도 인식률이 상승됨을 볼 수 있다. 특히 발성 목록중 /82/, /62/, /31/, /15/, /75/ 등의 연음과 /226/, /755/등과 같이 모음의 연속된 발성이 포함된 숫자 열에서 제안된 방식이 인식률에 영향을 미치는 것을 볼 수가 있었다. 이는 연음에서 발생하는 오류가 연속 숫자음에 많은 영향을 미치는 것을 알 수 있다. 그 외에 /22/, /55/등과 같이 단모음으로 이루어진 숫자음의 연속 발성 또한 인식률을 저하시키는데 한 요인으로 작용함으로서 이에 대한 레퍼런스도 작성하여 인식률이 상승되는 것을 볼 수 있었다.
PDF

The Effects of Various Factors on Milk Yield and Variation in Milk Yield Between Milking, Milk Components, Milking Duration, and Milking Flow Rate in Holstein Dairy Cattle (착유우의 연속유량, 유량변이, 유성분, 체세포수, 비유지속시간, 비유속도에 대한 산차, 착유시간, 유기 및 착유간격의 효과)

Ahn, B.S.;Jeon, B.S.;Baek, K.S.;Park, S.J.;Lee, H.J.;Lee, W.S.;Kim, S.B.;Park, S.B.;Kim, H.S.;Ju, J.C.;Khan, M. A.
- Journal of Animal Science and Technology
- /
- v.47 no.6
- /
- pp.919-924
- /
- 2005
This study was carried out to estimate the effects of parity, milking time, milking interval and days in milk(DIM) on variation in milk yield between consecutive milkings(am to pm to am), morning and evening milk yield and its components, somatic cell counts(SCS), milking duration, milk flow rate and peak milk flow in Holstein dairy cattle. Records from one hundred and twenty two heads of Holstein cattle at National Livestock Research Institute, Korea were used for this study from July 1 to August 8, 2005. The experimental herd had average 1.6$\pm$0.9 parities, 199.8$\pm$109.1 DIM and 12.26$\pm$4.06kg milk yields at each milking. Milking yield, percent milk fat and SNF, milking duration and average milk flow were significantly varied by parity, milking time and DIM. Percent milk protein and lactose were varied by parity and DIM, however SCS and average milk flow were affected by parity and milking time. Milking interval significantly affected the consecutive, morning and evening milk yield and average milk flow. However, MUN was not affected by parity, milking time, DIM and milking interval. Milk yield was decreased with increasing parity. Milk yield in the morning was higher than that of in the evening. Milk yield between consecutive milking was not affected by parity, however, affected by milking time. Percent milk Fat, SNF and SCS were higher at in evening milk than those of in morning milk. Milk protein, lactose, SNF, SCS, milking duration and peak milk flow rate were influenced by parity. This study suggested that milk yield variation between consecutive milking, milking flow rate, and milking duration could be important traits for enhancing Holstein cattle productivity however, and more study is needed to estimate genetic parameters for such traits.
https://doi.org/10.5187/JAST.2005.47.6.919 인용 PDF KSCI

Korean Continuous Speech Recognition using Phone Models for Function words (기능어용 음소 모델을 적용한 한국어 연속음성 인식)

명주현;정민화
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.04b
- /
- pp.354-356
- /
- 2000
의사형태소를 디코딩 단위로 한국어 연속 음성 인식에서의 조사, 어미, 접사 및 짧은 용언의 어간등의 단어가 상당수의 인식 오류를 발생시킨다. 이러한 단어들은 발화 지속시간이 매우 짧고 생략이 빈번하며 결합되는 다른 형태소의 형태에 따라서 매우 심한 발음상의 변이를 보인다. 본 논문에서는 이러한 단어들은 한국어 기능어라 정의하고 실제 의사형태소 단위의 인식 실험을 통하여 기능어 집합 1, 2를 규정하였다. 그리고 한국어 기능어에 기능어용 음소를 독립적으로 적용하는 방법을 제안했다. 또한 기능어용 음소가 분리되어 생기는 음향학적 변이들을 처리하기 위해 Gaussian Mixture 수를 증가시켜 보다 견고한 학습을 수행했고, 기능어들의 음향 모델 스코어가 높아짐에 따른 인식에서의 삽입 오류 증가를 낮추기 위해 언어 모델에 fixed penalty를 부여하였다. 기능어 집합1에 대한 음소 모델을 적용한 경우 전체 문장 인식률은 0.8% 향상되었고 기능어 집합2에 대한 기능어 음소 모델을 적용하였을 때 전체 문장 인식률은 1.4% 증가하였다. 위의 실험 결과를 통하여 한국어 기능어에 대해 새로운 음소를 적용하여 독립적으로 학습하여 인식을 수행하는 것이 효과적임을 확인하였다.
PDF

Stochastic Pronunciation Lexicon Modeling for Large Vocabulary Continous Speech Recognition (확률 발음사전을 이용한 대어휘 연속음성인식)

Yun, Seong-Jin;Choi, Hwan-Jin;Oh, Yung-Hwan
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.2
- /
- pp.49-57
- /
- 1997
In this paper, we propose the stochastic pronunciation lexicon model for large vocabulary continuous speech recognition system. We can regard stochastic lexicon as HMM. This HMM is a stochastic finite state automata consisting of a Markov chain of subword states and each subword state in the baseform has a probability distribution of subword units. In this method, an acoustic representation of a word can be derived automatically from sample sentence utterances and subword unit models. Additionally, the stochastic lexicon is further optimized to the subword model and recognizer. From the experimental result on 3000 word continuous speech recognition, the proposed method reduces word error rate by 23.6% and sentence error rate by 10% compare to methods based on standard phonetic representations of words.
PDF

Search Result 164, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)