통합 검색 | Korea Science

ETRI 방송뉴스음성인식시스템 소개 (Introduction of ETRI Broadcast News Speech Recognition System)

박준
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2006년도 춘계 학술대회 발표논문집
- /
- pp.89-93
- /
- 2006
This paper presents ETRI broadcast news speech recognition system. There are two major issues on the broadcast news speech recognition: 1) real-time processing and 2) out-of-vocabulary handling. For real-time processing, we devised the dual decoder architecture. The input speech signal is segmented based on the long-pause between utterances, and each decoder processes the speech segment alternatively. One decoder can start to recognize the current speech segment without waiting for the other decoder to recognize the previous speech segment completely. Thus, the processing delay is not accumulated. For out-of-vocabulary handling, we updated both the vocabulary and the language model, based on the recent news articles on the internet. By updating the language model as well as the vocabulary, we can improve the performance up to 17.2% ERR.
PDF

4~6세 일반아동 및 언어발달지연 아동의 음운인식 및 음운처리 능력이 언어 능력에 미치는 영향 (Effects of phonological awareness and phonological processing on language skills in 4- to 6-year old children with and without language delay)

김신영;손진경;임동선
- 말소리와 음성과학
- /
- 제12권1호
- /
- pp.51-63
- /
- 2020
음운인식은 음운론 영역의 상위언어인식 능력으로, 읽기 및 어휘력 등의 언어능력을 예측하는 것으로 알려져 있다. 본 연구는 음운인식 능력과 기타 음운처리 능력, 그리고 언어능력 간의 관계를 일반아동 집단과 언어발달지연 아동 집단 간 비교를 통해 살펴보고자 하였다. 4~6세의 언어발달지연 아동(n=15)과 일반아동(n=18)을 대상으로 음운인식 능력을 평가하기 위하여 음절수세기, 음절탈락, 음절변별 과제를 실시하였다. 또한 비단어 따라말하기, 숫자 거꾸로 회상하기의 두 가지 음운처리 과제와 수용 및 표현어휘력, 문법성판단 과제와의 상관관계를 분석하고, 언어능력을 예측하는 음운인식 하위과제가 무엇인지 검토하였다. 음운인식 하위과제 중 음절수세기를 제외한 음절탈락, 음절변별 과제 수행력의 집단 간 차이가 유의하였다. 또한 일반아동 집단은 음절탈락과 숫자 거꾸로 회상하기, 음절변별과 수용어휘력 과제 수행력 간 상관관계가 유의하였으며, 언어발달지연 아동 집단은 음절수세기 과제와 숫자 거꾸로 회상하기, 수용어휘력, 표현어휘력, 문법성판단 과제의 수행력 간 상관관계가 유의하였다. 그리고 단계적 중다회귀분석 결과 일반아동 집단은 음절변별 과제가 수용어휘력 및 문법성판단 과제 수행력을 유의하게 예측하는 것으로 나타났으며, 언어발달지연 아동 집단은 음절수세기 과제가 수용어휘력, 표현어휘력, 문법성판단 과제 수행력을 유의하게 예측하는 것으로 나타났다. 언어발달지연 아동 집단은 일반아동 집단에 비해 음절수세기를 제외한 나머지 음절 수준의 음운인식 과제의 수행력이 저조하였으며, 이러한 특징이 상관관계 분석 및 회귀분석 결과에도 반영되었다. 또한 각 집단에서 음운인식 과제 수행력이 언어능력을 유의하게 예측하는 것으로 나타난 결과는 음운론 영역의 상위언어인식 능력의 중요성을 시사한다.
https://doi.org/10.13064/KSSS.2020.12.1.051 인용 PDF KSCI

An Energy-Efficient Matching Accelerator Using Matching Prediction for Mobile Object Recognition

Choi, Seongrim;Lee, Hwanyong;Nam, Byeong-Gyu
- JSTS:Journal of Semiconductor Technology and Science
- /
- 제16권2호
- /
- pp.251-254
- /
- 2016
An energy-efficient object matching accelerator is proposed for mobile object recognition based on matching prediction scheme. Conventionally, vocabulary tree has been used to save the external memory bandwidth in object matching process but involved massive internal memory transactions to examine each object in a database. In this paper, a novel object matching accelerator is proposed based on matching predictions to reduce unnecessary internal memory transactions by mitigating non-target object examinations, thereby improving the energy-efficiency. Experimental results show a 26% reduction in power-delay product compared to the prior art.
https://doi.org/10.5573/JSTS.2016.16.2.251 인용 PDF KSCI

요리활동을 통한 언어중재가 언어발달지연을 보이는 유아의 언어능력 향상에 대한 연구 (A Study the effect of Cooking Activity as a Language Intervention on the Language Development of Language Delayed Infants.)

서의정;김윤희
- 한국산학기술학회논문지
- /
- 제17권10호
- /
- pp.109-118
- /
- 2016
본 연구는 요리활동을 통한 언어중재가 언어발달지연을 보이는 유아의 언어능력 향상에 대해 알아보고 현장에서의 효율적인 교수학습 및 환경구성을 마련하는데 연구의 목적을 두고 있다. 연구대상은 서울시에 위치한 E아동발달센터에 재원 중인 만3세 유아 3명(남아 2명, 여아 1명)을 대상으로 각 연령 별 발달을 고려한 주제를 각각 선정하여 요리활동을 통한 언어중재를 실행하였다. 언어중재는 1주에 1회 50분씩 총 25회에 거쳐 실시하였고, 어휘는 요리동사 및 명사가 골고루 분포되도록 하였다. 자료 분석은 그림 어휘력검사(PPVT-R), 취학 전 아동의 수용언어(RLA) 및 표현언어(ELA) 발달 검사(PRES)를 사전 사후 검사를 실시하였고, 결과를 요약하면 다음과 같다. 요리활동을 통한 언어중재를 언어발달지연을 보이는 유아에게 실시한 결과 어휘력, 수용언어, 표현언어, 통합언어 모두 정상언어발달수준에 도달하였음을 알 수 있었다. 이러한 결과는 요리활동을 통한 언어중재가 언어발달지연을 보이는 만3세 유아들에게 긍정적인 영향을 주는 활동임을 시사하고 있다. 따라서 요리활동은 유아의 능동적인 참여와 흥미를 유발 시킬 수 있으며, 다양한 경험을 통해 언어능력을 신장 시킬 수 있다고 본다.
https://doi.org/10.5762/KAIS.2016.17.10.109 인용 PDF KSCI

Implementation of HMM-Based Speech Recognizer Using TMS320C6711 DSP

Bae Hyojoon;Jung Sungyun;Bae Keunsung
- 대한음성학회지:말소리
- /
- 제52호
- /
- pp.111-120
- /
- 2004
This paper focuses on the DSP implementation of an HMM-based speech recognizer that can handle several hundred words of vocabulary size as well as speaker independency. First, we develop an HMM-based speech recognition system on the PC that operates on the frame basis with parallel processing of feature extraction and Viterbi decoding to make the processing delay as small as possible. Many techniques such as linear discriminant analysis, state-based Gaussian selection, and phonetic tied mixture model are employed for reduction of computational burden and memory size. The system is then properly optimized and compiled on the TMS320C6711 DSP for real-time operation. The implemented system uses 486kbytes of memory for data and acoustic models, and 24.5 kbytes for program code. Maximum required time of 29.2 ms for processing a frame of 32 ms of speech validates real-time operation of the implemented system.
PDF

말소리가 제한된 아동을 위한 말리듬을 이용한 난타 프로그램의 개발과 효과 (Development and effects of Nanta program using speech rhythm for children with limited speech sound production)

박영혜;최성희
- 말소리와 음성과학
- /
- 제13권2호
- /
- pp.67-76
- /
- 2021
난타는 북과 같은 타악기를 이용한 "두드리기"라는 뜻으로 한국 전통 음악인 사물놀이의 리듬이다. 말소리 산출이 제한된 아이들을 위해 난타 프로그램이 개발되어 적용되었다. 또한, 이 연구는 언어 리듬을 이용한 난타 프로그램의 효과에 대한 증거를 제공한다. 난타 음성 리듬 중재 프로그램은 말리듬을 이용하여 개발되었다. 난타 프로그램은 청각 자극, 다양한 소리와 박자, 리듬을 제공했으며, 리듬과 함께 호흡, 발성, 조음의 세 단계로 구성되어 있다. 말소리 목록이 제한된 6명의 아이들이 이 연구에 참여했다. 아동들에게 소리와 박자를 탐색하고 소리와 박자를 자유롭게 표현하도록 하였다. 또한, 리듬과 함께 단어를 모방하고 모방하는 단어에서 음절의 길이를 늘림으로써 다양한 말소리를 산출하도록 격려하였다. 매 회당 40분 동안 주 2회씩 총 15회의 세션이 실시되었다. 중재 효과를 조사하기 위해 치료 전후 취학전 아동의 수용언어 및 표현언어 발달척도(PRES)와 수용-표현 어휘력 검사(REVT) 점수를 비교하였다. Wilcoxon rank test 결과, 중재 후 PRES에서 수용언어 점수(p=.027)와 표현언어 점수(p=.024) 및 수용어휘력(p=.028)과 표현어휘력 (p=.028) 점수가 유의하게 향상되었음을 보여주었다. 난타 리듬 컨트롤 프로그램은 수용적이고 표현적인 어휘와 언어 발달에 상당한 긍정적인 영향을 미쳤다. 이러한 발견들은 리듬 컨트롤 프로그램이 제한된 음성 소리 생성을 가진 어린이들의 언어 발달과 어휘 향상에 유용할 수 있다는 것을 암시한다.
https://doi.org/10.13064/KSSS.2021.13.2.067 인용 PDF KSCI

한국어 연결단어의 이음소 인식과 어절 형성에 관한 연구 (A Study on the Diphone Recognition of Korean Connected Words and Eojeol Reconstruction)

김경선;정홍
- 한국음향학회지
- /
- 제14권4호
- /
- pp.46-63
- /
- 1995
본 논문에서는 시간지연신경망을 이용한 한국어 무제한 어휘 연결단어 인식 시스템에 대해 기술하였다. 인식단위로는 인접한 두음소의 천이과정을 포한하는 이음소 (diphone)를 사용하였으며 그 개수는 329개이다. 한국어 연결단어 인식과정은 음성신호의 특징 추출 과정, 이음소 인식과정과 후처리 과정의 세 단계로 구분된다. 특징 추출 단계에서는 입력 음성의 이음소 구간을 분리하여 16차의 필터밸크 (filter-bank) 계수를 구한다. 이음소 인식은 3단계의 계층적 구조로 이루어졌으며 총 30개의 시간지연신경망을 이용해 이음소를 인식한다. 특히, 사용된 시간지연신경망은 인식률을 높이기 위하여 기존의 시간 지연신경망 구조를 변경하였다. 후처리 단계는 음소 천이확률과 음소 혼동확률을 이용한 이음소 오인식 수정과정과 인식된 이음소를 결합하여 어절을 형성하는 과정으로 이루어진다.
PDF

음절을 기반으로한 한국어 음성인식 (Korean Speech Recognition Based on Syllable)

이영호;정홍
- 전자공학회논문지B
- /
- 제31B권1호
- /
- pp.11-22
- /
- 1994
For the conventional systme based on word, it is very difficult to enlarge the number of vocabulary. To cope with this problem, we must use more fundamental units of speech. For example, syllables and phonemes are such units, Korean speech consists of initial consonants, middle vowels and final consonants and has characteristic that we can obtain syllables from speech easily. In this paper, we show a speech recognition system with the advantage of the syllable characteristics peculiar to the Korean speech. The algorithm of recognition system is the Time Delay Neural Network. To recognize many recognition units, system consists of initial consonants, middle vowels, and final consonants recognition neural network. At first, our system recognizes initial consonants, middle vowels and final consonants. Then using this results, system recognizes isolated words. Through experiments, we got 85.12% recognition rate for 2735 data of initial consonants, 86.95% recognition rate for 3110 data of middle vowels, and 90.58% recognition rate for 1615 data of final consonants. And we got 71.2% recognition rate for 250 data of isolated words.
PDF

Implementation of HMM-Based Speech Recognizer Using TMS320C6711 DSP

Bae Hyojoon;Jung Sungyun;Son Jongmok;Kwon Hongseok;Kim Siho;Bae Keunsung
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2004년도 ICEIC The International Conference on Electronics Informations and Communications
- /
- pp.391-394
- /
- 2004
This paper focuses on the DSP implementation of an HMM-based speech recognizer that can handle several hundred words of vocabulary size as well as speaker independency. First, we develop an HMM-based speech recognition system on the PC that operates on the frame basis with parallel processing of feature extraction and Viterbi decoding to make the processing delay as small as possible. Many techniques such as linear discriminant analysis, state-based Gaussian selection, and phonetic tied mixture model are employed for reduction of computational burden and memory size. The system is then properly optimized and compiled on the TMS320C6711 DSP for real-time operation. The implemented system uses 486kbytes of memory for data and acoustic models, and 24.5kbytes for program code. Maximum required time of 29.2ms for processing a frame of 32ms of speech validates real-time operation of the implemented system.
PDF

검색결과 9건 처리시간 0.021초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)