Search | Korea Science

Current Status of Speech Database at ETRI (ETRI의 음성데이타베이스 구축현황)

이영직
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1995.06a
- /
- pp.265-271
- /
- 1995
한국전자통신연구소의 음성 데이터베이스 구축 현황을 기술한다. 현재 한국전자통신연구소에서는 음성인식 연구를 위해 단어음성, 정형 문장 음성 등의 데이터베이스를 구축, 보유하고 있다. 음성인식용 데이터베이스는 정해진 단어, 분장을 20명 내지 100명이 발성한 것으로, 일부는 음소 단위까지 레이블링이 되어 있다. 또 음성합성 연구를 위해 합성단위 및 운율데이타베이스를 가지고 있는데 이는 한 명 혹은 남녀 각각 3명이 발성한 것으로, 일부는 피치 등이 수록되어 있다. 문장 데이터베이스는 언어 정보처리를 위해 교재, 문학, 경제, 과학 분야의 문장을 총 480만 어절 가지고 있으며, 이 중 일부에 품사 정보를 추가하였다. 한국전자통신연구소는 국내 음성 연구의 발전에 기여하고자 음성 연구의 기반 자료가 되는 음성 데이터베이스를 국내 대학 및 산업체에 배포하고 있다 음성 데이터베이스는 음성 연구의 기반이 되는 자료임에도 불구하고 많은 비용과 노력이 들어 일반 대학에서는 쉽게 만들 수 없었다. 이에 ETRI는 한국통신이 출연한 "자동통역 요소기술개발" 과제으 LQNTKSANF인 여러 종류의 음성 데이터베이스와 관련 프로그램을 공급하여 국내 음성 연구의 기반 확립에 기여하고자 한다. 기여하고자 한다.
PDF

The Extraction of Effective Index Database from Voice Database and Information Retrieval (음성 데이터베이스로부터의 효율적인 색인데이터베이스 구축과 정보검색)

Park Mi-Sung
- Journal of Korean Library and Information Science Society
- /
- v.35 no.3
- /
- pp.271-291
- /
- 2004
Such information services source like digital library has been asked information services of atypical multimedia database like image, voice, VOD/AOD. Examined in this study are suggestions such as word-phrase generator, syllable recoverer, morphological analyzer, corrector for voice processing. Suggested voice processing technique transform voice database into tort database, then extract index database from text database. On top of this, the study suggest a information retrieval model to use in extracted index database, voice full-text information retrieval.
PDF

On the Present Construction Status of Speech Databases at KAIST Communications Research Laboratory (KAIST 통신연구실의 음성 데이터베이스 구축 현황)

최인정
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1995.06a
- /
- pp.272-275
- /
- 1995
한국과학기술원 통신연구실에서 진행중인 한국어 음성 데이터베이스의 개발 현황에 관하여 기술한다. 음성데이타베이스의 구축을 위하여 사용된 절차와 환경, 및 데이터베이스의 음성학적, 언어학적 성질들이 상세히 기술된다. 데이터베이스는 damtjddlstlr 알고리듬의 개발 및 평가를 위하여 사용되도록 고안되었다. 데이터베이스는 5종류의 음성 데이터, 즉 3천단어 규모의 무역관련 연속음성, 가변길이 연결 숫자음, phonembalanced 75 고립단어, 지역명 관련 500 고립단어, 한국어 아-세트로 구성되어 있다.
PDF

음성 데이터베이스의 연구동향

Jeong, Yu-Hyeon
- Electronics and Telecommunications Trends
- /
- v.7 no.3
- /
- pp.1-11
- /
- 1992
음성정보처리 연구에 있어서 기본적인 연구 도구인 동시에 개발 내용의 객관적인 평가 기준이 되는 음성 데이터베이스에 관하여 음성 데이터베이스의 개요, 국내외 연구동향, 연구 과제 등에 관하여 기술하였다.
https://doi.org/10.22648/ETRI.1992.J.070301 인용 PDF

A Korean Speech Database for Use in Automatic Translation (자동통역용 한국어 음성 데이터베이스)

최인정
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.287-290
- /
- 1994
음성 인식 시스템의 개발을 위해서는 음성 데이터베이스구축이 중요한 과제의 하나로서, 많은 시간과 노력이 요구된다. 개별적인 음성데이타베이스 구축에 따른 중복 투자를 줄이고 다양한 인식 알고리듬의 성능 비교와 국내 음성 인식 기술의 발전을 위해서는 벤치마크 시험을 위한 공통의 음성 데이터베이스가 필수적이다. 본 논문에서는 한국과학기술원 통신연구실에서 제작한 한국어 음성 데이터베이스에 관하여 기술한다. KAIST 음성데이타베이스는 자동통ㅇ역을 N이한 무역 상담과 관련되 3,000 단어 규모의 연속어를 비롯하여, 가변 길이 연결 숫자음, phoneme-balanced 75 고립단어, 지역명 관련 500 고립단어, 한국어 아-세트로 구성되어 있다. 이 음성 데이터베이스의 구축을 위하여 사용된 태스크선정 절차, 녹음 방법, 규격, 및 기대효과 등 세부사항을 기술한다.
PDF

연구 및 평가용 음성 데이터베이스의 개발동향과 제안

Lee, Yong-Ju;Kim, Gyeong-Tae
- ETRI Journal
- /
- v.8 no.3
- /
- pp.24-28
- /
- 1986
음성인식 기술개발을 위해서는 음성 데이터베이스가 필요하다. 본고에서는 음성 입출력기술 표준화의 일환으로서의 공통음성 데이터 제정에 관한 각국의 현황을 소개하였고, 우리말을 대상으로 한 음운특성 연구용의 데이터베이스를 확보하기 위한 고려사항을 검토하였다.
PDF

Comparison of Adult and Child's Speech Recognition of Korean (한국어에서의 성인과 유아의 음성 인식 비교)

Yoo, Jae-Kwon;Lee, Kyoung-Mi
- The Journal of the Korea Contents Association
- /
- v.11 no.5
- /
- pp.138-147
- /
- 2011
While most Korean speech databases are developed for adults' speech, not for children's speech, there are various children's speech databases based on other languages. Because there are wide differences between children's and adults' speech in acoustic and linguistic characteristics, the children's speech database needs to be developed. In this paper, to find the differences between them in Korean, we built speech recognizers using HMM and tested them according to gender, age, and the presence of VTLN(Vocal Tract Length Normalization). This paper shows the speech recognizer made by children's speech has a much higher recognition rate than that made by adults' speech and using VTLN helps to improve the recognition rate in Korean.
https://doi.org/10.5392/JKCA.2011.11.5.138 인용 PDF KSCI

Dialogic Male Voice Triphone DB Construction (남성 음성 triphone DB 구축에 관한 연구)

Kim, Yu-Jin;Baek, Sang-Hoon;Han, Min-Soo;Chung, Jae-Ho
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.2
- /
- pp.61-71
- /
- 1996
In this paper, dialogic triphone data base construction for triphone synthesis system is discussed. Particularly, in this work, dialogic speech data is collected from the broadcast media, and three different transcription steps are taken. Total 10 hours of speech data are collected. Among them, six hours of speech data are used for the triphone data base construction, and the rest four hours of data are reserved. Dialogic speech data base construction is far different from the reciting speech data base construction. This paper describes various steps that necessary for the dialogic triphone data base construction from collecting speech data to triphone unit labeling.
PDF

Speech Data Collection for korean Speech Recognition (한국어 음성인식을 위한 음성 데이터 수집)

Park, Jong-Ryeal;Kwon, Oh-Wook;Kim, Do-Yeong;Choi, In-Jeong;Jeong, Ho-Young;Un, Chong-Kwan
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.4
- /
- pp.74-81
- /
- 1995
This paper describes the development of speech databases for the Korean language which were constructed at Communications Research Laboratory in KAIST. The procedure and environment to construct the speech database are presented in detail, and the phonetic and linguistic properties of the databases are presented. the databases were intended for use in designing and evaluating speech recognition algorithms. The databases consist of five different sets of speech contents : trade-related continuous speech with 3,000 words, variable-length connected digits, phoneme-balanced 75 isolated words, 500 isolated Korean provincial names, and Korean A-set words.
PDF

Speech Data Base Construction at LG Electronics (LG 전자의 음성 DB 구축 현황)

김락용
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1995.06a
- /
- pp.280-282
- /
- 1995
음성인식시스템 개발을 위해서는 음성 데이터베이스 구축이 필요하며 이를 위해 LG 전자에서 구축한 두 가지 데이터베이스에 관해서 기술한다. LG 전자에서 보유한 음성 데이터 베이스는 차량 및 전화 선로상에 존재하는 잡음이 포함된 상태에서 수집한 숫자음과 제어 단어로 이루어져 있으며 마이크와 핸드셋을 통과한 음서잉 사용되었으며, 화자 독립 음성 인식을 위한 400-500 명분의 화자로 구성되어 있다.
PDF

Search Result 270, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)