Search | Korea Science

Text-to-Speech System Using Logatom (Logatom을 사용한 문서음성변환 시스템)

Cho Kwansun;Lee Chulhee
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.7-10
- /
- 1999
본 논문에서는 logatom 기반 무제한 한국어 TTS 시스템 구현을 제안한다. 이를 위하여 한국어를 대표할 만한 문서코퍼스를 선택하여 분석하고 이를 바탕으로 합성에 필요한 logatom을 설계한다. 일반적으로 음성코퍼스를 통해 음성세그먼트를 추출하여 접속에 기반한 TTS 시스템에서는 음성세그먼트를 의미있는 단어 또 는 어절로부터 추출한다. 하지만 음성세그먼트 추출시 고려되는 사항은 합성단위에 기초한 음소간의 결합형태이므로 본 논문에서는 음성세그먼트 추출을 위하여 무의미한 음소열인 logatom을 설계한다. Logatom은 문장 세그먼트의 어절내 위치와 문서코퍼스 분석 결과 얻어진 음소간의 결합형태를 기반으로 설계된다. 제안된 시스템의 합성음질을 평가하기 위하여 CVC 기반 logatom을 사용하여 임의의 문장을 합성해 본 결과 대부분의 음성세그먼트 접속이 자음에서 이루어지고 어절의 위치를 고려한 logatom 설계로 인하여 어절 내에서는 비교적 자연스러운 합성음을 얻을 수 있었다.
PDF

The Control of Impedance System By A Cone Type Loudspeaker (콘형 스피커의 임피던스 제어시스템)

Ryu Sung-Ho;Lee Baek-Lyeol;Kim Jung-Hwa;Kim Chun-Duck
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.229-232
- /
- 2001
콘형 스피커를 사용한 임피던스 흡음 제어시스템에서는 진동판의 진동속도와 음압 모두를 궤환하는데 이는 궤환 이득이 큰 반면에 궤환 루프상의 요소들 때문에 안정한 동작이 곤란하였다. 이 연구에서는 전기단 접속형 제어시스템과 속도 궤환형 제어시스템의 적용에 대해 그 가능성을 확인하였다.
PDF

A fast POS tagging method for speech synthesis (음성합성을 위한 품사태깅시스템의 속도 개선)

Kim Jeong-se;Park Jun
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.159-162
- /
- 2002
본 논문에서는 음성합성을 위한 의사형태소 품사 태깅 시스템의 속도를 개선하는 방법으로 정확률을 다소 희생하더라도 속도개선이 될 수 있는 방법을 제안하고자 한다. 형태소 해석 시에는 종성으로 올 수 있는 자모를 제외한 나머지에 대해서는 음절단위로 구성하는 변형된 Tabular 파싱법으로 해석하는데, 여기에다 일반적으로 적용 가능한 몇 가지의 규칙을 추가함으로써 해석 가능한 노드들을 줄였다. 태깅 시에는 한국어의 특성상 어절 하나씩을 품사 태깅하였을 경우에도 상당히 정확하다는 점을 이용하여 어절 내부에서는 full search 를 하고 그 다음 어절은 이전 어절의 제일 높은 값을 가지는 품사열 정보를 활용하는 방법을 제안한다. 제안한 시스템은 32 개 품사 태그셋에 2 만 형태소 사전을 이용해 실험한 결과, 기존의 시스템보다 약 $60\%$이상의 속도 개선을 보였으며, 정확률은 약 $1\%$ 정도 떨어졌다.
PDF

Shallow Water Acoustic Communication Channel Characteristic Analysis Using PN Sequence with 25 kHz Carrier at the Shore of Geojea Island (25 kHz 대역에서 PN 신호열을 이용한 거제 천해역 수중음향통신 채널 특성 분석)

Kim, Jae-Gap;Kim, Sea-Moon;Lim, Young-Kon
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.8
- /
- pp.381-389
- /
- 2007
In this paper, the measuring method of underwater acoustic communication channel characteristics in the shallow water using the autocorrelation characteristic of PN sequence and the undorwater communication channel analysis results from the received signal sample data are described. For measuring the underwater acoustic communication channel characteristics, two PN sequences are used as a transmitted data of I-channel and Q-channel of QPSK symbol and QPSK signal is transmitted with symbol rate of 5 kHz and carrier frequency of 25 kHz. In the receiver the received signal, which pass through 675 m and 1492 m, is sampled and then stored. Using the stored sample data, the scattering function, coherent time, delay power profile, spaced-tone autocorrelation function, delay spread, and coherent bandwidth of each propagation distance cases are analyzed. Based on the analysis results, several guidelines are suggested for the design and implementation of underwater transmission system.
https://doi.org/10.7776/ASK.2007.26.8.381 인용 PDF KSCI

Speaker Adaptation for Voice Dialing (음성 다이얼링을 위한 화자적응)

;Chin-Hui Lee
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.5
- /
- pp.455-461
- /
- 2002
This paper presents a method that improves the performance of the personal voice dialling system in which speaker independent phoneme HMM's are used. Since the speaker independent phoneme HMM based voice dialing system uses only the phone transcription of the input sentence, the storage space could be reduced greatly. However, the performance of the system is worse than that of the system which uses the speaker dependent models due to the phone recognition errors generated when the speaker independent models are used. In order to solve this problem, a new method that jointly estimates transformation vectors for the speaker adaptation and transcriptions from training utterances is presented. The biases and transcriptions are estimated iteratively from the training data of each user with maximum likelihood approach to the stochastic matching using speaker-independent phone models. Experimental result shows that the proposed method is superior to the conventional method which used transcriptions only.
PDF KSCI

Measurements of Acoustic Properties of Materials by Spectral Analysis of Ultrasonic Pulses (초음파 펄스의 주파수해석에 의한 재료의 음향특성 측정)

Ha, Kang-Lyeol;Kim, Moo-Joon;Lee, Jong-Kyu;Kim, Sung-Boo;Noriyoshi, Chubachi
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.6
- /
- pp.40-47
- /
- 1995
A system for measurement of ultrasonic velocity, attenuation and complex modulus of materials by using the spectral analysis method of pulses has been constructed and its performances are estimated. The system has a mechanical scanning part of an acoustic microscope with a ZnO plane wave transducer of the resonant frequency of 85MHz. Ultrasonic velocity has been obtained by the intervals of maxima (or minima) on the power spectrum of a pulse train reflected from the surface and bottom of a specimen, and attenuation has been obtained by the power spectra of three pulses reflected from the surface and the bottom of a specimen and the surface of a standard specimen. The measured results for materials such as fused quartz, polyester show that the system has very high accuracy.
PDF

Early Detective Warning System of Fire in the Tunnel Road (도로터널 내 차량사고 화재조기감지 예고 시스템)

Yoon, Sungwook;Kim, Hyenki
- Proceedings of the Korea Contents Association Conference
- /
- 2012.05a
- /
- pp.291-292
- /
- 2012
본 연구는 여러 가지 센서를 이용하여 자동차 전용 도로터널의 차량 사고시의 음향을 인식하여 사고인식률을 높이는 화재 예고 시스템에 관한 연구이다. 현행의 CCTV나 자동화재탐재설비에서 감지하는 열센서나 영상전송자료를 파악하기에 앞서, 이차적 재해 가능성을 유의미한 수준에서 미리 예고하고 대응할 수 있는 사전예고시스템을 구성하였다. 유선설치기반의 센서로 대부분 구성된 도로터널 내에서 비교적 설비가 저렴한 무선센서를 사용함으로서 기존 터널에서의 적용성을 증대시켰다.
PDF

Iterative Polynomial Fitting Technique for the Nonlinear Array Shape Estimation (비선형 선배열 형상 추정을 위한 반복 다항 근사화 기법)

조요한;조치영;서희선
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.8
- /
- pp.74-80
- /
- 2001
Because of ocean waves, swell, steering corrections, etc, the hydrophones of a towed array will not live along a straight line. However the degradation of bearing estimation performance occurs when beamforming is carried out on the hydrophone outputs of an acoustic towed array which is not straight. So it is required to estimate the shape of the array for the improved beamformer output. In this paper, an iterative array shape estimation technique is presented, which is based on the use of the least squares polynomial fitting to the data from heading sensors. The estimation error and the influence of deformations on the performance of the conventional beamformer output are investigated. Finally, the suggested method is applied to the real system in order to investigate the applicability.
PDF

A Study on the Prosody Generation of Korean Sentences using Neural Networks (신경망을 이용한 한국어 운율 발생에 관한 연구)

Lee Il-Goo;Min Kyoung-Joong;Kang Chan-Koo;Lim Un-Cheon
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.65-69
- /
- 1999
합성단위, 합성기, 합성방식 등에 따라 여러 가지 다양한 음성합성시스템이 있으나 순수한 법칙합성 시스템이 아니고 기본 합성단위를 연결하여 합성음을 발생시키는 연결합성 시스템은 연결단위사이의 매끄러운 합성계수의 변화를 구현하지 못해 자연감이 떨어지는 실정이다. 자연음에 존재하는 운율법칙을 정확히 구현하면 합성음의 자연감을 높일 수 있으나 존재하는 모든 운율법칙을 추출하기 위해서는 방대한 분량의 언어자료 구축이 필요하다. 일반 의미 문장으로부터 운율법칙을 추출하는 것이 바람직하겠으나, 모든 운율 현상이 포함된 언어자료는 그 문장 수가 극히 방대하여 처리하기 힘들기 때문에 가능하면 문장 수를 줄이면서 다양한 운율 현상을 포함하는 문장 군을 구축하는 것이 중요하다. 본 논문에서는 음성학적으로 균형 잡힌 고립단어 412 단어를 기반으로 의미문장들을 만들었다. 이들 단어를 각 그룹으로 구분하여 각 그룹에서 추출한 단어들을 조합시켜 의미 문장을 만들도록 하였다. 의미 문장을 만들기 위해 단어 목록에 없는 단어를 첨가하였다. 단어의 문장 내에서의 상대위치에 따른 운율 변화를 살펴보기위해 각 문장의 변형을 만들어 언어자료에 포함시켰다. 자연감을 높이기 위해 구축된 언어자료를 바탕으로 음성데이타베이스를 작성하여 운율분석을 통해 신경망을 훈련시키기 위한 목표패턴을 작성하였다 문장의 음소열을 입력으로 하고 특정음소의 운율정보를 발생시키는 신경망을 구성하여 언어자료를 기반으로 작성한 목표패턴을 이용해 신경망을 훈련시켰다. 신경망의 입력패턴은 문장의 음소열 중 11개 음소열로 구성된다. 이 중 가운데 음소의 운율정보가 출력으로 나타난다. 분절요인에 의한 영향을 고려해주기 위해 전후 5음소를 동시에 입력시키고 문장내에서의 구문론적인 영향을 고려해주기 위해 해당 음소의 문장내에서의 위치, 운율구에 관한 정보등을 신경망의 입력 패턴으로 구성하였다. 특정화자로 하여금 언어자료를 발성하게 한 음성시료의 운율정보를 추출하여 신경망을 훈련시킨 결과 자연음의 운율과 유사한 합성음의 운율을 발생시켰다.
PDF

Prediction of time-series underwater noise data using long short term memory model (Long short term memory 모델을 이용한 시계열 수중 소음 데이터 예측)

Hyesun Lee;Wooyoung Hong;Kookhyun Kim;Keunhwa Lee
- The Journal of the Acoustical Society of Korea
- /
- v.42 no.4
- /
- pp.313-319
- /
- 2023
In this paper, a time series machine learning model, Long Short Term Memory (LSTM), is applied into the bubble flow noise data and the underwater projectile launch noise data to predict missing values of time-series underwater noise data. The former is mixed with bubble noise, flow noise, and fluid-induced interaction noise measured in a pipe and can be classified into three types. The latter is the noise generated when an underwater projectile is ejected from a launch tube and has a characteristic of instantaenous noise. For such types of noise, a data-driven model can be more useful than an analytical model. We constructed an LSTM model with given data and evaluated the model's performance based on the number of hidden units, the number of input sequences, and the decimation factor of signal. It is shown that the optimal LSTM model works well for new data of the same type.
https://doi.org/10.7776/ASK.2023.42.4.313 인용 PDF

Search Result 116, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)