Search | Korea Science

Comparison of Feature Extraction Methods for the Telephone Speech Recognition (전화 음성 인식을 위한 특징 추출 방법 비교)

전원석;신원호;김원구;이충용;윤대희
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.7
- /
- pp.42-49
- /
- 1998
본 논문에서는 전화망 환경에서 음성 인식 성능을 개선하기 위한 특징 벡터 추출 단계에서의 처리 방법들을 연구하였다. 먼저, 고립 단어 인식 시스템에서 채널 왜곡 보상 방 법들을 단어 모델과 문맥 독립 음소 모델에 대하여 인식 실험을 하였다. 켑스트럼 평균 차 감법, RASTA 처리, 켑스트럼-시간 행렬을 실험하였으며, 인식 모델에 따른 각 알고리즘의 성능을 비교하였다. 둘째로, 문맥 독립 음소 모델을 이용한 인식 시스템의 성능 향상을 위하 여 정적 특징 벡터에 대하여 주성분 분석 방법(principal component analysis)과 선형 판별 분석(linear discriminant analysis)과 같은 선형 변환 방법을 적용하여 분별력이 높은 벡터 공간으로 변환함으로써 인식 성능을 향상시켰다. 또한 선형 변환 방법을 켑스트럼 평균 차 감법과 결합하여 더욱 뛰어난 성능을 보여주었다.
PDF

The Comparison of Speaker Adaptation Methods (화자 적응 방법들의 비교)

황영수
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.1
- /
- pp.61-66
- /
- 1999
In this paper, we proposed various speaker adaptation methods and studied the performance of these methods. Methods which were studied in this paper are MAPE(Maximum A Posteriori Probability Estimation), Linear Spectral Estimating, Multi-Layer Perceptron and ARTMAP. In order to evaluate the performance of these methods, we used Korean isolated digits as the experimental data, the hybrid speaker adaptation method, which unified MAPE, linear spectral estimating and output probability of SCHMM, showed the better recognition result than those which performed other methods. And the method using ARTMAP showed the similar result to above hybrid method.
PDF

Reconsideration on the Diffsuse Sound Field (확산음장에 관한 고찰)

강현주
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06c
- /
- pp.331.2-336
- /
- 1998
In this paper, the validity for the application of the diffuse sound field theoty to the real sound field, especially on the bounding surfaces of the rooms, was reconsidered. The analytical result for directivity pattern on the bounding surfaces of the room was compared with the result of numerical simulations using ray tracing technique. Comparison results show that the distribution of the incident sound energy vs incident angles is approximated to Gaussian distribution, not to the uniform distribution.
PDF

A Study on a comparison and analysis of Speaking rate estimation for adaptive bit rate on CELP vocoder (가변전송률 CELP 부호화기 설계를 위한 발성률 비교 분석에 관한 연구)

Jang KyungA;Min SoYeon;Bae MyungJin
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.105-108
- /
- 2004
음성 부호화 기술은 전송률과 복잡도를 줄이고 음질을 향상시키는 방향으로 진행되고 있다. 현재 상용화되고 있는 CELP형 보코더는 낮은 전송률에 비해 우수한 음질을 제공한다. 본 논문에서는 기존의 방식과 다르게 보코더 단에 입력 음성이 들어가기 앞서 전처리 기법을 수행하는 전처리단을 부가하여 전송률을 낮추는 방법을 소개하고, 소개된 방법들을 각기 비교하고 분석하고자 한다. 전처리기법들을 음성 인식이나 합성에서 사용되는 파라미터들을 적용시켰으며, 처리시간이나 계산시간에 있어 기존의 방식에서 많은 영향을 미치지 않은 간단한 알고리즘으로 구현하였다. 소개하는 전처리단에서는 기존의 코딩방식에서 사용하지 않은 파라미터들, 발성율, 지속시간, PSOLA 방식들을 이용하였다.
PDF

A comparison of commercial software for the sound quality analysis (음질 분석용 상용 소프트웨어 비교)

Shin Sung-Hwan;Ih Jeong-Guon
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.215-218
- /
- 2004
제품 발생 소음의 음질 (sound qualify)에 대한 관심이 높아지면서, 음질 제어가 제품 경쟁력 향상에 중요한 고려 요소가 되었다. 이러한 음질에 대한 객관적인 분석을 위하여 다수의 상용 프로그램이 개발되고 있지만 라우드니스 (loudness)를 제외한 음질인자 (sound quality metrics)들은 아직 표준화 작업이 이루어지지 않았기 때문에 프로그램에 따라 차이가 발생하고 있다. 본 연구에서는 음질 분석을 위한 4 개의 상용 프로그램을 이용하여 일정 신호의 라우드니스, 샤프니스 (sharpness), 러프니스(roughness), 변동강도 (fluctuation strength)를 계산하고, 그 결과를 비교, 분석하였고, 이미 표준화 된 라우드니스를 포함한 음질인자들의 계산 결과는 프로그램에 따라 무시할 수 없는 차이가 나타남을 확인하였다. 이는 제품의 음질 평가 시 사용된 분석 프로그램 따라 그 결과가 다를 수 있음을 의미한다. 본 연구를 통해서 얻은 결과는 향후 음질 인자의 표준화 및 음질 지수 개발에 중요한 자료로 사용 될 수 있다.
PDF

Performance Comparison of Speech Recognition Using Body-conducted Signals in Noisy Environment (소음 환경에서 body-conducted 신호를 이용한 음성인식 성능 비교)

Choi Dae-Lim;Lee Kwang-Hyun;Lee Yong-Ju;Kim Chong-Kyo
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.57-60
- /
- 2004
본 논문에서는 음성정보기술산업지원센터(SiTEC)에서 현재 배포중인 고소음 환경 음성 DB를 이용하여 air-conducted 음성과 body-conducted 음성의 인식 성능을 비교 실험하였다. 소음 환경에서 일반적인 마이크로폰으로부터 수집된 air-conducted 음성은 잡음의 영향을 받기 쉬우며 이는 인식률을 저하시킨다. 반면에 진동 픽업 마이크로폰에서 수집된 body-conducted 음성은 소음에 보다 강인한 특성을 보인다. 이러한 특성에 근거하여 소음 환경에서 일반 다이나믹 마이크로폰 음성에 음질 개선 방법과 채널 보상 방법을 적용한 인식 결과와 3종류의 진동 픽업 마이크로폰에서 수집된 음성과의 인식 성능을 비교 분석하여 body-conducted 음성 인식 시스템의 환용 가능성을 살펴보았다.
PDF

Comparison of the Speech Recognition Performance based upon the Recurrent Structure of the Multilayered Recurrent Neural Network (다층회귀신경망의 회귀구조에 따른 음성인식성능 비교)

어태경
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06e
- /
- pp.357-360
- /
- 1998
4층구조인 다층퍼셉트론으로부터 입력층을 제외한 각 측의 출력성분을 하위은닉층으로 귀환하는 3모델의 다층회귀신경망을 구성하고, 각 모델별 망의 크기에 따른 음성인식성능을 분석 비교한다. 과거의 입력신호를 출력층에서 예측하여 오차신호를 계산하고, 이 오차신호가 최소화하는 방향으로 연결세기를 조정한다. 실험결과 3회귀모델중 상위은닉층의 회귀연결방식이 가장 양호한 인식율을 나타내었으며, 각 망 공히 상, 히위은닉층의 뉴런수 10, 15개, 예측차수 3, 4차 일 때 인식성능이 양호하였다. 그리고 회귀신경망이 비회귀신경망에 비해 인식율이 크게 향상된다는 것을 확인 할 수 있었다.
PDF

Performance Comparison of Acoustic Modeling Technique (음소 모델링 방식들의 성능 비교)

송명규
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06e
- /
- pp.377-380
- /
- 1998
HMM 기반의 음성 인식기를 구현하는데 있어서 모델의 복잡도와 제한된 훈련 데이터 사이의 균형을 유지하는 것은 중요한 문제이다. 중간규모 또는 대용량 어휘 인식 시스템은 정교한 모델을 얻기 위해서 문맥종속 음소 모델링이 필수적이다. 그러나, 제한된 훈련 데이터로는 발생 가능한 모든 context를 포함하기가 어렵고, 더구나 훈련 데이터에서 관찰된 context중에서도 그 관찰빈도가 낮은 것이 많아서 신뢰성 있는 문맥종속 모델들을 얻기에는 여전히 어려움이 따른다. 또한 경우에 따라서는 계산량의 감축을 위하여 모델 규모를 축소시킬 필요도 생긴다. 이러한 문제를 해결하기 위해 본 논문에서는 unit reduction 방법들과 state tying을 이용한 방법들의 성능을 실험을 통해 비교한다. 고립단어 인식 실험결과 state tying을 이용한 방법이 unit reduction에 비하여 우수함을 확인 할 수 있었다.
PDF

SEM Observation for the Damage of Inner Hair Cell Stereocilia of Guinea Pig Cochlea after Loud Tone Exposure (격음노출 후 기니픽 달팽이관 내유모세포 부동섬모에 관한 SEM(전자투사식현미경) 관측)

Jarng Soon Suck
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.1E
- /
- pp.1-6
- /
- 2005
The inner hair cell stereocilia of the guinea pig cochlea was examined under a scanning electron microscope (SEM) after loud tone exposure onto the ear drum of the animal. Before and after guinea pigs were exposed to intensive and continuous tone such as 106 dB SPL in intensity, the functioning of the cochlea was monitored by N1-N2 audiograms. The structural damage of the stereocilia of inner hair cells (IHCs) and outer hair cells (OHCs) was examined using the SEM in x 1500 magnification. The comparison between the functional change of the cochlea and the structural damage of the IHC stereocilia is done by means of photographic observation. It can be shown that the functional change might be related to the structural damage of the IHC stereocilia after intensive acoustic trauma.
PDF KSCI

A Study on the Elastic Wave Velocity of Magnetostrictive Materials (자왜 재료의 탄성파 속도에 관한 연구)

강국진;노용래
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.4
- /
- pp.54-61
- /
- 2001
Magnetostrictive materials have nonlinear elasto-magnetic properties. However the constitutive equations to describe the nonlinear properties are not available, yet. In this study we develope the equation in magnetostrictive materials by use of piezomagnetic constitutive equation which is quasi-linearized. With the wave equation, we determine the propagation velocity inside the magnetostrictive materials when a plane wave propagates along a given magnetic field. Validity of the calculated velocity is verified through comparison with experimental velocity measurement results for the most representative magnetostrictive materials. Terfenol-D.
PDF

Search Result 255, Processing Time 0.019 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)