Search | Korea Science

Collection, Analysis and Classification of Pathological Voice from ARS using Neural Network (ARS와 신경회로망을 이용한 장애음성의 수집, 분석 및 식별에 관한 연구)

김광인;조철우;김대현;왕수건;전계록;안시훈;김기련;김용주
- Proceedings of the IEEK Conference
- /
- 2000.09a
- /
- pp.955-958
- /
- 2000
본 논문은 음성신호를 이용해 성대의 질환이 있는 환자를 진단하고 병명을 판별하게끔 유도하는 자동 진단 시스템을 개발하기 위한 연구의 일부로, 그중 ARS를 이용하여 환자의 음성을 수집, 분석, 식별의 실험에 대한 연구이다. 본 연구 팀에서는 이미 CSL을 이용한 장애음성 데이터의 수집과 식별에 관한 연구 결과를 발표한바 있다. 하지만 선행연구에서는 방음실에서 디지털 녹음기를 이용하여 수집한 음성을 사용했기 때문에, ARS를 통하여 녹음한 음성과는 샘플링 주파수나 대역폭, 잡음성분등의 데이터의 특성이 상당한 차이가 있다. 이러한 이유로 ARS를 통하여 녹음한 음성에 보다 적합한 파라미터 분석프로그램을 작성하여 파라미터를 구하였다. 이 파라미터들은 Kay사의 MDVP를 기초로하여 작성하였고, 대부분 80%정도의 신뢰성을 가졌다. 수집한 음성의 식별은 정상음성과 양성음성의 두가지 경우로 분리하였다. 식별기법으로는 신경망을 이용하였고, 식별파라미터는 구한 파라미터중 6개의 파라미터를 선별하여 식별한 결과 약 90%정도의 식별율을 가졌다.
PDF

Classification of Pathological Speech Signals Using Wavelet Transform and Neural Network (Wavelet 변환과 신경회로망을 이용한 후두의 양성종양의 식별에 관한 연구)

김대현
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06e
- /
- pp.395-398
- /
- 1998
본 논문에서는 웨이브렛 변환에서 구해진 파라미터와 신경회로망을 이용하여 후두의 양성종양과 정상상태를 구분하는 실험을 행하였다. 식별 파라미터로는 웨이브렛변환으로부터 도출된 ECS 파라미터와 jitter, shimmer를 이용하였으며 신경회로망은 한 개의 은닉층을 갖는 다층구조 신경망을 이용하였다. 신경망의 입력으로는 세가지 파라미터의 조합을 두 개 또는 세 개를 입력하여 각각의 경우의 식별율을 조사하였다. 실험결과 75%에서 93%에 이르는 식별율을 얻었다.
PDF

Performance Comparison by Characteristic Parameter of Speaker Identification System using Neural Networks (신경회로망을 이용한 화자식별 시스템의 특징 파라미터에 따른 성능비교)

정재룡;유재훈;배현;전병희;김성신
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2002.12a
- /
- pp.345-348
- /
- 2002
음성인식 기술은 크게 음성인식과 화자인식 기술의 두 가지로 분류된다. 현재는 음성인식 기술이 널리 연구되고 있지만 점차 화자인식 기술의 중요성이 대두되고 있다. 본 논문에서는 화자인식 기술의 한 가지 분류로 임의 화자를 식별하기 위한 화자식별 기술을 연구 대상으로 하고 있으며, 신경회로망을 이용한 화자식별 시스템의 특징 추출 방법을 제시하고 그에 따른 성능을 비교하고 있다. 식별 단계에서 26명의 78개의 음성 샘플을 신경회로망의 역전파 알고리듬을 이용하여 학습하고, 테스트용으로 한 화자의 음성샘플이 사용되어 식별된다. 신경회로망의 입력 변수는 특징 파라미터로 선형예측계수, Mel-주파수 켑스트럼계수와 웨이블릿을 이용한 켑스트럼 계수를 사용하였다. 그 결과로써 화자식별 시스템의 신경회로망 모델2의 입력으로 혼합된 특징 파라미터를 사용한 경우가 다른 파라미터들을 사용한 경우와 비교하여 8.46~21.53%의 차를 가지고 가장 좋은 성능을 나타내었다.

Discrimination Between Natural and Artificial Seismic Sounds by Using 20 MSVQ Algorithm (20 MSVQ 알고리즘을 이용한 자연 및 인공 지진음 식별)

Yoon, Sang-Hoon;Song, Young-Hwan;Bae, Myung-Jin
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.3
- /
- pp.251-259
- /
- 2009
This paper proposes an identification technique to discriminate natural and artificial seismic sounds by using the 20 MSVQ algorithm with the data measured by using a hydrophone. Spectrum band energy and MFCC were used as representative parameters for sake of discriminating natural and artificial seismic sounds, and the orders of characterized parameters were determined through experiments. As a result of using 20 MSVQ algorithm with the 2 characterized parameters, MFCC had 99.9% and the spectrum energy parameter had 83.9% percent of success. It was verified that it is extremely accurate when seismic sounds were discriminated by using the method suggested by this paper.
https://doi.org/10.7776/ASK.2009.28.3.251 인용 PDF KSCI

On the Classification of the Pathological Speech (장애음성의 분류방법에 관한 연구)

김대현
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.388-391
- /
- 1998
jitter, shimmer 및 켑스트럼 방식의 음원분석에 의한 파라미터를 이용하여 장애음성을 진단, 식별하는 방법을 제안한다. 먼저 통계적 처리결과르 바탕으로 식별에 유효한 파라미터들을 선택하고 이들 파라미터들을 이용하여 최종 진단한다. 식별방법으로는 신경회로망을 이용한다. 입력파라미터로는 jitter, shimmer, HNRR을 사용한다. 신경회로망은 1 은닉층을 갖는 3- layer 신경회로망을 사용한다. 실험결과 효과적으로 정상음성과 장애음성의구분이 가능해졌다.
PDF

Estimation of Speeker Recognition Parameter using Lyapunov Dimension (Lyapunov 차원을 이용한 화자식별 파라미터 추정)

Yoo, Byong-Wook;Kim, Chang-Seok
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.4
- /
- pp.42-48
- /
- 1997
This paper has apparaised ability of speaker recognition and speech recognition using correlation dimension and Lyapunov dimension. In this method, speech was regarded the cahos that the random signal is appeared in determinisitic raising system. we deduced exact correlation dimension and Lyapunov dimension with searching important orbit from AR model power spectrum when reconstruct strange attractor using Taken's embedding theory. We considered a usefulness of speech recognition and speaker recognition using correlation dimension and Lyapunov dimension that characterized reconstruction attractor. As a result of consideration, which were of use more the speaker recognition than speech recognition, and in case of speaker recognition using Lyapunov dimension were much recognition rate more than speaker recognitions using correlation dimension.
PDF

System Identification of ARMAX Model using the Genetic Algorithm (유전자 알고리즘을 이용한 ARMAX 모델의 시스템 식별)

정경권;권성훈;이정훈;엄기환
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 1998.11a
- /
- pp.146-150
- /
- 1998
In this paper, we propose a nonlinear system identification method using the genetic algorithm. We represent the nonlinear system as a parameter vector and a measurement vector of ARMAX model. In order to identify the nonlinear system, we find the parameter vector using the genetic algorithm. The parameter vector is regarded as a chromosome of gene. The error between the desired output and estimated output every sampling period is used to calculate the fitness of one gene. The simulation results showed the effectiveness of using the genetic algorithm in the nonlinear system identification.
PDF

System Identification of a Small Unmanned Air Vehicle Using Neural Networks (신경회로망을 이용한 소형 무인항공기 시스템 식별)

Song, Yong-Kyu;Jeon, Byung-Ho
- Journal of the Korean Society for Aeronautical & Space Sciences
- /
- v.35 no.10
- /
- pp.912-917
- /
- 2007
In this paper system identification of a small UAV via neural networks is tried and the estimated parameters are then compared to those obtained by Fourier Transform Regression and Maximum Likelihood Estimation Techniques. With the estimated parameters a linear system is constructed and simulated to compare to the flight data. The results show that parameter identification using neural networks is comparable to the existing techniques
https://doi.org/10.5139/JKSAS.2007.35.10.912 인용 PDF KSCI

Analysis of Acoustical Characteristics of Pathological Voice Using Source Analysis (음원분석을 통한 장애음성의 음향적 특성분석에 관한 연구)

조철우
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06c
- /
- pp.163-166
- /
- 1998
본 논문에서는 장애음성들의 분석을 위하여 기존의 파라미터들인 jitter, shimmer 및 NHR과 함께 음원의 추정에 의한 파라미터를 이용하여 장애음성의 음향적 특성분석을 위한 실험을 행하고 정상음성과 장애음성을 이들 파라미터에 의해 식별하고자 한다.
PDF

Analysis and Recognition of Korean Fricatives and Affricates (한국어 마찰음 및 파찰음의 분석과 인식)

정석재;정현열;이무영
- The Journal of the Acoustical Society of Korea
- /
- v.10 no.5
- /
- pp.27-35
- /
- 1991
음소를 인식의 기본 단위로 하는 소규모 음성 인식 시스템을 구현하기 위한 기초 연구로서 마 찰음(/ㅅ, ㅆ, ㅎ/) 과 파찰음(/ㅈ, ㅉ, ㅊ/) 에 대하여 지속시간, 평균패턴, 분산비를 이용하여 각 음소 의 특징을 분석하고 각 음소군 내에서의 식별에 유효한 parameter들을 추출하여 인식 실험을 실시하 였다. 지속시간의 분포, 평균패턴의 분포, 분산비의 분포를 이용하여 분석한 결과 6차원 정도의 cepstrum 계수만으로 마찰음 및 파찰음의 식별이 가능하고, 시간 방향의 정보는 음성의 시단으로부터 14 frame 정도의 특징을 인식 파라미터로 할 경우가 최적임을 알 수 있었다. 이를 이용한 인식실험 결과에서는 조음방법별로 분류된 음소군내의 각 음소에 대한 인식실험의 인식률 보다는 발음방법별 인식실험시의 인식률이 높게 나타나 동일 음소군 내에서의 각 음소에 대한 식별이 더 어려움을 알 수 있었고, 특징 파라미터의 길이를 음성의 시단으로부터 14 frame 정도로 했을 때 조음방법별 인식률은 평균 81.1%, 발음방법별 인식률은 평균 97.9%로 최고의 인식률을 나타내었다. 특징 파라미터의 길이 를 14 frame 이상으로 증가시켜도 인식률은 큰 변화가 없어 분석 결과를 잘 설명하고 있음을 알 수 있었다.
PDF

Search Result 131, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)