Search | Korea Science

Study on the Improvement of Speech Recognizer by Using Time Scale Modification (시간축 변환을 이용한 음성 인식기의 성능 향상에 관한 연구)

이기승
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.6
- /
- pp.462-472
- /
- 2004
In this paper a method for compensating for thp performance degradation or automatic speech recognition (ASR) is proposed. which is mainly caused by speaking rate variation. Before the new method is proposed. quantitative analysis of the performance of an HMM-based ASR system according to speaking rate is first performed. From this analysis, significant performance degradation was often observed in the rapidly speaking speech signals. A quantitative measure is then introduced, which is able to represent speaking rate. Time scale modification (TSM) is employed to compensate the speaking rate difference between input speech signals and training speech signals. Finally, a method for compensating the performance degradation caused by speaking rate variation is proposed, in which TSM is selectively employed according to speaking rate. By the results from the ASR experiments devised for the 10-digits mobile phone number, it is confirmed that the error rate was reduced by 15.5% when the proposed method is applied to the high speaking rate speech signals.
PDF KSCI

Error Detection and Concealment of Transmission Error Using Watermark (워터마크를 이용한 전송 채널 에러의 검출 및 은닉)

박운기;전병우
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.29 no.2C
- /
- pp.262-271
- /
- 2004
There are channel errors when video data are transmitted between encoder and decoder. These channel errors would make decoded image incorrect, so it is very important to detect and recover channel errors. This paper proposes a method of error detection and recovery by hiding specific information into video bitstream using fragile watermark and checking it later. The proposed method requires no additional bits into compressed bitstream since it embeds a user-specific data pattern in the least significant bits of LEVELs in VLC codewords. The decoder can extract the information to check whether the received bitstream has an error or not. We also propose to use this method to embed essential data such as motion vectors that can be used for error recovery. The proposed method can detect corrupted MBs that usually escape the conventional syntax-based error detection scheme. This proposed method is quite simple and of low complexity. So the method can be applied to multimedia communication system in low bitrate wireless channel.
PDF KSCI

Rainfall Forecasting Using Satellite Information and Integrated Flood Runoff and Inundation Analysis (I): Theory and Development of Model (위성정보에 의한 강우예측과 홍수유출 및 범람 연계 해석 (I): 이론 및 모형의 개발)

Choi, Hyuk Joon;Han, Kun Yeun;Kim, Gwangseob
- KSCE Journal of Civil and Environmental Engineering Research
- /
- v.26 no.6B
- /
- pp.597-603
- /
- 2006
The purpose of this study is to improve the short term rainfall forecast skill using neural network model that can deal with the non-linear behavior between satellite data and ground observation, and minimize the flood damage. To overcome the geographical limitation of Korean peninsula and get the long forecast lead time of 3 to 6 hour, the developed rainfall forecast model took satellite imageries and wide range AWS data. The architecture of neural network model is a multi-layer neural network which consists of one input layer, one hidden layer, and one output layer. Neural network is trained using a momentum back propagation algorithm. Flood was estimated using rainfall forecasts. We developed a dynamic flood inundation model which is associated with 1-dimensional flood routing model. Therefore the model can forecast flood aspect in a protected lowland by levee failure of river. In the case of multiple levee breaks at main stream and tributaries, the developed flood inundation model can estimate flood level in a river and inundation level and area in a protected lowland simultaneously.
https://doi.org/10.12652/Ksce.2006.26.6B.597 인용 PDF

Robust Speech Recognition Using Missing Data Theory (손실 데이터 이론을 이용한 강인한 음성 인식)

김락용;조훈영;오영환
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.3
- /
- pp.56-62
- /
- 2001
In this paper, we adopt a missing data theory to speech recognition. It can be used in order to maintain high performance of speech recognizer when the missing data occurs. In general, hidden Markov model (HMM) is used as a stochastic classifier for speech recognition task. Acoustic events are represented by continuous probability density function in continuous density HMM(CDHMM). The missing data theory has an advantage that can be easily applicable to this CDHMM. A marginalization method is used for processing missing data because it has small complexity and is easy to apply to automatic speech recognition (ASR). Also, a spectral subtraction is used for detecting missing data. If the difference between the energy of speech and that of background noise is below given threshold value, we determine that missing has occurred. We propose a new method that examines the reliability of detected missing data using voicing probability. The voicing probability is used to find voiced frames. It is used to process the missing data in voiced region that has more redundant information than consonants. The experimental results showed that our method improves performance than baseline system that uses spectral subtraction method only. In 452 words isolated word recognition experiment, the proposed method using the voicing probability reduced the average word error rate by 12％ in a typical noise situation.
PDF

Performance Improvement of Cardiac Disorder Classification Based on Automatic Segmentation and Extreme Learning Machine (자동 분할과 ELM을 이용한 심장질환 분류 성능 개선)

Kwak, Chul;Kwon, Oh-Wook
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.1
- /
- pp.32-43
- /
- 2009
In this paper, we improve the performance of cardiac disorder classification by continuous heart sound signals using automatic segmentation and extreme learning machine (ELM). The accuracy of the conventional cardiac disorder classification systems degrades because murmurs and click sounds contained in the abnormal heart sound signals cause incorrect or missing starting points of the first (S1) and the second heart pulses (S2) in the automatic segmentation stage, In order to reduce the performance degradation due to segmentation errors, we find the positions of the S1 and S2 pulses, modify them using the time difference of S1 or S2, and extract a single period of heart sound signals. We then obtain a feature vector consisting of the mel-scaled filter bank energy coefficients and the envelope of uniform-sized sub-segments from the single-period heart sound signals. To classify the heart disorders, we use ELM with a single hidden layer. In cardiac disorder classification experiments with 9 cardiac disorder categories, the proposed method shows the classification accuracy of 81.6% and achieves the highest classification accuracy among ELM, multi-layer perceptron (MLP), support vector machine (SVM), and hidden Markov model (HMM).
https://doi.org/10.7776/ASK.2009.28.1.032 인용 PDF KSCI

A Neural Networks Model for Flow Forecasting in Nakdong River Basin (낙동강 유역에서의 유량 예측 신경망 모형에 관한 연구)

Han, Kun-Yeun;Kim, Dong-Il;Choi, Hyun-Gu;Yoon, Young-Sam
- Proceedings of the Korea Water Resources Association Conference
- /
- 2008.05a
- /
- pp.1727-1731
- /
- 2008
수자원의 효율적인 관리를 위해서는 신뢰성 있는 유량자료의 획득이 대단히 중요하다. 우리나라는 양질의 유량자료를 획득하기 위해 매년 많은 시간과 돈을 투자하고 있으나 자료의 질적인 면에서 만족할 만한 성과를 얻지 못하고 있다. 현재까지 우리나라의 유량자료는 댐의 수문자료와 수량관리 부처인 건교부에서 운영하는 수위표 지점의 수위-유량곡선에서 산출된 자료에 의존하고 있다. 그러나 수위-유량 관계식을 보정하기 위한 유량측정사업이 지속적이지 못하며, 이 관계식은 유량이 적은 저수기 및 갈수기에는 부정확하다는 한계가 있다. 또한, 국립환경과학원 낙동강물환경연구소에서 오염총량관리를 위한 낙동강수계 유량측정사업을 실시하고 있지만, 목적은 낙동강수계의 오염총량관리 단위유역 말단 47개 지점에서 유량측정을 효율적으로 실시하여 수질정책의 기초자료를 제공하는데 있다. 이 자료 역시 오염총량관리를 위하여 유량측정을 실시하여 수자원의 효율적인 관리를 위한 일 유량을 알 수가 없는 한계점을 가지고 있다. 따라서 저수기 및 갈수기에 수질정책의 기초자료를 제공하기 위해서 하천을 포함한 유역의 정확한 강우-유출특성의 파악이 필요하다. 그러나 강우-유출특성 또한 유역 내 강우의 시 공간적 분포가 다르며 그 자가 비선형성이 강하고 여러 변동성을 포함하므로, 강우로부터 하천의 유출량의 정확한 해석이 불가능하다. 그러나 최근 인공지능 분야에서 신호처리, 지능제어 및 패턴인식 등의 수단으로 사용되고 있는 신경망은 학습이라는 최적화 과정을 통해 입력과 출력으로 구성되는 하나의 시스템을 비선형적으로 구축할 수 있으며 이러한 이점을 활용하여 수자원 분야에서 다양하게 적용되고 있다. 본 연구의 목적은 강우-유출자료 및 댐 방류량 자료의 비선형적인 특정을 가장 잘 반영할 수 있는 신경망모형을 적용하여 수질정책의 기초자료를 제공하기 위하여 신뢰성 있는 유량자료를 산정하는 모형을 개발하는 것이다. 이를 위해서 낙동강물환경연구소에서 오염총량관리를 위한 낙동강수계 유량측정 지점 상류의 댐 방류량의 일 방류량자료와 강우자료를 입력 자료로 하여 유량을 예측할 수 있는 유량예측 신경망 모형 FFBN(Flow Forecasting By Neural)을 개발하였다. 그리고 입력 자료로서 장기유출모형인 SWAT의 모의결과를 입력 자료로 추가한 FFBNS(Flow Forecasting By Neural and SWAT)을 개발하였다. 신경망 모형의 구조는 입력층과 출력층 사이에 하나의 은닉층이 존재하는 다층 신경망으로 구성하였으며, 학습단계에서는 오류 역전파 알고리듬 학습방법 중 모멘텀법을 사용하였다. 예측된 유출량을 실측치와의 비교를 위하여 낙본D지점과 낙본 E지점에 대하여 $2005{\sim}2006$년까지의 모의 결과를 낙동 수위측정지점과 구미 수위측정지점의 실측치 통하여 복잡한 비선형성을 가지는 유출 시계열 자료에 대한 효과적인 최적의 신경망모델을 개발하여 유량을 예측하고 적용 가능성을 검토하고자 한다. 모의 결과는 수질정책의 기초자료 제공에 기여할 수 있을 것으로 판단된다.
PDF

Search Result 26, Processing Time 0.023 seconds

Study on the Improvement of Speech Recognizer by Using Time Scale Modification (시간축 변환을 이용한 음성 인식기의 성능 향상에 관한 연구)

Error Detection and Concealment of Transmission Error Using Watermark (워터마크를 이용한 전송 채널 에러의 검출 및 은닉)

Rainfall Forecasting Using Satellite Information and Integrated Flood Runoff and Inundation Analysis (I): Theory and Development of Model (위성정보에 의한 강우예측과 홍수유출 및 범람 연계 해석 (I): 이론 및 모형의 개발)

Robust Speech Recognition Using Missing Data Theory (손실 데이터 이론을 이용한 강인한 음성 인식)

Performance Improvement of Cardiac Disorder Classification Based on Automatic Segmentation and Extreme Learning Machine (자동 분할과 ELM을 이용한 심장질환 분류 성능 개선)

A Neural Networks Model for Flow Forecasting in Nakdong River Basin (낙동강 유역에서의 유량 예측 신경망 모형에 관한 연구)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)