Search | Korea Science

An Endpoint Detection Algorithm for Noise Speech using Band Energy (대역에너지를 이용한 잡음음성의 끝점검출 알고리즘)

Park Ki-Sang;Suk Su-Young;Jung Ho-Youl;Chung Hyun-Yeol
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.91-94
- /
- 2002
음성인식 시스템의 실용화를 위해서 우선적으로 해결되어야 될 문제중 하나로 잡음환경하에서의 끝점검출을 들 수 있다. 잡음이 존재하지 않는 환경에서는 기존의 에너지 파라미터만으로도 어느정도 신뢰성있는 끝점 구간을 검출할 수 있으나 도심 소음과 같은 실제 잡음환경하에서는 대부분 좋지 않은 결과를 보인다. 본 논문에서는 도심환경의 배경잡음을 제거하는 방법으로 입력되는 음성에 대하여 주변소음에 의해 손상된 음성스펙트럼의 크기 성분만을 제거하는 전처리 기법인 Bark scale에 기반한 스펙트럼 차감법을 사용하고, 인간의 청각특성을 고려하여 음성의 주파수 대역을 3개의 대역으로 분리한 후, 대역별로 세밀한 에너지 문턱치값을 설정하여 음성의 끝점을 탐색하는 방법을 제안한다. 제안한 방법의 유효성을 확인하기 위해 실제 사무실 및 지하철역 등의 잡음환경하에서 녹음된 데이터베이스를 이용하여 끝점검출을 수행한 결과 기존의 에너지와 영교차율을 이용한 방법에 비해 평균 $46\%$의 오차율 감소와 대역에너지만을 사용한 경우에 비해 평균 $17\%$의 오차율 감소를 나타내어 제안한 방법의 유효성을 확인할 수 있었다.
PDF

Dynamic Redundant Audio Transmission for Packet Loss Recovery in VoIP Systems (인터넷 전화에서 손실 패킷 복원을 위한 동적인 부가 정보 전송 기법)

권철홍;김무중
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.4
- /
- pp.349-360
- /
- 2002
In ITU H.323 teleconference system, the RTP/RTCP protocol is offered to transfer real-time multimedia stream. Both sender and receiver hate experience in packet loss and jitter which result from network congestion over Internet. Audio quality over Internet depends on the number of lost packets and on jitter between successive packets. The goal of our study is to improve the speech quality over Internet by checking the packet loss characteristics of the network and adopting the but for control management mechanism at the receiver. We suggest a dynamic redundant audio transmission mechanism which examines the packet loss rate and uses the feedback information through RTCP.
PDF KSCI

A Study on Feature Extraction of Transformers Aging Signal using Discrete Wavelet Transform Technique (이산 웨이블렛 변환 기법을 이용한 변압기 열화신호의 특정추출에 관한 연구)

Park, Jae-Jun;Kim, Meyoun-Soo;Oh, Seung-Heon;Kim, Sung-Hong;Kweon, Dong-Jin;Song, Young-Chul;Ahn, Chang-Beom
- Proceedings of the Korean Institute of Electrical and Electronic Material Engineers Conference
- /
- 2000.05a
- /
- pp.5-12
- /
- 2000
본 연구에서, Daubechies'Mother Wavelet를 이용한 이산 웨이블렛 변환(Discrete Wavelet Transform)에 기초한 새롭고 효과적인 특정추출방법을 제안하였다. 특정추출을 이용하여 응용방향을 설명하고 또는 통계적 파라메터의 평가를 행하였다. 본 연구에서는 다음과 같은 몇 가지 사실을 알 수 있었다. 1. 시스템에서 발생된 (인가전압이 0[V]) 노이즈라 볼 수 가있는 렌덤노이즈(Random Noise)를 디지털필터인 FIR(Finite Impulse Response)필터를 통하여 상당한 노이즈를 억제할 수가 있었다. 2. 이산 웨이블렛 변환 시 레벨 1~4까지 변환한 결과 최적의 변환상태 Level-3을 기준으로 하였다. 3. 특정추출 파라메터는 음향방출신호의 최대값, 평균값, 분산, 왜도, 첨쇄도를 특정추출파라메터로 이용하였다. 4. 특정추출 결과를 이용하여 전체 열화시간 중 대표적 음향방출신호 중 초기열화신호, 중기열화신호, 말기열화신호를 얻을 수 있었다. 이런 특정추출을 통하여 변압기열화상태를 진단할 수 있는 가능성을 확인 할 수가 있었다.
PDF

Side scan sonar image super-resolution using an improved initialization structure (향상된 초기화 구조를 이용한 측면주사소나 영상 초해상도 영상복원)

Lee, Junyeop;Ku, Bon-hwa;Kim, Wan-Jin;Ko, Hanseok
- The Journal of the Acoustical Society of Korea
- /
- v.40 no.2
- /
- pp.121-129
- /
- 2021
This paper deals with a super-resolution that improves the resolution of side scan sonar images using learning-based compressive sensing. Learning-based compressive sensing combined with deep learning and compressive sensing takes a structure of a feed-forward network and parameters are set automatically through learning. In particular, we propose a method that can effectively extract additional information required in the super-resolution process through various initialization methods. Representative experimental results show that the proposed method provides improved performance in terms of Peak Signal-to-Noise Ratio (PSNR) and Structure Similarity Index Measure (SSIM) than conventional methods.
https://doi.org/10.7776/ASK.2021.40.2.121 인용 PDF KSCI

A quantitative analysis of aerodynamic noise by sound sources from a nozzle inflow (노즐 내부 유동 소음원에 의한 공력 소음의 정량적 분석)

Kwongi, Lee;Cheolung, Cheong;Kyeonghun, Park
- The Journal of the Acoustical Society of Korea
- /
- v.41 no.6
- /
- pp.698-704
- /
- 2022
In this paper, the radiated aerodynamic noise generated from sound sources of a nozzle inflow is quantitatively investigated and compared with experimental results of externally radiated noise. A high-resolution unsteady compressible Large Eddy Simulation (LES) technique is used to accurately predict the internal and external flow of three types of nozzle shape. Through using the vortex sound source for sound sources, the geometry of nozzle neck is identified as most significant aerodynamic noise sources. For validation of quantitative analysis, the vortex sound source intensity of internal nozzle flow is compared with results of external radiated noise of calculation and experiment.
https://doi.org/10.7776/ASK.2022.41.6.698 인용 PDF KSCI

A study on the localization of incipient propeller cavitation applying sparse Bayesian learning (희소 베이지안 학습 기법을 적용한 초생 프로펠러 캐비테이션 위치추정 연구)

Ha-Min Choi;Haesang Yang;Sock-Kyu Lee;Woojae Seong
- The Journal of the Acoustical Society of Korea
- /
- v.42 no.6
- /
- pp.529-535
- /
- 2023
Noise originating from incipient propeller cavitation is assumed to come from a limited number of sources emitting a broadband signal. Conventional methods for cavitation localization have limitations because they cannot distinguish adjacent sound sources effectively due to low accuracy and resolution. On the other hand, sparse Bayesian learning technique demonstrates high-resolution restoration performance for sparse signals and offers greater resolution compared to conventional cavitation localization methods. In this paper, an incipient propeller cavitation localization method using sparse Bayesian learning is proposed and shown to be superior to the conventional method in terms of accuracy and resolution through experimental data from a model ship.
https://doi.org/10.7776/ASK.2023.42.6.529 인용 PDF

A Study on Robust Emotion Classification Structure Between Heterogeneous Speech Databases (이종 음성 DB 환경에 강인한 감성 분류 체계에 대한 연구)

Yoon, Won-Jung;Park, Kyu-Sik
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.5
- /
- pp.477-482
- /
- 2009
The emotion recognition system in commercial environments such as call-center undergoes severe system performance degradation and instability due to the speech characteristic differences between the system training database and the input speech of unspecified customers. In order to alleviate these problems, this paper extends traditional method of emotion recognition of neutral/anger into two-step hierarchical structure by using emotional characteristic changes and differences of male and female. The experimental results indicate that the proposed method provides very stable and successful emotional classification performance about 25% over the traditional method of emotion recognition.
https://doi.org/10.7776/ASK.2009.28.5.477 인용 PDF KSCI

Study on the Performance of Spectral Contrast MFCC for Musical Genre Classification (스펙트럼 대비 MFCC 특징의 음악 장르 분류 성능 분석)

Seo, Jin-Soo
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.4
- /
- pp.265-269
- /
- 2010
This paper proposes a novel spectral audio feature, spectral contrast MFCC (SCMFCC), and studies its performance on the musical genre classification. For a successful musical genre classifier, extracting features that allow direct access to the relevant genre-specific information is crucial. In this regard, the features based on the spectral contrast, which represents the relative distribution of the harmonic and non-harmonic components, have received increased attention. The proposed SCMFCC feature utilizes the spectral contrst on the mel-frequency cepstrum and thus conforms the conventional MFCC in a way more relevant for musical genre classification. By performing classification test on the widely used music DB, we compare the performance of the proposed feature with that of the previous ones.
https://doi.org/10.7776/ASK.2010.29.4.265 인용 PDF KSCI

Application of Spectral Element Method for the Vibration Analysis of Passive Constrained Layer Damping Beams (수동감쇠 적층보의 진동해석을 위한 스펙트럴요소법의 적용)

Song, Jee-Hun;Hong, Suk-Yoon
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.1
- /
- pp.25-31
- /
- 2009
This paper introduces a spectrally formulated element method (SEM) for the beams treated with passive constrained layer damping (PCLD). The viscoelastic core of the beams has a complex modulus that varies with frequency. The SEM is formulated in the frequency domain using dynamic shape functions based on the exact displacement solutions from progressive wave methods, which implicitly account for the frequency-dependent complex modulus of the viscoelastic core. The frequency response function and dynamic responses obtained by the SEM and the conventional finite element method (CFEM) are compared to evaluate the validity and accuracy of the present spectral PCLD beam element model. The spectral PCLD beam element model is found to provide very reliable results when compared with the conventional finite element model.
https://doi.org/10.7776/ASK.2009.28.1.025 인용 PDF KSCI

Functional beamforming for high-resolution ultrasound imaging in the air with random sparse array transducer (고해상도 공기중 초음파 영상을 위한 기능성 빔형성법 적용)

Choon-Su Park
- The Journal of the Acoustical Society of Korea
- /
- v.43 no.3
- /
- pp.361-367
- /
- 2024
Ultrasound in the air is widely used in industry as a measurement technique to prevent abnormalities in the machinery. Recently, the use of airborne ultrasound imaging techniques, which can find the location of abnormalities using an array transducers, is increasing. A beamforming method that uses the phase difference for each sensor is used to visualize the location of the ultrasonic sound source. We exploit a random sparse ultrasonic array and obtain beamforming power distribution on the source in a certain distance away from the array. Conventional beamforming methods inevitably have limited spatial resolution depending on the number of sensors used and the aperture size. A high-resolution ultrasound imaging technique was implemented by applying functional beamforming as a method to overcome the geometric constraints of the array. The functional beamforming method can be expressed as a generalized beam forming method mathematically, and has the advantage of being able to obtain high-resolution imaging by reducing main-lobe width and side lobes. As a result of observation through computer simulation, it was verified that the resolution of the ultrasonic source in the air was successfully increased by functional beamforming using the ultrasonic sparse array.
https://doi.org/10.7776/ASK.2024.43.3.361 인용 PDF

Search Result 502, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)