Search | Korea Science

Mask Estimation Based on Band-Independent Bayesian Classifler for Missing-Feature Reconstruction (Missing-Feature 복구를 위한 대역 독립 방식의 베이시안 분류기 기반 마스크 예측 기법)

Kim Wooil;Stern Richard M.;Ko Hanseok
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.2
- /
- pp.78-87
- /
- 2006
In this paper. we propose an effective mask estimation scheme for missing-feature reconstruction in order to achieve robust speech recognition under unknown noise environments. In the previous work. colored noise is used for training the mask classifer, which is generated from the entire frequency Partitioned signals. However it gives a limited performance under the restricted number of training database. To reflect the spectral events of more various background noise and improve the performance simultaneously. a new Bayesian classifier for mask estimation is proposed, which works independent of other frequency bands. In the proposed method, we employ the colored noise which is obtained by combining colored noises generated from each frequency band in order to reflect more various noise environments and mitigate the 'sparse' database problem. Combined with the cluster-based missing-feature reconstruction. the performance of the proposed method is evaluated on a task of noisy speech recognition. The results show that the proposed method has improved performance compared to the Previous method under white noise. car noise and background music conditions.
https://doi.org/10.7776/ASK.2006.25.2.078 인용 PDF KSCI

Speech Recognition Using Noise Robust Features and Spectral Subtraction (잡음에 강한 특징 벡터 및 스펙트럼 차감법을 이용한 음성 인식)

Shin, Won-Ho;Yang, Tae-Young;Kim, Weon-Goo;Youn, Dae-Hee;Seo, Young-Joo
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.5
- /
- pp.38-43
- /
- 1996
This paper compares the recognition performances of feature vectors known to be robust to the environmental noise. And, the speech subtraction technique is combined with the noise robust feature to get more performance enhancement. The experiments using SMC(Short time Modified Coherence) analysis, root cepstral analysis, LDA(Linear Discriminant Analysis), PLP(Perceptual Linear Prediction), RASTA(RelAtive SpecTrAl) processing are carried out. An isolated word recognition system is composed using semi-continuous HMM. Noisy environment experiments usign two types of noises:exhibition hall, computer room are carried out at 0, 10, 20dB SNRs. The experimental result shows that SMC and root based mel cepstrum(root_mel cepstrum) show 9.86% and 12.68% recognition enhancement at 10dB in compare to the LPCC(Linear Prediction Cepstral Coefficient). And when combined with spectral subtraction, mel cepstrum and root_mel cepstrum show 16.7% and 8.4% enhanced recognition rate of 94.91% and 94.28% at 10dB.
PDF

Noise Control of Plate Structures with Optimal Design of Multiple Piezoelectric Actuators (복수 압전 가진기의 최적 설계를 통한 판구조물의 소음제어)

김재환
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 1996.04a
- /
- pp.263-270
- /
- 1996
Noise control of a plate structure with multiple disk shaped piezoelectric actuators is studied. The plate is excited by an acoustic pressure field produced by a noise source located below the plate. Finite element modeling is used for the plate structure that supports a combination of three dimensional solid, flat shell and transition elements. The objective function, in the optimization procedure, is to minimize the sound energy radiated onto a hemispherical surface of given radius and the design parameters are the locations and sizes of the piezoelectric actuators as well as the amplitudes of the voltages applied to them. Automatic mesh generation is addressed as part of the modeling procedure. Numerical results for both resonance and off resonance frequencies show remarkable noise reduction and the optimal locations of the actuators are found to be close to the edges of the plate structure. The optimized result is robust such that when the acoustic pressure pattern is changed, reduction of radiated sound is still maintained. The robustness of an optimally designed structure is also tested by changing the frequency of the noise source using only the actuator voltages as design parameters.
PDF

Suboptimal Decision Fusion in Wireless Sensor Networks under Non-Gaussian Noise Channels (비가우시안 잡음 채널을 갖는 무선 센서 네트워크의 준 최적화 결정 융합에 관한 연구)

Park, Jin-Tae;Koo, In-Soo;Kim, Ki-Seon
- Journal of Internet Computing and Services
- /
- v.8 no.4
- /
- pp.1-9
- /
- 2007
Decision fusion in wireless sensor networks under non-Gaussian noise channels is studied. To consider the tail behavior noise distributions, we use a exponentially-tailed distribution as a wide class of noise distributions. Based on a canonical parallel fusion model with fading and noise channels, the likelihood ratio(LR) based fusion rule is considered as an optimal fusion rule under Neyman-Pearson criterion. With both high and low signal-to-noise ratio (SNR) approximation to the optimal rule, we obtain several suboptimal fusion rules. and we propose a simple fusion rule that provides robust detection performance with a minimum prior information, Performance evaluation for several fusion rules is peformed through simulation. Simulation results show the robustness of the Proposed simple fusion rule.
PDF

Noise-robust Hand Region Segmentation In RGB Color-based Real-time Image (RGB 색상 기반의 실시간 영상에서 잡음에 강인한 손영역 분할)

Yang, Hyuk Jin;Kim, Dong Hyun;Seo, Yeong Geon
- Journal of Digital Contents Society
- /
- v.18 no.8
- /
- pp.1603-1613
- /
- 2017
This paper proposes a method for effectively segmenting the hand region using a widely popular RGB color-based webcam. This performs the empirical preprocessing method four times to remove the noise. First, we use Gaussian smoothing to remove the overall image noise. Next, the RGB image is converted into the HSV and the YCbCr color model, and global fixed binarization is performed based on the statistical value for each color model, and the noise is removed by the bitwise-OR operation. Then, RDP and flood fill algorithms are used to perform contour approximation and inner area fill operations to remove noise. Finally, ROI (hand region) is selected by eliminating noise through morphological operation and determining a threshold value proportional to the image size. This study focuses on the noise reduction and can be used as a base technology of gesture recognition application.
https://doi.org/10.9728/dcs.2017.18.8.1603 인용 PDF KSCI

A Noise-Robust Measuring Algorithm for Small Tubes Based on an Iterative Statistical Method (통계적 반복법에 기반한 노이즈에 강한 소형튜브 측정 알고리즘 개발)

Kim, Hyoung-Seok;Naranbaatar, Erdenesuren;Lee, Byung-Ryong
- Transactions of the Korean Society of Mechanical Engineers A
- /
- v.35 no.2
- /
- pp.175-181
- /
- 2011
We propose a novel algorithm for measuring the radius of tubes. This proposed algorithm is capable of effectively removing added noise and measuring the radius of tubes within allowable precision. The noise is removed by using a candidate true center that minimizes the standard deviation with respect to the radius. Further, the disconnection in data points resulting from noise removal is solved by using a connection algorithm. The final step of the process is repeated until the value of the standard deviation decreases to a small predefined value. Experiments were performed using circle geometries with added noise and a real tube with complex noise and that is used in the braking units of automobiles. It was concluded that the measurement carried out using the algorithm was accurate within 1.4%, even with 15% added noise.
https://doi.org/10.3795/KSME-A.2011.35.2.175 인용 PDF KSCI

Recent Research Trends in Touchscreen Readout Systems (최근 터치스크린 Readout 시스템의 연구 경향)

Jun-Min Lee;Ju-Won Ham;Woo-Seok Jang;Ha-Min Lee;Sang-Mo Koo;Jong-Min Oh;Seung-Hoon Ko
- Journal of the Korean Institute of Electrical and Electronic Material Engineers
- /
- v.36 no.5
- /
- pp.423-432
- /
- 2023
With the increasing demand for mobile devices featuring multi-touch operation, extensive research is being conducted on touch screen panel (TSP) Readout ICs (ROICs) that should possess low power consumption, compact chip size, and immunity to external noise. Therefore, this paper discusses capacitive touch sensors and their readout circuits, and it introduces research trends in various circuit designs that are robust against external noise sources. The recent state-of-the-art TSP ROICs have primarily focused on minimizing the impact of parasitic capacitance (Cp) caused by thin panel thickness. The large Cp can be effectively compensated using an area-efficient current compensator and Current Conveyor (CC), while a display noise reduction scheme utilizing a noise-antenna (NA) electrode significantly improves the signal-to-noise ratio (SNR). Based on these achievements, it is expected that future TSP ROICs will be capable of stable operation with thinner and flexible Touch Screen Panels (TSPs).
https://doi.org/10.4313/JKEM.2023.36.5.1 인용 PDF

Study on the Vulnerabilities of Automatic Speech Recognition Models in Military Environments (군사적 환경에서 음성인식 모델의 취약성에 관한 연구)

Elim Won;Seongjung Na;Youngjin Ko
- Convergence Security Journal
- /
- v.24 no.2
- /
- pp.201-207
- /
- 2024
Voice is a critical element of human communication, and the development of speech recognition models is one of the significant achievements in artificial intelligence, which has recently been applied in various aspects of human life. The application of speech recognition models in the military field is also inevitable. However, before artificial intelligence models can be applied in the military, it is necessary to research their vulnerabilities. In this study, we evaluates the military applicability of the multilingual speech recognition model "Whisper" by examining its vulnerabilities to battlefield noise, white noise, and adversarial attacks. In experiments involving battlefield noise, Whisper showed significant performance degradation with an average Character Error Rate (CER) of 72.4%, indicating difficulties in military applications. In experiments with white noise, Whisper was robust to low-intensity noise but showed performance degradation under high-intensity noise. Adversarial attack experiments revealed vulnerabilities at specific epsilon values. Therefore, the Whisper model requires improvements through fine-tuning, adversarial training, and other methods.
https://doi.org/10.33778/kcsa.2024.24.2.201 인용 PDF HTML

The Noise Robust Algorithm to Detect the Starting Point of Music for Content Based Music Retrieval System (노이즈에 강인한 음악 시작점 검출 알고리즘)

Kim, Jung-Soo;Sung, Bo-Kyung;Koo, Kwang-Hyo;Ko, Il-Ju
- Journal of the Korea Society of Computer and Information
- /
- v.14 no.9
- /
- pp.95-104
- /
- 2009
This paper proposes the noise robust algorithm to detect the starting point of music. Detection of starting point of music is necessary to solve computational-waste problem and retrieval-comparison problem with inconsistent input data in music content based retrieval system. In particular, such detection is even more necessary in time sequential retrieval method that compares data in the sequential order of time in contents based music retrieval system. Whereas it has the long point that the retrieval is fast since it executes simple comparison in the order of time, time sequential retrieval method has the short point that data starting time to be compared should be the same. However, digitalized music cannot guarantee the equity of starting time by bit rate conversion. Therefore, this paper ensured that recognition rate shall not decrease even while executing high speed retrieval by applying time sequential retrieval method through detection of music starting point in the pre-processing stage of retrieval. Starting point detection used minimum wave model that can detect effective sound, and for strength against noise, the noises existing in mute sound were swapped. The proposed algorithm was confirmed to produce about 38% more excellent performance than the results to which starting point detection was not applied, and was verified for the strength against noise.
https://doi.org/10.9708/jksci.2009.14.9.095 인용 PDF

A Robust Backpropagation Algorithm and It's Application (문자인식을 위한 로버스트 역전파 알고리즘)

Oh, Kwang-Sik;Kim, Sang-Min;Lee, Dong-No
- Journal of the Korean Data and Information Science Society
- /
- v.8 no.2
- /
- pp.163-171
- /
- 1997
Function approximation from a set of input-output pairs has numerous applications in scientific and engineering areas. Multilayer feedforward neural networks have been proposed as a good approximator of nonlinear function. The back propagation(BP) algorithm allows multilayer feedforward neural networks to learn input-output mappings from training samples. It iteratively adjusts the network parameters(weights) to minimize the sum of squared approximation errors using a gradient descent technique. However, the mapping acquired through the BP algorithm may be corrupt when errorneous training data we employed. When errorneous traning data are employed, the learned mapping can oscillate badly between data points. In this paper we propose a robust BP learning algorithm that is resistant to the errorneous data and is capable of rejecting gross errors during the approximation process, that is stable under small noise perturbation and robust against gross errors.
PDF

Search Result 1,308, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)