• Title/Summary/Keyword: 잡음 검출

Search Result 1,342, Processing Time 0.029 seconds

Triplet loss based domain adversarial training for robust wake-up word detection in noisy environments (잡음 환경에 강인한 기동어 검출을 위한 삼중항 손실 기반 도메인 적대적 훈련)

  • Lim, Hyungjun;Jung, Myunghun;Kim, Hoirin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.5
    • /
    • pp.468-475
    • /
    • 2020
  • A good acoustic word embedding that can well express the characteristics of word plays an important role in wake-up word detection (WWD). However, the representation ability of acoustic word embedding may be weakened due to various types of environmental noise occurred in the place where WWD works, causing performance degradation. In this paper, we proposed triplet loss based Domain Adversarial Training (tDAT) mitigating environmental factors that can affect acoustic word embedding. Through experiments in noisy environments, we verified that the proposed method effectively improves the conventional DAT approach, and checked its scalability by combining with other method proposed for robust WWD.

Reduction Algorithm of Environmental Noise by Multi-band Filter (멀티밴드필터에 의한 환경잡음억압 알고리즘)

  • Choi, Jae-Seung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.8
    • /
    • pp.91-97
    • /
    • 2012
  • This paper first proposes the speech recognition algorithm by detection of the speech and noise sections at each frame, then proposes the reduction algorithm of environmental noise by multi-band filter which removes the background noises at each frame according to detection of the speech and noise sections. The proposed algorithm reduces the background noises using filter bank sub-band domain after extracting the features from the speech data. In this experiment, experimental results of the proposed noise reduction algorithm by the multi-band filter demonstrate using the speech and noise data, at each frame. Based on measuring the spectral distortion, experiments confirm that the proposed algorithm is effective for the speech by corrupted the noise.

Image noise reduction algorithms using nonparametric method (비모수 방법을 사용한 영상 잡음 제거 알고리즘)

  • Woo, Ho-young;Kim, Yeong-hwa
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.5
    • /
    • pp.721-740
    • /
    • 2019
  • Noise reduction is an important field in image processing and requires a statistical approach. However, it is difficult to assume a specific distribution of noise, and a spatial filter that reflects regional characteristics is a small sample and cannot be accessed in a parametric manner. The first order image differential and the second order image differential show a clear difference according to the noise level included in the image and can be more clearly understood using the canyon edge detector. The Fligner-Killeen test was performed and the bootstrap method was used to statistically check the noise level. The estimated noise level was set between 0 and 1 using the cumulative distribution function of the beta distribution. In this paper, we propose a nonparametric noise reduction algorithm that accounts for the noise level included in the image.

Efficient Edge Detection in Noisy Images using Robust Rank-Order Test (잡음영상에서 로버스트 순위-순서 검정을 이용한 효과적인 에지검출)

  • Lim, Dong-Hoon
    • The Korean Journal of Applied Statistics
    • /
    • v.20 no.1
    • /
    • pp.147-157
    • /
    • 2007
  • Edge detection has been widely used in computer vision and image processing. We describe a new edge detector based on the robust rank-order test which is a useful alternative to Wilcoxon test. Our method is based on detecting pixel intensity changes between two neighborhoods with a $r{\times}r$ window using an edge-height model to perform effectively on noisy images. Some experiments of our robust rank-order detector with several existing edge detectors are carried out on both synthetic images and real images with and without noise.

Robust Speech Endpoint Detection in Noisy Environments for HRI (Human-Robot Interface) (인간로봇 상호작용을 위한 잡음환경에 강인한 음성 끝점 검출 기법)

  • Park, Jin-Soo;Ko, Han-Seok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.2
    • /
    • pp.147-156
    • /
    • 2013
  • In this paper, a new speech endpoint detection method in noisy environments for moving robot platforms is proposed. In the conventional method, the endpoint of speech is obtained by applying an edge detection filter that finds abrupt changes in the feature domain. However, since the feature of the frame energy is unstable in such noisy environments, it is difficult to accurately find the endpoint of speech. Therefore, a novel feature extraction method based on the twice-iterated fast fourier transform (TIFFT) and statistical models of speech is proposed. The proposed feature extraction method was applied to an edge detection filter for effective detection of the endpoint of speech. Representative experiments claim that there was a substantial improvement over the conventional method.

An Edge Detection Method using Modified Mask in Impulse Noise and AWGN Environments (임펄스 잡음 및 AWGN 환경에서 변형된 마스크를 이용한 에지 검출 방법)

  • Lee, Chang-Young;Kim, Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.10a
    • /
    • pp.265-267
    • /
    • 2013
  • Edges include various important informations of the objects. These edges are being applied in numerous areas and there is a detection method using mask in existing edge detection methods. These existing edge detection methods are simple to realize. However, because the fixed mask is used, edge detection characteristics in complicated noise environments are somewhat unsatisfactory. Therefore, to compensate for the weakness in the existing detection methods, edge detection algorithm which uses the standard deviation of local mask and noise elimination was proposed.

  • PDF

An Algorithm on Edge Detection using Local Mask in Salt & Pepper Noise Environments (Salt & Pepper 잡음 환경에서 국부 마스크를 이용한 에지 검출에 관한 연구)

  • Lee, Chang-Young;Kim, Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.05a
    • /
    • pp.787-789
    • /
    • 2014
  • Image processing is presently used in various areas such as smart phone, smart TV, and portable PC. Likewise, edge detection plays an important role in most of the applications. As such, studies for the detection of edge are continually underway. Roberts, Laplacian and LoG(lapacian of Gaussian) are the representative edge detection methods, but these methods do not offer optimal edge detection characteristic in case of the image that is damaged by Salt & Pepper noises. As such, this study presented algorithm with superior edge detection characteristic by utilizing the elements of local mask in Salt & Pepper noise environment.

  • PDF

A Study on Edge Detection Algorithm using Modified Mask in Salt and Pepper Noise Images (Salt and Pepper 잡음 영상에서 변형된 마스크를 이용한 에지 검출 알고리즘에 관한 연구)

  • Lee, Chang-Young;Kim, Nam-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.18 no.1
    • /
    • pp.210-216
    • /
    • 2014
  • The edge in the image is a part which the brightness changes rapidly between the object and the object or objects and background, and includes information of the features such as size, position, orientation, and texture of the object. The edge detection is the technique that acquires these information of the images, and now the researches to detect edges are making steady progress. Typical conventional edge detection methods are Sobel, Prewitt, Roberts using the first derivative operator and Laplacian method using the second derivative operator and so on. These methods is more or less insufficient that the characteristics of the edge detection in the image added salt and pepper noise. therefore, in this paper, an edge detection algorithm using modified mask that applies different size mask according to noise density of local mask is proposed.

Detection of the First and Second Heart Sound Using Three-order Shannon Energy Difference (3차 샤논 에너지 변화량을 이용한 제 1심음과 제 2심음 검출 알고리듬)

  • Lee, G.H.;Kim, P.U.;Lee, Y.J.;Kim, M.N.
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.7
    • /
    • pp.884-894
    • /
    • 2011
  • We proposed a new algorithm for detection of first(S1) and second heart sound(S2). Many researches for detecting primary components and those algorithms have good performance at normal heart sound, but the performance is degraded at abnormal heart sound which is contain murmurs generated by heart disease. Therefore we proposed the S1, S2 detection algorithm using three-order Shannon energy difference. Using S1, S2's character which has large energy difference than murmurs, it is reduced noise and detected S1, S2. According to simulation results, not only normal heart sound but also abnormal heart sound, the proposed algorithm has better performance than former study at abnormal heart sound.

A Dual Noise-Predictive Partial Response Decision-Feedback Equalizer for Perpendicular Magnetic Recording Channels (수직 자기기록 채널을 위한 쌍 잡음 예측 부분 응답 결정 궤환 등화기)

  • 우중재;조한규;이영일;홍대식
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.9C
    • /
    • pp.891-897
    • /
    • 2003
  • Partial response maxim likelihood (PRML) is a powerful and indispensable detection scheme for perpendicular magnetic recording channels. The performance of PRML can be improved by incorporating a noise prediction scheme into branch metric computations of Viterbi algorithm (VA). However, the systems constructed by VA have shortcomings in the form of high complexity and cost. In this connection, a new simple detection scheme is proposed by exploiting the minimum run-length parameter d=1 of RLL code. The proposed detection scheme have a slicer instead of Viterbi detector and a noise predictor as a feedback filter. Therefore, to improve BER performance, the proposed detection scheme is extended to dual detection scheme for improving the BER performance. Simulation results show that the proposed scheme has a comparable performance to noise-predictive maximum likelihood (NPML) detector with less complexity when the partial response (PR) target is (1,2,1).