• Title/Summary/Keyword: Wavelet Band

Search Result 266, Processing Time 0.027 seconds

A Fast Motion Estimation Algorithm using Adaptive Search According to Importance of Search Ranges (탐색영역의 중요도에 따라 적응적인 탐색을 이용한 고속 움직임 예측 알고리즘)

  • Kim, Tae Hwan;Kim, Jong Nam;Jeong, Shin Il
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.4
    • /
    • pp.437-442
    • /
    • 2015
  • Voice activity detection is very important process that voice activity separated form noisy speech signal for speech enhance. Over the past few years, many studies have been made on voice activity detection, but it has poor performance in low signal to noise ratio environment or fickle noise such as car noise. In this paper, it proposed new voice activity detection algorithm using ensemble variance based on wavelet band entropy and soft thresholding method. We conduct a survey in a lot of signal to noise ratio environment of car noise to evaluate performance of the proposed algorithm and confirmed performance of the proposed algorithm.

A Color Image Watermarking Method for Embedding Audio Signal

  • Kim Sang Jin;Kim Chung Hwa
    • Proceedings of the IEEK Conference
    • /
    • 2004.08c
    • /
    • pp.631-635
    • /
    • 2004
  • The rapid development of digital media and communication network urgently brings about the need of data certification technology to protect IPR (Intellectual property right). This paper proposed a new watermarking method for embedding contents owner's audio signal in order to protect color image IPR. Since this method evolves the existing static model and embeds audio signal of big data, it has the advantage of restoring signal transformed due to attacks. Three basic stages of watermarking include: 1) Encode analogue ID owner's audio signal using PCM and create new 3D audio watermark; 2) Interleave 3D audio watermark by linear bit expansion and 3) Transform Y signal of color image into wavelet and embed interleaved audio watermark in the low frequency band on the transform domain. The results demonstrated that the audio signal embedding in color image proposed in this paper enhanced robustness against lossy JPEG compression, standard image compression and image cropping and rotation which remove a part of image.

  • PDF

Thai Phoneme Segmentation using Dual-Band Energy Contour

  • Ratsameewichai, S.;Theera-Umpon, N.;Vilasdechanon, J.;Uatrongjit, S.;Likit-Anurucks, K.
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.110-112
    • /
    • 2002
  • In this paper, a new technique for Thai isolated speech phoneme segmentation is proposed. Based on Thai speech feature, the isolated speech is first divided into low and high frequency components by using the technique of wavelet decomposition. Then the energy contour of each decomposed signal is computed and employed to locate phoneme boundary. To verity the proposed scheme, some experiments have been performed using 1,000 syllables data recorded from 10 speakers. The accuracy rates are 96.0, 89.9, 92.7 and 98.9% for initial consonant, vowel, final consonant and silence, respectively.

  • PDF

Time Delay Estimation using Third-order Statistics and Subband Adaptive Filtering (3차 통계기법과 서브밴드 적응 필터링을 이용한 시간 지연 추정)

  • 박현석;남상원
    • Proceedings of the IEEK Conference
    • /
    • 2001.09a
    • /
    • pp.907-910
    • /
    • 2001
  • In this paper, we address a new time delay estimation method using third-order statistics and subband adaptive filtering to improve the accuracy of target detection for acoustic backscattered signals in a noise interference environment. Each reference and primary signals are decorrelated using the multiresolution analysis framework through a M-band discrete wavelet transform(M-DWT). Then noise effect can be reduced. Here, time delays are estimated iteratively in each subband using two different adaptation mechanisms that minimize the mean squared error (MSE) between the references and primary signal. More specifically, third-order cumulants and projection cross-correlation(PCC) criterion are utilized to achieve an effective SNR improvement for the time delay estimation.

  • PDF

Sound Diffusion Control for the Localized Sound Image Using Time Delay (방향 정위된 음원에 시간지연을 이용한 확산감 제어에 관한 연구)

  • 김익형;정의필
    • Proceedings of the IEEK Conference
    • /
    • 2001.06d
    • /
    • pp.135-138
    • /
    • 2001
  • Many researchers have developed the techniques of an efficient 3-D sound system based on the psycho-acoustics of spatial hearing with multimedia or virtual reality In this paper, we propose an idea for the improved 3-D sound system using conventional stereo headphones to obtain a better sound diffusion from the mono-sound recorded at an anechoic chamber. We use the HRTF (Head Related Transfer Function) for the sound localization and the wavelet filter bank with time delay for the sound diffusion. We investigate the effects of the 3-B sound depending on the length of time delay at lowest frequency band. Also the correlation coefficient of the signals between the left channel and the right channel is measured to identify the sound diffusion.

  • PDF

Analysis of Partial Discharge Signals Using Statistical and Pattern Recognition Technique (통계처리와 패턴 인식 기법에 의한 부분방전 해석)

  • Byun, Doo-Gyoon;Hong, Jin-Woong
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.12 no.12
    • /
    • pp.1231-1234
    • /
    • 2006
  • In this study, we detected electromagnetic waves generated in an enclosed switchgear and applied various statistical methods for detecting signals. We calculated the various statistical factors via the appropriate statistical methods. Further, we used these statistics to recognize the characteristics for each pattern by identifying the partial discharge in each case for normal, proceeding and abnormal states. The characteristics of electromagnetic wave patterns occurred in various states at electric power facilities and were used as an output variable for more efficient diagnosis. In this paper, we confirmed that the pattern of partial discharge signal can be used as one of the factors used to analyze the insulation state and to consider while estimating diagnosis of insulation states by recognizing the signal pattern to intelligence. We will utilize the proposed diagnosis method to determine insulation degradation states.

Design of an EBCOT in JPEG2000 for a Web Camera Server (웹 카메라 서버용 JPEG2000 의 EBCOT 설계에 관한 연구)

  • Park, Ju-Hyun;Kim, Young-Chul;Hong, Sung-Hoon;Lee, Myung-Ok
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2001.10a
    • /
    • pp.151-154
    • /
    • 2001
  • 본 연구는 웹 카메라에 적용하기 위한 JPEG2000의 주요 블록인 EBCOT(Embedded Block Coding with Optimized Truncation)의 설계 및 구현에 관한 연구이다. EHCOT 블록은 웨이블렛(wavelet)변환에 의해 분할된 각 sub-band에 존재하는 주위 화소 값들과 상위 bit-plane의 값들에 대한 상호 연관성을 조사하여 context을 추출하고 그 context를 이용하여 엔트로피 부호화(entropy coding)를 수행하는 T1(Tier 1) 블록과 bit-stream을 구성하는 T2(Tier 2) 블록으로 구성된다. 본 논문에서는 JPEG2000에서 전체 압축성능을 좌우하는 EBCOT의 T1 블록을 Synopsys tool을 이용하여 설계하고 구현하였다.

  • PDF

Multi-Description Image Compression Coding Algorithm Based on Depth Learning

  • Yong Zhang;Guoteng Hui;Lei Zhang
    • Journal of Information Processing Systems
    • /
    • v.19 no.2
    • /
    • pp.232-239
    • /
    • 2023
  • Aiming at the poor compression quality of traditional image compression coding (ICC) algorithm, a multi-description ICC algorithm based on depth learning is put forward in this study. In this study, first an image compression algorithm was designed based on multi-description coding theory. Image compression samples were collected, and the measurement matrix was calculated. Then, it processed the multi-description ICC sample set by using the convolutional self-coding neural system in depth learning. Compressing the wavelet coefficients after coding and synthesizing the multi-description image band sparse matrix obtained the multi-description ICC sequence. Averaging the multi-description image coding data in accordance with the effective single point's position could finally realize the compression coding of multi-description images. According to experimental results, the designed algorithm consumes less time for image compression, and exhibits better image compression quality and better image reconstruction effect.

Robust Face Recognition Against Illumination Change Using Visible and Infrared Images (가시광선 영상과 적외선 영상의 융합을 이용한 조명변화에 강인한 얼굴 인식)

  • Kim, Sa-Mun;Lee, Dea-Jong;Song, Chang-Kyu;Chun, Myung-Geun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.4
    • /
    • pp.343-348
    • /
    • 2014
  • Face recognition system has advanctage to automatically recognize a person without causing repulsion at deteciton process. However, the face recognition system has a drawback to show lower perfomance according to illumination variation unlike the other biometric systems using fingerprint and iris. Therefore, this paper proposed a robust face recogntion method against illumination varition by slective fusion technique using both visible and infrared faces based on fuzzy linear disciment analysis(fuzzy-LDA). In the first step, both the visible image and infrared image are divided into four bands using wavelet transform. In the second step, Euclidean distance is calculated at each subband. In the third step, recognition rate is determined at each subband using the Euclidean distance calculated in the second step. And then, weights are determined by considering the recognition rate of each band. Finally, a fusion face recognition is performed and robust recognition results are obtained.

Spatial - Frequency Analysis of time-varying Coherence using ERP signals for attentional visual stimulus (시각 자극의 집중에 따른 시간 변화에 대한 뇌 유발전위의 공간 - 주파수간 상관 변화 분석)

  • Lee, ByuckJin;Yoo, Sun-Kook
    • Science of Emotion and Sensibility
    • /
    • v.16 no.4
    • /
    • pp.527-534
    • /
    • 2013
  • In this study, we analyzed spatial-frequency relationship related brain function for change of the time during attentional visual stimulus through the analysis of Coherence. With experimentation about ERP(Event Related Potential)data, it revealed that change of the phase synchronization between different scalp locations at ${\theta}$, ${\alpha}$ band. ERP between left and right frontal lobes, between the frontal and central lobes showed the phase synchronization at the P100, N200, ERP between the frontal and occipital lobes showed the phase synchronization at the P300 related information of visual stimulus. Compared to STFT using the window of a fixed length, CWT is able to multi-resolution analysis with the adjustment of parameters of mother wavelet. Thus, coherence results with CWT was found to be effective for analysis of time-varying spatial-frequency relationship in ERP. The phase synchronization for inattentional visual stimulus was not observed.