• Title/Summary/Keyword: Filter-Bank

Search Result 355, Processing Time 0.028 seconds

Analysis of CRLB Performances with CAF under Multiple Emitters (CAF 이용 다중 발기하에서의 CRLB 성능 분석)

  • Lee, Young-kyu;Yang, Sung-hoon;Lee, Chang-bok;Park, Young-Mi;Lee, Moon-Seok
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.21 no.6
    • /
    • pp.589-594
    • /
    • 2015
  • In this paper, we described the Cramer-Rao Lower Bound (CLRB) performances of Time Difference of Arrival (TDOA) and Frequency Difference of Arrival (FDOA) methods when there are multiple emitters. The TDOA and FDOA values between two receivers can be simultaneously estimated by using the so-called Complex Ambiguity Function (CAF). In the case of multiple emitters, there exist Inter Symbol Interferences (ISIs) in the measurement data. Therefore, it is required to reduce the effect of ISI and provide a performance evaluation method of TDOA and FDOA estimations. In order to eliminate the ISIs, using of a filter bank before calculating CAF is proposed when the carrier frequencies of the emitters are different to one another. Angle of Arrival (AOA) or Received Signal Strength (RSS) methods before calculating CAF were proposed to reduce the ISIs when the carrier frequencies are the same. In order to evaluate the CRLB of TDOA and FDOA estimations, we employed the conditional probability distribution method and described the numerical comparison results.

Convolutional Neural Network based Audio Event Classification

  • Lim, Minkyu;Lee, Donghyun;Park, Hosung;Kang, Yoseb;Oh, Junseok;Park, Jeong-Sik;Jang, Gil-Jin;Kim, Ji-Hwan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.6
    • /
    • pp.2748-2760
    • /
    • 2018
  • This paper proposes an audio event classification method based on convolutional neural networks (CNNs). CNN has great advantages of distinguishing complex shapes of image. Proposed system uses the features of audio sound as an input image of CNN. Mel scale filter bank features are extracted from each frame, then the features are concatenated over 40 consecutive frames and as a result, the concatenated frames are regarded as an input image. The output layer of CNN generates probabilities of audio event (e.g. dogs bark, siren, forest). The event probabilities for all images in an audio segment are accumulated, then the audio event having the highest accumulated probability is determined to be the classification result. This proposed method classified thirty audio events with the accuracy of 81.5% for the UrbanSound8K, BBC Sound FX, DCASE2016, and FREESOUND dataset.

A Study on the Electromagnetic Transients at Switching Capacitor Banks in a Electric Distribution Electric Power Distribution Substation (배전변전소에서 캐패시터 뱅크 투입시 일어나는 전자과도 현상에 관한 연구)

  • 김경철
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.16 no.1
    • /
    • pp.92-99
    • /
    • 2002
  • Transient in an electric distribution system are mainly generated by switching. This paper presents analysis of switching surge and means of limiting the voltage magnification transient for high voltage power systems by using the EDSA's EMTAP software package. One means of limiting the voltage magnification transient is to convert the end-user power factor contraction capacitor banks to harmonics filters. An inductance in series with the capacitor bank was used to decrease the transient voltage at the customer bus to acceptable levels. And also simulation results used the EDSA harmonics analysis program show the effect of harmonics reduction.

Real Time 1/3 Octave Band Control System for High Intensity Acoustic Chamber (음향 챔버 내부의 1/3 옥타브 스펙트럼 실시간 제어 시스템)

  • Kim, Young-Key;Kim, Hong-Bae;Moon, Sang-Mu;Woo, Sung-Hyun;Lee, Sang-Seol
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2002.11b
    • /
    • pp.881-885
    • /
    • 2002
  • This paper presents the performance and the algorithm of a 1/3-octave band spectrum control system. The system is developed to provide various spectrums in a high intensity acoustic chamber. The required spectrum, which usually comes from launch vehicle company, starts from 25Hz band and ends 10kHz band. Automatic spectrum control system is preferred since the system requires short settling time to guarantee the safety of test objects and to reduce the amount of operating gas. The developed system adapted a PCI data-acquisition/signal-generation board installed in a personal computer to implement whole control logic. The control software used three cascade digital Butterworth filters using software. The filers are designed following ANSI S1.11 standard to implement 1/3 octave band filter bank. The graphical user interface of the system guides the user to follow standard operation procedure. The averaged control spectrum showed less than 0.05 dB in every running 1/3-octave band.

  • PDF

Digital Watermarking for JPEG2000 (JPEG2000을 위한 디지털 워터마킹)

  • 서용석;주상현;정호열
    • Journal of Broadcast Engineering
    • /
    • v.6 no.1
    • /
    • pp.32-40
    • /
    • 2001
  • In this paper, we propose a DWT (discrete Wavelet Transform) based watermarking method, which can be conveniently Integrated In the up-coming JPEG2770 baseline system. Although Conventional DWT based watermarking techniques insert watermark signal Into wavelet coefficients after the transform, our proposed method embeds a watermark into wavelet coefficients obtained from the ongoing process of lifting for DWT. The proposed method allows us to selectively determine frequency characteristics of the coefficients where the watermark is embedded. so that the Inserted watermark cannot be removed or altered even when the filter-bank for DWT is known. Through the simulation, we show that the proposed method is more secure and more robust against various attacks than conventional DWT barred watermarking techniques.

  • PDF

Emotion recognition from speech using Gammatone auditory filterbank

  • Le, Ba-Vui;Lee, Young-Koo;Lee, Sung-Young
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.255-258
    • /
    • 2011
  • An application of Gammatone auditory filterbank for emotion recognition from speech is described in this paper. Gammatone filterbank is a bank of Gammatone filters which are used as a preprocessing stage before applying feature extraction methods to get the most relevant features for emotion recognition from speech. In the feature extraction step, the energy value of output signal of each filter is computed and combined with other of all filters to produce a feature vector for the learning step. A feature vector is estimated in a short time period of input speech signal to take the advantage of dependence on time domain. Finally, in the learning step, Hidden Markov Model (HMM) is used to create a model for each emotion class and recognize a particular input emotional speech. In the experiment, feature extraction based on Gammatone filterbank (GTF) shows the better outcomes in comparison with features based on Mel-Frequency Cepstral Coefficient (MFCC) which is a well-known feature extraction for speech recognition as well as emotion recognition from speech.

Control Strategy Based on Equivalent Fundamental and Odd Harmonic Resonators for Single-Phase DVRs

  • Teng, Guofei;Xiao, Guochun;Hu, Leilei;Lu, Yong;Kafle, Yuba Raj
    • Journal of Power Electronics
    • /
    • v.12 no.4
    • /
    • pp.654-663
    • /
    • 2012
  • In this paper, a digital control strategy based on equivalent fundamental and odd harmonic resonators is proposed for single-phase DVRs. By using a delay block, which can be equivalent to a bank of resonators, it rejects the fundamental and odd harmonic disturbances effectively. The structure of the single closed-loop control system consists of a delay block, a proportional gain and a set of zero phase notch filters. The principle of the controller design is discussed in detail to ensure the stability of the system. Both the supply voltage and the load current feedforwards are used to improve the response speed and the ability to eliminate disturbances. The proposed controller is simple in terms of its structure and implementation. It has good performances in harmonic compensation and dynamic response. Experimental results from a 2kW DVR prototype confirm the validity of the design procedure and the effectiveness of the control strategy.

Support Vector Machine Based Diagnostic System for Thyroid Cancer using Statistical Texture Features

  • Gopinath, B.;Shanthi, N.
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.14 no.1
    • /
    • pp.97-102
    • /
    • 2013
  • Objective: The aim of this study was to develop an automated computer-aided diagnostic system for diagnosis of thyroid cancer pattern in fine needle aspiration cytology (FNAC) microscopic images with high degree of sensitivity and specificity using statistical texture features and a Support Vector Machine classifier (SVM). Materials and Methods: A training set of 40 benign and 40 malignant FNAC images and a testing set of 10 benign and 20 malignant FNAC images were used to perform the diagnosis of thyroid cancer. Initially, segmentation of region of interest (ROI) was performed by region-based morphology segmentation. The developed diagnostic system utilized statistical texture features derived from the segmented images using a Gabor filter bank at various wavelengths and angles. Finally, the SVM was used as a machine learning algorithm to identify benign and malignant states of thyroid nodules. Results: The SVMachieved a diagnostic accuracy of 96.7% with sensitivity and specificity of 95% and 100%, respectively, at a wavelength of 4 and an angle of 45. Conclusion: The results show that the diagnosis of thyroid cancer in FNAC images can be effectively performed using statistical texture information derived with Gabor filters in association with an SVM.

An Efficient Computation of FFT for MPEG/Audio Psycho-Acoustic Model (MPEG 심리음향모델의 고속 구현을 위한 효율적 FFT 연산)

  • 송건호;이근섭;박영철;윤대희
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.6
    • /
    • pp.261-269
    • /
    • 2004
  • In this paper, an efficient algorithm for computing in the MPEG/audio Layer Ⅲ (MP3) encoder is proposed. The proposed algerian performs a full-band 1024-point FFT by computing 32-point FFT's of 32 subband outputs. To reduce the aliasing caused by the analysis filter bank, an aliasing cancellation butterfly is developed. A major benefit of the proposed algorithm is the computational saving. By using the proposed algorithm, it is possible to save 40~50% of computations for FFT, which results in about 20% reduction of the PAM-2 complexity.

Design of 64-Bit Guide Sensor for Automatic Guided Vehicle (무인운반차용 16비트 가이드 센서 설계)

  • Lee, Ju-Won;Cho, Su-Hyeon;Lee, Dong-Chang;Kang, Seong-In
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.05a
    • /
    • pp.915-916
    • /
    • 2015
  • The main sensor of AGV is the guide sensor in order to detect the path, and the sensor consists of 8 or 16-magneto resistance devices arranged by with 10mm. In controlling the AGV posture by using the sensor, AGV is occurred left/right shaking frequently. So, for driving stability of AGV, An accuracy of the sensor should be improved. Therefore, this study proposed sensor signal processing method to improve accuracy of guide sensor, and implemented. The accuracy of sensor in experimentation showed 2.84[mm]. In designing the sensor for controlling AGV posture, the proposed method will be effective.

  • PDF