• Title/Summary/Keyword: frequency features

Search Result 1,636, Processing Time 0.028 seconds

Image Retrieval Using the Color Feature and the Wavelet-Based Feature (색상특징과 웨이블렛 기반의 특징을 이용한 영상 검색)

  • 박종현;박순영;조완현
    • Proceedings of the IEEK Conference
    • /
    • 1999.11a
    • /
    • pp.487-490
    • /
    • 1999
  • In this paper we propose an efficient content-based image retrieval method using the color and wavelet based features. The color features are extracted from color histograms of the global image and the wavelet based features are extracted from the invariant moments of the high-pass band image through the spatial-frequency analysis of the wavelet transform. The proposed algorithm, called color and wavelet features based query(CWBQ), is composed of two-step query operations for efficient image retrieval: the coarse level filtering operation and the fine level matching operation. In the first filtering operation, the color histogram feature is used to filter out the dissimilar images quickly from a large image database. The second matching operation applies the wavelet based feature to the retained set of images to retrieve all relevant images successfully. The experimental results show that the proposed algorithm yields more improved retrieval accuracy with computationally efficiency than the previous methods.

  • PDF

PROSODY IN SPEECH TECHNOLOGY - National project and some of our related works -

  • Hirose Keikichi
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.15-18
    • /
    • 2002
  • Prosodic features of speech are known to play an important role in the transmission of linguistic information in human conversation. Their roles in the transmission of para- and non- linguistic information are even much more. In spite of their importance in human conversation, from engineering viewpoint, research focuses are mainly placed on segmental features, and not so much on prosodic features. With the aim of promoting research works on prosody, a research project 'Prosody and Speech Processing' is now going on. A rough sketch of the project is first given in the paper. Then, the paper introduces several prosody-related research works, which are going on in our laboratory. They include, corpus-based fundamental frequency contour generation, speech rate control for dialogue-like speech synthesis, analysis of prosodic features of emotional speech, reply speech generation in spoken dialogue systems, and language modeling with prosodic boundaries.

  • PDF

Statistical Extraction of Speech Features Using Independent Component Analysis and Its Application to Speaker Identification

  • 장길진;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.156-156
    • /
    • 2002
  • We apply independent component analysis (ICA) for extracting an optimal basis to the problem of finding efficient features for representing speech signals of a given speaker The speech segments are assumed to be generated by a linear combination of the basis functions, thus the distribution of speech segments of a speaker is modeled by adapting the basis functions so that each source component is statistically independent. The learned basis functions are oriented and localized in both space and frequency, bearing a resemblance to Gabor wavelets. These features are speaker dependent characteristics and to assess their efficiency we performed speaker identification experiments and compared our results with the conventional Fourier-basis. Our results show that the proposed method is more efficient than the conventional Fourier-based features in that they can obtain a higher speaker identification rate.

A Method for Terrain Cover Classification Using DCT Features (DCT 특징을 이용한 지표면 분류 기법)

  • Lee, Seung-Youn;Kwak, Dong-Min;Sung, Gi-Yeul
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.13 no.4
    • /
    • pp.683-688
    • /
    • 2010
  • The ability to navigate autonomously in off-road terrain is the most critical technology needed for Unmanned Ground Vehicles(UGV). In this paper, we present a method for vision-based terrain cover classification using DCT features. To classify the terrain, we acquire image from a CCD sensor, then the image is divided into fixed size of blocks. And each block transformed into DCT image then extracts features which reflect frequency band characteristics. Neural network classifier is used to classify the features. The proposed method is validated and verified through many experiments and we compare it with wavelet feature based method. The results show that the proposed method is more efficiently classify the terrain-cover than wavelet feature based one.

A Single-Chip CMOS Digitally Synthesized 0-35 MHz Agile Function Generator

  • Meenakarn, C.;Thanachayanont, A.
    • Proceedings of the IEEK Conference
    • /
    • 2002.07c
    • /
    • pp.1984-1987
    • /
    • 2002
  • This paper describes the design and implementation of a single-chip digitally synthesized 0-35MHz agile function generator. The chip comprises an integrated direct digital synthesizer (DDS) with a 10-bit on- chip digital-to-analog converter (DAC) using an n-well single-poly triple-metal 0.5-$\mu\textrm{m}$ CMOS technology. The main features of the chip include maximum clock frequency of 100 MHz at 3.3-V supply voltage, 32-bit frequency tuning word resolution, 12-bit phase tuning word resolution, and an on-chip 10-bit DAC. The chip provides sinusoidal, ramp, saw-tooth, and random waveforms with phase and frequency modulation, and power-down function. At 100-MHz clock frequency, the chip covers a bandwidth from dc to 35 MHz in 0.0233-Hz frequency steps with 190-ns frequency switching speed. The complete chip occupies 12-mm$^2$die area and dissipates 0.4 W at 100-MHz clock frequency.

  • PDF

Method for Feature Extraction of Radar Full Pulses Based on EMD and Chaos Detection

  • Guo, Qiang;Nan, Pulong
    • Journal of Communications and Networks
    • /
    • v.16 no.1
    • /
    • pp.92-97
    • /
    • 2014
  • A novel method for extracting frequency slippage signal from radar full pulse sequence is presented. For the radar full pulse sequence received by radar interception receiver, radio frequency (RF) and time of arrival (TOA) of all pulses constitute a two-dimensional information sequence. In a complex and intensive electromagnetic environment, the TOA of pulses is distributed unevenly, randomly, and in a nonstationary manner, preventing existing methods from directly analyzing such time series and effectively extracting certain signal features. This work applies Gaussian noise insertion and structure function to the TOA-RF information sequence respectively such that the equalization of time intervals and correlation processing are accomplished. The components with different frequencies in structure function series are separated using empirical mode decomposition. Additionally, a chaos detection model based on the Duffing equation is introduced to determine the useful component and extract the changing features of RF. Experimental results indicate that the proposed methodology can successfully extract the slippage signal effectively in the case that multiple radar pulse sequences overlap.

Acoustic Characteristics of Vowels in Korean Distant-Talking Speech (한국어 원거리 음성의 모음의 음향적 특성)

  • Lee Sook-hyang;Kim Sunhee
    • MALSORI
    • /
    • v.55
    • /
    • pp.61-76
    • /
    • 2005
  • This paper aims to analyze the acoustic effects of vowels produced in a distant-talking environment. The analysis was performed using a statistical method. The influence of gender and speakers on the variation was also examined. The speech data used in this study consist of 500 distant-talking words and 500 normal words of 10 speakers (5 males and 5 females). Acoustic features selected for the analysis were the duration, the formants (Fl and F2), the fundamental frequency and the total energy. The results showed that the duration, F0, F1 and the total energy increased in the distant-talking speech compared to normal speech; female speakers showed higher increase in all features except for the total energy and the fundamental frequency. In addition, speaker differences were observed.

  • PDF

Video Segmentation and Key frame Extraction using Multi-resolution Analysis and Statistical Characteristic

  • Cho, Wan-Hyun;Park, Soon-Young;Park, Jong-Hyun
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.457-469
    • /
    • 2003
  • In this paper, we have proposed the efficient algorithm that can segment the video scene change using a various statistical characteristics obtained from by applying the wavelet transformation for each frames. Our method firstly extracts the histogram features from low frequency subband of wavelet-transformed image and then uses these features to detect the abrupt scene change. Second, it extracts the edge information from applying the mesh method to the high frequency subband of transformed image. We quantify the extracted edge information as the values of variance characteristic of each pixel and use these values to detect the gradual scene change. And we have also proposed an algorithm how extract the proper key frame from segmented video scene. Experiment results show that the proposed method is both very efficient algorithm in segmenting video frames and also is to become the appropriate key frame extraction method.

Phoneme Recognition Using Frequency State Neural Network (주파수 상태 신경 회로망을 이용한 음소 인식)

  • Lee, Jun-Mo;Hwang, Yeong-Soo;Kim, Seong-Jong;Shin, In-Chul
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.4
    • /
    • pp.12-19
    • /
    • 1994
  • This paper reports a new structure for phoneme recognition neural network. The proposed neural network is able to deal with the structure of the frequency bands as well as the temporal structure of phonemic features which used in the conventional TSNN. We trained this neural network using the phonetics (아, 이, 오, ㅅ, ㅊ, ㅍ, ㄱ, ㅇ, ㄹ, ㅁ) and the phoneme recognition of this neural network was a little better than those of conventional TDNN and TSNN using only temporal structure of phonemic features.

  • PDF

Features Extraction of Tool Wear and its Detection using Neural Network (가공 재질에 따른 공구 마멸의 특성 추출과 신경회로망을 이용한 마멸 검출)

  • 이호영;조동우
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 1995.10a
    • /
    • pp.89-94
    • /
    • 1995
  • A16061, SB41 and SM45C was used for developing tool wear monitoring system in face milling. First of all, Neural networks of which input are 8 $_{th}$ order AR morel parameters, frequency band energies, cutting conditions was used to monitor tool wear for each material. Finally, A unified neural network, which has tensile strengths of each material as an additional input, was constructed to consider the effect three materials on the features of tool wear. It was verified that tensile strength is the one of properties of workpiece materials.s.

  • PDF