• Title/Summary/Keyword: Pitch detect

Search Result 74, Processing Time 0.022 seconds

A Study on Pitch Period Detection of Speech Signal Using Modified AMDF (변형된 AMDF를 이용한 음성 신호의 피치 주기 검출에 관한 연구)

  • Seo, Hyun-Soo;Bae, Sang-Bum;Kim, Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.1
    • /
    • pp.515-519
    • /
    • 2005
  • Pitch period that is a important factor in speech signal processing is used in various applications such as speech recognition, speaker identification, speech analysis and synthesis. So many pitch detection algoritms have been studied until now. AMDF which is one of pitch period detection algorithms chooses the time interval from valley point to valley point as pitch period. In selection of valley point to detect pitch period, complexity of the algoritm is increased. So in this paper we proposed the simple algorithm using modified AMDF that detects global minimum valley point as pitch period of speech signal and compared existing methods with it through simulation.

  • PDF

Wavelet-based Pitch Detector for 2.4 kbps Harmonic-CELP Coder (2.4 kbps 하모닉-CELP 코더를 위한 웨이블렛 피치 검출기)

  • 방상운;이인성;권오주
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.8
    • /
    • pp.717-726
    • /
    • 2003
  • This paper presents the methods that design the Wavelet-based pitch detector for 2,4 kbps Harmonic-CELP Coder, and that achieve the effective waveform interpolation by decision window shape of the transition region, Waveform interpolation coder operates by encoding one pitch-period-sized segment, a prototype segment, of speech for each frame, generate the smooth waveform interpolation between the prototype segments for voiced frame, But, harmonic synthesis of the prototype waveforms between previous frame and current frame occur not only waveform errors but also discontinuity at frame boundary on that case of pitch halving or doubling, In addtion, in transition region since waveform interpolation coder synthesizes the excitation waveform by using overlap-add with triangularity window, therefore, Harmonic-CELP fail to model the instantaneous increasing speech and synthesis waveform linearly increases, First of all, in order to detect the precise pitch period, we use the hybrid 1st pitch detector, and increse the precision by using 2nd ACF-pitch detector, Next, in order to modify excitation window, we detect the onset, offset of frame by GCI, As the result, pitch doubling is removed and pitch error rate is decreased 5.4% in comparison with ACF, and is decreased 2,66% in comparison with wavelet detector, MOS test improve 0.13 at transition region.

Measurement of Grating Pitch Standards using Optical Diffractometry and Uncertainty Analysis (광 회절계를 이용한 격자 피치 표준 시편의 측정 및 불확도 해석)

  • Kim Jong-Ahn;Kim Jae-Wan;Park Byong-Chon;Kang Chu-Shik;Eom Tae-Bong
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.23 no.8 s.185
    • /
    • pp.72-79
    • /
    • 2006
  • We measured grating pitch standards using optical diffractometry and analyzed measurement uncertainty. Grating pitch standards have been used widely as a magnification standard for a scanning electron microscope (SEM) and a scanning probe microscope (SPM). Thus, to establish the meter-traceability in nano-metrology using SPM and SEM, it is important to certify grating pitch standards accurately. The optical diffractometer consists of two laser sources, argon ion laser (488 nm) and He-Cd laser (325 nm), optics to make an incident beam, a precision rotary table and a quadrant photo-diode to detect the position of diffraction beam. The precision rotary table incorporates a calibrated angle encoder, enabling the precise and accurate measurement of diffraction angle. Applying the measured diffraction angle to the grating equation, the mean pitch of grating specimen can be obtained very accurately. The pitch and orthogonality of two-dimensional grating pitch standards were measured, and the measurement uncertainty was analyzed according to the Guide to the Expression of Uncertainty in Measurement. The expanded uncertainties (k = 2) in pitch measurement were less than 0.015 nm and 0.03 nm for the specimen with the nominal pitch of 300 nm and 1000 nm. In the case of orthogonality measurement, the expanded uncertainties were less than $0.006^{\circ}$. In the pitch measurement, the main uncertainty source was the variation of measured pitch values according to the diffraction order. The measurement results show that the optical diffractometry can be used as an effective calibration tool for grating pitch standards.

On the Center Pitch Estimation by using the Spectrum Leakage Phenomenon for the Noise Corrupted Speech Signals (배경 잡음하에서 스펙트럼 누설현상을 이용한 음성신호의 중심 피치 검출)

  • Kang, Dong-Kyu;Bae, Myung-Jin;Ann, Sou-Guil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.10 no.1
    • /
    • pp.37-46
    • /
    • 1991
  • The pitch estimation algorithms witch have proposed until now are difficult to detect wide range pitches regardless of age or sex. A little deviation are observed with reference to the center pitch in the distribution diagram of pitches, since pitches are characterized by a physical limitation of the coarticulation mechanism. If the center pitches are refered to the accurate pitch extraction procedure, the algorithms will be not only simplified in procedure but also improved in accuracy. In this paper, we proposed an algorithm that the center pitches are accurately detected by using the spectrum leakage phenomenon for the noise speech signals.

  • PDF

Pitch Detection Using Wavelet Transform (웨이브렛 변환을 이용한 피치검출)

  • Seok, Jong-Won;Son, Young-Ho;Bae, Keun-Sung
    • Speech Sciences
    • /
    • v.5 no.1
    • /
    • pp.23-33
    • /
    • 1999
  • Mallat has shown that, with a proper choice of wavelet function, the local maxima of wavelet transformed signal indicate a sharp variation in the signal. Since the glottal closure causes sharp discontinuities in the speech signal, dyadic wavelet transform can be useful for detecting abrupt change in the voiced sounds, i.e., epochs. In this paper, we investigate the glottal closure instants obtained from the wavelet analysis of speech signal and compare them with those obtained from the EGG signal. Then, we detect pitch period of speech signal on the basis of these results. Experimental results demonstrated that local maxima of wavelet transformed signal give accurate estimation of epoch and pitch periods of voiced sound obtained by the proposed algorithm also correspond to those from EGG well.

  • PDF

On a Pitch Extraction of Speech Signal using Residual Signal of the Uniform Quantizer (균일양자화기의 잔여신호를 이용한 음성신호의 피치검출)

  • Bae, Myung-Jin;Han, Ki-Cheon;Cha, Jin-Jong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.36-40
    • /
    • 1997
  • In speech signal processing, it is necessary and important to detect exactly the pitch. The algorithms of pitch extraction which have been proposed until now are difficult exactly pitches over wide range speech signals. In this paper, thus, we proposed a new pitch detection algorithm that finds the fundamental period of speech signal in the residual signal quantized by the uniform quantizer as PCM. The proposed method shows little gross error of average 0.25% for clean speech and average 3.39% for SNR of 0dB. It also achieves results of the pitch contours, improving the accuracy of pitch detection in transient phonemes and noise environments.

  • PDF

Pitch Period Detection Algorithm Using Rotation Transform of AMDF (AMDF의 회전변환을 이용한 피치 주기 검출 알고리즘)

  • Seo, Hyun-Soo;Bae, Sang-Bum;Kim, Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.2
    • /
    • pp.1019-1022
    • /
    • 2005
  • As recent information communication technology is rapidly developed, a lot of researches related to speech signal processing have been processed. So pitch period is applied as important factor to many application fields such as speech recognition, speaker identification, speech analysis and synthesis. Therefore, many algorithms related to pitch detection have been proposed in time domain and frequency domain and AMDF(average magnitude difference function) which is one of pitch detection algorithms in time domain chooses time interval from valley to valley as pitch period. But, in selection of valley point to detect pitch period, complexity of the algorithm is increased. So in this paper we proposed pitch detection algorithm using rotation transform of AMDF, that taking the global minimum valley point as pitch period and established a threshold about the phoneme in beginning portion, to exclude pitch period selection. and compared existing methods with proposed method through simulation.

  • PDF

A Study on the Robust Pitch Period Detection Algorithm in Noisy Environments (소음환경에 강인한 피치주기 검출 알고리즘에 관한 연구)

  • Seo Hyun-Soo;Bae Sang-Bum;Kim Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2006.05a
    • /
    • pp.481-484
    • /
    • 2006
  • Pitch period detection algorithms are applied to various speech signal processing fields such as speech recognition, speaker identification, speech analysis and synthesis. Furthermore, many pitch detection algorithms of time and frequency domain have been studied until now. AMDF(average magnitude difference function) ,which is one of pitch period detection algorithms, chooses a time interval from the valley point to the valley point as the pitch period. AMDF has a fast computation capacity, but in selection of valley point to detect pitch period, complexity of the algorithm is increased. In order to apply pitch period detection algorithms to the real world, they have robust prosperities against generated noise in the subway environment etc. In this paper we proposed the modified AMDF algorithm which detects the global minimum valley point as the pitch period of speech signals and used speech signals of noisy environments as test signals.

  • PDF

On A Reduction of Pitch Searching Time by Preprocessing in the CELP Vocoder (CELP 보코더에서 전처리에 의한 피치검색 시간의 단축)

  • Kim, Dae-Sik;Bae, Myeong-Jin;Kim, Jong-Jae;Byun, Kyung-Jin;Han, Ki-Chun;Yoo, Hah-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.3
    • /
    • pp.33-40
    • /
    • 1994
  • Code Excited Linear Prediction(CELP) speech coders exhibit good performance at data rates below 4.8 kbps. This major drawback of CELP type coders is required much computation. In this paper, we propose a new pitch search method that preserves the quality of the CELP vocoder with reducing complexity. In the pitch searching, we detect the segments of high correlation by a simple preprocessing, and then carry out the pitch searching only for the segments obtained by the preprocessing. By using the proposed method, we can get approximately $77\%$ complexity reduction in the pitch search.

  • PDF

Multi-temporal Analysis of High-resolution Satellite Images for Detecting and Monitoring Canopy Decline by Pine Pitch Canker

  • Lee, Hwa-Seon;Lee, Kyu-Sung
    • Korean Journal of Remote Sensing
    • /
    • v.35 no.4
    • /
    • pp.545-560
    • /
    • 2019
  • Unlike other critical forest diseases, pine pitch canker in Korea has shown rather mild symptoms of partial loss of crown foliage and leaf discoloration. This study used high-resolution satellite images to detect and monitor canopy decline by pine pitch canker. To enhance the subtle change of canopy reflectance in pitch canker damaged tree crowns, multi-temporal analysis was applied to two KOMPSAT multispectral images obtained in 2011 and 2015. To assure the spectral consistency between the two images, radiometric corrections of atmospheric and shadow effects were applied prior to multi-temporal analysis. The normalized difference vegetation index (NDVI) of each image and the NDVI difference (${\Delta}NDVI=NDVI_{2015}-NDVI_{2011}$) between two images were derived. All negative ΔNDVI values were initially considered any pine stands, including both pitch canker damaged trees and other trees, that showed the decrease of crown foliage from 2011 to 2015. Next, $NDVI_{2015}$ was used to exclude the canopy decline unrelated to the pitch canker damage. Field survey data were used to find the spectral characteristics of the damaged canopy and to evaluate the detection accuracy from further analysis.Although the detection accuracy as assessed by limited number of field survey on 21 sites was 71%, there were also many false alarms that were spectrally very similar to the damaged canopy. The false alarms were mostly found at the mixed stands of pine and young deciduous trees, which might invade these sites after the pine canopy had already opened by any crown damages. Using both ${\Delta}NDVI$ and $NDVI_{2015}$ could be an effective way to narrow down the potential area of the pitch canker damage in Korea.