• Title/Summary/Keyword: pitch peak

Search Result 139, Processing Time 0.022 seconds

The Pitch Detection Using Variable LPF (Variable LPF에 의한 피치검출)

  • 백금란
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1993.06a
    • /
    • pp.88-92
    • /
    • 1993
  • In speech signal processing, it is necessary to detect exactly the pitch. The algorithms of pitch extraction which have been proposed until now are difficult to detect pitches over wide range speech signals. Thus we propose a new algorithm which uses the G-peak extraction to do it. It is the method that finds the most MZI(maximum zero-crossing interval) at each frame and convolve it with speech signal ; this is the same with passing speech signals to variable LPF. Finally we obtained the pitch, improve the accuracy of pitch detection and extract it with the high speed.

  • PDF

A Study on Korean, English and Japanese Speaker Recognitions Using the Peak and Valley Pitch Detection and the Fuzzy Theory (PVPF방법과 퍼지 이론을 이용한 한국어, 영어 및 일본어 화자 인식에 관한 연구)

  • Kim, Yeon-Suk
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.2
    • /
    • pp.522-533
    • /
    • 1999
  • This paper proposes speaker recognition algorithm which includes both the pitch parameter and the fuzzy inference. This study proposes a pitch detection method PVPF(peak and valley pitch detection fuction) by means of comparing spectra which utilizes the transform characteristics between time and frequency. In this paper, makes reference pattern using membership function and performs vocal tract recognition of common character using fuzzy pattern matching in order to include time variation width for non-linear utterance time.

  • PDF

Phoneme Separation and Establishment of Time-Frequency Discriminative Pattern on Korean Syllables (음절신호의 음소 분리와 시간-주파수 판별 패턴의 설정)

  • 류광열
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.16 no.12
    • /
    • pp.1324-1335
    • /
    • 1991
  • In this paper, a phoneme separation and an establishment of discriminative pattern of Korean phonemes are studied on experiment. The separation uses parameters such as pitch extraction, glottal peak pulse width of each pitch. speech duration. envelope and amplitude bias. The first pitch is extracted by deviations of glottal peak and width. energy and normalization on a bias on the top of vowel envelope. And then, it traces adjacent pitch to vowel in whole. On vewel, amethod to be reduced gliding pattern and the possible of vowel distinction to be used just second formant are proposed, and shrinking pitch waveform has nothing to do with pitch length is estimated. A pattern of envelope, spectrum, shrinking waveform, and a method of analysis by mutual relation among phonemes and manners of articulation on consonant are detected. As experimental results, 90% on vowel phoneme, 80% and 60% on initial and final consonant are discriminated.

  • PDF

Cross-linguistic Study of Perceptual Cues to F0 Variations (한·중 청자의 음높이 변화에 대한 지각 연구)

  • Yoon, Eunkyung;Cao, Wenkai
    • Journal of Korean language education
    • /
    • v.28 no.3
    • /
    • pp.25-51
    • /
    • 2017
  • This study aimed to identify the differences in pitch perception between tonal and non-tonal language listeners. A total of 60 Korean and Chinese listeners participated in the perception test. A two-syllable nonsense word /paba/ was manipulated in five steps. The pitch height or contour on the second syllable was raised or lowered. Both groups were asked to select which of the two syllables had the higher pitch. The findings showed that the majority of Korean listeners (GK) perceived decreased pitch as each peak of the syllable was lowered and perceived increased pitch as it was raised, which means the pitch height is a primary perceptual cue for GK. However, Chinese listeners (GC) perceived sensitive pitch movements as the pitch contour changed. GC's perception may presumably be affected by the L1's tone sandhi. We found it reasonable to assume that language experience has a significant effect on the cross-linguistic perceptual differences between tone and non-tonal language listeners.

Implementation of a Real-time SIFT Pitch Detector (실시간 SIFT 기본주파수 검출기의 구현)

  • Lee, Jong Seok;Lee, Sang Uk
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.23 no.1
    • /
    • pp.101-113
    • /
    • 1986
  • In this paper, a real-time pitch detector LPC vocoder as implemented on a high speed digital signal processor, NEC 7720, is described. The pitch detector was based mainly on the SIFT algorithm. The SIFT pitch detector consists primarily of a digital low pass filter, inverse filter, computation of autocorrelation, a peak picker, interpolation, V/UV defcision and a final pitch smoother. In our approach, modification, mainly on the V/UV decision and a final pitch smoother, was made to estimate more accurate pitches. An 16-bit fixed-point aithmatic was employed for all necessary computation and the simulated results were compared with the eye detected pitches obtained from real speech data. The pitch detector occupies 98.8% of the instruction ROM, 37% of the data ROM, and 94% of internal RAM and takes 15.2ms to estimate a pitch when an analysis frame is consisted of 128 sampled speech data. It is observed that the tested results were well agreed with the computer simulation results.

  • PDF

The continuous or categorical effects for HH vs. HL and HH vs. LH in lexical pitch accent contrasts of Korean

  • Kim, Jungsun
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.53-65
    • /
    • 2014
  • The current research examines whether pitch contour shapes in North Kyungsang pitch accent contrasts provide a phonetic dimension for phonological discreteness in a mimicry task. Two pitch accent continua resynthesized were created for HH vs. HL and HH vs. LH. To confirm a phonetic dimension for accounting for pitch accent categories in North Kyungsang Korean, the mimicries of speakers of two dialects (i.e., North Kyungsang & South Cholla) were compared. One of the findings showed that, for North Kyungsang speakers, the range of mean f0 peak times was a phonetic dimension undergoing a continuous shift within a stimulus continuum for both HH vs. HL and HH vs. LH. On the other hand, for South Cholla speakers, there were no apparent shifts around categorical boundaries for either HH vs. HL or HH vs. LH. Regarding individual mimicries on f0 peak timing, there are many variations. For HH vs. LH, three North Kyungsang speakers showed a discrete pattern reflecting a shift in phonological categories, but for HH vs. HL, there was no such distinction showing a categorical shift, though there were statistically significant differences for two speakers. Interestingly, one of the North Kyungsang speakers showed a continuous phonetic dimension for both HH vs. HL and HH vs. LH. Lastly, the f0 valley timing did not exhibit a discrete or gradient phonetic dimension for speakers of either dialect. On the basis of these results, what is interesting is that the tonal target such as high tone in North Kyungsang pitch accent categories within the autosegmental-metrical (AM) theory may be realized within individual cognitive systems for representing the interaction of perception and production.

Litter Production and Decomposition in the Querces acutissima and Pinus rigida Forests (상수리나무림과 리기다소나무림의 낙엽 생산과 분해)

  • 문형태;주환택
    • The Korean Journal of Ecology
    • /
    • v.17 no.3
    • /
    • pp.345-353
    • /
    • 1994
  • Litter production and decomposition were investigated for 2 years in the oak, Quercus acutissima, and the pitch pine, Pinus rigida, stands in the vicinity of Kongju, Chungnam Province. Litter production was measured with litter trap at monthly basis. Litterbag method was used for the measurement of litter decomposition. Litter producion continued throughout the year, but showed a peak in autumn. Second peak in May or June was caused by falling of bud scales and reproductive organs. Average litter production in the oak and the pitch pine stands were $567.1g{\cdot}m^{-2}{\cdot}yr^{-1}\;and\;653.2g{\cdot}m^{-2}{\cdot}yr^{-1}$, respectively. Litter production in this study area were higher than those in other reports. Nutrient concentrations in litter were the highest in summer when the least litter production occurred, and the lowest in autumn when the greatest litter production occurred, except for calcium in the oak stand. Nutrient concentrations of the oak litter were higher than those in the pitch pine litter. After 1 year, % remaining mass of oak and pitch pine litter was 43.6% and 58%, respectively. After 21 months elapsed, % remaining mass of oak and pitch pine litter was 22.2% and 33.2%, respectively.

  • PDF

The High Speed Pitch Extraction of Speech Signals Using the Area Comparison Method (면적 비교법에 의한 고속 PITCH 추출)

  • 배명진;안수결
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.22 no.2
    • /
    • pp.13-17
    • /
    • 1985
  • In this paper, a new pitch extraction method, the area comparison method, is proposed. By the speech production model, the area of the first peak on a pitch interval of speech signals is emphasized. By using the above characteristics, this method have more advantages than the others for pitch extraction. The defective decision caused by an impulsive noise is minimized and the pre-filtering is not necessary for this method, because the intergration of signals takes place in the process.

  • PDF

Flattening Techniques for Pitch Detection (피치 검출을 위한 스펙트럼 평탄화 기법)

  • 김종국;조왕래;배명진
    • Proceedings of the IEEK Conference
    • /
    • 2002.06d
    • /
    • pp.381-384
    • /
    • 2002
  • In speech signal processing, it Is very important to detect the pitch exactly in speech recognition, synthesis and analysis. but, it is very difficult to pitch detection from speech signal because of formant and transition amplitude affect. therefore, in this paper, we proposed a pitch detection using the spectrum flattening techniques. Spectrum flattening is to eliminate the formant and transition amplitude affect. In time domain, positive center clipping is process in order to emphasize pitch period with a glottal component of removed vocal tract characteristic. And rough formant envelope is computed through peak-fitting spectrum of original speech signal in frequency domain. As a results, well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. After all, we obtain residual signal which is removed vocal tract element The performance was compared with LPC and Cepstrum, ACF 0wing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

Preparation of pitch-coated $TiO_2$ and their photocatalytic performance

  • Chen, Ming-Liang;Oh, Won-Chun
    • Journal of the Korean Crystal Growth and Crystal Technology
    • /
    • v.17 no.1
    • /
    • pp.23-29
    • /
    • 2007
  • Pitch-coated anatase $TiO_2$ typed was prepared by $CCl_4$ solvent mixing method with different mixing ratios. Since the carbon layers derived from pitch on the $TiO_2$ particles were porous, the pitch-coated $TiO_2$ sample series showed a good adsorptivity and photo decomposition activity. The BET surface area for the carbon layer in the sample increases to increasing with pitch contents. The SEM results present to the characterization of porous texture on the pitch-coated $TiO_2$ sample and pitch distributions on the surfaces for all the materials used. From XRD data a weak and broad carbon peak of graphene with pristine anatase peaks were observed in the X-ray diffraction patterns for the pitch-coated $TiO_2$. The EDX spectra show the presence of C, O and S with strong Ti peaks. Most of these samples are richer in carbon and major Ti metal than any other elements. Finally, the excellent photocatalytic activity of pitch-coated $TiO_2$ with slope relationship between relative concentration of MB ($c/c_o$) and t could be attributed to the homogeneous coated pitch on the external surface by $CCl_4$ solvent method.