• Title/Summary/Keyword: Pitch rate

Search Result 530, Processing Time 0.021 seconds

Korean & Native Speakers' High -low Range Differences in F0 and its Role in Pronunciation Assessment (한국인과 원어민 영어 발화의 F0 고저 범위 차이와 발음 평가에 있어서 그 역할)

  • Rhee, Seok-Chae;Cho, Chul-Hyun;Moon, Sun-Young
    • Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.93-103
    • /
    • 2003
  • This study examines the high-low pitch range differences between Korean and native English speakers' utterances of 36 English sentences, and explores the role of the pitch fluctuation range in the pronunciation assessment. Findings are: i) Korean speakers implement half as much pitch fluctuation as the native speakers with the achievement rate of, on an average, 47%. Especially, pitch fluctuation range is below the average in the sentence types of exclamation, yes-no & wh-questions, ii) the reason why Korean speakers' pitch fluctuation range is only half the range of the native speakers is that Korean speakers do not raise the pitch at certain sentence positions so much as the native speakers, and iii) the range of pitch fluctuation plays an important role in distinguishing the utterances of the native speakers from those of Korean speakers, but does not in rating the English utterances among Korean speakers.

  • PDF

A Research of Attitude Angle Control Structure for Thrust Vector Control of Missiles at Initial Boosting Phase (초기 추력편향제어를 위한 자세각제어 구조에 대한 연구)

  • Kim, Boo-Min;Whang, Ick-ho
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.66 no.10
    • /
    • pp.1523-1531
    • /
    • 2017
  • In general, the thrust vector control should be fast and stable in the initial launch phase. Two types of conventional controllers, one is for pitch angle control and the other is for pitch rate control, are designed based on the equation of motion without aerodynamics and are compared in the viewpoints of the stability margin and the time response performance. Also analyzed are the rejection capabilities to cope with high aerodynamic disturbances caused by high angle of attacks in initial booting phase. Additionally, time response features at actuator saturation are investigated. Based on those results, we suggest a controller structure which is more suitable for thrust vector control of missiles at initial booting phase.

A Fast Harmonic Estimation Method for Low Bit Rate Harmonic Speech Coders

  • Park, Yong-Soo;Youn, Dae-Hee;Kang, Tae-lk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.4E
    • /
    • pp.24-30
    • /
    • 2001
  • This paper describes a fast harmonic estimation, referred to as Delta Adjustment (DA), using a low resolution pitch. The presented DA method is based on modification of the Generalized Dual Excitation (GDE) technique[1] which was proposed to improve speech enhancement performance. We introduce the GDE technique and modify it to be suitable for low bit rate harmonic coding that uses only an integer pitch estimate. Unlike the GDE, the DA matches a frequency-warped version of the original spectrum that conforms to a fixed pitch at all harmonic bands. In addition, complexity and performance of the presented method are described in comparison with those of the conventional Fractional Pitch (FP) based harmonic estimation. Experimental results showed that the DA algorithm significantly reduces the complexity of the FP method while maintaining the performance.

  • PDF

Study on Performance Prediction of Industrial Axial Flow Fan with Adjustable Pitch Blades (산업용 조정 피치형 축류송풍기의 성능예측에 관한 연구)

  • Koo, Jae-In;Kim, Chang-Soo;Chung, Jin-Teak;Kim, Kwang-Ho
    • 유체기계공업학회:학술대회논문집
    • /
    • 2001.11a
    • /
    • pp.30-34
    • /
    • 2001
  • In the present study, we studied the method of predicting the on-design and on-design point performance of axial flow fan with adjustable pitch blades. With the change of stagger angle of axial flow fan with adjustable pitch blade, flow rate and pressure can be changed. Because of this merit adjustable pitch fans are used in many industrial facility. When changing stagger angle or estimating the performance at a wide range of off-design condition, incidence angle changes greatly as the flow rate changes. Therefore, the deviation angle at the blade exit is estimated by the correlation considering the effects of blade design, incidence angle variation. In the loss model, we used known pressure loss model for blade boundary layer and wake, secondary flow, endwall boundary layer and tip leakage flow. The results of modified deviation angle model and experiment were compared for the usefulness of the modified model.

  • PDF

On a Study of the Reduction of Bit Rate by the Preprocessing of PSOLA Coding Technique in the G. 723.1 Vocoder (PSOLA 전처리과정을 이용한 G.723.1 보코더의 전송률 감소에 관한 연구)

  • 장경아;조성현;배명진
    • Proceedings of the IEEK Conference
    • /
    • 2002.06d
    • /
    • pp.401-404
    • /
    • 2002
  • In general, speech coding methods are classified into the following three categories: the waveform coding, the source coding and the hybrid coding. In this paper, First, the reference waveform is detected after searching the pitch period by NAMDF similarity and similarity between the reference waveform and the waveform each pitch period. It made a decision whether the waveform is compressed with the threshold of similarity. If the waveform is compressed only magnitude and pitch information is transmitted into the input of G.723.1 vocoder. Performing through the G.723.1 vocoder, the waveform is restored with the magnitude and pitch information by PSOLA synthesis method. The result of simulation with proposed algorithm has a 31% reduction of bit rate than the standard 5.3kbps G.723.1 ACELP vocoder.

  • PDF

A Study on the Pitch Alteration Technique by Subband Scaling in Speech Signal (서브밴드 스케일링에 의한 음성신호의 피치변경법에 관한 연구)

  • Kim, Young-Kyu;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.137-147
    • /
    • 2003
  • Speech synthesis can classify by synthesis way, that is waveform coding, source coding and mixture coding. Specially, waveform coding is suitable for high quality synthesis. However, it is not desirable by synthesis techniques of syllable or phoneme unit because it do not separate and handles excitation and formant part. Therefore, there is a need for pitch alteration method applied in synthesis by the rule in waveform coding. This study propose about pitch alteration method that use spectrum scaling after do to flatten spectra by subband linear approximation to minimize spectrum distortion. This paper show evaluation whether show excellency of some measure compared with LPC, Cepstrum, lifter function and method that propose. estimation method seeks distribution of each flattened signal and measured degree of flattened spectra Signal flattened is normalized, So that highest point amounts to zero, and distribution of signal ,whose average is zero, is calculated. this show result that measure the spectrum distortion rate to estimate performance of method that propose. The average spectrum distortion rate was kept below the average 2.12%, so the method that propose is superiors than existent method.

  • PDF

A Study On the Disguised Voice - From a prosodic point of view - (위장발화에 대한 연구 - 운율적 특성을 중심으로 -)

  • Cho Minha;Nho Seogeun;Song Minkyu;Shin Jiyoung;Kang Sunmee
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.191-195
    • /
    • 2003
  • The aim of this paper is to analyze the phonetic features for disguised voice. In this paper we examined the features such as phonation types, pitch range, speech rate, intonation type and boundary tones etc. So the result of the analysis is as follows. : $\circled1$ Phonation types are very important manner of disguised voice for male subjects. $\circled2$ Pitch range and average of pitch value is very important cue for speaker verification. $\circled3$ pitch contour, speech rate and boundary tones can be a secondary cue for speaker verification.

  • PDF

A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting (음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구)

  • Kim, Jong-Kuk;Jo, Wang-Rae;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.85-95
    • /
    • 2003
  • In speech signal processing, it is very important to detect the pitch exactly in speech recognition, synthesis and analysis. If we exactly pitch detect in speech signal, in the analysis, we can use the pitch to obtain properly the vocal tract parameter. It can be used to easily change or to maintain the naturalness and intelligibility of quality in speech synthesis and to eliminate the personality for speaker-independence in speech recognition. In this paper, we proposed a new pitch detection algorithm. First, positive center clipping is process by using the incline of speech in order to emphasize pitch period with a glottal component of removed vocal tract characteristic in time domain. And rough formant envelope is computed through peak-fitting spectrum of original speech signal infrequence domain. Using the roughed formant envelope, obtain the smoothed formant envelope through calculate the linear interpolation. As well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. Inverse fast fourier transform (IFFT) compute this flattened harmonics. After all, we obtain Residual signal which is removed vocal tract element. The performance was compared with LPC and Cepstrum, ACF. Owing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

A Study on the Pitch Contour Generator with Neural Network in the Isolated Words (신경망을 이용한 고립단어에서의 피치변화곡선 발생기에 관한 연구)

  • Lim Unchun;Kwak Jingu;Chang Sokwang
    • Proceedings of the KSPS conference
    • /
    • 1996.02a
    • /
    • pp.137-155
    • /
    • 1996
  • The purpose of this paper is to generate a pitch contour which is affected by tile phonetic environment and the number of syllables in each Korean isolated word using a neural network. To do this, we analyzed a set of 513 Korean isolated words, consisting of 1-4 syllables and extracted the pitch contour and the duration of each phoneme in all the words. The total number of phonemes we analyzed is about 3800. After that we approximated the pitch contour with a 1st order polynominal by a regression analysis. We could get the slope, the initial pitch and the duration of each phoneme. We used these 3 parameters as the target pattern of the neural network and let the neural network learn the rule of the variation of the pitch and duration, which was affected by the phonetic environment of each phoneme. We used 7 consecutive phoneme strings as an input pattern for a neural network to make the network learn the effect of phonetic environment around the center phoneme. In the learning phase, we used 3545 items(463 words) as target patterns which contained the phonetic environment of front and rear 3 phonemes and the neural network showed the correctness rate of 98.43%, 98.59%, 97.7% in the estimation of the duration, the slope, the initial pitch. In the recall phase, we tested the performance of tile neural network with 251 items(50 words) which weren't need as learning data and we could get the good correctness rate of 97.34%, 95.45%, 96.3% in the generation of the duration, the slope, and the initial pitch of each phoneme.

  • PDF