• 제목/요약/키워드: pitch rate

검색결과 530건 처리시간 0.024초

한국인과 원어민 영어 발화의 F0 고저 범위 차이와 발음 평가에 있어서 그 역할 (Korean & Native Speakers' High -low Range Differences in F0 and its Role in Pronunciation Assessment)

  • 이석재;조철현;문선영
    • 음성과학
    • /
    • 제10권4호
    • /
    • pp.93-103
    • /
    • 2003
  • This study examines the high-low pitch range differences between Korean and native English speakers' utterances of 36 English sentences, and explores the role of the pitch fluctuation range in the pronunciation assessment. Findings are: i) Korean speakers implement half as much pitch fluctuation as the native speakers with the achievement rate of, on an average, 47%. Especially, pitch fluctuation range is below the average in the sentence types of exclamation, yes-no & wh-questions, ii) the reason why Korean speakers' pitch fluctuation range is only half the range of the native speakers is that Korean speakers do not raise the pitch at certain sentence positions so much as the native speakers, and iii) the range of pitch fluctuation plays an important role in distinguishing the utterances of the native speakers from those of Korean speakers, but does not in rating the English utterances among Korean speakers.

  • PDF

초기 추력편향제어를 위한 자세각제어 구조에 대한 연구 (A Research of Attitude Angle Control Structure for Thrust Vector Control of Missiles at Initial Boosting Phase)

  • 김부민;황익호
    • 전기학회논문지
    • /
    • 제66권10호
    • /
    • pp.1523-1531
    • /
    • 2017
  • In general, the thrust vector control should be fast and stable in the initial launch phase. Two types of conventional controllers, one is for pitch angle control and the other is for pitch rate control, are designed based on the equation of motion without aerodynamics and are compared in the viewpoints of the stability margin and the time response performance. Also analyzed are the rejection capabilities to cope with high aerodynamic disturbances caused by high angle of attacks in initial booting phase. Additionally, time response features at actuator saturation are investigated. Based on those results, we suggest a controller structure which is more suitable for thrust vector control of missiles at initial booting phase.

A Fast Harmonic Estimation Method for Low Bit Rate Harmonic Speech Coders

  • Park, Yong-Soo;Youn, Dae-Hee;Kang, Tae-lk
    • The Journal of the Acoustical Society of Korea
    • /
    • 제20권4E호
    • /
    • pp.24-30
    • /
    • 2001
  • This paper describes a fast harmonic estimation, referred to as Delta Adjustment (DA), using a low resolution pitch. The presented DA method is based on modification of the Generalized Dual Excitation (GDE) technique[1] which was proposed to improve speech enhancement performance. We introduce the GDE technique and modify it to be suitable for low bit rate harmonic coding that uses only an integer pitch estimate. Unlike the GDE, the DA matches a frequency-warped version of the original spectrum that conforms to a fixed pitch at all harmonic bands. In addition, complexity and performance of the presented method are described in comparison with those of the conventional Fractional Pitch (FP) based harmonic estimation. Experimental results showed that the DA algorithm significantly reduces the complexity of the FP method while maintaining the performance.

  • PDF

산업용 조정 피치형 축류송풍기의 성능예측에 관한 연구 (Study on Performance Prediction of Industrial Axial Flow Fan with Adjustable Pitch Blades)

  • 구재인;김창수;정진택;김광호
    • 유체기계공업학회:학술대회논문집
    • /
    • 유체기계공업학회 2001년도 유체기계 연구개발 발표회 논문집
    • /
    • pp.30-34
    • /
    • 2001
  • In the present study, we studied the method of predicting the on-design and on-design point performance of axial flow fan with adjustable pitch blades. With the change of stagger angle of axial flow fan with adjustable pitch blade, flow rate and pressure can be changed. Because of this merit adjustable pitch fans are used in many industrial facility. When changing stagger angle or estimating the performance at a wide range of off-design condition, incidence angle changes greatly as the flow rate changes. Therefore, the deviation angle at the blade exit is estimated by the correlation considering the effects of blade design, incidence angle variation. In the loss model, we used known pressure loss model for blade boundary layer and wake, secondary flow, endwall boundary layer and tip leakage flow. The results of modified deviation angle model and experiment were compared for the usefulness of the modified model.

  • PDF

PSOLA 전처리과정을 이용한 G.723.1 보코더의 전송률 감소에 관한 연구 (On a Study of the Reduction of Bit Rate by the Preprocessing of PSOLA Coding Technique in the G. 723.1 Vocoder)

  • 장경아;조성현;배명진
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
    • /
    • pp.401-404
    • /
    • 2002
  • In general, speech coding methods are classified into the following three categories: the waveform coding, the source coding and the hybrid coding. In this paper, First, the reference waveform is detected after searching the pitch period by NAMDF similarity and similarity between the reference waveform and the waveform each pitch period. It made a decision whether the waveform is compressed with the threshold of similarity. If the waveform is compressed only magnitude and pitch information is transmitted into the input of G.723.1 vocoder. Performing through the G.723.1 vocoder, the waveform is restored with the magnitude and pitch information by PSOLA synthesis method. The result of simulation with proposed algorithm has a 31% reduction of bit rate than the standard 5.3kbps G.723.1 ACELP vocoder.

  • PDF

서브밴드 스케일링에 의한 음성신호의 피치변경법에 관한 연구 (A Study on the Pitch Alteration Technique by Subband Scaling in Speech Signal)

  • 김영구;배명진
    • 음성과학
    • /
    • 제10권4호
    • /
    • pp.137-147
    • /
    • 2003
  • Speech synthesis can classify by synthesis way, that is waveform coding, source coding and mixture coding. Specially, waveform coding is suitable for high quality synthesis. However, it is not desirable by synthesis techniques of syllable or phoneme unit because it do not separate and handles excitation and formant part. Therefore, there is a need for pitch alteration method applied in synthesis by the rule in waveform coding. This study propose about pitch alteration method that use spectrum scaling after do to flatten spectra by subband linear approximation to minimize spectrum distortion. This paper show evaluation whether show excellency of some measure compared with LPC, Cepstrum, lifter function and method that propose. estimation method seeks distribution of each flattened signal and measured degree of flattened spectra Signal flattened is normalized, So that highest point amounts to zero, and distribution of signal ,whose average is zero, is calculated. this show result that measure the spectrum distortion rate to estimate performance of method that propose. The average spectrum distortion rate was kept below the average 2.12%, so the method that propose is superiors than existent method.

  • PDF

위장발화에 대한 연구 - 운율적 특성을 중심으로 - (A Study On the Disguised Voice - From a prosodic point of view -)

  • 조민하;노석은;송민규;신지영;강선미
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.191-195
    • /
    • 2003
  • The aim of this paper is to analyze the phonetic features for disguised voice. In this paper we examined the features such as phonation types, pitch range, speech rate, intonation type and boundary tones etc. So the result of the analysis is as follows. : $\circled1$ Phonation types are very important manner of disguised voice for male subjects. $\circled2$ Pitch range and average of pitch value is very important cue for speaker verification. $\circled3$ pitch contour, speech rate and boundary tones can be a secondary cue for speaker verification.

  • PDF

음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구 (A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting)

  • 김종국;조왕래;배명진
    • 음성과학
    • /
    • 제10권2호
    • /
    • pp.85-95
    • /
    • 2003
  • In speech signal processing, it is very important to detect the pitch exactly in speech recognition, synthesis and analysis. If we exactly pitch detect in speech signal, in the analysis, we can use the pitch to obtain properly the vocal tract parameter. It can be used to easily change or to maintain the naturalness and intelligibility of quality in speech synthesis and to eliminate the personality for speaker-independence in speech recognition. In this paper, we proposed a new pitch detection algorithm. First, positive center clipping is process by using the incline of speech in order to emphasize pitch period with a glottal component of removed vocal tract characteristic in time domain. And rough formant envelope is computed through peak-fitting spectrum of original speech signal infrequence domain. Using the roughed formant envelope, obtain the smoothed formant envelope through calculate the linear interpolation. As well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. Inverse fast fourier transform (IFFT) compute this flattened harmonics. After all, we obtain Residual signal which is removed vocal tract element. The performance was compared with LPC and Cepstrum, ACF. Owing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

신경망을 이용한 고립단어에서의 피치변화곡선 발생기에 관한 연구 (A Study on the Pitch Contour Generator with Neural Network in the Isolated Words)

  • 임운천;곽진구;장석왕
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 1996년도 2월 학술대회지
    • /
    • pp.137-155
    • /
    • 1996
  • The purpose of this paper is to generate a pitch contour which is affected by tile phonetic environment and the number of syllables in each Korean isolated word using a neural network. To do this, we analyzed a set of 513 Korean isolated words, consisting of 1-4 syllables and extracted the pitch contour and the duration of each phoneme in all the words. The total number of phonemes we analyzed is about 3800. After that we approximated the pitch contour with a 1st order polynominal by a regression analysis. We could get the slope, the initial pitch and the duration of each phoneme. We used these 3 parameters as the target pattern of the neural network and let the neural network learn the rule of the variation of the pitch and duration, which was affected by the phonetic environment of each phoneme. We used 7 consecutive phoneme strings as an input pattern for a neural network to make the network learn the effect of phonetic environment around the center phoneme. In the learning phase, we used 3545 items(463 words) as target patterns which contained the phonetic environment of front and rear 3 phonemes and the neural network showed the correctness rate of 98.43%, 98.59%, 97.7% in the estimation of the duration, the slope, the initial pitch. In the recall phase, we tested the performance of tile neural network with 251 items(50 words) which weren't need as learning data and we could get the good correctness rate of 97.34%, 95.45%, 96.3% in the generation of the duration, the slope, and the initial pitch of each phoneme.

  • PDF