• Title/Summary/Keyword: quality of pitch

Search Result 330, Processing Time 0.028 seconds

Separation of Voiced Sounds and Unvoiced Sounds for Corpus-based Korean Text-To-Speech (한국어 음성합성기의 성능 향상을 위한 합성 단위의 유무성음 분리)

  • Hong, Mun-Ki;Shin, Ji-Young;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.7-25
    • /
    • 2003
  • Predicting the right prosodic elements is a key factor in improving the quality of synthesized speech. Prosodic elements include break, pitch, duration and loudness. Pitch, which is realized by Fundamental Frequency (F0), is the most important element relating to the quality of the synthesized speech. However, the previous method for predicting the F0 appears to reveal some problems. If voiced and unvoiced sounds are not correctly classified, it results in wrong prediction of pitch, wrong unit of triphone in synthesizing the voiced and unvoiced sounds, and the sound of click or vibration. This kind of feature is usual in the case of the transformation from the voiced sound to the unvoiced sound or from the unvoiced sound to the voiced sound. Such problem is not resolved by the method of grammar, and it much influences the synthesized sound. Therefore, to steadily acquire the correct value of pitch, in this paper we propose a new model for predicting and classifying the voiced and unvoiced sounds using the CART tool.

  • PDF

Development of Pitch Pine Glued Laminated Timber for Structural Use -Improvement of Bending Capacity of Pitch Pine Glulam by Using Domestic Larch Laminars- (리기다소나무의 구조용 집성재 이용기술 개발 -낙엽송 층재와의 혼합 구성을 통한 집성재의 휨성능 향상-)

  • Kim, Kwang-Mo;Shim, Kug-Bo;Park, Joo-Saeng;Kim, Wun-Sub;Lim, Jin-Ah;Yeo, Hwanmyeong
    • Journal of the Korean Wood Science and Technology
    • /
    • v.35 no.6
    • /
    • pp.13-22
    • /
    • 2007
  • This study was carried out to scrutinize possibility of manufacturing pitch pine (Pinus rigida) glued laminated timber in order to add values of pitch pine trees. Also, it was investigated to improve bending performance of pitch pine glulam. Pitch pine was imported as one of major plantation species in Korean peninsula. Machine stress rated grades of pitch pine lumber mostly ranged between E7 and E9. which grades were more or less inferior to producing high quality glulam. However, the adhesive properties between pitch pine and pitch pine, and between pitch pine and Japanese larch (Larix kaempferi Carr.), such as shear bond strength, wood failure rate and de-lamination rate of bonded layer submerged in cold and boiling water, were higher than Korean Standard criteria. These properties are essential for manufacturing glulam with single species or multiple species. The modulus of rupture (MOR) of pitch pine glulam exceeded the criterion of Korean Standard for glulam strength grade but modulus of elasticity (MOE) was lower than the criterion. On the other hand, the bending performances (MOR and MOE) were improved 20 percent by mixing with Japanese larch laminar. It is effective to arrange higher quality Japanese larch laminar at the outer layer of glulam for improving bending performances. In conclusion, it is possible to use low quality pitch pine as laminar of structural glulam for adding values of pitch pine.

A Correlation Study among Pitch, Nasalance, and Voice Quality (정상 성인의 음도, 비성도, 음질 간의 상관 연구)

  • Park, Sung-Jong;Yoo, Jae-Yeon
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.159-163
    • /
    • 2009
  • The purpose of this study is to conduct a correlational analysis among pitch, nasalance, and acoustic quality parameters estimated by two speech analysis softwares NasalView(version 1.31), Dr. Speech 4.5(Tiger Electronics). Thirty females and 25 males with normal voice participated in the study. The Pearson correlation coefficient was determined through a statistical analysis. The results came out as follows; Firstly, there was a correlation between $F_0$ and voice quality parameters, however there was no correlation between $F_0$ and nasalance. Secondly, nasalance showed a correlation with voice quality parameters.

  • PDF

The phonetic realization of English unstressed vowels produced by Korean advanced learners : A comparative study of English words and English loanwords (한국인 상급 학습자의 영어 비강세 모음의 특징 -영어단어와 한국어에 외래어로 유입된 영어단어의 비교연구-)

  • Kang, Sun-Mi;Kang, Ji-Eun;Kim, Kee-Ho
    • Phonetics and Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.3-11
    • /
    • 2012
  • The aim of this paper is to examine the phonetic realizations of English unstressed vowels produced by advanced Korean learners (KLs) of English compared with English native speakers (NSs) focusing on the comparative study of English words and English loanwords. The result shows that KLs are usually not native-like in producing the English unstressed vowel /ə/ and loanword orthography affects the way the KLs produce /?/. The vowel quality of the unstressed vowels produced by the KLs is different from that of the NSs. In duration and pitch, KLs show significantly less difference between the stressed and unstressed vowels than do the NSs. The KLs usually have a high pitch in the stressed and the last syllable while the NSs usually produce peak F0 in the stressed syllable. When the KLs have a similar vowel quality with that of the NSs, they produce a shorter duration of the unstressed vowels. However, there is no correlation between the realization of the pitch and the vowel quality in KLs speech.

A Study on Multi-Pulse Speech Coding Method by Using Individual Pitch Information (개별 피치정보를 이용한 멀티펄스 음성부호화 방식에 관한 연구)

  • Lee, See-Woo
    • The Journal of the Korea Contents Association
    • /
    • v.6 no.2
    • /
    • pp.59-64
    • /
    • 2006
  • In this paper, 1 propose a new method of Multi-Pulse Coding(IP-MPC) use individual pitch pulses in order to accommodate the changes in each pitch interval and reduce pitch errors. The extraction rate of individual pitch pulses was $85\%$ for female voice and $96\%$ for male voice respectively, 1 evaluate the MPC by using pitch information of autocorrelation method and the IP-MPC by using individual pitch pulses. As a result, 1 knew that synthesis speech of the IP-MPC was better in speech quality than synthesis speech of the MPC.

  • PDF

Image Quality and Radiation Dose of High-Pitch Dual-Source Spiral Cardiothoracic Computed Tomography in Young Children with Congenital Heart Disease: Comparison of Non-Electrocardiography Synchronization and Prospective Electrocardiography Triggering

  • Goo, Hyun Woo
    • Korean Journal of Radiology
    • /
    • v.19 no.6
    • /
    • pp.1031-1041
    • /
    • 2018
  • Objective: To compare image quality and radiation dose of high-pitch dual-source spiral cardiothoracic computed tomography (CT) between non-electrocardiography (ECG)-synchronized and prospectively ECG-triggered data acquisitions in young children with congenital heart disease. Materials and Methods: Eighty-six children (${\leq}3$ years) with congenital heart disease who underwent high-pitch dual-source spiral cardiothoracic CT were included in this retrospective study. They were divided into two groups (n = 43 for each; group 1 with non-ECG-synchronization and group 2 with prospective ECG triggering). Patient-related parameters, radiation dose, and image quality were compared between the two groups. Results: There were no significant differences in patient-related parameters including age, cross-sectional area, body density, and water-equivalent area between the two groups (p > 0.05). Regarding radiation dose parameters, only volume CT dose index values were significantly different between group 1 ($1.13{\pm}0.09mGy$) and group 2 ($1.07{\pm}0.12mGy$, p < 0.02). Among image quality parameters, significantly higher image noise ($3.8{\pm}0.7$ Hounsfield units [HU] vs. $3.3{\pm}0.6HU$, p < 0.001), significantly lower signal-to-noise ratio ($105.0{\pm}28.9$ vs. $134.1{\pm}44.4$, p = 0.001) and contrast-to-noise ratio ($84.5{\pm}27.2$ vs. $110.1{\pm}43.2$, p = 0.002), and significantly less diaphragm motion artifacts ($3.8{\pm}0.5$ vs. $3.7{\pm}0.4$, p < 0.04) were found in group 1 compared with group 2. Image quality grades of cardiac structures, coronary arteries, ascending aorta, pulmonary trunk, lung markings, and chest wall showed no significant difference between groups (p > 0.05). Conclusion: In high-pitch dual-source spiral pediatric cardiothoracic CT, additional ECG triggering does not substantially reduce motion artifacts in young children with congenital heart disease.

Robust Pitch Detection Algorithm for Pathological Voice inducing Pitch Halving and Doubling (피치 반감 배가를 유발하는 병적인 음성 분석을 위한 강인한 피치 검출 알고리즘)

  • Jang, Seung-Jin;Choi, Seong-Hee;Kim, Hyo-Min;Choi, Hong-Shik;Yoon, Young-Ro
    • Proceedings of the KIEE Conference
    • /
    • 2007.07a
    • /
    • pp.1797-1798
    • /
    • 2007
  • In field of voice pathology, diverse statistics extracted form pitch estimation were commonly used to assess voice quality. In this study, we proposed robust pitch detection algorithm which can estimate pitch of pathological voices in benign vocal fold lesions. we also compared our proposed algorithm with three established pitch detection algorithms; autocorrelation, simplified inverse filtering technique, and nonlinear state-space embedding methods. In the database of total pathological voices of 99 and normal voices of 30, an analysis of errors related with pitch detection was evaluated between pathological and normal voices, or among the types of pathological voices. According to the results of pitch errors, gross pitch error showed some increases in cases of pathological voices; especially excessive increase in PDA based on nonlinear time-series. In an analysis of types of pathological voices classified by aperiodicity and the degree of chaos, the more voice has aperiodic and chaotic, the more growth of pitch errors increased. Consequently, it is required to survey the severity of tested voice in order to obtain accurate pitch estimates.

  • PDF

Analysis of Coiling Process and Quality Inspection of Filaments for Bulbs (전구용 필라멘트의 제조 공정 해석 및 품질 검사)

  • 정태은;표성배;전병희;장병수;김학준
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2000.11a
    • /
    • pp.771-774
    • /
    • 2000
  • Coiling processes of filaments need precise work and standardization. It is important to maintain equal pitch of filaments. Uniform pitch of filaments is one of the dominant elements of life time and efficiency of bulbs. First coiling process of filament wires is modeled by nonlinear contact problem between filaments and mandrel. Analysis of coiling process using finite element method is conducted to consider manufacturing parameters and pitch distance is calculated under the given conditions. Also image detecting system is developed to inspect uniformity of pitch. This system will be used to inspect quality of filaments during coiling processes.

  • PDF

Comparative Analysis of Performance of Established Pitch Estimation Methods in Sustained Vowel of Benign Vocal Fold Lesions (양성후두 질환의 지속모음을 대상으로 한 기존 피치 추정 방법들의 성능 비교 분석)

  • Jang, Seung-Jin;Kim, Hyo-Min;Choi, Seong-Hee;Park, Young-Cheol;Choi, Hong-Shik;Yoon, Young-Ro
    • Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.179-200
    • /
    • 2007
  • In voice pathology, various measurements calculated from pitch values are proposed to show voice quality. However, those measurements frequently seem to be inaccurate and unreliable because they are based on some wrong pitch values determined from pathological voice data. In order to solve the problem, we compared several pitch estimation methods to propose a better one in pathological voices. From the database of 99 pathological voice and 30 normal voice data, errors derived from pitch estimation were analyzed and compared between pathological and normal voice data or among the vowels produced by patients with benign vocal fold lesions. Results showed that gross pitch errors were observed in the cases of pathological voice data. From the types of pathological voices classified by the degree of aperiodicity in the speech signals, we found that pitch errors were closely related to the number of aperiodic segments. Also, the autocorrelation approach was found to be the most robust pitch estimation in the pathological voice data. It is desirable to conduct further research on the more severely pathological voice data in order to reduce pitch estimation errors.

  • PDF

On a Pitch Alteration Method Compensated with the Spectrum for High Quality Speech Synthesis (스펙트럼 보상된 고음질 합성용 피치 변경법)

  • 문효정
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1995.06a
    • /
    • pp.123-126
    • /
    • 1995
  • The waveform coding are concerned with simply preserving the wave shape of speech signal through a redundancy reduction process. In the case of speech synthesis, the wave form coding with high quality are mainly used to the synthesis by analysis. However, because the parameters of this coding are not classified as either excitation and vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In this paper, we proposed a new pitch alteration method that can change the pitch period in waveform coding by using scaling the time-axis and compensating the spectrum. This is a time-frequency domain method that is preserved in the phase components of the waveform and that has a little spectrum distortion with 2.5% and less for 50% pitch change.

  • PDF