• Title/Summary/Keyword: Formant

Search Result 414, Processing Time 0.04 seconds

Face Feature Extraction for Child Ocular Inspection and Diagnosis of Colics by Crying Analysis (소아 망진을 위한 얼굴 특징 추출 및 영아 산통 진단을 위한 울음소리 분석)

  • Cho Dong-Uk;Kim Bong-Hyun
    • The KIPS Transactions:PartB
    • /
    • v.13B no.2 s.105
    • /
    • pp.97-104
    • /
    • 2006
  • There is no method to control for the child efficiently when disease happens who cannot be able to express his symptoms. Therefore, doctor's diagnosis depends on inquiring from child's patients, that leads to wrong diagnosis result. For this, in this paper, we would like to develop child ocular inspection, auscultation diagnosis instruments, using Oriental medicine principle that living body signal of five organs and six hallow organs which reflects patients face and voice We would like to get more accurate diagnosis result for child's symptoms from doctor's intuition on the basis of diagnostic sight visualization, objectification, quantization itself. This paper develops color revision, YCbCr application, and face color selection and five sensory organs and nose or apex extraction method etc, in child ocular inspection by first work achievement sequence among the whole development systems. Also, in occasion of child auscultation, crying characteristics of colics through pitch, intensity and formant analysis is numerized and objectifies doctor's intuition through this. Finally, experiments are performed to verify the effectiveness of the proposed methods.

Acoustic Voice Analysis in Patients with Penetration/Aspiration Via Videofluoroscopic Swallowing Study (비디오투시조영검사를 통한 침습/흡인에 따른 음성의 음향적 분석)

  • Kang, Young Ae;Jee, Sung Ju;Koo, Bon Seok
    • Korean Journal of Otorhinolaryngology-Head and Neck Surgery
    • /
    • v.60 no.9
    • /
    • pp.454-462
    • /
    • 2017
  • Background and Objectives The present study aimed to investigate the effects of penetration/aspiration (P/A) on voice acoustic parameters. Subjects and Method Twenty-seven patients were analyzed with the videofluoroscopic swallowing study (VFSS) and then divided into two groups based on the modified Penetration and Aspiration Scale results. Ten patients (5 males and 5 females) were included in the Non-P/A group, and 17 patients (12 males and 5 females) in the P/A group. Stroke was the major cause of swallowing disorders. Three sustained /a/ vowels recorded in pre- and post-VFSS were analyzed. Mann-Whitney U-test was used to compare acoustic values before and after VFSS, and the receiver operating characteristics (ROC) curve with combination of significant parameters was also conducted. Results Among acoustic parameters, the length of analyzed sample (p=0.010), number of segments computed (p=0.018), total number detected pitch periods (p=0.017), and second formant (p=0.013) in pre- and post-VFSS were significantly different between Non-P/A and P/A groups. In the P/A group after VFSS, the means of these significant parameters decreased. According to ROC combined with four significant parameters, the probability of predicting P/A condition was 84% (p=0.005), the sensitivity was 80%, and the specificity was 80%. Conclusion Voice acoustic analysis can reflect voice changes by penetration/aspiration and the combination of significant parameters can also detect swallowing disorders. Therefore, voice analysis can be a reliable screening tool for patients with swallowing disorders.

Change Measurement of Voice Analysis Parameter by an Increase of Intake the Caffeine (카페인 섭취량 증가에 따른 음성 분석 요소의 변화 측정)

  • Seo, Kyoung-Won;Jang, Yong-Jo;Kang, Deok-Hyun;Bae, Jung-Su;Yean, Yong-Hem;Lim, Soon-Yong;Min, Ji-Seon;Kim, Bong-Hyun;Ka, Min-Kyoung;Cho, Dong-Uk
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.656-659
    • /
    • 2010
  • 오늘날 현대 사회에서 여가생활시간이 늘어나고 이에 따라 사람들은 잦은 커피타임을 갖고 있으며 이로 인해 커피의 섭취량이 날로 늘어나고 있다. 날로 늘어나는 커피 섭취량과 같이 커피의 주성분인 카페인 섭취량 또한 날로 증가 하고 있다. 따라서 본 논문에서는 카페인의 성분과 효능, 인체에 미치는 영향 정도와 음성에 관여하는 정도 등에 대한 결과를 추출하여 실질적으로 카페인이 음성에 미치는 요소를 분석하였다. 이를 위해 본 논문에서는 음성분석 프로그램인 Praat를 사용하였으며 성대의 변화량과 몸 속의 공명음인 Formant를 실험 요소로 적용하였다. 데이터 자료에서 유용성을 입증하고 문제점 해결에 대해 분석하였으며 실험에 의해 제안한 방법의 활용성을 입증하는 연구를 수행하였다.

The Effects of Secondhand Smoking on Articulators Based on Phonetic Analysis (음성학적 분석 기반의 간접흡연이 조음기관에 미치는 영향)

  • Seo, Kyoung-Won;Kang, Deok-Hyun;Bae, Jung-Su;Jang, Yong-Jo;Yean, Yong-Hem;Lim, Soon-Yong;Min, Ji-Seon;Kim, Bong-Hyun;Ka, Min-Kyoung;Cho, Dong-Uk
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.648-651
    • /
    • 2010
  • 웰빙의 바람을 타고 이제 자신의 건강을 관리하는 사람들이 많아지고, 흡연에 대한 좋지 않은 인식이 높아지면서 금연의 열풍이 강하게 불고 있다. 하지만 금연을 한다고 해도 주위의 담배연기는 우리 몸의 건강을 해치기 때문에 담배연기로부터 해방되기는 매우 어렵다. 실제로 흡연하는 배우자를 가진 사람은 그렇지 않은 사람에 비해 심장병 발생률은 40%, 폐암 발생률은 30%가 더 높다. 따라서 본 논문에서는 간접흡연이 인체의 조음기관에 미치는 영향을 분석하기 위해 간접흡연에 따른 음성의 변화를 측정하고 비교, 분석하는 실험을 수행하였다. 이를 위해 간접흡연 전과 후의 음성을 수집하여 음성분석학적 요소 기술 중 Pitch, Jitter, Shimmer 등의 성대 진동 요소를 적용하고 인체 내의 공명기관을 분석하는 Formant를 적용하여 실험을 수행하여 간접흡연이 음성에 미치는 영향을 연구하였다.

PATTERNS OF ASSIMILATION OF IGBO VOWELS : AN ACOUSTIC ACCOUNT

  • Clara I. Ikekeonwu
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.514-514
    • /
    • 1996
  • Igbo, a new Benue Congo language has a vowel harmony system which, like that of Akan, is based on the pharynx size or tongue root position. In this study we examine Igbo vowel harmony with particular reference to assimilatory patterns of vowels in different harmony sets. This is to gain some insight into the factors involved in Igbo vowel assimilation, and to establish to what extent reports on Akan vowel assimilation are validated in Igbo. Tokens of the eight phonemic vowels of Standard Igbo are recorded from three native speakers of Igbo. The vowels are acoustically investigated (using the LPC analysis of CSL) in individual lexical items and within carefully designed carrier phrases. The F1 and F2 values of the vowels are obtained as these formant values are generally useful in establishing the salient characteristics of vowels. Vowels from the harmony sets are juxtaposed in the carrier phrases to ascertain the extent of assimilation. Results of the investigation show that the F1 values, to a large extend, are enough to characterize these vowels. The (-Expanded) vowels have higher F1 values than their (+Expanded) counterpart. Where there is an overlap in F1 values for some vowels the F1 bandwidth values serve to distinguish between the vowels. The overlap often reported in Akan for /I/ and /e/ on the one hand and /${\mho}$/ and /o/ on the other is not validated in Igbo. While the F1 values for these pairs of vowels are quite similar for one of our speakers, there is an appreciable difference between the F1 values of these vowels for the other two speakers. There is however an overlap for /e/ and /o/ for one of the speakers. Assimilations are generally regressive across word boundaries. It is, however, necessary to point out that the general perceptual impression that one of the vowels completely assimilates to the other, is not borne out by our investigation. Most of our F1 and F2 values for the vowels in individual lexical items are altered in assimilations. This then suggests that assimilation involving these vowels is partial rather than complete. The emerging 'allophones' are acoustically similar to the (+Expanded) vowel involved in the assimilation, that is when vowels from different harmony sets are involved. We conclude that while assimilation of Igbo vowels involves some phonological considerations, phonetic factors appear to be permanent in deciding the final form of the vowels.

  • PDF

Spectral Modeling of Haegeum Using Cepstral Analysis (캡스트럼 분석을 이용한 해금의 스펙트럼 모델링)

  • Hong, Yeon-Woo;Kang, Myeong-Su;Cho, Sang-Jin;Kim, Jong-Myon;Lee, Jung-Chul;Chong, Ui-Pil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.4
    • /
    • pp.243-250
    • /
    • 2010
  • This paper proposes a spectral modeling of Korean traditional instrument, Haegeum, using cepstral analysis to naturally describe Haegeum sounds varying with time. To get a precise result of cepstral analysis, we set the frame size to 3 periods of input signal and more cepstral coefficients are used to extract formants. The performance is enhanced by flexibly controlling the cutoff frequency of bandpass filter depending on the resonances in the synthesis process of sinusoidal components and the deleting peaks remained in the residual signal. To detect the change of pitch, we divide the input frames into silence, attack, and sustain region and determine which region the current frame is involved in. Then, the proposed method readjusts the frame size according to the fundamental frequency in the case of the current frame is in attack region and corrects the extraction errors of the fundamental frequency for the frames in sustain region. With these processes, the synthesized sounds are much more similar to the originals. The evaluation result through the listening test by a Haegeum player says that the synthesized sounds are almost similar to originals (96~100 % similar to the original sounds).

Vocal Analysis of Talking Rooster (말하는 닭의 발성 특성 분석)

  • Kyon, Doo-Heon;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.2
    • /
    • pp.125-132
    • /
    • 2010
  • Since the ancient times, animals that can imitate the voices of human beings have been considered extremely special. There are very few such animals, and the parrot is an example of them. For a long time, there had been no reported case of a rooster being able to mimic the voice of a human being, but talking roosters were recently found in Korea and the Kyrgyz Republic, generating much talk. In this study, the vocal characteristics of such roosters were examined, and their pronunciation-related statistics and actual sound sources were analyzed. The analysis results showed that even though the roostets cannot converse with people, they can imitate the human voice, uttering the words "An-dwae," and "A-ni-ya" in Korean, which mean "No" in English, when someone tries to catch their wings. A similar situation 'occurred in the Kyrgyzstan. The results of the listening survey on these sounds made by the roosters showed that most people recognized the words uttered by the roosters and that nobody thought that the words sounded like "cock-a-doodle-doo." It can be said that such roosters can make the sounds of the human voice because of their innate vocal organ and characteristics, which are significantly different from those of the general roosters. Their vocal organ and characteristics cause the sounds that they make to change in their vocal cords due to their high tension when humans try to catch them.

Acoustic characteristics of speech-language pathologists related to their subjective vocal fatigue (언어재활사의 주관적 음성피로도와 관련된 음향적 특성)

  • Jeon, Hyewon;Kim, Jiyoun;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.14 no.3
    • /
    • pp.87-101
    • /
    • 2022
  • In addition to administering a questionnaire (J-survey), which questions individuals on subjective vocal fatigue, voice samples were collected before and after speech-language pathology sessions from 50 female speech-language pathologists in their 20s and 30s in the Daejeon and Chungnam areas. We identified significant differences in Korean Vocal Fatigue Index scores between the fatigue and non-fatigue groups, with the most prominent differences in sections one and two. Regarding acoustic phonetic characteristics, both groups showed a pattern in which low-frequency band energy was relatively low, and high-frequency band energy was increased after the treatment sessions. This trend was well reflected in the low-to-high ratio of vowels, slope LTAS, energy in the third formant, and energy in the 4,000-8,000 Hz range. A difference between the groups was observed only in the vowel energy of the low-frequency band (0-4,000 Hz) before treatment, with the non-fatigue group having a higher value than the fatigue group. This characteristic could be interpreted as a result of voice abuse and higher muscle tonus caused by long-term voice work. The perturbation parameter and shimmer local was lowered in the non-fatigue group after treatment, and the noise-to-harmonics ratio (NHR) was lowered in both groups following treatment. The decrease in NHR and the fall of shimmer local could be attributed to vocal cord hypertension, but it could be concluded that the effective voice use of speech-language pathologists also contributed to this effect, especially in the non-fatigue group. In the case of the non-fatigue group, the rhamonics-to-noise ratio increased significantly after treatment, indicating that the harmonic structure was more stable after treatment.

THE STUDY OF PHONETIC CHANGE AFTER THE ORTHOGNATHIC SURGERY FOR THE PATIENT OF MANDIBULAR PROGNATHISM (하악전돌증환자(下顎前突症患者)의 악교정수술후(顎矯正手術後) 음성변화(音聲變化)에 관(關)한 연구(硏究))

  • Kim, Byung Ju;Kim, Yeo Gab
    • Maxillofacial Plastic and Reconstructive Surgery
    • /
    • v.15 no.4
    • /
    • pp.239-252
    • /
    • 1993
  • This study was done to analyze phonetic dysfunction and the effect of orthognathic surgery to phonation in the patients of mandibular prognathism. 20 persons were chosen as normal group and 20 patients of mandibular prognathism as abnormal for this study. 5 vowel sounds-'ㅏ(a)', 'ㅔ(e)', 'ㅣ(i)', 'ㅗ(o)', 'ㅜ(u)' and 14 consonant sounds-'ㄱ(g)', "ㄴ(n), 'ㄷ(d)', 'ㄹ(l)', 'ㅁ(m)', 'ㅂ(b)', 'ㅅ(s)', 'ㅇ(ng)', 'ㅈ(j)', 'ㅊ(ch)', 'ㅋ(k)', 'ㅌ(t)', 'ㅍ(p)', 'ㅎ(h)', were checked. We recorded these sounds in the period of preopration, postop. 12 months, postop. 24 months. A Formant ratio and a length of consonant time were studied with discriminant analysis. As a result of the study, the following conclusion were gained. 1. As a result of the analysis on vowel dysfunction patiensts of mandibular prognathism, more than 80% of men patients showed the dysfunction in prelingual sound 'ㅔ(e)' and 'ㅣ(i)'. More than 70% of women patients showed the dysfunction in all vowel. 2. One year later from the orthognathic surgery, men patients showed a marked improvement in 'ㅏ(a)'. The next were 'ㅗ(o)', 'ㅜ(u)' and 'ㅣ(i)'. Women patients showed a marked improvement in 'ㅜ(u)'. 3. Two year later from the orthognathic surgery, men patients showed a marked improvement in prelingual sound 'ㅔ(e)' and postlingual sound 'ㅗ(o)'. Women patients showed a marked improvement in 'ㅏ(a)'. More than 20% of patients showed the phonetic improvement compared with the condition of the postop. 12 months. 4. As a result of the analysis on consonant dysfunction patient of mandibular prognathism. more than 80% of men patients showed the dysfunction in lingual sound 'ㅅ(s)'. Most women patients showed the dysfunction in labial sound 'ㅁ(m)' and lingual sound 'ㄴ(n)'. More than 50% of patients showed the dysfunction in labial sound and lingual sound. 5. One year later from the orthognathic surgery. men patients showed a complete improvement in hard palatal sound 'ㅈ(j)'. The next were labial sound 'ㅂ(b)', lingual sound 'ㅅ(s)', soft palatal sound 'ㄱ(g)' and 'ㅋ(k)'. Women patients showed a marked improvement in soft palatal sound 'ㅇ(ng)' and 'ㄱ(g)'. 6. Two year later from the orthognathic surgery, all patients showed remarkable improvement in consonant sounds. except for labial sound 'ㅁ(m)', 'ㅍ(p)' and lingual sound 'ㄴ(n)'. The improvement ratio was increased as the time was going on compared with the condition of postop 12 months.

  • PDF

Clinical Acoustic Study of Acupuncture Therapy Effects on Post-Stroke Dysarthria (침치료가 뇌졸중으로 인한 구음장애에 미치는 음향적 특성에 대한 증례보고)

  • Lee, Min-Goo;Park, Sae-Wook;Lee, Sun-Woo;Ryu, Hyun-Hee;Lee, Seung-Eon;Kim, Yong-Jeong;Son, Ji-Woo;Rhim, Eun-Kyung;Kim, Sung-Nam;Lee, In;Moon, Byung-Soon;Yun, Jong-Min
    • The Journal of Internal Korean Medicine
    • /
    • v.26 no.3
    • /
    • pp.660-669
    • /
    • 2005
  • Objectives : The aim of this study is to find the acoustic characteristics of acupuncture therapy effects on post-stroke dysarthria. Methods : Acupuncture therapy was applied for four to six weeks by inserting needles into eight acupuncture points, CV23, CV24, bilateral 'Sheyu' and ipsilateral ST4, ST6 and contralateral LI4, ST36 on facial palsy side. All the speech samples were collected, pre-treatment and post-treatment, using Computerized Speech Lab. VOT and TD of each speech sample and vowel formant(F1&F2) were analyzed on spectrogram. Result : VOT and TD were decreased after treatment. F1 was decreased, and F2 was increased after treatment. Conclusions : This suggests that acupuncture therapy improves symptoms of post-stroke dysarthria by stimulating articulation organs such as tongue, lips, cheeks, larynx and pharynx.

  • PDF