• Title/Summary/Keyword: Speech analysis

Search Result 1,573, Processing Time 0.029 seconds

Design of Programmable SC Filter (프로그램 가능한 SC Filter의 설계)

  • 이병수;이종악
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.11 no.3
    • /
    • pp.172-178
    • /
    • 1986
  • The recent interest in the design of filters is motivatied by the fact that such filter can be fully integrated using standard metal-oxide-semiconductor processing technology. This is due to replacing all the resistors in the active RC filter network by the switched capacitors. The voltage gain of a SC filter depends only on the rations of capacitance and these ratios can be obtained and maintained to high accuracy. Therefore, it is known that a switched capacitor is much better than a resistor in temperature and linearity characteristics. This paper proposed a programmable SC filter and proved the fact that ${omega}_0$ Q and G of this circuit can be controlled by digital signal. Experiments show that SC filter remains the low sensitivities but it can't avoid little influence of parasitic capacitance. As the transfer characteristic of the SC filter is varied with sampling frequency and resistor array, SC filtering technigue can be applied for digital processing, speech analysis and synthesis and so on.

  • PDF

Analysis of Phonatory Aerodynamic & E.G.G. during Passaggio of the Trained Male Singers (남성성악가의 Vocal Register Transition(Passaggio)시 공기역학적 변화와 EGG의 변화 연구)

  • Nam, Do-Hyun;Choi, Seong-Hee;Choi, Jae-Nam;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.21-26
    • /
    • 2004
  • Vocal Register Transition(Passaggio) is one of the most important vocal technique for classically trined male singers(tenor). Passaggio is that it bridges the chest register to head register without a noticeable voice break. Vocalist gest the feeling that voice is not locked a particular register. The purpose of this study was to clarify the difference between easy($B_3$) tone and non passaggio(F#_4$) & passaggio(F#_4$). We selected 6 trained singers(tenor), who had more than 12.6 years of experience and were well trained in passaggio technique. Simulataneous measurement was performed frequency(F0), mean flow rate(MFR), intensity(I), and subglottal pressure(Psub) using a phonatory function analyzer(Nagashima) and Closed Quotient(CQ), Jitter, Shimmer, NHR a Electro-glottography(EGG) of Lx. Speech Studio(Laryngogrph Lt, London, UK) and vocal efficiency was calculated by Carroll's method. For the tenor, target tone/a/was measured in three conditions : 1) easy phonation : $B_3$, 2) high tone without passaggio : F#_4$, 3) high tone with passaggio : F#_4$). The results revealed that F0 of the target tones between non-passaggio group and passaggio group were not significantly different though higher is F0, higher is subglottal pressure. And also CQ, MFR, Psub were increased in passagio than nonpssagio but these values were not statistically different. This study concluded that passaggio is the vocal technique to make the same quality of tone between chest register and head register in tenor.

  • PDF

Development of Neck-Type Electrolarynx Blueton and Acoustic Characteristic Analysis (경부형 전기인공후두 Blueton의 개발과 음향학적 성능 분석)

  • Choi, Seong-Hee;Park, Young-Jae;Park, Young-Kwan;Kim, Tae-Jung;Nam, Do-Hyun;Lim, Sung-Eun;Lee, Sung-Eun;Kim, Han-Soo;Choi, Hong-Shik;Kim, Kwang-Moon
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.37-42
    • /
    • 2004
  • Electrolarynx(EL), battery operated vibrators which are held against the neck by on-off button, has been widely used as a verbal communication method among post-laryngectomized patients. EL speech can produce easily without need of any additional surgery or special training and be used with any other methods. This institute developed a neck-typed EL named "Blueton" in commperation with EL Company Linkus, which consists of 3 parts : Vibrator part, Control part, Battery part. In this study we evaluated the acoustic characteristics of the produced voices by Blueton compared with Servox-inton using MDVP. Three EL users (2 full time users, 1 part time user) were participated. The results revelaed that NHR higher in Servox than Blueton and intensity is higher in Blueton than Servox. The spectra for vowels produced by EL speakers are mixed signals combined with talkers' vocal output and electrolarynx noise. The spectra pattern is similar with two ELs. High, SPI index and vowel spectra from MDVP demonstrated characteristics of both electrolarynxes related to noise signal. This finding suggests that Blueton helps to provide one of useful rehabilitation options in the post laryngectomy patients.

  • PDF

Coarticulation and vowel reduction in the neutral tone of Beijing Mandarin

  • Lin Maocan
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.207-207
    • /
    • 1996
  • The neutral tone is one of the most important distinguishing features in Beijing Mandarin, but there are two completely different views on its linguistic function: a special tone(Xu, 1980) versus weak stress(Chao, 1968). In this paper, the acoustic manifestation of the neutral tone will be explored to show that it is closely related to weak stress. 122 disyllabic words in which the second syllable carries the neutral tone, including 22 stress pairs, were uttered by a native male speaker of Beijing dialect and analysed by Kay Digital Sonagraph 5500-1. The results of the acoustic analysis are presented as follows: 1) The first two formants of the medial and the syllabic vowel moves towards that of central vowel with a greater magnitude in the syllable with the neutral tone than in the syllable with any of the four normal tones. Also the vowel ending, and nasal coda /n/ and / / in the syllable with the neutral tone tends to be deleted. 2) In the syllables with the neutral tone, there are strong carryover coarticulations between the medial and syllabic vowel and the preceding unvoiced consonant. In general, the vowel is affected to move towards the position of the central vowel with more greater magnitude by coronal consonant than by labial or velar consonant. 3) In the syllable with the neutral tone, when and only when it precedes a syllable with tone-4, the high vowel following [f], [ts'], [s], [ts'], [s], [tc'] or [c] tends to be voiceless. 4) It can be seen from the acoustical results of 22 stress pairs that the duration of the syllable with the neutral tone is on the average reduced to 55% of that of the syllable with the four normal tones, and the duration of the final in the syllable with neutral tone is on the average reduced to 45% of that of the final in the syllable with the four normal tones(Lin & Yan 1980). 5) The FO contour of the neutral tone is highly dependent on the preceding normal tone(Lin & Yan 1993). For a number of languages it has been found that the vowel space is reduced as the level of stress placed upon the vowel is reduced(Nord 1986). Therefore we reach the conclusion that the syllable with neutral tone is related to weak stress(Lin & Yan 1990). The neutral tone is not a special tone because the preceding normal tone.

  • PDF

Matching Pursuit Sinusoidal Modeling with Damping Factor (Damping 요소를 첨가한 매칭 퍼슈잇 정현파 모델링)

  • Jeong, Gyu-Hyeok;Kim, Jong-Hark;Lim, Joung-Woo;Joo, Gi-Ho;Lee, In-Sung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.44 no.1
    • /
    • pp.105-113
    • /
    • 2007
  • In this paper, we propose the matching pursuit with damping factors, a new sinusoidal model improving the matching pursuit, for the codecs based on sinusoidal model. The proposed model defines damping factors by using a correlativity of parameters between the current and adjacent frame, and estimates sinusoidal parameters more accurately in analysis frame by using the matching pursuit according to damping factor, and synthesizes the final signal. Then it is possible to model efficiently without interpolation schemes. The proposed sinusoidal model shows a better speech quality without an additional delay than the conventional sinusoidal model with interpolation methods. Through the SNR(signal to noise ratio), the MOS(Mean Opinion Score), LR(Itakura-Saito likelihood ratio), and CD(cepstral distance), we compare the performance of our model with that of matching pursuit using interpolation methods.

Convergence Characteristics of Ant Colony Optimization with Selective Evaluation in Feature Selection (특징 선택에서 선택적 평가를 사용하는 개미 군집 최적화의 수렴 특성)

  • Lee, Jin-Seon;Oh, Il-Seok
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.10
    • /
    • pp.41-48
    • /
    • 2011
  • In feature selection, the selective evaluation scheme for Ant Colony Optimization(ACO) has recently been proposed, which reduces computational load by excluding unnecessary or less promising candidate solutions from the actual evaluation. Its superiority was supported by experimental results. However the experiment seems to be not statistically sufficient since it used only one dataset. The aim of this paper is to analyze convergence characteristics of the selective evaluation scheme and to make the conclusion more convincing. We chose three datasets related to handwriting, medical, and speech domains from UCI repository whose feature set size ranges from 256 to 617. For each of them, we executed 12 independent runs in order to obtain statistically stable data. Each run was given 72 hours to observe the long-time convergence. Based on analysis of experimental data, we describe a reason for the superiority and where the scheme can be applied.

Effects of Respiration and Oral Motor Training based on Musical Elements and Singing on Voice of Healthy Elderly (음악요소와 노래 부르기를 활용한 호흡 및 구강훈련이 정상노인의 음성에 미치는 영향)

  • Jun, Hee-Un;Kim, Soo-Ji
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.10
    • /
    • pp.380-387
    • /
    • 2011
  • This study was to investigate the effects of music-combined respiration and oral motor training on the voice of healthy elderly. 27 women attending a senior center in Seoul participated and were randomly assigned to the experimental (n = 16) and the control group (n = 11). Subjects attended music program(25 minutes per session) once a week for 4 weeks. For both groups, Fundamental Frequency (F0), Maximum Phonation Time (MPT) and Sequential Motion Rates (SMR) were measured using the Praat speech analysis program before and after the training. The results showed statistical significance in scores of intensity, F0, MPT, and SMR in the experimental group while only intensity was statistically significant in the control group. Considering that, the increasing life expectancy and growing number of older adults, their quality of life has been important. So this study suggests that the respiration and oral motor training would be effectively incorporated into training and services for this population.

Designing and Evaluating an Audiobook Service Model on Android Platform for the Visually-Impaired (안드로이드 플랫폼 기반 시각장애인용 음성도서 서비스 모델 구축 및 평가)

  • Jang, Won-Hong;Oh, Sam-Gyun
    • Journal of the Korean Society for information Management
    • /
    • v.32 no.2
    • /
    • pp.221-236
    • /
    • 2015
  • This paper describes the process and methodology followed in developing the Android-based LG Sangnam Audiobook service and an evaluation of its usefulness to the public. The methods included a survey of user needs, analysis of usage statistics, and user interviews. The study found that visually impaired users: 1) were greatly interested and willing to use smartphones if there were no barrier in cost and access; 2) preferred downloads to streaming services; 3) did not mind performance differences between real and TTS (text-to-speech) voices; 4) showed marked differences in book preferences according to age, 5) made about 14,000 downloads in 2014; and 6) indicated bookmarking and moving between pages and tables of content as the most important functions in using audiobooks.

The Study on Intraoral Pressure, Closure Duration and VOT During Phonation of Korean Bilabial Stop Consonants (한국어 양순 파열음 발음시 구강내압과 폐쇄기, VOT에 대한 연구)

  • 표화영;최홍식
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.7 no.1
    • /
    • pp.50-55
    • /
    • 1996
  • Acoustic analysis study was performed on 20 normal subjects by speaking nonsense syllables composed of Korean bilabial stops$(/P, P^{\star}, P^{h}/)$ and their preceding and/or following vowel /a/ (that is, $[pa, p^{\star}a, p^{h}a, apa, ap^{\star}a, ap^{h}a]$) with an ultraminiature pressure, sensor. in their mouths. Speech materials were phonated twice, once with a moderate voice, another time with a loud voice. The acoustic signal and intraoral pressure were recorded simultaneously on computer. By these procedures, we were to measure the intraoral pressure, closure duration and VOT of Korean bilabial stops, and to compare the values one another according to the intensity of phonation and the position of the target consonants. Intraoral pressure was measured by the peak intraoral pressure value of Its wave closure duration by the time interval between the onset of intraoral pressure build-up and the burst meaning the release of closure ; Voice onset time(VOT) on by the time interval between the burst and the onset or glottal vibration. Heavily aspirated bilabial stop consonant /$p^h$/ showed the highest intraoral pressure value, unaspirated /$p^{\star}$/, the second, slightly aspirated /P/, the lowest. The syllable initial bilabial stops showed higher intraoral pressure than word initial stops, and the value of loudly phonated consonants were higher than moderate consonants. The longest closure duration period was that of /$p^{\star}$/ and the shortest, /P/, and the duration was longer in word initial position and in the moderate voice. In VOT, the order of the longest to shortest was $/{p^h}/, /p/, /{p^\star}/$, and the value was shorer when the consonant was in intervocalic position and when it was phonated with a loud voice.

  • PDF

Awareness of Stroke Warning Symptoms and Related Factors among Residents in a Province (일개 광역시 지역주민의 뇌졸중 조기증상 인식도와 관련요인)

  • Lee, Yu-Mi;Kim, Keon-Yeop;Kim, Ki-Su
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.8
    • /
    • pp.5116-5123
    • /
    • 2014
  • This study examined the awareness levels of stroke warning symptoms and the related factors among residents in a province and this paper presents the evidence for education and promotion strategies. The study subjects were 585 adults living in a province. The demographic factors and awareness levels of stroke symptoms were surveyed through a telephone interview. In the survey, the most frequently recognized warning symptoms were 'sudden speech disturbance (84.6%)', and 'sudden weakness of one side (73.9%)'. On the other hand, 'sudden dizziness (67.0%)', 'sudden visual impairment (55.4%)' and 'sudden severe headache (51.3%)' were less recognized. In a multiple regression analysis, male, young age, no familial history of stroke, no acquaintance history of stroke, low educational level, no exposure to promotional literature were significantly related to a low awareness level of the stroke warning symptoms. Providing customized programs will be helpful for enhancing the efficiency of promotion and education to the population with a low awareness level of stroke symptoms.