• Title/Summary/Keyword: fundamental frequency (F0)

Search Result 138, Processing Time 0.025 seconds

Acoustic parameter delta of an aspirated voice in stroke patients (뇌졸중 환자 대상 흡인 음성의 음향변수 변동)

  • Kang, Young Ae;Jee, Sung Ju;Koo, Bon Seok;Jo, Cheolwoo
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.85-91
    • /
    • 2017
  • The present study aimed to investigate the changes of acoustic parameters of the aspirated voice in stroke patients. The eighty-eight subjects diagnosed with cerebro-vascular accident were divided into 32 penetration/aspiration (P/A) and 56 Non-P/A groups according to the videofluroscopic swallowing study (VFSS) results, and 26 control subjects participated. All subjects preformed VFSS and vowel /a/ was recorded three times pre- and post VFSS. Since the variation in the acoustic parameters within a single phonation has been observed, we proposed a delta formula for the acoustic parameters which can reflect the temporal changes of the each parameter in an utterance. We measured from the voice data eight acoustic parameters: fundamental frequency (F0), standard deviation of F0 (F0_SD), Jitter, relative average perturbation (RAP), Shimmer, amplitude perturbation quotient (APQ), harmonic to noise ration (HNR), noise to harmonic ratio (NHR). Then we found parameters which show the meaningful biggest temporal change in an utterance using the suggested delta parameter. Among them, the deltas of shimmer and APQ were significantly different pre- and post VFSS. These deltas of the P/A and the control group were increased after VFSS, while those of the Non-P/A group was descended. The variation patterns of the P/A and the control group were similar but the change width of the P/A group was larger. The large variations in an aspirated phonation of the P/A group are thought to be caused by irregular changes in air resistance due to residual food on the vocal cords.

Fabrication of a Subminiature 3 Dimensional Antenna for the Mobile Phone Handset (이동 통신 단말기용 초소형 3차원 안테나 제작)

  • Hong, Min-Gi;Son, Tae-Ho
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.19 no.12
    • /
    • pp.1455-1461
    • /
    • 2008
  • We implemented a subminiature internal antenna that is around 1 cc volume for the mobile phone. The fundamental type of studied antenna is IFA(Inverted F Antenna), and this antenna is designed to be improved efficiency and gain due to minimum current cancellation by the avoidance of multiple bending pattern. For the implementation of multiple band, helix is applied to compensate for short antenna length for low frequency band, and a 3 dimensional pattern is used for high frequency band. We made two kinds of 3D structure antenna. One is a 1 cc volume antenna for GSM/DCS band on the bare board set, and the other is a 1.5 cc volume for the GSM/USPCS mobile phone set. Measurements showed good gain performance that average gain of two antenna on each band are $-3.46{\sim}-0.45\;dBi$ and $-4.80{\sim}-3.29\;dBi$ respectively.

Intervocalic Stop Voicing Revisited

  • Han, Jeong-Im
    • Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.203-216
    • /
    • 2000
  • The purpose of this study is to revisit the property of the Korean plain stops in intervocalic position. More specifically, focusing on a word-internal, intervocalic position, this study investigates 1) how often speakers pronounce intervocalic. stops as fully voiced, 2) in what amount each speaker voice the plain stops during the stop closure, 3) whether the preceding or the following vowel influences the voicing of target consonants, and 4) the fundamental frequency pattern at the vowel onset after the target consonant shows any consistent pattern, regardless of whether voicing is present during the closure. The results of this study give strong support for the phonetic account of the voicing distinction in Korean. (Jun 1995, 1996).

  • PDF

An Acoustic Study of Korean Phonation Types (한국어 발성 유형의 음향음성학적 연구)

  • Park, Han-Sang
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.6
    • /
    • pp.343-352
    • /
    • 2005
  • Phonation type index k (PTI In) presents a single and simplified measure of the spectral tilt. which is free from the effects of fundamental frequency and vowel qualify This study investigates PTI k with vowels /i . e. a. o, u/ obtained from 10 Korean male subjects. Specifically. this study tests the significance of differences in PTI k across Positions, Phonation types. vowels, and speakers, respectively The results showed that there was a significant difference in PTI k across positions, Phonation types, vowels. and speakers.

Phonation types of Korean fricatives and affricates

  • Lee, Goun
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.51-57
    • /
    • 2017
  • The current study compared the acoustic features of the two phonation types for Korean fricatives (plain: /s/, fortis : /s'/) and the three types for affricates (aspirated : /$ts^h$/, lenis : /ts/, and fortis : /ts'/) in order to determine the phonetic status of the plain fricative /s/. Considering the different manners of articulation between fricatives and affricates, we examined four acoustic parameters (rise time, intensity, fundamental frequency, and Cepstral Peak Prominence (CPP) values) of the 20 Korean native speakers' productions. The results showed that unlike Korean affricates, F0 cannot distinguish two fricatives, and voice quality (CPP values) only distinguishes phonation types of Korean fricatives and affricates by grouping non-fortis sibilants together. Therefore, based on the similarity found in /$ts^h$/ and /ts/ and the idiosyncratic pattern found in /s/, this research concludes that non-fortis fricative /s/ cannot be categorized as belonging to either phonation type.

Closure Duration and Pitch as Phonetic Cues to Korean Stop Identity in AP Medial Position: Production Test

  • Kang, Hyun-Sook;Dilley, Laura
    • Speech Sciences
    • /
    • v.14 no.3
    • /
    • pp.7-19
    • /
    • 2007
  • The present study investigated some phonetic attributes which distinguish two Korean stop types $^-aspirated$ and $lax^-$ in a prosodic position which has previously received little attention, namely medial in an accentual phrase. The intonational pattern across syllables which are initial in an accentual phrase (Jun, 1993) is said to depend on the type of stop (aspirated or lax), while that of syllables which are medial in an accentual phrase are not. In Experiment 1, nine native Korean speakers read sentences with a controlled prosodic pattern in which aspirated or lax stops occurred in accentual phrase-medial position. Acoustic analysis revealed significant differences between aspirated and lax stops in closure duration, voice-onset time, and fundamental frequency (F0) values for post-stop vowels. The results indicate that a wider range of acoustic cues distinguish aspirated and lax Korean stops than previously demonstrated. Phonetic and phonological models of consonant-tone interactions for Korean will need to be revised to account for these results.

  • PDF

A Threshold Adaptation based Voice Query Transcription Scheme for Music Retrieval (음악검색을 위한 가변임계치 기반의 음성 질의 변환 기법)

  • Han, Byeong-Jun;Rho, Seung-Min;Hwang, Een-Jun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.59 no.2
    • /
    • pp.445-451
    • /
    • 2010
  • This paper presents a threshold adaptation based voice query transcription scheme for music information retrieval. The proposed scheme analyzes monophonic voice signal and generates its transcription for diverse music retrieval applications. For accurate transcription, we propose several advanced features including (i) Energetic Feature eXtractor (EFX) for onset, peak, and transient area detection; (ii) Modified Windowed Average Energy (MWAE) for defining multiple small but coherent windows with local threshold values as offset detector; and finally (iii) Circular Average Magnitude Difference Function (CAMDF) for accurate acquisition of fundamental frequency (F0) of each frame. In order to evaluate the performance of our proposed scheme, we implemented a prototype music transcription system called AMT2 (Automatic Music Transcriber version 2) and carried out various experiments. In the experiment, we used QBSH corpus [1], adapted in MIREX 2006 contest data set. Experimental result shows that our proposed scheme can improve the transcription performance.

Variational autoencoder for prosody-based speaker recognition

  • Starlet Ben Alex;Leena Mary
    • ETRI Journal
    • /
    • v.45 no.4
    • /
    • pp.678-689
    • /
    • 2023
  • This paper describes a novel end-to-end deep generative model-based speaker recognition system using prosodic features. The usefulness of variational autoencoders (VAE) in learning the speaker-specific prosody representations for the speaker recognition task is examined herein for the first time. The speech signal is first automatically segmented into syllable-like units using vowel onset points (VOP) and energy valleys. Prosodic features, such as the dynamics of duration, energy, and fundamental frequency (F0), are then extracted at the syllable level and used to train/adapt a speaker-dependent VAE from a universal VAE. The initial comparative studies on VAEs and traditional autoencoders (AE) suggest that the former can efficiently learn speaker representations. Investigations on the impact of gender information in speaker recognition also point out that gender-dependent impostor banks lead to higher accuracies. Finally, the evaluation on the NIST SRE 2010 dataset demonstrates the usefulness of the proposed approach for speaker recognition.

Reliability of OperaVOXTM against Multi-Dimensional Voice Program to Assess Voice Quality before and after Laryngeal Microsurgery in Patient with Vocal Polyp (성대 용종 환자의 후두미세수술 전후 음성 평가에서 OperaVOXTM와 Multi-Dimensional Voice Program 간의 신뢰도 연구)

  • Kim, Sun Woo;Kim, So Yean;Cho, Jae Kyung;Jin, Sung Min;Lee, Sang Hyuk
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.31 no.2
    • /
    • pp.71-77
    • /
    • 2020
  • Background and Objectives OperaVOXTM (Oxford Wave Research Ltd.) is a portable voice analysis software package designed for use with iOS devices. As a relatively cheap, portable and easily accessible form of acoustic analysis, OperaVOXTM may be more clinically useful than laboratory-based software in many situations. The aim of this study was to evaluate the agreement between OperaVOXTM and Multi-Dimensional Voice Program (MDVP; Computerized Speech Lab) to assess voice quality before and after laryngeal microsurgery in patient with vocal polyp. Materials and Method Twenty patients who had undergone laryngeal microsurgery for vocal polyp were enrolled in this study. Preoperative and postoperative voices were assessed by acoustic analysis using MDVP and OperaVOXTM. A five-seconds recording of vowel /a/ was used to measure fundamental frequency (F0), jitter, shimmer and noise-to-harmonic ratio (NHR). Results Several acoustic parameters of MDVP and OperaVOXTM related to short-term variability showed significant improvement. While pre-operative value of F0, jitter, shimmer, NHR was 155.75 Hz (male: 125.37 Hz, female: 183.37 Hz), 2.20%, 6.28%, 0.16, post-operative values of these parameter was 164.34 Hz (male: 129.42 Hz, female: 199.26 Hz), 2.15%, 5.18%, 0.14 Hz in MDVP. While pre-operative value of F0, jitter, shimmer, NHR was 168.26 Hz (male: 135.16 Hz, female: 201.37 Hz), 2.27%, 6.95%, 0.26, post-operative values of these parameters was 162.72 Hz (male: 128.267 Hz, female: 197.18 Hz), 1.71%, 5.36%, 0.20 in OperaVOXTM. There was high intersoftware agreement for F0, jitter, shimmer with intraclass correlation coefficient. Conclusion Our results showed that the short-term variability of acoustic parameters in both MDVP and OperaVOXTM were useful for the objective assessment of voice quality in patients who received laryngeal microsurgery. OperaVOXTM is comparable to MDVP and has high intersoftware reliability with MDVP in measuring the F0, jitter, and shimmer

A Study for Acoustic Features of Benign Laryngeal Disease (양성 성대 점막 질환의 음향학적 특성에 관한 연구)

  • Lee, Jae Seok;Kim, Jin Pyeong;Park, Jeong Je;Kwon, Oh Jin;Woo, Seung Hoon
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.24 no.1
    • /
    • pp.47-50
    • /
    • 2013
  • Background and Objectives:The purpose of this study is to find features in acoustics and to learn useful features of parameters in order to distinguish laryngeal diseases through many acoustic variables. Materials and Methods:The subjects of this study were 125-male patients who had been diagnosed with vocal nodule, vocal polyp, vocal cyst, Reinke's edema, leukoplakia. To research the features of each disease in acoustics, they are measured 34 parameters by using MDVP. Results:It is clear that in order to see a meaning result when distinguishing laryngeal diseases, $F_0$, $MF_0$, $T_0$, Fhi, Flo, PER variables are significant (p<.05). It means that variables related to fundamental frequency are important to anticipate which group will be diagnosed with Reinke's edema and leukoplakia. vAm had an effect on getting a significant result in terms of amplitude perturbation parameters, which is useful to distinguish between laryngeal polyp/cyst and other laryngeal disease (p<.05). ATRI made a significant result in related to tremor parameters, which is useful to distinguish between laryngeal polyp and other laryngeal disease (p<.05). Conclusion:$F_0$, $MF_0$, $T_0$, Fhi, Flo, PER, vAm, ATRI might be meaningful parameters distinguishing pathologic from benign laryngeal diseases. Especially, the vAm and ATRI are an important factor when forecasting which group would be diagnosed with vocal polyp.

  • PDF