• Title/Summary/Keyword: acoustic features

Search Result 323, Processing Time 0.025 seconds

Phonation types of Korean fricatives and affricates

  • Lee, Goun
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.51-57
    • /
    • 2017
  • The current study compared the acoustic features of the two phonation types for Korean fricatives (plain: /s/, fortis : /s'/) and the three types for affricates (aspirated : /$ts^h$/, lenis : /ts/, and fortis : /ts'/) in order to determine the phonetic status of the plain fricative /s/. Considering the different manners of articulation between fricatives and affricates, we examined four acoustic parameters (rise time, intensity, fundamental frequency, and Cepstral Peak Prominence (CPP) values) of the 20 Korean native speakers' productions. The results showed that unlike Korean affricates, F0 cannot distinguish two fricatives, and voice quality (CPP values) only distinguishes phonation types of Korean fricatives and affricates by grouping non-fortis sibilants together. Therefore, based on the similarity found in /$ts^h$/ and /ts/ and the idiosyncratic pattern found in /s/, this research concludes that non-fortis fricative /s/ cannot be categorized as belonging to either phonation type.

A Study on Feature Extraction of Transformers Aging Signal using discrete Wavelet Transform Technique (이산 웨이블렛 변환 기법을 이용한 변압기 열화신호의 특징추출에 관한 연구)

  • Park, Jae-Jun;Kwon, Dong-Jin;Song, Yeong-Cheol;Ahn, Chang-Beom
    • The Transactions of the Korean Institute of Electrical Engineers C
    • /
    • v.50 no.3
    • /
    • pp.121-129
    • /
    • 2001
  • In this paper, a new efficient feature extraction method based on Daubechies discrete wavelet transform is presented. This paper especially deals with the assessment of process statistical parameter using the features extracted from the wavelet coefficients of measured acoustic emission signals. Since the parameter assessment using all wavelet coefficients will often turn out leads to inefficient or inaccurate results, we selected that level-3 stage of multi decomposition in discrete wavelet transform. We make use of the feature extraction parameter namely, maximum value of acoustic emission signal, average value, dispersion, skewness, kurtosis, etc. The effectiveness of this new method has been verified on ability a diagnosis transformer go through feature extraction in stage of aging(the early period, the middle period, the last period)

  • PDF

Time-Frequency Domain Analysis of Acoustic Signatures Using Pseudo Wigner-Ville Distribution

  • Jeon, Jae-Jin
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.674-679
    • /
    • 1994
  • Acoustic signal such as speech and scattered sound, are generally a nonstationary process whose frequency contents vary at any instant of time. For time-varying signal, whether a nonstationary or a deterministic transient signal, a traditional frequency domain representation does not reveal the contents of signal characteristics and may lead to erroneous results such as the loss of desired characteristics features or the mis-interpretation for a wrong conclusion. A time-frequency domain representation is needed to characterize such signatures. Pseudo Wigner-Ville distribution (PWVD) is ideally suited for portraying nonstationary signal time-frequency domain and carried out by adapting the fast Fourier transform algorithm. In this paper, the important properties of PWVD were investigated using both stationary and nonstationry signatures by numerical examples PWVD was applied to acoustic sigtnatures to demonstrate its application for time-ferquency domain analysis.

  • PDF

Korean speakers hyperarticulate vowels in polite speech

  • Oh, Eunhae;Winter, Bodo;Idemaru, Kaori
    • Phonetics and Speech Sciences
    • /
    • v.13 no.3
    • /
    • pp.15-20
    • /
    • 2021
  • In line with recent attention to the multimodal expression of politeness, the present study examined the association between polite speech and acoustic features through the analysis of vowels produced in casual and polite speech contexts in Korean. Fourteen adult native speakers of Seoul Korean produced the utterances in two social conditions to elicit polite (professor) and casual (friend) speech. Vowel duration and the first (F1) and second formants (F2) of seven sentence- and phrase-initial monophthongs were measured. The results showed that polite speech shares acoustic similarities with vowel production in clear speech: speakers showed greater vowel space expansion in polite than casual speech in an effort to enhance perceptual intelligibility. Especially, female speakers hyperarticulated (front) vowels for polite speech, independent of speech rate. The implications for the acoustic encoding of social stance in polite speech are further discussed.

Feasibility Study on Surface Microcrack Detection of the Steel Wire Rods Using Electromagnetic Acoustic Resonance (전자기 음향 공진을 이용한 강선의 표면 미세 결함 탐상 타당성 연구)

  • Heo, Taehoon;Cho, Seung Hyun;Ahn, Bongyoung;Lim, Zhong Soo
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.33 no.1
    • /
    • pp.7-13
    • /
    • 2013
  • The surface microcrack over a few tens of micrometers is one of severe problems of a steel wire rod to lead to the failure of the final products, so the method to evaluate crack depth has been required to develop. This work investigates the feasibility of electromagnetic acoustic resonance (EMAR) for this problem. EMAR is the method for measurement of resonant features using electromagnetic acoustic transducer (EMAT). Generally, EMAR is sensitive to small variation of the structures and easy to apply it to the industrial field because of the feature of noncontact measurement. Through several EMAR experiments, the change of the resonant frequencies and attenuation in reverberation has been observed. The results confirms that the surface cracks of around 100 micrometer depth can be detected successfully with the present method.

SPEECH TRAINING TOOLS BASED ON VOWEL SWITCH/VOLUME CONTROL AND ITS VISUALIZATION

  • Ueda, Yuichi;Sakata, Tadashi
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.441-445
    • /
    • 2009
  • We have developed a real-time software tool to extract a speech feature vector whose time sequences consist of three groups of vector components; the phonetic/acoustic features such as formant frequencies, the phonemic features as outputs on neural networks, and some distances of Japanese phonemes. In those features, since the phoneme distances for Japanese five vowels are applicable to express vowel articulation, we have designed a switch, a volume control and a color representation which are operated by pronouncing vowel sounds. As examples of those vowel interface, we have developed some speech training tools to display a image character or a rolling color ball and to control a cursor's movement for aurally- or vocally-handicapped children. In this paper, we introduce the functions and the principle of those systems.

  • PDF

Characteristics of 2 to 4 year old Korean children's production of monophthongs and diphthongs (만 2-4세 한국 아동의 단모음과 이중모음 산출 특징)

  • Song, Inmi;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.65-74
    • /
    • 2018
  • The purpose of this study is to investigate age-specific features of 2;1- to 4;1-year -olds' production of monophthongs and diphthongs through both auditory perceptual analysis and acoustic analysis. Test material included {vowel+'da'} consisting of 7 monophthongs and 10 diphthongs and meaningful words beginning with vowels. The percentage of correct vowels was used for perceptual analysis and Praat(5.2.12) was used for acoustic analysis, analyzing variables related to monophthongs and diphthongs. The results of this study are as follows: First, perceptual analysis showed that children from an age group of 2;1 to 2;8 years showed significant difference in the accuracy level of both monophthongs and diphthongs as compared to those aged 2;9 to 3;4 years and those aged 3;5 to 4;1 years. Second, the results of acoustic analysis provided that formant (F1 and F2) of monophthong, in general, tended to decrease as age increased. In terms of F2 differentiation slope and regression slope, which were diphthong-related variables, the age group of 3;5 to 4;1 years showed a large general slope change.

Closed-Loop Power Control for Code Division Multiple Access in Time-Varying Underwater Acoustic Channel (시변 수중 음향 채널에서 코드 분할 다중 접속 방식의 폐루프 전력 제어 기법)

  • Seo, Bo-Min;Cho, Ho-Shin
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.12
    • /
    • pp.32-40
    • /
    • 2015
  • Code division multiple access (CDMA) is one of the promising medium access control scheme for underwater acoustic sensor networks due to its beneficial features such as robustness against frequency-selective fading and high frequency-reuse efficiency. In this paper, we design a closed-loop power control scheme for the underwater CDMA, to adapt time-varying acoustic channel. In the proposed scheme, sink node sends to sensor nodes the associated path loss which is acquired by uplink-channel analysis based on received packets from the sensor nodes. Then, sensor nodes adjust their transmission power in an adaptive manner to time-varying underwater acoustic channel, according to the informations sent by the sink node.

Computation of Laryngeal Flow and Sound through a Dynamic Model of the Vocal Folds (동적 성대 모델을 이용한 후두 내 유동 및 음향장에 대한 수치 연구)

  • Bae, Young-Min;Moon, Young-J.
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2008.03b
    • /
    • pp.21-24
    • /
    • 2008
  • The present study numerically investigates the glottal airflow characteristics as well as acoustic features of phonation fully coupled with dynamic behavior of vocal folds. The vocal folds are described by a low-dimensional body-covered model characterized by bio-mechanical parameters such as glottal width, vocal folds stiffness, and subglottal pressure. The flow in the vocal tract is modeled as an incompressible, axisymmetric form of the Navier-Stokes equations (INS), while the acoustic field is predicted by the linearized perturbed compressible equations (LPCE). The computed result shows that a two-mass model of vocal folds is sufficient to reproduce temporal variations in oral airflow and glottis motion produced by female speakers. It is also found that i) the glottal width has a significant effect on the amplitude of glottal flow, and thus on the amplitude of acoustic wave in the vocal tract, ii) the vocal fold tension is the main control parameter for the fundamental frequency of phonation, iii) the subglottal pressure plays an appreciable role on reproduction of the self-sustained oscillation of vocal folds, and iv) the strength of pulsating airflow and vortical structures are primarily affected by glottal width and subglottal pressure, and are closely related to pitch, loudness, and voice quality. Finally, more comprehensive explanation about the difference between one- and two-mass models is presented with discussion of effectiveness of vocal folds oscillation and voice quality.

  • PDF

Acoustic Characteristics of 'Short Rushes of Speech' using Alternate Motion Rates in Patients with Parkinson's Disease (파킨슨병 환자의 교대운동속도 과제에서 관찰된 '말 뭉침'의 음향학적 특성)

  • Kim, Sun Woo;Yoon, Ji Hye;Lee, Seung Jin
    • Phonetics and Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.55-62
    • /
    • 2015
  • It is widely accepted that Parkinson's disease(PD) is the most common cause of hypokinetic dysarthria, and its characteristics of 'short rushes of speech' have become more evident along with the severity of motor disorders. Speech alternate motion rates (AMRs) are particularly useful for observing not only rate abnormalities but also deviant speech. However, relatively little is known about the characteristics of 'short rushes of speech' in terms of AMRs of PD except for the perceptual characteristics. The purpose of this study was to examine which acoustic features of 'short rushes of speech' in terms of AMRs are a robust indicator of Parkinsonian speech. Numbers of syllabic repetitions (/pə/, /tə/, /kə/) in AMR tasks were analyzed through acoustic methods observing a spectrogram of the Computerized Speech Lab in 9 patients with PD. Acoustically, we found three characteristics of 'short rushes of speech': 1) Vocalized consonants without closure duration(VC) 76.3%; 2) No consonant segmentation(NC) 18.6%; 3) No vowel formant frequency(NV) 5.1%. Based on these results, 'short rushes of speech' may affect the failure to reach and maintain the phonatory targets. In order to best achieve the therapeutic goals, and to make the treatment most efficacious, it is important to incorporate training methods which are based on both phonation and articulation.