• 제목/요약/키워드: formant characteristics

검색결과 128건 처리시간 0.023초

폐쇄자음군의 폐쇄구간 축소에 따른 위치성 지각에 대한 재해석 (Reinterpretation of the Perception of Place Cues in the Reduced Closure Duration of Stop Consonant Clusters)

  • 이석재
    • 대한음성학회지:말소리
    • /
    • 제45호
    • /
    • pp.1-14
    • /
    • 2003
  • This paper criticizes S. Kim (1992), claiming that the perception of place cues in the reduced stop consonant clusters ('reducing' means 'cutting off' the acoustic silence in stop clusters) largely depends on the acoustic characteristics such as formant transition and noise frequency distribution of stop burst, rather than the closure duration time as advocated by S. Kim (1992). The claim is based on the perception test conducted upon 111 stimuli over 10 subjects. The finding is that, when the closure duration is cut off up to the point where only one stop is perceived, place of the second stop, not the first one, in the cluster is in most cases perceived regardless of the places of the first and second stops. It is likely that the place cues of the stop in the prevocalic position mask those in the postvocalic position.

  • PDF

정상 모음과 비음에 대한 음성 음향학적 비교 (ACOUSTIC CHARACTERISTICS OF NORMAL AND NASAL VOWELS)

  • 김광현;성명훈;오승하;강명구;김진영
    • 대한기관식도과학회:학술대회논문집
    • /
    • 대한기관식도과학회 1991년도 제25차 학술대회 연제순서 및 초록
    • /
    • pp.21-21
    • /
    • 1991
  • 구개열이나 구개부전이 있는 경우, 흔히 비 정상적인 비음을 접할 수 있는데 이러한 구조이상과 구음이상간의 관계를 밝히는데 있어서 비음에 대한 객관적인 평가방법이 요구된다. 여러 객관적 지표로서의 가능성 중에 음향학적 분석이 많이 시행되었는데 이 방법도 개인별 공명구조의 차이에 기인한 변동이 심하므로 그 효용의 관계가 있었다. 저자들은 이러한 개인별 차이를 줄이기 위해 정상 성인 남자 20명에서 한국 모음 /a/, /i/, /u/, /e/, /o/에 대하여 각각 정상 발음, 인공적으로 일으킨 비폐색, 비개방음을 음향학적으로 분석하여 보았다. 결과, Formant의 변화를 관찰하였고 비음의 객관적 지표로써 이용 가능성을 알 수 있었다.

  • PDF

음성발생 모델로부터의 G-peak를 이용한 음성에너지 추출에 관한 연구 (A Study on the Energy Extraction Using G-peak from the Speech Production Model)

  • 배명진;임재열;안수길
    • 대한전자공학회논문지
    • /
    • 제24권3호
    • /
    • pp.381-386
    • /
    • 1987
  • By the speech production model, the first positive peak in a pitch interval of the voiced speech is mainly affected by the glottis and the first formant component, known as a typical energy source of the voiced speech. From these characteristics, the energy parameter can be replaced by the area of the area of the positve peak in a pitch interval, which parameter is generally used for classification of speech signals. In this method, the changed energy parameter is independent of window length applied for analysis, and the pitch can be extracted smultaneously. Furthermore, the energy can be extracted in the pitch period unit.

  • PDF

청각장애학생의 영어 발성 주파수별 특징 분석 (Feature analysis of deaf students' English language by frequency)

  • 이근민;박혜정
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권4호
    • /
    • pp.819-828
    • /
    • 2014
  • 본 논문에서는 청각장애학생들의 영어 발성의 특징을 분석하여 그 특징들을 반영할 수 있는 맞춤형 영어 학습 보조 도구를 개발하기 위한 기초자료를 제시하고자 한다. 본 논문에서는 청각장애학생들의 영어 발성의 특징을 분석하기 위해서 서울과 대구에 있는 청각특수학교의 학생들을 대상으로 직접 방문하여 녹음하였으며, 음성파일을 분석하기 위해 음성분석 전문 프로그램인 플라트 프로그램을 활용하였다. 청각장애학생들의 영어 발성의 특징은 플라트 프로그램을 통해 음성학에서 사용하는 음성의 특징 값들을 추출하여, 그 특징 값들을 이용하여 비장애학생의 영어 발성의 특징과 비교분석하였다.

음성합성시스템을 위한 음색제어규칙 연구 (A Study on Voice Color Control Rules for Speech Synthesis System)

  • 김진영;엄기완
    • 음성과학
    • /
    • 제2권
    • /
    • pp.25-44
    • /
    • 1997
  • When listening the various speech synthesis systems developed and being used in our country, we find that though the quality of these systems has improved, they lack naturalness. Moreover, since the voice color of these systems are limited to only one recorded speech DB, it is necessary to record another speech DB to create different voice colors. 'Voice Color' is an abstract concept that characterizes voice personality. So speech synthesis systems need a voice color control function to create various voices. The aim of this study is to examine several factors of voice color control rules for the text-to-speech system which makes natural and various voice types for the sounding of synthetic speech. In order to find such rules from natural speech, glottal source parameters and frequency characteristics of the vocal tract for several voice colors have been studied. In this paper voice colors were catalogued as: deep, sonorous, thick, soft, harsh, high tone, shrill, and weak. For the voice source model, the LF-model was used and for the frequency characteristics of vocal tract, the formant frequencies, bandwidths, and amplitudes were used. These acoustic parameters were tested through multiple regression analysis to achieve the general relation between these parameters and voice colors.

  • PDF

모음-자음-모음 연결에서 자음의 조음특성과 모음-모음 동시조음 (Consonantal Production and V-to-V Coarticulation in Korean VCV Sequences)

  • 신지영
    • 음성과학
    • /
    • 제1권
    • /
    • pp.55-81
    • /
    • 1997
  • In the present paper, V-to-V coarticulation in Korean VCV sequences is discussed, focusing on links between consonantal production and degree of V-to-V coarticulation. Temporal and spatial differences between three types of Korean alveolar stops (lax /t/. aspirated /$t^h$/ and thense /t'/) are examined from VCV sequences involving all possible combinations of three Korean unrounded vowels /a, i,/ based on spectrographic and electrographic data(two male speakers and one female speaker and one female speaker respectively). Closure duration and voice onset time (VOT) were measured from acoustic data. 'Total duration', which is defined as the sum of the closure duration and the VOT, was also calculated in order to see the temporal distance between two vowels in a VCV sequence. Differences in lingual-palatal contact pattern at the maximum contact (MC) point between the three types of stop were observed from EPG data. V-to-V coarticulation was investigated by measuring the offset or onset of the second formant (F2) of the target vowels from spectrograms. Two different dimensions of articulation, temporal and spatial, seem to playa role in determining the degree of V-to-V coarticulation. The degree of V-to-V anticipatory coarticulation is influenced by the spatial characteristics of the intervening consonant while the degree of carryover coarticulation is influenced by the temporal characteristics of the consonant.

  • PDF

인체 내부에서의 진동 전달특성 분석 (An Analysis of the Vibration Characteristics through the Human Body)

  • 전종원;진용옥
    • 한국음향학회지
    • /
    • 제19권7호
    • /
    • pp.59-65
    • /
    • 2000
  • 본 논문은 음성진단이나 치료를 위한 기초연구로서, 인체의 진동신호를 측정하여 그 특성을 분석한 것이다. 가진신호는 외부적인 힘이 아닌 자신의 음성이며, 진동과 공진 특성이 강한 모음 '아', '에', '이', '오', '우'를 적용하여 실험하였다. 실험장치로는 마이크로폰과 가속도계 그리고 증폭기를 이용하였으며 컴퓨터에 측정 데이터를 저장하였다. 마이크로폰으로 음성신호를 저장하면서 동시에 가속도계를 이용하여 인체 각 부위에서의 진동신호를 측정하였으며 측정 위치는 머리, 목, 몸체를 일정한 간격으로 나누어 총 63개의 위치로 정하였다. 진동 신호의 측정 위치와 횟수는 사용 목적에 따라 충분히 가변적일 수 있다. 진동 분석을 위한 파라미터는 진동 신호의 크기, 위상, 기본 진동수, 결집음폭대이며, 코히어런스 함수를 이용하여 인체의 진동신호와 음성과의 상관성을 알아보았다. 실험결과, 인체의 위치에 따라 독특한 특징들이 있음을 확인하였으며, 그 결과를 제시하였다.

  • PDF

여성 뉴스 앵커의 발성 특성 분석 (The Characteristics of the Vocalization of the Female News Anchors)

  • 견두헌;배명진
    • 한국음향학회지
    • /
    • 제30권7호
    • /
    • pp.390-395
    • /
    • 2011
  • 본 논문은 각 방송사의 메인 뉴스인 평일 저녁뉴스의 여성 앵커 음성 분석을 통하여 공통적인 음성 요소와 각 방송국별 상대적인 음성 및 음향에는 어떠한 차이가 있는지 연구하였다. 전반적인 음성 특성을 알아보기 위해 6가지 요소를 분석한 결과, 각 방송국별 아나운서는 발화속도를 제외하면 모든 영역에서 뚜렷한 음성 및 발성 특징을 가지고 있었으며 음향시스템적인 차이도 발견 되었다. 주요 분석 요소는 기본 피치 외에 제1포만트와 피치비율에 따른 음색과 피치대역폭을 통한 정감도 요소, 피치대역폭내의 평균피치위치를 통한 문장 끝맺음 양상, 평균 발화속도, 주파수 대역별 에너지 분포를 통한 음향적인 음색 분석이다. 분석된 수치 및 결과는 국내 여성 앵커의 발성 특징의 기준으로 참고 및 활용될 수 있다.

개교환자의 발성에 관한 언어 음성학적 연구 (A SPEECH-PHONETIC STUDY ON THE PRONUNCIATION OF THE OPENBITE PATIENTS)

  • 김기달;양원식
    • 대한치과교정학회지
    • /
    • 제21권2호
    • /
    • pp.287-307
    • /
    • 1991
  • This study aimed at examining speech defects of openbite patients, which were analized in terms of formant frequency for vowels and word pronunciation length for consonants. In addition, the upper and lower lip (perioral m.) activity was tested by the EMG. The tongue force was measured by the strain gauge, and the speech discrimination test was carried out. One experimental group and one control group were used for this study and they were respectively composed of six female openbite patients and six normal-occlusion females. Eight monophthongs, two fricatives and two affricatives were chosen for speech analysis. Speeches of the above-mentioned groups were recorded and then analized by the ILS/PC-1 software. Four hundred most frequently used monosyllables were also chosen for discrimination score. Openbite patients showed the following characteristics: 1. Abnormality in case of /a/, $/\varepsilon/$, /e/, /i/ $F_2$ and /e/, /a/ $F_1$. 2. Significantly elongated length in their pronunciation of /h/ and $/C^h/$ and somewhat elongated length also in their pronunciation of /s/ and /c/. 3. Significant upper lip activity according to the EMG test during pronunciation of the bilabial consonants. 4. Relatively weak tongue force according to the strain gauge measurement. 5. According to the speech discrimination test, high rate of misarticulation in case of (a) initial /p/ /s'/ and /ts'/, (b) /a/,$/\varepsilon/$,/e/,/je/,/o/, $/\phi/$,/jo/,/u/,/we/, and /i/ (c) final (equation omitted).

  • PDF

운동실조형 마비성구음장애에 적용되는 지각적, 음향학적, 생리학적 도구에 관하여 - 환자사례를 중심으로 - (Perceptual, Acoustical, and Physiological Tools in Ataxic Dysarthria Management: A Case Report)

  • 김향희
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 1996년도 2월 학술대회지
    • /
    • pp.9-22
    • /
    • 1996
  • Among the various dysarthric subtypes, diagnosis of ataxic dysarthria is rendered when the speech characteristics include imprecise and irregular articulatory breakdowns, marked degree of speech rate impairment, overall monopitch and monoloudness, and respiratory-articulatory incoordination. Traditionally, speech pathologists have relied only upon their ‘ears’ to describe and evaluate the dysarthric speech. A statement of percentage of correct words identified by a listener do not provide so much more than an index of severity. Within the same perceptual dimension, a carefully constructed speech intelligibility test can specify patterns of errors. The patterns can contain a diagnostic value as well as Provide strategies for remediation. The phonetically transcribed texts on single words and a standard passage, 'kail' produced by an ataxic dysarthria are presented in this report, with an emphasis of the articulatory error analysis. Furthermore,, acoustic tools [e.g., spectrography to measure formant transitions, segment durations, consonant spectra, etc.] are utilized to serve as basic measures that objectively document patients' speech intelligibility, Finally, the treatment methods [e.g., spectrography as a visual feedback, gestural reorganization using pacing method, DAF (Delayed Auditory Feedback)] to modify the dysarthric behaviors are presented.

  • PDF