• 제목/요약/키워드: 4 Formant Frequency

검색결과 71건 처리시간 0.028초

Line Spectral Frequency와 음성신호의 주파수 분포에 관한 연구 (A Study on the Relation Between the LSF's and Spectral Distribution of Speech Signals)

  • 이동수;김영화
    • 대한전자공학회논문지
    • /
    • 제25권4호
    • /
    • pp.430-436
    • /
    • 1988
  • LSF(Line Spectral Frequency) derived from LPC has known as a very useful transmission parameter of speech signals, for it has a good linear interpolation characteristics and a low spectrum distortion at low bit rates coding. This paper presents that it is possible to extract directly the formant frequencies of speech signals from LSF parameter without application of FFT algorithm by comparing the distribution of LSF parameter with the frequency distribution of analysis filter. This paper suggests the advanced algorithm that results in improving the speed of convergence at analytic solution method. Also, for the flexibility of parameters, the process that transforms from LSF to LPC is presented.

  • PDF

4세, 5세, 6세 정상 아동의 한국어 단모음 발달 (Korean Monophthong Development in Normal 4-, 5-, and 6-Years-Olds)

  • 강은영
    • 대한통합의학회지
    • /
    • 제7권4호
    • /
    • pp.89-104
    • /
    • 2019
  • Purpose : The purpose of this study was to investigate the development of korean vowels by acoustically analyzing whether children produce Korean vowels differently according to their age and gender between ages 4 and 6. Methods : A total of 104 children aged 4~6 years (56 males and 48 females) participated in this study. The participants were classified as either 4, 5, or 6 years old. Vowel speech data was obtained by asking the subjects to pronounce meaningful words in which the vowel in question was located in the first syllable. Speech analysis was performed using the Multi-speech 3700 program. Results : Age, gender, and vowel being pronounced all had significant effects on intensity. There was significant decrease with increasing age, and the intensity was significantly higher in male children than female children. Neither age, gender, nor the vowel being produced affected the fundamental frequency. The fundamental frequency produced did not differ by age or gender. The first and second formants had considerable effect on age and vowels, significantly decreased with age, and did not have a gender difference. Conclusion : The results of this study showed that children aged 4~6 have similar anatomical structures, but that maturity of speech motor skills required to pronounce vowels was correlated with age. The results of this study can be used to evaluate children's speech and develop speech therapy programs.

구개상의 두께에 따른 한국어 자음의 발음 변화에 관한 컴퓨터 분석 - 치조음, 경구개음- (A COMPUTER ANALYSIS ON THE KOREAN CONSONANT SOUND DISTORTION IN RELATION TO THE PALATAL PLATE THICKNESS -Dentoalveolar and hard palatal consonant-)

  • 우이형;최대균;최부병;박남수
    • 대한치과보철학회지
    • /
    • 제25권1호
    • /
    • pp.71-94
    • /
    • 1987
  • This study was carried out to investigate the sound distortion following the alternation of the palatal plate thickness. For this study, 2 healthy male subjects (24-year-old) were selected. Born in Seoul, they both spoke Seoul dialect. First, their sounds of /na(나)/, /da(다)/, /1a(라)/, /ja(자)/, /cha(차)/, /ta(타)/, without inserting plates were recorded, and then the sounds with palatal plates of different thickness were recorded, successively. The plate was fabricated in 3 types, each palatal thickness being 1.0mm, 2.5mm, dentoalveolar portion 2.5mm, other residual portion was 1.0mm, successively. Each type plates named B, C, D-type, in succession. Series of analysis were administered through Computer(16 bit) to analyze the sound distortions. These experiments were analyzed by the LPC (without weighting, pre-weighting, post-weighting) of the consonants, vowels portion, formant frequency of the vowels and word duration of the consonants. The findings led to the following conclusions: 1. There was no correlation of the distortion rate on the 2 informants. 2. Generally, vowels were not affected by the palatal plate thickness in the formant analysis, however, more distortion was detected in the LPC analysis, especially C, D-type plates. 3. Consonants distortion was more evident in the C, D-type plate. 4. The second formant was most disturbed and reduced in the all consonants with insertion of the palatal plate, especially C, D-type plate. 5. Word duration was shortened in the plate inserted(except /ja/, /cha/), especially C, D-type. 6. It was found that dentoalveolar, hard palatal sounds were severely distorted in plate inserted, and they were mainly affected by the dentoalveolar portion thickness. 7. There was correlation between palatal thickness and consonants quality.

  • PDF

벅아이 코퍼스 오류 수정과 코퍼스 활용을 위한 프랏 스크립트 툴 (Error Correction and Praat Script Tools for the Buckeye Corpus of Conversational Speech)

  • 윤규철
    • 말소리와 음성과학
    • /
    • 제4권1호
    • /
    • pp.29-47
    • /
    • 2012
  • The purpose of this paper is to show how to convert the label files of the Buckeye Corpus of Spontaneous Speech [1] into Praat format and to introduce some of the Praat scripts that will enable linguists to study various aspects of spoken American English present in the corpus. During the conversion process, several types of errors were identified and corrected either manually or automatically by the use of scripts. The Praat script tools that have been developed can help extract from the corpus massive amounts of phonetic measures such as the VOT of plosives, the formants of vowels, word frequency information and speech rates that span several consecutive words. The script tools can extract additional information concerning the phonetic environment of the target words or allophones.

MRI에 의한 모음의 성도 단면적 측정 및 면적 변이에 따른 합성 연구 (Measurement of the vocal tract area of vowels By MRI and their synthesis by area variation)

  • 양병곤
    • 음성과학
    • /
    • 제4권1호
    • /
    • pp.19-34
    • /
    • 1998
  • The author collected and compared midsagittal, coronal, coronal oblique, and transversal images of Korean monophthongs /a, i, e, o, u, i, v/ produced by a healthy male speaker using 1.5 T MR, VISION. Area was measured by computer software after tracing the cross-section at different points along the tract. Results showed that the width of the oral and pharyngeal cavities varied compensatorily from each other on the midsagittal dimension. Formant frequency values estimated from the area functions of the seven vowels showed a strong correlation (r=0.978) with those analyzed from the spoken vowels. Moreover, almost all of 35 students who listened to the synthesized vowels from area data perceived the synthesized vowels as equivalent to the spoken ones. Movement of constriction points of vowel /u/ with wider lip opening sounded /i/ and led to slight changes in vowel quality. Jaw and tongue movement led to major volume variation with an anatomical limitation. Each comer vowel varied systematically from a somewhat constant volume of the average area. Thus, the author proposed that any simulation studies related to vocal tract area variation should reflect its constant volume. The results may be helpful to verify exact measurement of the vocal tract area through vowel synthesis and a simulation study before having any operation of the vocal tract.

  • PDF

후두 전적출술후 MR영상을 이용한 음성재활환자의 발성기전에 관한 연구 (Mechanism of Vowel Phonation in T-E Shunt Patient using MR Imaging after Total Laryngectomy)

  • 박병래
    • 대한방사선기술학회지:방사선기술과학
    • /
    • 제20권1호
    • /
    • pp.21-27
    • /
    • 1997
  • Total laryngectomy has become an usual treatment for any advanced carcinoma of the laynx, but most patients who have undergone total laryngectomy have shown permanant disability in voice production. I compared the first three formant frequencies estimated from MRI to those measured directly from speech data of the T-E patients and the normal. It was to estimate the accuracy of MRI and to compare the vocal tract shape of the normal to T-E patients. The obtained results were as follows : 1. The middle sagittle section of the MRI represents vocal tract well during pnonation. The vocal tract shape of the T-E shunt patients are lack of pharyngeal space and superior space of the glottis. 2. The length of the normal subject's vocal tract is 17 cm. For the T-E shunt patients, the length from lip to shunt opening is 17.5 cm in case 1, and 18.5 cm in case 2. That of the true resonante chamber is 13 cm and 13.5 cm for each case respectively. 3. T-E shunt patients phonated strained voice. The intensity of the higher formant frequency decreased especially in /o/, /u/. 4. The vocal tract is shortened during the phonation by T-E shunt patients. In case of /e/ and /i/, front cavities are constricted while back cavities are shortened. 5. The pseudoglottis of the T-E shunt patients is located at $14{\sim}15\;cm$ below from lips.

  • PDF

비중격 성형술 및 하비잡개 절제술 후 비개존도 측정을 위한 Nasometer와 제1포만트 측정의 유용성 (Significance of Nasometer and First Formant for Nasal Patency After Septoplasty and Turbinoplasty)

  • 진성민;강현국;이경철;박상욱;이성채;이용배
    • 대한후두음성언어의학회지
    • /
    • 제8권2호
    • /
    • pp.161-165
    • /
    • 1997
  • Background : The rhinomanometry and acoustic rhinometry can assess e nasal passage dynamically and statically Recently, analytic methods such as nasometer and sound spectrogram are gaining wide attention to evaluate the nasality objectively. Objectives : firstly to determine if ere was a relationship between the new methods and nasal airway resistance, and secondly to establish if the measurement of nasalance and sound spectrum could be used as an alternative to rhinomanometry and acoustic rhinometry. Materials and Methods : Thirty two patients who underwent either septoplasty and turbinectomy for nasal obstruction were studied. And their ages ranged form 15 to 45 years, with an average of 26.1 years. The rhinomanometry, nasometer, sound spectrogram were performed at preoperative and postoperative 4 weeks day. Results : After operation, subjective symptoms and rhinomanometric results were significantly improved but nasalance and slope of nana, mama and mamma passage had not meningful change. The significnat changes were noted in nasalance and first nasal formant frequency of nasal consonant of velum(angang). Conclusion : Nasometer and sound spectrogram had a limitation for the measure of nasal patency.

  • PDF

중국인 학습자의 한국어 발음 오류에 대한 음성 신호 파라미터들의 비교 연구 - 한국어의 /ㄹ/ 발음을 중심으로 (A Comparison Study on the Speech Signal Parameters for Chinese Leaners' Korean Pronunciation Errors - Focused on Korean /ㄹ/ Sound)

  • 이강희;유광복;임하영
    • 예술인문사회 융합 멀티미디어 논문지
    • /
    • 제7권6호
    • /
    • pp.239-246
    • /
    • 2017
  • 본 논문은 중국인 학습자들이 많은 오류를 나타내는 한국어 /ㄹ/발음을 중심으로 중국인 학습자들의 음성 신호 파라미터들을 한국인의 것과 비교하였다. 설측음 혹은 탄설음의 변이음으로 나타나는 한국어의 /ㄹ/ 발음에 대한 중국어의 유사 발음과의 관계를 언어학적 관점에서 알아봄으로 많은 오류를 보이는 이유를 확인해 보았다. 본 논문에서는 신호의 에너지, 시간 영역에서의 파형, 주파수 성분 분석이 가능한 스펙트로그램, 자기 상관 함수를 이용해 구한 피치 (F0), 포먼트 주파수 (f1, f2, f3, 그리고 f4) 등을 사용하여서 음성학적 측면에서 비교 분석 하였다. 본 논문에서 사용한 데이터는 국어학적 분석을 통한 제시어로 구성한 것을 사용하였고 이를 시뮬레이션 하였다. 에너지와 spectrogram 분석의 결과를 보면, 중국인 학습자는 한국어 /ㄹ/ 발음에서 한국인 화자들과 많은 차이를 보인다. 이외의 다른 음성 신호 파라미터들에서도 차이가 나는 것을 알 수 있다. 본 논문이 비교한 파라미터들을 이용하여서 중국인 화자가 한국어 학습시 나타나는 오류들을 상당히 줄일 수 있을 것으로 기대할 수 있다.

청각장애학생의 영어 발성 주파수별 특징 분석 (Feature analysis of deaf students' English language by frequency)

  • 이근민;박혜정
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권4호
    • /
    • pp.819-828
    • /
    • 2014
  • 본 논문에서는 청각장애학생들의 영어 발성의 특징을 분석하여 그 특징들을 반영할 수 있는 맞춤형 영어 학습 보조 도구를 개발하기 위한 기초자료를 제시하고자 한다. 본 논문에서는 청각장애학생들의 영어 발성의 특징을 분석하기 위해서 서울과 대구에 있는 청각특수학교의 학생들을 대상으로 직접 방문하여 녹음하였으며, 음성파일을 분석하기 위해 음성분석 전문 프로그램인 플라트 프로그램을 활용하였다. 청각장애학생들의 영어 발성의 특징은 플라트 프로그램을 통해 음성학에서 사용하는 음성의 특징 값들을 추출하여, 그 특징 값들을 이용하여 비장애학생의 영어 발성의 특징과 비교분석하였다.

변복조 방식을 이용한 3-채널 EGG 시스템의 개발(I) (Development of 3-Ch EGG System Using Modulation and Demodulation Techniques(I))

  • 김종명;송철규;이명호
    • 대한의용생체공학회:학술대회논문집
    • /
    • 대한의용생체공학회 1993년도 춘계학술대회
    • /
    • pp.134-135
    • /
    • 1993
  • The purpose of this research is development of EGG system for quantitative assessment of laryngeal function using speech and electroglotto-graphic data. The designed EGG system is 4-electrodes system which excitation current source is supplied from 1st to 4th electrode. The output signal.: from 2nd and 3rd electrodes, which are motivated by frequency of excitation current source, are air-pressure waveforms from vocal folds. After demodulation process, we obtain pitch signals of the modulated waveforms by excitation current source through differentiator which cuts off frequency below 0.1Hz. Software processing methods were used as conventional pitch extraction methods, but the proposed system is designed to analog hardware in order to eliminate interferences from low formant frequency of speech. We will construct the discriminating database between pathological subjects and control groups on each case. Using the proposed 3 channel EGG system and LMS algorithm, it will be detected that the distinctive characteristics of laryngeal function of voiced region and other regions by EGG signals and LPC spectra.

  • PDF