• 제목/요약/키워드: Korean numeric sounds

검색결과 7건 처리시간 0.018초

Praat를 이용한 숫자음의 음향적 분석법 (An acoustical analysis method of numeric sounds by Praat)

  • 양병곤
    • 음성과학
    • /
    • 제7권2호
    • /
    • pp.127-137
    • /
    • 2000
  • This paper presents a macro script to analyze numeric sounds by a speech analysis shareware, Praat, and analyzes those sounds produced by three students who were born and raised in Pusan. Recording was done in a quiet office. To make a meaningful comparison, dynamic time points in relation to the total duration of voicing segments were determined to measure acoustical values. Results showed that a strong correlation coefficient was found between the repetitive production of numeric sounds within and across the speakers. Very high coefficients among diphthongal numbers (0 and 6) which usually show wide formant variation were noticed. This supports that each speaker produced numbers quite coherently. Also, the frequency differences between the three subjects were within a perceptually similar range. To identify a speaker among others may require to find subtle individual differences within this range. Perceptual experiments by synthesized numeric sounds may lead to resolve the issue.

  • PDF

An Acoustical Study on the Syllable Structures of Korean Numeric Sounds

  • Yang, Byung-Gon
    • 음성과학
    • /
    • 제14권1호
    • /
    • pp.137-147
    • /
    • 2007
  • The purpose of this study was to examine the syllable structures of ten Korean numeric sounds produced by ten students. Each sound was normalized by its maximum intensity value and divided into onset, vowel, and coda sections after finding abrupt or visible changes in energy values or cumulative values of lower spectral energy at each pulse point using four Praat scripts. Then, segmental durations and cumulative intensity values of each syllable were obtained to find a statistical summary of the syllable structure. Intensity values at 100 proportional time points were also collected to compare the ten sounds. Results showed as follows: Firstly, there was not much deviation from the grand average duration and intensity for the majority of the sounds except the two diphthongal sounds on which their boundary points varied among the speakers. Secondly, the onset point for the CV or CVC category sounds and the boundary between the vowel and the nasal or lateral sound were easy to identify, which may be automatically traced later. Thirdly, there seems some tradeoff among the sections maintaining the same total duration per each syllable. Further studies on syllables with various onsets or codas would be desirable to make a general statement on the Korean syllable structure.

  • PDF

국어 숫자음의 음절구조에 대한 음향적 분석 (An Acoustical Study on the Syllable Structures of Korean Numeric Sounds)

  • 양병곤
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.170-172
    • /
    • 2007
  • The purpose of this study was to examine the syllable structures of ten Korean numeric sounds produced by ten subjects of the same age. Each sound was normalized and divided into onset, vowel, and coda sections. Then, acoustical measurements of each syllable were done to compare the ten sounds. Results showed that there was not much deviation from the grand average duration and intensity for the majority of the sounds except the two diphthongal sounds on which their boundary points varied among the speakers. Some syllable boundaries were quite obvious while others were ambiguous. There seemed some tradeoff among the syllable components depending on their acoustic features.

  • PDF

숫자음의 스펙트럼 차이값과 상관계수를 이용한 화자인증 파라미터 연구 (A Study on Speaker Identification Parameter Using Difference and Correlation Coeffieicent of Digit_sound Spectrum)

  • 이후동;강선미;장문수;양병곤
    • 음성과학
    • /
    • 제11권3호
    • /
    • pp.131-142
    • /
    • 2004
  • Speaker identification system basically functions by comparing spectral energy of an individual production model with that of an input signal. This study aimed to develop a new speaker identification system from two parameters from the spectral energy of numeric sounds: difference sum and correlation coefficient. A narrow-band spectrogram yielded more stable spectral energy across time than a wide-band one. In this paper, we collected empirical data from four male speakers and tested the speaker identification system. The subjects produced 18 combinations of three-digit numeric. sounds !en times each. Five productions of each three-digit number were statistically averaged to make a model for each speaker. Then, the remaining five productions were tested on the system. Results showed that when the threshold for the absolute difference sum was set to 1200, all the speakers could not pass the system while everybody could pass if set to 2800. The minimum correlation coefficient to allow all to pass was 0.82 while the coefficient of 0.95 rejected all. Thus, both threshold levels can be adjusted to the need of speaker identification system, which is desirable for further study.

  • PDF

대역별로 여과한 음성 강도의 차이값과 상관계수에 의한 화자확인 연구 (A Study on Speaker Identification by Difference Sum and Correlation Coefficient of Intensity Levels from Band-pass Filtered Sounds)

  • 양병곤
    • 음성과학
    • /
    • 제10권2호
    • /
    • pp.249-258
    • /
    • 2003
  • This study attempted to examine a speaker identification method using difference sum and correlation coefficient determined from a pair of intensity level matrices of band-pass-filtered numeric sounds produced by ten female speakers of similar age and height. Subjects recorded three digit numbers at a quiet room at a sampling rate of 22 kHz on a personal computer. Collected data were band-pass-filtered at five different band ranges. Then, matrices of five intensity levels at 100 proportional time points were obtained. Pearson correlation coefficients and the sum of absolute intensity differences between a pair of given matrices were determined within and across the speakers. Results showed that very high correlation coefficient and small difference sum generally occurred within each speaker but some individual variation was also observed. Thus, the matrix pair with a higher coefficient and a smaller difference sum was averaged to form each individual's model. Comparison among the speakers yielded generally low coefficients and large differences, which suggests successful speaker identification, but among them there were a few cases with very high coefficients and small differences. Future studies will focus on finer band ranges and additional spectral parameters at some peak points of the intensity contour at a low frequency band.

  • PDF

An Analysis of the English l Sound Produced by Korean Students

  • Yang, Byung-Gon
    • 음성과학
    • /
    • 제15권1호
    • /
    • pp.53-62
    • /
    • 2008
  • The purpose of this study was to examine the English l sound in an English short story produced by 16 Korean students in order to determine various allophones of the sound using acoustic visual displays and perceptual judgments. The subjects read the story in a quiet office at normal speed. Each word included the lateral sound in onset or coda positions and before a vowel of the following word. Results showed as follows: Firstly, there was a durational difference between the two major groups. Also the majority of the subjects produced the clear l regardless of the contexts. Some students produced the sound as the Korean flap or the English glide [r]. A few missing cases were also seen. The dark l was mostly produced by the subjects of English majors in coda position with a few cases before a vowel in a phrase. Visual displays using the computer analysis were very helpful in distinguishing lateral variants but sometimes perceptual process would be necessary to judge them in fast and weak production of the target word. Further studies would be desirable to test the discrepancies between the acoustical and perceptual decisions.

  • PDF

구개상의 두께가 한국어 단모음 발음에 미치는 영향에 관한 연구 -컴퓨터를 이용한 선형 예측 분석과 LOG AREA RATIO 분석- (A STUDY OF THE KOREAN SINGLE VOWEL SOUND DISTORTION IN RELATION TO THE PALATAL PLATE THICKNESS -LINEAR PREDICTION CORRELATION AND LOG AREA RATIO ANALYSES BY COMPUTER-)

  • 이정만;최대균;박남수;최부병
    • 대한치과보철학회지
    • /
    • 제26권1호
    • /
    • pp.31-49
    • /
    • 1988
  • This study was performed to investigate the sound distortion following the alternation of the palatal plate thickness, for this study, 3 subjects who were born in Seoul and spoke Seoul dialect were recruited from K university male student population. First, their sounds of /아(a)/, 어(e)/, 오(o)/, 우(u)/, 으($\.{+}$), 이(i)/,에(e)/ without inserting plate were recorded , and then the sounds with palatal plates of different thickness were recorded, respectively. The palatal plates was constructed to cover the alveolar & palatal surfaces of the maxilla with an approximate thickness of 1.0mm, 2.5mm, and thickness of 2.5mm over the alveolar ridge & 1.0mm elsewhere and, named B, C, D-type, in succession. Series of analysis were administered through Computer (16 bit IBM PC/AT) at analyze the sound distortions. These experiments were analyzed by the LPC, Log Area Ratio. The findings led to the following conclusions: 1. Sound distortions were relatively minute in each condition and informations, however, /이(i)/ was the most distorted vowel in all conditions. 2. By and large, sound distortion was large in C, D-types. However, there was no correlation of the distortion rate on the 3 informants, and all tested vowels. 3. It was similar to LPC, Log Area Ratio distortion rates. 4. It was found that the sound distortion wit]1 plate inserted was verified to the numeric value with LPC and Log Area Ratio method.

  • PDF