• 제목/요약/키워드: Speech sound

Search Result 628, Processing Time 0.028 seconds

Variations in the perception of lexical pitch accents and the correlations with individuals' autistic traits

  • Lee, Hyunjung
    • Phonetics and Speech Sciences
    • /
    • v.9 no.2
    • /
    • pp.53-59
    • /
    • 2017
  • The present study examined if individual listeners' perceptual variations were associated with their cognitive characteristics indexed by the Autistic Spectrum Quotient (AQ). This study first investigated the perception of the lexical pitch accent contrast in the Kyungsang Korean currently undergoing a sound change, and then tested if listeners' perceptual variations were correlated with their AQ scores. Eighteen Kyungsang listeners in their 20s participated in the perception experiment where they identified two contrastive accent words for auditory stimuli systematically varying F0 scaling and timing properties; the participants then completed the AQ questionnaire. In the results, the acoustic parameters reporting reduced phonetic differences across accent contrasts for younger Kyungsang generation played a reliable role in perceiving the HH word from HL, suggesting the discrepancy between the perception and the production in the context of sound change. This study also observed that individuals' perceptual variations were negatively correlated with their AQ sub scores. The present findings suggested that the sound change might appear differently between production and perception with a different time course, and deviant percepts could be explained by individuals' cognitive measure.

Voice onset time in English and Korean stops with respect to a sound change

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.13 no.2
    • /
    • pp.9-17
    • /
    • 2021
  • Voice onset time (VOT) is known to be a primary acoustic cue that differentiates voiced from voiceless stops in the world's languages. While much attention has been given to the sound change of Korean stops, little attention has been given to that of English stops. This study examines VOT of stop consonants as produced by English speakers in comparison to Korean speakers to see whether there is any VOT change for English stops and how the effects of stop, place, gender, and individual on VOT differ cross-linguistically. A total of 24 native speakers (11 Americans and 13 Koreans) participated in this experiment. The results showed that, for Korean, the VOT merger of lax and aspirated stops was replicated, and, for English, voiced stops became initially devoiced and voiceless stops became heavily aspirated. English voiceless stops became longer in VOT than Korean counterparts. The results suggest that, similar to Korean stops, English stops may also undergo a sound change. Since it is the first study to be revealed, more convincing evidence is necessary.

The Effects of Air Conditioner Noise on Classroom Acoustics (교실 음향에 대한 에어컨 소음의 영향)

  • Kim, Su-Yeon;Jeon, Jin-Yong
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2005.05a
    • /
    • pp.176-179
    • /
    • 2005
  • A case-study in classroom acoustics was conducted and the effects of two types(system air conditioner and packaged air conditioner) of air conditioner were investigated. Acoustical measurements were made in two different classrooms. Each classroom has different acoustics showing sound quality of air conditioner. Mental concentration test was conducted to evaluate the effects of air conditioner noise with different sound presure level(dBA). Speech intelligibility test was also planed with adopting Korean phonetic balanced words.

  • PDF

포르만트 주파수를 이용한 한국어 음성의 자동인식에 관한 연구

  • 김순협;박규태
    • Proceedings of the Korean Institute of Communication Sciences Conference
    • /
    • 1983.04a
    • /
    • pp.16-17
    • /
    • 1983
  • In Speech signal processing, ARMA spectral estimation method is used. It has been demonstrated that the ARMA model provides better spectral estimation then the more specialized AR model and MA model. Dynamic program is used to achieve time algnment. Speech sound similarity is defined to be proportional to the distance seperating to sound in a vector space defined by ARMA model. AS a result, the recognition rate of 97.3% for three speaker is obtained.

  • PDF

Research about auto-segmentation via SVM (SVM을 이용한 자동 음소분할에 관한 연구)

  • 권호민;한학용;김창근;허강인
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2220-2223
    • /
    • 2003
  • In this paper we used Support Vector Machines(SVMs) recently proposed as the loaming method, one of Artificial Neural Network, to divide continuous speech into phonemes, an initial, medial, and final sound, and then, performed continuous speech recognition from it. Decision boundary of phoneme is determined by algorithm with maximum frequency in a short interval. Recognition process is performed by Continuous Hidden Markov Model(CHMM), and we compared it with another phoneme divided by eye-measurement. From experiment we confirmed that the method, SVMs, we proposed is more effective in an initial sound than Gaussian Mixture Models(GMMs).

  • PDF

Speech Noise Cancellation using Time Adaptive Threshold Value in Wavelet Transform

  • Lee Chul-Hee;Lee Ki-Hoon;Hwang Hyang-Ja;Moon In-Seob;Kim Chong-Kyo
    • Proceedings of the IEEK Conference
    • /
    • summer
    • /
    • pp.244-248
    • /
    • 2004
  • This paper proposes a new noise cancellation method for speech recognition in noise environments. We determine the time adaptive threshold value using standard deviations of wavelet coefficients after wavelet transform by frames. The time adaptive threshold value is set up by using sum of standard deviations of wavelet coefficients in cA3 and weighted cD1. cA3 coefficients represent the voiced sound with lower frequency components and cD1 coefficients represent the unvoiced sound with higher frequency components. In experiments, we removed noise after adding white Gaussian noise and colored noise to original speech. The proposed method improved SNR and MSE more than wavelet transform and wavelet packet transform does. As a result of speech recognition experiment using noise speech DB, recognition performance is improved by $2\sim4\;\%.$

  • PDF

THE EFFECT OF LINGUAL FRENECTOMY ON PHONATION & TONGUE MOVEMENT (설소대성형술이 발음 및 혀의 운동에 미치는 영향에 관한 연구)

  • Hwang, Sun-Yong;Lee, Sang-Chull;Ryu, Dong-Mok
    • Maxillofacial Plastic and Reconstructive Surgery
    • /
    • v.14 no.1_2
    • /
    • pp.40-53
    • /
    • 1992
  • This sutdy aimed at examining the effect of lingual frenectomy on phonation & tongue movement. Almost the patient visiting to department of oral & maxillofacial surgery for the treatment of tongue tie always complain the speech problem. Many operation was performed according to this problem. But the objective evaluation of the speech change have been deficient. The experimental group was 25 adult males. Fourteen Korean consonants & after Korean vowels was combined and seventy sound was made for speech analysis. Before & after lingual frenectomy, the speech of the above mentioned group was recorded and then analysed by the Speech Workstation computer software. And before & after operation, the lingual frenum & tongue protrusion amount vas measured. The results were as follows : 1. The pre-operative length of lingual frenum was inverse proportion with the pre-operative length of the protrusive tongue. 2. The average difference between pre & post-operative length of the protrusive tongue was about 23 mm. 3. In the comparison of consonant continuing time change, fricative consonant(r, s, h) was increased post-operatively. 4. In the comparison of the vowel frequency formant change, the "i"and "u" sound vas reliably changed. 5. There was no reliable speech changes on the other sounds.

  • PDF

Executive function and Korean children's stop production

  • Eun Jong Kong;Hyunjung Lee;Jeffrey J. Holliday
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.45-52
    • /
    • 2023
  • Previous studies have established a role for cognitive differences in explaining variability in speech processing across individuals. In the case of perceptual cue weighting in the context of a sound change, studies have produced conflicting results regarding the relationship between executive function and the use of redundant cues. The current study aimed to explore this relationship in acoustic cue weighting during speech production. Forty-one Korean-speaking children read a list of stop-initial words and completed two tests that assess executive function, i.e., Dimensional Change Card Sorting (DCCS) and digit n-back. Voice onset time (VOT) and fundamental frequency (F0) were measured in each word, and analyses were carried out to determine the extent to which children's executive function predicted their use of both informative and less informative cues to the three pairs comprising the Korean three-way stop laryngeal contrast. No evidence was found for a relationship between cognitive ability and acoustic cue weighting in production, which is at odds with previous, albeit conflicting, results for speech perception. While this result may be due to the lack of task demands in the production task used here, it nevertheless expands the empirical ground upon which future work in this area may proceed.

Coda Sounds Acquisition at Word Medial Position in Three and Four Year Old Children's Spontaneous Speech (자발화에 나타난 3-4세 아동의 어중종성 습득)

  • Woo, Hyekyeong;Kim, Soojin
    • Phonetics and Speech Sciences
    • /
    • v.5 no.3
    • /
    • pp.73-81
    • /
    • 2013
  • Coda in the word-medial position plays an important role in acquisition of our speech. Accuracy of the coda in the word-medial position is important as a diagnostic indicator since it has a close relationship with degrees of disorder. Coda in the word-medial position only appears in condition of connecting two vowels and the sequence causes diverse phonological processes to happen. The coda in the word-medial position differs in production difficulty by the initial sound in the sequence. Accordingly, this study aims to examine the tendency of producing a coda in the word-medial position with consideration of an optional phonological process in spontaneous speech of three and four year old children. Data was collected from 24 children (four groups by age) without speech and language delay. The results of the study are as follows: 1) Sonorant coda in the word-medial position showed a high production frequency in manner of articulation, and alveolar in place of articulation. When the coda in the word-medial position is connected to an initial sound in the same place of articulation, it revealed a high frequency of production. 2) The coda in word-medial position followed by an initial alveolar stop revealed a high error rate. Error patterns showed regressive assimilation predominantly. 3) The order of difficulty that Children had producing codas in the word-medial position was $/k^{\neg}/$, $/p^{\neg}/$, /m/, /n/, /ŋ/ and /l/. Those results suggest that in targeting coda in the word-medial position for evaluation, we should consider optional phonological process as well as the following initial sound. Further studies would be necessary which codas in the word-medial position will be used for therapeutic purpose.

Perception of sentences varying with prosody pattern, sound intensity, and signal-to-noise ratio (운율 패턴, 강도, 신호대소음비에 따른 문장 지각 변화)

  • Chang, Son-A;Jang, Eunjoo;Jang, Jaejin
    • Phonetics and Speech Sciences
    • /
    • v.9 no.2
    • /
    • pp.119-124
    • /
    • 2017
  • This study investigates how perception of easy sentences varies with prosody pattern, sound intensity, and signal-to-noise ratio(SNR) in young adults with normal hearing who were in their 20's. The results showed that the presence of proper prosody pattern in the sentences increased correct perception rate of the target sentences, and that the lower the intensity and SNR, the lower the sentence perception scores. The results also showed that SNR had a greater effect on the sentence perception scores than sound intensity. There was a significant decrease of perception scores starting at the level of 15 dB and +3 SNR for the sentences with prosody pattern, while starting at the level of 18 dB and +6 SNR for the sentences without prosody pattern, ending up with a very poor perception score as sound intensity and SNR gets lower. There was a significant difference in the perception score of the sentences with prosody pattern between 20 year-old group and 21 year or older group in several listening conditions of sound intensity and SNR.