• Title/Summary/Keyword: speech sound

Search Result 625, Processing Time 0.021 seconds

A Study on the Sound Effect for Improving Customer's Speech Recognition in the TTS-based Shop Music Broadcasting Service (TTS를 이용한 매장음원방송에서 고객의 인지도 향상을 위한 음향효과 연구)

  • Kang, Sun-Mee;Kim, Hyun-Deuc;Chang, Moon-Soo
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.105-109
    • /
    • 2009
  • This thesis describes the method for well voice announcement using the TTS(Text-To-Speech) technology in the shop music broadcasting service. Offering a high quality TTS sound service for each shop requires a great expense. According to a report on the architectural acoustics the room acoustic indexes such as reverberation time and early decay time are closely connected with a subjective awareness about acoustics. By using the result the customers will be able to recognize better the voice announcement by applying sound effect to speech files made by TTS. The result of an aural comprehension examination has shown better about almost all of the parameters by applying reverb effect to TTS sound.

  • PDF

Segment and Word Duration Produced by Preschool Children (학령전기 아동의 분절음 및 단어 길이)

  • Kang, Eunyeong
    • Journal of The Korean Society of Integrative Medicine
    • /
    • v.8 no.4
    • /
    • pp.291-305
    • /
    • 2020
  • Purpose : The duration of speech segments reflects children's speech motor development. The purpose of this study was to determine whether segmental sound and word duration varies by age among preschool children. Methods : A total of 60 children aged 4~5 years participated in this study. Participants took the picture-naming test to produce single-word speech data. The duration of the consonant at the initial position of the word and the final position of the word, the voice onset time of plosive, the duration of the vowel following the initial consonant, and the duration of the word were measured. Results : As age increased, the duration of the initial consonant, the duration of the word, and the voice onset time decreased significantly. The main effects of age, manner of articulation, and place of articulation on the duration of the initial consonant were significant. The duration of consonants in the nasal sound and plosives and the duration of bilabial and alveolar sound differed significantly between groups. The main effects of age and vocal type on voice onset time were significant. The main effect of age on the duration of the consonant in the final position of word and on the duration of the vowel were not statistically significant. Conclusion : The results of this study showed that the duration of segmental sound and the word were associated with speech development between 4 and 5 years old. Accordingly, duration of the segmental sound and the word may serve as an acoustic cue as they reflect speech development and speech motor control maturity.

Experimental Phonetic Study of Yanjin Sino-Korean Dialect (연변 조선족 방언 음성의 실험적 연구)

  • Kim, Hyun-Gi
    • Phonetics and Speech Sciences
    • /
    • v.1 no.1
    • /
    • pp.47-52
    • /
    • 2009
  • The speech of Sino-Korean has been evolved from geopolitical cause since 1945. The aim of this study is to collect Yanji dialectal speech and to compare with South Korean dialectal speech. Twenty Yanbian university students participated as informants. Acoustic speech informations are analyzed using the Multi-Speech Windows Vista version. Dialectal speech characteristics of Yanji sino-Korean showed posterior vowel /${\alpha}$/, neutralization of mid-vowel /o/ between /o/ and /Ɔ/. Lenis stop sound showed the tendency of glottalization based on VOT value. Sibilant sound contains aspiration following constriction and lateral /l/ realized the approximant /r/.

  • PDF

Developing the speech screening test for 4-year-old children and application of Korean speech sound analysis tool (KSAT) (4세 말소리발달 선별검사 개발과 한국어말소리분석도구(Korean Speech Sound Analysis Tool, KSAT)의 활용)

  • Soo-Jin Kim;Ki-Wan Jang;Moon-Soo Chang
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.49-55
    • /
    • 2024
  • This study aims to develop a three-sentence speech screening test to evaluate speech development in 4-year-old children and provide standards for comparison with peers. Screening tests were conducted on 24 children each in the first and second halves of 4 years old. The screening test results showed a correlation of .7 with the existing speech disorder evaluation test results. We compared whether there was a difference between the two groups of 4-year-old in the phonological development indicators and error patterns obtained through the screening test. The developmental indicators of the children in the second half were high, but there were no statistically significant differences. The Korean Speech Sound Analysis Tool (KSAT) was used for all analyses, and the automatic analysis results and contents of the clinician's manual analysis were compared. The degree of agreement between the automatic and manual error pattern analyses was 93.63%. The significance of this study is that the standard of speech of a 4-year-old child of the speech screening test according to three sentences at the level of elicited sentences, and the applicability of the KSAT were reviewed in both clinical and research fields.

Inter-speaker and intra-speaker variability on sound change in contemporary Korean

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.25-32
    • /
    • 2017
  • Besides their effect on the f0 contour of the following vowel, Korean stops are undergoing a sound change in which a partial or complete consonantal merger on voice onset time (VOT) is taking place between aspirated and lax stops. Many previous studies on sound change have mainly focused on group-normative effects, that is, effects that are representative of the population as a whole. Few systematic quantitative studies of change in adult individuals have been carried out. The current study examines whether the sound change holds for individual speakers. It focuses on inter-speaker and intra-speaker variability on sound change in contemporary Korean. Speech data were collected for thirteen Seoul Korean speakers studying abroad in America. In order to minimize the possible effects of speech production, socio-phonetic factors such as age, gender, dialect, speech rate, and L2 exposure period were controlled when recruiting participants. The results showed that, for nine out of thirteen speakers, the consonantal merger is taking place between the aspirated and lax stop in terms of VOT. There were also intra-speaker variations on the merger in three aspects: First, is the consonantal (VOT) merger between the two stops is in progress or not? Second, are VOTs for aspirated stops getting shorter or not (i.e., the aspirated-shortening process)? Third, are VOTs for lax stops getting longer or not (i.e., the lax-lengthening process)? The results of remarkable inter-speaker and intra-speaker variability indicate a synchronous speech sound change of the stop system in contemporary Korean. Some speakers are early adopters or active propagators of sound change whereas others are not. Further study is necessary to see whether the inter-speaker differences exceed intra-speaker differences in sound change.

Comparison of Sound Pressure Level and Speech Intelligibility of Emergency Broadcasting System at Longitudinal Corridor (장방향 복도 공간의 비상방송설비에 대한 음압 레벨과 음성 명료도 비교)

  • Jeong, Jeong-Ho;Lee, Sung-Chan
    • Fire Science and Engineering
    • /
    • v.32 no.4
    • /
    • pp.42-49
    • /
    • 2018
  • In this study, in order to investigate whether or not the emergency broadcasting sound generated from an emergency broadcasting speaker is clearly transmitted to the occupant through architectural sound simulation, when the loudspeaker for emergency broadcasting is installed at intervals of 25 m according to NFSC 202 for a rectangular hallway. The sound pressure level and speech intelligibility index were analyzed according to changes in building finishing materials. With a reflective material finishing, sound pressure level satisfied the standard while speech intelligibility index was low. As a result of applying the sound absorbing material finishing, clarity and speech transmission index was improved to a level that could be understood by the occupant, whereas the sound pressure level delivered to the occupant decreased in the same space.

A longitudinal study of phonological development in Korean late-talkers (말늦은 아동의 말소리 발달 종단 연구)

  • Kim, Soo-Jin;Lee, Suhyang;Hong, Gyung Hun
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.115-122
    • /
    • 2017
  • This study attempts to determine the extent to which late talkers are at the risk of delayed phonological development, in order to identify groups at risk and to find factors affecting delayed phonological development. A group of 1,452 children (51% boys, 49% girls) were recruited from the nationwide Panel Study on Korean Children. The current study collected data from 418 children who were previously identified as late-talkers (LT) at their age of three on average (Time 1: expressive vocabulary test) and three years later (Time 2: phonological test). Their phonological outcomes of the Time 2 were analyzed and then compared to those of a group of 1,056 children with typical language development (NLT: no late-talkers) at the age of three in terms of the number of incorrect consonants, and the speech sound disorders rating scores. LT showed a lower articulation score than NLT, and boys showed a lower score than girls. These findings indicate that the late onset of speech and the gender of young children could be potential risk factors of speech sound disorders.

Comparison of the Korean and Chinese Speech Intelligibility with Increasing Sound Absorption in a Classroom (강의실의 실내흡음력 증가에 따른 한국어 및 중국어의 음성요해도 비교)

  • Ding, Wei;Park, Chan-Jae;Haan, Chan-Hoon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.3
    • /
    • pp.129-141
    • /
    • 2012
  • The present study aims to investigates the effects of the physical sound clarity (D50, STI) on the subjective speech intelligibility of the both Korean and Chinese languages which can be caused by increase of the sound absorption in classroom. In order to this, sound measurements were undertaken at a classroom with and without absorption materials. Also, speech intelligibility tests were conducted by Korean and Chinese students using their native languages. As the results, it was found that both sound clarity and speech intelligibility were improved with increasing sound absorption. Also, it was revealed that Chinese speech intelligibility was more improved than Korean with same impose of sound absorption. It was considered by the difference of phonetic characteristics of two languages. Analysing correlation of physical sound clarity and subjective speech intelligibility, it was shown that D50 is highly correlated with Korean (0.696) and Chinese (0.707) respectively. Also, STI was highly correlated with Korean (0.651) and Chinese (0.665). Thus, it can be concluded that D50 and STI have significant correlations with the speech intelligibility.

The Acoustic Characteristics in Women Diver's Soombijil Sound (해녀의 숨비질소리에 대한 음향특징)

  • Han, Ji-Yeon;Park, Hyun-Ja;Jeong, Ok-Ran
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.176-179
    • /
    • 2007
  • This study examined the acoustic characteristics in women diver's Soombijil sound. A total of 18 women divers was attended this study. Acoustic analysis was performed via Praat. Soombijil sound were classified into three types as pitch variations in beginning, middle, and ending part. Type I showed increasing-decreasing-flat. Type II was identified by the shape of flat-flat-increasing. The shape of type III showed increasing-decreasing-increasing. Duration of Soombijil sound was mean 1.48 sec. The range of frequency was 1591.54 ${\sim}$ 4477.13 Hz. FFT analysis showed that frequencies were concentrated 500${\sim}$2000 Hz. Type I and II showed two peaks at 500 Hz and 1500${\sim}$2000 Hz. Type III has one peak below 500 Hz.

  • PDF

A Study on the Nature of Sound and the Hearing Mechanism (소리의 특성 및 청지각기능에 대한 고찰)

  • Lee, Jung-Hak;Kim, Jin-Sook
    • Speech Sciences
    • /
    • v.5 no.1
    • /
    • pp.167-179
    • /
    • 1999
  • The hearing mechanism is a complicated system. Sound is generated by a source that sends out air pressure or power. The pressure or power makes the sound waves. These waves reach the eardrum, or tympanic membrane, which vibrates at a rate and magnitude proportional to the nature of the sound waves. The tympanic membrane transforms this vibration into the mechanical energy in the middle ear, which in turn converts it to the hydraulic energy in the fluid of the inner ear. The hydraulic energy stimulates the sensory cells of the inner ear which send neuroelectrical impulses to the central auditory nervous system. The passive perception of auditory information starts just here. The listener gives attention to the speech sound, differentiates the sound from background noise, and integrates his experience with similar sounds. The listener then puts all of these aspects of audition into the context of the moment to identify the nature of sound. This has a major role in human communication. This paper provides an overview of the nature and characteristics of sound, the structure and function of the auditory system, and the way in which sound is processed by the auditory system.

  • PDF