• 제목/요약/키워드: Auditory cues

검색결과 41건 처리시간 0.023초

말운동장애인을 위한 시-청각 단서 제공 읽기 훈련 프로그램 개발 (Development of a Reading Training Software offering Visual-Auditory Cue for Patients with Motor Speech Disorder)

  • 방동혁;전유용;양동권;길세기;권미선;이상민
    • 대한의용생체공학회:의공학회지
    • /
    • 제29권4호
    • /
    • pp.307-315
    • /
    • 2008
  • In this paper, we developed a visual-auditory cue software for reading training of motor speech disorder patients. Motor speech disorder patients can use the visual and/or auditory cues for reading training and improving their symptom. The software provides some sentences with visual-auditory cues. Our sentences used for reading training are adequately comprised on modulation training according to a professional advice in speech therapy field. To ameliorate reading skills we developed two algorithms, first one is automatically searching the starting time of speech spoken by patients and the other one is removing auditory-cue from the recorded speech that recorded at the same time. The searching of speech starting time was experimented by 10 sentences per 6 subjects in four kinds of noisy environments thus the results is that $7.042{\pm}8.99[ms]$ error was detected. The experiment of the cancellation algorithm of auditory-cue was executed from 6 subjects with 1 syllable speech. The result takes improved the speech recognition rate $25{\pm}9.547[%]$ between before and after cancellation of auditory-cue in speech. User satisfaction index of the developed program was estimated as good.

Effects of Auditory Cues on Gait Initiation in Patients With Parkinson's Disease: A Preliminary Study

  • Kim, Hyeong-Dong
    • 한국전문물리치료학회지
    • /
    • 제14권4호
    • /
    • pp.44-49
    • /
    • 2007
  • The purpose of this study was to investigate the effects of auditory cues in the form of a metronome on gait initiation (GI) in Parkinson's disease (PD). 2 patients (mean age: 54 yrs) with idiopathic PD participated in the study. All patients (Hoehn and Yahr disability score of 2.0) were tested in the "on" state approximately 1.5 hours following the administration and fully responding to their PD medications. Subjects first initiated walking at self-initiated speeds to determine their cadences. Then, subjects were asked to initiate gait along the walkway while keeping pace with a metronome. The metronome rate (in beats/min) was set at a cadence 85% (slow condition), 100% (normal condition) and 115% (fast condition) of gait for each subject. Subjects were able to increase the speed of GI with faster cadence, but the speed of GI for the slow condition was similar to that of the normal condition. Swing toe-off was 578.3 ms for the fast condition, 709.4 ms for the normal condition and 736.2 ms for the slow condition. Respective times for swing heel-strike were 894.3 ms, 1110.2 ms and 1119.1 ms, and stance toe-off were 1105.4 ms, 1338.5 ms, and 1343.1 ms. Except for stance unloading ground reaction forces were greatest for the fast condition and smallest for the slow condition. It appears that PD patients were able to modulate GRFs and temporal events in response to auditory cues to achieve the peak acceleration force of the swing and stance limb. The findings from this study provided preliminary data, which could be used to investigate how PD patients modulate GRFs and temporal events during GI in response to tasks.

  • PDF

Familarity of Sounds as a Cue of Auditory Distance Perception

  • Min, Yoon-Ki
    • The Journal of the Acoustical Society of Korea
    • /
    • 제19권3E호
    • /
    • pp.19-24
    • /
    • 2000
  • The present research examined the contribution of sounds′ familiarity to auditory distance perception, while attempting to control the influences of unavoidable physical characteristics among sounds. Different vocal "styles" ("shouts", "whispers" and "a normal conversation") of man and woman were recorded digitally and presented from a stationary loudspeaker to blindfolded listeners in a semi anechoic chamber. Playback levels were adjusted to remove extraneous sound level cues. The results showed that the shouting voice was judged as appearing farthest, the whispering voice closest, and the conversational voice was intermediate. The findings suggested that the perception of auditory distance may be affected by past experience (or familiarity).

  • PDF

발음평가용 멀티미디어 시스템 구현을 위한 구어 프랑스어의 음향학적 단서 (Acoustic Cues in Spoken French for the Pronunciation Assessment Multimedia System)

  • 이은영;송미영
    • 음성과학
    • /
    • 제12권3호
    • /
    • pp.185-200
    • /
    • 2005
  • The objective of this study is to examine acoustic cues in spoken French for the assessment of pronunciation which is necessary to realization of the multimedia system. The corpus is composed of simple expressions which consist of the French phonological system include all phonemes. This experiment was made on 4 male and female French native speakers and on 20 Korean speakers, university students who had learned the French language more than two years. We analyzed the recorded data by using spectrograph and measured comparative features by the numerical values. First of all, we found the mean and the deviation of all phonemes, and then chose features which had high error frequency and great differences between French and Korean pronunciations. The selected data were simplified and compared among them. After we judged whether the problems of pronunciation in each Korean speaker were either the utterance mistake or the interference of mother tongue, in terms of articulatory and auditory aspects, we tried to find acoustic features as simplified as possible. From this experiment, we could extract acoustic cues for the construction of the French pronunciation training system.

  • PDF

잡음제거 기능을 갖춘 시-청각 단서 제공 읽기 훈련 프로그램 (A Reading Trainning Program offering Visual-Auditory Cue with Noise Cancellation Function)

  • 방동혁;강현덕;길세기;이상민
    • 재활복지공학회논문지
    • /
    • 제2권1호
    • /
    • pp.35-43
    • /
    • 2009
  • 본 논문에서는 개발된 잡음제거 기능을 갖춘 시-청각 단서 제공 읽기 훈련 프로그램(이하 프로그램)을 소개한다. 프로그램은 시-청각 단서들을 지닌 훈련용 문장들을 제공한다. 말운동장애인들은 읽기훈련을 위해서 시각단서와 청각단서들을 각각 또는 동시에 사용 가능하다. 훈련 결과의 평가 편의성 제공을 위해서 잡음제거 알고리즘을 개발하였다. 알고리즘은 피험자가 컴퓨터화면에 제공된 문장을 읽을 때 읽는 말소리와 함께 녹음된 잡음과 청각단서 소리를 제거한다. 또한 피험자가 읽기 연습을 시작할 때 최초의 말소리 개시시간을 검출하는 기능을 구현하였다. 말소리의 녹음은 4가지 잡음환경(실내 잡음, 백색 잡음, 자동차 내부잡음, 배블 잡음)에서 성인 6명(남성 3 명, 여성 3명)으로부터 하였다. 잡음제거 전과 후에 대한 조건에서 녹음된 말소리의 실제 시작 시간과 프로그램상에서 찾은 시간과의 오차를 실험하였다. 잡음제거 전과 후에서의 시간오차가 $4.847{\pm}2.4235[ms]$ 향상되었다. 개발된 프로그램은 말운동장애인의 훈련 및 증상 평가에 도움이 될 수 있으리라 사료된다.

  • PDF

SPATIAL EXPLANATIONS OF SPEECH PERCEPTION: A STUDY OF FRICATIVES

  • Choo, Won;Mark Huckvale
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 1996년도 10월 학술대회지
    • /
    • pp.399-403
    • /
    • 1996
  • This paper addresses issues of perceptual constancy in speech perception through the use of a spatial metaphor for speech sound identity as opposed to a more conventional characterisation with multiple interacting acoustic cues. This spatial representation leads to a correlation between phonetic, acoustic and auditory analyses of speech sounds which can serve as the basis for a model of speech perception based on the general auditory characteristics of sounds. The correlations between the phonetic, perceptual and auditory spaces of the set of English voiceless fricatives /f $\theta$ s $\int$ h / are investigated. The results show that the perception of fricative segments may be explained in terms of 2-dimensional auditory space in which each segment occupies a region. The dimensions of the space were found to be the frequency of the main spectral peak and the 'peakiness' of spectra. These results support the view that perception of a segment is based on its occupancy of a multi-dimensional parameter space. In this way, final perceptual decisions on segments can be postponed until higher level constraints can also be met.

  • PDF

The Effect of Acoustic Correlates of Domain-initial Strengthening in Lexical Segmentation of English by Native Korean Listeners

  • Kim, Sa-Hyang;Cho, Tae-Hong
    • 말소리와 음성과학
    • /
    • 제2권3호
    • /
    • pp.115-124
    • /
    • 2010
  • The current study investigated the role of acoustic correlates of domain-initial strengthening in lexical segmentation of a non-native language. In a series of cross-modal identity-priming experiments, native Korean listeners heard English auditory stimuli and made lexical decision to visual targets (i.e., written words). The auditory stimuli contained critical two word sequences which created temporal lexical ambiguity (e.g., 'mill#company', with the competitor 'milk'). There was either an IP boundary or a word boundary between the two words in the critical sequences. The initial CV of the second word (e.g., [$k_{\Lambda}$] in 'company') was spliced from another token of the sequence in IP- or Wd-initial positions. The prime words were postboundary words (e.g., company) in Experiment 1, and preboundary words (e.g., mill) in Experiment 2. In both experiments, Korean listeners showed priming effects only in IP contexts, indicating that they can make use of IP boundary cues of English in lexical segmentation of English. The acoustic correlates of domain-initial strengthening were also exploited by Korean listeners, but significant effects were found only for the segmentation of postboundary words. The results therefore indicate that L2 listeners can make use of prosodically driven phonetic detail in lexical segmentation of L2, as long as the direction of those cues are similar in their L1 and L2. The exact use of the cues by Korean listeners was, however, different from that found with native English listeners in Cho, McQueen, and Cox (2007). The differential use of the prosodically driven phonetic cues by the native and non-native listeners are thus discussed.

  • PDF

Low Frequency Perception of Rhythm and Intonation Speech Patterns by Normal Hearing Adults

  • Kim, Young-Sun;Asp, Carl-W.
    • 음성과학
    • /
    • 제9권1호
    • /
    • pp.7-16
    • /
    • 2002
  • This study tested normal hearing adults' auditory perception of rhythm and intonation patterns, with low-frequency speech energy. The results showed that the narrow-band low-frequency zones of 125, 250, or 500 Hz provided the same important rhythm and intonation cues as did the wide-band condition. This suggested that an auditory training strategy that uses low-frequency filters would be effective for structuring or re-structuring the perception of rhythm and intonation patterns. These filters force the client to focus on these patterns, because the speech intelligibility is drastically reduced. This strategy can be used with both normal-hearing and hearing impaired children and adults with poor listening skills, and possibly poor speech intelligibility.

  • PDF

한국어 화자의 영어 양순음 /b/와 순치음 /v/ 식별에서 시각 단서의 효과 (The Effect of Visual Cues in the Identification of the English Consonants /b/ and /v/ by Native Korean Speakers)

  • 김윤현;고성룡
    • 말소리와 음성과학
    • /
    • 제4권3호
    • /
    • pp.25-30
    • /
    • 2012
  • This study investigated whether native Korean listeners could use visual cues for the identification of the English consonants /b/ and /v/. Both auditory and audiovisual tokens of word minimal pairs in which the target phonemes were located in word-initial or word-medial position were used. Participants were instructed to decide which consonant they heard in $2{\times}2$ conditions: cue (audio-only, audiovisual) and location (word-initial, word-medial). Mean identification scores were significantly higher for audiovisual than audio-only condition and for word-initial than word-medial condition. Also, according to signal detection theory, sensitivity, d', and response bias, c were calculated based on both hit rates and false alarm rates. The measures showed that the higher identification rate in the audiovisual condition was related with an increase in sensitivity. There were no significant differences in response bias measures across conditions. This result suggests that native Korean speakers can use visual cues while identifying confusing non-native phonemic contrasts. Visual cues can enhance non-native speech perception.

참여형 멀티미디어 시스템 사용자 감성평가를 위한 다차원 심물리학적 척도 체계 (Development of Multiple-modality Psychophysical Scaling System for Evaluating Subjective User Perception of the Participatory Multimedia System)

  • 나종관;박민용
    • 대한인간공학회지
    • /
    • 제23권3호
    • /
    • pp.89-99
    • /
    • 2004
  • A comprehensive psychophysical scaling system, multiple-modality magnitude estimation system (MMES) has been designed to measure subjective multidimensional human perception. Unlike paper-based magnitude estimation systems, the MMES has an additional auditory peripheral cue that varies with corresponding visual magnitude. As the simplest, purely psychological case, bimodal divided-attention conditions were simulated to establish the superiority of the MMES. Subjects were given brief presentations of pairs of simultaneous stimuli consisting of visual line-lengths and auditory white-noise levels. In the visual or auditory focused-attention conditions, only the line-lengths or the noise levels perceived should be reported respectively. On the other hand, in the divided-attention conditions, both the line-lengths and the noise levels should be reported. There were no significant differences among the different attention conditions. Human performance was better when the proportion of magnitude in stimulus pairs were identically presented. The additional auditory cues in the MMES improved the correlations between the magnitude of stimuli and MMES values in the divided-attention conditions.