• Title/Summary/Keyword: auditory word recognition

Search Result 21, Processing Time 0.025 seconds

The Phonological and Orthographic activation in Korean Word Recognition(II) (한국어 단어 재인에서의 음운정보와 철자정보의 활성화(II))

  • Choi Wonil;Nam Kichun
    • Proceedings of the KSPS conference
    • /
    • 2003.10a
    • /
    • pp.33-36
    • /
    • 2003
  • Two experiments were conducted to support the suggestion that the same information processing was used in both input modalities, visual and auditory modality in Wonil Choi & Kichun Nam(2003)'s paper. The primed lexical decision task was performed and pseudoword prime stimuli were used. The result was that priming effect did not occur in any experimental condition. This result might be interpreted visual facilitative information and phonological inhibitory information cancelled each other.

  • PDF

Fast Algorithm for Recognition of Korean Isolated Words (한국어 고립단어인식을 위한 고속 알고리즘)

  • 남명우;박규홍;정상국;노승용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.1
    • /
    • pp.50-55
    • /
    • 2001
  • This paper presents a korean isolated words recognition algorithm which used new endpoint detection method, auditory model, 2D-DCT and new distance measure. Advantages of the proposed algorithm are simple hardware construction and fast recognition time than conventional algorithms. For comparison with conventional algorithm, we used DTW method. At result, we got similar recognition rate for speaker dependent korean isolated words and better it for speaker independent korean isolated words. And recognition time of proposed algorithm was 200 times faster than DTW algorithm. Proposed algorithm had a good result in noise environments too.

  • PDF

The Effect of Acoustic Correlates of Domain-initial Strengthening in Lexical Segmentation of English by Native Korean Listeners

  • Kim, Sa-Hyang;Cho, Tae-Hong
    • Phonetics and Speech Sciences
    • /
    • v.2 no.3
    • /
    • pp.115-124
    • /
    • 2010
  • The current study investigated the role of acoustic correlates of domain-initial strengthening in lexical segmentation of a non-native language. In a series of cross-modal identity-priming experiments, native Korean listeners heard English auditory stimuli and made lexical decision to visual targets (i.e., written words). The auditory stimuli contained critical two word sequences which created temporal lexical ambiguity (e.g., 'mill#company', with the competitor 'milk'). There was either an IP boundary or a word boundary between the two words in the critical sequences. The initial CV of the second word (e.g., [$k_{\Lambda}$] in 'company') was spliced from another token of the sequence in IP- or Wd-initial positions. The prime words were postboundary words (e.g., company) in Experiment 1, and preboundary words (e.g., mill) in Experiment 2. In both experiments, Korean listeners showed priming effects only in IP contexts, indicating that they can make use of IP boundary cues of English in lexical segmentation of English. The acoustic correlates of domain-initial strengthening were also exploited by Korean listeners, but significant effects were found only for the segmentation of postboundary words. The results therefore indicate that L2 listeners can make use of prosodically driven phonetic detail in lexical segmentation of L2, as long as the direction of those cues are similar in their L1 and L2. The exact use of the cues by Korean listeners was, however, different from that found with native English listeners in Cho, McQueen, and Cox (2007). The differential use of the prosodically driven phonetic cues by the native and non-native listeners are thus discussed.

  • PDF

Korean isolated word recognizer using new time alignment method of speech signal (새로운 시간축 정규화 방법을 이용한 한국어 고립단어 인식기)

  • Nam, Myeong-U;Park, Gyu-Hong;No, Seung-Yong
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.38 no.5
    • /
    • pp.567-575
    • /
    • 2001
  • This paper suggests new method to get fixed size parameter from different length of voice signals. The efficiency of speech recognizer is determined by how to compare the similarity(distance of each pattern) of the parameter from voice signal. But the variation of voice signal and the difference of speech speed make it difficult to extract the fixed size parameter from the voice signal. The method suggested in this paper is to normalize the parameter at fixed size by using the 2 dimension DCT(Discrete Cosine Transform) after representing the parameter by spectrogram. To prove validity of the suggested method, parameter extracted from 32 auditory filter-bank(it estimates auditory nerve firing probabilities) is used for the input of neural network after being processed by 2 dimension DCT. And to compare with conventional methods, we used one of conventional methods which solve time alignment problem. The result shows more efficient performance and faster recognition speed in the speaker dependent and independent isolated word recognition than conventional method.

  • PDF

STARTLE AND ORIENTING REFLEX COMPONENTS MODULATION BY ATTENTION TO TASK AND PERFORMANCE OF MENTAL TEST WITH NOISE FOREGROUND (백색소음하에서 단어암기 및 재인검사 수행시의 경악 및 정향반사 특성 : 스트레스/정서연구에의 시사점)

  • Estate Sokhadze
    • Proceedings of the Korean Society for Emotion and Sensibility Conference
    • /
    • 1999.03a
    • /
    • pp.139-145
    • /
    • 1999
  • In current study on 8 college students there was examined modulation of eyeblink (as measured by integrated EMG of m.orbicularis oculi) and skin conductance response (SCR) to an acoustic startle probe (85 dB[A] white noise) by attending to task presented in auditory modality (to memorize words for further recognition) and entire performance of the word recognition test. Both eyeblink magnitude and SCR amplitude and rise time to startle probes were modified (larger magnitude of EMG peak, lower amplitude and shorter rise time of SCR) during attending to task as compared to performance on test. Results are interpreted n terms of modification of electrodermal and eyeblink components of startle and orienting reflexes by task characteristics (passive versus active efforts), attentional demands and aversiveness of experimental situation. However, eyeblink startle response manifested potentiation during attending to task, while SCR demonstrated attenuation. There are discussed implications of startle modulatioas a potentially sensitive probe of situational demands in stress research and also are considered prospects for further studies.

  • PDF

Lexico-semantic interactions during the visual and spoken recognition of homonymous Korean Eojeols (한국어 시·청각 동음동철이의 어절 재인에 나타나는 어휘-의미 상호작용)

  • Kim, Joonwoo;Kang, Kathleen Gwi-Young;Yoo, Doyoung;Jeon, Inseo;Kim, Hyun Kyung;Nam, Hyeomin;Shin, Jiyoung;Nam, Kichun
    • Phonetics and Speech Sciences
    • /
    • v.13 no.1
    • /
    • pp.1-15
    • /
    • 2021
  • The present study investigated the mental representation and processing of an ambiguous word in the bimodal processing system by manipulating the lexical ambiguity of a visually or auditorily presented word. Homonyms (e.g., '물었다') with more than two meanings and control words (e.g., '고통을') with a single meaning were used in the experiments. The lemma frequency of words was manipulated while the relative frequency of multiple meanings of each homonym was balanced. In both experiments using the lexical decision task, a robust frequency effect and a critical interaction of word type by frequency were found. In Experiment 1, spoken homonyms yielded faster latencies relative to control words (i.e., ambiguity advantage) in the low frequency condition, while ambiguity disadvantage was found in the high frequency condition. A similar interactive pattern was found in visually presented homonyms in the subsequent Experiment 2. Taken together, the first key finding is that interdependent lexico-semantic processing can be found both in the visual and auditory processing system, which in turn suggests that semantic processing is not modality dependent, but rather takes place on the basis of general lexical knowledge. The second is that multiple semantic candidates provide facilitative feedback only when the lemma frequency of the word is relatively low.

A Study on the Spectrum Variation of Korean Speech (한국어 음성의 스펙트럼 변화에 관한 연구)

  • Lee Sou-Kil;Song Jeong-Young
    • Journal of Internet Computing and Services
    • /
    • v.6 no.6
    • /
    • pp.179-186
    • /
    • 2005
  • We can extract spectrum of the voices and analyze those, after employing features of frequency that voices have. In the spectrum of the voices monophthongs are thought to be stable, but when a consonant(s) meet a vowel(s) in a syllable or a word, there is a lot of changes. This becomes the biggest obstacle to phoneme speech recognition. In this study, using Mel Cepstrum and Mel Band that count Frequency Band and auditory information, we analyze the spectrums that each and every consonant and vowel has and the changes in the voices reftects auditory features and make it a system. Finally we are going to present the basis that can segment the voices by an unit of phoneme.

  • PDF

The role background noise intensity on Physiological activity during performance of mental task (인지과제 수행시 배경 소음의 크기에 따른 생리적 반응차)

  • Sohn Jin-Hun;Sokhadze Estate M.;Min Yoon-Ki;Lee Kyung-Hwa;Choi Sangsup
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.269-273
    • /
    • 1999
  • Combination of mental stress task with noise background is a traditional tool employed in psychophysiology. However, intensity of background noise is a factor affecting both performance on test and psychophysiological responses associated with stress evoked by mental load in noisy environment. In the current study on 7 subjects we analyzed the influence of white noise (WN) intensity (55, 70, and 85 dB[A] ) on psychophysiological responses during word recognition test performed on noise background. There were recorded following physiological variables: electrodermal activity (EDA) , namely, skin conductance level (SCL), skin conductance response (SCR) amplitude (SCR-A), rise time and total number of SCRs (N-SCR); cardiovascular activity, e.g., heart rate (HR), respiratory sinus arrhythmia (RSA) index, pulse transit time (PTT), finger pulse volume (PV), skin temperature (SKT) and respiratory activity, such as respiration rate (RESP-R) and inspiration wane amplitude (RESP-A) during baseline resting state and 40 s long performance on 3 similar Korean word recognition tests with different WN intensity (55, 70, and 85 dB). Electrodermal responses (SCR-A, SCL, N-SCR) demonstrated gradual increment with increased intensity of noise, and this increase of response magnitude with higher intensity of noise was typical also for r skin temperature (phasic SKT decrease) and pulse volume (phasic and tonic PV decrease). However, some cardiovascular and respiratory responses did not exhibit same tendency of gradual increase of reactivity , namely HR, as well as RESP-R and RESP-A showed decrement of response magnitudes. Important finding in terms of cardiovascular reactivity was that 55 and 70dB evoked similar profiles, while 85dB WN resulted in significantly different profile of reactions, suggesting that there exists a threshold level after which intensive auditory stimulation elicits psychophyslological responses pattern of different quality. There are discussed potential autonomic mechanism involved in mediation of observed physiological responses.

  • PDF

Isolated Word Recognition Using k-clustering Subspace Method and Discriminant Common Vector (k-clustering 부공간 기법과 판별 공통벡터를 이용한 고립단어 인식)

  • Nam, Myung-Woo
    • Journal of the Institute of Electronics Engineers of Korea TE
    • /
    • v.42 no.1
    • /
    • pp.13-20
    • /
    • 2005
  • In this paper, I recognized Korean isolated words using CVEM which is suggested by M. Bilginer et al. CVEM is an algorithm which is easy to extract the common properties from training voice signals and also doesn't need complex calculation. In addition CVEM shows high accuracy in recognition results. But, CVEM has couple of problems which are impossible to use for many training voices and no discriminant information among extracted common vectors. To get the optimal common vectors from certain voice classes, various voices should be used for training. But CVEM is impossible to get continuous high accuracy in recognition because CVEM has a limitation to use many training voices and the absence of discriminant information among common vectors can be the source of critical errors. To solve above problems and improve recognition rate, k-clustering subspace method and DCVEM suggested. And did various experiments using voice signal database made by ETRI to prove the validity of suggested methods. The result of experiments shows improvements in performance. And with proposed methods, all the CVEM problems can be solved with out calculation problem.

Clinical Study for YMG-1, 2's Effects on Learning and Memory Abilities (육미지황탕가감방-1, 2가 학습과 기억능력에 미치는 영향에 관한 임상연구)

  • Park Eun Hye;Chung Myung Suk;Park Chang Bum;Chi Sang Eun;Lee Young Hyurk;Bae Hyun Su;Shin Min Kyu;Kim Hyun taek;Hong Moo Chang
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.16 no.5
    • /
    • pp.976-988
    • /
    • 2002
  • The aim of this study was to examine the memory and attention enhancement effect of YMG-1 and YMG-2, which are modified herbal extracts from Yukmijihwang-tang (YMJ). YMJ, composing six herbal medicine, has been used for restoring the normal functions of the body to consolidate the constitution, nourishing and invigorating the kidney functions for hundreds years in Asian countries. A series of studies reported that YMJ and its components enhance memory retention, protects neuronal cell from reactive oxygen attack and boost immune activities. Recently the microarray analysis suggested that YMG-1 protects neurodegeneration through modulating various neuron specific genes. A total of 55 subjects were divided into three groups according to the treatment of YMG-1 (n=20), YMG-2 (n=20) and control (C; n=15) groups. Before treatments, all of subjects were subjected to the assessments on neuropsychological tests of K-WAIS test, Rey-Kim memory test, and psychophysiological test of Event-Related Potential (ERP) during auditory oddball task and repeated word recognition task. They were repeatedly assessed with the same methods after drug treatment for 6 weeks. Although no significant effect of drug was found in Rey-Kim memory test, a significant interaction (P = .010, P < 0.05) between YMG-2 and C groups was identified in the scores digit span and block design, which are the subscales of K-WAIS. The very similar but marginal interaction (P = .064) between YMG-1 and C groups was found too. In ERP analysis, only YMG-1 group showed decreasing tendency of P300 latency during oddball task while the others tended to increase, and it caused significant interaction between session and group (p= .004). This result implies the enhancement of cognitive function in due to consideration of relationship between P300 latency and the speed of information processing. However, no evidence which could demonstrate the significant drug effect was found in neither amplitude or latency. These results come together suggest that YMG-1, 2 may enhance the attention, resulting in enhancement of memory processing. For elucidating detailed mechanism of YMG on learning and memory, the further studies are necessary.