• 제목/요약/키워드: auditory word recognition

검색결과 21건 처리시간 0.022초

청각 단어 재인에서 나타난 한국어 단어길이 효과 (The Korean Word Length Effect on Auditory Word Recognition)

  • 최원일;남기춘
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2002년도 11월 학술대회지
    • /
    • pp.137-140
    • /
    • 2002
  • This study was conducted to examine the korean word length effects on auditory word recognition. Linguistically, word length can be defined by several sublexical units such as letters, phonemes, syllables, and so on. In order to investigate which units are used in auditory word recognition, lexical decision task was used. Experiment 1 and 2 showed that syllable length affected response time, and syllable length interacted with word frequency. As a result, in recognizing auditory word syllable length was important variable.

  • PDF

청각단어 재인에서 나타난 한국어 단어 길이 효과 (The Korean Word Length Effect on AudWord Recognition)

  • 최원일;남기춘
    • 대한음성학회지:말소리
    • /
    • 제44호
    • /
    • pp.33-46
    • /
    • 2002
  • This study was conducted to examine the effect of word length on auditory word recognition. Word length can be defined by several sublexical units, such as letters, phonemes, syllables, etc. To find out which sublexical units are influential in auditory word recognition, the auditory lexical decision task was used. In Experiment 1, we examined the partial correlation between the speed of reaction time and the number of sublexical units, and in Experiment 2, we executed ANOVA to find out which sublexical length variable was an influential unit. Through these two experiment, we concluded syllable length was the most important variable on auditory word recognition.

  • PDF

로봇 시스템에의 적용을 위한 음성 및 화자인식 알고리즘 (Implementation of the Auditory Sense for the Smart Robot: Speaker/Speech Recognition)

  • 조현;김경호;박영진
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2007년도 춘계학술대회논문집
    • /
    • pp.1074-1079
    • /
    • 2007
  • We will introduce speech/speaker recognition algorithm for the isolated word. In general case of speaker verification, Gaussian Mixture Model (GMM) is used to model the feature vectors of reference speech signals. On the other hand, Dynamic Time Warping (DTW) based template matching technique was proposed for the isolated word recognition in several years ago. We combine these two different concepts in a single method and then implement in a real time speaker/speech recognition system. Using our proposed method, it is guaranteed that a small number of reference speeches (5 or 6 times training) are enough to make reference model to satisfy 90% of recognition performance.

  • PDF

인지적 청각 특성을 이용한 고립 단어 전화 음성 인식 (Isolated-Word Speech Recognition in Telephone Environment Using Perceptual Auditory Characteristic)

  • 최형기;박기영;김종교
    • 대한전자공학회논문지TE
    • /
    • 제39권2호
    • /
    • pp.60-65
    • /
    • 2002
  • 본 논문에서는, 음성 인식률 향상을 위하여 청각 특성을 기반으로 한 GFCC(gammatone filter frequency cepstrum coefficients) 파라미터를 음성 특징 파라미터로 제안한다. 그리고 전화망을 통해 얻은 고립단어를 대상으로 인식실험을 수행하였다. 성능비교를 위하여 MFCC(mel frequency cepstrum coefficients)와 LPCC(linear predictive cepstrum coefficient)를 사용하여 인식 실험을 하였다. 또한, 각 파라미터에 대하여 전화망의 채널 왜곡 보상기법으로 CMS(cepstral mean subtraction)를 도입한 방법과 적용시키지 않은 방법으로 인식실험을 하였다. 실험 결과로서, GFCC를 사용하여 인식을 수행한 방법이 다른 파라미터를 사용한 방법에 비해 향상된 결과를 얻었다.

청각적, 시각적 자극제시 방법과 음절위치에 따른 일반아동의 음운인식 능력 (Phonological awareness skills in terms of visual and auditory stimulus and syllable position in typically developing children)

  • 최유미;하승희
    • 말소리와 음성과학
    • /
    • 제9권4호
    • /
    • pp.123-128
    • /
    • 2017
  • This study aims to compare the performance of syllable identification task according to auditory and visual stimuli presentation methods and syllable position. Twenty-two typically developing children (age 4-6) participated in the study. Three-syllable words were used to identify the first syllable and the final syllable in each word with auditory and visual stimuli. For the auditory stimuli presentation, the researcher presented the test word only with oral speech. For the visual stimuli presentation, the test words were presented as a picture, and asked each child to choose appropriate pictures for the task. The results showed that when tasks were presented visually, the performances of phonological awareness were significantly higher than in presenting with auditory stimuli. Also, the performances of the first syllable identification were significantly higher than those of the last syllable identification. When phonological awareness task are presented by auditory stimuli, it is necessary to go through all the steps of the speech production process. Therefore, the phonological awareness performance by auditory stimuli may be low due to the weakness of the other stages in the speech production process. When phonological awareness tasks are presented using visual picture stimuli, it can be performed directly at the phonological representation stage without going through the peripheral auditory processing, phonological recognition, and motor programming. This study suggests that phonological awareness skills can be different depending on the methods of stimulus presentation and syllable position of the tasks. The comparison of performances between visual and auditory stimulus tasks will help identify where children may show weakness and vulnerability in speech production process.

청음 음성학적 지식에 기반한 음가분류에 의한 핵심어 검출 시스템 구현 (The Design of Keyword Spotting System based on Auditory Phonetical Knowledge-Based Phonetic Value Classification)

  • 김학진;김순협
    • 정보처리학회논문지B
    • /
    • 제10B권2호
    • /
    • pp.169-178
    • /
    • 2003
  • This study outlines two viewpoints the classification of phone likely unit (PLU) which is the foundation of korean large vocabulary speech recognition, and the effectiveness of Chiljongseong (7 Final Consonants) and Paljogseong (8 Final Consonants) of the korean language. The phone likely classifies the phoneme phonetically according to the location of and method of articulation, and about 50 phone-likely units are utilized in korean speech recognition. In this study auditory phonetical knowledge was applied to the classification of phone likely unit to present 45 phone likely unit. The vowels 'ㅔ, ㅐ'were classified as phone-likely of (ee) ; 'ㅒ, ㅖ' as [ye] ; and 'ㅚ, ㅙ, ㅞ' as [we]. Secondly, the Chiljongseong System of the draft for unified spelling system which is currently in use and the Paljongseonggajokyong of Korean script haerye were illustrated. The question on whether the phonetic value on 'ㄷ' and 'ㅅ' among the phonemes used in the final consonant of the korean fan guage is the same has been argued in the academic world for a long time. In this study, the transition stages of Korean consonants were investigated, and Ciljonseeng and Paljongseonggajokyong were utilized in speech recognition, and its effectiveness was verified. The experiment was divided into isolated word recognition and speech recognition, and in order to conduct the experiment PBW452 was used to test the isolated word recognition. The experiment was conducted on about 50 men and women - divided into 5 groups - and they vocalized 50 words each. As for the continuous speech recognition experiment to be utilized in the materialized stock exchange system, the sentence corpus of 71 stock exchange sentences and speech corpus vocalizing the sentences were collected and used 5 men and women each vocalized a sentence twice. As the result of the experiment, when the Paljongseonggajokyong was used as the consonant, the recognition performance elevated by an average of about 1.45% : and when phone likely unit with Paljongseonggajokyong and auditory phonetic applied simultaneously, was applied, the rate of recognition increased by an average of 1.5% to 2.02%. In the continuous speech recognition experiment, the recognition performance elevated by an average of about 1% to 2% than when the existing 49 or 56 phone likely units were utilized.

단어빈도와 단어규칙성 효과에 기초한 합성음 평가 (The text-to-speech system assessment based on word frequency and word regularity effects)

  • 남기춘;최원일;이동훈;구민모;김종진
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2002년도 11월 학술대회지
    • /
    • pp.105-108
    • /
    • 2002
  • In the present study, the intelligibility of the synthesized speech sounds was evaluated by using the psycholinguistic and fMRI techniques, In order to see the difference in recognizing words between the natural and synthesized speech sounds, word regularity and word frequency were varied. The results of Experiment1 and Experiment2 showed that the intelligibility difference of the synthesized speech comes from word regularity. There were smaller activation of the auditory areas in brain and slower recognition time for the regular words.

  • PDF

잡음에 강한 음성 인식을 위한 성문 가중 켑스트럼에 관한 연구 (Glottal Weighted Cepstrum for Robust Speech Recognition)

  • 전선도;강철호
    • 한국음향학회지
    • /
    • 제18권5호
    • /
    • pp.78-82
    • /
    • 1999
  • 본 연구는 잡음에 강한 음성 파라미터로써 널리 사용하는 가중 켑스트럼에 관한 연구이다. 특히 청각 모델인 PLP(Perceptual Linear Predictive)에서 켑스트럼을 추출 후 비대칭형 성문 펄스 파형 형태를 가중치 함수로 사용하는 방법을 제안한다. 또한 이러한 가중 켑스트럼을 성도 모델에서의 성도파형과 켑스트럼과 연관하여 분석하였다. 그리고 청각 모델인 PLP의 켑스트럼에 가중시켜 청각 모델과 성도 모델을 모두 적용한 음성 파라미터를 얻었다. 이러한 방법의 성능 평가를 위해 차량내 잡음과 길거리에서의 잡음 환경에서의 고립 단어 인식 실험을 하였다. 그리고 기존의 LP(Linear Prediction)에 의한 가중된 윈도우 켑스트럼 및 PLP에 의한 가중된 Liftering 켑스트럼 등과 비교하였다. 모의 실험 결과는 기존의 가중된 cepstrum 보다 제안하는 성문 가중 켑스트럼이 보다 높은 인식율을 보여준다.

  • PDF

난청인의 주파수 선택도와 비대칭적 청각 필터를 고려한 난청 시뮬레이터 개발에 관한 연구 (A Study on Development of a Hearing Impairment Simulator considering Frequency Selectivity and Asymmetrical Auditory Filter of the Hearing Impaired)

  • 주상익;강현덕;송영록;이상민
    • 전기학회논문지
    • /
    • 제59권4호
    • /
    • pp.831-840
    • /
    • 2010
  • In this paper, we propose a hearing impairment simulator considering reduced frequency selectivity and asymmetrical auditory filter of the hearing impaired, and we verified the reduced frequency selectivity and asymmetrical auditory filter affected in speech perception through experiments. The reduced frequency selectivity has made embodied by spectral smearing using LPC(linear prediction coding). The shapes of auditory filter are asymmetrical different with each center frequency. Hearing impaired person which has hearing loss was differently changed with that of normal hearing people and it has different value for speech of quality through auditory filter. The experiments confirmed subjective test and objective test. The subjective experiments are composed of 4 kinds of tests: pure tone test, SRT(speech reception threshold) test, and WRS(word recognition score) test without spectral smearing, and WRS test with spectral smearing. The experiment of the hearing impairment simulator was performed from 9 subjects who have normal ears. The amount of spectral smearing was controlled by LPC order. The asymmetrical auditory filter of proposed hearing impairment simulator was simulated and then some tests to estimate the filter's performance objectively were performed. The objective experiment as simulated auditory filter's performance evaluation method used PESQ(perceptual evaluation of speech quality) and LLR(log likelihood ratio) for speech through auditory filter. The processed speech was evaluated objective speech quality and distortion using PESQ and LLR value. When hearing loss processed, PESQ and LLR value have big difference according to asymmetrical auditory filter in hearing impairment simulator.

후천성 인공와우 이식 성인의 청능훈련 사례 연구 (Case Study of Auditory Training for the Acquired Hearing loss Adult with Cochlear Implant)

  • 홍하나
    • 재활복지
    • /
    • 제17권4호
    • /
    • pp.371-382
    • /
    • 2013
  • 최근 인공와우 이식 수술에 대한 건강보험이 확대 되면서 이식자들의 수는 늘어나게 되었다. 2005~2009년 사이 최근 6년간 인공와우 수술을 받은 환자는 약 3,300여명이 이르며 그 중 성인의 인공와우 이식 수가 늘어가는 양상을 보이고 있다. 어린 아동의 경우 인공와우 이식 후 청능훈련을 적극적으로 받으며 관련 연구도 많이 있지만 성인에 대한 이식 후 청능훈련에 대한 연구는 많지 않다. 본 연구는 언어습득이후 인공와우를 이식한 성인여자(54세) 1명을 대상으로 Ling 6 sound test, 표준화된 자음과 모음 듣기 검사, 문장 검사 그리고 실생활에 필요한 환경음과 단어의 인지 및 확인 평가 도구를 이용하여 10주간 청능훈련을 실시하였다. 10주간의 청능 훈련 결과, 대상자는 Ling 6 sound의 모든 음소를 확인하였으며 표준화된 자모음과 문장 듣기 검사에서도 100%에 가까운 수행력을 보였다. 또한 실생활에 환경음과 단어의 인지 및 확인은 57%에서 95%까지 수행력이 개선되었다. 본 연구 결과는 성인을 대상으로 한 청능훈련은 체계적이고 효과적인 계획과 개인의 특성을 고려한 재활 프로그램이 필요함을 보여주었다.