• 제목/요약/키워드: speechTool

검색결과 155건 처리시간 0.024초

Central Auditory Processing Tests as Diagnostic Tools for the Early Identification of Elderly Individuals with Mild Cognitive Impairment

  • Jalaei, Bahram;Valadbeigi, Ayub;Panahi, Rasool;Nahrani, Morteza Hamidi;Arefi, Hossein Namvar;Zia, Maryam;Ranjbar, Nastaran
    • 대한청각학회지
    • /
    • 제23권2호
    • /
    • pp.83-88
    • /
    • 2019
  • Background and Objectives: Mild cognitive impairment (MCI) is a disorder that usually occurs in the elderly, leading to dementia in some progressive cases. The purpose of this study is to examine the utility of central auditory processing tests as early diagnostic tools for identifying the elderly with MCI. Subjects and Methods: This study was conducted on 20 elderly patients with MCI and 20 healthy matched peers. The speech perception ability in a quiet environment and in the presence of background noise and also temporal resolution were assessed by using Speech Perception in Noise (SPIN) and Gap in Noise (GIN) tests, respectively. Results: The results indicated that the ability to understand speech in a quiet environment did not differ significantly between the two groups. However, SPIN at the three signal-to-noise ratios and the temporal resolution scores were significantly different between the two groups (p<0.001). Conclusions: Individuals with MCI appear to have poorer speech comprehension in noise and a lower temporal resolution than those of the same age, but without cognitive defects. Considering the utility of these tests in identifying cognitive problems, we propose that since the GIN test seems to be less influenced by intervening factors, this test can therefore, be a useful tool for the early screening of elderly people with cognitive problems.

인식 단위로서의 한국어 음절에 대한 연구 (A Study on the Korean Syllable As Recognition Unit)

  • 김유진;김회린;정재호
    • 한국음향학회지
    • /
    • 제16권3호
    • /
    • pp.64-72
    • /
    • 1997
  • 본 논문에서는 한국어 대용량 어휘 인식 시스템에 적합한 인식 단위에 대하여 연구 및 실험하였다. 특히 현재 인식 시스템의 인식 단위로 주로 사용되는 음소와 한국어의 특징을 잘 나타내는 음절을 선택하고, 인식 실험을 통해 음절이 한국어 인식 시스템의 인식 단위로서 적합한가를 음소와 비교하였다. 객관적인 비교 인식 실험 결과를 제시하기 위하여 동일한 남성 화자의 음성 데이터를 수집하고, 수작업 음소 경계 및 레이블링 과정을 거친 음성 데이터 베이스를 구축하였다. 또한 각 인식 단위에 동일한 HMM 기반의 훈련 및 인식 알고리즘을 적용하기 위해 Entropic사의 HTK (HMM Tool Kit) 2.0을 사용하였다. 각 인식 단위의 훈련을 위해 5상태 3출력, 8상태 6출력 HMM 모델의 연속 HMM (Continuous HMM)을 적용하였고, PBW 3회분, POW 1회분을 훈련에 사용하고 PBW 1회분을 각 인식 단위로서 인식하는 화자 종속 단어 인식 실험을 구성하였다. 실험 결과 8상태 6출력 모델을 사용한 경우 음소 단위는 95.65%, 음절 단위는 94.41%의 인식률을 나타내었다. 한편 인식 속도에서는 음절이 음소보다 약 25% 빠른 것으로 나타났다.

  • PDF

정상 성인에서 청성유발 피부전위 (Auditory Evoked Skin Potential in Normal Subjects)

  • 허승덕;정동근;서덕준;김광년;김기련;강명구;김리석
    • 음성과학
    • /
    • 제12권2호
    • /
    • pp.81-88
    • /
    • 2005
  • Electrodermal activity(EDA) is a bio-electric signal which occurs at the skin surface during the sweating. EDA reflects the activity of the sympathetic axis of the autonomic nervous system. EDA is associated with the eccrine sweat gland at the palmar and plamar surface. This study was aimed to characterize the relationship between EDA and auditory stimulus intensities. Acoustic stimulus used in this study were 500 Hz, 1 kHz, 2 kHz of narrow band noise, which were representative of speech frequencies in audible range. Stimulus intensity between 90 and 30 dB in 10 dB within dynamic range. After deriving the minimum stimulus intensity(threshold of skin potential) which elicited skin potential, and then the latency and amplitude were derived from waveform of skin potential, each latency and amplitude were compared to stimulus intensity. The waveform of skin potential were recorded stably, and the threshold of skin potential appeared nearly the hearing threshold level of the participant. The latency was decreased and the amplitude was increased according to the increase of the stimulus intensity. These results suggest that auditory evoked skin potential can be applicable to auditory assessment and audiological diagnosis tool.

  • PDF

컴퓨터를 이용한 한국어 발음 검사 (Lee-Kim Test of Korean Articulation Using Multimedia Computer Software)

  • 이현복;김선희;정태충
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 1996년도 10월 학술대회지
    • /
    • pp.421-425
    • /
    • 1996
  • A multimedia Version of ${\ulcorner}Lee-Kim test of Korean Articulation{\lrcorner}$ "Lee-Kim Test of Korean Articulation" consisting of picture test, Sentence test, user's manual and notation, analysis sheets was published in 1990 to serve as a standard tool for testing and analysing the articulation errors of normal and abnormal speakers. It has been found, however that, the picture and sentences test using the printed version of Lee-Kim test of Korean Articulation revealed several limitations, in i, e, a) inefficiency in inducing desirable response from the informants b) lack of concentration and interest on the part of informants c) no consistent way of providing the informant with a clue in case the informant is unfamiliar with the word represented by the picture or the sentence. d) no reliable means for the speech-language pathologist to analyze and evaluate the informant's speech in relation to the standard pronunciation A multimedia version of Lee-Kim Korean articulation Test which features picture and word as well as recorded voice has been developed with a view to eliminating the limitation mentioned above and facilitating the articulation test-with ease and accuracy.

  • PDF

편측 인공와우 이식자의 보청기 사용 (Use of Hearing Aids in Unilateral Cochlear Implantee)

  • 허승덕;김리석;정동근;최아현;고도홍;김현기
    • 음성과학
    • /
    • 제12권4호
    • /
    • pp.197-202
    • /
    • 2005
  • The cochlear implantation(CI) as an useful tool for aural rehabilitation in bilateral severe to profound hearing impairment. However, CI prefer to usually one ear in spite of bilateral hearing impaired. because of the various characteristics of hearing loss, the hearing conservation for the future possibility, and socioeconomic condition of hearing impaired person and their families. The unilateral CI has limitations such as a directional loss, a difficult speech understanding in noise and a neural plasticity. These limitations will be overcome by hearing aid(HA) which is familiar with hearing impairer. but HA fitting for bimodal-binaural hearing are difficult because the difference output characteristic of HA and CI. This study will be confirm realities of use of HA in unilateral cochlear implantee. For this goal, 25(m:f=10:15) child participated who are used to HA for 1 to 17 months. We had telephone interviews with their mother about use of HA, change of auditory performance and own voice. As the results, hearing threshold levels of unimplanted ear, the use of a appropriate HA, implanted and aided hearing threshold level(HTL) are must be considered for successful biomodal-binaural hearing. Especially, implanted and aided HTL should be very useful parameter for a prediction of HA effect and a criterion of selection for bilateral cochlear implantation.

  • PDF

한국어판 음성장애지수와 음성관련 삶의 질의 타당도 및 신뢰도 연구 (Validity and Reliability of Korean-Version of Voice Handicap Index and Voice-Related Quality of Life)

  • 김재옥;임성은;박선영;최성희;최재남;최홍식
    • 음성과학
    • /
    • 제14권3호
    • /
    • pp.111-125
    • /
    • 2007
  • It is important to examine patients' subjective evaluation as well as objective measures and clinician's rating to assess voice disorders. This study aimed to evaluate validity and reliability of Korean-version of Voice Handicap Index (KVHI) and Voice-Related Quality of Life (KVQOL) with 113 adults with voice disorders and 111 normal adults. Content validity was verified by three experienced speech-language pathologists. Concurrent validity was revealed by examining the correlation among KVHI, KVQOL, and Voice Rating Scale as well as item discrimination coefficients. Total scores of KVHI and KVQOL of adults with voice disorders were significantly different from those of normal adults. Test-retest reliability and internal consistencies were significantly high in both KVHI and KVQOL. Correlations among scores of each subscale and total score were also significantly high in each tool. The study revealed that KVHI and KVQOL are suitable tools to be used in clinics and research areas in Korea, which can subjectively evaluate the effects of voice disorders on daily life as well as on quality of life.

  • PDF

국제 음소 기술에 의한 언어에 독립적인 발음사전 생성에 관한 연구 (A Study on the Language Independent Dictionary Creation Using International Phoneticizing Engine Technology)

  • 신좌철;우인성;강흥순;황인수;김석동
    • The Journal of the Acoustical Society of Korea
    • /
    • 제26권1E호
    • /
    • pp.1-7
    • /
    • 2007
  • One result of the trend towards globalization is an increased number of projects that focus on natural language processing. Automatic speech recognition (ASR) technologies, for example, hold great promise in facilitating global communications and collaborations. Unfortunately, to date, most research projects focus on single widely spoken languages. Therefore, the cost to adapt a particular ASR tool for use with other languages is often prohibitive. This work takes a more general approach. We propose an International Phoneticizing Engine (IPE) that interprets input files supplied in our Phonetic Language Identity (PLI) format to build a dictionary. IPE is language independent and rule based. It operates by decomposing the dictionary creation process into a set of well-defined steps. These steps reduce rule conflicts, allow for rule creation by people without linguistics training, and optimize run-time efficiency. Dictionaries created by the IPE can be used with the Sphinx speech recognition system. IPE defines an easy-to-use systematic approach that can lead to internationalization of automatic speech recognition systems.

확률적 문법규칙에 기반한 국어사전의 뜻풀이말 구문분석기 (A Parser of Definitions in Korean Dictionary based on Probabilistic Grammar Rules)

  • 이수광;옥철영
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제28권5호
    • /
    • pp.448-448
    • /
    • 2001
  • The definitions in Korean dictionary not only describe meanings of title, but also include various semantic information such as hypernymy/hyponymy, meronymy/holonymy, polysemy, homonymy, synonymy, antonymy, and semantic features. This paper purposes to implement a parser as the basic tool to acquire automatically the semantic information from the definitions in Korean dictionary. For this purpose, first we constructed the part-of-speech tagged corpus and the tree tagged corpus from the definitions in Korean dictionary. And then we automatically extracted from the corpora the frequency of words which are ambiguous in part-of-speech tag and the grammar rules and their probability based on the statistical method. The parser is a kind of the probabilistic chart parser that uses the extracted data. The frequency of words which are ambiguous in part-of-speech tag and the grammar rules and their probability resolve the noun phrase's structural ambiguity during parsing. The parser uses a grammar factoring, Best-First search, and Viterbi search In order to reduce the number of nodes during parsing and to increase the performance. We experiment with grammar rule's probability, left-to-right parsing, and left-first search. By the experiments, when the parser uses grammar rule's probability and left-first search simultaneously, the result of parsing is most accurate and the recall is 51.74% and the precision is 87.47% on raw corpus.

음성 및 음향분석 프로그램 Praat의 임상적 활용법 (Guidance to the Praat, a Software for Speech and Acoustic Analysis)

  • 성철재
    • 대한후두음성언어의학회지
    • /
    • 제33권2호
    • /
    • pp.64-76
    • /
    • 2022
  • Praat is a useful analysis tool for linguists, engineers, doctors, speech-language pathologits, music majors, and natural scientists. Basic parameters including duration, pitch, energy and perturbation parameters such as jitter and shimmer can be easily measured and manipulated in the sound editor. When a more in-depth analysis is needed, it is recommended to understand the advanced menus of the object window and learn how to use them. Among the object window menus, vowel formant analysis, spectrum analysis, and cepstrum analysis can be cited as useful ones in the clinical field. The spectrum object can be usefully used for voice quality measurement and diagnosis of patients with voice disorders by showing the energy distribution according to frequency axis (domain). A cepstrum object is useful for speech analysis when periodicity of the sound object is not measurable. The low to high ratio obtained from the spectral object and the CPPs measured from the cepstrum object have attracted many researchers, and it has been proven that the CPPs measured in Praat are relatively excellent.

전화 음성의 Segmentation 및 Labeling에 관한 연구 (A Study on the Segmentation and Labeling of telephone-based Speech)

  • 어범석;최갑근;김학진;김순협
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 제13회 신호처리 합동 학술대회 논문집
    • /
    • pp.803-806
    • /
    • 2000
  • 상용 가능한 대규모 음성인식 시스템의 개발을 위해서는 음성 데이터베이스 구축이 중요한 과제의 하나로써, 많은 시간과 노력이 요구되며 특히 세그멘테이션과 라벨링은 그 노력의 상당부분이 된다. 본 논문은 ARS 주식 거래 시스템에서 사용되는 대용량 음성 DB의 효과적 구축을 위해 세그멘테이션 및 라벨링의 자동화에 대한 연구를 하였다. 본 연구를 위해 20대 성인 남녀를 대상으로 증권거래와 관련한 15개의 문장을 발성하도록 하였으며 Dialogic사의 D/41ESC보드를 장착하고, Window NT4.0 플렛폼에서 음성을 수집하였다. 또한 자동 Segmentation과 labeling은 Aligner를 사용하였으며 수동과 비교하기 위해 CSLU speech Tool Kit을 사용하였고 수작업은 숙련도가 있는 전문가가 하도록 하였다.

  • PDF