• 제목/요약/키워드: speechTool

검색결과 155건 처리시간 0.021초

성대마비로 인한 기식 음성에 대한 Cepstral 분석 (A Cepstral Analysis of Breathy Voice with Vocal Fold Paralysis)

  • 강영애;성철재
    • 말소리와 음성과학
    • /
    • 제4권2호
    • /
    • pp.89-94
    • /
    • 2012
  • The aim of this study is to investigate the usefulness of the parameter CPP (cepstral peak prominence) and LTAS (long term average spectrum) band energy for an analysis of breathy voice with vocal fold paralysis. Thirty-four female subjects who have vocal paralysis after thyroidectomy participated in this study. According to the perceptual judgements by three speech pathologists and one phonetic scholar, subjects were divided into two groups: breathy voice group (n = 21) and non-breathy voice group (n = 13). Maximum sustained phonation task was measured for acoustic analysis. CPP-related (i.e. mean F0, mean CPP, and mean CPPs) and LTAS-related (i.e. minimum, maximum, and mean) parameters were used. Independent samples t-test was conducted. Regarding CPP, there are significant differences in mean CPP and mean CPPs between groups. The values of mean CPP and CPPs in the non-breathy voice group are higher than those in the breathy voice group. The CPP could be regarded as the useful parameter for breathy voice analysis in the clinic. When it comes to LTAS, energy from 0 to 2 kHz are significantly different between groups. The minimum value of non-breathy group is lower than that of breathy group, whereas the maximum value of non-breathy group is higher. The frequency band below 2 kHz seems to be related to breathy voice.

PRAAT 소프트웨어: 교사 목소리 분석을 위한 맞춤법 상호작용 도구 (PRAAT Software: A Spech Interaction Tool to Analyze Teacher Voices)

  • 엘라 키드
    • 융합정보논문지
    • /
    • 제9권9호
    • /
    • pp.158-165
    • /
    • 2019
  • 본 논문은 음성 소프트웨어 기술의 사용을 통해 영어의 내부 영역 내에서 음성 상호 작용의 영향을 검토한다. 기본주파수(F0)는 국적, 연령, 성별을 기준으로 원어민(연령 30-55세) 음성 효과를 분석해 얻었다. 이 연구에서 밝혀진 바에 따르면, 백인 영국 여성(33세)과 백인 미국인 남성(55세)이 가장 많은 대화형 연설을 했다는 것이다. 기여 요인은 학생들이 언어 습득 연구를 통해 다양한 언어 스타일을 경험한 것이다. 이 연구의 결과는 평균 이상의 연속적인 말이 학생의 참여와 상호작용에 가장 중요하다는 데 동의한 $Traunm{\ddot{u}}eller$ & Eriksson(1995) 및 이전 연구와 양립할 수 있다.

후두음성 질환에 대한 인공지능 연구 (Artificial Intelligence for Clinical Research in Voice Disease)

  • 석준걸;권택균
    • 대한후두음성언어의학회지
    • /
    • 제33권3호
    • /
    • pp.142-155
    • /
    • 2022
  • Diagnosis using voice is non-invasive and can be implemented through various voice recording devices; therefore, it can be used as a screening or diagnostic assistant tool for laryngeal voice disease to help clinicians. The development of artificial intelligence algorithms, such as machine learning, led by the latest deep learning technology, began with a binary classification that distinguishes normal and pathological voices; consequently, it has contributed in improving the accuracy of multi-classification to classify various types of pathological voices. However, no conclusions that can be applied in the clinical field have yet been achieved. Most studies on pathological speech classification using speech have used the continuous short vowel /ah/, which is relatively easier than using continuous or running speech. However, continuous speech has the potential to derive more accurate results as additional information can be obtained from the change in the voice signal over time. In this review, explanations of terms related to artificial intelligence research, and the latest trends in machine learning and deep learning algorithms are reviewed; furthermore, the latest research results and limitations are introduced to provide future directions for researchers.

한국어판 말장애지수(The Korean Version of Speech Handicap Index)에 대한 예비연구 (A Pilot Study of the Korean Version of Speech Handicap Index (SHI))

  • 박성신;최성희;차원재;홍영혜;정년기;성명훈;하정훈
    • 대한후두음성언어의학회지
    • /
    • 제24권1호
    • /
    • pp.41-46
    • /
    • 2013
  • Background & Objectives : The Speech Handicap Index (SHI) is used to assess speech problem of head and neck cancer patients. The aim of this preliminary study was to evaluate the reliability and validity the Korean version SHI. Materials and Methods : Sixteen patients with oral cavity cancer and 26 normal control were participated in the study. Test-retest reliability of the Korean version of SHI was completed by 20 out of 42 subjects after 2weeks. Mann-Whitney U test was used to compare the Korean version of SHI scores between normal population and patients group. The relationship between the Korean version of SHI and diadochokinesis (DDK) was investigated using Spearman correlation coefficients. Results : The Korean version of SHI provided a high internal consistency (${\alpha}$=0.99) and test-retest reliability (Spearman rho 0.98). The mean SHI scores [total (T), speech (S), and psychosocial (P)] in normal population were 0.8 (T), 0.2 (S) and 0.4 (P), whereas those in patients group were 58.1 (T), 27.9 (S) and 27.0 (P) respectively. There were significant differences in total SHI score as well as in all of the sub-SHI scores between two groups. Moreover, significant correlation between the Korean version of SHI parameters (T, S, P) and sequential motion rate (SMR) were yielded in patients group. Conclusion : The Korean version of SHIwas reliable and valid. It can be useful as a supplementary clinical tool for diagnosing and measuring treatment efficacy of speech problems related to oral cavity cancer.

  • PDF

와우 갑각 전기자극 뇌간유발반응용 원추형 전극의 개발 (Development of Cone-Shaped Electrode for Promontory Stimulation Electrically Auditory Brainstem Response)

  • 허승덕;정동근;강명구;김리석;고도흥
    • 음성과학
    • /
    • 제10권4호
    • /
    • pp.181-187
    • /
    • 2003
  • This paper introduces a new zinc coated copper wire electrode with coiled cone shape which has low surface resistance and tolerance to the motion artifact for promontory stimulation electrically auditory brainstem responses (PSEABR). Auditory brainstem responses (ABR) can be used to predict hearing threshold level with a great deal of accuracy particularly for a young child who cannot cooperate mechanically and some hearing impaired who are exaggerating a hearing loss for economic compensation. While severe profound sensorineural hearing losses may not be implemented by auditory potentials, PSEABR is proven as a useful tool even for some sensorineural related hearing impaired. It was shown that PSEABR gives the electrical stimuli to promontory of the cochlear instead of giving acoustic stimuli. For this reason, PSEABR can be used as an alternative for cochlear implantation, and can also be used as an optimal device selection and neural information for MAP. It was found that the role of electrode is very important in PSEABR. Even though this cone-shaped electrode was applied in animal experiments, waveforms are well produced by PSEABR. Thus, it was concluded that cone-shaped electrode turned out to be a useful preoperative audiological evaluation tool in deciding time for cochlear implantation surgery.

  • PDF

IP 네트워크 기반 고품질 오디오 협업 시스템 (Implementation of a High-Quality Audio Collaboration System Over IP Networks)

  • 강진아;김홍국
    • 한국HCI학회:학술대회논문집
    • /
    • 한국HCI학회 2008년도 학술대회 1부
    • /
    • pp.218-223
    • /
    • 2008
  • 본 논문에서는 IP 네트워크에서 동작하는 오디오 협업 시스템의 성능을 향상시키는 방법들을 구현하고 구현된 방법들의 성능을 검증한다. IP 네트워크가 갖는 지연(jitter) 및 패킷 손실 등의 특성은 음성 및 오디오의 품질을 저하시킨다. 이러한 품질 저하를 줄이기 위해, 패킷 손실률은 패킷 크기를 줄임으로써 감소시킬 수 있다는 점에 착안하여 MPEG-2 AAC 오디오 코덱을 이용한 저전송률 오디오 전송 방법을 제안한다. 또한, 음성 전송 시 패킷손실에 따른 음성 품질을 향상시키기 위해 패킷 손실에 견고한 iLBC(Internet Low-Bitrate Codec) 음성 코덱 및 ITU-T G.711 패킷 손실 은닉 알고리즘을 오디오 협업 시스템에 적용한다. 제안한 방법들을 RAT (Robust-Audio Tool)[7]를 기반으로 구성된 오디오 협업 시스템에 구현하였다. 구현한 결과, 256 kbit/s에서 동작하는 MPEG-2 AAC 오디오 코덱은 압축하지 않고 전송된 오디오와 유사한 품질을 제공하는 것을, iLBC와 G.711 패킷 손실 은닉 알고리즘은 2~10% 패킷 손실 환경에서 음질을 개선하는 것을 확인할 수 있었다.

  • PDF

설소대 절단술의 결정 요인에 관한 기초 연구: Boley gauge를 이용한 3$\sim$6세 정상 아동의 혀의 최대 신장 길이 계측 (A Preliminary Study on the Determining Indicatory Factors for Frenulotomy: Maximum Lingual Length-Protrusion of 3-6 Year Old Normal Children with Boley Gauge (Digimatic Caliper$Caliper^{(R)}$))

  • 최재남;표화영;심현섭;최흥식
    • 음성과학
    • /
    • 제8권3호
    • /
    • pp.161-172
    • /
    • 2001
  • Ankyloglossia (tongue-tie) limits movement of the tongue connected with feeding and has adverse impacts on both dental health and speech. For the patients with ankyloglossia, surgical intervention is recommended as primary treatment. This study suggests the efficient tool in determining indicatory factors for frenulotomy by quantifying Maximum Lingual Length-Protrusion (MLL-P) with boley gauge, and as a preliminary study, to show the measurement results with normal children using the tool. The subjects were 61 normal children, and the distance (MLL-P) between mandibular central incisor and tongue tip during tongue protrusion was measured with a boley gauge (Digimatic $Caliper^{(R)}$). The results of this study can be summarized as follows: (1) The mean value of MLL-P (N=61 normal children) was 21.44 mm, (2) The mean value of MLL-P was 20.69 mm in males (N=33) and 21.91 mm in females (N=28). There was no statistically significant difference between males and females, (3) The mean value of MLL-P was 19.34 mm, 21.19 mm, 22.33 mm, 22.61 mm for measurement of 3-, 4-, 5- and 6-year-old children, respectively, and (4) The mean value of MLL-P showed statistically significant difference between 3- and 5-year-old children, between 3- and 6-year old children.

  • PDF

언어발달장애에 있어서 청성뇌간반응의 임상적 의의 (Clinical Significance of Auditory Brainstem Response(ABR) in Speech/Language Disorders)

  • 오기원;박우생;권순학;김진경;이준화
    • Clinical and Experimental Pediatrics
    • /
    • 제45권10호
    • /
    • pp.1199-1203
    • /
    • 2002
  • 목 적: 언어발달장애를 주소로 내원하는 소아들을 대상으로 청성뇌간반응(ABR)을 선별검사로 시행함으로써 그 원인을 조기 발견하고 조기 치료함으로써 이들의 장애를 최소화하고 언어발달 과정에 도움을 주고자 본 연구를 시행하였다. 방 법 : 1999년 1월부터 2001년 12월까지 3년 동안 경북대학교병원 소아과에 언어발달장애를 주소로 내원한 18개월 이상 소아 139례를 대상으로 하여 청성뇌간반응(ABR)을 시행하였으며 이들의 임상진단과 ABR 이상에 따라 분류하였다. 결 과 : 전체 139례 중 ABR 검사상 이상소견을 보인 환아는 15례(10.8%)였으며 전반적 발달장애 7례, 발달성 언어장애 4례, 기타 4례로 나타났으며 이상소견을 보인 환아 중 7례는 전음성 난청이었으며 8례는 감각신경성 난청이었다. 또한 ABR 검사상 정상을 보인 18개월에서 7세 사이의 I파 및 V파의 평균잠복기 및 표준편차는 각각 $1.40{\pm}0.13$, $5.57{\pm}0.26$이었으며, I-V의 정점간 잠복기는 $4.18{\pm}0.24$였다. 중등도 이상의 감각신경성 난청환아의 4례에서 보청기를 착용하였으며 이들을 추적 관찰하였다. 결 론: 언어발달장애를 주소로 내원하는 소아들에게 선별검사로서의 ABR은 반드시 필요한 것으로 생각되며 이것을 통해 감각신경성 난청을 조기 발견하고 감각 신경성 난청의 경우 보청기를 사용하여 조기 치료함으로써 이들의 신경학적 발달 및 사회적 적응에 도움을 줄 수 있을 것이다.

연속음성 인식기를 위한 벡터양자화기 기반의 화자정규화 (Vector Quantizer Based Speaker Normalization for Continuos Speech Recognition)

  • 신옥근
    • 한국음향학회지
    • /
    • 제23권8호
    • /
    • pp.583-589
    • /
    • 2004
  • 포만트 등의 음향학적인 정보를 이용하지 않는 연속음성인식 (CSR)을 위한 벡터 양자화기 기반의 화자 정규화 방법을 제안한다. 이 방법은 앞서 제안한 간단한 숫자음 인식기를 위한 화자정규화 방법을 개선한 것으로, 코드북의 크기를 증가시켜 가면서 벡터양자화기를 반복적으로 학습시킴으로써 정규화된 코드북을 구한 다음, 치를 이용하여 시험용화자의 워핑계수를 추정한다. 코드북 생성과 워핑계수 추정을 위해 모음 음소의 집합과 자음과 모음을 포함한 모든 음소의 집합 등 두 가지 음소집합을 이용i,겨 실험하였으며, 추정한 워핑계수에 상응하는 구간선형 워핑함수를 이용하여 인식기의 학습과 시험에 사용될 특징벡터를 워핑하였다. TIMIT 코퍼스와 HTK toolkit을 이용한 음소인식 실험을 수행하여 제안하는 방법의 성능을 조사한 결과, 포만트를 이용한 워핑 방법과 비슷한 성능을 가짐을 확인하였다.

Central Auditory Processing Tests as Diagnostic Tools for the Early Identification of Elderly Individuals with Mild Cognitive Impairment

  • Jalaei, Bahram;Valadbeigi, Ayub;Panahi, Rasool;Nahrani, Morteza Hamidi;Arefi, Hossein Namvar;Zia, Maryam;Ranjbar, Nastaran
    • Journal of Audiology & Otology
    • /
    • 제23권2호
    • /
    • pp.83-88
    • /
    • 2019
  • Background and Objectives: Mild cognitive impairment (MCI) is a disorder that usually occurs in the elderly, leading to dementia in some progressive cases. The purpose of this study is to examine the utility of central auditory processing tests as early diagnostic tools for identifying the elderly with MCI. Subjects and Methods: This study was conducted on 20 elderly patients with MCI and 20 healthy matched peers. The speech perception ability in a quiet environment and in the presence of background noise and also temporal resolution were assessed by using Speech Perception in Noise (SPIN) and Gap in Noise (GIN) tests, respectively. Results: The results indicated that the ability to understand speech in a quiet environment did not differ significantly between the two groups. However, SPIN at the three signal-to-noise ratios and the temporal resolution scores were significantly different between the two groups (p<0.001). Conclusions: Individuals with MCI appear to have poorer speech comprehension in noise and a lower temporal resolution than those of the same age, but without cognitive defects. Considering the utility of these tests in identifying cognitive problems, we propose that since the GIN test seems to be less influenced by intervening factors, this test can therefore, be a useful tool for the early screening of elderly people with cognitive problems.