• 제목/요약/키워드: speech factors

검색결과 352건 처리시간 0.026초

비원어민 한국어 말하기 숙련도 평가와 평가항목의 상관관계 (Correlation analysis of linguistic factors in non-native Korean speech and proficiency evaluation)

  • 양승희;정민화
    • 말소리와 음성과학
    • /
    • 제9권3호
    • /
    • pp.49-56
    • /
    • 2017
  • Much research attention has been directed to identify how native speakers perceive non-native speakers' oral proficiency. To investigate the generalizability of previous findings, this study examined segmental, phonological, accentual, and temporal correlates of native speakers' evaluation of L2 Korean proficiency produced by learners with various levels and nationalities. Our experiment results show that proficiency ratings by native speakers significantly correlate not only with rate of speech, but also with the segmental accuracies. The influence of segmental errors has the highest correlation with the proficiency of L2 Korean speech. We further verified this finding within substitution, deletion, insertion error rates. Although phonological accuracy was expected to be highly correlated with the proficiency score, it was the least influential measure. Another new finding in this study is that the role of pitch and accent has been underemphasized so far in the non-native Korean speech perception studies. This work will serve as the groundwork for the development of automatic assessment module in Korean CAPT system.

소프트컴퓨팅 기법을 이용한 다음절 단어의 음성인식 (Speech Recognition of Multi-Syllable Words Using Soft Computing Techniques)

  • 이종수;윤지원
    • 정보저장시스템학회논문집
    • /
    • 제6권1호
    • /
    • pp.18-24
    • /
    • 2010
  • The performance of the speech recognition mainly depends on uncertain factors such as speaker's conditions and environmental effects. The present study deals with the speech recognition of a number of multi-syllable isolated Korean words using soft computing techniques such as back-propagation neural network, fuzzy inference system, and fuzzy neural network. Feature patterns for the speech recognition are analyzed with 12th order thirty frames that are normalized by the linear predictive coding and Cepstrums. Using four models of speech recognizer, actual experiments for both single-speakers and multiple-speakers are conducted. Through this study, the recognizers of combined fuzzy logic and back-propagation neural network and fuzzy neural network show the better performance in identifying the speech recognition.

SPEECH SYNTHESIS USING LARGE SPEECH DATA-BASE

  • Lee, Kyu-Keon;Mochida, Takemi;Sakurai, Naohiro;Shirai, Katasuhiko
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 FIFTH WESTERN PACIFIC REGIONAL ACOUSTICS CONFERENCE SEOUL KOREA
    • /
    • pp.949-956
    • /
    • 1994
  • In this paper, we introduce a new speech synthesis method for Japanese and Korean arbitrary sentences using the natural speech data-base. Also, application of this method to a CAI system is discussed. In our synthesis method, a basic sentence and basic accent-phrases are selected from the data-base against a target sentence. Factors for those selections are phrase dependency structure (separation degree), number of morae, type of accent and phonemic labels. The target pitch pattern and phonemic parameter series are generated using those selected basic units. As the pitch pattern is generated using patterns which are directly extracted form real speech, it is expected to be more natural than any other pattern which is estimated by any model. Until now, we have examined this method on Japanese sentence speech and affirmed that the synthetic sound preserves human-like features fairly well. Now we extend this method to Korean sentence speech synthesis. Further more, we are trying to apply this synthesis unit to a CAI system.

  • PDF

청각장애 성인의 말명료도 평가방법의 비교 (Comparisons of Utility of Various Speech Intelligibility Evaluations of Adults with Hearing Impairment)

  • 도연지;김수진
    • 음성과학
    • /
    • 제11권4호
    • /
    • pp.173-184
    • /
    • 2004
  • This study aims to discuss the test methodologies that evaluate the speech intelligibility of hearing-impaired adults using various contexts. Seven adults with severe hearing loss participated in the experiment. The context of the speech intelligibility consists of 77 pairs of one-syllable words with phonemic contrasts, 30 two-syllable words and the list of each 12 and 10 sentences. The speech intelligibility of various contexts had significant correlation, and both one-syllable words with phonemic contrasts and the sentence 1 had higher correlation than other tests. The one-syllable words with phonemic contrasts took longer to test than others, and it demanded more effort to select the pair of words. However, from the point of view of the identification of segmental difficulties, the one-syllable words with phonemic contrasts that reflected segmental factors contributing to the intelligibility was useful.

  • PDF

The Factors Affecting Job Satisfaction in Speech-Language Pathologists

  • Moon, Kyung-Im;Cho, In-Sook;Park, Woong-Sik
    • 한국컴퓨터정보학회논문지
    • /
    • 제24권11호
    • /
    • pp.263-270
    • /
    • 2019
  • 본 연구는 언어재활사의 직무만족도에 영향을 미치는 요인을 파악하여 이를 중재하기 위한 프로그램 개발에 기초자료를 제공하기 위한 조사연구이다. 언어재활사 145명을 대상으로 구조화된 설문지를 이용하였다. 연구 결과 직무만족도의 평균 3.62점이었고, 직무만족도는 자기효능감과는 양의 상관관계, 직무스트레스와는 음의 상관관계에 있었다. 직무만족도에 영향을 미치는 요인은 자기효능감과 직무스트레스로 나타났고, 직무만족도의 설명력은 46.8%로 나타났다. 본 연구 결과가 언어재활사의 직무만족도를 놀일 수 있는 방안을 모색하고 이에 대한 중재프로그램을 개발하는데 기초적 자료로 유용하게 활용 될 것으로 기대된다.

멀티밴드 스펙트럼 차감법과 엔트로피 하모닉을 이용한 잡음환경에 강인한 분산음성인식 (Robust Distributed Speech Recognition under noise environment using MESS and EH-VAD)

  • 최갑근;김순협
    • 전자공학회논문지CI
    • /
    • 제48권1호
    • /
    • pp.101-107
    • /
    • 2011
  • 음성인식의 실용화에 가장 저해되는 요소는 배경잡음과 채널에 의한 왜곡이다. 일반적으로 잡음은 음성인식 시스템의 성능을 저하시키고 이로 인해 사용 장소의 제약을 많이 받고 있다. DSR(Distributed Speech Recognition) 기반의 음성인식 역시 이 같은 문제로 성능 향상에 어려움을 겪고 있다. 이 논문은 잡음환경에서 DSR기반의 음성인식률 향상을 위해 정확한 음성구간을 검출하고, 잡음을 제거하여 잡음에 강인한 특징추출을 하도록 설계하였다. 제안된 방법은 엔트로피와 음성의 하모닉을 이용해 음성구간을 검출하며 멀티밴드 스펙트럼 차감법을 이용하여 잡음을 제거한다. 음성의 스펙트럼 에너지에 대한 엔트로피를 사용하여 음성검출을 하게 되면 비교적 높은 SNR 환경 (SNR 15dB) 에서는 성능이 우수하나 잡음환경의 변화에 따라 음성과 비음성의 문턱 값이 변화하여 낮은 SNR환경(SNR 0dB)에시는 정확한 음성 검출이 어렵다. 이 논문은 낮은 SNR 환경(0dB)에서도 정확한 음성을 검출할 수 있도록 음성의 스펙트럴 엔트로피와 하모닉 성분을 이용하였으며 정확한 음성 구간 검출에 따라 잡음을 제거하여 잡음에 강인한 특정을 추출하도록 하였다. 실험결과 잡음환경에 따른 인식조건에서 개선된 인식성능을 보였다.

Acoustic Variation Conditioned by Prosody in English Motherese

  • Choi, Han-Sook
    • 말소리와 음성과학
    • /
    • 제2권1호
    • /
    • pp.41-50
    • /
    • 2010
  • The current study exploresacoustic variation induced by prosodic contexts in different speech styles,with a focus on motherese or child-directed speech (CDS). The patterns of variation in the acoustic expression of voicing contrast in English stops, and the role of prosodic factors in governing such variation are investigated in CDS. Prosody-induced acoustic strengthening reported from adult-directed speech (ADS)is examined in the speech data directed to infants at the one-word stage. The target consonants are collected from Utterance-initial and -medial positions, with or without focal accent. Overall, CDS shows that the prosodic prominence of constituents under focal accent conditions variesin the acoustic correlates of the stop laryngeal contrasts. The initial position is not found with enhanced acoustic values in the current study, which is similar to the finding from ADS (Choi, 2006 Cole et al, 2007). Individualized statistical results, however, indicate that the effect of accent on acoustic measures is not very robust, compared to the effect of accent in ADS. Enhanced distinctiveness under focal accent is observed from the limited subjects' acoustic measures in CDS. The results indicate dissimilar strategies to mark prosodic structures in different speech styles as well as the consistent prosodic effect across speech styles. The stylistic variation is discussed in relation to the listener under linguistic development in CDS.

  • PDF

Overlapping of /o/ and /u/ in modern Seoul Korean: focusing on speech rate in read speech

  • Igeta, Takako;Hiroya, Sadao;Arai, Takayuki
    • 말소리와 음성과학
    • /
    • 제9권1호
    • /
    • pp.1-7
    • /
    • 2017
  • Previous studies have reported on the overlapping of $F_1$ and $F_2$ distribution for the vowels /o/ and /u/ produced by young Korean speakers of the Seoul dialect. It has been suggested that the overlapping of /o/ and /u/ occurs due to sound change. However, few studies have examined whether speech rate influences the overlapping of /o/ and /u/. On the other hand, previous studies have reported that the overlapping of /o/ and /u/ in syllable produced by male speakers is smaller than by female speakers. Few reports have investigated on the overlapping of the two vowels in read speech produced by male speakers. In the current study, we examined whether speech rates affect overlapping of /o/ and /u/ in read speech by male and female speakers. Read speech produced by twelve young adult native speakers of Seoul dialect were recorded in three speech rates. For female speakers, discriminant analysis showed that the discriminant rate became lower as the speech rate increases from slow to fast. Thus, this indicates that speech rate is one of the factors affecting the overlapping of /o/ and /u/. For male speakers, on the other hand, the discriminant rate was not correlated with speech rate, but the overlapping was larger than that of female speakers in read speech. Moreover, read speech by male speakers was less clear than by female speakers. This indicates that the overlapping may be related to unclear speech by sociolinguistic reasons for male speakers.

Affixation effects on word-final coda deletion in spontaneous Seoul Korean speech

  • Kim, Jungsun
    • 말소리와 음성과학
    • /
    • 제8권4호
    • /
    • pp.9-14
    • /
    • 2016
  • This study investigated the patterns of coda deletion in spontaneous Seoul Korean speech. More specifically, the current study focused on three factors in promoting coda deletion, namely, word position, consonant type, and morpheme type. The results revealed that, first, coda deletion frequently occurred when affixes were attached to the ends of words, rather than in affixes in word-internal positions or in roots. Second, alveolar consonants [n] and [l] in the coda positions of high-frequency affixes [nɨn] and [lɨl] were most likely to be deleted. Additionally, regarding affix reduction in the word-final position, all subjects seemed to depend on this articulatory strategy to a similar degree. In sum, the current study found that affixes without primary semantic content in spontaneous speech tend to undergo the process of reduction, favoring the occurrence of specific pronunciation variants.

Review And Challenges In Speech Recognition (ICCAS 2005)

  • Ahmed, M.Masroor;Ahmed, Abdul Manan Bin
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2005년도 ICCAS
    • /
    • pp.1705-1709
    • /
    • 2005
  • This paper covers review and challenges in the area of speech recognition by taking into account different classes of recognition mode. The recognition mode can be either speaker independent or speaker dependant. Size of the vocabulary and the input mode are two crucial factors for a speech recognizer. The input mode refers to continuous or isolated speech recognition system and the vocabulary size can be small less than hundred words or large less than few thousands words. This varies according to system design and objectives.[2]. The organization of the paper is: first it covers various fundamental methods of speech recognition, then it takes into account various deficiencies in the existing systems and finally it discloses the various probable application areas.

  • PDF