• 제목/요약/키워드: expressive speech

검색결과 37건 처리시간 0.028초

A 3D Audio-Visual Animated Agent for Expressive Conversational Question Answering

  • Martin, J.C.;Jacquemin, C.;Pointal, L.;Katz, B.
    • 한국정보컨버전스학회:학술대회논문집
    • /
    • 한국정보컨버전스학회 2008년도 International conference on information convergence
    • /
    • pp.53-56
    • /
    • 2008
  • This paper reports on the ACQA(Animated agent for Conversational Question Answering) project conducted at LIMSI. The aim is to design an expressive animated conversational agent(ACA) for conducting research along two main lines: 1/ perceptual experiments(eg perception of expressivity and 3D movements in both audio and visual channels): 2/ design of human-computer interfaces requiring head models at different resolutions and the integration of the talking head in virtual scenes. The target application of this expressive ACA is a real-time question and answer speech based system developed at LIMSI(RITEL). The architecture of the system is based on distributed modules exchanging messages through a network protocol. The main components of the system are: RITEL a question and answer system searching raw text, which is able to produce a text(the answer) and attitudinal information; this attitudinal information is then processed for delivering expressive tags; the text is converted into phoneme, viseme, and prosodic descriptions. Audio speech is generated by the LIMSI selection-concatenation text-to-speech engine. Visual speech is using MPEG4 keypoint-based animation, and is rendered in real-time by Virtual Choreographer (VirChor), a GPU-based 3D engine. Finally, visual and audio speech is played in a 3D audio and visual scene. The project also puts a lot of effort for realistic visual and audio 3D rendering. A new model of phoneme-dependant human radiation patterns is included in the speech synthesis system, so that the ACA can move in the virtual scene with realistic 3D visual and audio rendering.

  • PDF

구개열 아동의 초기 어휘에 나타난 음운 특성 연구 (Phonological Characteristics of Early Vocabulary in Young Children with Cleft Palate)

  • 하승희
    • 말소리와 음성과학
    • /
    • 제6권2호
    • /
    • pp.65-71
    • /
    • 2014
  • The purpose of this study was to investigate whether young children with cleft palate differ from those of noncleft typically developing children in terms of expressive vocabulary size, phonological characteristics and lexical selectivity. A total of 12 children with cleft palate and 12 noncleft children who were matched by age and gender participated in the study. The groups were compared by size of expressive vocabulary reported on Korean version of MacArthur-Bates Communicative Development Inventories and the number of different words, consonant inventory, the percentage of words beginning with obstruents and vowels, nasal, and glottal sounds, and the percentage of words which do not include obstruents in a language sample. Also, correlation analysis were performed to examine the relationship between measures on size of expressive vocabulary and phonological characteristics. The results showed that expressive vocabulary size and consonant inventory for children with cleft palate produced significantly smaller than those for noncleft children. Children with cleft palate produced significantly more words beginning with vowel or which do not include obstruents, and fewer words beginning with obstruents than noncleft children. The two groups showed different results on significant correlations between measures on size of expressive vocabulary and phonological characteristics indicating that children with cleft palate show different lexical selectivity from their noncleft peers. The results suggest that children with cleft palate aged 18-30 months demonstrate a slower rate of lexical and phonological development compared with their noncleft peers and they develop lexical selectivity reflecting cleft palate speech. The results will have a clinical implication on speech-language intervention for young children with cleft palates.

한국어 구어 실행증 환자에 대한 점진적 8단계 치료 기법의 임상적 효과: 사례연구 (Eight-step Continuum Treatment for Korean Apraxia of Speech Patient: A Case Study)

  • 이무경;정옥란
    • 음성과학
    • /
    • 제12권4호
    • /
    • pp.247-254
    • /
    • 2005
  • This study aimed at clarifing clinical effects of eight-step continuum treatment in a patient who showed apraxia of speech after stroke. The eight-step continuum treatment consisted of 8 steps and its clinical efficacy has been proven with American apraxic patients. However, it has not been clinically proven to be effective in Korean patients with apraxia of speech as of yet. Therefore, this study was conducted in an effort to provide preliminary clinical evidence regarding its effectiveness regardless of the linguistic differences between Korean and English. The therapy took place twice a week for 6 months, a total of 48 times. The results showed that the patient's receptive language was improved from 83% to 89% and 37% in accuracy, and expressive language from 15% to 37%. It seemed that spontaneous recovery did not playa role in his improvement since the study was conducted 2 years after the stroke. In addition, the improvement of expressive language was much greater(22%) than that of receptive language(6%), which implied that the therapy was effective in apraxia of speech because apraxia of speech is relatively confined to expressive ability, more specifically motor programming and sequencing.

  • PDF

구개파열 영유아의 언어발달 특성 연구 (Language Development in Cleft Palate Infants)

  • 김효선;김영태;김석화
    • 음성과학
    • /
    • 제9권3호
    • /
    • pp.121-131
    • /
    • 2002
  • The purpose of the present study was to investigate the characteristics of receptive and expressive language development of cleft palate infants aged under 3. Twenty-six cleft palate infants and 52 normal infants were grouped into 3 chronological age groups with 1 year intervals: less than 1, 1, and 2 years old. The cleft palate infants were divided into 2 groups: cleft palate only, and cleft lip & palate. Each mother of the infants was asked to complete the questionnaire, Sequenced Language Scale for Infants (SELSI). Receptive and Expressive language scores of SELSI were computed. The scores of receptive and expressive language were respectively analyzed into 4 categories of language: phonology/prosody, semantics, syntax, and pragmatics. The results, concerning the differences of language development between the cleft palate and the normal infants, were as follows: (1) expressive language scores were significantly different at age 2 between the cleft palate and the normal infants; (2) cleft plate groups aged less than 1 and 1 showed lower scores of phonology/prosody of expressive language than the normal groups; (3) cleft palate group aged 2 showed lower scores than the normal group in semantics, syntax, and pragmatics. The results, concerning the characteristics of language development between the cleft palate only and the cleft lip & palate infants, were as follows: (1) the scores of expressive and receptive language were not statistically different between the 2 groups; (2) both groups did not show any difference in .the scores of phonology/ prosody, semantics, syntax, and pragmatics of receptive and expressive language.

  • PDF

세 가지 자극 양식이 실어증자의 언어이해력과 언어표현력에 미치는 영향 (The Effects of Three Stimulus Modes on receptive Language Performance and expressive Language Performance in Aphasics.)

  • 이무경;유재연;이옥분;정옥란
    • 음성과학
    • /
    • 제7권3호
    • /
    • pp.263-272
    • /
    • 2000
  • The purpose of this study was to compare receptive language performance and expressive language performance in 13 patients with aphasia by using three stimulus presentation modes: Stimulus mode I (picture), Stimulus mode II (written word), Stimulus m (question using verbal explanation). The stimulus consisted of 10 words. They included 5 functional words and 5 non-functional words. The 13 subjects with aphasia were divided into 2 aphasic types: 5 Broca's aphasics and 8 anomie aphasics. The results were as follows: Firstly, the three stimulus modes didn't affect receptive language performance of aphasia subjects. Secondly; the three stimulus modes were effective on expressive language performance of aphasia subjects. Particularly, stimulus mode II (written words) was effective in naming the aphasia subjects. Thirdly, the functional words with high frequency were better than non-functional words with low frequency on expressive language performance, but not on receptive language performance of aphasia subjects. Finally, the interaction between three stimulus modes and the functional (nonfunctional) words affected expressive language performance only, but not receptive language performance. Particularly, presenting stimulus in written words which are functional words produced the best expressive language performance.

  • PDF

쇼핑호스트의 스피치스타일이 구매의도에 미치는 영향 (The Impact of Speech Style of Shopping Host on Purchase Intention)

  • 윤치영;이성환
    • 한국산학기술학회논문지
    • /
    • 제10권10호
    • /
    • pp.2893-2899
    • /
    • 2009
  • 본 연구는 쇼핑호스트의 스피치스타일을 분류해내고, 분류된 스피치스타일이 쇼핑호스트의 평가와 구매의도에 미치는 영향관계를 파악하고자 하였다. 쇼핑호스트의 스피치스타일이 쇼핑 호스트 평가에 미치는 영향에서 설파형의 고객지향성, 신뢰성, 적격성과 서술형의 적격성을 제외한 다른 스피치스타일은 영향을 미치는 것으로 분석되었다. 특히 가설에서 검증된 스피치스타일과 쇼핑 호스트 평가의 영향관계를 살펴보면 논증형, 표출형, 서술형 순으로 영향력이 나타나고 있다. 쇼핑호스트의 스피치스타일이 구매의도에 미치는 영향에서는 설파형, 서술형을 제외하고 논증형, 표출형에 영향을 미치는 것으로 분석되었다. 즉, 서술적이면서 주도적인 스피치스타일 보다 논리적으로 설득하면서 주어진 내용을 잘 표현하는 스피치스타일을 선호하고 있는 것으로 나타났다.

말소리가 제한된 아동을 위한 말리듬을 이용한 난타 프로그램의 개발과 효과 (Development and effects of Nanta program using speech rhythm for children with limited speech sound production)

  • 박영혜;최성희
    • 말소리와 음성과학
    • /
    • 제13권2호
    • /
    • pp.67-76
    • /
    • 2021
  • 난타는 북과 같은 타악기를 이용한 "두드리기"라는 뜻으로 한국 전통 음악인 사물놀이의 리듬이다. 말소리 산출이 제한된 아이들을 위해 난타 프로그램이 개발되어 적용되었다. 또한, 이 연구는 언어 리듬을 이용한 난타 프로그램의 효과에 대한 증거를 제공한다. 난타 음성 리듬 중재 프로그램은 말리듬을 이용하여 개발되었다. 난타 프로그램은 청각 자극, 다양한 소리와 박자, 리듬을 제공했으며, 리듬과 함께 호흡, 발성, 조음의 세 단계로 구성되어 있다. 말소리 목록이 제한된 6명의 아이들이 이 연구에 참여했다. 아동들에게 소리와 박자를 탐색하고 소리와 박자를 자유롭게 표현하도록 하였다. 또한, 리듬과 함께 단어를 모방하고 모방하는 단어에서 음절의 길이를 늘림으로써 다양한 말소리를 산출하도록 격려하였다. 매 회당 40분 동안 주 2회씩 총 15회의 세션이 실시되었다. 중재 효과를 조사하기 위해 치료 전후 취학전 아동의 수용언어 및 표현언어 발달척도(PRES)와 수용-표현 어휘력 검사(REVT) 점수를 비교하였다. Wilcoxon rank test 결과, 중재 후 PRES에서 수용언어 점수(p=.027)와 표현언어 점수(p=.024) 및 수용어휘력(p=.028)과 표현어휘력 (p=.028) 점수가 유의하게 향상되었음을 보여주었다. 난타 리듬 컨트롤 프로그램은 수용적이고 표현적인 어휘와 언어 발달에 상당한 긍정적인 영향을 미쳤다. 이러한 발견들은 리듬 컨트롤 프로그램이 제한된 음성 소리 생성을 가진 어린이들의 언어 발달과 어휘 향상에 유용할 수 있다는 것을 암시한다.

대구 실어증 진단검사 개발 및 표준화 연구 -신뢰도와 타당도- (The Validity and Reliability of the Daegu Diagnostic Aphasia Examination)

  • 김지채;안종복;이옥분;황영진;정옥란
    • 음성과학
    • /
    • 제12권3호
    • /
    • pp.7-17
    • /
    • 2005
  • This study aimed at investigating the validity and reliability of the Daegu Diagnostic Aphasia Examination (DDAE). The DDAE has been developed to assess aphasics' receptive and expressive language ability. One hundred and forty eight aphasics (96 males and 52 females) diagnosed as aphasics participated in this study. Reliability coefficients showed that the DDAE was highly consistent and accurate (Cronbach's a = .76$\sim$.82). For its' content-validity, a 5-point scale was administered. Four speech and language pathologists served as evaluator. The receptive language mean score was 4.29, the expressive language mean score was 4.09, and the right-hemisphere function mean score was 4.00. For construct validity, the correlation total scores were calculated. The results showed a significant correlation.

  • PDF

생후 12개월 이전에 인공와우이식을 받은 선천성 농 아동의 말지각과 언어능력 (Speech Perception and Language Outcome in Congenitally Deaf Children Receiving Cochlear Implants in the First Year of Life)

  • 정성욱;서지원;부성현;김리석
    • Korean Journal of Otorhinolaryngology-Head and Neck Surgery
    • /
    • 제61권12호
    • /
    • pp.644-649
    • /
    • 2018
  • Background and Objectives The objective of this study was to examine the effect of early cochlear implantation (CI) performed in infants less than 12 months of age. Subjects and Method Twenty-five children who received their first CI before 12 months of age were included in this study (infant group). The speech perception and language outcomes of these children were compared with those of 14 children who received their first CI between 13 and 24 months of age (older group). All children received sequential bilateral CI with the inter-stage interval of less than 2 years. Speech perception was measured using Categories of Auditory Performance, monosyllabic word test and sentence test, and language ability was measured using Sequenced Language Scale for Infants, Preschool Receptive-Expressive Language Scale, or Receptive & Expressive Vocavulary Test, depending on the age at the time of testing. Results There were no significant differences in speech perception abilities between the infant group and the older group. The mean expressive language score of infant group was higher than that of the older group, but the difference was not statistically significant. However, the receptive language score of infant group was significantly higher than that of the older group. Conclusion Children who received CI before 12 months of age achieved better receptive language ability than those who received it after 12 months of age. Thus CI should be performed as early as before 12 months of age to achieve better language ability.

RNN을 이용한 Expressive Talking Head from Speech의 합성 (Synthesis of Expressive Talking Heads from Speech with Recurrent Neural Network)

  • 사쿠라이 류헤이;심바 타이키;야마조에 히로타케;이주호
    • 로봇학회논문지
    • /
    • 제13권1호
    • /
    • pp.16-25
    • /
    • 2018
  • The talking head (TH) indicates an utterance face animation generated based on text and voice input. In this paper, we propose the generation method of TH with facial expression and intonation by speech input only. The problem of generating TH from speech can be regarded as a regression problem from the acoustic feature sequence to the facial code sequence which is a low dimensional vector representation that can efficiently encode and decode a face image. This regression was modeled by bidirectional RNN and trained by using SAVEE database of the front utterance face animation database as training data. The proposed method is able to generate TH with facial expression and intonation TH by using acoustic features such as MFCC, dynamic elements of MFCC, energy, and F0. According to the experiments, the configuration of the BLSTM layer of the first and second layers of bidirectional RNN was able to predict the face code best. For the evaluation, a questionnaire survey was conducted for 62 persons who watched TH animations, generated by the proposed method and the previous method. As a result, 77% of the respondents answered that the proposed method generated TH, which matches well with the speech.