통합 검색 | Korea Science

Text-driven Speech Animation with Emotion Control

Chae, Wonseok;Kim, Yejin
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제14권8호
- /
- pp.3473-3487
- /
- 2020
In this paper, we present a new approach to creating speech animation with emotional expressions using a small set of example models. To generate realistic facial animation, two example models called key visemes and expressions are used for lip-synchronization and facial expressions, respectively. The key visemes represent lip shapes of phonemes such as vowels and consonants while the key expressions represent basic emotions of a face. Our approach utilizes a text-to-speech (TTS) system to create a phonetic transcript for the speech animation. Based on a phonetic transcript, a sequence of speech animation is synthesized by interpolating the corresponding sequence of key visemes. Using an input parameter vector, the key expressions are blended by a method of scattered data interpolation. During the synthesizing process, an importance-based scheme is introduced to combine both lip-synchronization and facial expressions into one animation sequence in real time (over 120Hz). The proposed approach can be applied to diverse types of digital content and applications that use facial animation with high accuracy (over 90%) in speech recognition.
https://doi.org/10.3837/tiis.2020.08.018 인용 PDF KSCI HTML

벅아이 코퍼스 오류 수정과 코퍼스 활용을 위한 프랏 스크립트 툴 (Error Correction and Praat Script Tools for the Buckeye Corpus of Conversational Speech)

윤규철
- 말소리와 음성과학
- /
- 제4권1호
- /
- pp.29-47
- /
- 2012
The purpose of this paper is to show how to convert the label files of the Buckeye Corpus of Spontaneous Speech [1] into Praat format and to introduce some of the Praat scripts that will enable linguists to study various aspects of spoken American English present in the corpus. During the conversion process, several types of errors were identified and corrected either manually or automatically by the use of scripts. The Praat script tools that have been developed can help extract from the corpus massive amounts of phonetic measures such as the VOT of plosives, the formants of vowels, word frequency information and speech rates that span several consecutive words. The script tools can extract additional information concerning the phonetic environment of the target words or allophones.
https://doi.org/10.13064/KSSS.2012.4.1.029 인용 PDF

초점과 주제의 음성학적 관련성 (Focus, Topic and Their Phonetic Relevance.)

김용범
- 한국언어정보학회지:언어와정보
- /
- 제8권1호
- /
- pp.27-52
- /
- 2004
This paper attempts to define various notions involving focus and topic found in Korean and also employs phonetic measures to verify the plausibility of those notions that are theoretically argued for. This paper crucially relies on Prince (1981) for the notion of familiarity and its pragmatic significance, and adopts Rooth's (1985) notion of alternative set and utilizes it in the light of pragmatic interpretation. The basic idea of this paper is to decompose the notion alternative set into finer-grained components and to assign various levels of familiarity to those finer components, thereby helping define different kinds of focus and topic according to the properties of those sub-components.
PDF

정보 시스템의 유니코드 기반 한자 검색 지원 (Support on Ideograph Characters Search of Unicode Based Information System)

윤소영
- 정보관리학회지
- /
- 제24권4호
- /
- pp.375-391
- /
- 2007
현재 유니코드 CJK 한자코드는 부수 기준 배열방식을 따르고 있어 한자의 한글음가를 기준으로 하는 우리의 문자생활 방식과 차이가 있으며, 우리나라 고유한자나 동형이음어, 이두문자, 그리고 이체자 관계 등을 모두 수록하고 있지 않아 정보시스템에 그대로 적용하기에는 무리가 있다. 따라서 유니코드 기반 정보시스템의 정확한 한자표현 및 한자검색을 위해서는 한자를 포함하는 자료에 대한 정확한 이해를 바탕으로 여러 가지 지원방안을 마련해야 한다. 이러한 측면에서 역사분야 정보검색시스템에서는 한글음가 및 한국에서만 사용되는 동형이음어 처리를 위한 한자-한글음가 사전, 본래 한자의 음가와 다르게 읽히는 한자를 위한 특수용어사전, 이형자와이체자를 위한 이체자사전, 그리고 유니코드 CJK 통합한자에 등록되어 있지 않은 한자를 위한 신출 한자목록을 지원하고 있다.
https://doi.org/10.3743/KOSIM.2007.24.4.375 인용 PDF

단어의 음성학적 특징을 이용한 한국어 기계 번역 데이터 세트 구축 방안 (Proposed Methodology for Building Korean Machine Translation Data sets Considering Phonetic Features)

장칭하오;양홍진;김세린;권혁철
- 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
- /
- 한국정보과학회언어공학연구회 2022년도 제34회 한글 및 한국어 정보처리 학술대회
- /
- pp.592-595
- /
- 2022
한국어에서 한자어와 외래어가 차지하는 비중은 매우 높다. 일상어의 경우 한자어와 외래어의 비중이 약 53%, 전문어의 경우 약 92%에 달한다. 한자어나 외래어는 중국이나 다른 나라로부터 영향을 받아 한국에서 쓰이는 단어들이다. 한국어에서 사용되는 한자어와 외래어의 한글 표기과 원어 표기를 발음해보면, 발음이 상당히 유사하다는 것을 알 수 있다. 한자어인 도서관(图书馆)을 중국어로 발음해보면 t^hu.ʂu.kwan'로 해당 단어에 대한 한국 사람의 발음과 상당히 유사하다. 본 논문에서는 Source Length, Source IPA Length, Target Length, Target IPA Length, IPA Distance 등 총 5가지의 음성학적 특징을 고려한 한국어-중국어 한국어-영어 단어 기계번역 데이터 세트를 구축하고자 한다.
PDF

음성학적 지식과 DAC 기반 분할 알고리즘 (Phonetic Acoustic Knowledge and Divide And Conquer Based Segmentation Algorithm)

구찬모;왕지남
- 정보처리학회논문지B
- /
- 제9B권2호
- /
- pp.215-222
- /
- 2002
본 논문에서는 음절이 잘 발달되어 있는 한국어에 대해서 신뢰할 수 있는 완전 자동화된 레이블링 시스템을 제안한다. 음운 및 음향학적인 정보를 최대한 이용하고 분할에러를 줄이기 위해서 조절 메카니즘의 하나로 DAC개념을 사용하여 음성을 speechlet으로 나누고 분할 된 음성 구간에 대해서 레이블링을 시도하는 DAC기반 분할알고리즘이다. HMM방법이 획일적이고 확정적인 성능을 갖는 반면 본 제안 방법은 음성학적인 특화지식을 컴포넌트로 개발 추가 계속 향상시킬 수 있는 프레임워크를 제시하고 있다는 점에서 주요 의의가 있다고 하겠다. MM과 같은 통계학적인 방법을 이용하지 않고 음운학적, 음향학적 지식만을 이용하는 새로운 방법은 수행속도와 음성학적인 특화 지식컴포넌트를 확장함에 따라 일관성이 있으며 효과적 방법으로 적용가능 할 것이다. 제안 방법을 검증하기 위하여 실험결과를 제시하였다.
https://doi.org/10.3745/KIPSTB.2002.9B.2.215 인용 PDF KSCI

Electromyographic evidence for a gestural-overlap analysis of vowel devoicing in Korean

Jun, Sun-A;Beckman, M.;Niimi, Seiji;Tiede, Mark
- 음성과학
- /
- 제1권
- /
- pp.153-200
- /
- 1997
In languages such as Japanese, it is very common to observe that short peripheral vowel are completely voiceless when surrounded by voiceless consonants. This phenomenon has been known as Montreal French, Shanghai Chinese, Greek, and Korean. Traditionally this phenomenon has been described as a phonological rule that either categorically deletes the vowel or changes the [+voice] feature of the vowel to [-voice]. This analysis was supported by Sawashima (1971) and Hirose (1971)'s observation that there are two distinct EMG patterns for voiced and devoiced vowel in Japanese. Close examination of the phonetic evidence based on acoustic data, however, shows that these phonological characterizations are not tenable (Jun & Beckman 1993, 1994). In this paper, we examined the vowel devoicing phenomenon in Korean using data from ENG fiberscopic and acoustic recorders of 100 sentences produced by one Korean speaker. The results show that there is variability in the 'degree of devoicing' in both acoustic and EMG signals, and in the patterns of glottal closing and opening across different devoiced tokens. There seems to be no categorical difference between devoiced and voiced tokens, for either EMG activity events or glottal patterns. All of these observations support the notion that vowel devoicing in Korean can not be described as the result of the application of a phonological rule. Rather, devoicing seems to be a highly variable 'phonetic' process, a more or less subtle variation in the specification of such phonetic metrics as degree and timing of glottal opening, or of associated subglottal pressure or intra-oral airflow associated with concurrent tone and stricture specifications. Some of token-pair comparisons are amenable to an explanation in terms of gestural overlap and undershoot. However, the effect of gestural timing on vocal fold state seems to be a highly nonlinear function of the interaction among specifications for the relative timing of glottal adduction and abduction gestures, of the amplitudes of the overlapped gestures, of aerodynamic conditions created by concurrent oral tonal gestures, and so on. In summary, to understand devoicing, it will be necessary to examine its effect on phonetic representation of events in many parts of the vocal tracts, and at many stages of the speech chain between the motor intent and the acoustic signal that reaches the hearer's ear.
PDF

Sequence-to-Sequence Model을 이용한 영어 발음 기호 자동 변환 (Automatic Conversion of English Pronunciation Using Sequence-to-Sequence Model)

이공주;최용석
- 정보처리학회논문지:소프트웨어 및 데이터공학
- /
- 제6권5호
- /
- pp.267-278
- /
- 2017
영어는 동일 철자의 발음이 매우 다양한 언어이기 때문에 사전에 기술되어 있는 단어의 발음기호를 읽어야만 정확한 발음을 알 수 있다. 영어 사전마다 사용하는 발음기호(phonetic alphabet) 시스템이 다르며 같은 단어에 대해 기술하고 있는 발음 역시 다르다. 본 연구에서는 최근 딥 러닝 분야에서 널리 사용되고 있는 sequence-to-sequence (seq2seq) model을 이용하여 사전마다 다른 발음을 자동으로 변환해 보고자 한다. 4가지 다른 종류의 사전에서 추출한 발음 데이터를 이용하여 모두 12개의 seq2seq model을 구현하였으며, 발음 자동 변환 모듈의 정확 일치율은 74.5% ~ 89.6%의 성능을 보였다. 본 연구의 주요 목적은 다음의 두 가지이다. 첫째 영어 발음기호 시스템과 각 사전의 발음 데이터 특성을 살펴보는 것이고, 둘째, 발음 정보의 자동 변환과 오류 분석을 통해 seq2seq model의 특성을 살펴보는 것이다.
https://doi.org/10.3745/KTSDE.2017.6.5.267 인용 PDF KSCI

영문 브랜드네임의 중문 브랜드네임 전환 방식에 대한 중화권 소비자들의 브랜드 평가에 관한 연구 -브랜드컨셉의 조절효과를 중심으로- (Evaluations of Chinese Brand Name by Different Translation Types: Focusing on The Moderating Role of Brand Concept)

이지은;전주언
- Asia Marketing Journal
- /
- 제12권4호
- /
- pp.1-25
- /
- 2011
중화권 시장의 지속적인 성장으로 글로벌 브랜드들은 중화권 시장에 성공적으로 진입하기 위해서 다양한 노력을 기울이고 있다. 중화권 시장에 속한 국가들은 중국어를 공통적으로 사용하기 때문에 이 시장에 진출하려는 기업들은 영문 브랜드네임을 중문 브랜드 네임으로 전환해야 한다. 중문 브랜드네임에 대한 기존 연구들을 살펴보면, 중화권 시장에 진출한 글로벌 브랜드네임에 대한 사례분석에 초점이 맞추어져 있을 뿐 브랜드가 제공하는 편익과 중문 브랜드네임 전환방식과의 관계를 실증적으로 검증한 연구는 부족한 실정이다. 따라서 본 연구는 영문 브랜드네임이 중문 브랜드네임으로 전환되는 방식에 대해 살펴보았으며, 브랜드컨셉에 따라 중문 브랜드네임 전환방식에 대해 중화권 소비자들은 다른 평가를 내릴 것이라 예측하고 이에 대해 실증적으로 검증하였다. 연구결과 중화권 소비자들은 기능적 브랜드 컨셉제품의 경우 음운과 의미가 반영된 음운어의론식 전환방식에 대해서 가장 호의적으로 평가하였으며, 다음으로 의미가 반영된 의차식 전환이 호의적인 평가를 받았다. 반면에 상징적 브랜드 컨셉제품의 경우는 음운어의론식 전환방식을 가장 호의적으로 평가하였으며, 다음으로 음운이 반영된 음차식 전환에 대해 호의적으로 평가하는 것으로 나타났다. 실증결과를 토대로 글로벌 브랜드를 담당하는 브랜드 매니저들에게 자사 브랜드가 중화권 소비자들에게 제공하는 목표 편익에 따라서 중문 브랜드네임 전환방식이 달라져야 함을 제안하였다.
PDF

입말 표기를 이용한 영어 단어 검색 (Retrieving English Words with a Spoken Work Transliteration)

김지승;김광현;이준호
- 한국문헌정보학회지
- /
- 제39권3호
- /
- pp.93-103
- /
- 2005
영어 사전 검색 서비스 이용자들은 원하는 영어 단어의 철자를 정확하게 기억하지 못하고, 발음만을 기억하는 경우가 있다. 이러한 이용자들에게 도움을 주기 위해 본 연구에서는 입말 표기, 즉 영어 단어 발음의 한글 표기를 이용하여 영어 단어를 효과적으로 검색할 수 있는 방법을 제안한다. 이를 위하여 코닉스(KONIX) 코드를 개발하며, 입말 표기와 영어 단어를 코닉스 코드들로 변환한다. 그리고 변환된 코닉스 코드들 사이의 음성적 유사도를 편집 거리 방법과 2-그램 방법을 이용하여 계산한다. 또한 제안한 방법이 입말 표기에 의한 영어 단어 검색에 매우 효과적임을 실험을 통하여 입증한다.
https://doi.org/10.4275/KSLIS.2005.39.3.093 인용 PDF

검색결과 276건 처리시간 0.034초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)