Search | Korea Science

Modality-Based Sentence-Final Intonation Prediction for Korean Conversational-Style Text-to-Speech Systems

Oh, Seung-Shin;Kim, Sang-Hun
- ETRI Journal
- /
- v.28 no.6
- /
- pp.807-810
- /
- 2006
This letter presents a prediction model for sentence-final intonations for Korean conversational-style text-to-speech systems in which we introduce the linguistic feature of 'modality' as a new parameter. Based on their function and meaning, we classify tonal forms in speech data into tone types meaningful for speech synthesis and use the result of this classification to build our prediction model using a tree structured classification algorithm. In order to show that modality is more effective for the prediction model than features such as sentence type or speech act, an experiment is performed on a test set of 970 utterances with a training set of 3,883 utterances. The results show that modality makes a higher contribution to the determination of sentence-final intonation than sentence type or speech act, and that prediction accuracy improves up to 25% when the feature of modality is introduced.
PDF

Conveyed Message in YouTube Product Review Videos: The discrepancy between sponsored and non-sponsored product review videos

Kim, Do Hun;Suh, Ji Hae
- The Journal of Information Systems
- /
- v.32 no.4
- /
- pp.29-50
- /
- 2023
Purpose The impact of online reviews is widely acknowledged, with extensive research focused on text-based reviews. However, there's a lack of research regarding reviews in video format. To address this gap, this study aims to explore the connection between company-sponsored product review videos and the extent of directive speech within them. This article analyzed viewer sentiments expressed in video comments based on the level of directive speech used by the presenter. Design/methodology/approach This study involved analyzing speech acts in review videos based on sponsorship and examining consumer reactions through sentiment analysis of comments. We used Speech Act theory to perform the analysis. Findings YouTubers who receive company sponsorship for review videos tend to employ more directive speech. Furthermore, this increased use of directive speech is associated with a higher occurrence of negative consumer comments. This study's outcomes are valuable for the realm of user-generated content and natural language processing, offering practical insights for YouTube marketing strategies.
https://doi.org/10.5859/KAIS.2023.32.4.29 인용 PDF

A Domain Action Classification Model Using Conditional Random Fields (Conditional Random Fields를 이용한 영역 행위 분류 모델)

Kim, Hark-Soo
- Korean Journal of Cognitive Science
- /
- v.18 no.1
- /
- pp.1-14
- /
- 2007
In a goal-oriented dialogue, speakers' intentions can be represented by domain actions that consist of pairs of a speech act and a concept sequence. Therefore, if we plan to implement an intelligent dialogue system, it is very important to correctly infer the domain actions from surface utterances. In this paper, we propose a statistical model to determine speech acts and concept sequences using conditional random fields at the same time. To avoid biased learning problems, the proposed model uses low-level linguistic features such as lexicals and parts-of-speech. Then, it filters out uninformative features using the chi-square statistic. In the experiments in a schedule arrangement domain, the proposed system showed good performances (the precision of 93.0% on speech act classification and the precision of 90.2% on concept sequence classification).
PDF

An analysis of Speech Acts for Korean Using Support Vector Machines (지지벡터기계(Support Vector Machines)를 이용한 한국어 화행분석)

En Jongmin;Lee Songwook;Seo Jungyun
- The KIPS Transactions:PartB
- /
- v.12B no.3 s.99
- /
- pp.365-368
- /
- 2005
We propose a speech act analysis method for Korean dialogue using Support Vector Machines (SVM). We use a lexical form of a word, its part of speech (POS) tags, and bigrams of POS tags as sentence features and the contexts of the previous utterance as context features. We select informative features by Chi square statistics. After training SVM with the selected features, SVM classifiers determine the speech act of each utterance. In experiment, we acquired overall $90.54\%$ of accuracy with dialogue corpus for hotel reservation domain.
https://doi.org/10.3745/KIPSTB.2005.12B.3.365 인용 PDF KSCI

Indirect Speech Acts We Live by: A Case Study of Daddy-Son Interactions in Extended speech Act Theory

Kubo, Susumu
- Proceedings of the Korean Society for Language and Information Conference
- /
- 1994.02a
- /
- pp.203-212
- /
- 1994
PDF

Dialogue Strategies to Overcome Speech Recognition Errors in Form-Filling Dialogue (양식 채우기 대화에서 음성 인식 오류의 보완을 위한 대화 전략)

Kang Sang-Woo;Lee Song-Wook;Seo Jung-Yun
- Korean Journal of Cognitive Science
- /
- v.17 no.2
- /
- pp.139-150
- /
- 2006
Speech recognition errors cause fatal results in a spoken dialogue system. When a system can not determine the speech-act of u utterance due to speech recognition errors, a dialogue system has a difficulty in continuing conversation. In this paper, we propose strategies for sub-dialogue generation by inferring the speech-act of an utterance with patterns of recognition errors on the field of form-filling dialogue. We used the proposed method on a plan-based dialogue model, corrected 27% of incomplete tasks, and acquired overall 89% of task completion rate.
PDF

Clinic Study on the Speech Retardation Complained Problems of Articulation & Reading Fluency (조음과 읽기 유창성의 문제를 호소한 어지(語遲) 환자 치험 1례)

Kang, Hee-Chul;Jung, Myong-Suk;Lee, Seung-Gi
- Journal of Physiology & Pathology in Korean Medicine
- /
- v.22 no.6
- /
- pp.1585-1588
- /
- 2008
The purpose of this study was to investigate the clinical application of oriental medical therapy(OMT) to Speech retardation complained problems of Articulation & Reading fluency. We treated the patient with OMT & others. The recovery of Speech retardation was evaluated by Articulation correction test(ACT) & Reading fluency test(RFT). The applicability of OMT & other therapy has positive effects on the patient with Speech retardation complained problems of Articulation & Reading fluency. The scores of ACT & RFT were increased.
PDF KSCI

Review of Korean Speech Act Classification: Machine Learning Methods

Kim, Hark-Soo;Seon, Choong-Nyoung;Seo, Jung-Yun
- Journal of Computing Science and Engineering
- /
- v.5 no.4
- /
- pp.288-293
- /
- 2011
To resolve ambiguities in speech act classification, various machine learning models have been proposed over the past 10 years. In this paper, we review these machine learning models and present the results of experimental comparison of three representative models, namely the decision tree, the support vector machine (SVM), and the maximum entropy model (MEM). In experiments with a goal-oriented dialogue corpus in the schedule management domain, we found that the MEM has lighter hardware requirements, whereas the SVM has better performance characteristics.
https://doi.org/10.5626/JCSE.2011.5.4.288 인용 PDF KPUBS

CNN Based Speech-act Classification Using Sentence Types and Modalities (문장 유형과 양태 정보를 이용한 합성곱 신경망 기반의 대화체 발화 화행 분석)

Park, Yongsin;Ko, Youngjoong
- Annual Conference on Human and Language Technology
- /
- 2018.10a
- /
- pp.642-644
- /
- 2018
화행(Speech-act)이란 어떤 목적을 달성하기 위해 발화를 통해 이루어지는 화자의 행위를 뜻하며, 화행 분석(Speech-act analysis)이란 주어진 발화의 화행을 결정하는 것을 뜻한다. 문장 유형과 양태는 화행의 일종으로, 문장 유형의 경우 화자의 기본적인 발화 의도에 따라 평서문, 명령문, 청유문, 의문문, 감탄문의 다섯 가지 유형으로 나눌 수 있고, 양태는 문장이 표현하는 명제나, 명제가 기술하는 상황에 대해서 화자가 갖는 의견이나 태도를 말한다. 본 논문에서는 종결어미와 보조용언으로부터 비교적 간단하게 추출 가능한 문장 유형과 양태 정보를 활용하여 대화체 발화문의 화행 분석 성능을 높이는 방법을 보인다. 본 논문에서 제안하는 모델은 합성곱 신경망(CNN)을 사용한 기본 모델에 비해 0.52%p 성능 향상을 보였다.
PDF

Meaning and Intonation of Endings with Polysemous Modality: Through the Analysis of the Spontaneous Speech (인식·행위 양태 다의성 어미의 의미와 억양 -구어 자유발화 분석을 통하여-)

Jo, Min-ha
- Korean Linguistics
- /
- v.77
- /
- pp.331-357
- /
- 2017
The purpose of this paper is to identify the workings of intonation realized in the endings through the spoken language. To achieve this objective, this paper has analyzed 300 minutes of spontaneous speech by women from Seoul and discussed the meanings of modality and their relationship with intonation. Intonation functions significantly in polysemous modal endings in epistemic and act modality. Epistemic modality is usually expressed through indirect and soft intonations such as L:, M: and LH, whereas act modality is expressed through direct and strong intonations such as H, HL and LHL. Intonation appears to be related to the Certainty degree of information, rather than classification of modality, Lengthening relate to indirectness, H with uncertainty, L with statements or affirmation, and HL and LHL relates to assertive attitude. This paper is significant as it has overcome the abstractness of existing modality studies and has engaged in objective and comprehensive analysis with actual spontaneous speech data.
https://doi.org/10.20405/kl.2017.11.77.331 인용

Search Result 98, Processing Time 0.019 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)