• Title/Summary/Keyword: Utterance

Search Result 382, Processing Time 0.022 seconds

A VOWEL TRAJECTORY DISPLAY FOR SPEECH TRAINING

  • Kido, Ken'iti;Tanahashi, Kenji;Ohuchi, Yasuhiro
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.971-976
    • /
    • 1994
  • A speech display system is developed for the evaluation and the training of speech utterance. The speech is analyzed by linear predictive technique every 5 ms and the frequencies of the lowest two spectral local peaks P1 and P2 are extracted. The vowel trakectory is displayed using those frequencies on th P1-P2 plane. In most cases, P1 and P2 correspond to the first and the second formants, but in the case of indistinct utterance, the correspondence between the local spectral peaks and the formants tends to fall into disorder. And the system is considered to be useful for the evaluation of speech quality. The examples of some words uttered by normal speakers and some patients with difficulty in utterance are compared each other for the discussion of the effectiveness of the system.

  • PDF

A Study on the Analysis of Korean Native Speakers's Utterance Fluency (한국어 모어 화자의 발화 유창성 분석 연구)

  • Lee, Jin
    • Korean Linguistics
    • /
    • v.81
    • /
    • pp.245-265
    • /
    • 2018
  • The purpose of this study is to prepare the basis for a more objective evaluation of oral fluency by analyzing Korean native speaker's utterance. Traditionally, fluency evaluation tended to rely on the evaluators' experience and subjective idea. Therefore, there has been a need of setting the evaluation standard in numeric form that is easily measurable. In this study, I will analyze Korean native speaker's utterance in focus of pause. Total number of 875 pauses were extracted from the 21st Century Sejong Korean spoken corpus, and the elements before and after the pauses were annotated. From the analysis results, the pauses were divided between fluent pauses and influent pauses. If the length of fluent pauses do not exceed reasonable length of pause for native Korean speakers, there was no point reduction. On the other hand, if the influent pauses are made more frequently than the native Korean speakers, then it is subject to point reduction.

Approximated Posterior Probability for Scoring Speech Recognition Confidence

  • Kim Kyuhong;Kim Hoirin
    • MALSORI
    • /
    • no.52
    • /
    • pp.101-110
    • /
    • 2004
  • This paper proposes a new confidence measure for utterance verification with posterior probability approximation. The proposed method approximates probabilistic likelihoods by using Viterbi search characteristics and a clustered phoneme confusion matrix. Our measure consists of the weighted linear combination of acoustic and phonetic confidence scores. The proposed algorithm shows better performance even with the reduced computational complexity than those utilizing conventional confidence measures.

  • PDF

Prosodic characteristics of French language in conversational discourse (프랑스어의 대화 담화에 나타난 운율 연구)

  • Ko, Young-Lim;Yoon, Ae-Sun
    • Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.165-180
    • /
    • 2001
  • In this paper prosodic characteristics of French language are analysed with a corpus of radio interview. Intonation patterns are interpreted in terms of raising pattern, focal raising pattern and falling pattern. Accentual prominence is classified in two types, rhythmic accent and focal accent. Focal accent permit to explain the cohesion in a utterance or between two utterances. As a prosodic variable of discourse pauses are described by their form of realization (filled pause, silent pause, hesitation etc), their distribution and their function in utterance.

  • PDF

On the Role of the Phatic Function of Intonation in Russian (러시아어 발화시 억양의 역할)

  • Park, Kun-Woo
    • Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.81-89
    • /
    • 1998
  • This paper investigates the phatic function of intonation in Russian by recording and analysing 11 female native speakers of standard Moscow Russian. This paper shows that differences in intonation pattern of a sentence are associated with differences in degree of listener's involvement in the speech. Intonation pattern of an utterance having phatic function appears to be determined by 1) the speaker's readiness to talk to evoke the listener's attention ; 2) the speaker's intention to continue the communication. Some emphasis is placed on the relationship between intonation pattern of an utterance and speaker-listener interaction.

  • PDF

Acoustic characteristics of Korean vowels on pitch alteration utterance (피치 변경 발성에 따른 모음의 음향적 특성)

  • 조창수;홍광석
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2439-2442
    • /
    • 2003
  • In this paper, we examine the acoustic characteristics of Korean vowels on pitch alteration utterance. The prosody is known as an indicator of acoustic characteristics of emotions. Also, speech is acoustically differenced according to the emotional variation and environmental variation, although speaker utters the same speech. We analyzed the spectral envelopes and formants from the voiced regions as data points on the speech waveform.

  • PDF

A Study on Out-of-Vocabulary Rejection Algorithms using Variable Confidence Thresholds (가변 신뢰도 문턱치를 사용한 미등록어 거절 알고리즘에 대한 연구)

  • Bhang, Ki-Duck;Kang, Chul-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.11
    • /
    • pp.1471-1479
    • /
    • 2008
  • In this paper, we propose a technique to improve Out-Of-Vocabulary(OOV) rejection algorithms in variable vocabulary recognition system which is much used in ASR(Automatic Speech Recognition). The rejection system can be classified into two categories by their implementation method, keyword spotting method and utterance verification method. The utterance verification method uses the likelihood ratio of each phoneme Viterbi score relative to anti-phoneme score for deciding OOV. In this paper, we add speaker verification system before utterance verification and calculate an speaker verification probability. The obtained speaker verification probability is applied for determining the proposed variable-confidence threshold. Using the proposed method, we achieve the significant performance improvement; CA(Correctly Accepted for keyword) 94.23%, CR(Correctly Rejected for out-of-vocabulary) 95.11% in office environment, and CA 91.14%, CR 92.74% in noisy environment.

  • PDF

Lip Reading Method Using CNN for Utterance Period Detection (발화구간 검출을 위해 학습된 CNN 기반 입 모양 인식 방법)

  • Kim, Yong-Ki;Lim, Jong Gwan;Kim, Mi-Hye
    • Journal of Digital Convergence
    • /
    • v.14 no.8
    • /
    • pp.233-243
    • /
    • 2016
  • Due to speech recognition problems in noisy environment, Audio Visual Speech Recognition (AVSR) system, which combines speech information and visual information, has been proposed since the mid-1990s,. and lip reading have played significant role in the AVSR System. This study aims to enhance recognition rate of utterance word using only lip shape detection for efficient AVSR system. After preprocessing for lip region detection, Convolution Neural Network (CNN) techniques are applied for utterance period detection and lip shape feature vector extraction, and Hidden Markov Models (HMMs) are then used for the recognition. As a result, the utterance period detection results show 91% of success rates, which are higher performance than general threshold methods. In the lip reading recognition, while user-dependent experiment records 88.5%, user-independent experiment shows 80.2% of recognition rates, which are improved results compared to the previous studies.

Topic and Topic Change Detection in Instance Messaging (인스턴트 메시징에서의 대화 주제 및 주제 전환 탐지)

  • Choi, Yoon-Jung;Shin, Wook-Hyun;Jeong, Yoon-Jae;Myaeng, Sung-Hyon;Han, Kyoung-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.7
    • /
    • pp.59-66
    • /
    • 2008
  • This paper describes a novel method for identifying the main topic and detecting topic changes in a text-based dialogue as in Instant Messaging (IM). Compared to other forms of text, dialogues are uniquely characterized with the short length of text with small number of words, two or more participants, and existence of a history that affects the current utterance. Noting the characteristics, our method detects the main topic of a dialogue by considering the keywords not only the utterances of the user but also the dialogue system's responses. Dialogue histories are also considered in the detection process to increase accuracy. For topic change detection, the similarity between the former utterance's topic and the current utterance's topic is calculated. If the similarity is smaller than a certain threshold, our system judges that the topic has been changed from the current utterance. We obtained 88.2% and 87.4% accuracy in topic detection and topic change detection, respectively.

  • PDF

Preceded Utterance Conversational Agent's Effect on User Experience with User's Task Performance and Conversational Agent's Self-Disclosure (선제 발화하는 대화형 에이전트가 사용자 경험에 미치는영향: 사용자 과제 수행과 대화형 에이전트의 자기노출을 중심으로)

  • Shin, Hyorim;Lee, Soyeon;Kang, Hyunmin
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.1
    • /
    • pp.565-576
    • /
    • 2022
  • The scope and functions of a conversational agent are gradually expanding. In particular, research and technology development is being conducted on a conversational agent that can speak first without user calls. However, still in its early stages, there is a lack of research on how a preceded utterance conversational agent will affect users. Accordingly, this study conducted a 2×3 mixed design using the user's task performance condition and the agent's self-exposure as independent variables and measured Intimacy, Functional Satisfaction, Psychological Reactance, and Workload as dependent variables to identify the effects of preceded utterance conversational agent on user experience.