• Title/Summary/Keyword: Speech Class

Search Result 140, Processing Time 0.022 seconds

A Study of Keyword Spotting System Based on the Weight of Non-Keyword Model (비핵심어 모델의 가중치 기반 핵심어 검출 성능 향상에 관한 연구)

  • Kim, Hack-Jin;Kim, Soon-Hyub
    • The KIPS Transactions:PartB
    • /
    • v.10B no.4
    • /
    • pp.381-388
    • /
    • 2003
  • This paper presents a method of giving weights to garbage class clustering and Filler model to improve performance of keyword spotting system and a time-saving method of dialogue speech processing system for keyword spotting by calculating keyword transition probability through speech analysis of task domain users. The point of the method is grouping phonemes with phonetic similarities, which is effective in sensing similar phoneme groups rather than individual phonemes, and the paper aims to suggest five groups of phonemes obtained from the analysis of speech sentences in use in Korean morphology and in stock-trading speech processing system. Besides, task-subject Filler model weights are added to the phoneme groups, and keyword transition probability included in consecutive speech sentences is calculated and applied to the system in order to save time for system processing. To evaluate performance of the suggested system, corpus of 4,970 sentences was built to be used in task domains and a test was conducted with subjects of five people in their twenties and thirties. As a result, FOM with the weights on proposed five phoneme groups accounts for 85%, which has better performance than seven phoneme groups of Yapanel [1] with 88.5% and a little bit poorer performance than LVCSR with 89.8%. Even in calculation time, FOM reaches 0.70 seconds than 0.72 of seven phoneme groups. Lastly, it is also confirmed in a time-saving test that time is saved by 0.04 to 0.07 seconds when keyword transition probability is applied.

The Dramatization of Habitus: A Bourdieun Reading of Pygmalion

  • Hwang, Hoon-Sung
    • Journal of English Language & Literature
    • /
    • v.55 no.3
    • /
    • pp.383-398
    • /
    • 2009
  • Based on the Greek myth of Pygmalion and the fairy tale of Cinderella, Shaw's Pygmalion demonstrates a masterful coalescence of these two narrative motifs into a coherent plot scheme. Even more significant is his keen insight into the conflicts created at the tripartite intersection of human activity concerning language/class/culture, which, as the leitmotif, revolves around lessons in language learning. This play basically deals with human transformation and by its very nature, Higgins's experimentation with transforming Eliza cannot stop at language alone. Her cultural transformation ripples over into the realms of gesture and even a unique way of living (modus vivendi) intimately associated with taste and manners, which Bourdieu terms as habitus. By acquiring a new fashion and language, Eliza is reborn as a new lady aspiring to be filled with a newly acquired habitus. While separating her from her old Cockney style, Higgins inculcates Queen's English in Eliza, in which process her changed speech styles gradually transforms and restructures her deportment and manners, finally generating new practices, perceptions and attitudes. The gist of Pygmalion is however less Eliza's ascent into the middle class than her battle for symbolic capital waged at the level of language. By problematizing his contemporary practice of habitus conventionalized and warped by class distinctions based on economic, social and cultural capitals, Shaw creates a new humanist model of man founded on spiritual and rational virtues. In conclusion, Eliza is not a frigid Galatea but a dynamic character that goes through a brilliant transformation of three stages: 1) linguistic; 2) cultural, and 3) humanist. Finally she is built into a "consort battleship" on an equal standing with her sculptor. The process of her character-building cannot be illuminated without resorting to the dynamic notion of habitus, which highlights the process of inculcation, structuring, generation and transposing. Given the overwhelming weight of the heroine's role and the dynamic process of her transformation as the major plot scheme, this play should be christened Galatea in lieu of Pygmalion.

Investigation of the listening environment for lower grade students in elementary school using subjective tests (주관적 평가법을 이용한 초등학교 저학년 교실의 청취환경 조사)

  • Park, Chan-Jae;Haan, Chan-Hoon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.3
    • /
    • pp.201-212
    • /
    • 2021
  • The present study was conducted as a pilot investigation to suggest the standards of acoustic performance for classrooms suitable for incomplete hearing people such as children under 9 years of age. Subjective evaluations such as questionnaire and speech intelligibility test were conducted to 264 students at two elementary schools in Cheong-ju in order to analyze the characteristics of the listening environment in the classrooms of the lower grades in elementary school. The survey was undertaken with a total of 264 students at two elementary schools in Cheong-ju, and investigated their satisfaction with the classroom listening environment. As a result, students responded that the most helpful information type for understanding class content is the voice of teacher. In addition, the volume of the current teacher's voice is normal, and the level of clarity is highly satisfactory. As for the acoustic performance of the classroom, the opinion that the noise was normal and the reverberation was very short was found to be dominant in overall satisfaction with the listening environment. Meanwhile, as a result of speech intelligibility test using the word list selected for the lower grade students of elementary school, it could be inferred that the longitudinal axis distance from the sound source in the case of 8-year-olds is a factor that affects speech recognition.

How to Teach English Intonation to Japanese Students

  • Masaki Tsuzuki
    • Proceedings of the KSPS conference
    • /
    • 1996.02a
    • /
    • pp.47-61
    • /
    • 1996
  • The phonetic study of English language in Japan is a matter of great importance, a problem of major concern and a. vital subject The special difficulties which the Japanese college students have in learning English lie in the field of prosodic features of English, such as, syllable, rhythm, stress, intonation, prominence, of.. These difficulties have made Japanese students' pronunciation relatively monotonous or mora(ness). In my presentation, the specific phonetic features of Japanese language first will be discussed and clarified. And then the effective teaching method of intonation to improve Japanese students' pronunciation will be suggested. Finally, the oral dialogue with intonation analysis and transcription in the class room will be demonstrated to highlight the presentation.

  • PDF

Glottal Area and Voice Onset Time

  • Kim, Dae-Won
    • MALSORI
    • /
    • no.15_18
    • /
    • pp.19-34
    • /
    • 1989
  • There is general agreement that voice onset time (VOT) is functionally related with the glottal opening at the moment of the oral release of a stop. However, systematic investigations of tempo 8n4 the place of articulation as affecting the glottal opening and VOT have relatively neglected. Various instrumental techniques were used to verify the claim with BrEng and korean speakers, under controlled experimental conditions, tempo being one of them. It was found that voiceless aspiration (i.e. VOT) is not simply a function of the glottal area at the moment of the oral release of a stop as it is normally defined in the existing literature. Within a given place of articulation and across temper VOT was generally insignificantly related to the glottal area. It is inferred that the glottal adduction onset time for the following vowel is actively control led by the speaker to meet aerodynamic requirements in relation to class (i.e. aspirated and unaspirated) and tempo. Some possible underlying physiological mechanisms for various phonetic aspects of intervocalic stops, associated with the glottal area and VOT, were discussed.

  • PDF

Airborne Noise Level of Navy Ships (함정의 공기중 소음수준)

  • 김종철;박일권;김경용;안호일
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.5 no.4
    • /
    • pp.27-37
    • /
    • 2002
  • Airborne noise is one of the considerable environmental factors for navy ship personnel because of accomplishing their tasks on restricted ship spaces. In this study, the effects of airborne noise on personnel and existing criteria for acceptable airborne noise on ships are reviewed briefly. Statistic results of airborne noise levels of the Korean navy ships are analyzed according to the class of ships and are compared airborne noise levels of the US navy ships. These results can be used for proposing airborne noise criteria of the navy ship for the future.

Performance Improvement of Korean Connected Digit Recognition Using Various Discriminant Analyses (다양한 변별분석을 통한 한국어 연결숫자 인식 성능향상에 관한 연구)

  • Song Hwa Jeon;Kim Hyung Soon
    • MALSORI
    • /
    • no.44
    • /
    • pp.105-113
    • /
    • 2002
  • In Korean, each digit is monosyllable and some pairs are known to have high confusability, causing performance degradation of connected digit recognition systems. To improve the performance, in this paper, we employ various discriminant analyses (DA) including Linear DA (LDA), Weighted Pairwise Scatter LDA WPS-LDA), Heteroscedastic Discriminant Analysis (HDA), and Maximum Likelihood Linear Transformation (MLLT). We also examine several combinations of various DA for additional performance improvement. Experimental results show that applying any DA mentioned above improves the string accuracy, but the amount of improvement of each DA method varies according to the model complexity or number of mixtures per state. Especially, more than 20% of string error reduction is achieved by applying MLLT after WPS-LDA, compared with the baseline system, when class level of DA is defined as a tied state and 1 mixture per state is used.

  • PDF

An acoustic study of Korean lenis stop voicing - in relation to prosodic structure - (국어 파열연자음 유성음화에 관한 음향음성학적 고찰 -운율구조와 관련하여-)

  • Kim Hyo Sook;Kim Sun Ju;Kim Sunmi
    • MALSORI
    • /
    • no.39
    • /
    • pp.15-24
    • /
    • 2000
  • This study aims to reexamine Korean Lenis Stop Voicing (henceforth, LSV) and to specify its phonetic conditions in phonetic terms. LSV optionally occurs within certain prosodic domains. They are called 'Malthomak'(Lee, 1996),'phonological phrase'(Kang, 1992), or 'accentual phrase'(Jun, 1993). On the basis of Jun's phrasing, this study focuses on the more specific phonetic conditions of LSV in the accentual phrase medial position, sub-classifying voicing as complete and partial. The results shows that whether the stops become completely voiced or partially voiced was determined by the various phonetic environments, such as adjacent segments and following intonational phrase boundaries. It is shown that the conditions of LSV should be described in terms of more detailed phonetic environments and that they could be used in predicting the class of voicing.

  • PDF

The Effects of Critical Friends on the Self-Esteem and Academic Oral Presentation Ability of Teacher Students

  • Malisuwan, Pattapee
    • Asian Journal for Public Opinion Research
    • /
    • v.4 no.4
    • /
    • pp.246-259
    • /
    • 2017
  • The purpose of this study is to evaluate the effects of critical friends on self-esteem and the academic oral presentation ability of undergraduate students. A pretest was conducted in the first week of the semester. A pre-academic oral presentation preparation was held from the second week to the seventh week and followed by pedagogical speech activities from the eighth week to the eleventh week. The research instruments are Academic oral presentation behavior and self-esteem evaluation forms. The samples were 37 third year undergraduate students, who were purposively selected from the educational technology class at the Chulalongkorn University. The statistics used for analyzing quantitative data are frequencies, means, standard deviations, one sample t-tests, and Pearson's Product-Moment Correlations. It was found that the 37 third year undergraduate teacher students had higher self-esteem at the statistically significant level of .05 and academic oral presentation scores after the activity were statistically significant at the .05 level.

Design and Implementation of a Call Control Markup Interpreter and Its Interaction with Voice Dialog Systems (호 제어 마크업 해석기 개발 및 음성 대화 시스템과의 연동)

  • Lee, Kyung-A;Kwon, Ji-Hye;Kim, Ji-Young;Hong, Ki-Hyung
    • MALSORI
    • /
    • no.53
    • /
    • pp.171-183
    • /
    • 2005
  • Call Control eXtensible Markup (CCXML) is a standard language that supports a call control of voice dialog systems such as VoiceXML based systems. CCXML allows developers to handle telephony calls in an easy way without deep knowledge about telephony networks and their switching systems.We design and implement a call control markup interpreter. At the implementation, we use a Dialogic JCT-LS board, but, by designing a wrapping class for CTI (computer telephony board) features, the interpreter can easily adopt other CTI boards. We also design and implement event-based interaction scheme between the interpreter and voice dialog systems. For verifying the interaction scheme, we implement a simple voice dialog system.

  • PDF