• 제목/요약/키워드: Prosodic features

검색결과 75건 처리시간 0.021초

한국어 구조적 중의성 문장에 대한 일본인 중급 한국어 학습자들의 발화양상 (Prosodic aspects of structural ambiguous sentences in Korean produced by Japanese intermediate Korean learners)

  • 윤영숙
    • 말소리와 음성과학
    • /
    • 제7권3호
    • /
    • pp.89-97
    • /
    • 2015
  • The aim of this study is to investigate the prosodic aspects of structural ambiguous sentences in Korean produced by Japanese Korean learners and the influence of their first language prosody. Previous studies reported that structural ambiguous sentences in Korean are different especially in prosodic phrasing. So we examined whether Japanese Korean leaners can also distinguish, in production, between two types of structural ambiguous sentences on the basis of prosodic features. For this purpose 4 Korean native speakers and 8 Japanese Korean learners participated in the production test. Analysis materials are 6 sentences where a relative clause modify either NP1 or NP1+NP2. The results show that Korean native speakers produced ambiguous sentences by different prosodic structure depending on their semantic and syntactic structure (left branching or right branching sentence). Japanese speakers also show distinct prosodic structure for two types of ambiguous sentences in most cases, but they have more errors in producing left branching sentences than right branching sentences. In addition to that, interference of Japanese pitch accent in the production of Korean ambiguous sentences was observed.

운율 특성 벡터와 가우시안 혼합 모델을 이용한 감정인식 (Emotion Recognition using Prosodic Feature Vector and Gaussian Mixture Model)

  • 곽현석;김수현;곽윤근
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2002년도 추계학술대회논문집
    • /
    • pp.762-766
    • /
    • 2002
  • This paper describes the emotion recognition algorithm using HMM(Hidden Markov Model) method. The relation between the mechanic system and the human has just been unilateral so far. This is the why people don't want to get familiar with multi-service robots of today. If the function of the emotion recognition is granted to the robot system, the concept of the mechanic part will be changed a lot. Pitch and Energy extracted from the human speech are good and important factors to classify the each emotion (neutral, happy, sad and angry etc.), which are called prosodic features. HMM is the powerful and effective theory among several methods to construct the statistical model with characteristic vector which is made up with the mixture of prosodic features

  • PDF

The role of prosody in dialect authentication Simulating Masan dialect with Seoul speech segments

  • Yoon, Kyu-Chul
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.234-239
    • /
    • 2007
  • The purpose of this paper is to examine the viability of simulating one dialect with the speech segments of another dialect through prosody cloning. The hypothesis is that, among Korean regional dialects, it is not the segmental differences but the prosodic differences that play a major role in authentic dialect perception. This work intends to support the hypothesis by simulating Masan dialect with the speech segments from Seoul dialect. The dialect simulation was performed by transplanting the prosodic features of Masan utterances unto the same utterances produced by a Seoul speaker. Thus, the simulated Masan utterances were composed of Seoul speech segments but their prosody came from the original Masan utterances. The prosodic features involved were the fundamental frequency contour, the segmental durations, and the intensity contour. The simulated Masan utterances were evaluated by four native Masan speakers and the role of prosody in dialect authentication and speech synthesis was discussed.

  • PDF

표준한국어 악센트의 실험음성학적 연구 -청취 테스트 및 음향분석- (The Experimental Phonetic Study of Word Accent in Standard Korean)

  • 성철재
    • 대한음성학회지:말소리
    • /
    • 제21_24호
    • /
    • pp.43-89
    • /
    • 1992
  • In this thesis, the prominent aspect of word accent in standard Korean is studied by auditory test and acoustic analysis experiment. The definition of 'accent' is, following Hoyoung Lee's discussion(1990), to be described as 'the means whereby a focused part of an utterance is made to stand out in order to concentrate the hearer's attention on it.' That is to say, the ten of 'accent' may be described in terms of phonological phenomenon and the accented syllable can be phonetically prominent as the result of those phonological process. Prosodic features may have different characteristics in different languages whether they contain linguistically important functions or not. Thus the characteristics of word accent in standard Korean will be determined as the content and trait of prosodic features. Following this viewpoint, present study looked over prosodic features which may effect the characteristics of word accent in standard Korean, through systematic experimental procedure. And the result of this experiment has been verified by statistical method, the T-test, for the purpose of identifying the relatedness among prosodic features(parameters). This thesis, therefore, aimed to investigate the intrinsic acoustic and physical qualities of the word accent in standard Korean. Nonsense words composed by 'mal' and 'ma' which can be divided into 'heavy syllable' and 'light syllable' quoted from Hyman(1975) have been classified into 28 types with respect to syllable numbers(2 syl., 3 sy1., 4 syl.) and these words have become the target of auditory test and acoustic experiment. As the result of those experimental Procedures, the word accent in standard Korean may be said that it has a tendency of fixing first two syllables regardless of syllable numbers. The syllable types of HH, HL, LL in the first two syllables may be prominent at first syllable and the type of H may be at second syllable. Various prosodic features(parameters) including duration, intensity, and Fo(purely phonetic terms) were also strengthened in those positions. The result of this experiment can be cleared up like these : 1. The most important feature is proved as 'duration', the feature of intensity resulted in more subsidiary one than the feature of duration. 2. Fo( fundamental frequency) could be observed as having some coherent contour through almost all syllable types(99 %), that is, in 2 syllable types, it had rising contour, in 2 syllable types, rising-falling contour, and in 4 syllable types, it contained rising-falling-rising contour. The result of auditory test was different with those contour forms of all Fo surveyed. With respect to these results, the discuss for Fo is determined' to be excluded comparing other features. 3. Finally, this thesis resulted in a decision that the word accent in standard Korean may has fixed(somewhat weaker) accent, especially fixed at first two syllables in almost all words. 4. Various kinds of syllable types related with 2,3,4 syllables, therefore, can be reclassified into 4 types of HH, HL, LH, LL following the concept of accent fixing placement(i.e. first two syllables). In these 4 types, the types of HH, HL, LL were prominent at the position of the first syllable , and the type of LH was prominent at the second syllable otherwise.

  • PDF

운율이식을 통해 나타난 감정인지 양상 연구 (A Study on the Perceptual Aspects of an Emotional Voice Using Prosody Transplantation)

  • 이서배
    • 대한음성학회지:말소리
    • /
    • 제62호
    • /
    • pp.19-32
    • /
    • 2007
  • This study investigated the perception of emotional voices by transplanting some or all of the prosodic aspects, i.e. pitch, duration, and intensity, of the utterances produced with emotional voices onto those with normal voices and vice versa. Listening evaluation by 24 raters revealed that prosodic effect was greater than segmental & vocal quality effect on the preception of the emotion. The degree of influence of prosody and that of segments & vocal quality varied according to the type of emotion. As for fear, prosodic elements had far greater influence than segmental & vocal quality elements whereas segmental and vocal elements had as much effect as prosody on the perception of happy voices. Different amount of contribution to the perception of emotion was found among prosodic features with the descending order of pitch, duration and intensity. As for the length of the utterances, the perception of emotion was more effective with long utterances than with short utterances.

  • PDF

Recognition of Emotion and Emotional Speech Based on Prosodic Processing

  • Kim, Sung-Ill
    • The Journal of the Acoustical Society of Korea
    • /
    • 제23권3E호
    • /
    • pp.85-90
    • /
    • 2004
  • This paper presents two kinds of new approaches, one of which is concerned with recognition of emotional speech such as anger, happiness, normal, sadness, or surprise. The other is concerned with emotion recognition in speech. For the proposed speech recognition system handling human speech with emotional states, total nine kinds of prosodic features were first extracted and then given to prosodic identifier. In evaluation, the recognition results on emotional speech showed that the rates using proposed method increased more greatly than the existing speech recognizer. For recognition of emotion, on the other hands, four kinds of prosodic parameters such as pitch, energy, and their derivatives were proposed, that were then trained by discrete duration continuous hidden Markov models(DDCHMM) for recognition. In this approach, the emotional models were adapted by specific speaker's speech, using maximum a posteriori(MAP) estimation. In evaluation, the recognition results on emotional states showed that the rates on the vocal emotions gradually increased with an increase of adaptation sample number.

자동 구두점 삽입을 이용한 Rich Transcription 생성 (Rich Transcription Generation Using Automatic Insertion of Punctuation Marks)

  • 김지환
    • 대한음성학회지:말소리
    • /
    • 제61호
    • /
    • pp.87-100
    • /
    • 2007
  • A punctuation generation system which combines prosodic information with acoustic and language model information is presented. Experiments have been conducted first for the reference text transcriptions. In these experiments, prosodic information was shown to be more useful than language model information. When these information sources are combined, an F-measure of up to 0.7830 was obtained for adding punctuation to a reference transcription. This method of punctuation generation can also be applied to the 1-best output of a speech recogniser. The 1-best output is first time aligned. Based on the time alignment information, prosodic features are generated. As in the approach applied in the punctuation generation for reference transcriptions, the best sequence of punctuation marks for this 1-best output is found using the prosodic feature model and an language model trained on texts which contain punctuation marks.

  • PDF

음성감정인식에서 음색 특성 및 영향 분석 (Analysis of Voice Quality Features and Their Contribution to Emotion Recognition)

  • 이정인;최정윤;강홍구
    • 방송공학회논문지
    • /
    • 제18권5호
    • /
    • pp.771-774
    • /
    • 2013
  • 본 연구는 감정상태와 음색특성의 관계를 확인하고, 추가로 cepstral 피쳐와 조합하여 감정인식을 진행하였다. Open quotient, harmonic-to-noise ratio, spectral tilt, spectral sharpness를 포함하는 특징들을 음색검출을 위해 적용하였고, 일반적으로 사용되는 피치와 에너지를 기반한 운율피쳐를 적용하였다. ANOVA분석을 통해 각 특징벡터의 유효성을 살펴보고, sequential forward selection 방법을 적용하여 최종 감정인식 성능을 분석하였다. 결과적으로, 제안된 피쳐들으로부터 성능이 향상되는 것을 확인하였고, 특히 화남과 기쁨에 대하여 에러가 줄어드는 것을 확인하였다. 또한 음색관련 피쳐들이 cepstral 피쳐와 결합할 경우 역시 인식 성능이 향상되었다.

자폐 범주성 장애아동과 정상아동의 평서문 읽기에서의 운율구 특성 비교 (A Comparative Study on the Characteristics of the Prosodic Phrases between Autism Spectrum Disorder and Normal Children in the Reading of Korean Read Sentences)

  • 정금수;성철재
    • 대한음성학회지:말소리
    • /
    • 제65호
    • /
    • pp.51-65
    • /
    • 2008
  • The aim of this study is to compare ASD (Autism Spectrum Disorder) children with normal children in terms of the prosodic features. Materials are collected by the reading of Korean read sentences. They are composed of 10 declarative sentences, each of which was consisted of 5-6 words. Subjects are consisted of 10 ASD and 10 normal male children with a receptive vocabulary age of 5;0-6;5 years. We found out that both groups showed the differences not only in the tonal patterns at the end of the prosodic phrases, but also in both the degree of rising and falling slope related to pitch contour. While HL% and HLH% were highly emerged in sentence final position in normal group, HL% and HLH% were prominent in ASD group in the same position. LH% and LHL% IP types were observed only in ASD group in sentence medial position. The slope showing the variation in the fundamental frequency at the end of the prosodic phrase was twice as steep in the group of ASD children as in the group of normal children.

  • PDF

음성인식.합성을 위한 한국어 운율단위 음운론의 계산적 연구:음운단위에 따른 경계의 발견 (A Computation Study of Prosodic Structures of Korean for Speech Recognition and Synthesis:Predicting Phonological Boundaries)

  • 이찬도
    • 한국정보처리학회논문지
    • /
    • 제4권1호
    • /
    • pp.280-287
    • /
    • 1997
  • 성공적인 음성인식·합성 시스템을 구축하기 위해서는 음운론적 지식, 특히 운율 정보의 도입이 매우 중요하다. 본 연구에서는 우선 음성인식·합성을 위한 운율음운 론의 연구동향을 개관하고, 국어의 음운단위와 경계의 설정에 관한 이론적·실험적 고찰을 정리하였으며, 음운단위에 따른 경계의 자동적 발견을 위하여, 데이터를 수집 하고 시스템을 구현하여 실험을 행하였다. 단순회귀 신경망을 이용하여, 2,200여 개 의 문장에 있는 12,000여개의 음운단어를 외부정보의 도움이 전혀 없이 훈련시킨 결 과, 70%정도의 예측률을 보였다. 본 연구에서 사용한 방법을 다른 정보와 결합하여 사용한다면, 음운경계의 발전과 그에 따른 분절화를 정확하게 행할 수 있으리라 기대 된다.

  • PDF