• Title/Summary/Keyword: phonetic information

Search Result 277, Processing Time 0.022 seconds

Comparison of feature parameters for emotion recognition using speech signal (음성 신호를 사용한 감정인식의 특징 파라메터 비교)

  • 김원구
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.5
    • /
    • pp.371-377
    • /
    • 2003
  • In this paper, comparison of feature parameters for emotion recognition using speech signal is studied. For this purpose, a corpus of emotional speech data recorded and classified according to the emotion using the subjective evaluation were used to make statical feature vectors such as average, standard deviation and maximum value of pitch and energy and phonetic feature such as MFCC parameters. In order to evaluate the performance of feature parameters speaker and context independent emotion recognition system was constructed to make experiment. In the experiments, pitch, energy parameters and their derivatives were used as a prosodic information and MFCC parameters and its derivative were used as phonetic information. Experimental results using vector quantization based emotion recognition system showed that recognition system using MFCC parameter and its derivative showed better performance than that using the pitch and energy parameters.

A Research on the Format for Romanization of Korean Personal Name (한국인명의 로마자표기 형식에 대한 연구)

  • Kim, Sung-Won;Kim, Jeong-Woo
    • Journal of Information Management
    • /
    • v.43 no.2
    • /
    • pp.199-222
    • /
    • 2012
  • Due to the increase of international business and activities, Koreans nowadays have higher needs to present their personal identity to the foreigners. In this process, the first requirement is to exchange personal names with foreigners. Therefore, the phonetic translation of Korean names into Roman alphabetic notation is frequently required, in order to deliver Korean personal names to the people who do not understand Korean alphabet. However, some confusions have been witnessed in the way of transforming Korean names into Roman (English) alphabet notation, due to the fact that there are many different ways to put Korean pronunciation into Roman (English) alphabet. This study examines different formats of Romanization of Korean personal names to find and suggest an optimal one. It first examines structures of and differences between Korean and Western personal names and usage patterns, reviews the issues surrounding Romanization of Korean personal names, and patternizes diverse Romanization formats currently used. Based on these examinations and consequent findings, I would like to suggest a format for the Romanization of Korean personal names which is considered to be the best.

Study on Efficient Generation of Dictionary for Korean Vocabulary Recognition (한국어 음성인식을 위한 효율적인 사전 구성에 관한 연구)

  • Lee Sang-Bok;Choi Dae-Lim;Kim Chong-Kyo
    • Proceedings of the KSPS conference
    • /
    • 2002.11a
    • /
    • pp.41-44
    • /
    • 2002
  • This paper is related to the enhancement of speech recognition rate using enhanced pronunciation dictionary. Modern large vocabulary, continuous speech recognition systems have pronunciation dictionaries. A pronunciation dictionary provides pronunciation information for each word in the vocabulary in phonemic units, which are modeled in detail by the acoustic models. But in most speech recognition system based on Hidden Markov Model, actual pronunciation variations are disregarded. Without the pronunciation variations in the speech recognition system, the phonetic transcriptions in the dictionary do not match the actual occurrences in the database. In this paper, we proposed the unvoiced rule of semivowel in allophone rules to pronunciation dictionary. Experimental results on speech recognition system give higher performance than existing pronunciation dictionaries.

  • PDF

A Study On Fomants of Voice Imitation (모방발화의 모음 포만트 연구)

  • Ahn, Byoung-Seob;Shin, Ji-Young;Kang, Sun-Mee
    • Proceedings of the KSPS conference
    • /
    • 2004.05a
    • /
    • pp.209-213
    • /
    • 2004
  • The aim of this paper is to analyze vowel in voice imitation, and to find the invariable phonetic features of the speaker. In this paper we examined the formants of vowel /a, u, i/. The results of the present are as follows : (1) Speakers change their vocal tract cavity features. (2) F1 changes easily compared to $F2{\sim}F3{\sim}F4$. (3) F3-F2 appears to be constituent for a speakers identification in vowel /a/ and F4-F2 in vowel /i/.

  • PDF

Secure Blocking + Secure Matching = Secure Record Linkage

  • Karakasidis, Alexandros;Verykios, Vassilios S.
    • Journal of Computing Science and Engineering
    • /
    • v.5 no.3
    • /
    • pp.223-235
    • /
    • 2011
  • Performing approximate data matching has always been an intriguing problem for both industry and academia. This task becomes even more challenging when the requirement of data privacy rises. In this paper, we propose a novel technique to address the problem of efficient privacy-preserving approximate record linkage. The secure framework we propose consists of two basic components. First, we utilize a secure blocking component based on phonetic algorithms statistically enhanced to improve security. Second, we use a secure matching component where actual approximate matching is performed using a novel private approach of the Levenshtein Distance algorithm. Our goal is to combine the speed of private blocking with the increased accuracy of approximate secure matching.

Modified Phonetic Decision Tree For Continuous Speech Recognition

  • Kim, Sung-Ill;Kitazoe, Tetsuro;Chung, Hyun-Yeol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.17 no.4E
    • /
    • pp.11-16
    • /
    • 1998
  • For large vocabulary speech recognition using HMMs, context-dependent subword units have been often employed. However, when context-dependent phone models are used, they result in a system which has too may parameters to train. The problem of too many parameters and too little training data is absolutely crucial in the design of a statistical speech recognizer. Furthermore, when building large vocabulary speech recognition systems, unseen triphone problem is unavoidable. In this paper, we propose the modified phonetic decision tree algorithm for the automatic prediction of unseen triphones which has advantages solving these problems through following two experiments in Japanese contexts. The baseline experimental results show that the modified tree based clustering algorithm is effective for clustering and reducing the number of states without any degradation in performance. The task experimental results show that our proposed algorithm also has the advantage of providing a automatic prediction of unseen triphones.

  • PDF

Implementation of an Effective Rule Base System for the Change of Korean Vocal Sound (한국어 음운 변동 처리를 위한 효율적인 Rule Base System의 구성)

  • 이규영;이상범
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.28B no.12
    • /
    • pp.9-18
    • /
    • 1991
  • In this Paper, a rule-based method for the phenomenon of Korean vocal sound change is proposed. This method could be used to solve a problem between symbolic(Hangul)and phonetic language(Korean) for the study of Korean speech processing. A rule on the phenomenon of vocal sound rearranged for the rule base with a end-consonents on the authority of standard pronunciation rule. The proposed rule base system is simplified by the implementation for the vocal sound change. Also, it is useful to create the data base with phonetic value for the Korean voice processing by syllable unit.

  • PDF

A Study On the Realization of the Lexical Contrastive Focus and the Segmental Contrastive Focus (어휘 대조 초점과 음소 대조 초점 실현에 관한 음성학적 연구)

  • Kwak, Sook-young;Shin, Ji-young
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.179-184
    • /
    • 2005
  • The aim of this paper is to analyze the phonetic features of the lexical contrastive focus and the segmental contrastive focus. In this paper, I made two variables to study the realization of the contrastive focus. One is the three phonation types of the Korean plosive, a lenis, a fortis and an aspirate. The other is the positions of the segmental contrastive focus syllable in a word. I examined pitch, duration, intensity, VOT, formant, and so on. The realization of focus is different by the phonation types and the positions of the focused syllable.

  • PDF

Segmentation and Labeling in Creation of Speech Corpus (음성 코퍼스 구축에서 분절과 레이블링의 문제)

  • Um Yongnam;Lee Yong-Ju
    • Proceedings of the KSPS conference
    • /
    • 2002.11a
    • /
    • pp.27-32
    • /
    • 2002
  • In this paper it is discussed what should be taken into consideration with respect to segmentation and labeling in creation of speech corpus. What levels of annotation and what kind of contents should be included, what kind of acoustic information is checked for in segmentation, etc are discussed.

  • PDF

Meta-data Standardization of Speech Database (음성 DB의 메타데이타 표준화)

  • Kim Sanghun
    • Proceedings of the KSPS conference
    • /
    • 2003.10a
    • /
    • pp.61-64
    • /
    • 2003
  • In this paper, we introduce a new description method of annotation information of speech database. As one of structured description methods, XML based description which has been standardized by W3C will be applied to represent metadata of speech database. It will be continuously revised through the speech technology standard forum during this year

  • PDF