• 제목/요약/키워드: phonemes

검색결과 226건 처리시간 0.018초

A STUDY ON THE SIMULATED ANNEALING OF SELF ORGANIZED MAP ALGORITHM FOR KOREAN PHONEME RECOGNITION

  • Kang, Myung-Kwang;Ann, Tae-Ock;Kim, Lee-Hyung;Kim, Soon-Hyob
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
    • /
    • pp.407-410
    • /
    • 1994
  • In this paper, we describe the new unsuperivised learning algorithm, SASOM. It can solve the defects of the conventional SOM that the state of network can't converge to the minimum point. The proposed algorithm uses the object function which can evaluate the state of network in learning and adjusts the learning rate flexibly according to the evaluation of the object function. We implement the simulated annealing which is applied to the conventional network using the object function and the learning rate. Finally, the proposed algorithm can make the state of network converged to the global minimum. Using the two-dimensional input vectors with uniform distribution, we graphically compared the ordering ability of SOM with that of SASOM. We carried out the recognitioin on the new algorithm for all Korean phonemes and some continuous speech.

  • PDF

음성 에너지계산에서 창함수-길이 변화영향의 개선에 관한 연구 (On Improving the Effects of Varying the Window Length on Speech Energy Computation)

  • 배명진;안수길
    • 한국음향학회지
    • /
    • 제9권2호
    • /
    • pp.34-41
    • /
    • 1990
  • 음성신호의 전처리과정에서 에너지 퍼래미터는 음소의 변화특성을 나타내기 때문에 많이 사용하고 있다. 그렇지만 추출과정에서 창함수를 적용하기 때문에 창함수길이에 따른 영향을 받게된다. 본논문에서는 창함수길이에 따른 영향을 측정하고 그 영향을 최소화시키는 에너지추출법을 새로이 제안하였다. 이방법으로 추출된 에너지변화도는 창함수길이의 영향을 제거시켰기 때문에 음소의 변화특성을 잘나타낸다. 또한 계산시간은 샘플당 한번의 뺄셈과 덧셈, 그리고 두 번의 비교연산만 있으면 된다.

  • PDF

유성음 구간 검출을 위한 간단한 알고리즘에 관한 연구 (A Study on the Simple Algorithm for Discrimination of Voiced Sounds)

  • 장규철;우수영;박용규;유창동
    • 한국음향학회지
    • /
    • 제21권8호
    • /
    • pp.727-734
    • /
    • 2002
  • 본 논문에서는 유ㆍ무성음 구간을 검출하기 위한 간단한 알고리즘을 제안한다. 제안된 방법은 음성의 유ㆍ무성음의 주기성에 대한 특성을 보완할 수 있는 저대역 에너지와 영교차율, 그리고 주기성의 안정성을 판단하기 위한 피치 변화량을 파라미터로 사용하였다. 유ㆍ무성음의 구간검출을 음소단위의 검출이라는 측면에서 접근하여 음소군의 검출율과 음소군내의 음소의 검출율을 얻었다. TIMIT코퍼스 (corpus)를 데이터베이스로 사용하여 실험했을 때 유성음 음소 검출율이 약 13% 향상되었다.

동화를 이용한 음운인식활동이 저소득층 초등 방과후 교실 1, 2 학년 아동의 읽기, 학습동기 및 자아개념에 미치는 영향 (Phonological Awareness Activities Using Story Books : Effects on Reading, Self-Concept, and Learning Motivation in an After-School Program for 1st and 2nd Grade Low Income Children)

  • 이지현;김유정;이정아
    • 아동학회지
    • /
    • 제27권5호
    • /
    • pp.123-141
    • /
    • 2006
  • The phonemic awareness program included construction of 45 activities emphasizing various sounds in speech and letter names using a storybook. The subjects were thirty 1st and 2nd grade low-income(15 experimental and 15 control group) children attending an after-school program in Seoul. Pre- and post-tests assessed children's reading, self-concept, and learning motivation. The experimental group children had rich opportunity to deal with and discuss sounds, syllables, phonemes, and the Korean alphabet names during storybook reading, games, and play over a 12 week period, while the control group children were provided with worksheets, subject tutoring, and homework guidance. Results showed that the phonemic activities were an effective and useful way to enhance children's reading ability, self-concept, and learning motivation.

  • PDF

코퍼스기반 음성합성기의 데이터베이스 최적화 방안 (An Optimization of Speech Database in Corpus-based speech synthesis sytstem)

  • 장경애;정민화
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2002년도 11월 학술대회지
    • /
    • pp.209-213
    • /
    • 2002
  • This paper describes the reduction of DB without degradation of speech quality in Corpus-based Speech synthesizer of Korean language. In this paper, it is proposed that the frequency of every unit in reduced DB should reflect the frequency of units in Korean language. So, the target population of every unit is set to be proportional to their frequency in Korean large corpus(780K sentences, 45Mega phonemes). Second, the frequent instances during synthesis should be also maintained in reduced DB. To the last, it is proposed that frequency of every instance should be reflected in clustering criterion and used as criterion for selection of representative instances. The evaluation result with proposed methods reveals better quality than using conventional methods.

  • PDF

An Implementation of Speaker Verification System Based on Continuants and Multilayer Perceptrons

  • Lee, Tae-Seung;Park, Sung-Won;Lim, Sang-Seok;Hwang, Byong-Won
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2003년도 ISIS 2003
    • /
    • pp.216-219
    • /
    • 2003
  • Among the techniques to protect private information by adopting biometrics, speaker verification is expected to be widely used due to advantages in convenient usage and inexpensive implementation cost Speaker verification should achieve a high degree of the reliability in the verification nout the flexibility in speech text usage, and the efficiency in verification system complexity. Continuants have excellent speaker-discriminant power and the modest number of phonemes in the category, and multilayer perceptrons (MLPs) have superior recognition ability and fast operation speed. In consequence, the two provide viable ways for speaker verification system to obtain the above properties. This paper implements a system to which continuants and MLPs are applied, and evaluates the system using a Korean speech database. The results of the experiment prove that continuants and MLPs enable the system to acquire the three properties.

  • PDF

Phonetic Transcription Rules and Quantitative Analysis of Phoneme Distribution in French

  • Bae, Hee-Sook;Yun, Young-Sun;Oh, Yung-Hwan
    • 음성과학
    • /
    • 제9권1호
    • /
    • pp.149-171
    • /
    • 2002
  • After establishing the rules for the phonetic transcription in French, quantitative analysis on the given text, Waiting for Godot, is performed. Analyzing the text by investigating the influence of phoneme distribution is very interesting in the phonostylistic point of view. Since the phonetic transcription rules are useful for its automation, the rules are carefully established in this paper. From the results of the phonetic transcription, we can investigate the distribution of individual phonemes and the different phoneme groups between dialogues and scenery indications for various characters.

  • PDF

일본어 /p/의 청각인상 연구 (Auditory Images of Japanese /p/ by Koreans)

  • 이재강
    • 음성과학
    • /
    • 제11권3호
    • /
    • pp.83-93
    • /
    • 2004
  • The objectives of this study are to analyze Korean speakers' pronunciations of various Japanese /p/ patterns and to provide desirable pronunciation models. This is a part of an ongoing research that aims to propose a useful method of teaching Japanese pronunciation of /p/ to Koreans. The experimental data consist of /p/ phonemes in word initial, word medial, and 'yoon' positions. Yoon must be written in small size after a letter and it only makes a syllable with the preceding letter in Japanese. There were 22 different phoneme positions. They were pronounced by 48 Japanese majoring students (24 females and 24 males), who were in their twenties and were raised in Daejeon and vicinity. The individual pronunciations were collected and digitized into 528 files. The results show that Koreans pronounced the Japanese phoneme /p/ in a variety of ways, according to the auditory environments in which the phoneme was tested: as [ph] in word initial, [pp] or [ph] in word medial, and [ph] in 'yoon', unlike native speakers who pronounced Japanese /p/ as [ph] in word initial, [pp] in word medial and, and [pp] or [ph] in 'yoon'.

  • PDF

뇌성마비로 인한 마비말장애 성인의 자음 오류 분석 (Consonant Confusions Matrices in Adults with Dysarthria Associated with Cerebral Palsy)

  • 이영미;성지은;심현섭
    • 말소리와 음성과학
    • /
    • 제5권1호
    • /
    • pp.47-54
    • /
    • 2013
  • The aim of this study was to analyze consonant articulation errors produced by 90 speakers with cerebral palsy (CP). Phonetic transcriptions were made for 37 single-word utterances containing 70 phonemes: 48 initial consonants and 22 final consonants. Errors of substitution, omission, and distortion were analyzed using a confusion matrix paradigm showing the visualization of error patterns. Results showed that substitution errors in initial and final consonants were most frequent, followed by omission and distortion. Consonant omission occurred more frequently on final consonants. In both initial and final consonants, the within-place errors were more prominent than the within-manner errors. The current results suggest that consonant confusion matrices for dysarthric speech may provide useful information for evaluating speech intelligibility and developing automatic speech recognition system of adults with CP associated dysarthria.

BMS 알고리즘을 이용한 핵심어 검출기 거절기능 성능 향상 실험 (Improvement of Confidence Measure Performance in Keyword Spotting using Background Model Set Algorithm)

  • 김병돈;김진영;최승호
    • 대한음성학회지:말소리
    • /
    • 제46호
    • /
    • pp.103-115
    • /
    • 2003
  • In this paper, we proposed Background Model Set algorithm used in the speaker verification to improve calculating confidence measure(CM) in speech recognition. CM is to display relative likelihood between recognized models and antiphone models. In previous method calculating of CM, we calculated probability and standard deviation using all phonemes in composition of antiphone models. At this process, antiphone CM brought bad recognition result. Also, recognition time increases. In order to solve this problem, we studied about method to reconstitute average and standard deviation using BMS algorithm in CM calculation.

  • PDF