• 제목/요약/키워드: Korean first phoneme

검색결과 47건 처리시간 0.023초

확장된 버로우즈-휠러 변환을 이용한 개선된 한글 초성 탐색 (Improved First-Phoneme Searches Using an Extended Burrows-Wheeler Transform)

  • 김성환;조환규
    • 정보과학회 컴퓨팅의 실제 논문지
    • /
    • 제20권12호
    • /
    • pp.682-687
    • /
    • 2014
  • 한글 초성 질의는 내비게이션 시스템이나 모바일 기기와 같이 입력 환경에 제약이 있어 오류가 빈번한 인터페이스 상에서 사용자 편의성 향상을 위하여 제공되는 중요한 기능이다. 본 논문에서는 한글 문자열을 자소 단위로 분해하여 재배열하여 환형 문자열로 변환한 후, 확장된 버로우즈-휠러 변환을 이용하여 색인함으로써 초성 질의 탐색을 위한 시공간 효율적인 자료구조를 제안한다. 또한 실험을 통하여 기존 기법에 비하여 더 적은 공간만을 사용하면서도 보다 다양한 형태의 질의를 처리할 수 있으며, 특히 질의어의 길이가 짧고, 초성의 비율이 높을수록 탐색 속도가 향상됨을 확인하였다.

청각장애 아동의 음운인식 능력과 단어확인 능력의 상관연구 (A Study of Correlation Between Phonological Awareness and Word Identification Ability of Hearing Impaired Children)

  • 김유경;김문정;안종복;석동일
    • 음성과학
    • /
    • 제13권3호
    • /
    • pp.155-167
    • /
    • 2006
  • Hearing impairment children possess poor underlying perceptual knowledge of the sound system and show delayed development of segmental organization of that system. The purpose of this study was to investigate the relationship between phonological awareness ability and word identification ability in hearing impaired children. 14 children with moderately severe hearing loss participated in this study. All tasks were individually administered. Phonological awareness tests consisted of syllable blending, syllable segmentation, syllable deletion, body-coda discrimination, phoneme blending, phoneme segmentation and phoneme deletion. Close-set Monosyllabic Words(12 items) and lists 1 and 2 of open-set Monosyllabic Words in EARS-K were examined for word identification. Results of this study were as follows: First, from the phonological awareness task, the close-set word identification showed a high positive correlation with the coda discrimination, phoneme blending and phoneme deletion. The open-set word identification showed a high positive correlation with phoneme blending, phoneme deletion and phoneme segmentation. Second, from the level of phonological awareness, the close-set word identification showed a high positive correlation with the level of body-coda awareness and phoneme awareness while the open-set word identification showed a high positive correlation only with the level of phoneme awareness.

  • PDF

유성음과 무성음의 경계를 이용한 연속 음성의 세그먼테이션 (Segmentation of continuous Korean Speech Based on Boundaries of Voiced and Unvoiced Sounds)

  • 유강주;신욱근
    • 한국정보처리학회논문지
    • /
    • 제7권7호
    • /
    • pp.2246-2253
    • /
    • 2000
  • In this paper, we show that one can enhance the performance of blind segmentation of phoneme boundaries by adopting the knowledge of Korean syllabic structure and the regions of voiced/unvoiced sounds. eh proposed method consists of three processes : the process to extract candidate phoneme boundaries, the process to detect boundaries of voiced/unvoiced sounds, and the process to select final phoneme boundaries. The candidate phoneme boudaries are extracted by clustering method based on similarity between two adjacent clusters. The employed similarity measure in this a process is the ratio of the probability density of adjacent clusters. To detect he boundaries of voiced/unvoiced sounds, we first compute the power density spectrum of speech signal in 0∼400 Hz frequency band. Then the points where this paper density spectrum variation is greater than the threshold are chosen as the boundaries of voiced/unvoiced sounds. The final phoneme boundaries consist of all the candidate phoneme boundaries in voiced region and limited number of candidate phoneme boundaries in unvoiced region. The experimental result showed about 40% decrease of insertion rate compared to the blind segmentation method we adopted.

  • PDF

초급 중국어 학습자를 위한 발음교육 개선방안 - 말하기 중심 발음 교수법 - (A Study of the Speaking-Centered Chinese Pronunciation Teaching Method for Basic Chinese Learners.)

  • 임승규
    • 비교문화연구
    • /
    • 제35권
    • /
    • pp.339-368
    • /
    • 2014
  • In Teaching Chinese as a Foreign Language, phoneme-based pronunciation teaching such as tone, consonants, vowels is the most common teaching methods. Based on main character of Chinese grammar: 'lack of morphological change' in a narrow sense, was proposed by Lv Shuxiang and Zhu Dexi, I designed 'Communicative oriented Chinese pronunciation teaching method'. This teaching method is composed of seven elements: one kind is the 'structural elements': phoneme, word, phrase, sentence; another kind is the 'functional elements': listening, speaking and translation. This pronunciation teaching method has four kinds of practice methods: 1) phoneme learning method; 2) word based pronunciation practice; 3) phrase based pronunciation practice; 4) sentence based pronunciation practice. When the teachers use these practice methods, they can use the dialogue and Korean-Chinese translation. In particular, when the teachers use 'phoneme learning method', they must use Korean and Chinese phonetic comparison results. When the teachers try to correct learner's errors, they must first consider the speech communication.

소음문장 제거를 위한 음소지속시간 사용 (The Usage of Phoneme Duration Information for Rejecting Garbage Sentences)

  • 구명완;김호경;박성준;김재인
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.219-222
    • /
    • 2003
  • In this paper, we study the usage of phoneme duration information for rejection garbage sentence. First, we build a phoneme duration modeling in a speech recognition system based on dicicion tree state tying, We assume that phone duration has a Gamma distribution. Next, we build a verification module in which word-level confidence measure is used. Finally, we make a comparative study on phoneme duration with speech DB obtained from the live system. This DB consistes of OOT(out-of-task) and ING(in-grammar) utterences. the usage of phone duration information yields that OOT recognition rate is improved by 46% and that another 8.4% error rate is reduced when combined with utterence verification module.

  • PDF

Support Vector Machine Based Phoneme Segmentation for Lip Synch Application

  • Lee, Kun-Young;Ko, Han-Seok
    • 음성과학
    • /
    • 제11권2호
    • /
    • pp.193-210
    • /
    • 2004
  • In this paper, we develop a real time lip-synch system that activates 2-D avatar's lip motion in synch with an incoming speech utterance. To realize the 'real time' operation of the system, we contain the processing time by invoking merge and split procedures performing coarse-to-fine phoneme classification. At each stage of phoneme classification, we apply the support vector machine (SVM) to reduce the computational load while retraining the desired accuracy. The coarse-to-fine phoneme classification is accomplished via two stages of feature extraction: first, each speech frame is acoustically analyzed for 3 classes of lip opening using Mel Frequency Cepstral Coefficients (MFCC) as a feature; secondly, each frame is further refined in classification for detailed lip shape using formant information. We implemented the system with 2-D lip animation that shows the effectiveness of the proposed two-stage procedure in accomplishing a real-time lip-synch task. It was observed that the method of using phoneme merging and SVM achieved about twice faster speed in recognition than the method employing the Hidden Markov Model (HMM). A typical latency time per a single frame observed for our method was in the order of 18.22 milliseconds while an HMM method applied under identical conditions resulted about 30.67 milliseconds.

  • PDF

한국어 음성 인식에서 변동성과 벌크 지표에 기반한 음소 경계 검출 (Phoneme Segmentation based on Volatility and Bulk Indicators in Korean Speech Recognition)

  • 이재원
    • 정보과학회 컴퓨팅의 실제 논문지
    • /
    • 제21권10호
    • /
    • pp.631-638
    • /
    • 2015
  • 최근 모바일 환경에서 작동 가능한 음성 인식 시스템에 대한 수요가 급격히 증대되고 있다. 본 논문은 음소 기반 한국어 음성 인식 시스템에 적용하기 위한 새로운 한국어 음소 경계 검출 방안을 제안한다. 먼저 입력 신호는 동일한 크기의 블록들을 구성한다. 제안하는 방식은 입력 음성 신호의 각 블록에 대해 계산되는 변동성 지표와, 부호가 동일한 인접 샘플들의 집합인, 블록 내의 각 벌크에 대해 계산되는 벌크 지표를 음소 경계 검출의 기반 지표로 사용한다. 두 가지 기반 지표를 결합하여 활용하는 세 개의 전용 인식 알고리즘을 사용하여, 모음, 유성 자음, 그리고 무성 자음을 차례로 인식하여 음소 간 경계를 검출한다. 실험 결과를 통해, 제안하는 방식을 사용함으로써 기존의 경계 검출 방식에 비해 오류율을 현저히 감소시킬 수 있음을 확인하였다.

접촉점에서의 국소 그래프 패턴에 의한 필기체 한글의 자소분리에 관한 연구 (A Study on the Phoneme Segmentation of Handwritten Korean Characters by Local Graph Patterns on Contacting Points)

  • 최필웅;이기영;구하성;고형화
    • 전자공학회논문지B
    • /
    • 제30B권4호
    • /
    • pp.1-10
    • /
    • 1993
  • In this paper, a new method of phoneme segmentation of handwritten Korean characters using the local graph pattern is proposed. At first, thinning was performed before extracting features. End-point, inflexion-point, branch-point and cross-point were extracted as features. Using these features and the angular relations between these features, local graph pattern was made. When local graph pattern is made, the of strokes is investigated on contacting point. From this process, pattern is simplified as contacting pattern of the basic form and the contacting form we must take into account can be restricted within fixed region, 4therefore phoneme segmentation not influenced by characters form and any other contact in a single character is performed as matching this local graph pattern with base patterns searched ahead. This experiments with 540 characters have been conducted. From the result of this experiment, it is shown that phoneme segmentation is independent of characters form and other contact in a single character to obtain a correct segmentation rate of 95%, manages it efficiently to reduce the time spent in lock operation when the lock.

  • PDF

악리론으로 본 정음창제와 정음소 분절 알고리즘 (Ortho-phonic Alphabet Creation by the Musical Theory and its Segmental Algorithm)

  • 진용옥;안정근
    • 음성과학
    • /
    • 제8권2호
    • /
    • pp.49-59
    • /
    • 2001
  • The phoneme segmentation is a very difficult problem in speech sound processing because it has found out segmental algorithm in many kinds of allophone and coarticulation's trees. Thus system configuration for the speech recognition and voice retrieval processing has a complex system structure. To solve it, we discuss a possibility of new segmental algorithm, which is called the minus a thirds one or plus in tripartitioning(삼분손익) of twelve temporament(12 율려), first proposed by Prof. T. S. Han. It is close to oriental and western musical theory. He also has suggested a 3 consonant and 3 vowel phonemes in Hunminjungum(훈민정음) invented by the King Sejong in the 15th century. In this paper, we suggest to newly name it as ortho-phonic phoneme(OPP/정음소), which carries the meaning of 'the absoluteness and independency'. OPP also is acceptable to any other languages, for example IPA. Lastly we know that this algorithm is constantly applicable to the global language and is very useful to construct a voice recognition and retrieval structuring engineering.

  • PDF

음성 질의 기반 디지털 사진 검색 기법 (A Query-by-Speech Scheme for Photo Albuming)

  • 김태성;서영주;이용주;김회린
    • 대한음성학회지:말소리
    • /
    • 제57호
    • /
    • pp.99-112
    • /
    • 2006
  • In this paper, we introduce two retrieval methods for photos with speech documents. We compare the pattern of speech query with those of speech documents recorded in digital cameras, and measure the similarities, and retrieve photos corresponding to the speech documents which have high similarity scores. As the first approach, a phoneme recognition scheme is used as the pre-processor for the pattern matching, and in the second one, the vector quantization (VQ) and the dynamic time warping (DTW) are applied to match the speech query with the documents in signal domain itself. Experimental results show that the performance of the first approach is highly dependent on that of phoneme recognition while the processing time is short. The second method provides a great improvement of performance. While the processing time is longer than that of the first method due to DTW, but we can reduce it by taking approximated methods.

  • PDF