• Title/Summary/Keyword: New Words

Search Result 1,475, Processing Time 0.03 seconds

New Postprocessing Methods for Rejectin Out-of-Vocabulary Words

  • Song, Myung-Gyu
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.3E
    • /
    • pp.19-23
    • /
    • 1997
  • The goal of postprocessing in automatic speech recognition is to improve recognition performance by utterance verification at the output of recognition stage. It is focused on the effective rejection of out-of vocabulary words based on the confidence score of hypothesized candidate word. We present two methods for computing confidence scores. Both methods are based on the distance between each observation vector and the representative code vector, which is defined by the most likely code vector at each state. While the first method employs simple time normalization, the second one uses a normalization technique based on the concept of on-line garbage mode[1]. According to the speaker independent isolated words recognition experiment with discrete density HMM, the second method outperforms both the first one and conventional likelihood ratio scoring method[2].

  • PDF

Development of an algorithm for the control of prosodic factors to synthesize unlimited isolated words in the time domain (시간 영역에서의 무제한 고립어 합성을 위한 운율 요소 제어용 알고리즘 개발)

  • 강찬희
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.35C no.7
    • /
    • pp.59-68
    • /
    • 1998
  • This paper is to develop an algorithm for the unlimited korean speech synthesis. We present the results controlled of prosodic factors with isolated words as aynthesis basis unit int he time domain. With a new pitch-synchronous and parametric speech synthesis mehtod in the time domain here we mainly present the results of controlled prosody factors such a spitch periods, energy envelops and durations and the evaluaton of synthetic speech qualities. In the case of synthesis, it is possible ot synthesize connected words by controlling of a continuous unified prosody that makes to improve the naturalities. In the results of experiment, it also has been to be improved uncontinuities of pitch and zeroing of energy in the junction parts of speech waveforms. Specially it has been to be possible to synthesize speeches with unlimitted durations and tones. So on it makes the noisiness and the clearness better by improving the degradation effects from the phase distortion due to the discontinuities in the waveform connection parts.

  • PDF

Intelligent Wordcloud Using Text Mining (텍스트 마이닝을 이용한 지능적 워드클라우드)

  • Kim, Yeongchang;Ji, Sangsu;Park, Dongseo;Lee, Choong Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.325-326
    • /
    • 2019
  • This paper proposes an intelligent word cloud by improving the existing method of representing word cloud by examining the frequency of nouns with text mining technique. In this paper, we propose a method to visually show word clouds focused on other parts, such as verbs, by effectively adding newly-coined words and the like to a dictionary that extracts noun words in text mining. In the experiment, the KoNLP package was used for extracting the frequency of existing nouns, and 80 new words that were not supported were added manually by examining frequency.

  • PDF

Korean Document Classification Using Extended Vector Space Model (확장된 벡터 공간 모델을 이용한 한국어 문서 분류 방안)

  • Lee, Samuel Sang-Kon
    • The KIPS Transactions:PartB
    • /
    • v.18B no.2
    • /
    • pp.93-108
    • /
    • 2011
  • We propose a extended vector space model by using ambiguous words and disambiguous words to improve the result of a Korean document classification method. In this paper we study the precision enhancement of vector space model and we propose a new axis that represents a weight value. Conventional classification methods without the weight value had some problems in vector comparison. We define a word which has same axis of the weight value as ambiguous word after calculating a mutual information value between a term and its classification field. We define a word which is disambiguous with ambiguous meaning as disambiguous word. We decide the strengthness of a disambiguous word among several words which is occurring ambiguous word and a same document. Finally, we proposed a new classification method based on extension of vector dimension with ambiguous and disambiguous words.

An Implementation of the Spam Mail Prevention System Using Reply Message with Secrete Words (비밀단어의 회신을 이용한 스팸메일 차단 시스템의 구현)

  • Ko Joo Young;Shim Jae Chang;Kim Hyun Ki
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.1
    • /
    • pp.111-118
    • /
    • 2005
  • This paper describes an implementation of the spam mail prevention system using reply message with secrete words. When user receives a new e-mail, the e-mail address is compared with the white e-mail addresses in database by the system. If user receives a new e-mail which does not exist in a white e-mail addresses database, a reply e-mail attached with secrete words is delivered automatically. And the system is compared with the white domains first for intranet environment. It speeds up processing time. proposed algorithm is required a small database and faster than the black e-mail addresses comparison. This system is implemented using procmail, PHP and IMAP on Linux and the user can manage the databases on the web.

  • PDF

Fast Algorithm for Recognition of Korean Isolated Words (한국어 고립단어인식을 위한 고속 알고리즘)

  • 남명우;박규홍;정상국;노승용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.1
    • /
    • pp.50-55
    • /
    • 2001
  • This paper presents a korean isolated words recognition algorithm which used new endpoint detection method, auditory model, 2D-DCT and new distance measure. Advantages of the proposed algorithm are simple hardware construction and fast recognition time than conventional algorithms. For comparison with conventional algorithm, we used DTW method. At result, we got similar recognition rate for speaker dependent korean isolated words and better it for speaker independent korean isolated words. And recognition time of proposed algorithm was 200 times faster than DTW algorithm. Proposed algorithm had a good result in noise environments too.

  • PDF

Research on the Value of Korean Neologism Education and the Method of Building Data (한국어 신조어 교육의 가치와 자료 구축을 위한시론)

  • Kim, Deok-shin
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.1
    • /
    • pp.371-377
    • /
    • 2022
  • This study examines whether there are subjects and learners to pay attention to as 'processes' that have not been dealt with in Korean vocabulary education due to prioritizing learning outcomes, educational outcomes, and objects. In addition, the purpose of this study was to examine the educational value of the neologism and to suggest data construction method for it. Proposal to create a 'single-level list' of neologisms as a preliminary work to create a dictionary as a learning material to teach new words to academic purpose learners, taking neologism as the vocabulary in the blind spot and foreign academic purpose learners as learners in the blind spot stage. did The 'single-layered list' is to divide new words by period into coined words, meanings, culture, etc. and construct them as data. Through this study, we will help systematically teach Korean vocabulary by adding vocabulary to be learned as a 'process' to the results of Korean vocabulary education so far.

A Study on Hybrid System of Affordance-based Future Housing using Convergence Technology (컨버젼스 기술을 이용한 어포던스 기반 미래주거 공간의 하이브리드 구조에 관한 연구)

  • Kang, Min-Soo;Choo, Seung-Yeon
    • Proceeding of Spring/Autumn Annual Conference of KHA
    • /
    • 2009.04a
    • /
    • pp.95-100
    • /
    • 2009
  • In the coming 21st centuries, words of development of information communication technology among the key words being emerged as an important concern has been talked about frequently and ubiquitous environment that helps human living being networked with humans, objects and environments has been rapidly progressed, influencing significantly over the various fields as well as architectural area. And eventually in this architectural area, the space that is desired to be shown to and experienced by the people could be found in the creation of a space in a new form that has not been existed in this world by utilizing the information communication technology. The purpose of this study is to develop one-step advanced space from the existing space and to form a new paradigm of the future space by utilizing convergence technology and the psychology-based design principle of behavioral inducement called affordance.

  • PDF

A Study on the Types of Design Problem Solving by Analogical Thinking - Focused on the Analysis of Associated Words and Sketch - (유추적 사고에 의한 디자인 문제해결의 유형 - 연상된 단어와 스케치 분석을 중심으로 -)

  • Choi, Eun-Hee;Choi, Yoon-Ah
    • Korean Institute of Interior Design Journal
    • /
    • v.16 no.2 s.61
    • /
    • pp.63-70
    • /
    • 2007
  • Analogy in problem solving is similarity-based reasoning facilitated by verbal and visual operation. This similarity-based reasoning generally supports initial phase of idea search. Therefore, this study intends to infer the types of problem solving by tracing the analogy use of verbal and visual representation through a experimental research. According to the result of this research, the types of problem solving by analogy are classified into 'evolving', 'divergent', and 'poor conversion' type. Firstly, 'evolving type' is distinguished between 'combination type' associated different contents to develope a new design and 'transformation type' associated similar words and sketches to be continuously revised and developed. In these types usually structural analogy rather than surface analogy is used. Secondly, in 'divergent type' associated words or sketches are individually represented, and among them one design solution is selected. In this type usually surface analogy is used. Thirdly, in 'poor conversion type' interaction between verbal representation and visual representation does not go on smoothly, and the generation of idea is poor. In here surface analogy is mostly used. These findings could form the basis of skill development of idea generation and conversion in design education.

A study of Traditional Korean Medicine(TKM) term's Normalization for Enlarged Reference terminology model (참조용어(Reference Terminology) 모델 확장을 위한 한의학용어 정형화(Normalization) 연구)

  • Jeon, Byoung-Uk;Hong, Seong-Cheon
    • Journal of the Korean Institute of Oriental Medical Informatics
    • /
    • v.15 no.2
    • /
    • pp.1-6
    • /
    • 2009
  • The discipline of terminology is based on its own theoretical principles and consists primarily of the following aspects: analysing the concepts and concept structures used in a field or domain of activity, identifying the terms assigned to the concepts, in the case of bilingual or multilingual terminology, establishing correspondences between terms in the various languages, creating new terms, as required. The word properties has syntax, morphology and orthography. The syntax is that how words are put together. The morphology is consist of inflection, derivation, and compounding. The orthography is spelling. Otherwise, the terms of TKM(Traditional Korean Medicine) is two important element of visual character and phonetic notation. A visual character consist of spell, sort words, stop words, etc. For example, that is a case of sort words in which this '다한', '한다', '多汗', '汗多' as same. A phonetic notation consist of palatalization, initial law, etc. For example, that is a case of palatalization in which this '수족랭', '수족냉', '手足冷', '手足冷' as same. Therefore, to enlarged reference terminology is a method by term's normalization. For such a reason, TKM's terms of normalization is necessary.

  • PDF