• Title/Summary/Keyword: vocabulary size

Search Result 48, Processing Time 0.03 seconds

1-Pass Semi-Dynamic Network Decoding Using a Subnetwork-Based Representation for Large Vocabulary Continuous Speech Recognition (대어휘 연속음성인식을 위한 서브네트워크 기반의 1-패스 세미다이나믹 네트워크 디코딩)

  • Chung Minhwa;Ahn Dong-Hoon
    • MALSORI
    • /
    • no.50
    • /
    • pp.51-69
    • /
    • 2004
  • In this paper, we present a one-pass semi-dynamic network decoding framework that inherits both advantages of fast decoding speed from static network decoders and memory efficiency from dynamic network decoders. Our method is based on the novel language model network representation that is essentially of finite state machine (FSM). The static network derived from the language model network [1][2] is partitioned into smaller subnetworks which are static by nature or self-structured. The whole network is dynamically managed so that those subnetworks required for decoding are cached in memory. The network is near-minimized by applying the tail-sharing algorithm. Our decoder is evaluated on the 25k-word Korean broadcast news transcription task. In case of the search network itself, the network is reduced by 73.4% from the tail-sharing algorithm. Compared with the equivalent static network decoder, the semi-dynamic network decoder has increased at most 6% in decoding time while it can be flexibly adapted to the various memory configurations, giving the minimal usage of 37.6% of the complete network size.

  • PDF

A Collocational Analysis of Korean High School English Textbooks and Suggestions for Collocation Instruction

  • Kim, Nahk-Bohk
    • English Language & Literature Teaching
    • /
    • v.10 no.3
    • /
    • pp.41-66
    • /
    • 2004
  • Under the textbook-driven approach to English education in the Korean selling, the importance of the English textbook can not be overemphasized as the main source of learning materials. Recently, with the development of computer-based language corpora, the recognition of the importance of collocations and the availability of computerized databases of words have caused a resurgence and facilitation in the instruction of collocation. The primary purpose of the present study is to identify the characteristics of lexical collocation and the extent of its use in high school 10th-grade textbooks. From all the analyses, it is revealed that the language materials reflect various constructed collocation in the case of adjective+noun and noun+noun collocations in a natural context. However, verb+noun and adverb+verb collocations are not fully reflected. This is true for delexicalized verbs, and verb and adjective intensifiers. Also the language materials do not provide sufficient support for the lexical syllabus, even though all textbooks may be somewhat adequate in terms of vocabulary size. Finally, based on the analyses of the texts, the suggestions for English collocation instruction are made in the lexical approach.

  • PDF

Monophone and Biphone Compuond Unit for Korean Vocabulary Speech Recognition (한국어 어휘 인식을 위한 혼합형 음성 인식 단위)

  • 이기정;이상운;홍재근
    • Journal of the Korea Computer Industry Society
    • /
    • v.2 no.6
    • /
    • pp.867-874
    • /
    • 2001
  • In this paper, considering the pronunciation characteristic of Korean, recognition units which can shorten the recognition time and reflect the coarticulation effect simultaneously are suggested. These units are composed of monophone and hipbone ones. Monophone units are applied to the vowels which represent stable characteristic. Biphones are used to the consonant which vary according to adjacent vowel. In the experiment of word recognition of PBW445 database, the compound units result in comparable recognition accuracy with 57% speed up compared with triphone units and better recognition accuracy with similar speed. In addition, we can reduce the memory size because of fewer units.

  • PDF

Comparative Analysis of Current Science Textbooks on Category (중학교 과학 교과서의 범주별 분석 비교)

  • Koo, Soo-Jeong;Choi, Don-Hyung
    • Journal of The Korean Association For Science Education
    • /
    • v.12 no.2
    • /
    • pp.97-107
    • /
    • 1992
  • ln this study, we analyzed 5 science textbooks currently used for the 7th graders quantitatively by using the science textbook rating system of Collette and Chiappetta(1986), making meta-analysis of the results of 17 graduate school students of Seoul National University. The rating system consists of 11 categories with detailed items respectively : content, organization, reading level, instruction approach, illustrations, end-chapter teaching aids, laboratory activities in text and/or accompanying manual, teacher aids, indices and glossaries and mechanical makeup of text. Each item in the checklist is to be given between one and five points and the total number of possible points in this rating system is 290. It was shown that 5 science textbooks currently used for 7th-year-students were all "poor" in terms of total points and had, at large, uniformed results especially in 10 items; 7 items concerning moral and ethical implications of science, vocabulary lists, accompanying laboratory manual, annotated editions for test, supply list for laboratory program, student workbook and glossary with low points, while 3 items concerning facilities needed for laboratory activities, activities relevant to the content and textbook size with high points. A Science teachers could get a broad view with a correct impression of the books usefulness in making an evaluation of available textbooks.

  • PDF

A Study on Type and Spatial Sense of Contemporary Architecture Integrated Structure and Skin - Focused on Contemporary Architecture case after 2000 years - (구조와 표피가 일체화된 현대건축의 유형과 공간감에 관한 연구 - 2000년 이후 건축사례를 중심으로 -)

  • Lee, Sang-Ho;Ban, Ja-Yuen
    • Korean Institute of Interior Design Journal
    • /
    • v.26 no.1
    • /
    • pp.83-90
    • /
    • 2017
  • The purpose of this study is to investigate the possibilities of architectural planning and expression of the relationship between structure and skin in contemporary architecture. For this purpose, we show interior space images -integration of structure and skin architecture- to students and experts of the related majors, and let them mark their feeling on the questionnaire composed spatial expression vocabulary extracted through the literature study on spatial sensibility, and analysis data. As a result, in contemporary architecture where the structure and the skin are integrated, form elements have a stronger influence on formation of space sense than elements of light and size, and aesthetics, characteristic, and temporality are common in the inner space, Three types of four types showed unique characteristics, and it was confirmed that there is a causal relationship between the spatial feeling factor and the spatial feeling. This means that the relationship between the structure and the skin can be considered as a planning factor, and this study is expected to be used as such basic data.

A Study on Pseudo N-gram Language Models for Speech Recognition (음성인식을 위한 의사(疑似) N-gram 언어모델에 관한 연구)

  • 오세진;황철준;김범국;정호열;정현열
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.2 no.3
    • /
    • pp.16-23
    • /
    • 2001
  • In this paper, we propose the pseudo n-gram language models for speech recognition with middle size vocabulary compared to large vocabulary speech recognition using the statistical n-gram language models. The proposed method is that it is very simple method, which has the standard structure of ARPA and set the word probability arbitrary. The first, the 1-gram sets the word occurrence probability 1 (log likelihood is 0.0). The second, the 2-gram also sets the word occurrence probability 1, which can only connect the word start symbol and WORD, WORD and the word end symbol . Finally, the 3-gram also sets the ward occurrence probability 1, which can only connect the word start symbol , WORD and the word end symbol . To verify the effectiveness of the proposed method, the word recognition experiments are carried out. The preliminary experimental results (off-line) show that the word accuracy has average 97.7% for 452 words uttered by 3 male speakers. The on-line word recognition results show that the word accuracy has average 92.5% for 20 words uttered by 20 male speakers about stock name of 1,500 words. Through experiments, we have verified the effectiveness of the pseudo n-gram language modes for speech recognition.

  • PDF

A Study on the Korean Broadcasting Speech Recognition (한국어 방송 음성 인식에 관한 연구)

  • 김석동;송도선;이행세
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.1
    • /
    • pp.53-60
    • /
    • 1999
  • This paper is a study on the korean broadcasting speech recognition. Here we present the methods for the large vocabuary continuous speech recognition. Our main concerns are the language modeling and the search algorithm. The used acoustic model is the uni-phone semi-continuous hidden markov model and the used linguistic model is the N-gram model. The search algorithm consist of three phases in order to utilize all available acoustic and linguistic information. First, we use the forward Viterbi beam search to find word end frames and to estimate related scores. Second, we use the backword Viterbi beam search to find word begin frames and to estimate related scores. Finally, we use A/sup */ search to combine the above two results with the N-grams language model and to get recognition results. Using these methods maximum 96.0% word recognition rate and 99.2% syllable recognition rate are achieved for the speaker-independent continuous speech recognition problem with about 12,000 vocabulary size.

  • PDF

A Study on the Remodeling of The Training Center for Performers of Korean Traditional Music(Studio 'Byeol') for Historicity Conservation (역사성 보존을 위한 구 국악사양성소(별오름극장)의 리모델링에 관한 연구)

  • Lee, Wan-Geon
    • Korean Institute of Interior Design Journal
    • /
    • v.19 no.5
    • /
    • pp.165-172
    • /
    • 2010
  • Recently, the recognition is changing about cultural heritage, and the various types of buildings or facilities of modern or contemporary times have been designated as cultural properties after that Registered Cultural Properties System is enforced. The purpose of this study is to survey how the newly born the historic buildings of modern or contemporary times through the remodeling process of the Studio 'Byeol'(the Training Center for Performers of Korean Traditional Music) in the National Theater of Korea so-called a microcosm of performing arts history. In the process, it will examine the merits and demerits of various alternatives and the direction of the remodeling etc., and propose an utilization as a basic data of post evaluation for the remodeling of a historic building. The result are as followings. Firstly, the remodeling that gave a new physical properties to a building can be used a method of conservation and reuse on a historic building. The remodeling of a historic building must be eclectically progress between the owner and the citizen or the economic value and the historicity conservation. And, the remodeling of historic buildings such as the Training Center for Performers of Korean Traditional Music must consider the conservation of the exterior walls in whole or in part at least. Secondly, an architect Lee Hee Tae(李喜泰) who had been to develop his own architectural vocabulary and to test based on the korean traditional architecture and the Training Center for Performers of Korean Traditional Music must be newly evaluated today. Lastly, the remodeling alternatives of the Training Center for Performers of Korean Traditional Music have been analyzed with three types, which is 'repairing only the interior which maintains the size and an appearance of present', 'extending the outer wall to the external column line', 'extending the basement'. And, it was analyzed with the appropriate final decision that it remodels only the interior in the current situation because of a historicity, a budget, a relevant law etc.