• 제목/요약/키워드: Language generation

검색결과 735건 처리시간 0.033초

자동 구두점 삽입을 이용한 Rich Transcription 생성 (Rich Transcription Generation Using Automatic Insertion of Punctuation Marks)

  • 김지환
    • 대한음성학회지:말소리
    • /
    • 제61호
    • /
    • pp.87-100
    • /
    • 2007
  • A punctuation generation system which combines prosodic information with acoustic and language model information is presented. Experiments have been conducted first for the reference text transcriptions. In these experiments, prosodic information was shown to be more useful than language model information. When these information sources are combined, an F-measure of up to 0.7830 was obtained for adding punctuation to a reference transcription. This method of punctuation generation can also be applied to the 1-best output of a speech recogniser. The 1-best output is first time aligned. Based on the time alignment information, prosodic features are generated. As in the approach applied in the punctuation generation for reference transcriptions, the best sequence of punctuation marks for this 1-best output is found using the prosodic feature model and an language model trained on texts which contain punctuation marks.

  • PDF

언어 중추 영역에 대한 기능적 자기공명영상: 시각적, 청각적 지시 과제에 관한 비교 (Functional MR Imaging of Language System : Comparative Study between Visual and Auditory Instructions in Word Generation Task)

  • 구은회;권대철;김동성;송인찬
    • 대한의용생체공학회:의공학회지
    • /
    • 제24권4호
    • /
    • pp.241-246
    • /
    • 2003
  • 정상 성인을 대상으로 언어생성과제의 시각적 지시과제와 청각적 지시과제에 대한 기능적 MR영상들을 비교하고자 하였다. 6명의 정상 성인을 대상으로 자기공명영상기기를 이용하였다 시각적, 청각적 언어 지시과제는 명사의 언어를 이용하여. 휴지기. 확성기를 각각 6번 구성하였다. 뇌의 활성영역의 휴지기와 확성기의 신호변화에 따른 데이터는 MRDx 프로그램으로 처리하였고 cross correlation 방법으로 통계 처리하였다. 시각적, 청각적 지시과제의 분석방법은 lateralization index 공식으로 뇌 반구의 활성화 voxel의 총합과 좌$.$우 반구의 픽셀 수를 구하였고, 지시 과제에 대한 활성화 영역을 “Area of Activation Cells” 값에 따라 4등급으로 분류하여 비교 분석하였다. 시간적 청각적 지시과제에서 Broca, Wernicke 영역 등에서 크기는 다르지만 모두 활성화가 관찰되었다. lateralization index에서 뇌반구의 활성화 값에 차이는 시각적 지시과제에 비해 청각적 지시과제가 높은 활성화 값을 보였다. 그러므로 언어영역에 대한 뇌의 기능적 영상을 얻는데 두 방법 모두 유용하다고 생각된다.

단위 테스트를 위한 테스트 어서션 언어 설계 및 도구 구현 (Test Assertion Language for Unit Test : Design and Implementation)

  • 신우창
    • 한국IT서비스학회지
    • /
    • 제10권4호
    • /
    • pp.205-218
    • /
    • 2011
  • Unit testing which is the first task to perform software testing has a major impact on reducing s/w development cost and improving s/w quality. In order to make unit testing become the formal procedure rather than ad-hoc tasks performed by developer, the language-neutral measures are necessary in the programming which is able to specify the test cases. This paper presents a test assertion language for the language-neutral specification of the test cases. The suggested language is based on the ISO/IEC 24709-1, but it supports object oriented features and can describe test cases for unit testing. The test cases that are specified by the language can be used for generating test oracle source codes by test oracle generation tools.

문맥의존 철자오류 후보 생성을 위한 통계적 언어모형 개선 (Improved Statistical Language Model for Context-sensitive Spelling Error Candidates)

  • 이정훈;김민호;권혁철
    • 한국멀티미디어학회논문지
    • /
    • 제20권2호
    • /
    • pp.371-381
    • /
    • 2017
  • The performance of the statistical context-sensitive spelling error correction depends on the quality and quantity of the data for statistical language model. In general, the size and quality of data in a statistical language model are proportional. However, as the amount of data increases, the processing speed becomes slower and storage space also takes up a lot. We suggest the improved statistical language model to solve this problem. And we propose an effective spelling error candidate generation method based on a new statistical language model. The proposed statistical model and the correction method based on it improve the performance of the spelling error correction and processing speed.

엔터테인먼트 데이터를 위한 자연어 검색시스템 (A Natural Language Retrieval System for Entertainment Data)

  • 김정인
    • 한국멀티미디어학회논문지
    • /
    • 제18권1호
    • /
    • pp.52-64
    • /
    • 2015
  • Recently, as the quality of life has been improving, search items in the area of entertainment represent an increasing share of the total usage of Internet portal sites. Information retrieval in the entertainment area is mainly depending on keywords that users are inputting, and the results of information retrieval are the contents that contain those keywords. In this paper, we propose a search method that takes natural language inputs and retrieves the database pertaining to entertainment. The main components of our study are the simple Korean morphological analyzer using case particle information, predicate-oriented token generation, standardized pattern generation coherent to tokens, and automatic generation of the corresponding SQL queries. We also propose an efficient retrieval system that searches the most relevant results from the database in terms of natural language querying, especially in the restricted domain of music, and shows the effectiveness of our system.

분산전원 상세모델을 적용한 DC Micro-grid의 동작특성 분석 (Operational Characteristic Analysis of DC Micro-grid with Detail Model of Distributed Generation)

  • 이지헌;권기현;한병문;차한주
    • 전기학회논문지
    • /
    • 제58권11호
    • /
    • pp.2175-2184
    • /
    • 2009
  • This paper describes operational analysis results of the DC micro-grid using detailed model of distributed generation. Detailed model of wind power generation, photo-voltaic generation, fuel-cell generation was implemented with the user-defined model of PSCAD/EMTDC software that is coded with C-language. The operation analysis was carried out using PSCAD/EMTDC software, in which the power circuit is implemented by built-in model and the controller is modelled by user-defined model that is also coded with C-language. Various simulation results confirm that the DC micro-grid can operate without any problem in both the interconnected mode and the islanded mode. The operation analysis result confirms that the DC micro-grid make it feasible to provide power to the load stably. And it can be utilize to develop the actual system design and building.

Kant 시스템에서의 한국어 생성을 위한 언어 정보의 구축 (Construction of Korean Linguistic Information for the Korean Generation on KANT)

  • 윤덕호
    • 한국정보처리학회논문지
    • /
    • 제6권12호
    • /
    • pp.3539-3547
    • /
    • 1999
  • KANT(Knowledge-based Accurate Natural language Translation) 시스템 생성 엔진을 위한 한국어 언어 정보를 구축하였다. KANT 시스템은 언어 중립적인 생성 엔진을 갖고 있기 때문에 한국어 언어 정보의 구축은 사실상 한국어 생성 모듈의 개발을 의미한다. 구축된 언어 정보는 개념별 한국어 대응 규칙, 범주별 한국어 대응 규칙, 한국어 사전 및 템플리트 선언, 한국어 문법 규칙, 한국어 어휘 유형, 한국어 어휘 규칙, 한국어 다시 쓰기 규칙 등으로 구성된다. 구축된 언어 정보를 이용해 KANT 시스템 개발 측이 준비한 118 문장 분량의 중간 언어 표현로부터 106 문장을 올바르며 완전한 한국어 문장으로서 생성하였다.

  • PDF

지식 기반 프랑스어 발음열 생성 시스템 (A knowledge-based pronunciation generation system for French)

  • 김선희
    • 말소리와 음성과학
    • /
    • 제10권1호
    • /
    • pp.49-55
    • /
    • 2018
  • This paper aims to describe a knowledge-based pronunciation generation system for French. It has been reported that a rule-based pronunciation generation system outperforms most of the data-driven ones for French; however, only a few related studies are available due to existing language barriers. We provide basic information about the French language from the point of view of the relationship between orthography and pronunciation, and then describe our knowledge-based pronunciation generation system, which consists of morphological analysis, Part-of-Speech (POS) tagging, grapheme-to-phoneme generation, and phone-to-phone generation. The evaluation results show that the word error rate of POS tagging, based on a sample of 1,000 sentences, is 10.70% and that of phoneme generation, using 130,883 entries, is 2.70%. This study is expected to contribute to the development and evaluation of speech synthesis or speech recognition systems for French.

Localization of Broca's Area Using Functional MR Imaging: Quantitative Evaluation of Paradigms

  • Kim, Chi-Heon;Kim, Jae-Hun;Chung, Chun-Kee;Kim, June-Sic;Lee, Jong-Min;Lee, Sang-Kun
    • Journal of Korean Neurosurgical Society
    • /
    • 제45권4호
    • /
    • pp.219-223
    • /
    • 2009
  • Objective : Functional magnetic resonance imaging (fMRI) is frequently used to localize language areas in a non-invasive manner. Various paradigms for presurgical localization of language areas have been developed, but a systematic quantitative evaluation of the efficiency of those paradigms has not been performed. In the present study, the authors analyzed different language paradigms to see which paradigm is most efficient in localizing frontal language areas. Methods : Five men and five women with no neurological deficits participated (mean age, 24 years) in this study. All volunteers were right-handed. Each subject performed 4 tasks, including fixation (Fix), sentence reading (SRI. pseudoword reading (PR), and word generation (WG). Fixation and pseudoword reading were used as contrasts. The functional area was defined as the area(s) with a t-value of more than 3.92 in fMRI with different tasks. To apply an anatomical constraint, we used a brain atlas mapping system, which is available in AFNI, to define the anatomical frontal language area. The numbers of voxels in overlapped area between anatomical and functional area were individually counted in the frontal expressive language area. Results : Of the various combinations, the word generation task was most effective in delineating the frontal expressive language area when fixation was used as a contrast (p<0.05). The sensitivity of this test for localizing Broca's area was 81 % and specificity was 70%. Conclusion : Word generation versus fixation could effectively and reliably delineate the frontal language area. A customized effective paradigm should be analyzed in order to evaluate various language functions.

한국어-수화 번역시스템을 위한 형태소 변환 (Morpheme Conversion for korean Text-to-Sign Language Translation System)

  • 박수현;강석훈;권혁철
    • 한국정보처리학회논문지
    • /
    • 제5권3호
    • /
    • pp.688-702
    • /
    • 1998
  • 본 논문에서는 한국어 각 품사별로 형태소 해석 규칙에 대응하는 수화 형태소 생성규칙을 제안한다. 한국어 자연수화는 한국어 자연언어에 비하여 극히 한정된 어휘를 가지며, 문법 요소의 수도 매우 한정적으로 사용되고 있다. 따라서 본 논문에서는 자연스러운 한국어 문장을 대응하는 수화로 변환시키기 위해서 한국어 문법에 대응하는 자연수화 문법을 정의한다. 각 phrase는 한국어 해석 문법과는 별도의 수화 형태소 생성문법을 정의 해야 하며, 이 문법은 형태소 해석/결합 규칙 및 구구조 해석규칙에 적용되고, 이 규칙의 정의로 가장 자연스러운 자연수화를 생성할 수 있게 된다.

  • PDF