• Title/Summary/Keyword: 생성 언어 모델

Search Result 637, Processing Time 0.026 seconds

An English-to-Korean Transliteration Model based on Grapheme and Phoneme (자소 및 음소 정보를 이용한 영어-한국어 음차표기 모델)

  • Oh Jong-Hoon;Choi Key-Sun
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.4
    • /
    • pp.312-326
    • /
    • 2005
  • There has been increasing interest in English-to-Korean transliteration recently. Previous ,works are related to a direct method like $\rightarrow$Korean graphemes> and a pivot method like $\rightarrow$English phoneme$\rightarrow$Korean graphemes>. Though most of the previous works focus on the direct method, transliteration, however, is a phonetic process rather than an orthographic one. In this point of view, we present an English-Korean transliteration model using grapheme and phoneme information. Unlike the previous works, our method uses phonetic information such as phonemes and their context. Moreover, we also use graphemes corresponding to phonemes. Our method shows about $60\%$ word accuracy.

An Abstract Grammar for XML Document Editing (XML 문서 편집을 위한 추상문법)

  • 신경희;최종명;유재우
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.3_4
    • /
    • pp.268-277
    • /
    • 2003
  • A document type definition(DTD) which defines tags for a document is a XML document grammar that defines syntactic structure of a document. An XML document keeps the rules and must be parsed to check validation. To parse XML document, the deterministic parsing method of programming language is irrelevant because it does not satisfy the definition of deterministic content model in element declaration. In this paper, we consider editing of a valid XML document in syntax-directed editing environment, and we suggest the internal storage representations of syntax in DTD and theirs algorithms. The consequence is that a syntactic structure of textual DTD is transformed into graph and table structures. The table structure of DTD is interpreted the context free grammar which has attribute values and is used in syntax-directed editor for XML. We called this the XML abstract grammar and showed generated results and examples.

Development of Verification and Conformance Testing Tools for Communication Protocol (통신 프로토콜 검정기 및 적합성시험 도구 개발)

  • Seo Mi-Seon;Hwang Jong-Gyu;Lee Jae-Ho;Kim Sung-Un
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.8
    • /
    • pp.1119-1133
    • /
    • 2005
  • As a very important part in development of the protocol, verification and conformance test for protocol specification are complementary techniques that are used to increase the level of confidence in the system functions as prescribed by their specifications. In this paper, we verify the safety and liveness properties of rail signal control protocol type 1 specified in LTS with model checking method, and experimentally prove that it is possible to check for the deadlock, livelock and rechability of the states and actions on LTS. The implemented formal checker is able to verify whether properties expressed in modal logic are true in specifications using modal mu-calculus. We also propose a formal method on generation of conformance test cases using the concept of UIO sequences from verified protocol specification. The suggested tools are implemented by C++ language under Windows NT.

  • PDF

Emotion and Sentiment Analysis from a Film Script: A Case Study (영화 대본에서 감정 및 정서 분석: 사례 연구)

  • Yu, Hye-Yeon;Kim, Moon-Hyun;Bae, Byung-Chull
    • Journal of Digital Contents Society
    • /
    • v.18 no.8
    • /
    • pp.1537-1542
    • /
    • 2017
  • Emotion plays a key role in both generating and understanding narrative. In this article we analyzed the emotions represented in a movie script based on 8 emotion types from the wheel of emotions by Plutchik. First we conducted manual emotion tagging scene by scene. The most dominant emotions by manual tagging were anger, fear, and surprise. It makes sense when the film script we analyzed is a thriller-genre. We assumed that the emotions around the climax of the story would be heightened as the tension grew up. From manual tagging we could identify three such duration when the tension is high. Next we analyzed the emotions in the same script using Python-based NLTK VADERSentiment tool. The result showed that the emotions of anger and fear were most matched. The emotion of surprise, anticipation, and disgust, however, scored lower matching.

A Study on the Conceptual Modeling and Implementation of a Semantic Search System (시맨틱 검색 시스템의 개념적 모형화와 그 구현에 대한 연구)

  • Hana, Dong-Il;Kwonb, Hyeong-In;Chong, Hak-Jin
    • Journal of Intelligence and Information Systems
    • /
    • v.14 no.1
    • /
    • pp.67-84
    • /
    • 2008
  • This paper proposes a design and realization for the semantic search system. The proposed model includes three Architecture Layers of a Semantic Search System ; (they are conceptually named as) the Knowledge Acquisition, the Knowledge Representation and the Knowledge Utilization. Each of these three Layers are designed to interactively work together, so as to maximize the users' information needs. The Knowledge Acquisition Layer includes index and storage of Semantic Metadata from various source of web contents(eg : text, image, multimedia and so on). The Knowledge Representation Layer includes the ontology schema and instance, through the process of semantic search by ontology based query expansion. Finally, the Knowledge Utilization Layer includes the users to search query intuitively, and get its results without the users'knowledge of semantic web language or ontology. So far as the design and the realization of the semantic search site is concerned, the proposedsemantic search system will offer useful implications to the researchers and practitioners so as to improve the research level to the commercial use.

  • PDF

3D Graphic Nursery Contents Developed by Mobile AR Technology (모바일 기반 증강현실 기술을 활용한 3D전래동화 콘텐츠 연구)

  • Park, Young-sook;Park, Dea-woo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.11
    • /
    • pp.2125-2130
    • /
    • 2016
  • In this paper, we researched the excellency of 3D graphic nursery contents which is developed by mobile AR technology. AR technology has currently people's attention because of the potential to be core contents of future ICT industry. We applied AR nursery contents for kid's subtitle language selection in Korean, Chinese and English education. The original fairy tale consisted of 6~8 scenes for the 3D contents production, and was adapted and translated. Dubbing was dubbed by the native speaker using the standard pronunciation, and the effect sound was edited separately to fit the scene. After composing a scenario, constructing a 3D model, constructing a interaction, constructing a sound effect, and creating content metadata, the Unity 3D game engine is executed to create a project and describe it as a script. It provides a fun and informative tradition of fairy tales with abundant content that incorporates ICT technology, accepting advanced technology-based education, and having opportunities to perceive software in daily life.

Considerations for BIBFRAME Acceptance of Expression and Representative Expression Attributes in LRM (BIBFRAME에서 LRM 표현형 및 대표표현형 속성 적용시 고려사항)

  • Lee, Mihwa
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.30 no.2
    • /
    • pp.33-50
    • /
    • 2019
  • Cataloging principles, cataloging rules, and encoding formats should considered LRM acceptance because LRM replaced FRBR as the conceptual model. This study identifies considerations for BIBFRAME acceptance of expression and representative expression attributes in LRM by using literature reviews and expert interviews. Primarily, work in BIBFRAME without expression as entity could map to work and expression of LRM and sustain expression by linking 2 works (work and expression). Second, BIBFRAME must consider association between representative expression attributes and specific expressions whose values can be transferred to the representative expression attributes. Third, representative expression attributes are different according to work types in LRM, and language, media, intended audience, and scale, that can be used as representative expression attributes in BIBFRAME, should be changed in class. Fourth, relation properties should be articulated for expanding networks between expressions originated from work in BIBFRAME. This study analyzes LRM and BIBFRAME by focusing on expression entity and representative expression attributes. More LRM study is needed on cataloging principles and cataloging rules.

GPU-based modeling and rendering techniques of 3D clouds using procedural functions (절차적 함수를 이용한 GPU기반 실시간 3D구름 모델링 및 렌더링 기법)

  • Sung, Mankyu
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.4
    • /
    • pp.416-422
    • /
    • 2019
  • This paper proposes a GPU-based modeling and rendering of 3D clouds using procedural functions. The formation of clouds is based on modified noise function made with fbm(Fractional Brownian Motion). Those noise values turn into densities of droplets of liquid water, which is a critical parameter for forming the three different types of clouds. At the rendering stage, the algorithm applies the ray marching technique to decide the colors of cloud using density values obtained from the noise function. In this process, all lighting attenuation and scattering are calculated by physically based manner. Once we have the clouds, they are blended on the sky, which is also rendered physically. We also make the clouds moving in the sky by the wind force. All algorithms are implemented and tested on GPU using GLSL.

Deep Learning-Based Automation Cyber Attack Convergence Trend Analysis Mechanism for Deep Learning-Based Security Vulnerability Analysis (사이버공격 융합 동향 분석을 위한 딥러닝 기반 보안 취약점 분석 자동화 메커니즘)

  • Kim, Jinsu;Park, Namje
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.32 no.1
    • /
    • pp.99-107
    • /
    • 2022
  • In the current technological society, where various technologies are converged into one and being transformed into new technologies, new cyber attacks are being made just as they keep pace with the changes in society. In particular, due to the convergence of various attacks into one, it is difficult to protect the system with only the existing security system. A lot of information is being generated to respond to such cyber attacks. However, recklessly generated vulnerability information can induce confusion by providing unnecessary information to administrators. Therefore, this paper proposes a mechanism to assist in the analysis of emerging cyberattack convergence technologies by providing differentiated vulnerability information to managers by learning documents using deep learning-based language learning models, extracting vulnerability information and classifying them according to the MITRE ATT&CK framework.

A Morpheme Analyzer based on Transformer using Morpheme Tokens and User Dictionary (사용자 사전과 형태소 토큰을 사용한 트랜스포머 기반 형태소 분석기)

  • DongHyun Kim;Do-Guk Kim;ChulHui Kim;MyungSun Shin;Young-Duk Seo
    • Smart Media Journal
    • /
    • v.12 no.9
    • /
    • pp.19-27
    • /
    • 2023
  • Since morphemes are the smallest unit of meaning in Korean, it is necessary to develop an accurate morphemes analyzer to improve the performance of the Korean language model. However, most existing analyzers present morpheme analysis results by learning word unit tokens as input values. However, since Korean words are consist of postpositions and affixes that are attached to the root, even if they have the same root, the meaning tends to change due to the postpositions or affixes. Therefore, learning morphemes using word unit tokens can lead to misclassification of postposition or affixes. In this paper, we use morpheme-level tokens to grasp the inherent meaning in Korean sentences and propose a morpheme analyzer based on a sequence generation method using Transformer. In addition, a user dictionary is constructed based on corpus data to solve the out - of-vocabulary problem. During the experiment, the morpheme and morpheme tags printed by each morpheme analyzer were compared with the correct answer data, and the experiment proved that the morpheme analyzer presented in this paper performed better than the existing morpheme analyzer.