• Title/Summary/Keyword: 전자문서처리

Search Result 457, Processing Time 0.026 seconds

Korean Named Entity Recognition Using BIT Representation (BIT 표기법을 활용한 한국어 개체명 인식)

  • Yoon, Ho;Kim, Chang-Hyun;Cheon, Min-Ah;Park, Ho-Min;Namgoong, Young;Choi, Min-Seok;Kim, Jae-Kyun;Kim, Jae-Hoon
    • Annual Conference on Human and Language Technology
    • /
    • 2019.10a
    • /
    • pp.190-194
    • /
    • 2019
  • 개체명 인식이란 주어진 문서에서 개체명의 범위를 찾고 개체명을 분류하는 것이다. 최근 많은 연구는 신경망 모델을 이용하며 하나 이상의 단어로 구성된 개체명을 BIO 표기법으로 표현한다. BIO 표기법은 개체명이 시작되는 단어의 표지에 B(Beginning)-를 붙이고, 개체명에 포함된 그 외의 단어의 표지에는 I(Inside)-를 붙이며, 개체명과 개체명 사이의 모든 단어의 표지를 O로 간주하는 방법이다. BIO 표기법으로 표현된 말뭉치는 O 표지가 90% 이상을 차지하므로 O 표지에 대한 혼잡도가 높아지는 문제와 불균형 학습 문제가 발생된다. 본 논문에서는 BIO 표기법 대신에 BIT 표기법을 제안한다. BIT 표기법이란 BIO 표기법에서 O 표지를 T(Tag) 표지로 변환하는 방법이며 본 논문에서 T 표지는 품사 표지를 나타낸다. 실험을 통해서 BIT 표기법이 거의 모든 경우에 성능이 향상됨을 확인할 수 있었다.

  • PDF

Construction of Evaluation-Annotated Datasets for EA-based Clothing Recommendation Chatbots (패션앱 후기글 평가분석에 기반한 의류 검색추천 챗봇 개발을 위한 학습데이터 EVAD 구축)

  • Choi, Su-Won;Hwang, Chang-Hoe;Yoo, Gwang-Hoon;Nam, Jee-Sun
    • Annual Conference on Human and Language Technology
    • /
    • 2021.10a
    • /
    • pp.467-472
    • /
    • 2021
  • 본 연구는 패션앱 후기글에 나타나는 구매자의 의견에 대한 '평가분석(Evaluation Analysis: EA)'을 수행하여, 이를 기반으로 상품의 검색 및 추천을 수행하는 의류 검색추천 챗봇을 개발하는 LICO 프로젝트의 언어데이터 구축의 일환으로 수행되었다. '평가분석 트리플(EAT)'과 '평가기반요청 쿼드러플(EARQ)'의 구성요소들에 대한 주석작업은, 도메인 특화된 단일형 핵심어휘와 다단어(MWE) 핵심패턴들을 FST 방식으로 구조화하는 DECO-LGG 언어자원에 기반하여 반자동 언어데이터 증강(SSP) 방식을 통해 진행되었다. 이 과정을 통해 20여만 건의 후기글 문서(230만 어절)로 구성된 EVAD 평가주석데이터셋이 생성되었다. 여성의류 도메인의 평가분석을 위한 '평가속성(ASPECT)' 성분으로 14가지 유형이 분류되었고, 각 '평가속성'에 연동된 '평가내용(VALUE)' 쌍으로 전체 35가지의 {ASPECT-VALUE} 카테고리가 분류되었다. 본 연구에서 구축된 EVAD 평가주석 데이터의 성능을 평가한 결과, F1-Score 0.91의 성능 평가를 획득하였으며, 이를 통해 향후 다른 도메인으로의 확장된 적용 가능성이 유효함을 확인하였다.

  • PDF

Design and Implementation of Feature Catalogue Builder based on the S-100 Standard (S-100 표준 기반 피처 카탈로그 제작지원 시스템의 설계 및 구현)

  • Park, Daewon;Kwon, Hyuk-Chul;Park, Suhyun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.8
    • /
    • pp.571-578
    • /
    • 2013
  • The IHO S-100 is a standard on the universal hydorgraphic data model for supporting information services that integrate various data in maritime and provide proper information for safety of vessels. The S-100 is used to develop S-10x product specifications which are standards on guideline for creation and delivery of specific data set in maritime. The product specification for feature-based data such as ENC(Electronic Navigational Chart) data includes a feature catalogue that describes characteristics of features in that feature-based data. The feature catalogue is developed by domain experts with knowledge on data of the target domain. However, it is not feasible to develop a feature catalogue according to the XML schema by manual. In the IHO TSMAD committee meeting, needs of developing technology on building feature catalogue has been discussed. Therefore, we present a feature catalogue builder that is a GUI(Graphic User Interface) system supporting domain experts to build feature catalogues in XML. The feature catalogue builder is developed to connect with the FCD(Feature Concept Dictionary) register in the IHO(International Hydrographic Organization) GI(Geographic Information) registry. Also, it supports domain experts to select proper feature items based on the relationships between register items.

The Construction of Multiform User Profiles Based on Transaction for Effective Recommendation and Segmentation (효과적인 추천과 세분화를 위한 트랜잭션 기반 여러 형태 사용자 프로파일의 구축)

  • Koh, Jae-Jin;An, Hyoung-Keun
    • The KIPS Transactions:PartD
    • /
    • v.13D no.5 s.108
    • /
    • pp.661-670
    • /
    • 2006
  • With the development of e-Commerce and the proliferation of easily accessible information, information filtering systems such as recommender and SDI systems have become popular to prune large information spaces so that users are directed toward those items that best meet their needs and preferences. Until now, many information filtering methods have been proposed to support filtering systems. XML is emerging as a new standard for information. Recently, filtering systems need new approaches in dealing with XML documents. So, in this paper our system suggests a method to create multiform user profiles with XML's ability to represent structure. This system consists of two parts; one is an administrator profile definition part that an administrator defines to analyze users purchase pattern before a transaction such as purchase happens directly. an other is a user profile creation part module which is applied by the defined profile. Administrator profiles are made from DTD information and it is supposed to point the specific part of a document conforming to the DTD. Proposed system builds user's profile more accurately to get adaptability for user's behavior of buying and provide useful product information without inefficient searching based on such user's profile.

A Study on Development and Operation of Standards Information System (차세대 표준정보유통시스템 구축 및 운용에 관한 연구)

  • 구경철;이준섭;송기평;박기식
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2000.05a
    • /
    • pp.93-97
    • /
    • 2000
  • In an era of very rapid technological change, standardization organizations have been confronted with the problem that they should develop more new and more complex standards as strategic tools for enforcement of competitive power in shorter time according to the reduction of Life Cycle of products and the increase in the number of standards and their complexity. To cope with this challenge, we introduce the functionalities and development directions for implementing standardization information system. Also this paper describes the architecture of the next generation system for standard information that enabling information sharing and exchanging using XML. Finally, we suggest implementation framework of Standardization Vortal Site for the One-stop service of integrated standards information which provide users with standards development roadmap and give brief introduction to SOL(Standards On-Line) site which is operated by ETRI/PEC.

  • PDF

Wiki-based Interactive Electronic Technical Manuals (IETM) for Construction Project Management : Through a Case Study of Urban Regeneration Projects (Wiki기반 건설 사업관리 전자매뉴얼 : 도시환경정비사업 사례연구를 통해)

  • Park, Moon-Seo;Kang, Sung-Hoon;Lee, Hyun-Soo
    • Korean Journal of Construction Engineering and Management
    • /
    • v.11 no.3
    • /
    • pp.3-12
    • /
    • 2010
  • Recently, workers on construction sites can hardly understand their tasks and the process of the tasks because the construction projects are becoming large and complex. Due to the complexity, workers need a tool that can help them understand their works and some paper-based manuals exist to support them. However, the existing paperbased manuals are not actively used by workers due to the low credibility of information on the manual. In particular, paper-based manual can not be updated frequently because those manuals have to be manually updated by experts. Thus, the credibility of information is decreased and the decrease let the users distrust information of the paper-based manual. Therefore, this research analyzed the feature of wiki-based system and suggested an wiki-based IETM(Interactive Electronic Technical Manual) system model that can be modified by users, and then conducted a survey through a prototype based on urban regeneration project. The result of the survey indicated that wikibased IETM can improve the stakeholder communication by reinforcing the process of creating knowledge as well as knowledge itself.

Digital Holographic Security Identification System (디지털 홀로그래픽 보안 인증 시스템)

  • Kim, Jung-Hoi;Kim, Nam;Jeon, Seok-Hee
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.2
    • /
    • pp.89-98
    • /
    • 2004
  • In this paper, we implement a digital holographic security card system that combines digital holographic memory using random phase encoded reference beams with electrical biometrics. Digitally encoded data including a document, a picture of face, and a fingerprint are recorded by multiplexing of holographic memory. A random phase mask encoding reference beams are used as a decoded key to protect illegal counterfeit. As a result, we can achieve a raw BER of 3.6${\times}$10-4 and shift selectivity of 4${\mu}{\textrm}{m}$ using the 2D random phase mask. Also, we develop a recording pattern and image processing which are suitable for a low cost reader without a position sensing photo-detector for real time data extraction and remove danger of fraud from unauthorized person by comparing the reconstructed holographic data with the live fingerprint data.

Interactive Interface Design Through VR Hand Tracking (VR 핸드트레킹을 통한 상호작용 인터페이스 설계)

  • Ju-Sang Lee;Hyo-Seung Lee;Woo-Jun Choi
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.1
    • /
    • pp.213-218
    • /
    • 2023
  • In order to use the VR HMD, operation through separate controllers in both hands is required. Methods for text input in VR include a method of selecting virtual keyboard keys on the screen one by one using a controller, a method of inputting through a keyboard that is a resource of a computer by connecting a computer and VR, or a method of inputting through a keyboard that is a resource of a computer by purchasing a wireless keyboard in VR and wireless keyboard connection method. As such, the text input method in current VR equipment causes inconvenience and additional costs to users. For these reasons, most of the VR-related contents are limited to simple functions such as games or viewers, and there is a risk that VR equipment will be recognized as a simple game machine. Therefore, in this study, a multi-input interface using hand tracking provided by the Oculus Quest2 device is designed and partially implemented. Through this, it is expected that various tasks such as document work and business processing as well as games can be conveniently used using VR equipment.

Methodology of Automatic Editing for Academic Writing Using Bidirectional RNN and Academic Dictionary (양방향 RNN과 학술용어사전을 이용한 영문학술문서 교정 방법론)

  • Roh, Younghoon;Chang, Tai-Woo;Won, Jongwun
    • The Journal of Society for e-Business Studies
    • /
    • v.27 no.2
    • /
    • pp.175-192
    • /
    • 2022
  • Artificial intelligence-based natural language processing technology is playing an important role in helping users write English-language documents. For academic documents in particular, the English proofreading services should reflect the academic characteristics using formal style and technical terms. But the services usually does not because they are based on general English sentences. In addition, since existing studies are mainly for improving the grammatical completeness, there is a limit of fluency improvement. This study proposes an automatic academic English editing methodology to deliver the clear meaning of sentences based on the use of technical terms. The proposed methodology consists of two phases: misspell correction and fluency improvement. In the first phase, appropriate corrective words are provided according to the input typo and contexts. In the second phase, the fluency of the sentence is improved based on the automatic post-editing model of the bidirectional recurrent neural network that can learn from the pair of the original sentence and the edited sentence. Experiments were performed with actual English editing data, and the superiority of the proposed methodology was verified.

Registration and Description of Public Records in Korea : A Comparative Analysis of Korean Recordskeeping System with the International Standards (한국의 기록물 둥록 및 기술에 대한 기록관리적 접근)

  • Si, Kwi-Sun
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.3 no.1
    • /
    • pp.69-92
    • /
    • 2003
  • Registration and description of records are important elements of processing which provide with the background information of production of records and business-related information. They also enable to search and use the records. In this paper, I examined the Korean registration and description system defined in the Public Records Management Act which directs the records creating agency to register records in creating offices and directs the "professional archives" to make "basic registrations" and "detailed registrations" of the records. In the analysis and comparison of two different registration and description systems with the known international standards of records and archives management, such as ISO15489 and ISAD(G), I intended to evaluate the Korean records and archives management system and suggested recommendations for the renovation of the Korean recordskeeping system. Despite we have unique office business procedures and the culture of officialdom, and despite we have developed our system based on the established business procedures and office culture, it would be preferable to adopt or follow the international standards and established best practices. After the comparative analysis, I recommended some innovations in the filed of registration and description. For instance, in the basic registration. we would better to install an item of "simple contents summary." We may also need the multiple-level description. The fonds level description and the series level description should be introduced to our archival automated management system. We need to establish a Korean standard of description adopting the rules of the ISAD(G) and ISAAR(CPF). Essential requirements for electronic records management, such as contextual and structural information, should be incorporated in the new standard. Documentation of records disposition also should be reinforced to guarantee the authenticity of records and to ensure control of the records. To implement the recommendations for the standard, we need to amend the Public Records Management Act and its Regulations and Rules. Also it is imperative to redesign the GARS integrated archival automated management system.