• Title/Summary/Keyword: 용어사전

Search Result 399, Processing Time 0.028 seconds

Automatic Recognition of Translation Phrases Enclosed with Parenthesis in Korean-English Mixed Documents (한영 혼용문에서 괄호 안 대역어구의 자동 인식)

  • Lee, Jae-Sung;Seo, Young-Hoon
    • The KIPS Transactions:PartB
    • /
    • v.9B no.4
    • /
    • pp.445-452
    • /
    • 2002
  • In Korean-English mixed documents, translated technical words are usually used with the attached full words or original words enclosed with parenthesis. In this paper, a collective method is presented to recognize and extract the translation phrases with using a base translation dictionary. In order to process the unregistered title words and translation words in the dictionary, a phonetic similarity matching method, a translation partial matching method, and a compound word matching method are newly proposed. The experiment result of each method was measured in F-measure(the alpha is set to 0.4) ; exact matching of dictionary terms as a baseline method showed 23.8%, the hybrid method of translation partial matching and phonetic similarity matching 75.9%, and the compound word matching method including the hybrid method 77.3%, which is 3.25 times better than the baseline method.

User Interaction-based Graph Query Formulation and Processing (사용자 상호작용에 기반한 그래프질의 생성 및 처리)

  • Jung, Sung-Jae;Kim, Taehong;Lee, Seungwoo;Lee, Hwasik;Jung, Hanmin
    • Journal of KIISE:Databases
    • /
    • v.41 no.4
    • /
    • pp.242-248
    • /
    • 2014
  • With the rapidly growing amount of information represented in RDF format, efficient querying of RDF graph has become a fundamental challenge. SPARQL is one of the most widely used query languages for retrieving information from RDF dataset. SPARQL is not only simple in its syntax but also powerful in representation of graph pattern queries. However, users need to make a lot of efforts to understand the ontology schema of a dataset in order to compose a relevant SPARQL query. In this paper, we propose a graph query formulation and processing scheme based on ontology schema information which can be obtained by summarizing RDF graph. In the context of the proposed querying scheme, a user can interactively formulate the graph queries on the graphic user interface without making efforts to understand the ontology schema and even without learning SPARQL syntax. The graph query formulated by a user is transformed into a set of class paths, which are stored in a relational database and used as the constraint for search space reduction when the relational database executes the graph search operation. By executing the LUBM query 2, 8, and 9 over LUBM (10,0), it is shown that the proposed querying scheme returns the complete result set.

A Study on Automatic Text Categorization of Web-Based Query Using Synonymy List (유사어 사전을 이용한 웹기반 질의문의 자동 범주화에 관한 연구)

  • Nam, Young-Joon;Kim, Gyu-Hwan
    • Journal of Information Management
    • /
    • v.35 no.4
    • /
    • pp.81-105
    • /
    • 2004
  • In this study, the way of the automatic text categorization on web-based query was implemented. X2 methods based on the Supported Vector Machine were used to test the efficiency of text categorization on queries. This test is carried out by the model using the Synonymy List. 713 synonyms were extracted manually from the tested documents. As the result of this test, the precision ratio and the recall ratio were decreased by -0.01% and by 8.53%, respectively whether the synonyms were assigned or not. It also shows that the Value of F1 Measure was increased by 4.58%. The standard deviation between the recall and precision ratio was improve by 18.39%.

Detection of Adverse Drug Reactions Using Drug Reviews with BERT+ Algorithm (BERT+ 알고리즘 기반 약물 리뷰를 활용한 약물 이상 반응 탐지)

  • Heo, Eun Yeong;Jeong, Hyeon-jeong;Kim, Hyon Hee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.11
    • /
    • pp.465-472
    • /
    • 2021
  • In this paper, we present an approach for detection of adverse drug reactions from drug reviews to compensate limitations of the spontaneous adverse drug reactions reporting system. Considering negative reviews usually contain adverse drug reactions, sentiment analysis on drug reviews was performed and extracted negative reviews. After then, MedDRA dictionary and named entity recognition were applied to the negative reviews to detect adverse drug reactions. For the experiment, drug reviews of Celecoxib, Naproxen, and Ibuprofen from 5 drug review sites, and analyzed. Our results showed that detection of adverse drug reactions is able to compensate to limitation of under-reporting in the spontaneous adverse drugs reactions reporting system.

The legal status of the breast in assessing physical disability (신체장애 평가에서 유방의 법적 지위 - 장기 해당 여부, 수유장애, 노동력상실에 대하여 -)

  • Kim, Bong Kyum
    • The Korean Society of Law and Medicine
    • /
    • v.18 no.1
    • /
    • pp.265-295
    • /
    • 2017
  • Breast tissue is composed of skin, mammary gland(including lactiferous duct), subcutaneous fat layer. The anatomical position is on the anterior chest wall(the outside of the chest cavity) but not on the inside of the thorax. Therefore, when the internal organs in the thoracic cavity are defined and expressed as 'organs' and the internal organs of each are labeled for a long time, for the breast located outside the thoracic cavity, it is thought that there is considerable difficulty in defining and recognizing the breast tissue as organs. For this reason, it is necessary to discourage the controversy over whether or not the breast is contained in the chest(or intra-thoracic cavity). In order to completely exclude it, it is assumed that the "chest-abdomen" can be called the "intra-thoraxic or intra-abdominal." But it is difficult to change the terms in various laws and regulations, I think that it would be necessary to insert only the clue clause "Breasts are excluded" in the detailed criteria for grading. In order to include it, it is necessary to change the terms of the ordinance or to say that the breast is exceptionally included.

  • PDF

A System for converting natural language queries Into boolean queries for Information Retrieval (정보검색을 위한 자연언어 질의어의 불리언 질의로의 변환)

  • 서광준;최기선;나동열
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.258-261
    • /
    • 1994
  • 자연언어 인터페이스는 초보자나 비숙련가의 입장에서는 새로운 시스템의 적응에 있어서 어떤 학습도 필요하지 않다는 장점이 있다. 이 연구에서는 불리언 질의를 처리하는 정보검색 시스템의 자연언어 인터페이스를 구혐하였다. 즉, 한국어 자연언어 질의를 불리언 질의로 변환해주는 시스템이다. 접근 방법은 먼저 자연언어 질의를 구문 해석한 후에, 그 결과인 문자의 의존 구조와 불용어 정보를 사용하여 기본적인 불리언 질의를 만든다음, 시소러스를 이용하여 불리언 질의를 확장한다. 여기에서 사용한 구문 해석 방법은 기존 문법에 기반한 방법이다. 변환 시스템은 SPARC-II 호환기종에서 구현되었으며, 약 5만 단어의 사전을 사용한다. 가공된 120 개의 질의를 대상으로 실험한 결과, 전체 소요시간은 13.5초가 걸렸다. 그리고, 변환된 불리언 연산식중에 110개가 적절하게 변환된 것으로 조사되었다.

  • PDF

A Study for Keyword Extraction Method (키워드 추출 기법에 관한 연구)

  • Shin, Seong-Yoon;Jeong, Kyong-Taek;Rhee, Yang-Won
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2009.01a
    • /
    • pp.463-466
    • /
    • 2009
  • 본 논문에서는 대량의 문제를 자동으로 분류하기 위하여 비감독 학습 기법에 의해 카테고리별 키워드를 구성하기 위한 방법을 제안하였다. 제안된 방법에서는 사전에 문제를 분류하지 않고 키워드를 추출하기 위하여 데이터마이닝 기법 중의 하나인 연관 규칙 탐사 알고리즘을 이용하였다. 먼저, 각 카테고리를 대표하는 핵심 키워드를 선정하고, 연관 규칙 탐사 알고리즘을 적용하여 각 핵심 키워드와 관련된 용어 집합을 추출한다.

  • PDF

The design and implementation of Manual XML DTD (교범 XML DTD 설계 및 구현)

  • Park, Se-Chul;Lee, Sang-Hoon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2001.10b
    • /
    • pp.1189-1192
    • /
    • 2001
  • XML을 이용하여 다른 조직이나 사용자간에 원활한 데이터 교환과 사용을 위해서는 공통적으로 사용할 수 있는 태그나 용어가 표준화되어야 한다. 현재 군 교범(야전교범 및 기술교범)을 XML을 이용하여 개발하려 노력하고 있으나, 사전연구가 미홉하고 표준이 정해지지 않은 상태에서 각 개별 기관별로 문서구조를 정의하고 태그를 사용함으로써 상호 호환성의 결여 및 차후 변환을 위한 낭비적 요소가 우려된다. 따라서 군 내부에서 XML을 적용한 문서 유형별 표준화가 시급히 요구되고 있다. 본 연구에서는 교범에 대한 XML 문서형정의를 설계하고 국방 표준 교범 XML DTD를 제안한다. 표준으로서의 문서형정의는 전자도서관에서 전문을 구축하는데 이용한 수 있을 뿐만 아니라 대화형 전자식 매뉴얼 구축에 기여할 수 있다.

  • PDF

A Study on the Systematic Compilation Method of Electrical Dictionaries (체계적인 전기용어사전 편찬방법론에 관한 연구)

  • Hwang, Sung-Wook;Kim, Jung-Hoon;Kwak, Hee-Ro
    • Proceedings of the KIEE Conference
    • /
    • 2000.07a
    • /
    • pp.581-583
    • /
    • 2000
  • So many terms of electrical engineering are nationalized words and Japanese words written in Chinese characters because electrical engineering is introduced from foreign countries. Many students who are not familiar to Chinese characters are difficult to study with this terms in the first step of electrical Engineering, In this study, the systematic compilation method of electrical dictionaries is proposed, which is based on the method of the standard Korean dictionary. Through this method, more systematic Korean electrical dictionaries will be compiled.

  • PDF

Onlotogy Modelling of Material Information for Offshore Plant (해양플랜트 기자재 정보의 온톨로지 모델링)

  • Park, Ho-Byung;Kim, Hyoung-Jean;Choe, Ji-Woong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.05a
    • /
    • pp.550-553
    • /
    • 2007
  • 본 논문에서는 제품 정보를 교환 및 공유하기 위한 국제 표준인 ISO 15926에 근거한 해양 플랜트 기자재의 제품 정보의 온톨로지 모델링을 소개한다. 모델링 방법은 코어 데이터 모델과, 참조 데이터 라이브러리, 템플릿과 객체 정보 모델을 이용한다. 코어 데이터 모델은 보편적인 개념을 정의하고, 참조데이터 라이브러리는 코어 데이터 모델을 확장한 공통 용어 사전이다. 의미를 표현하는 가장 작은 조각으로 템플릿을 사용하고, 객체 정보 모델을 통하여 객체들 사이의 관계를 정의한다. 모델링은 OWL을 이용하여 제품 데이터의 온톨로지를 생성하여 이기종 소프트웨어 간의 제품 정보를 교환하고 공유하도록 한다.

  • PDF