• Title/Summary/Keyword: Entity Resolution

Search Result 39, Processing Time 0.021 seconds

A Study on the Integration of Recognition Technology for Scientific Core Entities (과학기술 핵심개체 인식기술 통합에 관한 연구)

  • Choi, Yun-Soo;Jeong, Chang-Hoo;Cho, Hyun-Yang
    • Journal of the Korean Society for information Management
    • /
    • v.28 no.1
    • /
    • pp.89-104
    • /
    • 2011
  • Large-scaled information extraction plays an important role in advanced information retrieval as well as question answering and summarization. Information extraction can be defined as a process of converting unstructured documents into formalized, tabular information, which consists of named-entity recognition, terminology extraction, coreference resolution and relation extraction. Since all the elementary technologies have been studied independently so far, it is not trivial to integrate all the necessary processes of information extraction due to the diversity of their input/output formation approaches and operating environments. As a result, it is difficult to handle scientific documents to extract both named-entities and technical terms at once. In order to extract these entities automatically from scientific documents at once, we developed a framework for scientific core entity extraction which embraces all the pivotal language processors, named-entity recognizer and terminology extractor.

A Study on the Integration of Information Extraction Technology for Detecting Scientific Core Entities based on Large Resources (대용량 자원 기반 과학기술 핵심개체 탐지를 위한 정보추출기술 통합에 관한 연구)

  • Choi, Yun-Soo;Cheong, Chang-Hoo;Choi, Sung-Pil;You, Beom-Jong;Kim, Jae-Hoon
    • Journal of Information Management
    • /
    • v.40 no.4
    • /
    • pp.1-22
    • /
    • 2009
  • Large-scaled information extraction plays an important role in advanced information retrieval as well as question answering and summarization. Information extraction can be defined as a process of converting unstructured documents into formalized, tabular information, which consists of named-entity recognition, terminology extraction, coreference resolution and relation extraction. Since all the elementary technologies have been studied independently so far, it is not trivial to integrate all the necessary processes of information extraction due to the diversity of their input/output formation approaches and operating environments. As a result, it is difficult to handle scientific documents to extract both named-entities and technical terms at once. In this study, we define scientific as a set of 10 types of named entities and technical terminologies in a biomedical domain. in order to automatically extract these entities from scientific documents at once, we develop a framework for scientific core entity extraction which embraces all the pivotal language processors, named-entity recognizer, co-reference resolver and terminology extractor. Each module of the integrated system has been evaluated with various corpus as well as KEEC 2009. The system will be utilized for various information service areas such as information retrieval, question-answering(Q&A), document indexing, dictionary construction, and so on.

Heuristic-based Korean Coreference Resolution for Information Extraction

  • Euisok Chung;Soojong Lim;Yun, Bo-Hyun
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2002.02a
    • /
    • pp.50-58
    • /
    • 2002
  • The information extraction is to delimit in advance, as part of the specification of the task, the semantic range of the output and to filter information from large volumes of texts. The most representative word of the document is composed of named entities and pronouns. Therefore, it is important to resolve coreference in order to extract the meaningful information in information extraction. Coreference resolution is to find name entities co-referencing real-world entities in the documents. Results of coreference resolution are used for name entity detection and template generation. This paper presents the heuristic-based approach for coreference resolution in Korean. We constructed the heuristics expanded gradually by using the corpus and derived the salience factors of antecedents as the importance measure in Korean. Our approach consists of antecedents selection and antecedents weighting. We used three kinds of salience factors that are used to weight each antecedent of the anaphor. The experiment result shows 80% precision.

  • PDF

Legislation Trend Referring to Burden of Proof in Medical Malpractice Lawsuit (의료과오소송 입증책임 관련 입법의 동향)

  • Cho, Hyong-Won
    • The Korean Society of Law and Medicine
    • /
    • v.9 no.1
    • /
    • pp.129-162
    • /
    • 2008
  • Nowadays it is important for us to resolute medical disputes. Because a high incidence of medical accidents may be brought about according to many chances of treatment in the operation of health insurance and increasing concern of patient health. Patients and medical doctors have plenty of difficulty in uncomfortable treatment circumstances of a high incidence of medical accidents. It is especially desirable that our society should prevent medical accidents and resolute speedy, fairly and rationally the happened medical disputes. Many legislations were suggested to resolute medical dispute. But legal issue points stress only speedy medical dispute resolution procedure and don't compromise fair and professional procedure. Accordingly these legal arguing points had not been accepted by the National Assembly and people. If the speedy resolution of medical dispute was demanded to solve unsafe treatment circumstances, it is necessitated that the legislation containing legal issue points to procedure is enacted. Of course the interest of patients and doctors to legal issue points must be balanced. Because an arguing points to the reversal of proof burden is consisted of the entity judgement in connection with setting the basis of resolution of medical dispute, the legislation to these is checked carefully.

  • PDF

Resolution of Context Anaphora for Text Summarization (문서요약을 위한 조응 대용 해결)

  • 김상수;김계성;노태길;이상조
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.10d
    • /
    • pp.679-681
    • /
    • 2002
  • 한 문서에서 동일한 개체(Entity)를 지칭하는 고유명사가 다른 형태로 출현하는 현상은 문서요약의 품질을 떨어지게 만드는 요소이다. 이런 문제를 해결하기 위해서는 각각의 고유명사 및 지칭어를 인식하고 이들간의 상관 관계를 밝혀야 한다. 본 논문에서는 이런 문제를 개체명 조응 대용 관계로 정의하고 출현 특성에 따라 분류한 후 특성에 맡는 처리 방법을 보인다. 이를 위하여 고유명사의 조응 출현 양상에 따른 휴리스틱을 만들고, 고유명사를 지칭하는 명사들의 시소러스를 구축한 후 이들을 처리하는 방법을 제안한다.

  • PDF

Design of 8K Broadcasting System based on MMT over Heterogeneous Networks

  • Sohn, Yejin;Cho, Minju;Paik, Jongho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.8
    • /
    • pp.4077-4091
    • /
    • 2017
  • This paper presents the design of a broadcasting scenario and system for an 8K-resolution content. Due to an 8K content is four times larger than the 4K content in terms of size, many technologies such as content acquisition, video coding, and transmission are required to deal with it. Therefore, high-quality video and audio for 8K (ultra-high definition television) service is not possible to be transmitted only using the current terrestrial broadcasting system. The proposed broadcasting system divides the 8K content into four 4K contents by area, and each area is hierarchically encoded by Scalable High-efficiency Video Coding (SHVC) into three layers: L0, L1, and L2. Every part of the 8K video content divided into areas and hierarchy is independently treated. These parts are transmitted over heterogeneous networks such as digital broadcasting and broadband networks after going through several processes of generating signal messages, encapsulation, and packetization based on MPEG media transport. We propose three methods of generating streams at the sending entity to merge the divided streams into the original content at the receiving entity. First, we design the composition information, which defines the presentation structure for displays. Second, a descriptor for content synchronization is included in the signal message. Finally, we define the rules for generating "packet_id" among the packet header fields and design the transmission scheduler to acquire the divided streams quickly. We implement the 8K broadcasting system by adapting the proposed methods and show that the 8K-resolution contents are stably received and serviced with a low delay.

Coreference Resolution for Korean using Mention Pair with SVM (SVM 기반의 멘션 페어 모델을 이용한 한국어 상호참조해결)

  • Choi, Kyoung-Ho;Park, Cheon-Eum;Lee, Changki
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.4
    • /
    • pp.333-337
    • /
    • 2015
  • In this paper, we suggest a Coreference Resolution system for Korean using Mention Pair with SVM. The system introduced in this paper, also be able to extract Mention from document which is including automatically tagged name entity information, dependency trees and POS tags. We also built a corpus, including 214 documents with Coreference tags, referencing online news and Wikipedia for training the system and testing the system's performance. The corpus had 14 documents from online news, along with 200 question-and-answer documents from Wikipedia. When we tested the system by corpus, the performance of the system was extracted by MUC-F1 55.68%, B-cube-F1 57.19%, and CEAFE-F1 61.75%.

A Study on the Application Scope of Most-Favored Nation Treatment in the FTA Investment Provisions Based on the Arbitral Award Cases (FTA투자규정에 있어서 최혜국대우 조항의 적용범위에 관한 중재판정 사례연구)

  • Kim, Kyung-Bae
    • Journal of Arbitration Studies
    • /
    • v.20 no.1
    • /
    • pp.109-131
    • /
    • 2010
  • Investment Agreement is to be a part of FTA, as negotiating together both trade and investment. For example, it has a separate chapter about investment in KORUS FTA contract and is more detailed and inclusive than BIT contents which are traditional investment provisions. It is called to the investment norm of FT A. The investment agreement lures a foreign investment by providing the environment which is stable to the foreign investors. Hence, it plans in goal for the economic development of the home country. In international investment, the arbitration award cases are coming out to be divided into two parts applying MFN provisions in investor protective principles and dispute resolution process; the tendency of broad interpretation and the tendency of limited interpretation. In the case of RosInvest Co UK Ltd v. the Russian Federation awarded in 2007, the arbitration tribunal interprets that the application scope of MFN provisions contain the more lucrative dispute provision than other BITs without limitations in entity right of the investor. This judgment is the same view as arbitration tribunal position of Maffezini case. The arbitration tribunal of Plama case has kept out an assertion magnifying the arbitration tribunal's jurisdiction. That is, for applying more inclusive investor-nation resolution method from different treaty, tribunal mentioned that MFN provision had to see clearly a point of applying the investor-nation dispute resolution method. Dispute resolution process providing inclusive MFN provision has both the tendency of broad interpretation and the tendency of limited interpretation. It needs ceaselessly to do the monitoring about cases of arbitration award. In conclusion, the point where MFN provisions are applied conclusively is recognized, but it is still controversial whether or not to magnify the jurisdiction of arbitration tribunal applying MFN provisions. Therefore, it does not exist clear principle in the theory or in the award eases about the application scope for entity protection provision of MFN. Hence, The Korean government of Korea and local autonomous entities needs to keep their eyes on the trend of the international arbitration award cases in relation to the investment dispute for the future. Also, Korean government or local self-governing group must consider MFN provisions when they make a contract of international investment treaty such as writing concretely the application of MFN provisions from KORUS FTA.

  • PDF

The Current Situation and Improvement in International Commercial Arbitration in China (중국국제상사중재제도의 운용실태와 개선방안)

  • Choi Seok-Beom
    • Journal of Arbitration Studies
    • /
    • v.14 no.2
    • /
    • pp.135-172
    • /
    • 2004
  • While doing business in China foreign companies occasionally find themselves embroiled in disputes with Chinese individuals, companies or the Chinese Government. There are three primary ways to resolve a commercial dispute in China are negotiation, arbitration and litigation. The best way of dispute resolution is negotiation as it is the least expensive method and the working relationship of both parties concerned in dispute. But negotiations do not always give rise to resolution. Arbitration is the next choice. Unless the parties concerned can agree to resort to arbitration after the dispute has arisen, the underlying contract namely, sales contract or separate agreement must show that disputes will be resolved by arbitration. Agreements to arbitration specify arbitration body and governing law. There are two Chinese government -sponsored arbitration bodies for handling cases involving at least one foreign party: China International Economic and Trade Arbitration Commission(CIETAC) and China Maritime Arbitration Commission(CMAC) for maritime disputes. Contracts regarding foreign companies doing business in China often designate CIETAC arbitration. CIETAC distinguishes between two kinds of dispute resolutions, foreign-related arbitration and domestic arbitration. For a dispute to be classified as foreign-related arbitration, one of the companies must be a foreign entity without a major production facility or investment in China. CIETAC has published rules which govern the selection of a panel if the contract does not specify how the choice of arbitration will be handled. CIETAC's list of arbitrators for foreign-related disputes, from which CIETAC's arbitrators must en chosen, includes may non-Chines arbitrators. But many foreign experts believe that some aspects of CIETAC needs to be improved. The purpose of this paper is to improve the understanding of arbitration in China, CIETAC by way of studying the current situation and improvement of international commercial arbitration in China.

  • PDF

The Finality of Arbitral Awards: The U.S. Practices

  • Ha, Choong-Lyong
    • Journal of Arbitration Studies
    • /
    • v.30 no.3
    • /
    • pp.3-19
    • /
    • 2020
  • With the advent of the Free Trade Agreement between Korea and the U.S. and an increase in trade volume between the two countries, the possibility of commercial disputes has escalated among international merchants. It has been well-known that arbitration as an alternative dispute resolution is an efficient way to resolve international commercial disputes. When arbitral awards are enforced in the judicial system, the court will inevitably have to be involved with the enforcement procedures. The court is a typical legal entity to confirm arbitral awards. Through a confirmation process, the winning party obtains the same legal status of final judgment rendered by the court. However, a winning party in arbitration will have to overcome a legal hurdle in the enforcement process of arbitral awards. This article aims to investigate how the courts control the arbitration practices and what the basic legal issues in the enforcement of arbitral awards are. The US Federal Arbitration Act is investigated, while relevant cases are reviewed and updated for legal analysis.