Search | Korea Science

A Method to Solve the Entity Linking Ambiguity and NIL Entity Recognition for efficient Entity Linking based on Wikipedia (위키피디아 기반의 효과적인 개체 링킹을 위한 NIL 개체 인식과 개체 연결 중의성 해소 방법)

Lee, Hokyung;An, Jaehyun;Yoon, Jeongmin;Bae, Kyoungman;Ko, Youngjoong
- Journal of KIISE
- /
- v.44 no.8
- /
- pp.813-821
- /
- 2017
Entity Linking find the meaning of an entity mention, which indicate the entity using different expressions, in a user's query by linking the entity mention and the entity in the knowledge base. This task has four challenges, including the difficult knowledge base construction problem, multiple presentation of the entity mention, ambiguity of entity linking, and NIL entity recognition. In this paper, we first construct the entity name dictionary based on Wikipedia to build a knowledge base and solve the multiple presentation problem. We then propose various methods for NIL entity recognition and solve the ambiguity of entity linking by training the support vector machine based on several features, including the similarity of the context, semantic relevance, clue word score, named entity type similarity of the mansion, entity name matching score, and object popularity score. We sequentially use the proposed two methods based on the constructed knowledge base, to obtain the good performance in the entity linking. In the result of the experiment, our system achieved 83.66% and 90.81% F1 score, which is the performance of the NIL entity recognition to solve the ambiguity of the entity linking.
https://doi.org/10.5626/JOK.2017.44.8.813 인용 KSCI

Classifying Articles in Chinese Wikipedia with Fine-Grained Named Entity Types

Zhou, Jie;Li, Bicheng;Tang, Yongwang
- Journal of Computing Science and Engineering
- /
- v.8 no.3
- /
- pp.137-148
- /
- 2014
Named entity classification of Wikipedia articles is a fundamental research area that can be used to automatically build large-scale corpora of named entity recognition or to support other entity processing, such as entity linking, as auxiliary tasks. This paper describes a method of classifying named entities in Chinese Wikipedia with fine-grained types. We considered multi-faceted information in Chinese Wikipedia to construct four feature sets, designed different feature selection methods for each feature, and fused different features with a vector space using different strategies. Experimental results show that the explored feature sets and their combination can effectively improve the performance of named entity classification.
https://doi.org/10.5626/JCSE.2014.8.3.137 인용 PDF KSCI KPUBS

Named Entity Recognition with Structural SVMs and Pegasos algorithm (Structural SVMs 및 Pegasos 알고리즘을 이용한 한국어 개체명 인식)

Lee, Chang-Ki;Jang, Myun-Gil
- Korean Journal of Cognitive Science
- /
- v.21 no.4
- /
- pp.655-667
- /
- 2010
The named entity recognition task is one of the most important subtasks in Information Extraction. In this paper, we describe a Korean named entity recognition using structural Support Vector Machines (structural SVMs) and modified Pegasos algorithm. Using the proposed approach, we could achieve an 85.43% F1 and an 86.79% F1 for 15 named entity types on TV domain and sports domain, respectively. Moreover, we reduced the training time to 4% without loss of performance compared to Conditional Random Fields (CRFs).
PDF

Study of Export Platform's Activation Plan on Public Services (국내 수출기업 해외시장 진출을 위한 공공서비스 수출플랫폼 활성화 방안에 관한 연구)

Bae, Hong Kyun;Choi, Young Jun;Kang, Shin Won
- THE INTERNATIONAL COMMERCE & LAW REVIEW
- /
- v.61
- /
- pp.249-272
- /
- 2014
The purpose of this study is to seek the revitalization plans of export platform for supporting export enterprise of the domestic public services. To activate the export platform, alternatives to two aspects are required as follows. First of all, a governance plan of export platform should be established. At present, an operation entity of export platform, which is still developing stage, is not yet established. In this situation, even though export platform developed, it will faced with inefficient operation. Therefore, an operation entity and appropriate business model are important to the platform activation. second, to activate platform, it is necessary to support export process of life cycle. In particular, the service, which can support the business strategies setting and project implementation stage, is required. In addition, in order to activate the export platform is need to operate with he government's support program.
PDF

A case-based DSS to support enterprise data model development (전사적 데이터 모델 개발을 지원하는 사례기반 의사결정지원시스템)

박동진
- Proceedings of the Korean Operations and Management Science Society Conference
- /
- 1996.10a
- /
- pp.218-221
- /
- 1996
전사적 데이터 모델을 개발하기 위해서는, 먼저, 기업에 있어서 중요하게 관리되어져야 할 주요 entity들을 파악하는 것이 선행되어야 한다. entity의 결정은 시스템 개발 전 단계에 걸쳐 지대한 영향을 끼치는 중요한 의사결정이나, 아직까지 이는 매우 주관적일 뿐 아니라 의사결정자의 경험 및 전문성에 매우 의존적이다. 또한 때로는 entity의 결정에 필요 이상의 많은 시간이 소요되기도 한다. 본 연구에서는 entity결정에 직면한 의사결정자를 지원하기 위하여, 사례기반 추론 기술을 채택한 의사결정지원시스템을 설계 개발하였다. 본 시스템에서는 과거에 성공적으로 entity를 결정했었다고 평가되는 사례로부터, 해당 기업의 상황에 적합한 새로운 결론을 도출해서 의사결정자를 효과적으로 지원한다.
PDF

Relation Extraction Using Convolution Tree Kernel Expanded with Entity Features

Qian, Longhua;Zhou, Guodong;Zhu, Qiaomin;Qian, Peide
- Proceedings of the Korean Society for Language and Information Conference
- /
- 2007.11a
- /
- pp.415-421
- /
- 2007
This paper proposes a convolution tree kernel-based approach for relation extraction where the parse tree is expanded with entity features such as entity type, subtype, and mention level etc. Our study indicates that not only can our method effectively capture both syntactic structure and entity information of relation instances, but also can avoid the difficulty with tuning the parameters in composite kernels. We also demonstrate that predicate verb information can be used to further improve the performance, though its enhancement is limited. Evaluation on the ACE2004 benchmark corpus shows that our system slightly outperforms both the previous best-reported feature-based and kernel-based systems.
PDF

Representing and constructing liquefaction cycle alternatives for FLNG FEED using system entity structure concepts

Ha, Sol;Lee, Kyu-Yeul
- International Journal of Naval Architecture and Ocean Engineering
- /
- v.6 no.3
- /
- pp.598-625
- /
- 2014
To support the procedure for determining an optimal liquefaction cycle for FLNG FEED, an ontological modeling method which can automatically generate various alternative liquefaction cycles were carried out in this paper. General rules in combining equipment are extracted from existing onshore liquefaction cycles like C3MR and DMR cycle. A generic relational model which represents whole relations of the plant elements has all these rules, and it is expressed by using the system entity structure (SES), an ontological framework that hierarchically represents the elements of a system and their relationships. By using a process called pruning which reduces the SES to a candidate, various alternative relational models of the liquefaction cycles can be automatically generated. These alternatives were provided by XML-based formats, and they can be used for choosing an optimal liquefaction cycle on the basis of the assessments such as process simulation and reliability analysis.
https://doi.org/10.2478/IJNAOE-2013-0200 인용 PDF KSCI KPUBS

A study on the improvement of BCM industry through legal systems (BCM(재해경감활동관리)산업 활성화를 위한 법·제도 개선 방안 연구)

Han, Jong-U
- Disaster and Security
- /
- v.5 no.1
- /
- pp.93-100
- /
- 2015
Although many years passed since 'The Legislative bill on the support of voluntary activities of enterprises for disaster reduction'(hereinafter referred to as 'enterprise disaster reduction act') has been first enacted in 2007, BCMS is still not activated in our society. In contrast, after 911 Terror, importance of BCM is getting magnified and standardization research & institutionalization i s a lso proceeding i all over world. Lately, Disaster preventing activities is urgently needed like the sinking of 'Sewol ferry'. So the purpose of this paper is proposed for establishment of 'BCMS' and activation of the certificate system for Best-Run Business by analyzing the problem of 'enterprise disaster reduction act' and weak of activation as following. First, propel changing the policy of self-regulated participation to mandatory about the certificate system for Best-Run Business from public entity to government ministry and it is able to activate by propelling demo business of the certificate system for Best-Run Business. Second, public entity that has been given the certificate system for Best-Run Business by affiliating with Disaster Management Assessment of government management can be exempted from Disaster Management Assessment or those entity can arrange for connectivity acquisition method of 'Excellent rate'. Third, to publicize the activation of the law mentioned above, makes public entity r ecognizable by incorporating 'BCMS' into National safety management plan and establishment of National critical infrastructures security plan. Fourth, it should be reviewed to improving the related act regarding to inclusion of public organizations as well as private enterprises.
PDF

Conceptual Data Modeling: Entity-Relationship Models as Thinging Machines

Al-Fedaghi, Sabah
- International Journal of Computer Science & Network Security
- /
- v.21 no.9
- /
- pp.247-260
- /
- 2021
Data modeling is a process of developing a model to design and develop a data system that supports an organization's various business processes. A conceptual data model represents a technology-independent specification of structure of data to be stored within a database. The model aims to provide richer expressiveness and incorporate a set of semantics to (a) support the design, control, and integrity parts of the data stored in data management structures and (b) coordinate the viewing of connections and ideas on a database. The described structure of the data is often represented in an entity–relationship (ER) model, which was one of the first data-modeling techniques and is likely to continue to be a popular way of characterizing entity classes, attributes, and relationships. This paper attempts to examine the basic ER modeling notions in order to analyze the concepts to which they refer as well as ways to represent them. In such a mission, we apply a new modeling methodology (thinging machine; TM) to ER in terms of its fundamental building constructs, representation entities, relationships, and attributes. The goal of this venture is to further the understanding of data models and enrich their semantics. Three specific contributions to modeling in this context are incorporated: (a) using the TM model's five generic actions to inject processing in the ER structure; (b) relating the single ontological element of TM modeling (i.e., a thing/machine or thimac) to ER entities and relationships; and (c) proposing a high-level integrated, extended ER model that includes structural and time-oriented notions (e.g., events or behavior).
https://doi.org/10.22937/IJCSNS.2021.21.9.33 인용 PDF KSCI

Extending TextAE for annotation of non-contiguous entities

Lever, Jake;Altman, Russ;Kim, Jin-Dong
- Genomics & Informatics
- /
- v.18 no.2
- /
- pp.15.1-15.6
- /
- 2020
Named entity recognition tools are used to identify mentions of biomedical entities in free text and are essential components of high-quality information retrieval and extraction systems. Without good entity recognition, methods will mislabel searched text and will miss important information or identify spurious text that will frustrate users. Most tools do not capture non-contiguous entities which are separate spans of text that together refer to an entity, e.g., the entity "type 1 diabetes" in the phrase "type 1 and type 2 diabetes." This type is commonly found in biomedical texts, especially in lists, where multiple biomedical entities are named in shortened form to avoid repeating words. Most text annotation systems, that enable users to view and edit entity annotations, do not support non-contiguous entities. Therefore, experts cannot even visualize non-contiguous entities, let alone annotate them to build valuable datasets for machine learning methods. To combat this problem and as part of the BLAH6 hackathon, we extended the TextAE platform to allow visualization and annotation of non-contiguous entities. This enables users to add new subspans to existing entities by selecting additional text. We integrate this new functionality with TextAE's existing editing functionality to allow easy changes to entity annotation and editing of relation annotations involving non-contiguous entities, with importing and exporting to the PubAnnotation format. Finally, we roughly quantify the problem across the entire accessible biomedical literature to highlight that there are a substantial number of non-contiguous entities that appear in lists that would be missed by most text mining systems.
https://doi.org/10.5808/GI.2020.18.2.e15 인용 PDF KSCI

Search Result 136, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)