• 제목/요약/키워드: open information extraction

검색결과 105건 처리시간 0.019초

A Distance Approach for Open Information Extraction Based on Word Vector

  • Liu, Peiqian;Wang, Xiaojie
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권6호
    • /
    • pp.2470-2491
    • /
    • 2018
  • Web-scale open information extraction (Open IE) plays an important role in NLP tasks like acquiring common-sense knowledge, learning selectional preferences and automatic text understanding. A large number of Open IE approaches have been proposed in the last decade, and the majority of these approaches are based on supervised learning or dependency parsing. In this paper, we present a novel method for web scale open information extraction, which employs cosine distance based on Google word vector as the confidence score of the extraction. The proposed method is a purely unsupervised learning algorithm without requiring any hand-labeled training data or dependency parse features. We also present the mathematically rigorous proof for the new method with Bayes Inference and Artificial Neural Network theory. It turns out that the proposed algorithm is equivalent to Maximum Likelihood Estimation of the joint probability distribution over the elements of the candidate extraction. The proof itself also theoretically suggests a typical usage of word vector for other NLP tasks. Experiments show that the distance-based method leads to further improvements over the newly presented Open IE systems on three benchmark datasets, in terms of effectiveness and efficiency.

Open-Ball 피처 추출 방법에 의한 3차원 물체 인식 (3-D Object Recognition Using a Feature Extraction Scheme: Open-Ball Operator)

  • 김성수
    • 한국정보처리학회논문지
    • /
    • 제6권3호
    • /
    • pp.821-831
    • /
    • 1999
  • 3차운 물체 인식 중 오목과 볼록을 갖고 있는 물체의 인식은 대단히 어려운 문제이다. 본 논문에서는 물체의 인식을 위한 피처(Feature)의 추출 방법으로 오픈-볼(Open-Ball)을 제안한다. 이 새로운 방법은 물체의 크기, 이동고 회전에 불변성을 갖는 피처(Feature)를 생성하는 것뿐만이 아니라, 비교되는 물체를 인식하는 것을 상대적인 닮음 정도 측정으로 구현한다.

  • PDF

Grammatical Structure Oriented Automated Approach for Surface Knowledge Extraction from Open Domain Unstructured Text

  • Tissera, Muditha;Weerasinghe, Ruvan
    • Journal of information and communication convergence engineering
    • /
    • 제20권2호
    • /
    • pp.113-124
    • /
    • 2022
  • News in the form of web data generates increasingly large amounts of information as unstructured text. The capability of understanding the meaning of news is limited to humans; thus, it causes information overload. This hinders the effective use of embedded knowledge in such texts. Therefore, Automatic Knowledge Extraction (AKE) has now become an integral part of Semantic web and Natural Language Processing (NLP). Although recent literature shows that AKE has progressed, the results are still behind the expectations. This study proposes a method to auto-extract surface knowledge from English news into a machine-interpretable semantic format (triple). The proposed technique was designed using the grammatical structure of the sentence, and 11 original rules were discovered. The initial experiment extracted triples from the Sri Lankan news corpus, of which 83.5% were meaningful. The experiment was extended to the British Broadcasting Corporation (BBC) news dataset to prove its generic nature. This demonstrated a higher meaningful triple extraction rate of 92.6%. These results were validated using the inter-rater agreement method, which guaranteed the high reliability.

OpenCV를 이용한 도로표지 영상에서의 방향정보 자동인식 (Automatic Recognition of Direction Information in Road Sign Image Using OpenCV)

  • 김기홍;정규수;윤준희
    • 한국측량학회지
    • /
    • 제31권4호
    • /
    • pp.293-300
    • /
    • 2013
  • 도로표지는 운전자들에게 유용한 정보들을 제공함으로서 안전하고 원활한 교통을 확보하기 위한 중요한 시설물이다. 도로표지를 체계적으로 관리하기 위해서는 도로표지 내용에 대한 DB구축이 필요하며 이를 위한 작업은 거의 수동으로 진행되고 있어 많은 시간과 비용이 소요된다. 본 연구에서는 도로표지 영상에서 방향정보를 자동으로 인식 추출하기 위한 알고리즘을 제안하였다. 또한 OpenCV를 이용해 이를 구현하였으며 도로표지 영상에 적용하였다. 방향정보의 자동추출을 위해, 영상 개선, 영상 이진화, 방향지시 도형 영역 추출, 특징점 추출, 템플릿 영상정합 등의 영상처리 기법을 코딩하여 적용하였으며 이를 통해 방향정보 자동 인식의 가능성을 확인하였다.

Extracting meeting location from seminar and conference announcement in English

  • Kim, Anatoliy;Choi, Dong-Hyun;Choi, Key-Sun
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2011년도 한국컴퓨터종합학술대회논문집 Vol.38 No.1(C)
    • /
    • pp.258-261
    • /
    • 2011
  • Living in the age of information people face problems related to information overload. Information is easy to produce, store and distribute through various communication channels, one of which is emails. With the appearance of the mobile devices, such as smart phones and tabs, people can have access to email inbox at any moment of time from everywhere. In this paper we present information extraction system with a specific goal of extracting meeting location from the announcement of seminar or conference. We apply a machine learning method (conditional random fields, CRF), train the system using annotated corpus of seminar and conference announcements and validate results by applying various extracted correction rules and patterns. Furthermore, we normalize extracted location, and reference using geo-coding databases, OpenStreetMap and Wikipedia resources to determine real geographical coordinates.

라즈베리 파이를 이용한 무선 자동차번호판 영역 추출 모듈 개발 (Development of Wireless License Plate Region Extraction Module Based on Raspberry Pi)

  • 김동경;우종호
    • 한국멀티미디어학회논문지
    • /
    • 제18권10호
    • /
    • pp.1172-1179
    • /
    • 2015
  • A wireless license plate region extracting module is proposed for LPR system controlling multiple gates. This module is cheaply implemented using Raspberry Pi which is open source and high performance. First, as the upper 1/3 of the captured image is discarded as it has no useful information on license plate. Using the OpenCV libraries the edge image is got by Canny algorithm after applying Gaussian filtering to gray image, and the labeling is conducted for 4 consecutive numbers in license plate. These numbers are located using various decision equations, and expanding the numbers region the final license plate region can be extracted. The result image is transferred to Server using wifi direct. Using the proposed module it becomes easy to set up and maintain the LPR system. The experimental results showed that the successful extracting rate was 98.4% using 500 car images with 640 × 480 resolution.

패키지된 바이폴라 트랜지스터의 등가회로 모델 파라미터 추출 (Equivalent Circuit Model Parameter Extraction for Packaged Bipolar Transistors)

  • 이성현
    • 대한전자공학회논문지SD
    • /
    • 제41권12호
    • /
    • pp.21-26
    • /
    • 2004
  • 본 논문에서는 package된 BJT의 RF 등가회로 모델을 optimization과정 없이 직접 추출하는 방법을 개발하였다. 먼저, open 과 short package 구조를 사용하여 plastic package의 기생성분을 측정된 S-파라미터로부터 정확히 제거하였다. 이와 같이 package do-embedding된 S-파라미터로부터 package lead와 chip pad 사이의 bonding wire 인덕턴스와 chip pad 캐패시턴스를 직접 추출하는 간단한 방법을 구축하였다. 그 후에 내부 BJT소자의 소신호 모델변수들은 RF 등가회로로부터 유도된 Z나 Y-파라미터 방정식을 이용하여 결정하였다. 이 방법으로 모델화된 packaged BJT의 S-파라미터는 측정 데이터와 아주 잘 일치하였으며 이는 새로운 추출방법의 정확성을 증명한다.

GEO-MAPPING MASHUPS USING OPEN APIS: THE STATE-OF-THE-ART AND AN APPLICATION FOR GEO-SPATIAL WEB

  • Park, Yong-Jae;Lee, Ki-Won
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2008년도 International Symposium on Remote Sensing
    • /
    • pp.45-48
    • /
    • 2008
  • Since the mid-2000, so-called Web 2.0 paradigm was emerged and has been widely extended to actual Web markets with supporting Web technologies. Among Web 2.0 services, mashups with Open APIs is regarded as one of core technologies. Web mapping is also the most dominant mashup application in Open API-based applications, so that various mashups with Google Map API and other Open APIs are developing. In this paper, current technological status is explained with web GIS file formats of KML. While, a prototype is web-based GIS layer authoring system for general users, without any kinds of GIS tools and external spatial databases. This mashup application will be used web-based feature extraction for geo-spatial web for public users.

  • PDF

OpenCL 및 Embedded GPU를 이용한 영상 특징 추출 및 파노라마 영상 생성의 병렬화 (Parallelization of Feature Detection and Panorama Image Generation using OpenCL and Embedded GPU)

  • 강승헌;이승재;이만희;박인규
    • 방송공학회논문지
    • /
    • 제19권3호
    • /
    • pp.316-328
    • /
    • 2014
  • 본 논문에서는 최신 embedded GPU를 사용하여 영상의 특징 추출 알고리즘(SIFT, SURF)을 병렬화하고, 특징 추출 및 정합 결과를 이용하여 파노라마 영상을 GPU에서 고속으로 생성하는 방법을 제안한다. 병렬화 된 알고리즘의 GPGPU(general purpose computation on GPU) 구현은 최신 스마트폰의 embedded GPU에서 지원하기 시작한 OpenCL을 이용하였다. 본 논문에서는 GPU에서 OpenGL Shading Language(GLSL)를 이용한 기존의 병렬화와 OpenCL을 이용한 새로운 병렬화 구현 결과를 효과적인 코드 구현 방법과 수행속도 관점에서 비교하였다. 실험결과, OpenCL은 GLSL과 유사한 수행 속도를 보였으며 embedded CPU와 비교하여 약 3~4배 빠른 수행속도를 보였다. 구현한 특징 추출 결과의 응용 사례로써, 특징 정합을 통한 영상 정합을 GPU상에서 병렬 수행하여 여러 장의 영상으로부터 파노라마 영상을 고속으로 생성하는 사례를 보인다.

AUTOMATIC DETECTION Of NARROW OPEN WATER STREAMS IN AMAZON FORESTS FROM JERS-1 SAR IMAGERY

  • Amano, Takako-Sakurai;Iisaka, Joji;Kamiyama, Masataka;Takagi, Mikio
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 1999년도 Proceedings of International Symposium on Remote Sensing
    • /
    • pp.310-315
    • /
    • 1999
  • We extracted narrow open water streams from JERS-1 SAR images of the Amazon rain forest. The extracted range of these streams were almost comparable to a high level extraction of the same streams from near-IR images of JERS-1 VNIR data notwithstanding that these features in SAR images show the strong dependence of the observation angle. Large water bodies are relatively easy to extract from JERS-1 SAR images, as they tend to appear as very dark areas; but streams whose width is nearly equal to or less than the spatial resolution no longer appear as very dark features. By using strong scatterers distributed sparsely along the radar facing sides of the streams, we can successfully estimate approximate ranges of waterways and then extract relatively dark line-like features within these ranges.

  • PDF