• 제목/요약/키워드: Automatic Extraction

검색결과 879건 처리시간 0.022초

Grammatical Structure Oriented Automated Approach for Surface Knowledge Extraction from Open Domain Unstructured Text

  • Tissera, Muditha;Weerasinghe, Ruvan
    • Journal of information and communication convergence engineering
    • /
    • 제20권2호
    • /
    • pp.113-124
    • /
    • 2022
  • News in the form of web data generates increasingly large amounts of information as unstructured text. The capability of understanding the meaning of news is limited to humans; thus, it causes information overload. This hinders the effective use of embedded knowledge in such texts. Therefore, Automatic Knowledge Extraction (AKE) has now become an integral part of Semantic web and Natural Language Processing (NLP). Although recent literature shows that AKE has progressed, the results are still behind the expectations. This study proposes a method to auto-extract surface knowledge from English news into a machine-interpretable semantic format (triple). The proposed technique was designed using the grammatical structure of the sentence, and 11 original rules were discovered. The initial experiment extracted triples from the Sri Lankan news corpus, of which 83.5% were meaningful. The experiment was extended to the British Broadcasting Corporation (BBC) news dataset to prove its generic nature. This demonstrated a higher meaningful triple extraction rate of 92.6%. These results were validated using the inter-rater agreement method, which guaranteed the high reliability.

감성공학 문서 데이터의 지표 자동화를 위한 코퍼스 분석 기반 특성정보 추출 (Extraction of Informative Features for Automatic Indexation of Human Sensibility Ergonomic Documents)

  • 배희숙;곽현민;채균식;이상태
    • 감성과학
    • /
    • 제7권2호
    • /
    • pp.133-140
    • /
    • 2004
  • 최근 대량으로 쏟아지는 감성공학 연구 결과와 논문들을 가치 있는 자료로 만들기 위해서는 감성 데이터가 산업 전반에 활용될 수 있도록 지표로 정리해야 한다. 본 논문에서는 "웹기반 감성 데이터 베이스 구축 및 보급에 관한 연구" 과제를 통해 작성된 감성 데이터 지표에 입각해서 앞으로 대량으로 출현할 감성공학 데이터의 지속적인 지표화를 위한 과정의 자동화를 제안한다. 문서 데이터의 지표화 작업이 자동요약과 유사하다는 점에 착안하여 자동지표화 시스템을 위한 기술들의 기초가 되는 정보유형 및 주요어 추출, 특성표현을 통한 정보문 추출에 대해 감성공학 코퍼스 분석을 통해 연구하고자 한다. 이는 감성공학 분야에서의 지식관리 시스템이나 자동요약 시스템에 활용될 수 있다. 활용될 수 있다.

  • PDF

FPGA-Based Hardware Accelerator for Feature Extraction in Automatic Speech Recognition

  • Choo, Chang;Chang, Young-Uk;Moon, Il-Young
    • Journal of information and communication convergence engineering
    • /
    • 제13권3호
    • /
    • pp.145-151
    • /
    • 2015
  • We describe in this paper a hardware-based improvement scheme of a real-time automatic speech recognition (ASR) system with respect to speed by designing a parallel feature extraction algorithm on a Field-Programmable Gate Array (FPGA). A computationally intensive block in the algorithm is identified implemented in hardware logic on the FPGA. One such block is mel-frequency cepstrum coefficient (MFCC) algorithm used for feature extraction process. We demonstrate that the FPGA platform may perform efficient feature extraction computation in the speech recognition system as compared to the generalpurpose CPU including the ARM processor. The Xilinx Zynq-7000 System on Chip (SoC) platform is used for the MFCC implementation. From this implementation described in this paper, we confirmed that the FPGA platform is approximately 500× faster than a sequential CPU implementation and 60× faster than a sequential ARM implementation. We thus verified that a parallelized and optimized MFCC architecture on the FPGA platform may significantly improve the execution time of an ASR system, compared to the CPU and ARM platforms.

Automatic Road Extraction by Gradient Direction Profile Algorithm (GDPA) using High-Resolution Satellite Imagery: Experiment Study

  • Lee, Ki-Won;Yu, Young-Chul;Lee, Bong-Gyu
    • 대한원격탐사학회지
    • /
    • 제19권5호
    • /
    • pp.393-402
    • /
    • 2003
  • In times of the civil uses of commercialized high-resolution satellite imagery, applications of remote sensing have been widely extended to the new fields or the problem solving beyond traditional application domains. Transportation application of this sensor data, related to the automatic or semiautomatic road extraction, is regarded as one of the important issues in uses of remote sensing imagery. Related to these trends, this study focuses on automatic road extraction using Gradient Direction Profile Algorithm (GDPA) scheme, with IKONOS panchromatic imagery having 1 meter resolution. For this, the GDPA scheme and its main modules were reviewed with processing steps and implemented as a prototype software. Using the extracted bi-level image and ground truth coming from actual GIS layer, overall accuracy evaluation and ranking error-assessment were performed. As the processed results, road information can be automatically extracted; by the way, it is pointed out that some user-defined variables should be carefully determined in using high-resolution satellite imagery in the dense or low contrast areas. While, the GDPA method needs additional processing, because direct results using this method do not produce high overall accuracy or ranking value. The main advantage of the GDPA scheme on road features extraction can be noted as its performance and further applicability. This experiment study can be extended into practical application fields related to remote sensing.

위성영상의 선형특징 추출과 이를 이용한 자동 GCP 화일링에 관한 연구 (A Study on the Extraction of Linear Features from Satellite Images and Automatic GCP Filing)

  • 김정기;강치우;박래홍;이쾌희
    • 대한원격탐사학회지
    • /
    • 제5권2호
    • /
    • pp.133-145
    • /
    • 1989
  • This paper describes an implementation of linear feature extraction algorithms for satellite images and a method of automatic GCP(Ground Control Point) filing using the extracted linear feature. We propose a new linear feature extraction algorithm which uses magnitude and direction information of edges. The result of applying the proposed algorithm to satellite images are presented and compared with those of the other algorithms. By using the proposed algorithm, automatic GCP filing was successfully performed.

A New Framework for Automatic Extraction of Key Frames Using DC Image Activity

  • Kim, Kang-Wook
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제8권12호
    • /
    • pp.4533-4551
    • /
    • 2014
  • The effective extraction of key frames from a video stream is an essential task for summarizing and representing the content of a video. Accordingly, this paper proposes a new and fast method for extracting key frames from a compressed video. In the proposed approach, after the entire video sequence has been segmented into elementary content units, called shots, key frame extraction is performed by first assigning the number of key frames to each shot, and then distributing the key frames over the shot using a probabilistic approach to locate the optimal position of the key frames. Moreover, we implement our proposed framework in Android to confirm the validity, availability and usefulness. The main advantage of the proposed method is that no time-consuming computations are needed for distributing the key frames within the shots and the procedure for key frame extraction is completely automatic. Furthermore, the set of key frames is independent of any subjective thresholds or manually set parameters.

단일 키넥트를 이용한 골프 스윙 특징의 자동 추출 (Automatic extraction of golf swing features using a single Kinect)

  • 김병기
    • 한국컴퓨터정보학회논문지
    • /
    • 제19권12호
    • /
    • pp.197-207
    • /
    • 2014
  • 본 논문에서는 실용적인 TOF 카메라인 키넥트(Kinect) 한 대를 이용하여 골프 스윙의 자동 분석에 필요한 스윙 특징들을 자동 추출하는 효율적인 방법을 제안하였다. 제안한 방법은 키넥트가 제공하는 관절정보와 깊이(Depth) 정보를 이용하여, 골프스윙에서 중요한 7개의 키프레임과 각 키프레임에서 중요한 스윙특징들을 자동 추출한다. 10명의 골퍼들로부터 구한 50회의 스윙데이터에 대하여 성능을 확인 하였다. 제안한 방법은 설치가 간단하면서도 비용이 저렴한 환경에서 의미 있는 3차원 골프스윙 특징 추출이 가능하고, 구체적인 수치 값을 자동으로 제시하므로 실제적인 자가 스윙분석 시스템 개발에 사용될 수 있다는 점에서 의의가 있다.

영상처리를 이용한 골프 스윙 자동 분석 특징의 추출 (Feature Extraction for Automatic Golf Swing Analysis by Image Processing)

  • 김병기
    • 한국컴퓨터정보학회논문지
    • /
    • 제11권5호
    • /
    • pp.53-58
    • /
    • 2006
  • 본 논문에서는 영상처리 기법을 이용하여 골프 스윙 자동 분석을 위한 특징 추출 방법을 제안하였다. 기존 대부분의 스윙 분석 시스템들이 골프 코치와 같은 전문가가 필요한 반면 제안한 특징 추출 방법을 이용하면 전문가의 도움 없이 중요한 스윙 특징을 추출할 수 있다. 추출한 특징은 어드레싱, 백스윙, 스윙탑, 포워드 스윙, 임팩트, 팔로우쓰루와 같은 키 프레임뿐만 아니라 손, 어깨, 클럽헤드, 발, 무릎과 같은 골퍼의 신체부위와 클럽의 위치까지 포함 한다. 제안한 방법의 효용성을 알기 위하여 스윙영상에 대하여 실험한 결과 제안한 방법이 중요한 골프 스윙 특징 추출에 유용함을 확인하였다.

  • PDF

전력 외란 자동 식별을 위한 특징 벡터 추출 기법 (A Feature Vector Extraction Method For the Automatic Classification of Power Quality Disturbances)

  • 이철호;이재상;조관영;정지현;남상원
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1996년도 추계학술대회 논문집 학회본부
    • /
    • pp.404-406
    • /
    • 1996
  • The objective of this paper is to present a new feature-vector extraction method for the automatic detection and classification of power quality(PQ) disturbances, where FFT, DWT(Discrete Wavelet Transform), and data compression are utilized to extract an appropriate feature vector. In particular, the proposed classifier consists of three parts: i.e., (i) automatic detection of PQ disturbances, where the wavelet transform and signal power estimation method are utilized to detect each disturbance, (ii) feature vector extraction from the detected disturbance, and (iii) automatic classification, where Multi-Layer Perceptron(MLP) is used to classify each disturbance from the corresponding extracted feature vector. To demonstrate the performance and applicability of the proposed classification algorithm, some test results obtained by analyzing 7-class power quality disturbances generated by the EMTP are also provided.

  • PDF

An Ontology-based Knowledge Management System - Integrated System of Web Information Extraction and Structuring Knowledge -

  • Mima, Hideki;Matsushima, Katsumori
    • 한국전자거래학회:학술대회논문집
    • /
    • 한국전자거래학회 2005년도 e-Biz World Conference 2005
    • /
    • pp.55-61
    • /
    • 2005
  • We will introduce a new web-based knowledge management system in progress, in which XML-based web information extraction and our structuring knowledge technologies are combined using ontology-based natural language processing. Our aim is to provide efficient access to heterogeneous information on the web, enabling users to use a wide range of textual and non textual resources, such as newspapers and databases, effortlessly to accelerate knowledge acquisition from such knowledge sources. In order to achieve the efficient knowledge management, we propose at first an XML-based Web information extraction which contains a sophisticated control language to extract data from Web pages. With using standard XML Technologies in the system, our approach can make extracting information easy because of a) detaching rules from processing, b) restricting target for processing, c) Interactive operations for developing extracting rules. Then we propose a structuring knowledge system which includes, 1) automatic term recognition, 2) domain oriented automatic term clustering, 3) similarity-based document retrieval, 4) real-time document clustering, and 5) visualization. The system supports integrating different types of databases (textual and non textual) and retrieving different types of information simultaneously. Through further explanation to the specification and the implementation technique of the system, we will demonstrate how the system can accelerate knowledge acquisition on the Web even for novice users of the field.

  • PDF