• 제목/요약/키워드: extraction techniques

검색결과 888건 처리시간 0.036초

A Dependency Graph-Based Keyphrase Extraction Method Using Anti-patterns

  • Batsuren, Khuyagbaatar;Batbaatar, Erdenebileg;Munkhdalai, Tsendsuren;Li, Meijing;Namsrai, Oyun-Erdene;Ryu, Keun Ho
    • Journal of Information Processing Systems
    • /
    • 제14권5호
    • /
    • pp.1254-1271
    • /
    • 2018
  • Keyphrase extraction is one of fundamental natural language processing (NLP) tools to improve many text-mining applications such as document summarization and clustering. In this paper, we propose to use two novel techniques on the top of the state-of-the-art keyphrase extraction methods. First is the anti-patterns that aim to recognize non-keyphrase candidates. The state-of-the-art methods often used the rich feature set to identify keyphrases while those rich feature set cover only some of all keyphrases because keyphrases share very few similar patterns and stylistic features while non-keyphrase candidates often share many similar patterns and stylistic features. Second one is to use the dependency graph instead of the word co-occurrence graph that could not connect two words that are syntactically related and placed far from each other in a sentence while the dependency graph can do so. In experiments, we have compared the performances with different settings of the graphs (co-occurrence and dependency), and with the existing method results. Finally, we discovered that the combination method of dependency graph and anti-patterns outperform the state-of-the-art performances.

Intelligent Methods to Extract Knowledge from Process Data in the Industrial Applications

  • Woo, Young-Kwang;Bae, Hyeon;Kim, Sung-Shin;Woo, Kwang-Bang
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제3권2호
    • /
    • pp.194-199
    • /
    • 2003
  • Data are an expression of the language or numerical values that show some features. And the information is extracted from data for the specific purposes. The knowledge is utilized as information to construct rules that recognize patterns or make a decision. Today, knowledge extraction and application of that are broadly accomplished for the easy comprehension and the performance improvement of systems in the several industrial fields. The knowledge extraction can be achieved by some steps that include the knowledge acquisition, expression, and implementation. Such extracted knowledge is drawn by rules with data mining techniques. Clustering (CL), input space partition (ISP), neuro-fuzzy (NF), neural network (NN), extension matrix (EM), etc. are employed for the knowledge expression based upon rules. In this paper, the various approaches of the knowledge extraction are surveyed and categorized by methodologies and applied industrial fields. Also, the trend and examples of each approaches are shown in the tables and graphes using the categories such as CL, ISP, NF, NN, EM, and so on.

특징 추출과 검출 오차 최소화 알고리듬을 이용한 회전기계의 결함 진단 (Fault Diagnosis for Rotating Machine Using Feature Extraction and Minimum Detection Error Algorithm)

  • 정의필;조상진;이재열
    • 한국소음진동공학회논문집
    • /
    • 제16권1호
    • /
    • pp.27-33
    • /
    • 2006
  • Fault diagnosis and condition monitoring for rotating machines are important for efficiency and accident prevention. The process of fault diagnosis is to extract the feature of signals and to classify each state. Conventionally, fault diagnosis has been developed by combining signal processing techniques for spectral analysis and pattern recognition, however these methods are not able to diagnose correctly for certain rotating machines and some faulty phenomena. In this paper, we add a minimum detection error algorithm to the previous method to reduce detection error rate. Vibration signals of the induction motor are measured and divided into subband signals. Each subband signal is processed to obtain the RMS, standard deviation and the statistic data for constructing the feature extraction vectors. We make a study of the fault diagnosis system that the feature extraction vectors are applied to K-means clustering algorithm and minimum detection error algorithm.

KOMPSAT-2 영상을 이용한 토지피복정보 자동 추출 (Automatic Extraction of Land Cover information By Using KOMPSAT-2 Imagery)

  • 이현직;유지호;유영걸
    • 한국측량학회:학술대회논문집
    • /
    • 한국측량학회 2010년 춘계학술발표회 논문집
    • /
    • pp.277-280
    • /
    • 2010
  • There is a need to convert the old low- or medium-resolution satellite image-based thematic mapping to the high-resolution satellite image-based mapping of GSD 1m grade or lower. There is also a need to generate middle- or large-scale thematic maps of 1:5,000 or lower. In this study, the DEM and orthoimage is generated with the KOMPSAT-2 stereo image of Yuseong-gu, Daejeon Metropolitan City. By utilizing the orthoimage, automatic extraction experiments of land cover information are generated for buildings, roads and urban areas, raw land(agricultural land), mountains and forests, hydrosphere, grassland, and shadow. The experiment results show that it is possible to classify, in detail, for natural features such as the hydrosphere, mountains and forests, grassland, shadow, and raw land. While artificial features such as roads, buildings, and urban areas can be easily classified with automatic extraction, there are difficulties on detailed classifications along the boundaries. Further research should be performed on the automation methods using the conventional thematic maps and all sorts of geo-spatial information and mapping techniques in order to classify thematic information in detail.

  • PDF

사상채질 분류를 위한 안면부내 특징 요소 추출 (Facial Features Extraction for Sasang Constitution Classification)

  • 배나영;안택원;조동욱;이화섭
    • 사상체질의학회지
    • /
    • 제17권2호
    • /
    • pp.46-51
    • /
    • 2005
  • 1. Objectives The purpose of this study is to objectify the diagnosis of Sasang Constitution. Using the methods of this study, it will improve to classificate Sasang Constitution. 2. Methods 1) Automatic feature extraction of human frontal faces for Sasang Constitution classification. 2) Color feature extraction of human frontal faces (1)Erosion filtering (skin-white, the other-black) (2) Median median 3. Results and Conclusions Observing a person's shape has been the major method for Sasang Constitution classification, which usually has been dependent upon doctor's intuition as of these days. We are developing an automatic system which provides objective basic data for Sasang Constitution classification. For this, in this paper, firstly, the signal processing techniques are applied to automatic feature extraction of human frontal faces for Sasang Constitution classification. The experiment is conducted to verify the effectiveness of the proposed system.

  • PDF

원격탐사자료에 의한 해남지역 비금속광상 및 관련 특성 추출을 위한 연구 (A Study on Extraction of Non-metallic Ore Deposits from Remote Sensing Data of the Haenam Area)

  • 박인석;박종남
    • 대한원격탐사학회지
    • /
    • 제8권2호
    • /
    • pp.105-123
    • /
    • 1992
  • A study was made on the feature extraction for non-metallic one deposits and their related geology using the Remote Sensing and Airborne Radiometric data. The area chosen is around the Haenam area, where dickite and Quarzite mines are distributed in. The geology of the area consists mainly of Cretaceous volcanics and PreCambrian metamorphic. The methods applied are study on the reflectance characteristics of minerals and rocks sampled in the study area, and the feature extraction extraction of histogram normalized images for Landsat TM and Airborne Radiometric data, and finally evaluation of applicability of some useful pattern recognition techniques for regional lithological mapping. As a result, reflectances of non-metallic minerals are much higher than rock samples in the area. However, low grade dickites are slightly higher than rock samples, probably due to their greyish colour and also their textural features which may scatter the reflectance and may be capable of capturing much hychoryl ions. The reflectances of rock samples may depend on the degree of whiteness of samples. The outcrops or mine dumps in the study area were most effectively extracted on the histogram normalized image of TM Band 1, 2 and 3, due to their high reflectivity. The Masking technique using the above bands may be the most effective and the natural colour composite may provide some success as well. The colour composite image of PCA may also be effective in extracting geological features, and airborne radiometric data may be useful to some degree as an complementary tool.

Prediction of Protein-Protein Interactions from Sequences using a Correlation Matrix of the Physicochemical Properties of Amino Acids

  • Kopoin, Charlemagne N'Diffon;Atiampo, Armand Kodjo;N'Guessan, Behou Gerard;Babri, Michel
    • International Journal of Computer Science & Network Security
    • /
    • 제21권3호
    • /
    • pp.41-47
    • /
    • 2021
  • Detection of protein-protein interactions (PPIs) remains essential for the development of therapies against diseases. Experimental studies to detect PPI are longer and more expensive. Today, with the availability of PPI data, several computer models for predicting PPIs have been proposed. One of the big challenges in this task is feature extraction. The relevance of the information extracted by some extraction techniques remains limited. In this work, we first propose an extraction method based on correlation relationships between the physicochemical properties of amino acids. The proposed method uses a correlation matrix obtained from the hydrophobicity and hydrophilicity properties that it then integrates in the calculation of the bigram. Then, we use the SVM algorithm to detect the presence of an interaction between 2 given proteins. Experimental results show that the proposed method obtains better performances compared to the approaches in the literature. It obtains performances of 94.75% in accuracy, 95.12% in precision and 96% in sensitivity on human HPRD protein data.

CutPaste-Based Anomaly Detection Model using Multi Scale Feature Extraction in Time Series Streaming Data

  • Jeon, Byeong-Uk;Chung, Kyungyong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권8호
    • /
    • pp.2787-2800
    • /
    • 2022
  • The aging society increases emergency situations of the elderly living alone and a variety of social crimes. In order to prevent them, techniques to detect emergency situations through voice are actively researched. This study proposes CutPaste-based anomaly detection model using multi-scale feature extraction in time series streaming data. In the proposed method, an audio file is converted into a spectrogram. In this way, it is possible to use an algorithm for image data, such as CNN. After that, mutli-scale feature extraction is applied. Three images drawn from Adaptive Pooling layer that has different-sized kernels are merged. In consideration of various types of anomaly, including point anomaly, contextual anomaly, and collective anomaly, the limitations of a conventional anomaly model are improved. Finally, CutPaste-based anomaly detection is conducted. Since the model is trained through self-supervised learning, it is possible to detect a diversity of emergency situations as anomaly without labeling. Therefore, the proposed model overcomes the limitations of a conventional model that classifies only labelled emergency situations. Also, the proposed model is evaluated to have better performance than a conventional anomaly detection model.

제련소 주변토양 중금속 존재형태 (Existing Forms of Heavy Metals in the Vicinity of a Smelter)

  • 우상덕;김건하;김영진;남경필
    • 한국지하수토양환경학회지:지하수토양환경
    • /
    • 제15권5호
    • /
    • pp.16-22
    • /
    • 2010
  • Heavy metals in soils exist in various forms dependent upon surrounding conditions. As the Janghang smelter area is of concern for its high elevated heavy metal concentrations, Korean government decided to remediate the area. Main objectives of this research were; to analyze heavy metal concentrations and their existing forms in the vicinity of the smelter; and to understand differences made by analysis techniques of heavy metals. Top soils of rice field, crop field, bare field, and forestry in the area were sampled and analyzed for their physicochemical characteristics. Concentrations of Cu, Cd, Pb, and As were analyzed with two pretreatment techniques adopted using 0.1 N HCl and aqua regia. To analyze existing forms of heavy metals, Tessier's schemes for sequential extraction technique were adopted. Exchangeable fraction and carbonate bound fraction of heavy metals may pose potential threat to environment and were in the order of Pb > As > Cu > Cd. If assessing mobile fraction of heavy metals by land uses, the order was forestry > bare land > crop field > rice field. When analyzed using Tessier's scheme, high ratio of residual fractions to total arsenic concentration should be considered for remediation design of the area.

신호처리 기술에 의한 부분방전 방사전자파의 특징 추출 (The Feature Extraction of Partial Discharge Electromagnetic Wave utilizing Signal Processing Techniques)

  • 이현동;이광식
    • 조명전기설비학회논문지
    • /
    • 제16권1호
    • /
    • pp.44-49
    • /
    • 2002
  • 최근 고전압 전력기기에서의 부분방전을 측정하기 위한 다양한 절연진단 기술들이 소개되었다. 부분방전 신호는 아주 미약하고 주변환경의 여러잡음에 쉽게 영향을 받으므로 주위 노이즈와의 구별이 어려운 실정이다. 본 논문에서는 부분방전 검출법중 부분방전에 의해 방사되는 전자파를 안테나로 측정하는 방사전자파법을 이용하여 변전소 구내의 배경잡음과 실험실내의 모의 부분방전을 방사전자파법에 의해 측정분석하였다. 또한 간섭신호와 모의 부분방전시 방사되는 방사전자파의 특징을 추출하고, 그 인식을 위하여 웨이브렛 패킷 변환을 이용하였다. 그 결과 간섭신호와 부분방전의 특정주파수대역의 시간정보 특징으로 그 차이를 구별할 수 있었다.