• 제목/요약/키워드: Information Mining

검색결과 3,350건 처리시간 0.032초

Text Mining in Online Social Networks: A Systematic Review

  • Alhazmi, Huda N
    • International Journal of Computer Science & Network Security
    • /
    • 제22권3호
    • /
    • pp.396-404
    • /
    • 2022
  • Online social networks contain a large amount of data that can be converted into valuable and insightful information. Text mining approaches allow exploring large-scale data efficiently. Therefore, this study reviews the recent literature on text mining in online social networks in a way that produces valid and valuable knowledge for further research. The review identifies text mining techniques used in social networking, the data used, tools, and the challenges. Research questions were formulated, then search strategy and selection criteria were defined, followed by the analysis of each paper to extract the data relevant to the research questions. The result shows that the most social media platforms used as a source of the data are Twitter and Facebook. The most common text mining technique were sentiment analysis and topic modeling. Classification and clustering were the most common approaches applied by the studies. The challenges include the need for processing with huge volumes of data, the noise, and the dynamic of the data. The study explores the recent development in text mining approaches in social networking by providing state and general view of work done in this research area.

자유트리 기반의 그래프마이닝 기법 분석 (Analysis of Graph Mining based on Free-Tree)

  • 노영상;윤은일;류근호;김명준
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2008년도 추계학술발표대회
    • /
    • pp.275-278
    • /
    • 2008
  • 데이터마이닝은 현재 매우 각광 받고 있는 분야다. 연관규칙탐사는 트랜잭션 데이터베이스에서 일정빈도 이상의 패턴을 찾아내는 작업을 말한다. 그중 빈발서브그래프패턴 마이닝은 최근 관심이 늘어나고 있으며, 그 활용도 또한 매우 높다. 그래프마이닝은 아이템셋마이닝보다 훨씬 더 많은 계산을 필요로 한다. 중복을 최소화 하는 방법이 필요하며, 그중 가장 좋은 성능을 보이는 GASTON 알고리즘을 분석한다.

Low-Rank Representation-Based Image Super-Resolution Reconstruction with Edge-Preserving

  • Gao, Rui;Cheng, Deqiang;Yao, Jie;Chen, Liangliang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권9호
    • /
    • pp.3745-3761
    • /
    • 2020
  • Low-rank representation methods already achieve many applications in the image reconstruction. However, for high-gradient image patches with rich texture details and strong edge information, it is difficult to find sufficient similar patches. Existing low-rank representation methods usually destroy image critical details and fail to preserve edge structure. In order to promote the performance, a new representation-based image super-resolution reconstruction method is proposed, which combines gradient domain guided image filter with the structure-constrained low-rank representation so as to enhance image details as well as reveal the intrinsic structure of an input image. Firstly, we extract the gradient domain guided filter of each atom in high resolution dictionary in order to acquire high-frequency prior information. Secondly, this prior information is taken as a structure constraint and introduced into the low-rank representation framework to develop a new model so as to maintain the edges of reconstructed image. Thirdly, the approximate optimal solution of the model is solved through alternating direction method of multipliers. After that, experiments are performed and results show that the proposed algorithm has higher performances than conventional state-of-the-art algorithms in both quantitative and qualitative aspects.

자취 군집화를 통한 프로세스 마이닝의 성능 개선 (Improving Process Mining with Trace Clustering)

  • 송민석;;;정재윤
    • 대한산업공학회지
    • /
    • 제34권4호
    • /
    • pp.460-469
    • /
    • 2008
  • Process mining aims at mining valuable information from process execution results (called "event logs"). Even though process mining techniques have proven to be a valuable tool, the mining results from real process logs are usually too complex to interpret. The main cause that leads to complex models is the diversity of process logs. To address this issue, this paper proposes a trace clustering approach that splits a process log into homogeneous subsets and applies existing process mining techniques to each subset. Based on log profiles from a process log, the approach uses existing clustering techniques to derive clusters. Our approach are implemented in ProM framework. To illustrate this, a real-life case study is also presented.

TFT-LCD 산업에서의 품질마이닝 시스템

  • 이현우;남호수;최경호
    • 한국품질경영학회:학술대회논문집
    • /
    • 한국품질경영학회 2006년도 춘계학술대회
    • /
    • pp.142-148
    • /
    • 2006
  • Data mining is a useful tool for analyzing data from different perspectives and for summarizing them into useful information. Recently, the data mining methods are applied to solving quality problems of the manufacturing processes. This paper discusses the problems of construction of a quality mining system, which is based on the various data mining methods. The quality mining system includes recipe optimization, significant difference test, finding critical processes, forecasting the yield. The contents and system of this paper are focused on the TFT-LCD manufacturing process. We also provide some illustrative field examples of the quality mining system.

  • PDF

센서 네트워크의 데이터 스트림 마이닝을 위한 온톨로지 기반의 전처리 기법 (Ontology based Preprocessing Scheme for Mining Data Streams from Sensor Networks)

  • 정재은
    • 지능정보연구
    • /
    • 제15권3호
    • /
    • pp.67-80
    • /
    • 2009
  • 다양한 센서의 개발과 센서 네트워크 구축으로 인해 특정 공간의 환경 데이터를 수집할 수 있다. 보다 유용한 정보 및 지식의 발견을 위하여 데이터 마이닝(Data mining) 기법이 활용되는 연구들이 소개되었다. 본 연구에서는 이와 같은 데이터 마이닝 기법의 효율성 증대를 위하여 센서 네트워크로부터의 데이터 스트림의 전처리 과정(Preprocessing)을 수행하고자 한다. 제안하는 센서 스트림 데이터의 전처리 과정은 i) 세션확인(Session identification)과 ii) 오류검증(Error detection) 문제를 해결하고자 한다. 특히, 이를 위해 각센서 장비로부터 수집되는 데이터의 의미(Semantics)를 표현하고 있는 온톨로지(Ontology)를 적용한다. 본 연구 결과의 성능 평가를 위하여 센서 네트워크 테스팅 환경을 교내에 설치하였으며 30여일 동안 수집된 데이터를 이용하여 시뮬레이션을 실행하였다.

  • PDF

교육에서의 효율적인 정보 활용을 위한 데이터 마이닝 기법 (Data Mining Technology for Efficient Information Application)

  • 이철환;한선관
    • 정보교육학회논문지
    • /
    • 제3권1호
    • /
    • pp.75-85
    • /
    • 1999
  • 본 연구는 초 중등교육에서 사용되고 있는 데이터 베이스 시스템에 데이터 마이닝 기법을 적용하여 보다 효율적인 교육자료로 활용하기 위한 방안 제시에 그 목적이 있다. 데이터 마이닝에 대한 전반적인 내용과 기계학습과 관련된 내용을 고찰하였다. 교육에서 많이 사용되는 데이터베이스 시스템으로 종합생활기록과 건강 기록, 성적 자료가 있으며, 이러한 자료에서 나타난 특별한 형식과 집합을 데이터 마이닝 기법과 기계학습을 이용하여 유용한 정보를 추출하는 방법에 대해 제시하였다. 그리고 이러한 데이터 마이닝 기술을 사용함에 있어 교육 현장에서 문제가 되는 점과 이를 해결하기 위한 방안을 제안하였다.

  • PDF

빅데이터마이닝을 이용한 회계정보처리 모형 (Accounting Information Processing Model Using Big Data Mining)

  • 김경일
    • 융합정보논문지
    • /
    • 제10권7호
    • /
    • pp.14-19
    • /
    • 2020
  • 확장성 보고서 언어인 XML기술을 회계보고 영역에 응용한 인터넷 표준인 XBRL에 기초한 회계정보처리 모형을 제안하고자 한다. 기업마다 문서의 특성이 상이하기에 의사결정자에게 유용한 정보를 제공하여야 한다는 회계의 목적에 비추어 그 중요성이 크다. 본 연구는 X-Hive 데이터베이스 내에 XBRL로 저장된 XML 계층구조를 기반으로 하는 데이터 마이닝 모형을 제안하고자 한다. 데이터마이닝 분석은 연관규칙으로 실험되었고 XBRL을 기반으로 DC-Apriori 데이터마이닝 방법을 Apriori알고리즘과 X쿼리를 결합하여 제안한다. 마지막으로 제안 모형의 타당성과 유효성에 대해서는 실험을 통해 검증하였다.

Finding Naval Ship Maintenance Expertise Through Text Mining and SNA

  • Kim, Jin-Gwang;Yoon, Soung-woong;Lee, Sang-Hoon
    • 한국컴퓨터정보학회논문지
    • /
    • 제24권7호
    • /
    • pp.125-133
    • /
    • 2019
  • Because military weapons systems for special purposes are small and complex, they are not easy to maintain. Therefore, it is very important to maintain combat strength through quick maintenance in the event of a breakdown. In particular, naval ships are complex weapon systems equipped with various equipment, so other equipment must be considered for maintenance in the event of equipment failure, so that skilled maintenance personnel have a great influence on rapid maintenance. Therefore, in this paper, we analyzed maintenance data of defense equipment maintenance information system through text mining and social network analysis(SNA), and tried to identify the naval ship maintenance expertise. The defense equipment maintenance information system is a system that manages military equipment efficiently. In this study, the data(2,538cases) of some naval ship maintenance teams were analyzed. In detail, we examined the contents of main maintenance and maintenance personnel through text mining(word cloud, word network). Next, social network analysis(collaboration analysis, centrality analysis) was used to confirm the collaboration relationship between maintenance personnel and maintenance expertise. Finally, we compare the results of text mining and social network analysis(SNA) to find out appropriate methods for finding and finding naval ship maintenance expertise.

IGBT Open-Circuit Fault Diagnosis for 3-Phase 4-Wire 3-Level Active Power Filters based on Voltage Error Correlation

  • Wang, Ke;Tang, Yi;Zhang, Xiao;Wang, Yang;Zhang, Chuan-Jin;Zhang, Hui
    • Journal of Power Electronics
    • /
    • 제16권5호
    • /
    • pp.1950-1963
    • /
    • 2016
  • A novel open-circuit fault diagnosis method for 3-phase 4-wire 3-level active power filters based on voltage error correlation is proposed in this paper. This method is based on observing the output pole voltage error of the active power filter through two kinds of algorithms. One algorithm is a voltage error analytical algorithm, which derives four output voltage error analytic expressions through the pulse state, current value and dc bus voltage, respectively, assuming that all of the IGBTs of a certain phase come to an OC fault. The other algorithm is a current circuit equation algorithm, which calculates the real-time output voltage error through basic circuit theory. A correlation is introduced to measure the similarity of the output voltage errors between the two algorithms, and OC faults are located by the maximum of the correlations. A FPGA has been chosen to implement the proposed method due to its fast prototyping. Simulation and experimental results are presented to show the performance of the proposed OC fault diagnosis method.