• Title/Summary/Keyword: 실시간 마이닝

Search Result 174, Processing Time 0.026 seconds

Predicting changes of realtime search words using time series analysis and artificial neural networks (시계열분석과 인공신경망을 이용한 실시간검색어 변화 예측)

  • Chong, Min-Yeong
    • Journal of Digital Convergence
    • /
    • v.15 no.12
    • /
    • pp.333-340
    • /
    • 2017
  • Since realtime search words are centered on the fact that the search growth rate of an issue is rapidly increasing in a short period of time, it is not possible to express an issue that maintains interest for a certain period of time. In order to overcome these limitations, this paper evaluates the daily and hourly persistence of the realtime words that belong to the top 10 for a certain period of time and extracts the search word that are constantly interested. Then, we present the method of using the time series analysis and the neural network to know how the interest of the upper search word changes, and show the result of forecasting the near future change through the actual example derived through the method. It can be seen that forecasting through time series analysis by date and artificial neural networks learning by time shows good results.

Extracting week key issues and analyzing differences from realtime search keywords of portal sites (포털사이트 실시간 검색키워드의 주간 핵심 이슈 선정 및 차이 분석)

  • Chong, Min-Yeong
    • Journal of Digital Convergence
    • /
    • v.14 no.12
    • /
    • pp.237-243
    • /
    • 2016
  • Since realtime search keywords of portal sites are arranged in descending order by instant increasing rates of search numbers, they easily show issues increasing in interests for a short time. But they have the limits extracted different results by portal sites and not shown issues by a period. Thus, to find key issues from the whole realtime search keywords for certain period, and to show results of summarizing them and analyzing differences, is significant in providing the basis of understanding issues more practically and in maintaining consistency of them. This paper analyzes differences of week key issues extracted from week analysis of realtime search keywords provided by two typical portal sites. The results of experiments show that the portal group means of realtime search keywords by the independent t-test and the survival functions of realtime search keywords by the survival analysis are statistically significant differences.

Evaluation of Web Pages using User's Activities in a Page and Page Visiting Duration Time (사용자 활동과 폐이지 이용 시간을 이용한 웹 페이지 평가 기법)

  • Lee, Dong-Hun;Yun, Tae-Bok;Kim, Geon-Su;Lee, Ji-Hyeong
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.04a
    • /
    • pp.99-102
    • /
    • 2007
  • 웹 사용 마이닝은 사용자의 웹 이용 패턴에 대해 분석하여 정보를 찾아내는 분야이다. 사용자에 대한 분석은 웹을 통한 비즈니스의 근간이 되고 있다. 때문에 웹 마이닝 분야에서 주목받고 중요시 되는 기술이 되었다. 그러나 최근에는 공개된 기술의 취약점을 이용해 악의적으로 정보를 교란하는 일이 발생되고 있어 사회적으로 이슈가 되고 있다. 이러한 문제는 특히 단순한 페이지 뷰 횟수에 기반을 둔 정보 추출 방식에 주로 발생하고 있다. 따라서 본 논문에서는 이러한 추출 방식의 단순함을 줄이고 사용자의 정보를 더 반영하기 위하여 페이지 이용 시간과 페이지 내의 행동을 분석하여 콘텐츠의 질을 평가하는 방안을 제시한다. 구현 부분에는 사용자의 개인정보 침해 없이 사용자의 행동을 수집하기 위하여 최근 인기를 얻고 있는 Ajax 기술을 사용하였다. 그리고 실시간으로 웹 페이지에 대한 평가를 수행하기 위해 서버에 로그 필터 모듈을 추가하는 수집 기법을 제안하였다.

  • PDF

A Webtoon Recommendation System using Opinion Mining and Collaborate Filtering (오피니언 마이닝과 협업필터링을 이용한 웹툰 추천 시스템)

  • Sim, Dae-Su;Park, Jin-Soo;Park, Doo-Soon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.04a
    • /
    • pp.521-524
    • /
    • 2017
  • 최근 다양한 웹툰 콘텐츠의 증가와 함께 스마트폰 보급률이 높아지면서, 사용자들의 실시간 웹툰 서비스의 이용이 증가하고 있다. 웹툰 콘텐츠의 가치가 갈수록 점점 높아지고 있으며, 각종 영화 애니메이션 게임 등 다양한 콘텐츠 사업에 많은 데이터가 사용되고 있다. 본 논문에서는 기존 웹툰의 리뷰를 오피니언 마이닝기법을 사용하여 각 웹툰의 선호도를 평가하며 나이, 성별, 선호 장르, 선호 웹툰 플랫폼 등과 같은 개인 성향을 통하여 사용자간의 유사도를 측정하는 협업 필터링 방법을 적용해 각각의 사용자들이 보고 싶어하는 웹툰을 자동적으로 추천해주는 웹툰 추천 시스템을 제안한다.

Public opinion analysis system using opinion mining (오피니언 마이닝을 이용한 여론분석 시스템)

  • Kim, Young-Ah;Kim, Sung-Kwon;Hao, Fei;Park, Doo-soon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.04a
    • /
    • pp.291-293
    • /
    • 2015
  • 최근 스마트폰 사용자와 SNS를 이용하는 사용자들이 늘어나고 있다. 또 다양한 SNS가 등장하면서 SNS데이터의 양이 방대해지고 SNS데이터의 가치와 신뢰성도 점점 높아지고 있다. 이러한 SNS 데이터를 사용하여 특정 키워드의 여론을 분석하고 사용자들의 반응을 얻는 것은 좋은 정보로 여러 분야에 사용될 수 있을 것이다. 본 논문에서는 SNS를 기반으로 오피니언 마이닝을 사용해 특정 키워드에 대한 SNS사용자들의 여론을 분석하였다. 그 결과 실시간으로 올라오는 글들에 대하여 해당 키워드가 어떤 여론을 가지고 있는지 분석 결과를 얻었다.

A Design of false alarm analysis framework of intrusion detection system by using incremental mining method (점진적 마이닝 기법을 적용한 침입탐지 시스템의 오 경보 분석 프레임워크 설계)

  • Kim Eun-Hee;Ryu Keun-Ho
    • The KIPS Transactions:PartC
    • /
    • v.13C no.3 s.106
    • /
    • pp.295-302
    • /
    • 2006
  • An intrusion detection system writes a lot of alarms against attack behaviors in real time. These alarms contain not only actual attack alarms, but also false alarms that are mistakes made by the intrusion detection system. False alarms are the main reason that reduces the efficiency of the intrusion detection system, and we propose framework for false alarms analysis in the paper. Also, we apply an incremental data mining method for pattern analysis of false alarms increasing continuously. The framework consists of GUI, DB Manager, Alert Preprocessor, and False Alarm Analyzer. We analyze the false alarms increasingly through the experiment of the proposed framework and show that false alarms are reduced by applying the analyzed false alarm rules in the intrusion detection system.

The Knowledge-Based Design Paradigm through Web Data Mining and Knowledge Management Framework (웹 데이터 마이닝과 지식경영 프레임웍을 통한 지식-기반 디자인 패러다임 구축)

  • 양종열
    • Archives of design research
    • /
    • v.15 no.4
    • /
    • pp.159-168
    • /
    • 2002
  • The world has rushed into knowledge information society. Information technology is one of the causes to show up knowledge management and one of the motives to accelerate knowledge management. And, these days information technology and internet have made staffing progress. Therefore, the objective of this study is to take out latent knowledge of customers through web data mining in a vast amount of data on the internet in rapidly developing digital environments, to develop the knowledge-based design paradigm applied to knowledge management framework, and finally to develop design which satisfies customers' needs. To reach the objective, knowledge management process and varied previous studies related to web data mining are reviewed on a theoretical basis, and then a new knowledge-based design paradigm (in this study, eCRM in a true sense which combines web data mining with knowledge management process is called knowledge-based design paradigm) combining knowledge management process with web data mining is suggested.

  • PDF

In-depth Analysis of Soccer Game via Webcast and Text Mining (웹 캐스트와 텍스트 마이닝을 이용한 축구 경기의 심층 분석)

  • Jung, Ho-Seok;Lee, Jong-Uk;Yu, Jae-Hak;Lee, Han-Sung;Park, Dai-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.10
    • /
    • pp.59-68
    • /
    • 2011
  • As the role of soccer game analyst who analyzes soccer games and creates soccer wining strategies is emphasized, it is required to have high-level analysis beyond the procedural ones such as main event detection in the context of IT based broadcasting soccer game research community. In this paper, we propose a novel approach to generate the high-level in-depth analysis results via real-time text based soccer Webcast and text mining. Proposed method creates a metadata such as attribute, action and event, build index, and then generate available knowledges via text mining techniques such as association rule mining, event growth index, and pathfinder network analysis using Webcast and domain knowledges. We carried out a feasibility experiment on the proposed technique with the Webcast text about Spain team's 2010 World Cup games.

Performance Analysis of Siding Window based Stream High Utility Pattern Mining Methods (슬라이딩 윈도우 기반의 스트림 하이 유틸리티 패턴 마이닝 기법 성능분석)

  • Ryang, Heungmo;Yun, Unil
    • Journal of Internet Computing and Services
    • /
    • v.17 no.6
    • /
    • pp.53-59
    • /
    • 2016
  • Recently, huge stream data have been generated in real time from various applications such as wireless sensor networks, Internet of Things services, and social network services. For this reason, to develop an efficient method have become one of significant issues in order to discover useful information from such data by processing and analyzing them and employing the information for better decision making. Since stream data are generated continuously and rapidly, there is a need to deal with them through the minimum access. In addition, an appropriate method is required to analyze stream data in resource limited environments where fast processing with low power consumption is necessary. To address this issue, the sliding window model has been proposed and researched. Meanwhile, one of data mining techniques for finding meaningful information from huge data, pattern mining extracts such information in pattern forms. Frequency-based traditional pattern mining can process only binary databases and treats items in the databases with the same importance. As a result, frequent pattern mining has a disadvantage that cannot reflect characteristics of real databases although it has played an essential role in the data mining field. From this aspect, high utility pattern mining has suggested for discovering more meaningful information from non-binary databases with the consideration of the characteristics and relative importance of items. General high utility pattern mining methods for static databases, however, are not suitable for handling stream data. To address this issue, sliding window based high utility pattern mining has been proposed for finding significant information from stream data in resource limited environments by considering their characteristics and processing them efficiently. In this paper, we conduct various experiments with datasets for performance evaluation of sliding window based high utility pattern mining algorithms and analyze experimental results, through which we study their characteristics and direction of improvement.

Estimating long-term sustainability of real-time issues on portal sites (포털사이트 실시간이슈 지속가능성 평가)

  • Chong, Min-Young
    • Journal of Digital Convergence
    • /
    • v.17 no.12
    • /
    • pp.255-260
    • /
    • 2019
  • Real-time search keywords are not only limited to search keywords that are rapidly increasing interest in real-time, but also have a limitation that they are difficult to determine the sustainability as there is a difference in ranking between portal sites. Estimating sustainability for real-time search keywords is significant in terms of overcoming these limitations and providing some predictability. In particular, long-term search keywords that last for more than a month are of high value as long-lasting social issues. Therefore, in this paper, we analyze the interest based on the ranking of the real-time search keywords and the duration based on sustained weeks, days and hours of real-time search keywords by each portal site and the integrated portal site, and then estimating sustainability based on high level of interest and duration, and present a method to derive real-time search issues with high long-term sustainability.