• Title/Summary/Keyword: 시간 마이닝

Search Result 401, Processing Time 0.033 seconds

Music Recommendation Using Data Mining (데이터 마이닝을 이용한 음악 추천)

  • Lee, Hye-In;Yun, So-Young;Youn, Sung-Dae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2018.10a
    • /
    • pp.372-375
    • /
    • 2018
  • 본 논문은 온라인 음원 서비스 이용자들이 겪는 선택의 어려움을 최소화하고, 낭비되는 시간을 줄이기 위한 음악 추천 기법을 제안하고자 한다. 제안하는 기법은 개인정보의 이용 없이 아이템을 추천할 수 있는 아이템 기반 협업필터링 알고리즘을 사용한다. 더 정확한 추천을 위해 음원의 메타데이터를 이용한다. 실험을 통해 제안하는 기법이 메타데이터를 이용하지 않을 때보다 추천 성능이 향상되는 것을 확인하였다.

Blocking Elimination Method Using Graph Clustering In Influence Propagation (그래프 클러스터링을 이용한 영향력 전파에서의 블로킹 제거 방법)

  • Lee, Rich. Chul-Ghi;Lee, Wookey
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.04a
    • /
    • pp.706-709
    • /
    • 2015
  • 영향력 전파 문제는 주어진 네트워크 환경에서 영향력을 최대화 할 수 있는 top-k 노드를 찾는 문제로 데이터 마이닝 분야에서 활발히 연구되어왔다. 본 논문에서는 그래프 클러스터링 기법을 사용하여 영향력을 전파하는 방법을 제안하고자 한다. 이러한 방법에는 두 가지 이점이 있는데 먼저 서로 다른 시드 사이에 영향력이 중복되는 블로킹 현상을 제거하여 수행시간을 단축시킬 수 있다. 다음으로는 유 방향 그래프인 경우 기존의 탐욕 알고리즘보다 더 많은 노드에 전파를 가능하게 한다.

Inferring Undiscovered Public Knowledge by Using Text Mining-driven Graph Model (텍스트 마이닝 기반의 그래프 모델을 이용한 미발견 공공 지식 추론)

  • Heo, Go Eun;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.31 no.1
    • /
    • pp.231-250
    • /
    • 2014
  • Due to the recent development of Information and Communication Technologies (ICT), the amount of research publications has increased exponentially. In response to this rapid growth, the demand of automated text processing methods has risen to deal with massive amount of text data. Biomedical text mining discovering hidden biological meanings and treatments from biomedical literatures becomes a pivotal methodology and it helps medical disciplines reduce the time and cost. Many researchers have conducted literature-based discovery studies to generate new hypotheses. However, existing approaches either require intensive manual process of during the procedures or a semi-automatic procedure to find and select biomedical entities. In addition, they had limitations of showing one dimension that is, the cause-and-effect relationship between two concepts. Thus;this study proposed a novel approach to discover various relationships among source and target concepts and their intermediate concepts by expanding intermediate concepts to multi-levels. This study provided distinct perspectives for literature-based discovery by not only discovering the meaningful relationship among concepts in biomedical literature through graph-based path interference but also being able to generate feasible new hypotheses.

Analysis on Research Trend of Productivity Using Text Mining - Focusing on KSCE Journal - (텍스트 마이닝을 통한 건설 생산성 분야의 연구동향 분석 - KSCE 저널을 중심으로 -)

  • Gu, Bongil;Huh, Youngki
    • Korean Journal of Construction Engineering and Management
    • /
    • v.21 no.2
    • /
    • pp.15-21
    • /
    • 2020
  • The relationship between keywords, found in all productivity related papers published in the KSCE journal for last 15 years, were analyzed in order to reveal a research trend in the area using text mining and A-Priori algorithm. As the results, it is found that the word of 'productivity' is most closely related to the words of 'work' and 'labor'. Futhermore, the word is somewhat related to those of 'factor', 'model', simulation', and 'work time'. It is also revealed that, on the other hand, the words of 'machine' and 'equipment' have little relationships with the keyword. This research will be a great help for academia to understand a research trend in the area of construction productivity.

A Personalized Automatic TV Program Scheduler using Sequential Pattern Mining (순차 패턴 마이닝 기법을 이용한 개인 맞춤형 TV 프로그램 스케줄러)

  • Pyo, Shin-Jee;Kim, Eun-Hui;Kim, Mun-Churl
    • Journal of Broadcast Engineering
    • /
    • v.14 no.5
    • /
    • pp.625-637
    • /
    • 2009
  • With advent of TV environment and increasing of variety of program contents, users are able to experience more various and complex environment for watching TV contents. According to the change of content watching environment, users have to make more efforts to choose his/her interested TV program contents or TV channels than before. Also, the users usually watch the TV program contents with their own regular way. So, in this paper, we suggests personalized TV program schedule recommendation system based on the analyzing users' TV watching history data. And we extract the users' watched program patterns using the sequential pattern mining method. Also, we proposed a new sequential pattern mining which is suitable for TV watching environment and verify our proposed method have better performance than existing sequential pattern mining method in our application area. In the future, we will consider a VoD characteristic for extending to IPTV program schedule recommendation system.

Research Dynamics in Innovation Studies Using Text Mining (텍스트 마이닝을 이용한 혁신 분야의 국외 연구 동향 분석)

  • Jung, Hyojung
    • Journal of Technology Innovation
    • /
    • v.24 no.4
    • /
    • pp.249-275
    • /
    • 2016
  • For the past 50 years, innovation field has gone through an evolution. The range of research topics on innovation has expanded and diversified, along with a quantitative increase. In a multi-disciplinary field like innovation, to explore new topics and understand research trends, it is necessary to possess a comprehensive understanding regarding the current status of, and trends in, the research. In this study, the research trend in innovation studies from 2000 to 2015 was analyzed in a holistic perspective. For this, a novel technique, text mining was used. The result shows that innovation studies has focused on the traditional and emerging topics. Also, the differentiations has appeared in some of the traditional topics. This study provides not only an understanding of research dynamics, but also an opportunity to gain insights into the evolution of a new paradigm from an academic perspective.

BLOCS: Block Correlation Aware Sequential Pattern Mining based Caching Algorithm for Hybrid Storages (BLOCS: 블록 상관관계를 인지하는 시퀀스 패턴 마이닝 기반 하이브리드 스토리지 캐슁 알고리즘)

  • Lee, Seongjin;Won, Youjip
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.7
    • /
    • pp.113-130
    • /
    • 2014
  • In this paper, we propose BLOCS algorithm to find sequence of data that should be saved in cache device of hybrid storage system which uses SSD as a cache device. BLOCS algorithm which uses a sequence pattern mining scheme, creates a set of frequently requested sectors with respect to requested order of sectors. To compare the performance of the proposed scheme, we introduce Distance (DIST) based scheme, Request Frequency (FREQ) based scheme, and Frequency times Size (F-S) based scheme. We measure the hit ratio and I/O latency of different caching schemes using hybrid storage caching simulator. We acquired booting workload along with ten scenarios of launching applications and use the workloads as input to the cache simulator. After experiment with booting workload, we find that BLOCS scheme gives hit ratio of 61% which is about 15% higher than the least performing DIST scheme.

Keyword Extraction from News Corpus using Modified TF-IDF (TF-IDF의 변형을 이용한 전자뉴스에서의 키워드 추출 기법)

  • Lee, Sung-Jick;Kim, Han-Joon
    • The Journal of Society for e-Business Studies
    • /
    • v.14 no.4
    • /
    • pp.59-73
    • /
    • 2009
  • Keyword extraction is an important and essential technique for text mining applications such as information retrieval, text categorization, summarization and topic detection. A set of keywords extracted from a large-scale electronic document data are used for significant features for text mining algorithms and they contribute to improve the performance of document browsing, topic detection, and automated text classification. This paper presents a keyword extraction technique that can be used to detect topics for each news domain from a large document collection of internet news portal sites. Basically, we have used six variants of traditional TF-IDF weighting model. On top of the TF-IDF model, we propose a word filtering technique called 'cross-domain comparison filtering'. To prove effectiveness of our method, we have analyzed usefulness of keywords extracted from Korean news articles and have presented changes of the keywords over time of each news domain.

  • PDF

The Goods Recommendation System based on modified FP-Tree Algorithm (변형된 FP-Tree를 기반한 상품 추천 시스템)

  • Kim, Jong-Hee;Jung, Soon-Key
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.11
    • /
    • pp.205-213
    • /
    • 2010
  • This study uses the FP-tree algorithm, one of the mining techniques. This study is an attempt to suggest a new recommended system using a modified FP-tree algorithm which yields an association rule based on frequent 2-itemsets extracted from the transaction database. The modified recommended system consists of a pre-processing module, a learning module, a recommendation module and an evaluation module. The study first makes an assessment of the modified recommended system with respect to the precision rate, recall rate, F-measure, success rate, and recommending time. Then, the efficiency of the system is compared against other recommended systems utilizing the sequential pattern mining. When compared with other recommended systems utilizing the sequential pattern mining, the modified recommended system exhibits 5 times more efficiency in learning, and 20% improvement in the recommending capacity. This result proves that the modified system has more validity than recommended systems utilizing the sequential pattern mining.

A SNS Data-driven Comparative Analysis on Changes of Attitudes toward Artificial Intelligence (SNS 데이터 분석을 기반으로 인공지능에 대한 인식 변화 비교 분석)

  • Yun, You-Dong;Yang, Yeong-Wook;Lim, Heui-Seok
    • Journal of Digital Convergence
    • /
    • v.14 no.12
    • /
    • pp.173-182
    • /
    • 2016
  • AI (Artificial Intelligence) has attracted interest as a key element for technological advancement in various fields. In Korea, internet companies are leading the development of AI business technology. Active government funding plans for AI technology has also drawn interest. But not everyone is optimistic about AI. Both positive and negative opinions coexist about AI. However, attempts on analyzing people's opinions about AI in a quantitative way was scarce. In this study, we used text mining on SNS (Social Networking Service) to collect opinions about AI. And then we performed a comparative analysis about whether people view it as a positive thing or a negative thing and performed a comparative analysis to recognize popular key-words. Based on the results, it was confirmed that the change of key-words and negative posts have increased through time. And through these results, we were able to predict trend about AI.