• Title/Summary/Keyword: Review data mining

Search Result 271, Processing Time 0.03 seconds

A review of big data analytics and healthcare (빅데이터 분석과 헬스케어에 대한 동향)

  • Moon, Seok-Jae;Lee, Namju
    • Journal of the Korean Applied Science and Technology
    • /
    • v.37 no.1
    • /
    • pp.76-82
    • /
    • 2020
  • Big data analysis in healthcare research seems to be a necessary strategy for the convergence of sports science and technology in the era of the Fourth Industrial Revolution. The purpose of this study is to provide the basic review to secure the diversity of big data and healthcare convergence by discussing the concept, analysis method, and application examples of big data and by exploring the application. Text mining, data mining, opinion mining, process mining, cluster analysis, and social network analysis is currently used. Identifying high-risk factor for a certain condition, determining specific health determinants for diseases, monitoring bio signals, predicting diseases, providing training and treatments, and analyzing healthcare measurements would be possible via big data analysis. As a further work, the big data characteristics provide very appropriate basis to use promising software platforms for development of applications that can handle big data in healthcare and even more in sports science.

Data-Mining Bootstrap Procedure with Potential Predictors in Forecasting Models: Evidence from Eight Countries in the Asia-Pacific Stock Markets

  • Lee, Hojin
    • East Asian Economic Review
    • /
    • v.23 no.4
    • /
    • pp.333-351
    • /
    • 2019
  • We use a data-mining bootstrap procedure to investigate the predictability test in the eight Asia-Pacific regional stock markets using in-sample and out-of-sample forecasting models. We address ourselves to the data-mining bias issues by using the data-mining bootstrap procedure proposed by Inoue and Kilian and applied to the US stock market data by Rapach and Wohar. The empirical findings show that stock returns are predictable not only in-sample but out-of-sample in Hong Kong, Malaysia, Singapore, and Korea with a few exceptions for some forecasting horizons. However, we find some significant disparity between in-sample and out-of-sample predictability in the Korean stock market. For Hong Kong, Malaysia, and Singapore, stock returns have predictable components both in-sample and out-of-sample. For the US, Australia, and Canada, we do not find any evidence of return predictability in-sample and out-of-sample with a few exceptions. For Japan, stock returns have a predictable component with price-earnings ratio as a forecasting variable for some out-of-sample forecasting horizons.

DSS Architectures to Support Data Mining Activities for Supply Chain Management (데이터 마이닝을 활용한 공급사슬관리 의사결정지원시스템의 구조에 관한 연구)

  • Jhee, Won-Chul;Suh, Min-Soo
    • Asia pacific journal of information systems
    • /
    • v.8 no.3
    • /
    • pp.51-73
    • /
    • 1998
  • This paper is to evaluate the application potentials of data mining in the areas of Supply Chain Management (SCM) and to suggest the architectures of Decision Support Systems (DSS) that support data mining activities. We first briefly introduce data mining and review the recent literatures on SCM and then evaluate data mining applications to SCM in three aspects: marketing, operations management and information systems. By analyzing the cases about pricing models in distribution channels, demand forecasting and quality control, it is shown that artificial intelligence techniques such as artificial neural networks, case-based reasoning and expert systems, combined with traditional analysis models, effectively mine the useful knowledge from the large volume of SCM data. Agent-based information system is addressed as an important architecture that enables the pursuit of global optimization of SCM through communication and information sharing among supply chain constituents without loss of their characteristics and independence. We expect that the suggested architectures of intelligent DSS provide the basis in developing information systems for SCM to improve the quality of organizational decisions.

  • PDF

Analysis of Healthcare Quality Indicators using Data Mining and Development of a Decision Support System (데이터마이닝을 이용한 의료의 질 측정지표 분석 및 의사결정지원시스템 개발)

  • Kim, Hye Sook;Chae, Young-Moon;Tark, Kwan-Chul;Park, Hyun-Ju;Ho, Seung-Hee
    • Quality Improvement in Health Care
    • /
    • v.8 no.2
    • /
    • pp.186-207
    • /
    • 2001
  • Background : This study presented an analysis of healthcare quality indicators using data mining and a development of decision support system for quality improvement. Method : Specifically, important factors influencing the key quality indicators were identified using a decision tree method for data mining based on 8,405 patients who discharged from a medical center during the period between December 1, 2000 and January 31, 2001. In addition, a decision support system was developed to analyze and monitor trends of these quality indicators using a Visual Basic 6.0. Guidelines and tutorial for quality improvement activities were also included in the system. Result : Among 12 selected quality indicators, decision tree analysis was performed for 3 indicators ; unscheduled readmission due to the same or related condition, unscheduled return to intensive care unit, and inpatient mortality which have a volume bigger than 100 cases during the period. The optimum range of target group in healthcare quality indicators were identified from the gain chart. Important influencing factors for these 3 indicators were: diagnosis, attribute of the disease, and age of the patient in unscheduled returns to ICU group ; and length of stay, diagnosis, and belonging department in inpatient mortality group. Conclusion : We developed a decision support system through analysis of healthcare quality indicators and data mining technique which can be effectively implemented for utilization review and quality management in a healthcare organization. In the future, further number of quality indicators should be developed to effectively support a hospital-wide Continuous Quality Improvement activity. Through these endevours, a decision support system can be developed and the newly developed decision support system should be well integrated with the hospital Order Communication System to support concurrent review, utilization review, quality and risk management.

  • PDF

Online Social Media Review Mining for Living Items with Probabilistic Approach: A Case Study

  • Li, Shuai;Hao, Fei;Kim, Hee-Cheol
    • Smart Media Journal
    • /
    • v.2 no.2
    • /
    • pp.20-27
    • /
    • 2013
  • The concept of social media is top of the agenda for many business executives and decision makers, as well as consultants try to identify ways where companies can make profitable use of applications such as Netflix, Flixster. The social media is playing an increasingly important role as the information sources for customers making product choices etc. With the flourish of Web 2.0 technology, customer reviews are becoming more and more useful and important information resources for people to save their time and energy on purchasing products that they want. This paper proposes the Bayesian Probabilistic Classification algorithm to mine the social media review, and evaluates it by different splits and cross validation mechanism from the real data set. The explored study experimental results show the robustness and effectiveness of proposed approach for mining the social media review.

  • PDF

Current Status and Trend of Data Mining Techniques (데이터 마이닝 기법의 현황 및 추세)

  • 오승준;송영덕;오민근
    • KSCI Review
    • /
    • v.8 no.2
    • /
    • pp.67-74
    • /
    • 2001
  • Recent times have seen an explosive growth in the availability of various kinds of data. It has resulted in an unprecedented opportunity to develop automated data-driven techniques of extracting useful knowledge. Data mining. an important step in this process of knowledge discovery consists of methods that discover interesting. non-trivial and useful Patterns hidden in the data In this paper. we surveyed data mining techniques. We find effective data mining techniques in applying real world. and suggest appropriate application area for the each techniques. We conclude the Paper with some research issues.

  • PDF

IMPLEMENTATION OF SUBSEQUENCE MAPPING METHOD FOR SEQUENTIAL PATTERN MINING

  • Trang, Nguyen Thu;Lee, Bum-Ju;Lee, Heon-Gyu;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.627-630
    • /
    • 2006
  • Sequential Pattern Mining is the mining approach which addresses the problem of discovering the existent maximal frequent sequences in a given databases. In the daily and scientific life, sequential data are available and used everywhere based on their representative forms as text, weather data, satellite data streams, business transactions, telecommunications records, experimental runs, DNA sequences, histories of medical records, etc. Discovering sequential patterns can assist user or scientist on predicting coming activities, interpreting recurring phenomena or extracting similarities. For the sake of that purpose, the core of sequential pattern mining is finding the frequent sequence which is contained frequently in all data sequences. Beside the discovery of frequent itemsets, sequential pattern mining requires the arrangement of those itemsets in sequences and the discovery of which of those are frequent. So before mining sequences, the main task is checking if one sequence is a subsequence of another sequence in the database. In this paper, we implement the subsequence matching method as the preprocessing step for sequential pattern mining. Matched sequences in our implementation are the normalized sequences as the form of number chain. The result which is given by this method is the review of matching information between input mapped sequences.

  • PDF

Implementation of Subsequence Mapping Method for Sequential Pattern Mining

  • Trang Nguyen Thu;Lee Bum-Ju;Lee Heon-Gyu;Park Jeong-Seok;Ryu Keun-Ho
    • Korean Journal of Remote Sensing
    • /
    • v.22 no.5
    • /
    • pp.457-462
    • /
    • 2006
  • Sequential Pattern Mining is the mining approach which addresses the problem of discovering the existent maximal frequent sequences in a given databases. In the daily and scientific life, sequential data are available and used everywhere based on their representative forms as text, weather data, satellite data streams, business transactions, telecommunications records, experimental runs, DNA sequences, histories of medical records, etc. Discovering sequential patterns can assist user or scientist on predicting coming activities, interpreting recurring phenomena or extracting similarities. For the sake of that purpose, the core of sequential pattern mining is finding the frequent sequence which is contained frequently in all data sequences. Beside the discovery of frequent itemsets, sequential pattern mining requires the arrangement of those itemsets in sequences and the discovery of which of those are frequent. So before mining sequences, the main task is checking if one sequence is a subsequence of another sequence in the database. In this paper, we implement the subsequence matching method as the preprocessing step for sequential pattern mining. Matched sequences in our implementation are the normalized sequences as the form of number chain. The result which is given by this method is the review of matching information between input mapped sequences.

Students' Performance Prediction in Higher Education Using Multi-Agent Framework Based Distributed Data Mining Approach: A Review

  • M.Nazir;A.Noraziah;M.Rahmah
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.10
    • /
    • pp.135-146
    • /
    • 2023
  • An effective educational program warrants the inclusion of an innovative construction which enhances the higher education efficacy in such a way that accelerates the achievement of desired results and reduces the risk of failures. Educational Decision Support System (EDSS) has currently been a hot topic in educational systems, facilitating the pupil result monitoring and evaluation to be performed during their development. Insufficient information systems encounter trouble and hurdles in making the sufficient advantage from EDSS owing to the deficit of accuracy, incorrect analysis study of the characteristic, and inadequate database. DMTs (Data Mining Techniques) provide helpful tools in finding the models or forms of data and are extremely useful in the decision-making process. Several researchers have participated in the research involving distributed data mining with multi-agent technology. The rapid growth of network technology and IT use has led to the widespread use of distributed databases. This article explains the available data mining technology and the distributed data mining system framework. Distributed Data Mining approach is utilized for this work so that a classifier capable of predicting the success of students in the economic domain can be constructed. This research also discusses the Intelligent Knowledge Base Distributed Data Mining framework to assess the performance of the students through a mid-term exam and final-term exam employing Multi-agent system-based educational mining techniques. Using single and ensemble-based classifiers, this study intends to investigate the factors that influence student performance in higher education and construct a classification model that can predict academic achievement. We also discussed the importance of multi-agent systems and comparative machine learning approaches in EDSS development.

Genome data mining for everyone

  • Lee, Gir-Won;Kim, Sang-Soo
    • BMB Reports
    • /
    • v.41 no.11
    • /
    • pp.757-764
    • /
    • 2008
  • The genomic sequences of a huge number of species have been determined. Typically, these genome sequences and the associated annotation data are accessed through Internet-based genome browsers that offer a user-friendly interface. Intelligent use of the data should expedite biological knowledge discovery. Such activity is collectively called data mining and involves queries that can be simple, complex, and even combinational. Various tools have been developed to make genome data mining available to computational and experimental biologists alike. In this mini-review, some tools that have proven successful will be introduced along with examples taken from published reports.