• Title/Summary/Keyword: Data mining analysis

Search Result 2,164, Processing Time 0.038 seconds

A Case Study of OLAP and Data Mining on the Analytical Knowledge Creation in Organizations (OLAP과 데이터마이닝을 이용한 조직내 분석지 생성에 관한 사례연구)

  • Cho, Jae-Hee
    • Knowledge Management Research
    • /
    • v.5 no.1
    • /
    • pp.69-82
    • /
    • 2004
  • Prior research on knowledge management focused more on the experiential knowledge based on individual's experience or knowhow than on the analytical knowledge extracted from corporate data. This study examines the effects of the data warehouse technology, especially OLAP(on line analytical processing) and data mining techniques, on the analytical knowledge creation in organizations, linking analytical knowledge creation to data analysis method through real world case studies.

  • PDF

Designing Cost Effective Open Source System for Bigdata Analysis (빅데이터 분석을 위한 비용효과적 오픈 소스 시스템 설계)

  • Lee, Jong-Hwa;Lee, Hyun-Kyu
    • Knowledge Management Research
    • /
    • v.19 no.1
    • /
    • pp.119-132
    • /
    • 2018
  • Many advanced products and services are emerging in the market thanks to data-based technologies such as Internet (IoT), Big Data, and AI. The construction of a system for data processing under the IoT network environment is not simple in configuration, and has a lot of restrictions due to a high cost for constructing a high performance server environment. Therefore, in this paper, we will design a development environment for large data analysis computing platform using open source with low cost and practicality. Therefore, this study intends to implement a big data processing system using Raspberry Pi, an ultra-small PC environment, and open source API. This big data processing system includes building a portable server system, building a web server for web mining, developing Python IDE classes for crawling, and developing R Libraries for NLP and visualization. Through this research, we will develop a web environment that can control real-time data collection and analysis of web media in a mobile environment and present it as a curriculum for non-IT specialists.

Students' Performance Prediction in Higher Education Using Multi-Agent Framework Based Distributed Data Mining Approach: A Review

  • M.Nazir;A.Noraziah;M.Rahmah
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.10
    • /
    • pp.135-146
    • /
    • 2023
  • An effective educational program warrants the inclusion of an innovative construction which enhances the higher education efficacy in such a way that accelerates the achievement of desired results and reduces the risk of failures. Educational Decision Support System (EDSS) has currently been a hot topic in educational systems, facilitating the pupil result monitoring and evaluation to be performed during their development. Insufficient information systems encounter trouble and hurdles in making the sufficient advantage from EDSS owing to the deficit of accuracy, incorrect analysis study of the characteristic, and inadequate database. DMTs (Data Mining Techniques) provide helpful tools in finding the models or forms of data and are extremely useful in the decision-making process. Several researchers have participated in the research involving distributed data mining with multi-agent technology. The rapid growth of network technology and IT use has led to the widespread use of distributed databases. This article explains the available data mining technology and the distributed data mining system framework. Distributed Data Mining approach is utilized for this work so that a classifier capable of predicting the success of students in the economic domain can be constructed. This research also discusses the Intelligent Knowledge Base Distributed Data Mining framework to assess the performance of the students through a mid-term exam and final-term exam employing Multi-agent system-based educational mining techniques. Using single and ensemble-based classifiers, this study intends to investigate the factors that influence student performance in higher education and construct a classification model that can predict academic achievement. We also discussed the importance of multi-agent systems and comparative machine learning approaches in EDSS development.

An Efficient Algorithm for Mining Frequent Closed Itemsets Using Transaction Link Structure (트랜잭션 연결 구조를 이용한 빈발 Closed 항목집합 마이닝 알고리즘)

  • Han, Kyong Rok;Kim, Jae Yearn
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.32 no.3
    • /
    • pp.242-252
    • /
    • 2006
  • Data mining is the exploration and analysis of huge amounts of data to discover meaningful patterns. One of the most important data mining problems is association rule mining. Recent studies of mining association rules have proposed a closure mechanism. It is no longer necessary to mine the set of all of the frequent itemsets and their association rules. Rather, it is sufficient to mine the frequent closed itemsets and their corresponding rules. In the past, a number of algorithms for mining frequent closed itemsets have been based on items. In this paper, we use the transaction itself for mining frequent closed itemsets. An efficient algorithm is proposed that is based on a link structure between transactions. Our experimental results show that our algorithm is faster than previously proposed methods. Furthermore, our approach is significantly more efficient for dense databases.

A Clustering Algorithm Considering Structural Relationships of Web Contents

  • Kang Hyuncheol;Han Sang-Tae;Sun Young-Su
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.1
    • /
    • pp.191-197
    • /
    • 2005
  • Application of data mining techniques to the world wide web, referred to as web mining, has been the focus of several recent researches. With the explosive growth of information sources available on the world wide web, it has become increasingly necessary to track and analyze their usage patterns. In this study, we introduce a process of pre-processing and cluster analysis on web log data and suggest a distance measure considering the structural relationships between web contents. Also, we illustrate some real examples of cluster analysis for web log data and look into practical application of web usage mining for eCRM.

Analysis of Purchase Process Using Process Mining (프로세스 마이닝을 이용한 구매 프로세스 분석)

  • Kim, Seul-Gi;Jung, Jae-Yoon
    • The Journal of Bigdata
    • /
    • v.3 no.1
    • /
    • pp.47-54
    • /
    • 2018
  • Previous studies of business process analysis have analyzed various factors such as task, customer service, operator convenience, and execution time prediction. To accurately analyze these factors, it is effective to utilize actual historical data recorded in information systems. Process mining is a technique for analyzing various elements of a business process from event log data. In this case study, process mining was applied to the transaction data of a purchase agency to analyze the business process of their procurement process, the execution time, and the operators.

Process analysis in Supply Chain Management with Process Mining: A Case Study (프로세스 마이닝 기법을 활용한 공급망 분석: 사례 연구)

  • Lee, Yonghyeok;Yi, Hojeong;Song, Minseok;Lee, Sang-Jin;Park, Sera
    • The Journal of Bigdata
    • /
    • v.1 no.2
    • /
    • pp.65-78
    • /
    • 2016
  • In the rapid change of business environment, it is crucial that several companies with core competence cooperate together in order to deliver competitive products to the market faster. Thus a lot of companies are participating in supply chains and SCM (Supply Chain Management) become more important. To efficiently manage supply chains, the analysis of data from SCM systems is required. In this paper, we explain how to analyze SCM related data with process mining techniques. After discussing the data requirement for process mining, several process mining techniques for the data analysis are explained. To show the applicability of the techniques, we have performed a case study with a company in South Korea. The case study shows that process mining is useful tool to analyze SCM data. On specifically, an overall process, several performance measures, and social networks can be easily discovered and analyzed with the techniques.

  • PDF

DSS Architectures to Support Data Mining Activities for Supply Chain Management (데이터 마이닝을 활용한 공급사슬관리 의사결정지원시스템의 구조에 관한 연구)

  • Jhee, Won-Chul;Suh, Min-Soo
    • Asia pacific journal of information systems
    • /
    • v.8 no.3
    • /
    • pp.51-73
    • /
    • 1998
  • This paper is to evaluate the application potentials of data mining in the areas of Supply Chain Management (SCM) and to suggest the architectures of Decision Support Systems (DSS) that support data mining activities. We first briefly introduce data mining and review the recent literatures on SCM and then evaluate data mining applications to SCM in three aspects: marketing, operations management and information systems. By analyzing the cases about pricing models in distribution channels, demand forecasting and quality control, it is shown that artificial intelligence techniques such as artificial neural networks, case-based reasoning and expert systems, combined with traditional analysis models, effectively mine the useful knowledge from the large volume of SCM data. Agent-based information system is addressed as an important architecture that enables the pursuit of global optimization of SCM through communication and information sharing among supply chain constituents without loss of their characteristics and independence. We expect that the suggested architectures of intelligent DSS provide the basis in developing information systems for SCM to improve the quality of organizational decisions.

  • PDF

A Study on the Methods for the Robust Job Stress Management for Nuclear Power Plant Workers using Response Surface Data Mining (반응표면 데이터마이닝 기법을 이용한 원전 종사자의 강건 직무 스트레스 관리 방법에 관한 연구)

  • Lee, Yonghee;Jang, Tong Il;Lee, Yong Hee
    • Journal of the Korean Society of Safety
    • /
    • v.28 no.1
    • /
    • pp.158-163
    • /
    • 2013
  • While job stress evaluations are reported in the recent surveys upon the nuclear power plants(NPPs), any significant advance in the types of questionnaires is not currently found. There are limitations to their usefulness as analytic tools for the management of safety resources in NPPs. Data mining(DM) has emerged as one of the key features for data computing and analysis to conduct a survey analysis. There are still limitations to its capability such as dimensionality associated with many survey questions and quality of information. Even though some survey methods may have significant advantages, often these methods do not provide enough evidence of causal relationships and the statistical inferences among a large number of input factors and responses. In order to address these limitations on the data computing and analysis capabilities, we propose an advanced procedure of survey analysis incorporating the DM method into a statistical analysis. The DM method can reduce dimensionality of risk factors, but DM method may not discuss the robustness of solutions, either by considering data preprocesses for outliers and missing values, or by considering uncontrollable noise factors. We propose three steps to address these limitations. The first step shows data mining with response surface method(RSM), to deal with specific situations by creating a new method called response surface data mining(RSDM). The second step follows the RSDM with detailed statistical relationships between the risk factors and the response of interest, and shows the demonstration the proposed RSDM can effectively find significant physical, psycho-social, and environmental risk factors by reducing the dimensionality with the process providing detailed statistical inferences. The final step suggest a robust stress management system which effectively manage job stress of the workers in NPPs as a part of a safety resource management using the surrogate variable concept.

A study on 3-step complex data mining in society indicator survey (사회지표조사에서의 3단계 복합 데이터마이닝의 적용 방안)

  • Cho, Kwang-Hyun;Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.5
    • /
    • pp.983-992
    • /
    • 2012
  • Social indicator survey can identify the state of society as a whole. When we create a policy, social indicator survey can reflect the public opinion of the region. Social indicator survey is an important measure of social change. Social indicator survey has been conducted in many municipalities (Seoul, Incheon, Busan, Ulsan, Gyeongsangnamdo, etc.). But, the result of social indicator survey analysis is mainly the basic statistical analysis. In this study, we propose a new data mining methodology for effective analysis. We propose a 3-step complex data mining in society indicator survey. 3-step complex data mining uses three data mining method (intervening association rule, clustering, decision tree).