• Title/Summary/Keyword: Data mining analysis

Search Result 2,174, Processing Time 0.042 seconds

A Methodology for Customer Core Requirement Analysis by Using Text Mining : Focused on Chinese Online Cosmetics Market (텍스트 마이닝을 활용한 사용자 핵심 요구사항 분석 방법론 : 중국 온라인 화장품 시장을 중심으로)

  • Shin, Yoon Sig;Baek, Dong Hyun
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.44 no.2
    • /
    • pp.66-77
    • /
    • 2021
  • Companies widely use survey to identify customer requirements, but the survey has some problems. First of all, the response is passive due to pre-designed questionnaire by companies which are the surveyor. Second, the surveyor needs to have good preliminary knowledge to improve the quality of the survey. On the other hand, text mining is an excellent way to compensate for the limitations of surveys. Recently, the importance of online review is steadily grown, and the enormous amount of text data has increased as Internet usage higher. Also, a technique to extract high-quality information from text data called Text Mining is improving. However, previous studies tend to focus on improving the accuracy of individual analytics techniques. This study proposes the methodology by combining several text mining techniques and has mainly three contributions. Firstly, able to extract information from text data without a preliminary design of the surveyor. Secondly, no need for prior knowledge to extract information. Lastly, this method provides quantitative sentiment score that can be used in decision-making.

Design and Implementation of Mobile CRM Utilizing Big Data Analysis Techniques (빅데이터 분석 기법을 활용한 모바일 CRM 설계 및 구현)

  • Kim, Young-Il;Yang, Seung-Su;Lee, Sang-Soon;Park, Seok-Cheon
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.14 no.6
    • /
    • pp.289-294
    • /
    • 2014
  • In the recent enterprises and are utilizing the CRM using data mining techniques and new marketing plan. However, data mining techniques are necessary expertise, general public access is difficult, it will be subject to constraints of time and space. in this paper, in order to solve this problem, we have proposed a Mobile CRM applying the data mining method. Thus, to analyze the structure of an existing CRM system, and defines the data flow and format. Also, define the process of the system, was designed sales trend analysis algorithm and customer sales recommendation algorithm using data mining techniques. Evaluation of the proposed system, through the test scenario to ensure proper operation, it was carried out the comparison and verification with the existing system. Results of the test, the value of existing programs and data matches to verify the reliability and use queries the proposed statistical tables to reduce the analysis time of data, it was verified rapidity.

Adaptive Data Mining Model using Fuzzy Performance Measures (퍼지 성능 측정자를 이용한 적응 데이터 마이닝 모델)

  • Rhee, Hyun-Sook
    • The KIPS Transactions:PartB
    • /
    • v.13B no.5 s.108
    • /
    • pp.541-546
    • /
    • 2006
  • Data Mining is the process of finding hidden patterns inside a large data set. Cluster analysis has been used as a popular technique for data mining. It is a fundamental process of data analysis and it has been Playing an important role in solving many problems in pattern recognition and image processing. If fuzzy cluster analysis is to make a significant contribution to engineering applications, much more attention must be paid to fundamental decision on the number of clusters in data. It is related to cluster validity problem which is how well it has identified the structure that Is present in the data. In this paper, we design an adaptive data mining model using fuzzy performance measures. It discovers clusters through an unsupervised neural network model based on a fuzzy objective function and evaluates clustering results by a fuzzy performance measure. We also present the experimental results on newsgroup data. They show that the proposed model can be used as a document classifier.

Personal Sentiment Analysis and Opinion Mining (개인감정분석과 마이닝)

  • Lee, Hyun Chang;Shin, Seong Yoon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2017.07a
    • /
    • pp.344-345
    • /
    • 2017
  • Opinion mining and sentiment analysis(OMSA) as a research discipline has emerged during last 15 years and provides a methodology to computationally process the unstructured data mainly to extract opinions and identify their sentiments. The relatively new but fast growing research discipline has changed a lot during these years. This paper presents a scientometric analysis of research work done on OMSA during 2007-2016. For the literature analysis, research publications indexed in Web of Science (WoS) database are used as input data. The publication data is analyzed computationally to identify year-wise publication pattern, rate of growth of publications, research areas.

  • PDF

Polyclass in Data Mining (데이터 마이닝에서의 폴리클라스)

  • 구자용;박헌진;최대우
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.489-503
    • /
    • 2000
  • Data mining means data analysis and model selection using various types of data in order to explore useful information and knowledge for making decisions. Examples of data mining include scoring for credit analysis of a new customer and scoring for churn management, where the customers with high scores are given special attention. In this paper, scoring is interpreted as a modeling process of the conditional probability and polyclass scoring method is described. German credit data, a PC communication company data and a mobile communication company data are used to compare the performance of polyclass scoring method with that of the scoring method based on a tree model.

  • PDF

Toward Successful Management of Vocational Rehabilitation Services for People with Disabilities: A Data Mining Approach

  • Kim, Yong Seog
    • Industrial Engineering and Management Systems
    • /
    • v.11 no.4
    • /
    • pp.371-384
    • /
    • 2012
  • This study proposes a multi-level data analysis approach to identify both superficial and latent relationships among variables in the data set obtained from a vocational rehabilitation (VR) services program of people with significant disabilities. At the first layer, data mining and statistical predictive models are used to extract the superficial relationships between dependent and independent variables. To supplement the findings and relationships from the analysis at the first layer, association rule mining algorithms at the second layer are employed to extract additional sets of interesting associative relationships among variables. Finally, nonlinear nonparametric canonical correlation analysis (NLCCA) along with clustering algorithm is employed to identify latent nonlinear relationships. Experimental outputs validate the usefulness of the proposed approach. In particular, the identified latent relationship indicates that disability types (i.e., physical and mental) and severity (i.e., severe, most severe, not severe) have a significant impact on the levels of self-esteem and self-confidence of people with disabilities. The identified superficial and latent relationships can be used to train education program designers and policy developers to maximize the outcomes of VR training programs.

A Six Sigma Methodology Using Data Mining : A Case Study of "P" Steel Manufacturing Company (데이터 마이닝 기반의 6 시그마 방법론 : 철강산업 적용사례)

  • Jang, Gil-Sang
    • The Journal of Information Systems
    • /
    • v.20 no.3
    • /
    • pp.1-24
    • /
    • 2011
  • Recently, six sigma has been widely adopted in a variety of industries as a disciplined, data-driven problem solving approach or methodology supported by a handful of powerful statistical tools in order to reduce variation through continuous process improvement. Also, data mining has been widely used to discover unknown knowledge from a large volume of data using various modeling techniques such as neural network, decision tree, regression analysis, etc. This paper proposes a six sigma methodology based on data mining for effectively and efficiently processing massive data in driving six sigma projects. The proposed methodology is applied in the hot stove system which is a major energy-consuming process in a "P" steel company for improvement of heat efficiency through reduction of energy consumption. The results show optimal operation conditions and reduction of the hot stove energy cost by 15%.

Frequency Analysis of Scientific Texts on the Hypoxia Using Bibliographic Data (논문 서지정보를 이용한 빈산소수괴 연구 분야의 연구용어 빈도분석)

  • Lee, GiSeop;Lee, JiYoung;Cho, HongYeon
    • Ocean and Polar Research
    • /
    • v.41 no.2
    • /
    • pp.107-120
    • /
    • 2019
  • The frequency analysis of scientific terms using bibliographic information is a simple concept, but as relevant data become more widespread, manual analysis of all data is practically impossible or only possible to a very limited extent. In addition, as the scale of oceanographic research has expanded to become much more comprehensive and widespread, the allocation of research resources on various topics has become an important issue. In this study, the frequency analysis of scientific terms was performed using text mining. The data used in the analysis is a general-purpose scholarship database, totaling 2,878 articles. Hypoxia, which is an important issue in the marine environment, was selected as a research field and the frequencies of related words were analyzed. The most frequently used words were 'Organic matter', 'Bottom water', and 'Dead zone' and specific areas showed high frequency. The results of this research can be used as a basis for the allocation of research resources to the frequency of use of related terms in specific fields when planning a large research project represented by single word.

Development of Car Accidents Person Fatality Model using Data Mining (데이터 마이닝을 이용한 차량 사고자 사망확률 모형)

  • Kim Cheon-Shik;Hong You-Shik;Jung Myung-Hee
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.43 no.9 s.351
    • /
    • pp.25-31
    • /
    • 2006
  • In this paper, a fatality model of car accident using data mining is proposed with the goal of reducing fatality of traffic accident. The analysis results with a proposed fatality model are utilized to improve a technology and environment for driving. For this, traffic accident data are collected, a data mining algorithm is applied to this data, and then, a fatality model of car accident is developed based on the analysis. The training data as well as test data are utilized to develop the fatality model. The important factors to cause fatality in traffic accidents can be investigated using the model. If these factors are taken into account in traffic policies and driving environment, it is expected that the fatality rate of traffic accident can be reduced hereafter.

A Study on Customer's Purchase Trend Using Association Rule (연관규칙을 이용한 고객의 구매경향에 관한 연구)

  • 임영문;최영두
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2000.11a
    • /
    • pp.299-306
    • /
    • 2000
  • General definition of data mining is the knowledge discovery or is to extract hidden necessary information from large databases. Its technique can be applied into decision making, prediction, and information analysis through analyzing of relationship and pattern among data. One of the most important work is to find association rules in data mining. The objective of this paper is to find customer's trend using association rule from analysis of database and the result can be used as fundamental data for CRM(Customer Relationship Management). This paper uses Apriori algorithm and FoodMart data in order to find association rules.

  • PDF