• Title/Summary/Keyword: web data mining

Search Result 408, Processing Time 0.022 seconds

Analysis of Web Log for e-CRM on B2B of the Make-To-Order Company (수주생산기업 B2B에서 e-CRM을 위한 웹 로그 분석)

  • Go, Jae-Moon;Seo, Jun-Yong;Kim, Woon-Sik
    • IE interfaces
    • /
    • v.18 no.2
    • /
    • pp.205-220
    • /
    • 2005
  • This study presents a web log analysis model for e-CRM, which combines the on-line customer's purchasing pattern data and transaction data between companies in B2B environment of make-to-order company. With this study, the customer evaluation and the customer subdivision are available. We can forecast the estimate demands with periodical products sales records. Also, the purchasing rate per each product, the purchasing intention rate, and the purchasing rate per companies can be used as the basic data for the strategy for receiving the orders in future. These measures are used to evaluate the business strategy, the quality ability on products, the customer's demands, the benefits of customer and the customer's loyalty. And it is used to evaluate the customer's purchasing patterns, the response analysis, the customer's secession rate, the earning rate, and the customer's needs. With this, we can satisfy various customers' demands, therefore, we can multiply the company's benefits. And we presents case of the 'H' company, which has the make-to-order manufacture environment, in order to verify the effect of the proposal system.

Distributed FTP Server for Log Mining System on ACE (분산 FTP 서버의 ACE 기반 로그 마이닝 시스템)

  • Min, Su-Hong;Cho, Dong-Sub
    • Proceedings of the KIEE Conference
    • /
    • 2002.11c
    • /
    • pp.465-468
    • /
    • 2002
  • Today large corporations are constructing distributed server environment. Many corporations are respectively operating Web server, FTP server, Mail server and DB server on heterogeneous operation. However, there is the problem that a manager must manage each server individually. In this paper, we present distributed FTP server for log mining system on ACE. Proposed log mining system is based upon ACE (Adaptive Communication Environment) framework and data mining techniques. This system provides a united operation with distributed FTP server.

  • PDF

Implementation of a Web-Based Intelligent Decision Support System for Apartment Auction (아파트 경매를 위한 웹 기반의 지능형 의사결정지원 시스템 구현)

  • Na, Min-Yeong;Lee, Hyeon-Ho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.11
    • /
    • pp.2863-2874
    • /
    • 1999
  • Apartment auction is a system that is used for the citizens to get a house. This paper deals with the implementation of a web-based intelligent decision support system using OLAP technique and data mining technique for auction decision support. The implemented decision support system is working on a real auction database and is mainly composed of OLAP Knowledge Extractor based on data warehouse and Auction Data Miner based on data mining methodology. OLAP Knowledge Extractor extracts required knowledge and visualizes it from auction database. The OLAP technique uses fact, dimension, and hierarchies to provide the result of data analysis by menas of roll-up, drill-down, slicing, dicing, and pivoting. Auction Data Miner predicts a successful bid price by means of applying classification to auction database. The Miner is based on the lazy model-based classification algorithm and applies the concepts such as decision fields, dynamic domain information, and field weighted function to this algorithm and applies the concepts such as decision fields, dynamic domain information, and field weighted function to this algorithm to reflect the characteristics of auction database.

  • PDF

Understanding the Food Hygiene of Cruise through the Big Data Analytics using the Web Crawling and Text Mining

  • Shuting, Tao;Kang, Byongnam;Kim, Hak-Seon
    • Culinary science and hospitality research
    • /
    • v.24 no.2
    • /
    • pp.34-43
    • /
    • 2018
  • The objective of this study was to acquire a general and text-based awareness and recognition of cruise food hygiene through big data analytics. For the purpose, this study collected data with conducting the keyword "food hygiene, cruise" on the web pages and news on Google, during October 1st, 2015 to October 1st, 2017 (two years). The data collection was processed by SCTM which is a data collecting and processing program and eventually, 899 kb, approximately 20,000 words were collected. For the data analysis, UCINET 6.0 packaged with visualization tool-Netdraw was utilized. As a result of the data analysis, the words such as jobs, news, showed the high frequency while the results of centrality (Freeman's degree centrality and Eigenvector centrality) and proximity indicated the distinct rank with the frequency. Meanwhile, as for the result of CONCOR analysis, 4 segmentations were created as "food hygiene group", "person group", "location related group" and "brand group". The diagnosis of this study for the food hygiene in cruise industry through big data is expected to provide instrumental implications both for academia research and empirical application.

Sequential Pattern Mining with Optimization Calling MapReduce Function on MapReduce Framework (맵리듀스 프레임웍 상에서 맵리듀스 함수 호출을 최적화하는 순차 패턴 마이닝 기법)

  • Kim, Jin-Hyun;Shim, Kyu-Seok
    • The KIPS Transactions:PartD
    • /
    • v.18D no.2
    • /
    • pp.81-88
    • /
    • 2011
  • Sequential pattern mining that determines frequent patterns appearing in a given set of sequences is an important data mining problem with broad applications. For example, sequential pattern mining can find the web access patterns, customer's purchase patterns and DNA sequences related with specific disease. In this paper, we develop the sequential pattern mining algorithms using MapReduce framework. Our algorithms distribute input data to several machines and find frequent sequential patterns in parallel. With synthetic data sets, we did a comprehensive performance study with varying various parameters. Our experimental results show that linear speed up can be achieved through our algorithms with increasing the number of used machines.

Exploring Information Ethics Issues based on Text Mining using Big Data from Web of Science (Web of Science 빅데이터를 활용한 텍스트 마이닝 기반의 정보윤리 이슈 탐색)

  • Kim, Han Sung
    • The Journal of Korean Association of Computer Education
    • /
    • v.22 no.3
    • /
    • pp.67-78
    • /
    • 2019
  • The purpose of this study is to explore information ethics issues based on academic big data from Web of Science (WoS) and to provide implications for information ethics education in informatics subject. To this end, 318 published papers from WoS related to information ethics were text mined. Specifically, this paper analyzed the frequency of key-words(TF, DF, TF-IDF), information ethics issues using topic modeling, and frequency of appearances by year for each issue. This paper used 'tm', 'topicmodel' package of R for text mining. The main results are as follows. First, this paper confirmed that the words 'digital', 'student', 'software', and 'privacy' were the main key-words through TF-IDF. Second, the topic modeling analysis showed 8 issues such as 'Professional value', 'Cyber-bullying', 'AI and Social Impact' et al., and the proportion of 'Professional value' and 'Cyber-bullying' was relatively high. This study discussed the implications for information ethics education in Korea based on the results of this analysis.

On development of supporting tool for Folksonomy Mining based on Formal Concept Analysis (형식개념분석을 이용한 폭소노미 마이닝 기법과 지원도구의 개발)

  • Kang, Yu-Kyung;Hwang, Suk-Hyung;Yang, Hae-Sool
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.8
    • /
    • pp.1877-1893
    • /
    • 2009
  • Folksonomy is a user-generated taxonomy to organize information by which a user assigns tags to resources published on the web. Triadic datas that indicate relations of between users, tags, and resources, are created by collaborative tagging from many users in folksonomy-based system. Such the folksonomy data has been utilized in the field of the semantic web and web2.0 as metadata about web resources. In this paper, we propose FCA-based folksonomy data mining approach in order to extract the useful information from folksonomy data with various points of view. And we developed tool for supporting our approach. In order to verify the usefulness of our proposed approach and FMT, we have done some experiments for data of del.icio.us, which is a popular folksonomy-based bookmarking system. And we report about result of our experiments.

A study on the Analysis and Forecast of Effect Factors in e-Learning Reuse Intention Using Rule Induction Techniques (규칙유도기법을 이용한 이러닝 시스템의 재이용의도 영향요인 분석 및 예측에 관한 연구)

  • Bae, Jae-Kwon;Kim, Jin-Hwa;Jeong, Hwa-Min
    • Journal of Information Technology Applications and Management
    • /
    • v.17 no.2
    • /
    • pp.71-90
    • /
    • 2010
  • Electronic learning(or e-learning) has created hype for companies, universities, and other educational institutions. It has led to the phenomenal growth in the use of web-based learning and experimentation with multimedia, video conferencing, and internet-based technologies. Many researchers are interested in the factors that affect to the performance of e-learning or e-learning services. In this sense, this study is aimed at proposing e-learning system reuse prediction models in which e-learner intention to reuse influence factors(i.e., system accessibility, system stability, information clarity, information validity, self-regulated efficacy, computer self-efficacy, perceived usefulness, perceived ease of use, flow, and parental expectation) affect e-learner intention to reuse positively. A web survey was conducted for the full members of the e-learning education institute A in Seoul, Republic of Korea, an exclusive e-learning company that provides real time video lectures via the desktop conferencing system. The web survey was conducted for 20 days from November 5, 2009, through the e-learning web site of the company A. In this study, three data mining techniques were used : the multivariate discriminant analysis, CART, and C5.0 algorithm. This study was conducted to provide the e-learning service providers, e-learning operators, and contents developers with marketing and management strategies for improving the e-learning service companies, based on the data mining analysis results.

  • PDF

Design and Implementation of specialized Web 2.0 Travel Agency System (특화된 웹2.0 여행사 시스템의 설계 및 구현)

  • Kim, Jung Sook;Lee, Ya Ri;Hong, Kyung Pyo
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.5 no.1
    • /
    • pp.9-22
    • /
    • 2009
  • This paper is an explanation of a design and an implementation of Web 2.0 online travel agency system for frequent decision-making. On the Web 2.0 travel agency system, optimized information is obtained by applying data mining technology such as association rules, decision trees, and neural networks, and this system is a unified system that consists of the block systems of hotels, ground traffic, and flights in tour packages of a travel agency system. Furthermore, it is implemented to manage the system that is not for the administrator of a travel agency system, but for users or communities that use the system need their own information. The expected effect of this system is to maximize the investment company's efficiency through a new-concept interest model created by B2C customers, and also B2B small and medium-sized travel agencies adopting the system. As a result, it is a system that stimulates dormant customer activity and prevents good customers from leaving by maximizing the merit and capacity of the existed web site for marketing. Moreover, this system is also a model for people who plan customized travel agency business, and will show a way for the domestic and international travel agency industry's globalization.

A Study on Development of A Web-Based Forecasting System of Industrial Accidents (웹 기반의 산업재해 예측시스템 개발에 관한 연구)

  • Leem, Young-Moon;Hwang, Young-Seob;Choi, Yo-Han
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2007.11a
    • /
    • pp.269-274
    • /
    • 2007
  • Ultimate goal of this research is to develop a web-based forecasting system of industrial accidents. As an initial step for the purpose of this study, this paper provides a comparative analysis of 4 kinds of algorithms including CHAID, CART, C4.5, and QUEST. In addition, this paper presents the logical process for development of a forecasting system. Decision tree algorithm is utilized to predict results using objective and quantified data as a typical technique of data mining. The sample for this work was chosen from 10,536 data related to manufacturing industries during three years(2002$^{\sim}$2004) in korea.

  • PDF