• 제목/요약/키워드: Data Analyzing

검색결과 9,916건 처리시간 0.042초

Purchase Prediction by Analyzing Users' Online Behaviors Using Machine Learning and Information Theory Approaches

  • Kim, Minsung;Im, Il;Han, Sangman
    • Asia pacific journal of information systems
    • /
    • 제26권1호
    • /
    • pp.66-79
    • /
    • 2016
  • The availability of detailed data on customers' online behaviors and advances in big data analysis techniques enable us to predict consumer behaviors. In the past, researchers have built purchase prediction models by analyzing clickstream data; however, these clickstream-based prediction models have had several limitations. In this study, we propose a new method for purchase prediction that combines information theory with machine learning techniques. Clickstreams from 5,000 panel members and data on their purchases of electronics, fashion, and cosmetics products were analyzed. Clickstreams were summarized using the 'entropy' concept from information theory, while 'random forests' method was applied to build prediction models. The results show that prediction accuracy of this new method ranges from 0.56 to 0.83, which is a significant improvement over values for clickstream-based prediction models presented in the past. The results indicate further that consumers' information search behaviors differ significantly across product categories.

통계적 분석기법을 이용한 디젤기관의 고장진단 방법에 관한 연구 (The Fault Diagnosis Method of Diesel Engines Using a Statistical Analysis Method)

  • 김영일;오현경;유영호
    • Journal of Advanced Marine Engineering and Technology
    • /
    • 제30권2호
    • /
    • pp.247-252
    • /
    • 2006
  • Almost ship monitoring systems are event driven alarm system which warn only when the measurement value is over or under set point. These kinds of system cannot warn until signal is growing to abnormal state that the signal is over or under the set point. therefore cannot play a role for preventive maintenance system. This paper proposes fault diagnosis method which is able to diagnose and forecast the fault from present operating condition by analyzing monitored signals with present ship monitoring system without any additional sensors. By analyzing the data with high correlation coefficient(CC), correlation level of interactive data can be defined. Knowledge base of abnormal detection can be built by referring level of CC(Fault Detection CC. FDCC) to detect abnormal data among monitored data from monitoring system and knowledge base of diagnosis built by referring CC among interactive data for related machine each other to diagnose fault part.

Spatial Prediction Based on the Bayesian Kriging with Box-Cox Transformation

  • Choi, Jung-Soon;Park, Man-Sik
    • Communications for Statistical Applications and Methods
    • /
    • 제16권5호
    • /
    • pp.851-858
    • /
    • 2009
  • In the last decades, there has been much interest in climate variability because its change has dramatic effects on humanity. Especially, the precipitation data are measured over space and their spatial association is so complicated. So we should take into account such a spatial dependency structure while analyzing the data. However, in linear models for analyzing the data, data sets show severely skewed distribution. In the paper, we consider the Box-Cox transformation to satisfy the normal distribution prior to the analysis, and employ a Bayesian hierarchical framework to investigate the spatial patterns. The data set we considered is monthly average precipitation of the third quarter of 2007 obtained from 347 automated monitoring stations in Contiguous South Korea.

통계적분석기법을 이용한 디젤기관의 고장진단 방법에 관한 연구 (The Fault Diagnosis Method of Diesel Engines Using a Statistical Analysis Method)

  • 김영일;오현경;천행춘;유영호
    • 한국마린엔지니어링학회:학술대회논문집
    • /
    • 한국마린엔지니어링학회 2005년도 전기학술대회논문집
    • /
    • pp.281-286
    • /
    • 2005
  • Almost ship monitoring systems are event driven alarm system which warn only when the measurement value is over or under set point. These kinds of system cannot warn while signal is growing to abnormal state until the signal is over or under the set point and cannot play a role for preventive maintenance system. This paper proposes fault diagnosis method which is able to diagnose and forecast the fault from present operating condition by analyzing monitored signals with present ship monitoring system without additional sensors. By analyzing this data having high correlation coefficient(CC), correlation level of interactive data can be understood. Knowledge base of abnormal detection can be built by referring level of CC(Fault Detection CC, FDCC) to detect abnormal data among monitored data from monitoring system and knowledge base of diagnosis built by referring CC among interactive data for related machine each other to diagnose fault part.

  • PDF

Product Recommendation System based on User Purchase Priority

  • Bang, Jinsuk;Hwang, Doyeun;Jung, Hoekyung
    • Journal of information and communication convergence engineering
    • /
    • 제18권1호
    • /
    • pp.55-60
    • /
    • 2020
  • As personalized customer services create a society that emphasizes the personality of an individual, the number of product reviews and quantity of user data generated by users on the internet in mobile shopping apps and sites are increasing. Such product review data are classified as unstructured data. Unstructured data have the potential to be transformed into information that companies and users can employ, using appropriate processing and analyses. However, existing systems do not reflect the detailed information they collect, such as user characteristics, purchase preference, or purchase priority while analyzing review data. Thus, it is challenging to provide customized recommendations for various users. Therefore, in this study, we have developed a product recommendation system that takes into account the user's priority, which they select, when searching for and purchasing a product. The recommendation system then displays the results to the user by processing and analyzing their preferences. Since the user's preference is considered, the user can obtain results that are more relevant.

빅데이터 기반의 잠재적 붕괴위험 노후건축물 도출 방법 및 서울특별시 적용 연구 (The Method for Analyzing Potentially Collapsible Aged Buildings Using Big Data and its Application to Seoul)

  • 임혜연;박철영;조성현;이강
    • 대한건축학회논문집:계획계
    • /
    • 제35권2호
    • /
    • pp.139-146
    • /
    • 2019
  • The purpose of this study is to derive an improved method for analyzing old buildings with risk of collapse using public big data. Previous studies on the risk of building collapse focused on internal factors such as building age and structural vulnerability. However, this study suggests a method to derive potentially collapsible buildings considering not only internal factors of buildings but also external factors such as nearby new construction data. Based on the big data analysis, this study develops a system to visualize vulnerable buildings that require safety diagnosis and proposed a future utilization plan.

요인분석시 형태요인을 도출시키기 위한 자료 변환 연구 - 20대 남성 체형을 중심으로 - (Data Transition Research to Derive Shape Factor in Analyzing Factors -Centering on the body shape of males in twenties-)

  • 석혜정;김인숙
    • 대한가정학회지
    • /
    • 제40권2호
    • /
    • pp.67-86
    • /
    • 2002
  • The purpose of this research is to offer a method to derive the body shapes and pose factors with resets from analyzing factors by using the measurement values of human body. 297 male subjects in their twenties participated in this study. 54 anthropometric and 35 photographic measurements were taken from each subject. Data is the several typical items selected among items and you have to make an index value by using these typical items and convert it. The index has high correlation with each item, is easy to be measured and is to be used after selecting an important item in designing prototype. With results of analyzing reliability level by each item, correlation and items that factor loading is low, the item that the correlation is high is to be removed within the range that maintains the reliability level. Five factors of the front part and seven factors of the profile came to be derived from this kind of process.

신생산기술 실행과정상에 조직 적응노력의 효과성 분석

  • 이상곤;이진주
    • 한국경영과학회:학술대회논문집
    • /
    • 대한산업공학회/한국경영과학회 1994년도 춘계공동학술대회논문집; 창원대학교; 08월 09일 Apr. 1994
    • /
    • pp.627-632
    • /
    • 1994
  • This study suggests the contingency framework understanding the implementation of NIMT and conducts empirical analysis using survey data. The main objectives of the thesis are as follows : (1) analyzing the relationship between adaptation requirements and organizational adaptation efforts. (2) analyzing the relationship between organizational adaptation efforts and implementation performance, and the moderating effect of the adaptation requirements. (3) analyzing the relationship between organizational adaptation efforts and organization characteristics(organization size, technical capability, top management support). A field study was undertaken to test the hypothesized relationships among adaptation efforts, adaptations requirements, organization characteristics, implementation performance. Data were collected from 52 NMT implementation projects of 45 machine tool, metal component firms. The emprical relflts indicated that adaptation requirrnents, organization characteristics were significantly related to organizational adaptation efforts and adaptation efforts can significantly improve implementation perfomance, but the moderating effect of adaptation requirements on adaptation efforts and implementation performance was not supported.

한국과 일본 5학년 과학 교과서 내용 분석 (Content Analysis of the 5th grade Science Textbooks in Japan and Korea)

  • 김효남;이영미
    • 한국과학교육학회지
    • /
    • 제15권4호
    • /
    • pp.452-458
    • /
    • 1995
  • Science textbooks are very important materials in order to know elementary science learning in Japan and Korea. In this research the 5th grade science textbooks in Japan and Korea are analyzed by an analyzing category. The analyzing category is consisted of knowledge and scientific inquiry. Knowledge is divided by fact, concept, and rule. Scientific inquiry is divided by problem cognition, variable control, experiment planning, observing, measuring, categorizing, inferring, data transformation, predicting, correlation, cause and effect, result, communication, which are 13 subcategories. Analyzing methods are counting the frequency of each subcategory and tabulating the data. The results of this study are: 1. The frequency of scientific inquiry appeared in Korean 5th grade science textbooks is three times more than that in Japanese textbooks. 2. In scientific inquiry category, Japanese science textbooks emphasized observing, predicting, measuring and problem cognition; Korean science textbooks emphasized experiment planning, observing and problem cognition. 3. In knowledge category, fact subcategory is mostly emphasized in both countries.

  • PDF

Data Mining을 이용한 전략시뮬레이션 게임 데이터 분석 (A Study of Analyzing Realtime Strategy Game Data using Data Mining)

  • 용혜련;김도진;황현석
    • 한국게임학회 논문지
    • /
    • 제15권4호
    • /
    • pp.59-68
    • /
    • 2015
  • 정보통신기술의 발달로 빅데이터 분석을 통해 사람들 일상의 기록과 잠재적 요구까지 통찰할 수 있게 되었으며, 우리의 일상 속에서 방대한 정보를 실시간으로 도출하고 있다. 여러 산업이나 기업에서 이미 빅데이터와 결합시켜 비즈니스 등 다양한 분야에 활용하고 있지만 게임 산업에서의 빅데이터 활용은 아직까지 미흡한 실정이다. 이에 본 연구에서는 데이터 마이닝을 기법을 적용하여 전략시뮬레이션 게임 데이터를 분석하였다. 전략시뮬레이션 게임 데이터를 Decision Tree, Random Forest, Multi-class SVM, Linear Regression 분석 기법을 적용하여 게임 유저의 게임수준에 영향을 미치는 요인을 분석하였다. 게임수준을 예측하는데 있어 가장 우수한 성능을 보인 기법과 변수들을 도출하여 게임 디자인과 사용성을 증대시키기 위한 제안을 하고자 한다.