Search | Korea Science

Customer Behavior Prediction of Binary Classification Model Using Unstructured Information and Convolution Neural Network: The Case of Online Storefront (비정형 정보와 CNN 기법을 활용한 이진 분류 모델의 고객 행태 예측: 전자상거래 사례를 중심으로)

Kim, Seungsoo;Kim, Jongwoo
- Journal of Intelligence and Information Systems
- /
- v.24 no.2
- /
- pp.221-241
- /
- 2018
Deep learning is getting attention recently. The deep learning technique which had been applied in competitions of the International Conference on Image Recognition Technology(ILSVR) and AlphaGo is Convolution Neural Network(CNN). CNN is characterized in that the input image is divided into small sections to recognize the partial features and combine them to recognize as a whole. Deep learning technologies are expected to bring a lot of changes in our lives, but until now, its applications have been limited to image recognition and natural language processing. The use of deep learning techniques for business problems is still an early research stage. If their performance is proved, they can be applied to traditional business problems such as future marketing response prediction, fraud transaction detection, bankruptcy prediction, and so on. So, it is a very meaningful experiment to diagnose the possibility of solving business problems using deep learning technologies based on the case of online shopping companies which have big data, are relatively easy to identify customer behavior and has high utilization values. Especially, in online shopping companies, the competition environment is rapidly changing and becoming more intense. Therefore, analysis of customer behavior for maximizing profit is becoming more and more important for online shopping companies. In this study, we propose 'CNN model of Heterogeneous Information Integration' using CNN as a way to improve the predictive power of customer behavior in online shopping enterprises. In order to propose a model that optimizes the performance, which is a model that learns from the convolution neural network of the multi-layer perceptron structure by combining structured and unstructured information, this model uses 'heterogeneous information integration', 'unstructured information vector conversion', 'multi-layer perceptron design', and evaluate the performance of each architecture, and confirm the proposed model based on the results. In addition, the target variables for predicting customer behavior are defined as six binary classification problems: re-purchaser, churn, frequent shopper, frequent refund shopper, high amount shopper, high discount shopper. In order to verify the usefulness of the proposed model, we conducted experiments using actual data of domestic specific online shopping company. This experiment uses actual transactions, customers, and VOC data of specific online shopping company in Korea. Data extraction criteria are defined for 47,947 customers who registered at least one VOC in January 2011 (1 month). The customer profiles of these customers, as well as a total of 19 months of trading data from September 2010 to March 2012, and VOCs posted for a month are used. The experiment of this study is divided into two stages. In the first step, we evaluate three architectures that affect the performance of the proposed model and select optimal parameters. We evaluate the performance with the proposed model. Experimental results show that the proposed model, which combines both structured and unstructured information, is superior compared to NBC(Naïve Bayes classification), SVM(Support vector machine), and ANN(Artificial neural network). Therefore, it is significant that the use of unstructured information contributes to predict customer behavior, and that CNN can be applied to solve business problems as well as image recognition and natural language processing problems. It can be confirmed through experiments that CNN is more effective in understanding and interpreting the meaning of context in text VOC data. And it is significant that the empirical research based on the actual data of the e-commerce company can extract very meaningful information from the VOC data written in the text format directly by the customer in the prediction of the customer behavior. Finally, through various experiments, it is possible to say that the proposed model provides useful information for the future research related to the parameter selection and its performance.
https://doi.org/10.13088/jiis.2018.24.2.221 인용 PDF KSCI

The Influences of Perfusion Temperature on Inflammatory and Hematologic Responses during Cardiopulmonary Bypass (체외순환시 염증과 혈액학적 반응에 대한 관류온도의 영향)

김상필;최석철;박동욱;한일용;이양행;조광현;황윤호
- Journal of Chest Surgery
- /
- v.37 no.10
- /
- pp.817-826
- /
- 2004
Background: Several studies have demonstrated that conventional hypothermic cardiopulmonary bypass (CPB) causes cellular injury, abnormal responses in peripheral vascular beds and increased postoperative bleeding, whereas normothermic CPB provides protection of the hypothermic-induced effects and better cardiac recovery. The present study was prospectively performed to compare the effects of normothermic CPB to those of hypothermic CPB on the inflammatory and hematologic responses during cardiac surgery. Material and Method: Thirty-four adult patients scheduled for elective cardiac surgery were randomly assigned to hypothermic CPB (nasopharyngeal temperature $26～28^{\circ}C,$ n=17) or normothermic CPB (nasopharyngeal $temperature>35.5^{\circ}C,$ n=17) group. In both groups, cold $(4^{\circ}C)$ crystalloid cardioplegia was applied for myocardial protection. Blood samples were drawn from radial artery before (Pre-CPB), 10 minutes after starting (CPB-10) and immediately after ending (CPB-OFF) CPB. Total leukocyte and platelet counts, interleukin-6 (IL-6) level(expressed as percent to the baseline of Pre-CPB), D-dimer level, protein C and protein S activity were measured with the blood samples. The amount of bleeding for postoperative 24 hours and blood transfusion after operation were also assessed. All parameters were compared between the two groups. Result: The total leukocyte counts $(10,032\pm65/mm^3)$ and the increased ratio of IL-6 $(353\pm7.0%)$ at CPB-OFF in the normothermic group were higher than that $(7,254\pm48/mm^3$ and $298\pm7.3%)$ of the hypothermic group(p=0.02 and p=0.03). In the normothermic group, protein C activity $(32\pm3.8%)$ and protein S activity $(35\pm4.1%)$ at CPB-OFF were significantly lower than that $(45\pm4.3%$ and $51\pm3.8%)$ of the hypothermic group (p=0.04 and p=0.009). However, there were no differences in platelet counts and D-dimer concentration. In the normothermic group, the amount of bleeding for postoperative 24 hours $(850\pm23.2$ mL) and requirements for blood transfusion after operation such as packed cell $(1,402\pm20.5$ mL), fresh frozen plasma $(970\pm20.8$ mL) and platelet $(252\pm6.4$ mL) were higher than that $(530\pm21.5$ mL, $696\pm15.7$ mL, $603\pm18.2$ mL and $50\pm0.0$ mL) of the hypothermic group. Conclusion: These results indicate that normothermic CPB with cold crystalloid cardioplegia was associated with higher increase in inflammatory response, hemostatic abnormalities and postoperative bleeding problem than moderate hypothermic CPB.
PDF KSCI

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

Park, Ho-yeon;Kim, Kyoung-jae
- Journal of Intelligence and Information Systems
- /
- v.25 no.4
- /
- pp.141-154
- /
- 2019
Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.
https://doi.org/10.13088/jiis.2019.25.4.141 인용 PDF KSCI

Search Result 4,033, Processing Time 0.025 seconds

Customer Behavior Prediction of Binary Classification Model Using Unstructured Information and Convolution Neural Network: The Case of Online Storefront (비정형 정보와 CNN 기법을 활용한 이진 분류 모델의 고객 행태 예측: 전자상거래 사례를 중심으로)

The Influences of Perfusion Temperature on Inflammatory and Hematologic Responses during Cardiopulmonary Bypass (체외순환시 염증과 혈액학적 반응에 대한 관류온도의 영향)

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)