• Title/Summary/Keyword: 거래 데이터

Search Result 995, Processing Time 0.025 seconds

Customer Behavior Prediction of Binary Classification Model Using Unstructured Information and Convolution Neural Network: The Case of Online Storefront (비정형 정보와 CNN 기법을 활용한 이진 분류 모델의 고객 행태 예측: 전자상거래 사례를 중심으로)

  • Kim, Seungsoo;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.221-241
    • /
    • 2018
  • Deep learning is getting attention recently. The deep learning technique which had been applied in competitions of the International Conference on Image Recognition Technology(ILSVR) and AlphaGo is Convolution Neural Network(CNN). CNN is characterized in that the input image is divided into small sections to recognize the partial features and combine them to recognize as a whole. Deep learning technologies are expected to bring a lot of changes in our lives, but until now, its applications have been limited to image recognition and natural language processing. The use of deep learning techniques for business problems is still an early research stage. If their performance is proved, they can be applied to traditional business problems such as future marketing response prediction, fraud transaction detection, bankruptcy prediction, and so on. So, it is a very meaningful experiment to diagnose the possibility of solving business problems using deep learning technologies based on the case of online shopping companies which have big data, are relatively easy to identify customer behavior and has high utilization values. Especially, in online shopping companies, the competition environment is rapidly changing and becoming more intense. Therefore, analysis of customer behavior for maximizing profit is becoming more and more important for online shopping companies. In this study, we propose 'CNN model of Heterogeneous Information Integration' using CNN as a way to improve the predictive power of customer behavior in online shopping enterprises. In order to propose a model that optimizes the performance, which is a model that learns from the convolution neural network of the multi-layer perceptron structure by combining structured and unstructured information, this model uses 'heterogeneous information integration', 'unstructured information vector conversion', 'multi-layer perceptron design', and evaluate the performance of each architecture, and confirm the proposed model based on the results. In addition, the target variables for predicting customer behavior are defined as six binary classification problems: re-purchaser, churn, frequent shopper, frequent refund shopper, high amount shopper, high discount shopper. In order to verify the usefulness of the proposed model, we conducted experiments using actual data of domestic specific online shopping company. This experiment uses actual transactions, customers, and VOC data of specific online shopping company in Korea. Data extraction criteria are defined for 47,947 customers who registered at least one VOC in January 2011 (1 month). The customer profiles of these customers, as well as a total of 19 months of trading data from September 2010 to March 2012, and VOCs posted for a month are used. The experiment of this study is divided into two stages. In the first step, we evaluate three architectures that affect the performance of the proposed model and select optimal parameters. We evaluate the performance with the proposed model. Experimental results show that the proposed model, which combines both structured and unstructured information, is superior compared to NBC(Naïve Bayes classification), SVM(Support vector machine), and ANN(Artificial neural network). Therefore, it is significant that the use of unstructured information contributes to predict customer behavior, and that CNN can be applied to solve business problems as well as image recognition and natural language processing problems. It can be confirmed through experiments that CNN is more effective in understanding and interpreting the meaning of context in text VOC data. And it is significant that the empirical research based on the actual data of the e-commerce company can extract very meaningful information from the VOC data written in the text format directly by the customer in the prediction of the customer behavior. Finally, through various experiments, it is possible to say that the proposed model provides useful information for the future research related to the parameter selection and its performance.

The Adaptive Personalization Method According to Users Purchasing Index : Application to Beverage Purchasing Predictions (고객별 구매빈도에 동적으로 적응하는 개인화 시스템 : 음료수 구매 예측에의 적용)

  • Park, Yoon-Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.95-108
    • /
    • 2011
  • TThis is a study of the personalization method that intelligently adapts the level of clustering considering purchasing index of a customer. In the e-biz era, many companies gather customers' demographic and transactional information such as age, gender, purchasing date and product category. They use this information to predict customer's preferences or purchasing patterns so that they can provide more customized services to their customers. The previous Customer-Segmentation method provides customized services for each customer group. This method clusters a whole customer set into different groups based on their similarity and builds predictive models for the resulting groups. Thus, it can manage the number of predictive models and also provide more data for the customers who do not have enough data to build a good predictive model by using the data of other similar customers. However, this method often fails to provide highly personalized services to each customer, which is especially important to VIP customers. Furthermore, it clusters the customers who already have a considerable amount of data as well as the customers who only have small amount of data, which causes to increase computational cost unnecessarily without significant performance improvement. The other conventional method called 1-to-1 method provides more customized services than the Customer-Segmentation method for each individual customer since the predictive model are built using only the data for the individual customer. This method not only provides highly personalized services but also builds a relatively simple and less costly model that satisfies with each customer. However, the 1-to-1 method has a limitation that it does not produce a good predictive model when a customer has only a few numbers of data. In other words, if a customer has insufficient number of transactional data then the performance rate of this method deteriorate. In order to overcome the limitations of these two conventional methods, we suggested the new method called Intelligent Customer Segmentation method that provides adaptive personalized services according to the customer's purchasing index. The suggested method clusters customers according to their purchasing index, so that the prediction for the less purchasing customers are based on the data in more intensively clustered groups, and for the VIP customers, who already have a considerable amount of data, clustered to a much lesser extent or not clustered at all. The main idea of this method is that applying clustering technique when the number of transactional data of the target customer is less than the predefined criterion data size. In order to find this criterion number, we suggest the algorithm called sliding window correlation analysis in this study. The algorithm purposes to find the transactional data size that the performance of the 1-to-1 method is radically decreased due to the data sparity. After finding this criterion data size, we apply the conventional 1-to-1 method for the customers who have more data than the criterion and apply clustering technique who have less than this amount until they can use at least the predefined criterion amount of data for model building processes. We apply the two conventional methods and the newly suggested method to Neilsen's beverage purchasing data to predict the purchasing amounts of the customers and the purchasing categories. We use two data mining techniques (Support Vector Machine and Linear Regression) and two types of performance measures (MAE and RMSE) in order to predict two dependent variables as aforementioned. The results show that the suggested Intelligent Customer Segmentation method can outperform the conventional 1-to-1 method in many cases and produces the same level of performances compare with the Customer-Segmentation method spending much less computational cost.

A Study on Trust Transfer in Traditional Fintech of Smart Banking (핀테크 서비스에서 오프라인에서 온라인으로의 신뢰전이에 관한 연구 - 스마트뱅킹을 중심으로 -)

  • Ai, Di;Kwon, Sun-Dong;Lee, Su-Chul;Ko, Mi-Hyun;Lee, Bo-Hyung
    • Management & Information Systems Review
    • /
    • v.36 no.3
    • /
    • pp.167-184
    • /
    • 2017
  • In this study, we investigated the effect of offline banking trust on smart banking trust. As influencing factors of smart banking trust, this study compared offline banking trust, smart banking's system quality, and information quality. For the empirical study, 186 questionnaire data were collected from smart banking users and the data were analyzed using Smart-PLS 2.0. As results, it was verified that there is trust transfer in FinTech service, by the significant effect of offline banking trust on smart banking trust. And it was proved that the effect of offline banking trust on smart banking trust is lower than that of smart banking itself. The contribution of this study can be seen in both academic and industrial aspects. First, it is the contribution of the academic aspect. Previous studies on banking were focused on either offline banking or smart banking. But this study, focus on the relationship between offline banking and online banking, proved that offline banking trust affects smart banking trust. Next, it is the industrial contribution. This study showed that offline banking characteristics of traditional commercial banks affect the trust of emerging smart banking service. This means that the emerging FinTech companies are not advantageous in the competition of trust building compared to traditional commercial banks. Unlike traditional commercial banks, the emerging FinTech is innovating the convenience of customers by arming them with new technologies such as mobile Internet, social network, cloud technology, and big data. However, these FinTech strengths alone can not guarantee sufficient trust needed for financial transactions, because banking customers do not change a habit or an inertia that they already have during using traditional banks. Therefore, emerging FinTech companies should strive to create destructive value that reflects the connection with various Internet services and the strength of online interaction such as social services, which have an advantage over customer contacts. And emerging FinTech companies should strive to build service trust, focused on young people with low resistance to new services.

  • PDF

Optimization of Multiclass Support Vector Machine using Genetic Algorithm: Application to the Prediction of Corporate Credit Rating (유전자 알고리즘을 이용한 다분류 SVM의 최적화: 기업신용등급 예측에의 응용)

  • Ahn, Hyunchul
    • Information Systems Review
    • /
    • v.16 no.3
    • /
    • pp.161-177
    • /
    • 2014
  • Corporate credit rating assessment consists of complicated processes in which various factors describing a company are taken into consideration. Such assessment is known to be very expensive since domain experts should be employed to assess the ratings. As a result, the data-driven corporate credit rating prediction using statistical and artificial intelligence (AI) techniques has received considerable attention from researchers and practitioners. In particular, statistical methods such as multiple discriminant analysis (MDA) and multinomial logistic regression analysis (MLOGIT), and AI methods including case-based reasoning (CBR), artificial neural network (ANN), and multiclass support vector machine (MSVM) have been applied to corporate credit rating.2) Among them, MSVM has recently become popular because of its robustness and high prediction accuracy. In this study, we propose a novel optimized MSVM model, and appy it to corporate credit rating prediction in order to enhance the accuracy. Our model, named 'GAMSVM (Genetic Algorithm-optimized Multiclass Support Vector Machine),' is designed to simultaneously optimize the kernel parameters and the feature subset selection. Prior studies like Lorena and de Carvalho (2008), and Chatterjee (2013) show that proper kernel parameters may improve the performance of MSVMs. Also, the results from the studies such as Shieh and Yang (2008) and Chatterjee (2013) imply that appropriate feature selection may lead to higher prediction accuracy. Based on these prior studies, we propose to apply GAMSVM to corporate credit rating prediction. As a tool for optimizing the kernel parameters and the feature subset selection, we suggest genetic algorithm (GA). GA is known as an efficient and effective search method that attempts to simulate the biological evolution phenomenon. By applying genetic operations such as selection, crossover, and mutation, it is designed to gradually improve the search results. Especially, mutation operator prevents GA from falling into the local optima, thus we can find the globally optimal or near-optimal solution using it. GA has popularly been applied to search optimal parameters or feature subset selections of AI techniques including MSVM. With these reasons, we also adopt GA as an optimization tool. To empirically validate the usefulness of GAMSVM, we applied it to a real-world case of credit rating in Korea. Our application is in bond rating, which is the most frequently studied area of credit rating for specific debt issues or other financial obligations. The experimental dataset was collected from a large credit rating company in South Korea. It contained 39 financial ratios of 1,295 companies in the manufacturing industry, and their credit ratings. Using various statistical methods including the one-way ANOVA and the stepwise MDA, we selected 14 financial ratios as the candidate independent variables. The dependent variable, i.e. credit rating, was labeled as four classes: 1(A1); 2(A2); 3(A3); 4(B and C). 80 percent of total data for each class was used for training, and remaining 20 percent was used for validation. And, to overcome small sample size, we applied five-fold cross validation to our dataset. In order to examine the competitiveness of the proposed model, we also experimented several comparative models including MDA, MLOGIT, CBR, ANN and MSVM. In case of MSVM, we adopted One-Against-One (OAO) and DAGSVM (Directed Acyclic Graph SVM) approaches because they are known to be the most accurate approaches among various MSVM approaches. GAMSVM was implemented using LIBSVM-an open-source software, and Evolver 5.5-a commercial software enables GA. Other comparative models were experimented using various statistical and AI packages such as SPSS for Windows, Neuroshell, and Microsoft Excel VBA (Visual Basic for Applications). Experimental results showed that the proposed model-GAMSVM-outperformed all the competitive models. In addition, the model was found to use less independent variables, but to show higher accuracy. In our experiments, five variables such as X7 (total debt), X9 (sales per employee), X13 (years after founded), X15 (accumulated earning to total asset), and X39 (the index related to the cash flows from operating activity) were found to be the most important factors in predicting the corporate credit ratings. However, the values of the finally selected kernel parameters were found to be almost same among the data subsets. To examine whether the predictive performance of GAMSVM was significantly greater than those of other models, we used the McNemar test. As a result, we found that GAMSVM was better than MDA, MLOGIT, CBR, and ANN at the 1% significance level, and better than OAO and DAGSVM at the 5% significance level.

A Study on Perceived Quality affecting the Service Personal Value in the On-off line Channel - Focusing on the moderate effect of the need for cognition - (온.오프라인 채널에서 지각된 품질이 서비스의 개인가치에 미치는 영향에 관한 연구 -인지욕구의 조정효과를 중심으로-)

  • Sung, Hyung-Suk
    • Journal of Distribution Research
    • /
    • v.15 no.3
    • /
    • pp.111-137
    • /
    • 2010
  • The basic purpose of this study is to investigate perceived quality and service personal value affecting the result of long-term relationship between service buyers and suppliers. This research presented a constructive model(perceived quality affecting the service personal value and the moderate effect of NFC) in the on off line and then propose the research model base on prior researches and studies about relationships among components of service. Data were gathered from respondents who visit at the education service market. For this study, Data were analyzed by AMOS 7.0. We integrate the literature on services marketing with researches on personal values and perceived quality. The SERPVAL scale presented here allows for the creation of a common ground for assessing service personal values, giving a clear understanding of the key value dimensions behind service choice and usage. It will lead to a focus of future research in services marketing, extending knowledge in the field and stimulating further empirical research on service personal values. At the managerial level, as a tool the SERPVAL scale should allow practitioners to evaluate and improve the value of a service, and consequently, to define strategies and actions to address services for customers based on their fundamental personal values. Through qualitative and empirical research, we find that the service quality construct conforms to the structure of a second-order factor model that ties service quality perceptions to distinct and actionable dimensions: outcome, interaction, and environmental quality. In turn, each has two subdimensions that define the basis of service quality perceptions. The authors further suggest that for each of these subdimensions to contribute to improved service quality perceptions, the quality received by consumers must be perceived to be reliable, responsive, and empathetic. Although the service personal value may be found in researches that explore individual values and their consequences for consumer behavior, there is no established operationalization of a SERPVAL scale. The inexistence of an established scale, duly adapted in order to understand and analyze personal values behind services usage, exposes the need of a measurement scale with such a purpose. This need has to be rooted, however, in a conceptualization of the construct being scaled. Service personal values can be defined as a customer's overall assessment of the use of a service based on the perception of what is achieved in terms of his own personal values. As consumer behaviors serve to show an individual's values, the use of a service can also be a way to fulfill and demonstrate consumers'personal values. In this sense, a service can provide more to the customer than its concrete and abstract attributes at both the attribute and the quality levels, and more than its functional consequences at the value level. Both values and services literatures agree, that personal value is the highest-level concept, followed by instrumental values, attitudes and finally by product attributes. Purchasing behaviors are agreed to be the end result of these concepts' interaction, with personal values taking a major role in the final decision process. From both consumers' and practitioners' perspectives, values are extremely relevant, as they are desirable goals that serve as guiding principles in people's lives. While building on previous research, we propose to assess service personal values through three broad groups of individual dimensions; at the self-oriented level, we use (1) service value to peaceful life (SVPL) and, at the social-oriented level, we use (2) service value to social recognition (SVSR), and (3) service value to social integration (SVSI). Service value to peaceful life is our first dimension. This dimension emerged as a combination of values coming from the RVS scale, a scale built specifically to assess general individual values. If a service promotes a pleasurable life, brings or improves tranquility, safety and harmony, then its user recognizes the value of this service. Generally, this service can improve the user's pleasure of life, since it protects or defends the consumer from threats to life or pressures on it. While building upon both the LOV scale, a scale built specifically to assess consumer values, and the RVS scale for individual values, we develop the other two dimensions: SVSR and SVSI. The roles of social recognition and social integration to improve service personal value have been seriously neglected. Social recognition derives its outcome utility from its predictive utility. When applying this underlying belief to our second dimension, SVSR, we assume that people use a service while taking into consideration the content of what is delivered. Individuals consider whether the service aids in gaining respect from others, social recognition and status, as well as whether it allows achieving a more fulfilled and stimulating life, which might then be revealed to others. People also tend to engage in behavior that receives social recognition and to avoid behavior that leads to social disapproval, and this contributes to an individual's social integration. This leads us to the third dimension, SVSI, which is based on the fact that if the consumer perceives that a service strengthens friendships, provides the possibility of becoming more integrated in the group, or promotes better relationships at the social, professional or family levels, then the service will contribute to social integration, and naturally the individual will recognize personal value in the service. Most of the research in business values deals with individual values. However, to our knowledge, no study has dealt with assessing overall personal values as well as their dimensions in a service context. Our final results show that the scales adapted from the Schwartz list were excluded. A possible explanation is that although Schwartz builds on Rokeach work in order to explore individual values, its dimensions might be especially focused on analyzing societal values. As we are looking for individual dimensions, this might explain why the values inspired by the Schwartz list were excluded from the model. The hierarchical structure of the final scale presented in this paper also presents theoretical implications. Although we cannot claim to definitively capture the dimensions of service personal values, we believe that we come close to capturing these overall evaluations because the second-order factor extracts the underlying commonality among dimensions. In addition to obtaining respondents' evaluations of the dimensions, the second-order factor model captures the common variance among these dimensions, reflecting the respondents' overall assessment of service personal values. Towards this fact, we expect that the service personal values conceptualization and measurement scale presented here contributes to both business values literature and the service marketing field, allowing for the delineation of strategies for adding value to services. This new scale also presents managerial implications. The SERPVAL dimensions give some guidance on how to better pursue a highly service-oriented business strategy. Indeed, the SERPVAL scale can be used for benchmarking purposes, as this scale can be used to identify whether or not a firms' marketing strategies are consistent with consumers' expectations. Managerial assessment of the personal values of a service might be extremely important because it allows managers to better understand what customers want or value. Thus, this scale allows us to identify what services are really valuable to the final consumer; providing knowledge for making choices regarding which services to include. Traditional approaches have focused their attention on service attributes (as quality) and service consequences(as service value), but personal values may be an important set of variables to be considered in understanding what attracts consumers to a certain service. By using the SERPVAL scale to assess the personal values associated with a services usage, managers may better understand the reasons behind services' usage, so that they may handle them more efficiently. While testing nomological validity, our empirical findings demonstrate that the three SERPVAL dimensions are positively and significantly associated with satisfaction. Additionally, while service value to social integration is related only with loyalty, service value to peaceful life is associated with both loyalty and repurchase intent. It is also interesting and surprising that service value to social recognition appears not to be significantly linked with loyalty and repurchase intent. A possible explanation is that no mobile service provider has yet emerged in the market as a luxury provider. All of the Portuguese providers are still trying to capture market share by means of low-end pricing. This research has implications for consumers as well. As more companies seek to build relationships with their customers, consumers are easily able to examine whether these relationships provide real value or not to their own lives. The selection of a strategy for a particular service depends on its customers' personal values. Being highly customer-oriented means having a strong commitment to customers, trying to create customer value and understanding customer needs. Enhancing service distinctiveness in order to provide a peaceful life, increase social recognition and gain a better social integration are all possible strategies that companies may pursue, but the one to pursue depends on the outstanding personal values held by the service customers. Data were gathered from 284 respondents in the korean discount store and online shopping mall market. This research proposed 3 hypotheses on 6 latent variables and tested through structural equation modeling. 6 alternative measurements were compared through statistical significance test of the 6 paths of research model and the overall fitting level of structural equation model. and the result was successful. and Perceived quality more positively influences service personal value when NFC is high than when no NFC is low in the off-line market. The results of the study indicate that service quality is properly modeled as an antecedent of service personal value. We consider the research and managerial implications of the study and its limitations. In sum, by knowing the dimensions a consumer takes into account when choosing a service, a better understanding of purchasing behaviors may be realized, guiding managers toward customers expectations. By defining strategies and actions that address potential problems with the service personal values, managers might ultimately influence their firm's performance. we expect to contribute to both business values and service marketing literatures through the development of the service personal value. At a time when marketing researchers are challenged to provide research with practical implications, it is also believed that this framework may be used by managers to pursue service-oriented business strategies while taking into consideration what customers value.

  • PDF