• Title/Summary/Keyword: Target Information

Search Result 6,217, Processing Time 0.042 seconds

Effects of Initiation and Perceived Similarity on the Evaluation of Online Communities (온라인 커뮤니티 속 가입절차 및 지각된 유사성에 따른 평가의 차이)

  • Yoo, Jihyun;Kang, Hyunmin;Han, Kwanghee
    • Science of Emotion and Sensibility
    • /
    • v.21 no.4
    • /
    • pp.25-36
    • /
    • 2018
  • Nowadays, it is hard to imagine one's life without smart phones or the internet. Furthermore, not only do people form groups offline, but also online. Based on the cognitive dissonance theory, there have been many studies about how an offline group's initiation affects attitudes toward the group. However, there has not been a study about how an online group's initiation can affect attitudes toward the group. Therefore, this study aims to find out how cognitive dissonance aroused by initiation affects the attitudes toward the online community, which represents groups that are formed online. In addition, this study examined how perceived similarity affects changes in attitude aroused by cognitive dissonance. Participants were assigned to a group in three ways as follows: without a registration process, with a simple registration process, and/or with a complex registration process. Perceived similarity was calculated by the difference between the current body mass index (BMI) and the target BMI of the participant. Attitudes toward the online group were measured by perceived source credibility, perceived information quality, satisfaction, information usefulness, and continuance intention. Contrary to the cognitive dissonance theory, the results showed that when applied to offline social groups, there were conflicting results. There were cases where there was no difference in the evaluation between initiation conditions. However, other cases showed that groups with the most complex registration process were found to have the worst evaluation. People were more favorable toward the group when the perceived similarity was larger. Interestingly, people who had higher perceived similarity had more positive attitudes toward the groups that had been assigned with a registration process compared to the group formed without a registration process. Conversely, people with lower perceived similarity had more positive attitudes toward the group when there was no initiation process. Online communities may use the results of this study to design more suitable registration processes for their communities.

Recirculation Prohibition of Fair Value through Other Comprehensive Income on Realization and Earnings Management (기타포괄이익측정 금융자산 평가손익의 재순환금지와 이익조정)

  • Gong, Kyung-Tae
    • Management & Information Systems Review
    • /
    • v.38 no.2
    • /
    • pp.67-81
    • /
    • 2019
  • In accordance with K-IFRS 1109, financial instruments are classified to amortized cost (AC), fair value through other comprehensive income (FVOCI) and fair value through profit or loss (FVPL). And disposal gains are prohibited to be recirculated for net income when FVOCI financial instruments would be sold in the future, so-called recirculation prohibition. This research investigates whether accumulated other comprehensive income of available-for sale financial assets(AFS) under K-IFRS 1039, could affect reclassified amounts to the FVPL securities from the AFS securities. Also, this study investigates the effects of the reported income on the reclassified FVPL, because CEOs are likely to try earnings management when net income is predicted to be less than target or is low, comparing other firms. As a result of empirical analysis, first, I find that accumulated other comprehensive income of the AFS has a positive impact on the reclassified FVPL. Second, level of reporting income has no significant impact on the reclassified FVPL. Third, interaction effects are significantly positive on the firms which have more other comprehensive income and less level of reported income. Fourth, the effects of the bank and securities are more distinct than those of the manufactures. This study is the first research to investigate earnings management through AFS at the timing of the first adoption of K-IFRS 1109. Empirical results of this study provide evidence of earnings management on the reclassification of FVPL which gives meaningful implications to regulators, academic researchers and auditors.

A Proposal of Direction of Wind Ventilation Forest through Urban Condition Analysis - A Case Study of Pyeongtaek-si - (도시 여건 분석을 통한 바람길숲 조성방향 제시 - 평택시를 사례로 -)

  • SON, Jeong-Min;EUM, Jeong-Hee;SUNG, Uk-Je;BAEK, Jun-Beom;KIM, Ju-Eun;OH, Jeong-Hak
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.23 no.4
    • /
    • pp.101-119
    • /
    • 2020
  • Recently, as a plan to improve the particulate matter and thermal environment in the city, urban forests acting as wind ventilation corridor(wind ventilation forest) are promoted nationwide. This study analyzed the conditions for the creation of wind ventilation forest(vulnerable areas of the particulate matter and thermal environment, distribution of wind ventilation forest, characteristics of ventilation corridor) of in Pyeongtae-si, one of the target cities of wind ventilation forest project. Based on the results, the direction of developing on the wind ventilation forest in Pyeongtaek-si was suggested. As a result of deriving areas vulnerable to particulate matter and thermal environment, it was most vulnerable in urban areas in the eastern area of Pyeongtaek-si. Especially, emissions were high from industrial complexes and roads such as the Pyeongtaek-si thermal power plant, ports, and the national road no. 1. The wind ventilation forest in Pyeongtaek-si was distributed with small-scale windgenerating forests, wind-spreading forests, and wind-connection forests fragmented and disconnected. The characteristic of the overall wind ventilation corridor in Pyeongtaek-si is that the cold air generated from Mt.Mubong, etc., strongly flowed into Pyeongtaek-si and flowed in the northwest direction. Therefore, it is necessary to preserve and expand the wind-generating forests in Pyeongtaek-si in the long term, and it was important to create wind-spreading forests and wind-connection forests so that cold air could flow into the vulnerable area. In addition, in industrial complexes and roads where particulate matter is generated, planting techniques should be applied to prevent the spread of particulate matte to surrounding areas by creating wind-spreading forests considering the particulate matter blocking. This study can be used not only as the basis data for wind ventilation forest project in Pyeongtaek-si, but also as the basis data for urban forest creation and management.

Change Prediction of Future Forestland Area by Transition of Land Use Types in South Korea (로지스틱 회귀모형을 이용한 우리나라 산지면적의 공간변화 예측에 관한 연구)

  • KWAK, Doo-Ahn;PARK, So-Hee
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.24 no.4
    • /
    • pp.99-112
    • /
    • 2021
  • This study was performed to predict spatial change of future forestland area in South Korea at regional level for supporting forest-related plans established by local governments. In the study, land use was classified to three types which are forestland, agricultural land, and urban and other lands. A logistic regression model was developed using transitional interaction between each land use type and topographical factors, land use restriction factors, socioeconomic indices, and development infrastructures. In this model, change probability from a target land use type to other land use types was estimated using raster dataset(30m×30m) for each variable. With priority order map based on the probability of land use change, the total annual amount of land use change was allocated to the cells in the order of the highest transition potential for the spatial analysis. In results, it was found that slope degree and slope standard value by the local government were the main factors affecting the probability of change from forestland to urban and other land. Also, forestland was more likely to change to urban and other land in the conditions of a more gentle slope, lower slope criterion allowed to developed, and higher land price and population density. Consequently, it was predicted that forestland area would decrease by 2027 due to the change from forestland to urban and others, especially in metropolitan and major cities, and that forestland area would increase between 2028 and 2050 in the most local provincial cities except Seoul, Gyeonggi-do, and Jeju Island due to locality extinction with decline in population. Thus, local government is required to set an adequate forestland use criterion for balanced development, reasonable use and conservation, and to establish the regional forest strategies and policies considering the future land use change trends.

The Effect of Domain Specificity on the Performance of Domain-Specific Pre-Trained Language Models (도메인 특수성이 도메인 특화 사전학습 언어모델의 성능에 미치는 영향)

  • Han, Minah;Kim, Younha;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.4
    • /
    • pp.251-273
    • /
    • 2022
  • Recently, research on applying text analysis to deep learning has steadily continued. In particular, researches have been actively conducted to understand the meaning of words and perform tasks such as summarization and sentiment classification through a pre-trained language model that learns large datasets. However, existing pre-trained language models show limitations in that they do not understand specific domains well. Therefore, in recent years, the flow of research has shifted toward creating a language model specialized for a particular domain. Domain-specific pre-trained language models allow the model to understand the knowledge of a particular domain better and reveal performance improvements on various tasks in the field. However, domain-specific further pre-training is expensive to acquire corpus data of the target domain. Furthermore, many cases have reported that performance improvement after further pre-training is insignificant in some domains. As such, it is difficult to decide to develop a domain-specific pre-trained language model, while it is not clear whether the performance will be improved dramatically. In this paper, we present a way to proactively check the expected performance improvement by further pre-training in a domain before actually performing further pre-training. Specifically, after selecting three domains, we measured the increase in classification accuracy through further pre-training in each domain. We also developed and presented new indicators to estimate the specificity of the domain based on the normalized frequency of the keywords used in each domain. Finally, we conducted classification using a pre-trained language model and a domain-specific pre-trained language model of three domains. As a result, we confirmed that the higher the domain specificity index, the higher the performance improvement through further pre-training.

A Study on Risk Assessment Method for Earthquake-Induced Landslides (지진에 의한 산사태 위험도 평가방안에 관한 연구)

  • Seo, Junpyo;Eu, Song;Lee, Kihwan;Lee, Changwoo;Woo, Choongshik
    • Journal of the Society of Disaster Information
    • /
    • v.17 no.4
    • /
    • pp.694-709
    • /
    • 2021
  • Purpose: In this study, earthquake-induced landslide risk assessment was conducted to provide basic data for efficient and preemptive damage prevention by selecting the erosion control work before the earthquake and the prediction and restoration priorities of the damaged area after the earthquake. Method: The study analyzed the previous studies abroad to examine the evaluation methodology and to derive the evaluation factors, and examine the utilization of the landslide hazard map currently used in Korea. In addition, the earthquake-induced landslide hazard map was also established on a pilot basis based on the fault zone and epicenter of Pohang using seismic attenuation. Result: The earthquake-induced landslide risk assessment study showed that China ranked 44%, Italy 16%, the U.S. 15%, Japan 10%, and Taiwan 8%. As for the evaluation method, the statistical model was the most common at 59%, and the physical model was found at 23%. The factors frequently used in the statistical model were altitude, distance from the fault, gradient, slope aspect, country rock, and topographic curvature. Since Korea's landslide hazard map reflects topography, geology, and forest floor conditions, it has been shown that it is reasonable to evaluate the risk of earthquake-induced landslides using it. As a result of evaluating the risk of landslides based on the fault zone and epicenter in the Pohang area, the risk grade was changed to reflect the impact of the earthquake. Conclusion: It is effective to use the landslide hazard map to evaluate the risk of earthquake-induced landslides at the regional scale. The risk map based on the fault zone is effective when used in the selection of a target site for preventive erosion control work to prevent damage from earthquake-induced landslides. In addition, the risk map based on the epicenter can be used for efficient follow-up management in order to prioritize damage prevention measures, such as to investigate the current status of landslide damage after an earthquake, or to restore the damaged area.

The Effect of Price Promotional Information about Brand on Consumer's Quality Perception: Conditioning on Pretrial Brand (품패개격촉소신식대소비자질량인지적영향(品牌价格促销信息对消费者质量认知的影响))

  • Lee, Min-Hoon;Lim, Hang-Seop
    • Journal of Global Scholars of Marketing Science
    • /
    • v.19 no.3
    • /
    • pp.17-27
    • /
    • 2009
  • Price promotion typically reduces the price for a given quantity or increases the quantity available at the same price, thereby enhancing value and creating an economic incentive to purchase. It often is used to encourage product or service trial among nonusers of products or services. Thus, it is important to understand the effects of price promotions on quality perception made by consumer who do not have prior experience with the promoted brand. However, if consumers associate a price promotion itself with inferior brand quality, the promotion may not achieve the sales increase the economic incentives otherwise might have produced. More specifically, low qualitative perception through price promotion will undercut the economic and psychological incentives and reduce the likelihood of purchase. Thus, it is important for marketers to understand how price promotional informations about a brand have impact on consumer's unfavorable quality perception of the brand. Previous literatures on the effects of price promotions on quality perception reveal inconsistent explanations. Some focused on the unfavorable effect of price promotion on consumer's perception. But others showed that price promotions didn't raise unfavorable perception on the brand. Prior researches found these inconsistent results related to the timing of the price promotion's exposure and quality evaluation relative to trial. And, whether the consumer has been experienced with the product promotions in the past or not may moderate the effects. A few studies considered differences among product categories as fundamental factors. The purpose of this research is to investigate the effect of price promotional informations on consumer's unfavorable quality perception under the different conditions. The author controlled the timing of the promotional exposure and varied past promotional patterns and information presenting patterns. Unlike previous researches, the author examined the effects of price promotions setting limit to pretrial situation by controlling potentially moderating effects of prior personal experience with the brand. This manipulations enable to resolve possible controversies in relation to this issue. And this manipulation is meaningful for the work sector. Price promotion is not only used to target existing consumers but also to encourage product or service trial among nonusers of products or services. Thus, it is important for marketers to understand how price promotional informations about a brand have impact on consumer's unfavorable quality perception of the brand. If consumers associate a price promotion itself with inferior quality about unused brand, the promotion may not achieve the sales increase the economic incentives otherwise might have produced. In addition, if the price promotion ends, the consumer that have purchased that certain brand will likely to display sharply decreased repurchasing behavior. Through a literature review, hypothesis 1 was set as follows to investigate the adjustive effect of past price promotion on quality perception made by consumers; The influence that price promotion of unused brand have on quality perception made by consumers will be adjusted by past price promotion activity of the brand. In other words, a price promotion of an unused brand that have not done a price promotion in the past will have a unfavorable effect on quality perception made by consumer. Hypothesis 2-1 was set as follows : When an unused brand undertakes price promotion for the first time, the information presenting pattern of price promotion will have an effect on the consumer's attribution for the cause of the price promotion. Hypothesis 2-2 was set as follows : The more consumer dispositionally attribute the cause of price promotion, the more unfavorable the quality perception made by consumer will be. Through test 1, the subjects were given a brief explanation of the product and the brand before they were provided with a $2{\times}2$ factorial design that has 4 patterns of price promotion (presence or absence of past price promotion * presence or absence of current price promotion) and the explanation describing the price promotion pattern of each cell. Then the perceived quality of imaginary brand WAVEX was evaluated in the scale of 7. The reason tennis racket was chosen is because the selected product group must have had almost no past price promotions to eliminate the influence of average frequency of promotion on the value of price promotional information as Raghubir and Corfman (1999) pointed out. Test 2 was also carried out on students of the same management faculty of test 1 with tennis racket as the product group. As with test 1, subjects with average familiarity for the product group and low familiarity for the brand was selected. Each subjects were assigned to one of the two cells representing two different information presenting patterns of price promotion of WAVEX (case where the reason behind price promotion was provided/case where the reason behind price promotion was not provided). Subjects looked at each promotional information before evaluating the perceived quality of the brand WAVEX in the scale of 7. The effect of price promotion for unfamiliar pretrial brand on consumer's perceived quality was proved to be moderated with the presence or absence of past price promotion. The consistency with past promotional behavior is important variable that makes unfavorable effect on brand evaluations get worse. If the price promotion for the brand has never been carried out before, price promotion activity may have more unfavorable effects on consumer's quality perception. Second, when the price promotion of unfamiliar pretrial brand was executed for the first time, presenting method of informations has impact on consumer's attribution for the cause of firm's promotion. And the unfavorable effect of quality perception is higher when the consumer does dispositional attribution comparing with situational attribution. Unlike the previous studies where the main focus was the absence or presence of favorable or unfavorable motivation from situational/dispositional attribution, the focus of this study was exaus ing the fact that a situational attribution can be inferred even if the consumer employs a dispositional attribution on the price promotional behavior, if the company provides a persuasive reason. Such approach, in academic perspectih sis a large significance in that it explained the anchoring and adjng ch approcedures by applying it to a non-mathematical problem unlike the previous studies where it wis ionaly explained by applying it to a mathematical problem. In other wordn, there is a highrspedency tmatispositionally attribute other's behaviors according to the fuedach aal attribution errors and when this is applied to the situation of price promotions, we can infer that consumers are likely tmatispositionally attribute the company's price promotion behaviors. Ha ever, even ueder these circumstances, the company can adjng the consumer's anchoring tmareduce the po wibiliute thdispositional attribution. Furthermore, unlike majority of previous researches on short/long-term effects of price promotion that only considered the effect of price promotions on consumer's purchasing behaviors, this research measured the effect on perceived quality, one of man elements that affects the purchasing behavior of consumers. These results carry useful implications for the work sector. A guideline of effectively providing promotional informations for a new brand can be suggested through the outcomes of this research. If the brand is to avoid false implications such as inferior quality while implementing a price promotion strategy, it must provide a clear and acceptable reasons behind the promotion. Especially it is more important for the company with no past price promotion to provide a clear reason. An inconsistent behavior can be the cause of consumer's distrust and anxiety. This is also one of the most important factor of risk of endless price wars. Price promotions without prior notice can buy doubt from consumers not market share.

  • PDF

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

A Study on the Critical Success Factors of Social Commerce through the Analysis of the Perception Gap between the Service Providers and the Users: Focused on Ticket Monster in Korea (서비스제공자와 사용자의 인식차이 분석을 통한 소셜커머스 핵심성공요인에 대한 연구: 한국의 티켓몬스터 중심으로)

  • Kim, Il Jung;Lee, Dae Chul;Lim, Gyoo Gun
    • Asia pacific journal of information systems
    • /
    • v.24 no.2
    • /
    • pp.211-232
    • /
    • 2014
  • Recently, there is a growing interest toward social commerce using SNS(Social Networking Service), and the size of its market is also expanding due to popularization of smart phones, tablet PCs and other smart devices. Accordingly, various studies have been attempted but it is shown that most of the previous studies have been conducted from perspectives of the users. The purpose of this study is to derive user-centered CSF(Critical Success Factor) of social commerce from the previous studies and analyze the CSF perception gap between social commerce service providers and users. The CSF perception gap between two groups shows that there is a difference between ideal images the service providers hope for and the actual image the service users have on social commerce companies. This study provides effective improvement directions for social commerce companies by presenting current business problems and its solution plans. For this, This study selected Korea's representative social commerce business Ticket Monster, which is dominant in sales and staff size together with its excellent funding power through M&A by stock exchange with the US social commerce business Living Social with Amazon.com as a shareholder in August, 2011, as a target group of social commerce service provider. we have gathered questionnaires from both service providers and the users from October 22, 2012 until October 31, 2012 to conduct an empirical analysis. We surveyed 160 service providers of Ticket Monster We also surveyed 160 social commerce users who have experienced in using Ticket Monster service. Out of 320 surveys, 20 questionaries which were unfit or undependable were discarded. Consequently the remaining 300(service provider 150, user 150)were used for this empirical study. The statistics were analyzed using SPSS 12.0. Implications of the empirical analysis result of this study are as follows: First of all, There are order differences in the importance of social commerce CSF between two groups. While service providers regard Price Economic as the most important CSF influencing purchasing intention, the users regard 'Trust' as the most important CSF influencing purchasing intention. This means that the service providers have to utilize the unique strong point of social commerce which make the customers be trusted rathe than just focusing on selling product at a discounted price. It means that service Providers need to enhance effective communication skills by using SNS and play a vital role as a trusted adviser who provides curation services and explains the value of products through information filtering. Also, they need to pay attention to preventing consumer damages from deceptive and false advertising. service providers have to create the detailed reward system in case of a consumer damages caused by above problems. It can make strong ties with customers. Second, both service providers and users tend to consider that social commerce CSF influencing purchasing intention are Price Economic, Utility, Trust, and Word of Mouth Effect. Accordingly, it can be learned that users are expecting the benefit from the aspect of prices and economy when using social commerce, and service providers should be able to suggest the individualized discount benefit through diverse methods using social network service. Looking into it from the aspect of usefulness, service providers are required to get users to be cognizant of time-saving, efficiency, and convenience when they are using social commerce. Therefore, it is necessary to increase the usefulness of social commerce through the introduction of a new management strategy, such as intensification of search engine of the Website, facilitation in payment through shopping basket, and package distribution. Trust, as mentioned before, is the most important variable in consumers' mind, so it should definitely be managed for sustainable management. If the trust in social commerce should fall due to consumers' damage case due to false and puffery advertising forgeries, it could have a negative influence on the image of the social commerce industry in general. Instead of advertising with famous celebrities and using a bombastic amount of money on marketing expenses, the social commerce industry should be able to use the word of mouth effect between users by making use of the social network service, the major marketing method of initial social commerce. The word of mouth effect occurring from consumers' spontaneous self-marketer's duty performance can bring not only reduction effect in advertising cost to a service provider but it can also prepare the basis of discounted price suggestion to consumers; in this context, the word of mouth effect should be managed as the CSF of social commerce. Third, Trade safety was not derived as one of the CSF. Recently, with e-commerce like social commerce and Internet shopping increasing in a variety of methods, the importance of trade safety on the Internet also increases, but in this study result, trade safety wasn't evaluated as CSF of social commerce by both groups. This study judges that it's because both service provider groups and user group are perceiving that there is a reliable PG(Payment Gateway) which acts for e-payment of Internet transaction. Accordingly, it is understood that both two groups feel that social commerce can have a corporate identity by website and differentiation in products and services in sales, but don't feel a big difference by business in case of e-payment system. In other words, trade safety should be perceived as natural, basic universal service. Fourth, it's necessary that service providers should intensify the communication with users by making use of social network service which is the major marketing method of social commerce and should be able to use the word of mouth effect between users. The word of mouth effect occurring from consumers' spontaneous self- marketer's duty performance can bring not only reduction effect in advertising cost to a service provider but it can also prepare the basis of discounted price suggestion to consumers. in this context, it is judged that the word of mouth effect should be managed as CSF of social commerce. In this paper, the characteristics of social commerce are limited as five independent variables, however, if an additional study is proceeded with more various independent variables, more in-depth study results will be derived. In addition, this research targets social commerce service providers and the users, however, in the consideration of the fact that social commerce is a two-sided market, drawing CSF through an analysis of perception gap between social commerce service providers and its advertisement clients would be worth to be dealt with in a follow-up study.

A Hybrid SVM Classifier for Imbalanced Data Sets (불균형 데이터 집합의 분류를 위한 하이브리드 SVM 모델)

  • Lee, Jae Sik;Kwon, Jong Gu
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.125-140
    • /
    • 2013
  • We call a data set in which the number of records belonging to a certain class far outnumbers the number of records belonging to the other class, 'imbalanced data set'. Most of the classification techniques perform poorly on imbalanced data sets. When we evaluate the performance of a certain classification technique, we need to measure not only 'accuracy' but also 'sensitivity' and 'specificity'. In a customer churn prediction problem, 'retention' records account for the majority class, and 'churn' records account for the minority class. Sensitivity measures the proportion of actual retentions which are correctly identified as such. Specificity measures the proportion of churns which are correctly identified as such. The poor performance of the classification techniques on imbalanced data sets is due to the low value of specificity. Many previous researches on imbalanced data sets employed 'oversampling' technique where members of the minority class are sampled more than those of the majority class in order to make a relatively balanced data set. When a classification model is constructed using this oversampled balanced data set, specificity can be improved but sensitivity will be decreased. In this research, we developed a hybrid model of support vector machine (SVM), artificial neural network (ANN) and decision tree, that improves specificity while maintaining sensitivity. We named this hybrid model 'hybrid SVM model.' The process of construction and prediction of our hybrid SVM model is as follows. By oversampling from the original imbalanced data set, a balanced data set is prepared. SVM_I model and ANN_I model are constructed using the imbalanced data set, and SVM_B model is constructed using the balanced data set. SVM_I model is superior in sensitivity and SVM_B model is superior in specificity. For a record on which both SVM_I model and SVM_B model make the same prediction, that prediction becomes the final solution. If they make different prediction, the final solution is determined by the discrimination rules obtained by ANN and decision tree. For a record on which SVM_I model and SVM_B model make different predictions, a decision tree model is constructed using ANN_I output value as input and actual retention or churn as target. We obtained the following two discrimination rules: 'IF ANN_I output value <0.285, THEN Final Solution = Retention' and 'IF ANN_I output value ${\geq}0.285$, THEN Final Solution = Churn.' The threshold 0.285 is the value optimized for the data used in this research. The result we present in this research is the structure or framework of our hybrid SVM model, not a specific threshold value such as 0.285. Therefore, the threshold value in the above discrimination rules can be changed to any value depending on the data. In order to evaluate the performance of our hybrid SVM model, we used the 'churn data set' in UCI Machine Learning Repository, that consists of 85% retention customers and 15% churn customers. Accuracy of the hybrid SVM model is 91.08% that is better than that of SVM_I model or SVM_B model. The points worth noticing here are its sensitivity, 95.02%, and specificity, 69.24%. The sensitivity of SVM_I model is 94.65%, and the specificity of SVM_B model is 67.00%. Therefore the hybrid SVM model developed in this research improves the specificity of SVM_B model while maintaining the sensitivity of SVM_I model.