• Title/Summary/Keyword: 정보매개자

Search Result 363, Processing Time 0.026 seconds

Extension Method of Association Rules Using Social Network Analysis (사회연결망 분석을 활용한 연관규칙 확장기법)

  • Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.111-126
    • /
    • 2017
  • Recommender systems based on association rule mining significantly contribute to seller's sales by reducing consumers' time to search for products that they want. Recommendations based on the frequency of transactions such as orders can effectively screen out the products that are statistically marketable among multiple products. A product with a high possibility of sales, however, can be omitted from the recommendation if it records insufficient number of transactions at the beginning of the sale. Products missing from the associated recommendations may lose the chance of exposure to consumers, which leads to a decline in the number of transactions. In turn, diminished transactions may create a vicious circle of lost opportunity to be recommended. Thus, initial sales are likely to remain stagnant for a certain period of time. Products that are susceptible to fashion or seasonality, such as clothing, may be greatly affected. This study was aimed at expanding association rules to include into the list of recommendations those products whose initial trading frequency of transactions is low despite the possibility of high sales. The particular purpose is to predict the strength of the direct connection of two unconnected items through the properties of the paths located between them. An association between two items revealed in transactions can be interpreted as the interaction between them, which can be expressed as a link in a social network whose nodes are items. The first step calculates the centralities of the nodes in the middle of the paths that indirectly connect the two nodes without direct connection. The next step identifies the number of the paths and the shortest among them. These extracts are used as independent variables in the regression analysis to predict future connection strength between the nodes. The strength of the connection between the two nodes of the model, which is defined by the number of nodes between the two nodes, is measured after a certain period of time. The regression analysis results confirm that the number of paths between the two products, the distance of the shortest path, and the number of neighboring items connected to the products are significantly related to their potential strength. This study used actual order transaction data collected for three months from February to April in 2016 from an online commerce company. To reduce the complexity of analytics as the scale of the network grows, the analysis was performed only on miscellaneous goods. Two consecutively purchased items were chosen from each customer's transactions to obtain a pair of antecedent and consequent, which secures a link needed for constituting a social network. The direction of the link was determined in the order in which the goods were purchased. Except for the last ten days of the data collection period, the social network of associated items was built for the extraction of independent variables. The model predicts the number of links to be connected in the next ten days from the explanatory variables. Of the 5,711 previously unconnected links, 611 were newly connected for the last ten days. Through experiments, the proposed model demonstrated excellent predictions. Of the 571 links that the proposed model predicts, 269 were confirmed to have been connected. This is 4.4 times more than the average of 61, which can be found without any prediction model. This study is expected to be useful regarding industries whose new products launch quickly with short life cycles, since their exposure time is critical. Also, it can be used to detect diseases that are rarely found in the early stages of medical treatment because of the low incidence of outbreaks. Since the complexity of the social networking analysis is sensitive to the number of nodes and links that make up the network, this study was conducted in a particular category of miscellaneous goods. Future research should consider that this condition may limit the opportunity to detect unexpected associations between products belonging to different categories of classification.

Efficient Topic Modeling by Mapping Global and Local Topics (전역 토픽의 지역 매핑을 통한 효율적 토픽 모델링 방안)

  • Choi, Hochang;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.69-94
    • /
    • 2017
  • Recently, increase of demand for big data analysis has been driving the vigorous development of related technologies and tools. In addition, development of IT and increased penetration rate of smart devices are producing a large amount of data. According to this phenomenon, data analysis technology is rapidly becoming popular. Also, attempts to acquire insights through data analysis have been continuously increasing. It means that the big data analysis will be more important in various industries for the foreseeable future. Big data analysis is generally performed by a small number of experts and delivered to each demander of analysis. However, increase of interest about big data analysis arouses activation of computer programming education and development of many programs for data analysis. Accordingly, the entry barriers of big data analysis are gradually lowering and data analysis technology being spread out. As the result, big data analysis is expected to be performed by demanders of analysis themselves. Along with this, interest about various unstructured data is continually increasing. Especially, a lot of attention is focused on using text data. Emergence of new platforms and techniques using the web bring about mass production of text data and active attempt to analyze text data. Furthermore, result of text analysis has been utilized in various fields. Text mining is a concept that embraces various theories and techniques for text analysis. Many text mining techniques are utilized in this field for various research purposes, topic modeling is one of the most widely used and studied. Topic modeling is a technique that extracts the major issues from a lot of documents, identifies the documents that correspond to each issue and provides identified documents as a cluster. It is evaluated as a very useful technique in that reflect the semantic elements of the document. Traditional topic modeling is based on the distribution of key terms across the entire document. Thus, it is essential to analyze the entire document at once to identify topic of each document. This condition causes a long time in analysis process when topic modeling is applied to a lot of documents. In addition, it has a scalability problem that is an exponential increase in the processing time with the increase of analysis objects. This problem is particularly noticeable when the documents are distributed across multiple systems or regions. To overcome these problems, divide and conquer approach can be applied to topic modeling. It means dividing a large number of documents into sub-units and deriving topics through repetition of topic modeling to each unit. This method can be used for topic modeling on a large number of documents with limited system resources, and can improve processing speed of topic modeling. It also can significantly reduce analysis time and cost through ability to analyze documents in each location or place without combining analysis object documents. However, despite many advantages, this method has two major problems. First, the relationship between local topics derived from each unit and global topics derived from entire document is unclear. It means that in each document, local topics can be identified, but global topics cannot be identified. Second, a method for measuring the accuracy of the proposed methodology should be established. That is to say, assuming that global topic is ideal answer, the difference in a local topic on a global topic needs to be measured. By those difficulties, the study in this method is not performed sufficiently, compare with other studies dealing with topic modeling. In this paper, we propose a topic modeling approach to solve the above two problems. First of all, we divide the entire document cluster(Global set) into sub-clusters(Local set), and generate the reduced entire document cluster(RGS, Reduced global set) that consist of delegated documents extracted from each local set. We try to solve the first problem by mapping RGS topics and local topics. Along with this, we verify the accuracy of the proposed methodology by detecting documents, whether to be discerned as the same topic at result of global and local set. Using 24,000 news articles, we conduct experiments to evaluate practical applicability of the proposed methodology. In addition, through additional experiment, we confirmed that the proposed methodology can provide similar results to the entire topic modeling. We also proposed a reasonable method for comparing the result of both methods.

The Influence of Art-provoked Affect on Product and Product Attributes Evaluation (명화(名畵)에서 유발된 감정이 차용된 제품과 제품속성 평가에 미치는 영향)

  • Kim, Hanku;Jung, Bohee;Chu, Wujin
    • Asia Marketing Journal
    • /
    • v.13 no.2
    • /
    • pp.99-130
    • /
    • 2011
  • In recent years, a new way of differentiating product design has emerged -better known as 'masterpiece marketing,' this is a strategy where famous art pieces are borrowed on to product designs. Because the recent trends of well-being and LOHAS have encouraged the consumers' desires to enjoy culture and live a more opulent lifestyle, famous and notable paintings have grown to be more of "approachable masterpieces" to the public. As a strategy intended to develop a new consumerism, while still prioritizing customers' values and their satisfaction, companies have been drawn to this new type of marketing. The current consumption society has converted renowned art pieces from simply works of 'high culture' to a further way of marketing, aimed to differentiate products and dominate the market. Though many products have had masterpieces applied to their designs and have been noticed for their marketability, there has been less systematic research done on the scientific background behind this marketing approach. This research focused on the art pieces' fundamental nature of inducing emotions in the viewer, and hypothesized about how the evaluation of a product may be influenced by the affect provoked by the art piece used. To be more specific, if art pieces with different levels of pleasure and arousal -the two axis of emotion suggested by existing research on emotion -were used on each product, the goal was to see how the different levels influenced the consumer's assessment of the products, focusing on product's type as well as the evaluation of their attributes. First, a pretest was done to verify the relationship between the emotion provoked by the art piece and the consumer's preference. There were two types of surveys, each with five drawings from the ten that were assumed to differ in levels of the two axis of emotion. The survey was composed of questions asking for positive emotion, negative emotion, level of arousal, and preference. The correlation between the measurements of positive and negative emotions was -0.792, so an integrated entry was used in the analysis by subtracting the measurement of negative emotions from that of positive emotions. The first hypothesis that paintings that provoke positive emotions will be more preferred than paintings that bring out negative emotions was supported; and through this research, paintings that were to be used for the products were selected. The second pretest was conducted to settle on an item that would be used in the research. Items meant to measure utilitarian and hedonic attributes of milk and chocolate, the two products to be used in the research, were extracted. Because milk is a utilitarian product with strong practical attributes while chocolate is a hedonic product with strong hedonic attributes, these two were selected to be used in this research. The first study was executed to see if there is a difference in attitude about products that have different painting on their designs, which either induces positive or negative emotions. It was also to verify whether this difference in attitude was mediated by the viewer's preference for the art piece. This study showed that when positive emotion inducing painting was used, the product was better evaluated compared to the product with a painting that provokes a negative emotion, thus supporting the second hypothesis. It was also supported that the effect of affect on product evaluation was mediated by preference for the art piece. The second study was done to see the influence of the level of arousal on the evaluation of the product's attributes. Art pieces that differ in the level of arousal were selected through the pretest, and later it verified the hypothesis that the level of arousal has an effect on the assessment of the attributes of the product. In the case of milk, a utilitarian product, the fourth hypothesis that a high-arousal painting will better evaluated for its hedonic attributes was supported, as well as the fifth, which hypothesized that a low-arousal painting will receive a higher assessment for its utilitarian attributes. However, for chocolate, a hedonic product, both fourth and fifth hypotheses were not supported. This study is significant for the following basis: first, it verified the importance of the emotion induced by the painting on the evaluation of the product's attributes, by applying a systematic and scientific method. Second, it expanded from the existing research on positive/negative emotions to confirm the additional influence of the state of arousal on product evaluation.

  • PDF