• Title/Summary/Keyword: Three step search

Search Result 118, Processing Time 0.024 seconds

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

A CF-based Health Functional Recommender System using Extended User Similarity Measure (확장된 사용자 유사도를 이용한 CF-기반 건강기능식품 추천 시스템)

  • Sein Hong;Euiju Jeong;Jaekyeong Kim
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.1-17
    • /
    • 2023
  • With the recent rapid development of ICT(Information and Communication Technology) and the popularization of digital devices, the size of the online market continues to grow. As a result, we live in a flood of information. Thus, customers are facing information overload problems that require a lot of time and money to select products. Therefore, a personalized recommender system has become an essential methodology to address such issues. Collaborative Filtering(CF) is the most widely used recommender system. Traditional recommender systems mainly utilize quantitative data such as rating values, resulting in poor recommendation accuracy. Quantitative data cannot fully reflect the user's preference. To solve such a problem, studies that reflect qualitative data, such as review contents, are being actively conducted these days. To quantify user review contents, text mining was used in this study. The general CF consists of the following three steps: user-item matrix generation, Top-N neighborhood group search, and Top-K recommendation list generation. In this study, we propose a recommendation algorithm that applies an extended similarity measure, which utilize quantified review contents in addition to user rating values. After calculating review similarity by applying TF-IDF, Word2Vec, and Doc2Vec techniques to review content, extended similarity is created by combining user rating similarity and quantified review contents. To verify this, we used user ratings and review data from the e-commerce site Amazon's "Health and Personal Care". The proposed recommendation model using extended similarity measure showed superior performance to the traditional recommendation model using only user rating value-based similarity measure. In addition, among the various text mining techniques, the similarity obtained using the TF-IDF technique showed the best performance when used in the neighbor group search and recommendation list generation step.

A Qualitative Study on Facilitating Factors of User-Created Contents: Based on Theories of Folklore (사용자 제작 콘텐츠의 활성화 요인에 대한 정성적 연구: 구비문학 이론을 중심으로)

  • Jung, Seung-Ki;Lee, Ki-Ho;Lee, In-Seong;Kim, Jin-Woo
    • Asia pacific journal of information systems
    • /
    • v.19 no.2
    • /
    • pp.43-72
    • /
    • 2009
  • Recently, user-created content (UCC) have emerged as popular medium of on-line participation among users. The Internet environment has been constantly evolving, attracting active participation and information sharing among common users. This tendency is a significant deviation from the earlier Internet use as an one-way information channel through which users passively received information or contents from contents providers. Thanks to UCCs online users can now more freely generate and exchange contents; therefore, identifying the critical factors that affect content-generating activities has increasingly become an important issue. This paper proposes a set of critical factors for stimulating contents generation and sharing activities by Internet users. These factors were derived from the theories of folklores such as tales and songs. Based on some shared traits of folklores and UCC content, we found four critical elements which should be heeded in constructing UCC contents, which are: context of culture, context of situation, skill of generator, and response of audience. In addition, we selected three major UCC websites: a specialized contents portal, a general internet portal, and an official contents service site, They have different use environments, user interfaces, and service policies, To identify critical factors for generating, sharing and transferring UCC, we traced user activities, interactions and flows of content in the three UCC websites. Moreover, we conducted extensive interviews with users and operators as well as policy makers in each site. Based on qualitative and quantitative analyses of the data, this research identifies nine critical factors that facilitate contents generation and sharing activities among users. In the context of culture, we suggest voluntary community norms, proactive use of copyrights, strong user relationships, and a fair monetary reward system as critical elements in facilitating the process of contents generation and sharing activities. Norms which were established by users themselves regulate user behavior and influence content format. Strong relationships of users stimulate content generation activities by enhancing collaborative content generation. Particularly, users generate contents through collaboration with others, based on their enhanced relationship and specialized skills. They send and receive contents by leaving messages on website or blogs, using instant messenger or SMS. It is an interesting and important phenomenon, because the quality of contents can be constantly improved and revised, depending on the specialized abilities of those engaged in a particular content. In this process, the reward system is an essential driving factor. Yet, monetary reward should be considered only after some fair criterion is established. In terms of the context of the situation, the quality of contents uploading system was proposed to have strong influence on the content generating activities. Among other influential factors on contents generation activities are generators' specialized skills and involvement of the users were proposed. In addition, the audience response, especially effective development of shared interests as well as feedback, was suggested to have significant influence on contents generation activities. Content generators usually reflect the shared interest of others. Shared interest is a distinct characteristic of UCC and observed in all the three websites, in which common interest is formed by the "threads" embedded with content. Through such threads of information and contents users discuss and share ideas while continuously extending and updating shared contents in the process. Evidently, UCC is a new paradigm representing the next generation of the Internet. In order to fully utilize this innovative paradigm, we need to understand how users take advantage of this medium in generating contents, and what affects their content generation activities. Based on these findings, UCC service providers should design their websites as common playground where users freely interact and share their common interests. As such this paper makes an important first step to gaining better understand about this new communication paradigm created by UCC.

On decrease program of Radioactive Wastewater and Sewages in High Dose Radioiodine Therapy Ward (고용량 방사성옥소 치료병실의 오.폐수 저감화를 위한 연구)

  • Ryu, Jae-Kwang;Jung, Woo-Young;Shin, Sang-Ki;Cho, Shee-Man
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.12 no.1
    • /
    • pp.19-26
    • /
    • 2008
  • Purpose: In general, We discharged radioactive wastewater and sewages less than $8.1{\times}10^{-13}$ Ci/ml in a exclusive water-purifier tank. Our hospital operating three exclusive water-purifier tank for radioactive wastewater and sewages of 60 tons capacity respectively. In order to meet the criteria it need a enough decay more than 125 days per each exclusive tank. However, recently we fell into the serious situation that decay period was decreased remarkably, owing to the wastewater amount increased rapidly by enlarge the therapy ward. For that reason, in this article, I'd like to say the way that reducing of radioactive wastewater and sewages rationally. Materials and Methods: From January, 2006 to October, four hundred and two cases were analyzed. They were all hospitalized during 3 days and 2 nights. We calculated the average amount of water used (include toilet water used, shower water used, washstand water used, $\cdots$), each exclusive water-purifier tank's decay period, as well as try to search the increased factors about water-purifier tank inflow flux by re-analysis of the procedure of radioisotope therapy step by step. Results: We could increase each exclusive water-purifier tank's decay period from 84 days to 130 days through the improvement about following cause: (1) Improvement of conventional toilet stool for excessive water waste $\rightarrow$ Replacement of water saving style toilet stool (2) Prevention of unnecessary shower and wash (3) Stop the diuretics taking during hospitalization (4) Analysis of relationship between water intakes and residual dose of body (5) Education about outside toilet utilization before the administration (6) Changed each water-purifier tank's maximum level from85% to 90% Conclusion: The originality of our efforts are not only software but hardware performance improvements. Incidentally the side of software's are change of therapy procedures and protocols, the side of hardware's are replacement of water saving style toilet stool and change of each water-purifier tank's maximum level. Thus even if a long lapse of time, problem such as return to the former conditions may not happen. Besides, We expect that our trials become a new reasonable model in similar situation.

  • PDF

The Effects of Use Patterns and Service Quality on Performance and Use Satisfaction on Library Information System (도서관의 이용패턴과 서비스품질이 정보화성과지각 및 만족에 미치는 영향)

  • Jung, Hyung-Shik;Yeoum, Seoung-Yeoub
    • Journal of Global Scholars of Marketing Science
    • /
    • v.18 no.4
    • /
    • pp.217-244
    • /
    • 2008
  • Consumers' overall satisfaction on a specific library use is inferred to be primarily accrued from their performance perception and use satisfaction on the library information service system as recent information technology is being rapidly improved and more libraries are being equipped with advanced information technologies. However, prior research has been conducted only on general library service quality and visitors' satisfaction, leaving the important aspects of visitors' library use and information performance perception. Thus, the objectives of this research are to examine the effect of library use patterns such as general visit for book reading and more professional information search, coupled with service quality, on the library users' performance perception on the information system that in turn, affects library use satisfaction on the same information system. More specifically, this study examines whether library visitors perceive differenltly the information system performance according to their library use patterns such that professional library users may have less positive on information system service due to their higher expectation or more positive perception on it due to variety of information uses and positive judgment on advanced information system. Next, three dimensions of service quality, consisting of interaction, outcome, and physical evidence quality in visitors' library use situations, are hypothesized to affect performance perception on library information system. Thirdly, the performance perception on library information system is hypothesized to influence the system use satisfaction while these two constructs are to affect visitors' overall satisfaction. we develop the following research model in accordance with the above theoretical reasoning. All variables used in this study(General Use Patterns, Professional Use Patterns, Interaction Quality, Outcome Quality, Physical Evidence Quality, Information Performance Perception, Information Use Satisfaction, Overall Satisfaction) were defined operationally based on the underlying prior studies. A survey was conducted with prepared questionnaires to about 400 visitors of a specific university library. Among them, 353 proper questionnaires were finally used for the analyses. Two-step approach was used to test the hypotheses. First, confirmatory factor analysis was conducted to guarantee the validity and reliability of variables. The results showed that all variables had not only convergent and discriminant validity, but also reliability. Then, research model was examined with a structural equation using LISREL 8.30 version. The fitness of the research model was found to be within the acceptable level. The findings of this study are as follows. The professional library use pattern was found to affect the users' performance perception on the library information system while the general library use pattern was not. Second, three dimensions of service quality (interaction, outcome, physical evidence) were found to influence the information system performance respectively while none of them was not to information use satisfaction. Third, library users' performance perception on the information system operation was found to affect the information system use satisfaction, both of which also influence users' overall satisfaction of the library. The findings of this study suggest that contemporary libraries strengthen their advanced information system operation in a way of user orientation and more importantly maximize their visitors' utilization of information system, accompanying proper material and various program development. This study conceptualized the new constructs of library users' performance perception on the information system and information use satisfaction which could better explain library users' overall satisfaction. Thus, furture study related with library service could utilize the constructs of information system performance and satisfaction as well as the variety of library use patterns in the users' viewpoints.

  • PDF

Extension Method of Association Rules Using Social Network Analysis (사회연결망 분석을 활용한 연관규칙 확장기법)

  • Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.111-126
    • /
    • 2017
  • Recommender systems based on association rule mining significantly contribute to seller's sales by reducing consumers' time to search for products that they want. Recommendations based on the frequency of transactions such as orders can effectively screen out the products that are statistically marketable among multiple products. A product with a high possibility of sales, however, can be omitted from the recommendation if it records insufficient number of transactions at the beginning of the sale. Products missing from the associated recommendations may lose the chance of exposure to consumers, which leads to a decline in the number of transactions. In turn, diminished transactions may create a vicious circle of lost opportunity to be recommended. Thus, initial sales are likely to remain stagnant for a certain period of time. Products that are susceptible to fashion or seasonality, such as clothing, may be greatly affected. This study was aimed at expanding association rules to include into the list of recommendations those products whose initial trading frequency of transactions is low despite the possibility of high sales. The particular purpose is to predict the strength of the direct connection of two unconnected items through the properties of the paths located between them. An association between two items revealed in transactions can be interpreted as the interaction between them, which can be expressed as a link in a social network whose nodes are items. The first step calculates the centralities of the nodes in the middle of the paths that indirectly connect the two nodes without direct connection. The next step identifies the number of the paths and the shortest among them. These extracts are used as independent variables in the regression analysis to predict future connection strength between the nodes. The strength of the connection between the two nodes of the model, which is defined by the number of nodes between the two nodes, is measured after a certain period of time. The regression analysis results confirm that the number of paths between the two products, the distance of the shortest path, and the number of neighboring items connected to the products are significantly related to their potential strength. This study used actual order transaction data collected for three months from February to April in 2016 from an online commerce company. To reduce the complexity of analytics as the scale of the network grows, the analysis was performed only on miscellaneous goods. Two consecutively purchased items were chosen from each customer's transactions to obtain a pair of antecedent and consequent, which secures a link needed for constituting a social network. The direction of the link was determined in the order in which the goods were purchased. Except for the last ten days of the data collection period, the social network of associated items was built for the extraction of independent variables. The model predicts the number of links to be connected in the next ten days from the explanatory variables. Of the 5,711 previously unconnected links, 611 were newly connected for the last ten days. Through experiments, the proposed model demonstrated excellent predictions. Of the 571 links that the proposed model predicts, 269 were confirmed to have been connected. This is 4.4 times more than the average of 61, which can be found without any prediction model. This study is expected to be useful regarding industries whose new products launch quickly with short life cycles, since their exposure time is critical. Also, it can be used to detect diseases that are rarely found in the early stages of medical treatment because of the low incidence of outbreaks. Since the complexity of the social networking analysis is sensitive to the number of nodes and links that make up the network, this study was conducted in a particular category of miscellaneous goods. Future research should consider that this condition may limit the opportunity to detect unexpected associations between products belonging to different categories of classification.

An Examination of Knowledge Sourcing Strategies Effects on Corporate Performance in Small Enterprises (소규모 기업에 있어서 지식소싱 전략이 기업성과에 미치는 영향 고찰)

  • Choi, Byoung-Gu
    • Asia pacific journal of information systems
    • /
    • v.18 no.4
    • /
    • pp.57-81
    • /
    • 2008
  • Knowledge is an essential strategic weapon for sustaining competitive advantage and is the key determinant for organizational growth. When knowledge is shared and disseminated throughout the organization, it increases an organization's value by providing the ability to respond to new and unusual situations. The growing importance of knowledge as a critical resource has forced executives to pay attention to their organizational knowledge. Organizations are increasingly undertaking knowledge management initiatives and making significant investments. Knowledge sourcing is considered as the first important step in effective knowledge management. Most firms continue to make an effort to realize the benefits of knowledge management by using various knowledge sources effectively. Appropriate knowledge sourcing strategies enable organizations to create, acquire, and access knowledge in a timely manner by reducing search and transfer costs, which result in better firm performance. In response, the knowledge management literature has devoted substantial attention to the analysis of knowledge sourcing strategies. Many studies have categorized knowledge sourcing strategies into intemal- and external-oriented. Internal-oriented sourcing strategy attempts to increase firm performance by integrating knowledge within the boundary of the firm. On the contrary, external-oriented strategy attempts to bring knowledge in from outside sources via either acquisition or imitation, and then to transfer that knowledge across to the organization. However, the extant literature on knowledge sourcing strategies focuses primarily on large organizations. Although many studies have clearly highlighted major differences between large and small firms and the need to adopt different strategies for different firm sizes, scant attention has been given to analyzing how knowledge sourcing strategies affect firm performance in small firms and what are the differences between small and large firms in the patterns of knowledge sourcing strategies adoption. This study attempts to advance the current literature by examining the impact of knowledge sourcing strategies on small firm performance from a holistic perspective. By drawing on knowledge based theory from organization science and complementarity theory from the economics literature, this paper is motivated by the following questions: (1) what are the adoption patterns of different knowledge sourcing strategies in small firms (i,e., what sourcing strategies should be adopted and which sourcing strategies work well together in small firms)?; and (2) what are the performance implications of these adoption patterns? In order to answer the questions, this study developed three hypotheses. First hypothesis based on knowledge based theory is that internal-oriented knowledge sourcing is positively associated with small firm performance. Second hypothesis developed on the basis of knowledge based theory is that external-oriented knowledge sourcing is positively associated with small firm performance. The third one based on complementarity theory is that pursuing both internal- and external-oriented knowledge sourcing simultaneously is negatively or less positively associated with small firm performance. As a sampling frame, 700 firms were identified from the Annual Corporation Report in Korea. Survey questionnaires were mailed to owners or executives who were most erudite about the firm s knowledge sourcing strategies and performance. A total of 188 companies replied, yielding a response rate of 26.8%. Due to incomplete data, 12 responses were eliminated, leaving 176 responses for the final analysis. Since all independent variables were measured using continuous variables, supermodularity function was used to test the hypotheses based on the cross partial derivative of payoff function. The results indicated no significant impact of internal-oriented sourcing strategies while positive impact of external-oriented sourcing strategy on small firm performance. This intriguing result could be explained on the basis of various resource and capital constraints of small firms. Small firms typically have restricted financial and human resources. They do not have enough assets to always develop knowledge internally. Another possible explanation is competency traps or core rigidities. Building up a knowledge base based on internal knowledge creates core competences, but at the same time, excessive internal focused knowledge exploration leads to behaviors blind to other knowledge. Interestingly, this study found that Internal- and external-oriented knowledge sourcing strategies had a substitutive relationship, which was inconsistent with previous studies that suggested complementary relationship between them. This result might be explained using organizational identification theory. Internal organizational members may perceive external knowledge as a threat, and tend to ignore knowledge from external sources because they prefer to maintain their own knowledge, legitimacy, and homogeneous attitudes. Therefore, integrating knowledge from internal and external sources might not be effective, resulting in failure of improvements of firm performance. Another possible explanation is small firms resource and capital constraints and lack of management expertise and absorptive capacity. Although the integration of different knowledge sources is critical, high levels of knowledge sourcing in many areas are quite expensive and so are often unrealistic for small enterprises. This study provides several implications for research as well as practice. First this study extends the existing knowledge by examining the substitutability (and complementarity) of knowledge sourcing strategies. Most prior studies have tended to investigate the independent effects of these strategies on performance without considering their combined impacts. Furthermore, this study tests complementarity based on the productivity approach that has been considered as a definitive test method for complementarity. Second, this study sheds new light on knowledge management research by identifying the relationship between knowledge sourcing strategies and small firm performance. Most current literature has insisted complementary relationship between knowledge sourcing strategies on the basis of data from large firms. Contrary to the conventional wisdom, this study identifies substitutive relationship between knowledge sourcing strategies using data from small firms. Third, implications for practice highlight that managers of small firms should focus on knowledge sourcing from external-oriented strategies. Moreover, adoption of both sourcing strategies simultaneousiy impedes small firm performance.

The Usage of the Vulgate Bible in the European Catholicism: from the Council of Trent until the Second Council of Vatican (유럽 천주교의 불가타 성경 사용 양상: 트렌토 공의회 이후부터 2차 바티칸 공의회 이전까지)

  • CHO, Hyeon Beom
    • The Critical Review of Religion and Culture
    • /
    • no.32
    • /
    • pp.257-287
    • /
    • 2017
  • It seems to be quite an ambitious endeavor to trace back the translation history of Catholic Vulgate Bible from Latin language to Asian languages since 16th century. I try to bring out the translation(translative) procedure of Latin Bible to the Chinese Version, which is eventually come up (and the latter)to the Korean Version. It has been supported and funded by the National Research Foundation of Korea. This task has a three-year plan. For the first step(operation), I examined and searched the European situation of the Vulgate Bible in the Catholic Church, i.e. the ritual use of Vulgate Bible in the Mass and the religious retreat. The liturgical texts, to begin with, were analysed to disclose how the Vulgate Bible was reflected in them. The Lectionary and the Evangeliary were the typical ones. The structure or the formation system of the Lectionaries for Mass was based on the liturgical year cycle. From this point, the Vulgate Bible was rooted in the religious life of European Catholics after the Council of Trent which had proclaimed the Vulgate to be authentic source of the Revelation, therefore, to be respected as the only authoritative Bible. How did the Catholic Church use the Vulgate Bible out of the context and the boundary (sphere) of liturgy? The Meditation guide books for the purpose of instructing the religious retreat was published and (diffused) circulated among the priests, the religious persons and even the laymen. In those books also were included (found) the citation, the interpretation and the commentaries of the Vulgate Bible. The most of the devotees in Europe read the biblical phrases out of the meditation guide books. There are still remained the unsolved problems of how to understand (for understanding) the actual aspect of the Vulgate Bible in the European Catholic Church. All the Biblical verses were translated into French and included in the meditation guide books published in France. What did the Holy See think the French translation of the Vulgate Bible? Unfortunately, there were not found the Vatican Decrees about the European translation of the Vulgate Bible. The relationship between the Vulgate Bible and the Meditation guide (Those) will be much important for the study of Chinese translation of it. The search for the Decrees and the researches on it and the European and the non-European translations of the Vulgate Bible will be a continuous task for me as well as the other researchers on these subjects in the future.