• Title/Summary/Keyword: 데이터 종속성

Search Result 374, Processing Time 0.026 seconds

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

The Effect of Alcohol Availability on Drinking Behavior : A Multilevel Analysis on Urban Regions (알코올가용성이 음주행태에 미치는 영향: 도시지역을 대상으로 한 다수준 분석)

  • Kwon, RIA;Shin, Sangsoo;Shin, Young-jeon
    • 한국사회정책
    • /
    • v.25 no.2
    • /
    • pp.125-163
    • /
    • 2018
  • Social and health problems related to drinking are serious. Drinking behavior is affected not only by personal factors but also by environment factors. The purpose of this study is to find out how the alcoholic beverage stores in community influence the drinking behaviors of individuals after adjusting the individual level variables and provide it as basic data for alcohol related regulatory policies. In order to identify the factors affecting drinking behavior, we conducted a multilevel logistic regression analysis with high-risk drinking and current drinking as dependent variables. Individual-level data provided by 2015 community health survey from respondents of urban residents, and regional level data provided by the National Statistical office. The variables such as age, education level, and income level were used as individual level variables and the number of basic living allowances, divorce rate, and the number of pubs were used as community level variables. According to the research results, after controlling all variables, the number of bar, retail per $1km^2$ in residential area effect on current drinking. But, they are not effect on high risk drinking. In the high risk drinking, only the divorce rate effect on drinking behavior. As a result of the stratified analysis, there was no difference in the current drinking. But, it shows that the higher the number of retail stores and the total alcohol availability, the higher risk drinking behavior in the 60s. The results of this study suggest that policies aimed not only on individuals but also on the local environment are necessary.

A Study of the Influencing Factors for Decision Making on Construction Contract Types : Focused on DoD Construction Acquisitions with Firm Fixed Price and Cost Reimbursable in FAR (건설공사 대가지급방식의 의사결정 영향요인에 관한 연구 - 미국 연방조달규정에 따른 미국 국방성의 정액계약과 실비정산계약을 중심으로 -)

  • Son, Young-Hoon;Kim, Kyung-Rai
    • Korean Journal of Construction Engineering and Management
    • /
    • v.25 no.2
    • /
    • pp.23-35
    • /
    • 2024
  • This study analyzed the correlation between each of the 12 influencing factors in FAR 16.04 and the decision-making process for construction contract types, using data from a total of 2,406 DoD Construction Acquisitions spanning from 2008 to 2022. The study considered 12 independent variables, grouped into 4 Characteristics with 3 factors each. Meanwhile, all other contract types were categorized into two types: Firm-Fixed-Price (FFP) and Cost-Reimbursement Contract (CRC), which served as the dependent variables. The findings revealed that FFP contracts significantly dominated in terms of acquisition volume. In line with prevailing beliefs, logistic data analysis and Analytical Hierarchy Process (AHP) analysis of Relative Weights from Experts' Survey demonstrated that independent variables like Uncertainty of the Scope of Work and Complexity found out to be increasing the likelihood of selecting CRC. The number of contractors in the market does indeed influence the possibilities of contract decision-making between CRC and FFP. Meanwhile, the p-values of the top 3 influencing factors on CRC from the AHP analysis-namely, Appropriateness of CAS, Project Urgency, and Cost Analysis-exceeded 0.05 in the binominal regression results, rendering it inconclusive whether they significantly influenced the construction contract type decision, particularly with respect to payment methods. This outcome partly results from the fact that a majority of respondents possessed specific experiences related to the USFK relocation project. Furthermore, influencing factors in construction projects behave differently than common beliefs suggest. As a result, it is imperative to consider the 12 influencing factors categorized into 4 Characteristics areas before establishing acquisition strategies for targeted construction projects.

Analysis of Determinant Factors of Apartment Price Considering the Spatial Distribution and Housing Attributes (공간지리적 요인과 주거특성을 고려한 공동주택 가격결정 분석)

  • Moon, Tae-Heon;Jeong, Yoon-Young
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.11 no.1
    • /
    • pp.68-79
    • /
    • 2008
  • Because local cities are different from large cities, they need to reflect their own characteristics of housing market. Thus in order to obtain useful implications for the establishing sound housing market in Jinju City, this paper investigated the characteristics of spatial distribution and determinant factors that affect apartment price in Jinju City. GIS representation of the apartments showed that most of old and small apartments were built in 'land readjustment project' areas executed in 1970s. On the contrary, new and large scale apartment complexes were built quite recently and distributed in the western and southern parts of the city. Next, in order to examine the factors which affect apartment price, this paper subtracted firstly several variables from the related studies. However in order to avoid multi-colinearity, variables were summarized by means of factor analysis. Then, setting apartment price as a dependant variable, 12 hedonic price models were established with 33 independent variables. As results, building age, floor area, accessibility to university and hospital, accessibility to arterial road, and stair-type building were turned out to be significant. These results will be used in making the supply and allocation plan of urban facilities and housing. Finally as conclusions this paper emphasized the need of periodic analysis of local housing market and establishing detailed housing information systems.

  • PDF

An Analysis on the Yield Curves for Active Bond Managements (적극적 채권운용전략을 위한 수익률곡선 분석)

  • Jeong, Hee-Joon
    • The Korean Journal of Financial Management
    • /
    • v.25 no.2
    • /
    • pp.1-31
    • /
    • 2008
  • Before the financial crisis in 1997, Korean bond markets had been those of corporate bonds with relatively high market yield. During the period, most of major institutional investors tend to utilize passive strategies such as buying and holding. After the crisis, however, they could not help choosing active bond management strategies because of lowed yield level and intensified competition among the financial institutions. This study is forced on the yield curve, which is the reflection of all information on the bond investment environments. The study also make analysis on the major economic and securities market factors and its structural relationship with the shape of the curve such as level, curvature and slope. For these purposes, an empirical model based on the Nelson-Siegel Model is estimated with the data during $1999{\sim}2006$. Out-of-sample forecasting is also made to test the usefulness of the estimated model. In addition, the dependent variables which are the estimates of level and slope are estimated on the macro variables and securities market variables. VAR and SUR models are used for the estimation. Estimation results show that level and slope of the yield curve are influenced by the target call rate change, exchange rate change rate, inflation rate. These results provide practical implications for the active managements in the overall treasury bond markets.

  • PDF

Comparison of General Composition of Cooked Krill and Alcalase Optimization for Maximum Antioxidative Activity by Using Response Surface Methodology (자숙크릴의 일반성분 분석과 항산화 활성을 위한 반응표면법에 의한 알카라제 가수분해 최적화)

  • Kim, Kyoung-Myo;Cho, Yong-Bum;Hwang, Young-Jeong;Lee, Da-Sun;Lee, Yang-Bong
    • Culinary science and hospitality research
    • /
    • v.18 no.1
    • /
    • pp.15-26
    • /
    • 2012
  • The objective of this study is to optimize enzymatic hydrolysis of cooked krill by using Alcalase. To optimize krill hydrolysis on such dependent variables as TCA, DPPH-scavenging, and Fe-chelating activities by using Alcalase, independent variables of hydrolysis pH and temperature were investigated Their formulas and three dimensional graphs were obtained by using SAS and Maple softwares, respectively. For comparison of general composition of raw krill, its contents of moisture, crude protein, crude fat, and ash were 17.48%, 53.74%, 15.66%, and 10.21%, respectively, and for cooked krill, its contents were 4.80%, 71.84%, 5.26%, and 15.09%, respectively. The composition of fatty acids for cooked krill was similar to that of raw krill. The most abundant fatty acid was palmitic acid(16:0) and the following order was oleic acid(18:1), eicosapentaenoic acid (20:5), palmitoleic acid(16:1), and docosahexaenoic acid(22:6). For DH optimization of hydrolysates from cooked krill, its result was pH 8.5 and $66.6^{\circ}C$ hydrolysis temperature for the maximum DH of 29.4% For DPPH-antioxidative optimization of hydrolysates from raw krill, its maximum result of 27.1% was obtained in the hydrolysis condition of pH 7.4 and $67.5^{\circ}C$. For Fe-chelating optimization of hydrolysates from cooked krill, its maximum result of 24.9% was in the condition of pH 8.7 and $65.5^{\circ}C$. These results can be used for basic data for using krill products and other fish products as bioactive ingredients.

  • PDF

A Study of The Determinants of Turnover Intention and Organizational Commitment by Data Mining (데이터마이닝을 활용한 이직의도와 조직몰입의 결정요인에 대한 연구)

  • Choi, Young Joon;Shim, Won Shul;Baek, Seung Hyun
    • Journal of the Korea Society for Simulation
    • /
    • v.23 no.1
    • /
    • pp.21-31
    • /
    • 2014
  • In this article, data mining simulation is applied to find a proper approach and results of analysis for study of variables related to organization. Also, turnover intention and organizational commitment are used as target (dependent) variables in this simulation. Classification and regression tree (CART) with ensemble methods are used in this study for simulation. Human capital corporate panel data of Korea Research Institute for Vocation Education & Training (KRIVET) is used. The panel data is collected in 2005, 2007, and 2009. Organizational commitment variables are analyzed with combined measure variables which are created after investigation of reliability and single dimensionality for multiple-item measurement details. The results of this study are as follows. First, major determinants of turnover intention are trust, communication, and talent management-oriented trend. Second, the main determining factors for organizational commitment are trust, the number of years worked, innovation, communication. CART with ensemble methods has two ensemble CART methods which are CART with Bagging and CART with Arcing. Comparing two methods, CART with Arcing (Arc-x4) extracted scenarios with very high coefficients of determination. In this study, a scenario with maximum coefficient of determinant and minimum error is obtained and practical implications are presented. Using one of data mining methods, CART with ensemble method. Also, the limitation and future research are discussed.

Analysis of Factors of IRRs and Spread on Korea's BTO Projects (우리나라 민간투자사업의 수익률과 가산금리의 결정요인 분석)

  • Ju, Jae-Hong;Ha, Heon-Gu;Park, Dong-Gyu
    • Journal of Korean Society of Transportation
    • /
    • v.28 no.2
    • /
    • pp.135-150
    • /
    • 2010
  • This paper focuses on finding out which covenants are different among the concession agreements of Korean BTO projects and how these influenced IRR(Internal Rate of Return). That is, to figure out the political and economical determinants of IRR by analyzing the concession agreements which are the basic contract of implementing projects. As IRR is an index of profitability, so spread is an indicator of risk to collect debts. That's the reason why the analysis of spread is included. For the empirical analysis, the data of concession agreements for 75 projects and financial models are used. These 75 concession agreements are contracted from 1995 to 2008. The dependent variables are after tax nominal IRR and the spread of long term interest rates of 75 BTO projects. The independent variables are project's proceeding factors, the feasibility variables, the variables related to financial character and the variables related to covenants or the government's policy. The analysis shows that IRR has been influenced by the equity level of financial investors, the national government managed projects, the projects with minimum revenue guarantee (MRG), etc. And the equity level of financial investors, the national government managed projects and the implementation of supplementary project have an effect on spread also.

Effect of Usage Habits and Hardware Characteristics of Smartphone Users on Functional Performance (스마트폰 사용자의 사용습관 및 하드웨어 특성이 기능 수행도에 미치는 영향)

  • Yoon, Cheol-Ho
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.5
    • /
    • pp.599-604
    • /
    • 2019
  • This study examined how the characteristics of smartphone affect the functional performance of smartphones. In particular, this study focused on an understanding of the correlation between smartphone functional factors and usage habits. Functionality is defined as 11 kinds of functional elements. The characteristics of the smartphones were defined as the hardware characteristics and the user habits characteristics. Eighty subjects were organized to collect actual data by the smartphone function. The actual time required to perform each function was measured and observed five times for each functional element. Regression analysis was performed using Minitab ver.14 by classifying the measured values of the functional elements as dependent variables, the hardware characteristics collected through the questionnaire, and the user's usage habits as 12 independent variables. Overall, it is difficult to conclude that demographic and hardware characteristics of smartphone users have a significant effect on the performance. On the other hand, the variables related to smartphone usage habits have had a great impact on the performance of smartphone tasks, and as a result, the task execution time has increased. In simple input variables or viewing variables, the effects on usability was relatively small, but in all active variables, the execution time increased 10% - 30% in all tasks except for phone calls, seeking phone numbers, and dictionary search. Thus far, if the smartphone user interface has been provided uniformly in a large and simple manner, users with various usage habits can be utilized even if the input method and task processing method are more complicated and various interface types are provided.

A Study on the Determinants of "Decent Work" in the Logistics Industry : Focusing on the comparison with whole industries (물류산업의 "괜찮은 일자리(Decent Work)" 결정요인에 관한 연구 : 전체산업 모형과의 비교를 중심으로)

  • So, Ae-Rim;Shin, Seung-Sik
    • Journal of Korea Port Economic Association
    • /
    • v.38 no.4
    • /
    • pp.139-169
    • /
    • 2022
  • This study derived determinants of 'Decent Work' in the logistics industry and aims to use the analysis results as basic data for policymaking related to labor in the logistics industry and to prepare policies suitable for the characteristics of the logistics industry. As the dependent variable of the model, the Decent Job derived from the first study was used, and the target model was derived from panel data of whole industries to understand the unique characteristics of logistics industry jobs and applied to the logistics industry model. This study found that in the logistics industry, developing the expertise of the logistics industry through "vocational training" compared to whole industries is an important factor rather than raising the "academic level" through the regular curriculum. This seems to reflect the characteristics of the logistics industry as specialized vocational training is required in the case of "railway transportation", "inland water and port transportation", and "air cargo transportation", which have a high proportion of decent job workers among the detailed logistics industries analyzed in this study. Therefore, developing job expertise through additional manpower training programs such as vocational training as well as academic fields learned through regular curriculum is a very important factor in engaging in "Decent Work" not only in the logistics industry but also in other industries.