Browse > Article
http://dx.doi.org/10.5351/KJAS.2015.28.5.933

A Study on Domestic Drama Rating Prediction  

Kang, Suyeon (Department of Statistics, Ewha Womans University)
Jeon, Heejeong (Department of Statistics, Ewha Womans University)
Kim, Jihye (Department of Statistics, Ewha Womans University)
Song, Jongwoo (Department of Statistics, Ewha Womans University)
Publication Information
The Korean Journal of Applied Statistics / v.28, no.5, 2015 , pp. 933-949 More about this Journal
Abstract
Audience rating competition in the domestic drama market has increased recently due to the introduction of commercial broadcasting and diversification of channels. There is now a need for thorough studies and analysis on audience rating. Especially, a drama rating is an important measure to estimate advertisement costs for producers and advertisers. In this paper, we study the drama rating prediction models using various data mining techniques such as linear regression, LASSO regression, random forest, and gradient boosting. The analysis results show that initial drama ratings are affected by structural elements such as broadcasting station and broadcasting time. Average drama ratings are also influenced by earlier public opinion such as the number of internet searches about the drama.
Keywords
drama rating; linear regression; LASSO regression; random forest; gradient boosting; important variables;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Bae, J. (2005). An analysis on the factors in drama ratings - focusing on the drama attributes and audience factors, Korean Journal of Broadcasting and Telecommunication Studies, 19, 270-309.
2 Breiman, L. (1996). Bagging predictors, Machine Learning, 24, 123-140.
3 Breiman, L. (2001). Random forests, Machine Learning, 45, 5-32.   DOI
4 Breiman, L., Friedman, J., Olshen, R. and Stone, C. (1984). Classification and Regression Trees, Chapman and Hall, New York.
5 Cohen, J. (2002). Television viewing preferences: Programs, schedules, and the structure of viewing choices made by Israeli adults, Journal of Broadcasting & Electronic Media, 46, 204-221.   DOI   ScienceOn
6 Cortes, C. and Vapnik, V. (1995). Support-vector networks, Machine Learning, 20, 273-297.
7 Friedman, J. (2002). Stochastic gradient boosting, Computational Statistics & Data Analysis, 38, 367-378.   DOI   ScienceOn
8 Hastie, T., Tibshirani, R. and Friedman, J. (2009). The Elements of Statistical Learning, Springer, New York. USA.
9 Park, C., Kim, Y., Kim, J., Song, J. and Choi, H. (2011). Datamining using R, Kyowoo, Seoul.
10 R Development Core Team (2010). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, ISBN 3-900051-07-0. http://www.R-project.org.
11 Ridgeway, G. (2012). Generalized Boosted Models: A guide to the gbm package.
12 Tibshirani, R. (1996). Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society B, 58, 267-288.
13 Venables, W. N. and Ripley, B. D. (2003). Modern Applied Statistics with S, Springer, New York.