Browse > Article
http://dx.doi.org/10.5351/KJAS.2017.30.1.025

Analysis of cycle racing ranking using statistical prediction models  

Park, Gahee (Department of Statistics, Ewha Womans University)
Park, Rira (Department of Statistics, Ewha Womans University)
Song, Jongwoo (Department of Statistics, Ewha Womans University)
Publication Information
The Korean Journal of Applied Statistics / v.30, no.1, 2017 , pp. 25-39 More about this Journal
Abstract
Over 5 million people participate in cycle racing betting and its revenue is more than 2 trillion won. This study predicts the ranking of cycle racing using various statistical analyses and identifies important variables which have influence on ranking. We propose competitive ranking prediction models using various classification and regression methods. Our model can predict rankings with low misclassification rates most of the time. We found that the ranking increases as the grade of a racer decreases and as overall scores increase. Inversely, we can observe that the ranking decreases when the grade of a racer increases, race number four is given, and the ranking of the last race of a racer decreases. We also found that prediction accuracy can be improved when we use centered data per race instead of raw data. However, the real profit from the future data was not high when we applied our prediction model because our model can predict only low-return events well.
Keywords
cycle racing; linear regression; stepwise regression; logistic regression; random forest; generalized additive model; gradient boosting; ridge regression; lasso regression; principal components regression; important variables;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Breiman, L. (2001). Random forests, Machine Learning, 45, 5-32.   DOI
2 Cho, H. C., Kang, S. K., and Kim, J. K. (2008). Relationship of lower extremity factors, 200m record and Wingate anaerobic power in racing and competitive cyclists, Korean Journal of Sport Science, 19, 9-20.   DOI
3 Choe, H., Hwang, N., Hwang, C., and Song, J. (2015). Analysis of horse races : prediction of winning horses in horse races using statistical models, Korean Journal of Applied Statistics, 28, 1133-1146.   DOI
4 Hastie, T., Tibshirani, R., and Friedman, J. (2001). The Elements of Statistical Learning; Data Mining, Inference and Prediction, Springer, New York.
5 Hoerl, A. E. and Kennard, R. W. (1970). Ridge regression: biased estimation for nonorthogonal problems, Technometrics, 12, 55-67.   DOI
6 Kim, B. S. and Kim, J. S. (2007). Relationship between customer satisfaction and patterns of ticket purchase of cycle racing customers, Journal of Sport and Leisure Studies, 30, 203-211.
7 Tibshirani, R. (1996). Regression shrinkage and selection via the LASSO, Journal of the Royal Statistical Society Series B (Methodological), 58, 267-288.
8 Park, C., Kim, Y., Kim, J., Song, J., and Choi, H. (2011). Datamining using R, Kyowoo, Seoul.
9 Ridgeway, G. (2006). Generalized boosted models: a guide to the gbm package, Available from: http://cran.r-project.org/web/packages/gbm