DOI QR코드

DOI QR Code

A Generation and Accuracy Evaluation of Common Metadata Prediction Model Using Public Bicycle Data and Imputation Method

  • Kim, Jong-Chan (Dept. of Computer Engineering, Sunchon National University) ;
  • Jung, Se-Hoon (School of Creative Convergence, Andong National University)
  • Received : 2022.01.12
  • Accepted : 2022.02.03
  • Published : 2022.02.28

Abstract

Today, air pollution is becoming a severe issue worldwide and various policies are being implemented to solve environmental pollution. In major cities, public bicycles are installed and operated to reduce pollution and solve transportation problems, and operational information is collected in real time. However, research using public bicycle operation information data has not been processed. This study uses the daily weather data of Korea Meteorological Agency and real-time air pollution data of Korea Environment Corporation to predict the amount of daily rental bicycles. Cross- validation, principal component analysis and multiple regression analysis were used to determine the independent variables of the predictive model. Then, the study selected the elements that satisfy the significance level, constructed a model, predicted the amount of daily rental bicycles, and measured the accuracy.

Keywords

Acknowledgement

This work was supported by a grant from 2020 Research Fund of Andong National University

References

  1. O. Eoin and D.B. Shmoys, "Data Analysis and Optimization for Citi Bike Sharing," Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, pp. 687-694, 2015.
  2. C. Chekuri, N. Korula, and M. Pal, "Improved Algorithms for Orienteering and Related Problems," ACM Transactions on Algorithms, Vol. 8, No. 3 pp. 1-27, 2012. https://doi.org/10.1145/2229163.2229167
  3. Z. Yongping and Z. Mi, "Environmental Benefits of Bike Sharing: A Big Data-Based Analysis," Applied Energy, Vol. 220, pp. 296-301, 2018. https://doi.org/10.1016/j.apenergy.2018.03.101
  4. G.N. Oliveiraa, J.L. Sotomayor, R.P. Torchelsen C.T. Silva, and J.L-D. Comba, "Visual Analysis of Bike-Sharing Systems," Computers & Graphics, Vol. 60, pp. 119-129, 2016. https://doi.org/10.1016/j.cag.2016.08.005
  5. F. Yang, F. Ding, X. Qu, and B. Ran, "Estimating Urban Shared-Bike Trips with Location-Based Social Networking Data," Sustainability, Vol. 11, No. 11, pp. 1-14, 2019.
  6. V. Mohammad, "Analysis of Potential Evapotranspiration Using Limited Weather Data," Applied Water Science, Vol. 7, No. 1, pp. 187-197, 2017. https://doi.org/10.1007/s13201-014-0234-2
  7. W. Ling, Q. Shi, and M. Abdel-Aty, "Predicting Crashes on Expressway Ramps with Real-Time Traffic and Weather Data," Transportation research record, Vol. 2514, No. 1, pp. 32-38, 2015. https://doi.org/10.3141/2514-04
  8. Changwon City(2018), https://www.nubija.com/main/main.do(accessed July 1, 2018).
  9. National Institute for Transportation and Commities(2018), https://nitc.trec.pdx.edu/research/project/1041/National_Electric_Bike_Owner_Survey_(accessed July 1, 2018).
  10. Peopleforbikes(2018), https://peopleforbikes.org (accessed July 1, 2018).
  11. L. Alexandros and H. V. Jagadish, "Challeng-Es and Opportunities with Big Data," Proceedings of the VLDB Endowment, Vol. 5, No. 12, pp. 2032-2033, 2012.
  12. V. Marx, "Biology: The Big Challenges of Big Data," Nature, Vol. 255, pp. 255-260, 2013. https://doi.org/10.1038/255255a0
  13. I.T. Jolliffe, "Principal Components in Regression Analysis," Springer Series in Statistics, New York, NY, pp. 129-155, 1986.
  14. S.H. Jung, C.S. Shin, C.Y. Yun, J.W. Park, M.H. Park, Y.H. Kim, S.B. Lee, and Ch.B. Sim, "Analysis Process based on Modify K-means for Efficiency Improvement of Electric Power Data Pattern Detection," Journal of Korea Multimedia Society, Vol. 20, No. 12, pp. 1960-1969, 2017. https://doi.org/10.9717/KMMS.2017.20.12.1960
  15. S. Qinbao and M. Shepperd, "A New Imputa-Tion Method for Small Software Project Data Sets," Journal of Systems and Software, Vol. 80, No. 1, pp. 51-62, 2007. https://doi.org/10.1016/j.jss.2006.05.003
  16. P. William, "Pseudo Maximum Likelihood Estimation: The Asymptotic Distribution," The Annals of Statistics, Vol. 14, No. 1, pp. 355-357, 1986. https://doi.org/10.1214/aos/1176349862
  17. T. Venta and J. Sumner, "Maximum Likelihood Estimates of Rearrangement Distance: Implementing a Representation-Theoretic Approach," Bulletin of Mathematical Biology, Vol. 81, No. 2, pp. 535-567, 2019. https://doi.org/10.1007/s11538-018-0511-6
  18. J.C. Kim, C.B. Sim, and S.H. Jung, "A Study on Automatic Missing Value Imputation Replacement Method for Data Processing in Digital Data," Journal of Korea Multimedia Society, Vol. 24, No. 2, pp. 245-254, 2021. https://doi.org/10.9717/KMMS.2020.24.2.245
  19. P. Burman, "A Comparative Study of Ordinary Cross-validation, V-Fold Cross validation and the Repeated Learning-Testing Methods," Biometrika, Vol. 76, pp. 503-514, 1989. https://doi.org/10.1093/biomet/76.3.503