• Title/Summary/Keyword: extreme gradient boosting model

Search Result 41, Processing Time 0.015 seconds

A study on the number of passengers using the subway stations in Seoul (데이터마이닝 기법을 이용한 서울시 지하철역 승차인원 예측)

  • Cho, Soojin;Kim, Bogyeong;Kim, Nahyun;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.1
    • /
    • pp.111-128
    • /
    • 2019
  • Subways are eco-friendly public transportation that can transport large numbers of passengers safely and quickly. It is necessary to predict the accurate number of passengers in order to increase public interest in subway. This study groups stations on Lines 1 to 9 of the Seoul Metropolitan Subway using clustering analysis. We propose one final prediction model for all stations and three optimal prediction models for each cluster. We found three groups of stations out of 294 total subway stations. The Group 1 area is industrial and commercial, the Group 2 ares is residential and commercial, and the Group 3 area is residential districts. Various data mining techniques were conducted for each group, as well as driving some influential factors on demand prediction. We use our model to predict the number of passengers for 8 new stations which are part of the 3rd extension plan of Seoul metro line 9 opened in October 2018. The estimated average number of passengers per hour is from 241 to 452 and the estimated maximum number of passengers per hour is from 969 to 1515. We believe our analysis can help improve the efficiency of public transportation policy.