DOI QR코드

DOI QR Code

A Study on the Prediction of Cabbage Price Using Ensemble Voting Techniques

앙상블 Voting 기법을 활용한 배추 가격 예측에 관한 연구

  • Lee, Chang-Min (Department of Computer Engineering, Changwon National University) ;
  • Song, Sung-Kwang (Department of Computer Engineering, Changwon National University) ;
  • Chung, Sung-Wook (Department of Computer Engineering, Changwon National University)
  • 이창민 (창원대학교 컴퓨터공학과) ;
  • 송성광 (창원대학교 컴퓨터공학과) ;
  • 정성욱 (창원대학교 컴퓨터공학과)
  • Received : 2022.02.10
  • Accepted : 2022.03.20
  • Published : 2022.03.28

Abstract

Vegetables such as cabbage are greatly affected by natural disasters, so price fluctuations increase due to disasters such as heavy rain and disease, which affects the farm economy. Various efforts have been made to predict the price of agricultural products to solve this problem, but it is difficult to predict extreme price prediction fluctuations. In this study, cabbage prices were analyzed using the ensemble Voting technique, a method of determining the final prediction results through various classifiers by combining a single classifier. In addition, the results were compared with LSTM, a time series analysis method, and XGBoost and RandomForest, a boosting technique. Daily data was used for price data, and weather information and price index that affect cabbage prices were used. As a result of the study, the RMSE value showing the difference between the actual value and the predicted value is about 236. It is expected that this study can be used to select other time series analysis research models such as predicting agricultural product prices

배추와 같은 채소류는 자연재해의 영향을 많이 받기 때문에 폭우나 병해와 같은 재해로 인해 가격 변동이 심해져 농가 경제에 영향을 미치게 된다. 이러한 문제를 해결하기 위해서 농산물 가격 예측을 위한 다양한 노력이 행해졌지만 극심한 가격 예측 변동을 예측하기는 어렵다. 본 연구에서는 단일 분류기를 결합하여 다양한 여러 개의 분류기를 통해 최종 예측 결과를 결정하는 방식인 앙상블 Voting 기법으로 배추 가격을 분석하였다. 또한 시계 열 분석 방법인 LSTM과 부스팅 기법인 XGBoost와 RandomForest로 결과 비교를 하였다. 가격 데이터는 일별 데이터를 사용하였고 배추 가격에 영향을 주는 기상정보와 물가지수 등을 사용하였다. 연구 결과로는 실제값과 예측값의 차이를 보여주는 RMSE 값이 약 236 수준이다. 이 연구를 활용하여 농산물 가격 예측과 같은 다른 시계 열 분석 연구 모델 선정에 활용할 수 있을 것으로 기대된다.

Keywords

Acknowledgement

This research is financially supported by Changwon National University in 2021~2022

References

  1. M. Y. Jin. (2021). 7 Benefits of Cabbage. Health care NEWS (Online). www.hcnews.or.kr/news_gisa/gisa_view.htm?gisa_category=02010200&gisa_idx=10026
  2. E. G. Pyo. (2006). Kimchi, selected as one of the world's top 5 health foods.. SBS news (Online). https://news.sbs.co.kr/news/endPage.do?news_id=N1000090983&plink=COPYPASTE&cooper=SBSNEWSEND
  3. aTKAMIS. (n. d.). Food Archives. aTKAMIS (Online). www.kamis.or.kr/customer/archive/archive.do?action=detail&archiveNo=182
  4. D. M. Hawkins. (2004). The problem of overfitting. Journal of chemical information and computer sciences, 44(1), 1-12. DOI : 10.1021/ci0342472
  5. J. O. Nam. (2014). Forecast for Laver Producer Price Using Time Series Models. KMI, 29(2), 271-303.
  6. S. H. Shin, M. K. Lee & S. K. Song. (2018). A Prediction Model for Agricultural Products Price with LSTM Network. The Korea Contents Association, 18(11), 416-429.
  7. H. J. Lim. (2020). Dual Attention-based LSTM Model for produce price prediction. Domestic Master's Thesis. Sejong University. Seoul.
  8. J. H. Park. (2020). [Park Jung-hyun's Getting Started with Data Science] ⑥ Feature Engineering (2). AiTimes (Online). www.aitimes.com/news/articleView.html?idxno=134913
  9. wikipedia. (2022). linear regression. wikipedia (Online). https://en.wikipedia.org/wiki/Linear_regression
  10. wikipedia. (2021). Least square method.wikipedia (Online). https://en.wikipedia.org/wiki/Least_squares