DOI QR코드

DOI QR Code

Role of unstructured data on water surface elevation prediction with LSTM: case study on Jamsu Bridge, Korea

LSTM 기법을 활용한 수위 예측 알고리즘 개발 시 비정형자료의 역할에 관한 연구: 잠수교 사례

  • Lee, Seung Yeon (Department of Civil and Environmental Engineering, Hongik University) ;
  • Yoo, Hyung Ju (Department of Civil and Environmental Engineering, Hongik University) ;
  • Lee, Seung Oh (Department of Civil and Environmental Engineering, Hongik University)
  • 이승연 (홍익대학교 건설환경공학과) ;
  • 유형주 (홍익대학교 건설환경공학과) ;
  • 이승오 (홍익대학교 건설환경공학과)
  • Received : 2021.10.08
  • Accepted : 2021.11.25
  • Published : 2021.12.31

Abstract

Recently, local torrential rain have become more frequent and severe due to abnormal climate conditions, causing a surge in human and properties damage including infrastructures along the river. In this study, water surface elevation prediction algorithm was developed using the LSTM (Long Short-term Memory) technique specialized for time series data among Machine Learning to estimate and prevent flooding of the facilities. The study area is Jamsu Bridge, the study period is 6 years (2015~2020) of June, July and August and the water surface elevation of the Jamsu Bridge after 3 hours was predicted. Input data set is composed of the water surface elevation of Jamsu Bridge (EL.m), the amount of discharge from Paldang Dam (m3/s), the tide level of Ganghwa Bridge (cm) and the number of tweets in Seoul. Complementary data were constructed by using not only structured data mainly used in precedent research but also unstructured data constructed through wordcloud, and the role of unstructured data was presented through comparison and analysis of whether or not unstructured data was used. When predicting the water surface elevation of the Jamsu Bridge, the accuracy of prediction was improved and realized that complementary data could be conservative alerts to reduce casualties. In this study, it was concluded that the use of complementary data was relatively effective in providing the user's safety and convenience of riverside infrastructure. In the future, more accurate water surface elevation prediction would be expected through the addition of types of unstructured data or detailed pre-processing of input data.

최근 이상기후로 인한 국지성호우가 잦아져 하천변 사회기반시설을 포함한 인적·물적 피해가 급증하고 있다. 본 연구에서는 해당 시설들의 침수 피해를 예측·방지하고자 기계학습 중 시계열자료에 특화된 LSTM(Long Short- term Memory)기법을 활용하여 수위 예측 알고리즘을 개발하였다. 연구대상지는 잠수교로 연구기간은 총 6년(2015년~2020년)의 6, 7, 8월로 3시간 후의 잠수교 수위를 예측하였다. 입력자료(Input data)는 잠수교 수위(EL.m), 팔당댐 방류량(m3/s), 강화대교 조위(cm), 서울시 트윗의 개수로 기존 연구에 주로 사용된 정형자료뿐만 아니라 워드클라우드를 통해 구축된 비정형자료도 함께 사용하여 상호 보완형 자료를 구축하고, 비정형자료 활용 유무의 비교·분석을 통해 비정형자료의 역할도 제시하였다. 잠수교의 수위 예측 시 상호 보완형의 자료가 정형자료만을 사용한 경우에 비해 예측 정확도가 향상하였는 데, 이는 인명 피해를 감소시킬 수 있는 보수적인 예/경보가 가능함을 알 수 있었다. 본 연구에서는 하천변 사회기반시설의 이용자 안전 및 편의 제공에 상호 보완형 자료의 사용이 보다 효과적이라 판단하였다. 향후에는 비정형자료의 종류를 추가하거나 입력자료의 세밀한 전처리를 통하여 더욱 정확한 수위 예측을 기대해본다.

Keywords

Acknowledgement

본 연구는 환경부의 재원으로 한국환경산업기술원의 물관리연구사업(127572)에 의해 수행되었습니다.

References

  1. Asmai, S.A., Abidin, Z.Z., Basiron, H., and Ahmad, S. (2019). "An intelligent crisis-mapping framework for flood prediction." International Journal of Recent Technology and Engineering, Vol. 8, No. 2, pp. 1304-1310. https://doi.org/10.35940/ijrte.b1058.0882s819
  2. Bae, D.H., and Lee, B.J. (2011). "Development of continuous rainfall-runoff model for flood forecasting on the large-scale basin." Journal of Korea Water Resources Association, Vol. 44, No. 1, pp. 51-64. https://doi.org/10.3741/JKWRA.2011.44.1.51
  3. Behzad, M., Asghari, K., and Coppola Jr, E.A. (2010). "Comparative study of SVMs and ANNs in aquifer water level prediction." Journal of Computing in Civil Engineering, Vol. 24, No. 5, pp. 408-413. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000043
  4. Choi, D.W., Lee, W.B., Song, Y.H., Kang, T.H., and Han, Y.J. (2020). "Prediction of Highy Pathogenic Avian Influenza (HPAI) diffusion path using LSTM." The Journal of Bigdata, Vol. 5, No. 1, pp. 1-9.
  5. Gautam, Y. (2021). "Transfer Learning for COVID-19 cases and deaths forecast using LSTM network." ISA transactions. doi: 10.1016/j.isatra.2020.12.057
  6. Ha, M., and Ahn, H. (2019). "A machine learning-based vocational training dropout prediction model considering structured and unstructured data." The Journal of the Korea Contents Association, Vol. 19, No. 1, pp. 1-15. https://doi.org/10.5392/JKCA.2019.19.01.001
  7. Han, G.Y., Son, I.H., and Lee, J.Y. (2000). "Hydraulic model for real time forecasting of inundation risk." Journal of Korea Water Resources Association, Vol. 33, No. 3, pp. 331-340.
  8. Hochreiter, S., and Schmidhuber, J. (1997). "LSTM can solve hard long time lag problems." Advances in Neural Information Processing Systems, 473-479.
  9. Jang, S., Chun, H., Cho, I., and Kim, D. (2017). "A study on cabbage wholesale price forecasting model using unstructured agricultural meteorological data." Journal of the Korean Data and Information Science Society, Vol. 28, No. 3, pp. 617-624.
  10. Jung, S.H., Lee, D.E., and Lee, K.S. (2018). "Prediction of river water level using deep-learning open library." Journal of the Korean Society of Hazard Mitigation, Vol. 18, No. 1, pp. 1-11. https://doi.org/10.9798/KOSHAM.2018.18.1.1
  11. Kim, J.H., Kang, M.S., and Kim, S.H. (2019). "Comparing the performance of artificial neural networks and long short-term memory networks for rainfall-runoff analysis." Proceedings of the Korea Water Resources Association Conference, KWRA, pp. 320-320.
  12. Kim, M.S., Jung, S.H., Kim, J.G., Lee, H.S., and Kim, S.S. (2021). "A study on solar radiation forecasting based on long short-term memory considering hourly weather changes." Journal of Korean Institute of Intelligent Systems, Vol. 31, No. 1, pp. 88-94. https://doi.org/10.5391/JKIIS.2021.31.1.088
  13. Lee, J., and Hwang, S. (2019). "A study on the application of social network service data for monitoring flood damage." Journal of the Korean Society of Hazard Mitigation, Vol. 19, No. 7, pp. 77-85. https://doi.org/10.9798/kosham.2019.19.7.77
  14. Liang, C., Li, H., Lei, M., and Du, Q. (2018). "Dongting Lake water level forecast and its relationship with the three gorges dam based on a long short-term memory network." Water, Vol. 10, No. 10, 1389. https://doi.org/10.3390/w10101389
  15. Ruxton, G.D. (2006). The unequal variance t-test is an underused alternative to Student's t-test and the Mann-Whitney U test. Behavioral Ecology, Vol. 17, No. 4, pp. 688-690. https://doi.org/10.1093/beheco/ark016
  16. Tran, Q.K., and Song, S.K. (2017). "Water level forecasting based on deep learning: A use case of Trinity River-Texas-The United States." Journal of KIISE, Vol. 44, No. 6, pp. 607-612. https://doi.org/10.5626/JOK.2017.44.6.607
  17. Tran, Q.T., Hao, L., and Trinh, Q.K. (2016). "A novel procedure to model and forecast mobile communication traffic by ARIMA/GARCH combination models." Advances in Computer Science Research, Vol. 58, pp. 29-34.
  18. Yu, J.D., and Lee, I.S. (2018). "A prediction of stock price through the big-data analysis." Journal of the Society of Korea Industrial and Systems Engineering, Vol. 41, No. 3, pp. 154-161. https://doi.org/10.11627/jkise.2018.41.3.154
  19. Zhang, C., Zhou, G., Yuan, Q., Zhuang, H., Zheng, Y., Kaplan, L., Wang, S., and Han, J. (2016). "Geoburst: real-time local event detection in geo-tagged tweet streams." Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR '16). ACM, Pisa Italy, pp. 513-522.