DOI QR코드

DOI QR Code

A Novel Road Segmentation Technique from Orthophotos Using Deep Convolutional Autoencoders

  • Received : 2017.07.18
  • Accepted : 2017.08.22
  • Published : 2017.08.31

Abstract

This paper presents a deep learning-based road segmentation framework from very high-resolution orthophotos. The proposed method uses Deep Convolutional Autoencoders for end-to-end mapping of orthophotos to road segmentations. In addition, a set of post-processing steps were applied to make the model outputs GIS-ready data that could be useful for various applications. The optimization of the model's parameters is explained which was conducted via grid search method. The model was trained and implemented in Keras, a high-level deep learning framework run on top of Tensorflow. The results show that the proposed model with the best-obtained hyperparameters could segment road objects from orthophotos at an average accuracy of 88.5%. The results of optimization revealed that the best optimization algorithm and activation function for the studied task are Stochastic Gradient Descent (SGD) and Exponential Linear Unit (ELU), respectively. In addition, the best numbers of convolutional filters were found to be 8 for the first and second layers and 128 for the third and fourth layers of the proposed network architecture. Moreover, the analysis on the time complexity of the model showed that the model could be trained in 4 hours and 50 minutes on 1024 high-resolution images of size $106{\times}106pixels$, and segment road objects from similar size and resolution images in around 14 minutes. The results show that the deep learning models such as Convolutional Autoencoders could be a best alternative to traditional machine learning models for road segmentation from aerial photographs.

Keywords

References

  1. Bergstra, J. and Y. Bengio, 2012. Random search for hyper-parameter optimization, Journal of Machine Learning Research, 13: 281-305
  2. Colak, S., A. Lima, and M.C. Gonzalez, 2016. Understanding congested travel in urban areas, Nature communications, 7
  3. Dahl, G. E., D. Yu, L. Deng, and A. Acero, 2012. Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. IEEE Transactions on audio, speech, and language processing, 20(1): 30-42 https://doi.org/10.1109/TASL.2011.2134090
  4. Devkota, K. C., A. D. Regmi, H. R. Pourghasemi, K. Yoshida, B. Pradhan, I. C. Ryu, M. R. Dhital, and O.F. Althuwaynee, 2013. Landslide susceptibility mapping using certainty factor, index of entropy and logistic regression models in GIS and their comparison at Mugling- Narayanghat road section in Nepal Himalaya, Natural hazards, 65(1): 135-165 https://doi.org/10.1007/s11069-012-0347-6
  5. Jones, D. R., 2001. A taxonomy of global optimization methods based on response surfaces, Journal of global optimization, 21(4): 345-383 https://doi.org/10.1023/A:1012771025575
  6. Kaddah, W., Y. Ouerhani, A. Alfalou, M. Desthieux, C. Brosseau, and C. Gutierrez, 2016. Roadmarking features extraction using theVIAPIX(R) system, Optics Communications,371: 117-127 https://doi.org/10.1016/j.optcom.2016.03.065
  7. Krizhevsky, A., I. Sutskever, and G.E. Hinton, 2012. Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, 1097-1105
  8. Kumar, P., 2012. Road features extraction using terrestrial mobile laser scanning system, National University of Ireland Maynooth, Ireland.
  9. Kumar, P., C. P. McElhinney, P. Lewis, and T. McCarthy, 2014. Automated road markings extraction from mobile laser scanning data, International Journal of Applied Earth Observation and Geoinformation, 32: 125-137 https://doi.org/10.1016/j.jag.2014.03.023
  10. Kussul, N., M. Lavreniuk, S. Skakun, and A. Shelestov, 2017. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data, IEEE Geoscience and Remote Sensing Letters, 14(5): 778-782 https://doi.org/10.1109/LGRS.2017.2681128
  11. Latinopoulos, D. and K. Kechagia, 2015. A GIS-based multi-criteria evaluation for wind farm site selection, A regional scale application in Greece. Renewable Energy, 78: 550-560 https://doi.org/10.1016/j.renene.2015.01.041
  12. LeCun, Y., Y. Bengio, and G. Hinton, 2015. Deep learning, Nature, 521(7553): 436-444 https://doi.org/10.1038/nature14539
  13. Li, M., A. Stein, W. Bijker, and Q. Zhan, 2016. Regionbased urban road extraction from VHR satellite images using binary partition tree, International Journal of Applied Earth Observation and Geoinformation, 44: 217-225 https://doi.org/10.1016/j.jag.2015.09.005
  14. Majumder, N., S. Poria, A. Gelbukh, and E. Cambria, 2017. Deep Learning-Based Document Modeling for Personality Detection from Text, IEEE Intelligent Systems, 32(2): 74-79 https://doi.org/10.1109/MIS.2017.23
  15. Mnih, V. and G.E. Hinton. 2010. Learning to detect roads in high-resolution aerial images, Proc. of 11th European Conference on Computer Vision, Grete, Greece, Sep. 5-11, pp. 210-223
  16. Mockus, J., 1975. On Bayesian methods for seeking the extremum, Proc. of Optimization Techniques IFIP Technical Conference Novosibirsk, Berlin Heidelberg, Germany, Jul. 1-7, pp. 400-404
  17. Nareyek, A., 2003. Choosing search heuristics by nonstationary reinforcement learning, Applied Optimization, 86: 523-544 https://doi.org/10.1007/978-1-4757-4137-7_25
  18. Osborne, B. P., V. J. Osborne, and M. L. Kruger, 2012. Comparison of satellite surveying to traditional surveying methods for the resources industry, Journal of the British Interplanetary Society, 65(2): 98-104
  19. Raschka, S., 2015. Python machine learning, Packt Publishing Ltd., U.K
  20. Rathore, M. M., A. Ahmad, A. Paul, and S. Rho, 2016. Urban planning and building smart cities based on the internet of things using big data analytics, Computer Networks, 101: 63-80 https://doi.org/10.1016/j.comnet.2015.12.023
  21. Saito, S., T. Yamashita, and Y. Aoki, 2016. Multiple object extraction from aerial imagery with convolutional neural networks, Electronic Imaging, 2016(10): 1-9
  22. Sameen, M. I. and B. Pradhan, 2016a. Assessment of the effects of expressway geometric design features on the frequency of accident crash rates using high-resolution laser scanning data and GIS, Geomatics, Natural Hazards and Risk, 1-15
  23. Sameen, M. I. and B. Pradhan, 2016b. A Simplified Semi-Automatic Technique for Highway Extraction from High-Resolution Airborne LiDAR Data and Orthophotos, Journal of the Indian Society of Remote Sensing, 45(3): 1-11
  24. Sameen, M. I. and B. Pradhan, 2017. Severity Prediction of Traffic Accidents with Recurrent Neural Networks, Applied Sciences, 7(6): 476
  25. Sameen, M. I. and B. Pradhan, 2017a. Severity Prediction of Traffic Accidents with Recurrent Neural Networks, Applied Sciences, 7(6): 476 https://doi.org/10.3390/app7060476
  26. Sameen, M. I. and B. Pradhan, 2017b. A Two-Stage Optimization Strategy for Fuzzy Object-Based Analysis Using Airborne LiDAR and High- Resolution Orthophotos for Urban Road Extraction, Journal of Sensors.
  27. Saxe, A., P.W. Koh, Z. Chen, M. Bhand, B. Suresh, and A.Y Ng, 2011. On random weights and unsupervised feature learning, Proc. of the 28th international conference on machine learning (ICML-11), Bellevue, WA, Jun. 28-Jul. 2, pp. 1089-1096
  28. Shi, W., Z. Miao, and J. Debayle, 2014. An integrated method for urban main-road centerline extraction from optical remotely sensed imagery, IEEE Transactions on Geoscience and Remote Sensing, 52(6): 3359-3372 https://doi.org/10.1109/TGRS.2013.2272593
  29. Sohail, M., D.A.C. Maunder, and S. Cavill, 2006. Effective regulation for sustainable public transport in developing countries, Transport Policy, 13(3): 177-190 https://doi.org/10.1016/j.tranpol.2005.11.004
  30. Unsalan, C. and B. Sirmacek, 2012. Road network detection using probabilistic and graph theoretical methods, IEEE Transactions on Geoscience and Remote Sensing, 50(11): 4441-4453 https://doi.org/10.1109/TGRS.2012.2190078
  31. Vitor, G.B., D.A. Lima, A.C. Victorino, and J.V. Ferreira, 2013. A 2D/3D vision based approach applied to road detection in urban environments, Proc. of 2013 IEEE Intelligent Vehicles Symposium (IV), Gold Coast City, Australia, Jun. 23-26, pp. 952-957.
  32. Wang, J., J. Song, M. Chen, and Z. Yang, 2015. Road network extraction: A neural-dynamic framework based on deep learning and a finite state machine, International Journal of Remote Sensing, 36(12): 3144-3169 https://doi.org/10.1080/01431161.2015.1054049
  33. Wegner, J. D., J.A. Montoya-Zegarra, and K. Schindler, 2013. A higher-order CRF model for road network extraction, Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, Jun. 23-28, pp. 1698-1705
  34. Weihs, C., K. Luebke, and I. Czogiel, 2006. Response surface methodology for optimizing hyper parameters, Universität Dortmund, Dortmund, Germany
  35. Xie, X., K.B.Y. Wong, H. Aghajan, P. Veelaert, and W. Philips, 2016. Road network inference through multiple track alignment, Transportation Research Part C: Emerging Technologies, 72: 93-108 https://doi.org/10.1016/j.trc.2016.09.010
  36. Yang, B., L. Fang, Q. Li, and J. Li, 2012. Automated extraction of road markings from mobile LiDAR point clouds, Photogrammetric Engineering & Remote Sensing, 78(4): 331-338 https://doi.org/10.14358/PERS.78.4.331
  37. Zeyer, A., P. Doetsch, P. Voigtlaender, R. Schlüter, and H. Ney, 2017. A comprehensive study of deep bidirectional LSTM RNNs for acoustic modeling in speech recognition, Proc. of Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International Conference, New Orleans, LA, Mar. 5-9, pp. 2462-2466
  38. Zhan, C., 1993. A hybrid line thinning approach, Proc. of Autocarto-conference-, ASPRS American society for photogrammetry and remote sensing, Bethesda, MD, p. 396

Cited by

  1. A hybrid model using machine learning methods and GIS for potential rockfall source identification from airborne laser scanning data vol.15, pp.9, 2018, https://doi.org/10.1007/s10346-018-0990-4