A Novel Road Segmentation Technique from Orthophotos Using Deep Convolutional Autoencoders

Sameen, Maher Ibrahim;Pradhan, Biswajeet;

doi:10.7780/kjrs.2017.33.4.8

Korean Journal of Remote Sensing (대한원격탐사학회지)

Volume 33 Issue 4
/
Pages.423-436
/
2017
/
1225-6161(pISSN)
/
2287-9307(eISSN)

Korean Society of Remote Sensing (대한원격탐사학회)

DOI QR Code

A Novel Road Segmentation Technique from Orthophotos Using Deep Convolutional Autoencoders

Sameen, Maher Ibrahim (Department of Civil Engineering, University Putra Malaysia) ;
Pradhan, Biswajeet (Department of Civil Engineering, University Putra Malaysia)

Received : 2017.07.18
Accepted : 2017.08.22
Published : 2017.08.31

https://doi.org/10.7780/kjrs.2017.33.4.8 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

This paper presents a deep learning-based road segmentation framework from very high-resolution orthophotos. The proposed method uses Deep Convolutional Autoencoders for end-to-end mapping of orthophotos to road segmentations. In addition, a set of post-processing steps were applied to make the model outputs GIS-ready data that could be useful for various applications. The optimization of the model's parameters is explained which was conducted via grid search method. The model was trained and implemented in Keras, a high-level deep learning framework run on top of Tensorflow. The results show that the proposed model with the best-obtained hyperparameters could segment road objects from orthophotos at an average accuracy of 88.5%. The results of optimization revealed that the best optimization algorithm and activation function for the studied task are Stochastic Gradient Descent (SGD) and Exponential Linear Unit (ELU), respectively. In addition, the best numbers of convolutional filters were found to be 8 for the first and second layers and 128 for the third and fourth layers of the proposed network architecture. Moreover, the analysis on the time complexity of the model showed that the model could be trained in 4 hours and 50 minutes on 1024 high-resolution images of size $106{\times}106pixels$, and segment road objects from similar size and resolution images in around 14 minutes. The results show that the deep learning models such as Convolutional Autoencoders could be a best alternative to traditional machine learning models for road segmentation from aerial photographs.

Keywords

References

Bergstra, J. and Y. Bengio, 2012. Random search for hyper-parameter optimization, Journal of Machine Learning Research, 13: 281-305
Colak, S., A. Lima, and M.C. Gonzalez, 2016. Understanding congested travel in urban areas, Nature communications, 7
Dahl, G. E., D. Yu, L. Deng, and A. Acero, 2012. Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. IEEE Transactions on audio, speech, and language processing, 20(1): 30-42 https://doi.org/10.1109/TASL.2011.2134090
Devkota, K. C., A. D. Regmi, H. R. Pourghasemi, K. Yoshida, B. Pradhan, I. C. Ryu, M. R. Dhital, and O.F. Althuwaynee, 2013. Landslide susceptibility mapping using certainty factor, index of entropy and logistic regression models in GIS and their comparison at Mugling- Narayanghat road section in Nepal Himalaya, Natural hazards, 65(1): 135-165 https://doi.org/10.1007/s11069-012-0347-6
Jones, D. R., 2001. A taxonomy of global optimization methods based on response surfaces, Journal of global optimization, 21(4): 345-383 https://doi.org/10.1023/A:1012771025575
Kaddah, W., Y. Ouerhani, A. Alfalou, M. Desthieux, C. Brosseau, and C. Gutierrez, 2016. Roadmarking features extraction using theVIAPIX(R) system, Optics Communications,371: 117-127 https://doi.org/10.1016/j.optcom.2016.03.065
Krizhevsky, A., I. Sutskever, and G.E. Hinton, 2012. Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, 1097-1105
Kumar, P., 2012. Road features extraction using terrestrial mobile laser scanning system, National University of Ireland Maynooth, Ireland.
Kumar, P., C. P. McElhinney, P. Lewis, and T. McCarthy, 2014. Automated road markings extraction from mobile laser scanning data, International Journal of Applied Earth Observation and Geoinformation, 32: 125-137 https://doi.org/10.1016/j.jag.2014.03.023
Kussul, N., M. Lavreniuk, S. Skakun, and A. Shelestov, 2017. Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data, IEEE Geoscience and Remote Sensing Letters, 14(5): 778-782 https://doi.org/10.1109/LGRS.2017.2681128
Latinopoulos, D. and K. Kechagia, 2015. A GIS-based multi-criteria evaluation for wind farm site selection, A regional scale application in Greece. Renewable Energy, 78: 550-560 https://doi.org/10.1016/j.renene.2015.01.041
LeCun, Y., Y. Bengio, and G. Hinton, 2015. Deep learning, Nature, 521(7553): 436-444 https://doi.org/10.1038/nature14539
Li, M., A. Stein, W. Bijker, and Q. Zhan, 2016. Regionbased urban road extraction from VHR satellite images using binary partition tree, International Journal of Applied Earth Observation and Geoinformation, 44: 217-225 https://doi.org/10.1016/j.jag.2015.09.005
Majumder, N., S. Poria, A. Gelbukh, and E. Cambria, 2017. Deep Learning-Based Document Modeling for Personality Detection from Text, IEEE Intelligent Systems, 32(2): 74-79 https://doi.org/10.1109/MIS.2017.23
Mnih, V. and G.E. Hinton. 2010. Learning to detect roads in high-resolution aerial images, Proc. of 11th European Conference on Computer Vision, Grete, Greece, Sep. 5-11, pp. 210-223
Mockus, J., 1975. On Bayesian methods for seeking the extremum, Proc. of Optimization Techniques IFIP Technical Conference Novosibirsk, Berlin Heidelberg, Germany, Jul. 1-7, pp. 400-404
Nareyek, A., 2003. Choosing search heuristics by nonstationary reinforcement learning, Applied Optimization, 86: 523-544 https://doi.org/10.1007/978-1-4757-4137-7_25
Osborne, B. P., V. J. Osborne, and M. L. Kruger, 2012. Comparison of satellite surveying to traditional surveying methods for the resources industry, Journal of the British Interplanetary Society, 65(2): 98-104
Raschka, S., 2015. Python machine learning, Packt Publishing Ltd., U.K
Rathore, M. M., A. Ahmad, A. Paul, and S. Rho, 2016. Urban planning and building smart cities based on the internet of things using big data analytics, Computer Networks, 101: 63-80 https://doi.org/10.1016/j.comnet.2015.12.023
Saito, S., T. Yamashita, and Y. Aoki, 2016. Multiple object extraction from aerial imagery with convolutional neural networks, Electronic Imaging, 2016(10): 1-9
Sameen, M. I. and B. Pradhan, 2016a. Assessment of the effects of expressway geometric design features on the frequency of accident crash rates using high-resolution laser scanning data and GIS, Geomatics, Natural Hazards and Risk, 1-15
Sameen, M. I. and B. Pradhan, 2016b. A Simplified Semi-Automatic Technique for Highway Extraction from High-Resolution Airborne LiDAR Data and Orthophotos, Journal of the Indian Society of Remote Sensing, 45(3): 1-11
Sameen, M. I. and B. Pradhan, 2017. Severity Prediction of Traffic Accidents with Recurrent Neural Networks, Applied Sciences, 7(6): 476
Sameen, M. I. and B. Pradhan, 2017a. Severity Prediction of Traffic Accidents with Recurrent Neural Networks, Applied Sciences, 7(6): 476 https://doi.org/10.3390/app7060476
Sameen, M. I. and B. Pradhan, 2017b. A Two-Stage Optimization Strategy for Fuzzy Object-Based Analysis Using Airborne LiDAR and High- Resolution Orthophotos for Urban Road Extraction, Journal of Sensors.
Saxe, A., P.W. Koh, Z. Chen, M. Bhand, B. Suresh, and A.Y Ng, 2011. On random weights and unsupervised feature learning, Proc. of the 28th international conference on machine learning (ICML-11), Bellevue, WA, Jun. 28-Jul. 2, pp. 1089-1096
Shi, W., Z. Miao, and J. Debayle, 2014. An integrated method for urban main-road centerline extraction from optical remotely sensed imagery, IEEE Transactions on Geoscience and Remote Sensing, 52(6): 3359-3372 https://doi.org/10.1109/TGRS.2013.2272593
Sohail, M., D.A.C. Maunder, and S. Cavill, 2006. Effective regulation for sustainable public transport in developing countries, Transport Policy, 13(3): 177-190 https://doi.org/10.1016/j.tranpol.2005.11.004
Unsalan, C. and B. Sirmacek, 2012. Road network detection using probabilistic and graph theoretical methods, IEEE Transactions on Geoscience and Remote Sensing, 50(11): 4441-4453 https://doi.org/10.1109/TGRS.2012.2190078
Vitor, G.B., D.A. Lima, A.C. Victorino, and J.V. Ferreira, 2013. A 2D/3D vision based approach applied to road detection in urban environments, Proc. of 2013 IEEE Intelligent Vehicles Symposium (IV), Gold Coast City, Australia, Jun. 23-26, pp. 952-957.
Wang, J., J. Song, M. Chen, and Z. Yang, 2015. Road network extraction: A neural-dynamic framework based on deep learning and a finite state machine, International Journal of Remote Sensing, 36(12): 3144-3169 https://doi.org/10.1080/01431161.2015.1054049
Wegner, J. D., J.A. Montoya-Zegarra, and K. Schindler, 2013. A higher-order CRF model for road network extraction, Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, Jun. 23-28, pp. 1698-1705
Weihs, C., K. Luebke, and I. Czogiel, 2006. Response surface methodology for optimizing hyper parameters, Universität Dortmund, Dortmund, Germany
Xie, X., K.B.Y. Wong, H. Aghajan, P. Veelaert, and W. Philips, 2016. Road network inference through multiple track alignment, Transportation Research Part C: Emerging Technologies, 72: 93-108 https://doi.org/10.1016/j.trc.2016.09.010
Yang, B., L. Fang, Q. Li, and J. Li, 2012. Automated extraction of road markings from mobile LiDAR point clouds, Photogrammetric Engineering & Remote Sensing, 78(4): 331-338 https://doi.org/10.14358/PERS.78.4.331
Zeyer, A., P. Doetsch, P. Voigtlaender, R. Schlüter, and H. Ney, 2017. A comprehensive study of deep bidirectional LSTM RNNs for acoustic modeling in speech recognition, Proc. of Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International Conference, New Orleans, LA, Mar. 5-9, pp. 2462-2466
Zhan, C., 1993. A hybrid line thinning approach, Proc. of Autocarto-conference-, ASPRS American society for photogrammetry and remote sensing, Bethesda, MD, p. 396

Cited by

A hybrid model using machine learning methods and GIS for potential rockfall source identification from airborne laser scanning data vol.15, pp.9, 2018, https://doi.org/10.1007/s10346-018-0990-4

Korean Journal of Remote Sensing (대한원격탐사학회지)

A Novel Road Segmentation Technique from Orthophotos Using Deep Convolutional Autoencoders

Abstract

Keywords

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)