Browse > Article
http://dx.doi.org/10.3837/tiis.2019.08.007

Stage-GAN with Semantic Maps for Large-scale Image Super-resolution  

Wei, Zhensong (Institute of Information Science, Beijing Jiaotong University)
Bai, Huihui (Institute of Information Science, Beijing Jiaotong University)
Zhao, Yao (Institute of Information Science, Beijing Jiaotong University)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.13, no.8, 2019 , pp. 3942-3961 More about this Journal
Abstract
Recently, the models of deep super-resolution networks can successfully learn the non-linear mapping from the low-resolution inputs to high-resolution outputs. However, for large scaling factors, this approach has difficulties in learning the relation of low-resolution to high-resolution images, which lead to the poor restoration. In this paper, we propose Stage Generative Adversarial Networks (Stage-GAN) with semantic maps for image super-resolution (SR) in large scaling factors. We decompose the task of image super-resolution into a novel semantic map based reconstruction and refinement process. In the initial stage, the semantic maps based on the given low-resolution images can be generated by Stage-0 GAN. In the next stage, the generated semantic maps from Stage-0 and corresponding low-resolution images can be used to yield high-resolution images by Stage-1 GAN. In order to remove the reconstruction artifacts and blurs for high-resolution images, Stage-2 GAN based post-processing module is proposed in the last stage, which can reconstruct high-resolution images with photo-realistic details. Extensive experiments and comparisons with other SR methods demonstrate that our proposed method can restore photo-realistic images with visual improvements. For scale factor ${\times}8$, our method performs favorably against other methods in terms of gradients similarity.
Keywords
Super-resolution; Stage-GAN; Generative adversarial networks; Semantic maps; Large scaling factors;
Citations & Related Records
연도 인용수 순위
  • Reference
1 C. Dong, C. C. Loy, K. He, and X. Tang, "Image Super-Resolution Using Deep Convolutional Networks," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 38, no. 2, pp.295-307, Feb. 2016.   DOI
2 J. Kim, JK. Lee, and KM. Lee, "Deeply-Recursive Convolutional Network for Image Super-Resolution," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1637-1645, 2016.
3 J. Pan, S. Liu, J. Zhang, Y. Liu, J. Ren, Z. Li, J. Tang, H. Lu, Y.-W. Tai, and Ming-Hsuan Yang, "Learning Dual Convolutional Neural Networks for Low-Level Vision," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp.3070-3079, 2018.
4 D. Glasner, S. Bagon and M. Irani, "Super-resolution from a single image," in Proc. of IEEE International Conference on Computer Vision, pp.349-356, 2009.
5 B. Lim, S. Son, H. Kim, S. Nah, and K. M. Lee, "Enhanced deep residual networks for image super-resolution," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017.
6 Y. Zhang, Y. Tian, Y. Kong, B. Zhong, and Y. Fu, "Residual dense network for image super-resolution," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp.2474-2481, 2018.
7 J. Bruna, P. Sprechmann, and Y. LeCun, "Super-Resolution with Deep Convolutional Sufficient Statistics," in Proc. of IEEE International Conference on Learning Representations, 2016.
8 J. Zhao, M. Mathieu, and Y. LeCun, "Energy-based Generative Adversarial Network," in Proc. of International Conference on Learning Representations, pp. 1-17, 2017.
9 I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, and D. Warde-Farley, S. Ozair, A. Courville and Y. Bengio, "Generative adversarial nets," in Proc. of International Conference on Neural Information Processing Systems MIT Press, pp. 2672-2680, 2014.
10 M. Arjovsky, S. Chintala, and L. Bottou, "Wasserstein GAN," in Proc. of International Conference on Machine Learning, 2017.
11 J. Y. Zhu, P. Krahenbuhl, E. Shechtman, and A. A. Efros, "Generative Visual Manipulation on the Natural Image Manifold," in Proc. of European Conference on Computer Vision, pp. 597-613, 2016.
12 M. Mathieu, J. Zhao, A. Ramesh, P. Sprechmann, and Y. LeCun, "Disentangling factors of variation in deep representations using adversarial training," in Proc. of International Conference on Neural Information Processing Systems, pp. 5040-5048, 2016.
13 T. Salimans, I. Goodfellow, W. Zaremba, V. Cheung, A. Radford, and X. Chen, "Improved Techniques for Training GANs," in Proc. of International Conference on Neural Information Processing Systems, 2016.
14 C. Li, and M. Wand, "Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks," in Proc. of European Conference on Computer Vision, pp.702-716, 2016.
15 S. Reed, Z. Akata, X. Yan, L. Logeswaran, B. Schiele, and H. Lee, "Generative adversarial text to image synthesis," in Proc. of International Conference on International Conference on Machine Learning, pp. 1060-1069, 2016.
16 D. Emily, C. Soumith, S. Arthur, and F. Rob, "Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks," in Proc. of International Conference on Neural Information Processing Systems, pp. 1486-1494, 2015.
17 M. Mirza, and S. Osindero, "Conditional generative adversarial nets," arXiv preprint arXiv: 1411.1784, 2014.
18 D. Pathak, P. Krahenbuhl, J. Donahue, T. Darrell, and A. A. Efros, "Context Encoders: Feature Learning by Inpainting," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 2536-2544, 2016.
19 H. Zhang, T. Xu, H. Li, S. Zhang, X. Wang, X. Huang, and D. Metaxas, "StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks," in Proc. of IEEE International Conference on Computer Vision, pp.5908-5916, 2017.
20 C. Ledig, L. Theis, F. Huszar, J. Caballero, A. Cunningham, A. Acosta, A. Aitken, A. Tejani, J. Totz, Z. Wang, and W. Shi, "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 105-114, 2017.
21 A. Bulat, J. Yang, and G. Tzimiropoulos, "To learn image super-resolution, use a GAN to learn how to do image degradation first," in Proc. of European Conference on Computer Vision, pp. 187-202, 2018.
22 W.-S. Lai, J.-B. Huang, N. Ahuja, and M.-H. Yang, "Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp.5835-5843, 2017.
23 D. Kingma, and J. Ba, "Adam: A method for Stochastic Optimization," arXiv preprint arXiv: 1412.6980, 2014.
24 R. Olaf, F. Philipp, and B. Thomas, "U-Net: Convolutional Networks for Biomedical Image Segmentation," in Proc. of the International Conference on Medical Image Computing and Computer Assisted Intervention, pp. 234-241, 2015.
25 J. Long, E. Shelhamer, and T. Darrell, "Fully convolutional networks for semantic segmentation," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp.3431-3440, 2015.
26 S. Ioffe, and C. Szegedy, "Batch normalization: accelerating deep network training by reducing internal covariate shift," in Proc. of International Conference on International Conference on Machine Learning, pp.448-456, 2015.
27 K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp.770-778, 2016.
28 R. Tylecek, and R. Sara, "Spatial Pattern Templates for Recognition of Objects with Regular Structure," in Proc. of German Conference Pattern Recognition, pp.364-374, 2013.
29 M. Irani, and S. Peleg, "Motion Analysis for Image Enhancement: Resolution, Occlusion, and Transparency," Journal of Visual Communications and Image Representation, vol. 4, no.4, pp. 324-335, Dec. 1993.   DOI
30 Z. Wang, A.C. Bovik, H. R. Sheikh, and E. P. Simoncelli, "Image quality assessment: from error visibility to structural similarity," IEEE Transactions on Image Processing, vol.13, no.4, pp.600-612, April 2004.   DOI
31 A. Liu, W. Lin, and M. Narwaria, "Image Quality Assessment Based on Gradient Similarity," IEEE Transactions on Image Processing, vol.21, no.4, pp.1500-1512, April 2012.   DOI
32 M. W. Thornton, P. M. Atkinson, and D. A. Holland, "Sub-pixel mapping of rural land cover objects from fine spatial resolution satellite sensor imagery using super-resolution pixel-swapping," International Journal of Remote Sensing, vol. 27, no. 3, pp. 473-491, Feb. 2007.   DOI
33 S. Peled, and Y. Yeshurun, "Superresolution in MRI: Application to human white matter fiber tract visualization by diffusion tensor imaging," Magnetic Resonance in Medicine Official Journal of the Society of Magnetic Resonance in Medicine, vol.45, no.1, pp. 29-35, Jan. 2001.   DOI
34 W. Shi, J. Caballero, C. Ledig, X. Zhuang, W. Bai, K. Bhatia, A. Marvao, T. Dawes, D. ORegan, and D. Rueckert, "Cardiac Image Super-Resolution with Global Correspondence using Multi-Atlas PatchMatch," Medical Image Computing and Computer Assisted Intervention, pp. 9-16, 2013.
35 L. Zhang, H. Zhang, H. Shen, and P. Li, "A super-resolution reconstruction algorithm for surveillance images," IEEE Transactions on Signal Processing, vol. 90, no. 3, pp. 848-859, Sep. 2009.
36 B. K. Gunturk, A. U. Batur, Y. Altunbasak, M. H. Hayes, and R. M. Mersereau, "Eigenface-domain super-resolution for face recognition," IEEE Transactions on Image Processing, vol.12, no. 5, pp. 597-606, May 2003.   DOI
37 R. Keys, "Cubic convolution interpolation for digital image processing," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no.6, pp. 1153-1160, Dec. 1981.   DOI
38 J.-B. Huang, A. Singh, and N. Ahuja, "Single image super-resolution from transformed self-exemplars," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 5197-5206, 2015.
39 K. In Kim, and Y. Kwon, "Single-Image Super-Resolution Using Sparse Regression and Natural Image Prior," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.32, no. 6, pp. 1127-1133, June 2010.   DOI
40 J. Sun, Z. Xu, and H.-Y. Shum, "Image super-resolution using gradient profile prior," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, 2008.
41 F. Wang, M. Jiang, C. Qian, S. Yang, C. Li, H. Zhang, X. Wang and X. Tang, "Residual attention network for image classification," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp.6450-6458, 2017.
42 P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros, "Image-to-Image Translation with Conditional Adversarial Networks," in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, pp. 5967-5976, 2017.
43 Z. Li, J. Tang, and T. Mei, "Deep Collaborative Embedding for Social Image Understanding," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 41, no. 9, pp. 2070-2083, 2019.   DOI
44 Z. Li, and J. Tang, "Weakly supervised deep matrix factorization for social image understanding," IEEE Transactions on Image Processing, vol. 26, no. 1, pp. 276-288, Jan. 2017.   DOI