A Study on Image Generation from Sentence Embedding Applying Self-Attention
![]() |
Yu, Kyungho
(조선대학교 컴퓨터공학과 대학원)
No, Juhyeon (조선대학교 컴퓨터공학과 대학원) Hong, Taekeun (조선대학교 컴퓨터공학과 대학원) Kim, Hyeong-Ju (조선대학교 컴퓨터공학과 대학원) Kim, Pankoo (조선대학교 컴퓨터공학과) |
1 | Reed, S., Akata, Z., Yan, X., Logeswaran, L., Schiele, B. & Lee, H., "Generative adversarial text to image synthesis," In International Conference on Machine Learning, pp. 1060-1069, 2016. |
2 | Zhang, H., Xu, T., Li, H., Zhang, S., Wang, X., Huang, X. & Metaxas, D. N., "Stackgan: Text to photo-realistic image synthesis with stacked generative adversarial networks," In Proceedings of the IEEE international conference on computer vision, pp. 5907-5915, 2017. |
3 | Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. & Wojna, Z., "Rethinking the inception architecture for computer vision," In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818-2826, 2016. |
4 | Xu, T., Zhang, P., Huang, Q., Zhang, H., Gan, Z., Huang, X. & He, X, "Attngan: Fine-grained text to image generation with attentional generative adversarial networks," In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1316-1324, 2018. |
5 | Zhu, J. Y., Park, T., Isola, P. & Efros, A. A, "Unpaired image-to-image translation using cycle-consistent adversarial networks," In Proceedings of the IEEE international conference on computer vision, pp. 2223-2232, 2017. |
6 | Qiao, T., Zhang, J., Xu, D. & Tao, D, "Mirrorgan: Learning text-to-image generation by redescription," In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1505-1514, 2019. |
7 | Mikolov, T., Chen, K., Corrado, G. & Dean, J, "Efficient estimation of word representations in vector space," arXiv preprint arXiv:1301.3781, 2013. |
8 | 임명진, 박원호, 신주현, "Word2Vec과 LSTM을 활용한 이별 가사 감정 분류," 스마트미디어저널, 제9권 제3호, 90-97쪽, 2020년 9월 DOI |
9 | Devlin, J., Chang, M. W., Lee, K. & Toutanova, K, "Bert: Pre-training of deep bidirectional transformers for language understanding," arXiv preprint arXiv:1810.04805, 2018. |
10 | Vaswani, Ashish, et al. "Attention is all you need," Advances in neural information processing systems, pp. 5998-6008. 2017. |
11 | Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D. & Stoyanov, V, "Roberta: A robustly optimized bert pretraining approach," arXiv preprint arXiv:1907.11692, 2019. |
12 | Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R. & Le, Q. V, "Xlnet: Generalized autoregressive pretraining for language understanding," arXiv preprint arXiv:1906.08237, 2019. |
13 | Haseeb Nazki, Jaehwan Lee, Sook Yoon, Dong Sun Park, "Image-to-Image Translation with GAN for Synthetic Data Augmentation in Plant Disease Datasets," 스마트미디어저널, 제8권, 제2호, 46-57쪽, 2019년 06월 |
14 | 이태석, 강승식, "LSTM 기반의 sequence-to-sequence 모델을 이용한 한글 자동 띄어쓰기," 스마트미디어저널, 제7권, 제4호, 17-23쪽, 2018년 DOI |
![]() |