DOI QR코드

DOI QR Code

Text-to-Face Generation Using Multi-Scale Gradients Conditional Generative Adversarial Networks

다중 스케일 그라디언트 조건부 적대적 생성 신경망을 활용한 문장 기반 영상 생성 기법

  • Bui, Nguyen P. (Dept. of Superintelligence, Sungkyunkwan University) ;
  • Le, Duc-Tai (College of Computing. Sungkyunkwan University) ;
  • Choo, Hyunseung (College of Computing. Sungkyunkwan University)
  • ;
  • ;
  • 추현승 (성균관대학교 소프트웨어대학)
  • Published : 2021.11.04

Abstract

While Generative Adversarial Networks (GANs) have seen huge success in image synthesis tasks, synthesizing high-quality images from text descriptions is a challenging problem in computer vision. This paper proposes a method named Text-to-Face Generation Using Multi-Scale Gradients for Conditional Generative Adversarial Networks (T2F-MSGGANs) that combines GANs and a natural language processing model to create human faces has features found in the input text. The proposed method addresses two problems of GANs: model collapse and training instability by investigating how gradients at multiple scales can be used to generate high-resolution images. We show that T2F-MSGGANs converge stably and generate good-quality images.

Keywords

Acknowledgement

This research was supported by Korea government(MSIT, IITP), under the ICT Creative Consilience program(IITP-2021-2020-0-01821) and National Research Foundation of Korea (NRF-2020R1A2C2008447, Deep Adversarial Learning Driven Virtual Edge: Self-supervised virtual edge mobility, resource placement and allocation)" This research was supported by the MSIT(Ministry of Science and ICT), Korea, under the Grand Information Technology Research Center support program(IITP-2021-2015-0-00742) supervised by the IITP(Institute for Information & communications Technology Planning & Evaluation)".