Acknowledgement
이 논문은 2021학년도 조선대학교 학술연구비의 지원과 연구개발특구진흥재단의 '기술사업화 협업 플랫폼' 사업으로 수행되었습니다.(과제명: 인공지능 산업 육성 및 기술사업화를 위한 지능형 디지털 콘텐츠 제작 기술 개발 및 플랫폼 구축 사업, 과제고유번호: 1711177250)
References
- 이승진, 왕덕원, "인공지능 웹툰 작가의 가능성과 성공에 대한 연구: 혁신 확산 이론을 중심으로," 애니메이션연구, pp 231-254, 2023년 6월
- Dhariwal, P. & Nichol, A., "Diffusion models beat gans on image synthesis," Advances in neural information processing systems 34, pp 8780-8794, 2021.
- 유경호, 노주현, 홍택은, 김형주, 김판구, "Self-Attention을 적용한 문장 임베딩으로부터 이미지 생성 연구," 스마트미디어저널, 제10권, 제1호, 63-69쪽, 2021년 03월
- 유경호, 양희덕, "Generative Adversarial Network 를 이용한 카툰 원화의 라인 드로잉 추출," 스마트미디어저널, 제10권, 제2호, 30-37쪽, 2021년 06월
- 김광진, 이칠우, "Bi-LSTM 모델을 이용한 음악 생성 시계열 예측," 스마트미디어저널, 제11권, 제10호, 65-75쪽, 2022년 11월 https://doi.org/10.30693/SMJ.2022.11.10.65
- Ramesh, A., Dhariwal, P., Nichol, A., Chu, C. & Chen, M., "Hierarchical text-conditional image generation with clip latents," arxiv.org/abs/2204.06125, 2022.
- Rombach, R., Blattmann, A., Lorenz, D., Esser, P. & Ommer, B., "High-resolution image synthesis with latent diffusion models," Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10684-10695, 2022.
- Ruiz, N., Li, Y., Jampani, V., Pritch, Y., Rubinstein, M. & Aberman, K., "Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 22500-22510, 2023.
- Hu, E. J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., ... & Chen, W., "Lora: Low-rank adaptation of large language models," arXiv:2106.09685, 2021.
- 웹툰 AI 페인터, https://ai.webtoons.com/ko/painter (accessed Jul., 28, 2023).
- Kang, S., Choo, J. & Chang, J., "Consistent comic colorization with pixel-wise background classification," In Proceedings of the NIPS, Vol. 17, 2017.
- Xu, X., Xie, M., Miao, P., Qu, W., Xiao, W., Zhang, H., ... & Wong, T. T., "Perceptual-aware sketch simplification based on integrated VGG layers," IEEE transactions on visualization and computer graphics, Vol. 27, Issue 1, pp. 178-189, Jan. 2021. https://doi.org/10.1109/TVCG.2019.2930512
- Yu, K., Kim, H., Kim, J., Chun, C., & Kim, P., "A Study on Generating Webtoons Using Multilingual Text-to-Image Models," Applied Sciences, 13(12), 7278, 2023.
- Ho, J., Jain, A. & Abbeel, P., "Denoising diffusion probabilistic models," Advances in neural information processing systems33, pp 6840-6851, 2020.
- Kim, G., Kwon, T. & Ye, J. C., "Diffusionclip: Text-guided diffusion models for robust image manipulation," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2426-2435, 2022.
- Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, "I. Learning transferable visual models from natural language supervision," International conference on machine learning, pp. 8748-8763, 2021.
- Xu, T., Zhang, P., Huang, Q., Zhang, H., Gan, Z., Huang, X. & He, X., "Attngan: Fine-grained text to image generation with attentional generative adversarial networks," Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1316-1324, Salt Lake City, USA, Jun. 2018.
- Zhang, H., Xu, T., Li, H., Zhang, S., Wang, X., Huang, X. & Metaxas, D. N., "Stackgan: Text to photo-realistic image synthesis with stacked generative adversarial networks," 2017 IEEE International Conference on Computer Vision (ICCV), pp. 5907-5915, Venice, Italy, Oct. 2017.
- Zhu, M., Pan, P., Chen, W. & Yang, Y., "Dm-gan: Dynamic memory generative adversarial networks for text-to-image synthesis," Proceedings of the IEEE/CVF conference on computer vision and pattern recognitio, pp. 5802-5810, 2019.
- PEFT, https://github.com/huggingface/peft/tree/main (accessed Jul., 28, 2023).
- MSCOCO, https://cocodataset.org/#home, (accessed Jul., 28, 2023).
- Chen, Y., Lai, Y. K. & Liu, Y. J., "Cartoongan: Generative adversarial networks for photo cartoonization," 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9465-9474, Salt Lake City, USA, Jun. 2018.