Browse > Article
http://dx.doi.org/10.56977/jicce.2022.20.4.288

Automatic Generation of Video Metadata for the Super-personalized Recommendation of Media  

Yong, Sung Jung (Department of Computer Science and Engineering, Korea University of Technology and Education)
Park, Hyo Gyeong (Department of Computer Science and Engineering, Korea University of Technology and Education)
You, Yeon Hwi (Department of Computer Science and Engineering, Korea University of Technology and Education)
Moon, Il-Young (Department of Computer Science and Engineering, Korea University of Technology and Education)
Abstract
The media content market has been growing, as various types of content are being mass-produced owing to the recent proliferation of the Internet and digital media. In addition, platforms that provide personalized services for content consumption are emerging and competing with each other to recommend personalized content. Existing platforms use a method in which a user directly inputs video metadata. Consequently, significant amounts of time and cost are consumed in processing large amounts of data. In this study, keyframes and audio spectra based on the YCbCr color model of a movie trailer were extracted for the automatic generation of metadata. The extracted audio spectra and image keyframes were used as learning data for genre recognition in deep learning. Deep learning was implemented to determine genres among the video metadata, and suggestions for utilization were proposed. A system that can automatically generate metadata established through the results of this study will be helpful for studying recommendation systems for media super-personalization.
Keywords
AI; Metadata; OTT; Keyframe; YCbCr;
Citations & Related Records
연도 인용수 순위
  • Reference
1 C. A. Gomez-Uribe and N. Hunt, "The netflix recommender system: Algorithms, business value, and innovation," ACM Transactions on Management Information Systems, vol. 6, no. 13, pp. 1-19, Jan. 2016. DOI: 10.1145/2843948.   DOI
2 P. Covington, J. Adams, and E. Sargin, "Deep neural networks for YouTube recommendations," in Proceedings of the 10th ACM Conference on Recommender Systems (RecSys '16), New York: NY, USA, pp. 191-198, 2016. DOI: 10.1145/ 2959100.2959190.   DOI
3 J. Jung, "The correlation of bach music and the scene as seen in films," M. S. thesis, The Graduate School of Ewha Womans University, Seoul, Korea, pp. 1, 2007.
4 Y. Tan, J. Qin, X. Xiang, W. Ma, W. Pan, and N. Xiong, "A robust watermarking scheme in YCbCr color space based on channel coding," IEEE Access, vol. 7, pp. 1-1, Jan. 2019. DOI: 10.1109/ACCESS.2019.2896304.   DOI
5 K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," arXiv preprint, arXiv: 1409.1556. Sep. 2014.
6 Z. Wang, P. Song, Q. Tang, and Y. Rui, "A Non-Stationary Signal Preprocessing Method based on STFT for CW Radio Doppler Signal," in Proceedings of the 2020 4th International Conference on Vision, Image and Signal Processing, Bangkok, Thailand, pp. 1-5, 2020. DOI: 10.1145/3448823.3448845.   DOI
7 K. Liu, L. Gong, N. Tian, F. Gong, and Q. Wang, "Feature extraction method of power grid load data based on STFT-CRNN," in Proceedings of the 6th International Conference on Big Data and Computing (ICBDC'21), Shenzhen, Cina, pp. 55-60, 2021. DOI: 10.1145/3469968.3469978.   DOI
8 K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas: NV, USA, pp. 770-778, 2016. DOI: 10.1109/CVPR.2016.90.   DOI