Acknowledgement
This work was supported by Seokyeong University in 2022 and by Seokyeong University in 2023.
References
- Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," IEEE 2016, pp. 779-788, 2016. DOI: 10.48550/arXiv.1506.02640
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton, "ImageNet Classification with Deep Convolutional Neural Networks," Communications of the ACM, Vol.60, No.6, pp.84-90, 2017. https://doi.org/10.1145/3065386
- Fuzhen Zhuang, Zhiyuan Qi, Keyu Duan, Dongbo Xi, Yongchun Zhu, Hengshu Zhu, "A Comprehensive Survey on Transfer Learning," Proceedings of the IEEE, Vol.109, No.1, pp.43-76, 2021. DOI: 10.48550/arXiv.1911.02685
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin, "Attention Is All You Need," NIPS 2017, pp.6000-6010, 2017. DOI: 10.48550/arXiv.1706.03762
- Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Lucic, Cordelia Schmid, "ViViT: A Video Vision Transformer," ICCV 2021, pp.6836-6846, 2021. DOI: 10.48550/arXiv.2103.15691
- Xiaolong Wang, Ross Girshick, Abhinav Gupta, Kaiming He, "Non-local Neural Networks," IEEE, pp.7794-7803, 2017.
- Khurram Soomro, Amir Roshan Zamir and Mubarak Shah, "UCF101: A Dataset of 101 Human Action Classes From Videos in The Wild," CRCVTR-12-01, 2012. DOI: 10.48550/arXiv.1212.0402
- Reza Ghoddoosian, Marnim Galib, Vassilis Athitsos, "A Realistic Dataset and Baseline Temporal Model for Early Drowsiness Detection," CVPRW 2019, pp.178-187, 2019. DOI: 10.48550/arXiv.1904.07312
- Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby, "An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale," https://arxiv.org/abs/2010.11929 DOI: 10.48550/arXiv.2010.11929
- Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton, "Layer Normalization," https://arxiv.org/abs/1607.06450
- A. Buades, B. Coll and J.-M. Morel, "A nonlocal algorithm for image denoising," 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), Vol.2, pp.60-65, 2005. DOI: 10.1109/CVPR.2005.38.
- https://newindow.tistory.com/254