Facial Manipulation Detection with Transformer-based Discriminative Features Learning Vision

Van-Nhan Tran;Minsu Kim;Philjoo Choi;Suk-Hwan Lee;Hoanh-Su Le;Ki-Ryong Kwon;

doi:10.3745/PKIPS.y2023m11a.540

한국정보처리학회:학술대회논문집 (Proceedings of the Korea Information Processing Society Conference)

한국정보처리학회 2023년도 추계학술발표대회
/
Pages.540-542
/
2023
/
2005-0011(pISSN)
/
2671-7298(eISSN)

한국정보처리학회 (Korea Information Processing Society)

DOI QR Code

트랜스포머 기반 판별 특징 학습 비전을 통한 얼굴 조작 감지

Facial Manipulation Detection with Transformer-based Discriminative Features Learning Vision

;
김민수 (부경대학교 인공지능융합학과) ;
최필주 (부경대학교 인공지능융합학과) ;
이석환 (동아대학교 컴퓨터공학부) ;
;
권기룡 (부경대학교 인공지능융합학과)

Van-Nhan Tran (Dept. of Artificial Intelligence Convergence, Pukyong National University) ;
Minsu Kim (Dept. of Artificial Intelligence Convergence, Pukyong National University) ;
Philjoo Choi (Dept. of Artificial Intelligence Convergence, Pukyong National University) ;
Suk-Hwan Lee (Division of Computer and AI Engineering, Dong-A University) ;
Hoanh-Su Le (Faculty of Information Systems, University of Economics and Law, Ho Chi Minh City and Vietnam National University) ;
Ki-Ryong Kwon (Dept. of Artificial Intelligence Convergence, Pukyong National University)

발행 : 2023.11.02

https://doi.org/10.3745/PKIPS.y2023m11a.540 인용 PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Due to the serious issues posed by facial manipulation technologies, many researchers are becoming increasingly interested in the identification of face forgeries. The majority of existing face forgery detection methods leverage powerful data adaptation ability of neural network to derive distinguishing traits. These deep learning-based detection methods frequently treat the detection of fake faces as a binary classification problem and employ softmax loss to track CNN network training. However, acquired traits observed by softmax loss are insufficient for discriminating. To get over these limitations, in this study, we introduce a novel discriminative feature learning based on Vision Transformer architecture. Additionally, a separation-center loss is created to simply compress intra-class variation of original faces while enhancing inter-class differences in the embedding space.

키워드

과제정보

This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2023-2020-0-01797) supervised by the IITP (Institute for Information & Communications Technology Planning & Evaluation) and the MSIT (Ministry of Science and ICT), Korea, under the ICT Consilience Creative program (IITP-2023-2016-0-00318) supervised by the IITP (Institute for Information & communications Technology Planning & Evaluation).