The Science and Information (SAI) Organization
  • Home
  • About Us
  • Journals
  • Conferences
  • Contact Us

Publication Links

  • IJACSA
  • Author Guidelines
  • Publication Policies
  • Outstanding Reviewers

IJACSA

  • About the Journal
  • Call for Papers
  • Editorial Board
  • Author Guidelines
  • Submit your Paper
  • Current Issue
  • Archives
  • Indexing
  • Fees/ APC
  • Reviewers
  • Apply as a Reviewer

IJARAI

  • About the Journal
  • Archives
  • Indexing & Archiving

Special Issues

  • Home
  • Archives
  • Proposals
  • ICONS_BA 2025

Computer Vision Conference (CVC)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Computing Conference

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Intelligent Systems Conference (IntelliSys)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Future Technologies Conference (FTC)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact
  • Home
  • Call for Papers
  • Editorial Board
  • Guidelines
  • Submit
  • Current Issue
  • Archives
  • Indexing
  • Fees
  • Reviewers
  • RSS Feed

DOI: 10.14569/IJACSA.2025.01612110
PDF

VidAvDetect: A Deepfake-Inspired Vision Transformer Approach for Detecting Real Humans vs. AI-Avatars in Video Streams

Author 1: Btissam Acim
Author 2: Hamid Ouhnni
Author 3: Nassim Kharmoum
Author 4: Soumia Ziti

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 16 Issue 12, 2025.

  • Abstract and Keywords
  • How to Cite this Article
  • {} BibTeX Source

Abstract: The pace of advancement in Generative AI has made it possible to realize highly realistic synthetic identities in the form of avatars for non-existent persons, thus paving the way for a paradigm beyond state-of-the-art deepfake attacks that aim to manipulate real identities in people. This rapidly emerging trend poses a challenge to digital media forensics in a most critical way, in terms of deciding whether a facial identity observed in a video clip represents a real human identity versus a fully synthetic identity created using advanced tools in the realm of Generative AI. To address this gap, we introduce VidAvDetect, a deepfake-inspired Vision Transformer approach specifically designed to discriminate real human faces from AI-generated avatars in video streams, addressing a novel identity-existence verification task. The proposed system integrates efficient frame sampling, robust facial preprocessing, patch-based embeddings, and global structural modeling through a transformer encoder, enabling the detection of subtle geometric and textural regularities characteristic of synthetic identities. Experimental results demonstrate strong performance, with training accuracy reaching 97–98%, video-level accuracy of 95.1%, a macro F1-score of 0.944, and a ROC-AUC of 0.991, confirming the model’s robustness across heterogeneous real, manipulated, and fully synthetic datasets. By moving beyond manipulation detection to focus on identity-existence verification, VidAvDetect establishes a new methodological direction for transparency, regulation, and trust in modern digital media environments where AI-generated avatars increasingly resemble real humans.

Keywords: Vision transformer; deepfake; Artificial Intelligence (AI); Generative AI; AI Avatar; video streams

Btissam Acim, Hamid Ouhnni, Nassim Kharmoum and Soumia Ziti. “VidAvDetect: A Deepfake-Inspired Vision Transformer Approach for Detecting Real Humans vs. AI-Avatars in Video Streams”. International Journal of Advanced Computer Science and Applications (IJACSA) 16.12 (2025). http://dx.doi.org/10.14569/IJACSA.2025.01612110

@article{Acim2025,
title = {VidAvDetect: A Deepfake-Inspired Vision Transformer Approach for Detecting Real Humans vs. AI-Avatars in Video Streams},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2025.01612110},
url = {http://dx.doi.org/10.14569/IJACSA.2025.01612110},
year = {2025},
publisher = {The Science and Information Organization},
volume = {16},
number = {12},
author = {Btissam Acim and Hamid Ouhnni and Nassim Kharmoum and Soumia Ziti}
}



Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

IJACSA

Upcoming Conferences

Computer Vision Conference (CVC) 2026

21-22 May 2026

  • Amsterdam, The Netherlands

Computing Conference 2026

9-10 July 2026

  • London, United Kingdom

Artificial Intelligence Conference 2026

3-4 September 2026

  • Amsterdam, The Netherlands

Future Technologies Conference (FTC) 2026

15-16 October 2026

  • Berlin, Germany
The Science and Information (SAI) Organization
BACK TO TOP

Computer Science Journal

  • About the Journal
  • Call for Papers
  • Submit Paper
  • Indexing

Our Conferences

  • Computer Vision Conference
  • Computing Conference
  • Intelligent Systems Conference
  • Future Technologies Conference

Help & Support

  • Contact Us
  • About Us
  • Terms and Conditions
  • Privacy Policy

The Science and Information (SAI) Organization Limited is a company registered in England and Wales under Company Number 8933205.