Undercover Deepfakes: Detecting Fake Segments in Videos

Accepted at DFAD Workshop in ICCV 2023: [arXiv | pdf]

Evaluate on Temporal Deepfakes

Download the trained timeseries transformer model from here.
Also download the preprocessed data from here. Please note, this data corresponds to the preprocessed ViT-embeddings, not raw images.
Run the script evaluate.py like below:

python evaluate.py --model </path/to>/model/temporal_dfd.h5 --data </path/to>/embeddings/subtle/ --variation subtle

There are three types of embeddings: subtle for videos with carefully selected fake segments, random (TBA) for videos with randomly selected fake segments, and video (TBA) for videos that have same type of frames throughout i.e. they do not have a mix of real and fake frames.

Use the argument --data and --variation accordingly i.e. if you change --data to the random directory, also change --variation to random.

ViT Model Weights

Fine-tuned ViT model weights can be found here. ViT-embeddings for FF+ dataset can be downloaded here.

We also thank the authors of the SSF for providing their source code.

Abstract

The recent renaissance in generative models, driven primarily by the advent of diffusion models and iterative improvement in GAN methods, has enabled many creative applications. However, each advancement is also accompanied by a rise in the potential for misuse. In the arena of the deepfake generation, this is a key societal issue. In particular, the ability to modify segments of videos using such generative techniques creates a new paradigm of deepfakes which are mostly real videos altered slightly to distort the truth. This paradigm has been under-explored by the current deepfake detection methods in the academic literature. In this paper, we present a deepfake detection method that can address this issue by performing deepfake prediction at the frame and video levels. To facilitate testing our method, we prepared a new benchmark dataset where videos have both real and fake frame sequences with very subtle transitions. We provide a benchmark on the proposed dataset with our detection method which utilizes the Vision Transformer based on Scaling and Shifting to learn spatial features, and a Timeseries Transformer to learn temporal features of the videos to help facilitate the interpretation of possible deepfakes. Extensive experiments on a variety of deepfake generation methods show excellent results by the proposed method on temporal segmentation and classical video-level predictions as well. In particular, the paradigm we address will form a powerful tool for the moderation of deepfakes, where human oversight can be better targeted to the parts of videos suspected of being deepfakes.

Temporal Dataset

Temporal dataset is prepared based on the FaceForensics++ (FF++) dataset. We publish the start and end frame number of the fake segment(s) in the CSV files in temporal_dataset folder.

Manually selected fake segments where transition from real to fake frames and vice versa are very subtle.

Model Architecture

We leverage Parameter Efficient Fine-Tuning (PEFT) to build an efficient transformer-based architecture that achieves results comparable to and outperforming SOTA methods.

Results

Temporal Segmentation

On the subset with subtle transitions between real and fake frames.

On the subset with random (not subtle) transitions between real and fake frames.

Video Level Classification

Cite this paper

@InProceedings{Saha_2023_ICCV,
author    = {Saha, Sanjay and Perera, Rashindrie and Seneviratne, Sachith and Malepathirana, Tamasha and Rasnayaka, Sanka and Geethika, Deshani and Sim, Terence and Halgamuge, Saman},
title     = {Undercover Deepfakes: Detecting Fake Segments in Videos},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
month     = {October},
year      = {2023},
pages     = {415-425}

}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
assets		assets
temporal_dataset		temporal_dataset
.gitignore		.gitignore
README.md		README.md
evaluate.py		evaluate.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Undercover Deepfakes: Detecting Fake Segments in Videos

Evaluate on Temporal Deepfakes

ViT Model Weights

Abstract

Temporal Dataset

Model Architecture

Results

Temporal Segmentation

Video Level Classification

Cite this paper

About

Releases

Packages

Contributors 3

Languages

rgb91/temporal-deepfake-segmentation

Folders and files

Latest commit

History

Repository files navigation

Undercover Deepfakes: Detecting Fake Segments in Videos

Evaluate on Temporal Deepfakes

ViT Model Weights

Abstract

Temporal Dataset

Model Architecture

Results

Temporal Segmentation

Video Level Classification

Cite this paper

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages