code for Image Manipulation Detection by Multi-View Multi-Scale Supervision

Last update: Dec 30, 2022

Related tags

Deep Learning MVSS-Net

Overview

MVSS-Net

Code and models for ICCV 2021 paper: Image Manipulation Detection by Multi-View Multi-Scale Supervision

Update

22.02.17, Pretrained model for Real-World Image Foregery Localization Challange

To Be Done.

21.12.17, Something new: MVSS-Net++

We now have an improved version of MVSS-Net, denoted as MVSS-Net++. Check here.

Environment

Ubuntu 16.04.6 LTS
Python 3.6
cuda10.1+cudnn7.6.3

Requirements

Install nvidia-apex and move it to current directory.
pip install requirements.txt

Usage

Dataset

An example of the dataset index file is given as data/CASIAv1plus.txt, where each line contains:

img_path mask_path label

0 represents the authentic and 1 represents the manipulated.
For an authentic image, the mask_path is "None".
For wild images without mask groundtruth, the index should at least contain "img_path" per line.

Training sets

DEFACTO-84k
CASIAv2 / Edge-Mask

Test sets

DEFACTO-12k
Columbia
COVER
NIST16
CASIAv1plus: Note that some of the authentic images in CASIAv1 also appear in CASIAv2. With those images fully replaced by Corel images that are new to both CASIAv1 and CASIAv2, we constructed a revision of CASIAv1 termed as CASIAv1plus. We recommend to use CASIAv1plus as an alternative to the original CASIAv1.

Trained Models

We offer FCNs and MVSS-Nets trained on CASIAv2 and DEFACTO_84k, respectively. Please download the models and place them in the ckpt directory:

百度网盘 (提取码：mvss)
Google drive

The performance of these models for image-level manipulation detection (metric: AUC and image-level F1) is as follows. More details are reported in the paper.

Performance metric: AUC

Model	Training data	CASIAv1plus	Columbia	COVER	DEFACTO-12k
MVSS_Net	CASIAv2	0.932	0.980	0.731	0.573
MVSS_Net	DEFACTO-84k	0.771	0.563	0.525	0.886
FCN	CASIAv2	0.769	0.762	0.541	0.551
FCN	DEFACTO-84k	0.629	0.535	0.543	0.840

Performance metric: Image-level F1 (threshold=0.5)

Model	Training data	CASIAv1plus	Columbia	COVER	DEFACTO-12k
MVSS_Net	CASIAv2	0.759	0.802	0.244	0.404
MVSS_Net	DEFACTO-84k	0.685	0.353	0.360	0.799
FCN	CASIAv2	0.684	0.481	0.180	0.458
FCN	DEFACTO-84k	0.561	0.492	0.511	0.709

Inference & Evaluation

You can specify which pre-trained model to use by setting model_path in do_pred_and_eval.sh. Given a test_collection (e.g. CASIAv1plus or DEFACTO12k-test), the prediction maps and evaluation results will be saved under save_dir. The default threshold is set as 0.5.

bash do_pred_and_eval.sh $test_collection
#e.g. bash do_pred_and_eval.sh CASIAv1plus

For inference only, use following command to skip evaluation:

bash do_pred.sh $test_collection
#e.g. bash do_pred.sh CASIAv1plus

Demo

demo.ipynb: A step-by-step notebook tutorial showing the usage of a pre-trained model to detect manipulation in a specific image.

Citation

If you find this work useful in your research, please consider citing:

@InProceedings{MVSS_2021ICCV,  
author = {Chen, Xinru and Dong, Chengbo and Ji, Jiaqi and Cao, juan and Li, Xirong},  
title = {Image Manipulation Detection by Multi-View Multi-Scale Supervision},  
booktitle = {The IEEE International Conference on Computer Vision (ICCV)},  
year = {2021}  
}

Acknowledgments

Nvidia-apex is adopted for semi-precision training/inferencing.
The implement of DA module is based on the awesome-semantic-segmentation-pytorch.

Contact

If you enounter any issue when running the code, please feel free to reach us either by creating a new issue in the github or by emailing

Xinru Chen ([email protected])
Chengbo Dong ([email protected])

code for Image Manipulation Detection by Multi-View Multi-Scale Supervision

Related tags

Overview

MVSS-Net

Update

Environment

Requirements

Usage

Dataset

Training sets

Test sets

Trained Models

Performance metric: AUC

Performance metric: Image-level F1 (threshold=0.5)

Inference & Evaluation

Demo

Citation

Acknowledgments

Contact

Owner

dong_chengbo

Supervised domain-agnostic prediction framework for probabilistic modelling

Py-FEAT: Python Facial Expression Analysis Toolbox

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

A module that used for encrypt code which includes RSA and AES

Code used for the results in the paper "ClassMix: Segmentation-Based Data Augmentation for Semi-Supervised Learning"

Code for pre-training CharacterBERT models (as well as BERT models).

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Patch SVDD for Image anomaly detection

AI创造营 ：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

Python module providing a framework to trace individual edges in an image using Gaussian process regression.

Learning to Segment Instances in Videos with Spatial Propagation Network

NeuroFind - A solution to the to the Task given by the Oberseminar of Messtechnik Institute of TU Dresden in 2021

An official TensorFlow implementation of “CLCC: Contrastive Learning for Color Constancy” accepted at CVPR 2021.

DeepFaceLab fork which provides IPython Notebook to use DFL with Google Colab

Transformers based fully on MLPs

PyTorch code for the NAACL 2021 paper "Improving Generation and Evaluation of Visual Stories via Semantic Consistency"

QSYM: A Practical Concolic Execution Engine Tailored for Hybrid Fuzzing

PyTorch reimplementation of the paper Involution: Inverting the Inherence of Convolution for Visual Recognition [CVPR 2021].

Vision-and-Language Navigation in Continuous Environments using Habitat

A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人