CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Last update: Dec 22, 2022

Overview

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

This repo contains code for our paper "Counterfactual Samples Synthesizing for Robust Visual Question Answering" This repo contains code modified from here,many thanks!

Prerequisites

Make sure you are on a machine with a NVIDIA GPU and Python 2.7 with about 100 GB disk space.
h5py==2.10.0
pytorch==1.1.0
Click==7.0
numpy==1.16.5
tqdm==4.35.0

Data Setup

You can use

bash tools/download.sh

to download the data
and the rest of the data and trained model can be obtained from BaiduYun(passwd:3jot) or GoogleDrive unzip feature1.zip and feature2.zip and merge them into data/rcnn_feature/
use

bash tools/process.sh

to process the data

Training

Run

CUDA_VISIBLE_DEVICES=0 python main.py --dataset cpv2 --mode q_v_debias --debias learned_mixin --topq 1 --topv -1 --qvp 5 --output [] --seed 0

to train a model

Testing

Run

CUDA_VISIBLE_DEVICES=0 python eval.py --dataset cpv2 --debias learned_mixin --model_state []

to eval a model

Citation

If you find this code useful, please cite the following paper:

@inproceedings{chen2020counterfactual,
title={Counterfactual Samples Synthesizing for Robust Visual Question Answering},
author={Chen, Long and Yan, Xin and Xiao, Jun and Zhang, Hanwang and Pu, Shiliang and Zhuang, Yueting},
booktitle={CVPR},
year={2020}
}

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Related tags

Overview

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Prerequisites

Data Setup

Training

Testing

Citation

Owner

Playing around with FastAPI and streamlit to create a YoloV5 object detector

Code for the TPAMI paper: "Syntax Customized Video Captioning by Imitating Exemplar Sentences"

Improving Generalization Bounds for VC Classes Using the Hypergeometric Tail Inversion

Meta-meta-learning with evolution and plasticity

《DeepViT: Towards Deeper Vision Transformer》(2021)

Contrastive Learning with Non-Semantic Negatives

[Nature Machine Intelligence' 21] "Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence"

MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification

The source code of the paper "SHGNN: Structure-Aware Heterogeneous Graph Neural Network"

This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild"

Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

The Multi-Mission Maximum Likelihood framework (3ML)

The code for Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation

Chatbot in 200 lines of code using TensorLayer

Simple and ready-to-use tutorials for TensorFlow

Multi-agent reinforcement learning algorithm and environment

Library of various Few-Shot Learning frameworks for text classification

Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand

A minimal implementation of Gaussian process regression in PyTorch