[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

Last update: Dec 03, 2022

Overview

Counterfactual VQA (CF-VQA)

This repository is the Pytorch implementation of our paper "Counterfactual VQA: A Cause-Effect Look at Language Bias" in CVPR 2021. This code is implemented as a fork of RUBi.

CF-VQA is proposed to capture and mitigate language bias in VQA from the view of causality. CF-VQA (1) captures the language bias as the direct causal effect of questions on answers, and (2) reduces the language bias by subtracting the direct language effect from the total causal effect.

If you find this paper helps your research, please kindly consider citing our paper in your publications.

@inproceedings{niu2020counterfactual,
  title={Counterfactual VQA: A Cause-Effect Look at Language Bias},
  author={Niu, Yulei and Tang, Kaihua and Zhang, Hanwang and Lu, Zhiwu and Hua, Xian-Sheng and Wen, Ji-Rong},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2021}
}

Summary

Installation
- Setup and dependencies
- Download datasets
Quick start
- Train a model
- Evaluate a model
Useful commands
Acknowledgment

Installation

1. Setup and dependencies

Install Anaconda or Miniconda distribution based on Python3+ from their downloads' site.

conda create --name cfvqa python=3.7
source activate cfvqa
pip install -r requirements.txt

2. Download datasets

Download annotations, images and features for VQA experiments:

bash cfvqa/datasets/scripts/download_vqa2.sh
bash cfvqa/datasets/scripts/download_vqacp2.sh

Quick start

Train a model

The boostrap/run.py file load the options contained in a yaml file, create the corresponding experiment directory and start the training procedure. For instance, you can train our best model on VQA-CP v2 (CFVQA+SUM+SMRL) by running:

python -m bootstrap.run -o cfvqa/options/vqacp2/smrl_cfvqa_sum.yaml

Then, several files are going to be created in logs/vqacp2/smrl_cfvqa_sum/:

[options.yaml] (copy of options)
[logs.txt] (history of print)
[logs.json] (batchs and epochs statistics)
[_vq_val_oe.json] (statistics for the language-prior based strategy, e.g., RUBi)
[_cfvqa_val_oe.json] (statistics for CF-VQA)
[_q_val_oe.json] (statistics for language-only branch)
[_v_val_oe.json] (statistics for vision-only branch)
[_all_val_oe.json] (statistics for the ensembled branch)
ckpt_last_engine.pth.tar (checkpoints of last epoch)
ckpt_last_model.pth.tar
ckpt_last_optimizer.pth.tar

Many options are available in the options directory. CFVQA represents the complete causal graph while cfvqas represents the simplified causal graph.

Evaluate a model

There is no test set on VQA-CP v2, our main dataset. The evaluation is done on the validation set. For a model trained on VQA v2, you can evaluate your model on the test set. In this example, boostrap/run.py load the options from your experiment directory, resume the best checkpoint on the validation set and start an evaluation on the testing set instead of the validation set while skipping the training set (train_split is empty). Thanks to --misc.logs_name, the logs will be written in the new logs_predicate.txt and logs_predicate.json files, instead of being appended to the logs.txt and logs.json files.

python -m bootstrap.run \
-o ./logs/vqacp2/smrl_cfvqa_sum/options.yaml \
--exp.resume last \
--dataset.train_split ''\
--dataset.eval_split val \
--misc.logs_name test

Useful commands

Use a specific GPU

For a specific experiment:

CUDA_VISIBLE_DEVICES=0 python -m boostrap.run -o cfvqa/options/vqacp2/smrl_cfvqa_sum.yaml

For the current terminal session:

export CUDA_VISIBLE_DEVICES=0

Overwrite an option

The boostrap.pytorch framework makes it easy to overwrite a hyperparameter. In this example, we run an experiment with a non-default learning rate. Thus, I also overwrite the experiment directory path:

python -m bootstrap.run -o cfvqa/options/vqacp2/smrl_cfvqa_sum.yaml \
--optimizer.lr 0.0003 \
--exp.dir logs/vqacp2/smrl_cfvqa_sum_lr,0.0003

Resume training

If a problem occurs, it is easy to resume the last epoch by specifying the options file from the experiment directory while overwritting the exp.resume option (default is None):

python -m bootstrap.run -o logs/vqacp2/smrl_cfvqa_sum/options.yaml \
--exp.resume last

Acknowledgment

Special thanks to the authors of RUBi, BLOCK, and bootstrap.pytorch, and the datasets used in this research project.

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

Related tags

Overview

Counterfactual VQA (CF-VQA)

Summary

Installation

1. Setup and dependencies

2. Download datasets

Quick start

Train a model

Evaluate a model

Useful commands

Use a specific GPU

Overwrite an option

Resume training

Acknowledgment

Owner

Yulei Niu

Official Code Release for "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

Code for ICML 2021 paper: How could Neural Networks understand Programs?

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

OMAMO: orthology-based model organism selection

BlockUnexpectedPackets - Preventing BungeeCord CPU overload due to Layer 7 DDoS attacks by scanning BungeeCord's logs

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

TCube generates rich and fluent narratives that describes the characteristics, trends, and anomalies of any time-series data (domain-agnostic) using the transfer learning capabilities of PLMs.

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.

Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition

Recurrent Neural Network Tutorial, Part 2 - Implementing a RNN in Python and Theano

2021 CCF BDCI 全国信息检索挑战杯（CCIR-Cup）智能人机交互自然语言理解赛道第二名参赛解决方案

Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting

A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN)

The pure and clear PyTorch Distributed Training Framework.

This is an example of object detection on Micro bacterium tuberculosis using Mask-RCNN

source code the paper Fast and Robust Iterative Closet Point.

Using deep learning to predict gene structures of the coding genes in DNA sequences of Arabidopsis thaliana

PyContinual (An Easy and Extendible Framework for Continual Learning)

g2o: A General Framework for Graph Optimization

Reimplementation of the paper "Attention, Learn to Solve Routing Problems!" in jax/flax.