Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

Last update: Dec 11, 2022

Overview

2017 VQA Challenge Winner (CVPR'17 Workshop)

pytorch implementation of Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge by Teney et al.

Prerequisites

python 3.6+
numpy
pytorch 0.4
tqdm
nltk
pandas

Data

Preparation

To download and extract vqav2, glove, and pretrained visual features:
```
bash scripts/download_extract.sh
```
To prepare data for training:
```
python scripts/preproc.py
```

The structure of data/ directory should look like this:

- data/
  - zips/
    - v2_XXX...zip
    - ...
    - glove...zip
    - trainval_36.zip
  - glove/
    - glove...txt
    - ...
  - v2_XXX.json
  - ...
  - trainval_resnet...tsv
  (The above are files created after executing scripts/download_extract.sh)
  - tokenizers/
    - ...
  - dict_ans.pkl
  - dict_q.pkl
  - glove_pretrained_300.npy
  - train_qa.pkl
  - val_qa.pkl
  - train_vfeats.pkl
  - val_vfeats.pkl
  (The above are files created after executing scripts/preproc.py)

Train

Use default parameters:

bash scripts/train.sh

Notes

Huge re-factor (especially data preprocessing), tested based on pytorch 0.4.1 and python 3.6
Training for 20 epochs reach around 50% training accuracy. (model seems buggy in my implementation)
After all the preprocessing, data/ directory may be up to 38G+
Some of preproc.py and utils.py are based on this repo

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

Related tags

Overview

2017 VQA Challenge Winner (CVPR'17 Workshop)

Prerequisites

Data

Preparation

Train

Notes

Resources

Owner

Mark Dong

YoloAll is a collection of yolo all versions. you you use YoloAll to test yolov3/yolov5/yolox/yolo_fastest

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

imbalanced-DL: Deep Imbalanced Learning in Python

Project dự đoán giá cổ phiếu bằng thuật toán LSTM gồm: code train và code demo

Matching python environment code for Lux AI 2021 Kaggle competition, and a gym interface for RL models.

Deploy optimized transformer based models on Nvidia Triton server

This repository attempts to replicate the SqueezeNet architecture and implement the same on an image classification task.

🌎 The Modern Declarative Data Flow Framework for the AI Empowered Generation.

Code and models for ICCV2021 paper "Robust Object Detection via Instance-Level Temporal Cycle Confusion".

Code for "Finding Regions of Heterogeneity in Decision-Making via Expected Conditional Covariance" at NeurIPS 2021

Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR)

MultiTaskLearning - Multi Task Learning for 3D segmentation

Transfer-Learn is an open-source and well-documented library for Transfer Learning.

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

DockStream: A Docking Wrapper to Enhance De Novo Molecular Design

Transformer model implemented with Pytorch

Resources complimenting the Machine Learning Course led in the Faculty of mathematics and informatics part of Sofia University.

Pytorch cuda extension of grid_sample1d

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

DIVeR: Deterministic Integration for Volume Rendering