Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

Last update: Dec 11, 2022

Overview

2017 VQA Challenge Winner (CVPR'17 Workshop)

pytorch implementation of Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge by Teney et al.

Prerequisites

python 3.6+
numpy
pytorch 0.4
tqdm
nltk
pandas

Data

Preparation

To download and extract vqav2, glove, and pretrained visual features:
```
bash scripts/download_extract.sh
```
To prepare data for training:
```
python scripts/preproc.py
```

The structure of data/ directory should look like this:

- data/
  - zips/
    - v2_XXX...zip
    - ...
    - glove...zip
    - trainval_36.zip
  - glove/
    - glove...txt
    - ...
  - v2_XXX.json
  - ...
  - trainval_resnet...tsv
  (The above are files created after executing scripts/download_extract.sh)
  - tokenizers/
    - ...
  - dict_ans.pkl
  - dict_q.pkl
  - glove_pretrained_300.npy
  - train_qa.pkl
  - val_qa.pkl
  - train_vfeats.pkl
  - val_vfeats.pkl
  (The above are files created after executing scripts/preproc.py)

Train

Use default parameters:

bash scripts/train.sh

Notes

Huge re-factor (especially data preprocessing), tested based on pytorch 0.4.1 and python 3.6
Training for 20 epochs reach around 50% training accuracy. (model seems buggy in my implementation)
After all the preprocessing, data/ directory may be up to 38G+
Some of preproc.py and utils.py are based on this repo

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

Related tags

Overview

2017 VQA Challenge Winner (CVPR'17 Workshop)

Prerequisites

Data

Preparation

Train

Notes

Resources

Owner

Mark Dong

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

ByT5: Towards a token-free future with pre-trained byte-to-byte models

Use PaddlePaddle to reproduce the paper：mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer

DataCLUE: 国内首个以数据为中心的AI测评（含模型分析报告）

OpenChat: Opensource chatting framework for generative models

Repositório do trabalho de introdução a NLP

Contact Extraction with Question Answering.

Concept Modeling: Topic Modeling on Images and Text

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

NVDA, the free and open source Screen Reader for Microsoft Windows

多语言降噪预训练模型MBart的中文生成任务

A very simple framework for state-of-the-art Natural Language Processing (NLP)

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Official PyTorch implementation of "Dual Path Learning for Domain Adaptation of Semantic Segmentation".

A python package for deep multilingual punctuation prediction.

A fast and easy implementation of Transformer with PyTorch.

LewusBot - Twitch ChatBot built in python with twitchio library

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

Graph Coloring - Weighted Vertex Coloring Problem