Adaptively Aligned Image Captioning via Adaptive Attention Time

This repository includes the implementation for Adaptively Aligned Image Captioning via Adaptive Attention Time.

Requirements

Python 3.6
Java 1.8.0
PyTorch 1.0
cider
coco-caption
tensorboardX

Training AAT

Prepare data (with python2)

See details in data/README.md.

(notes: Set word_count_threshold in scripts/prepro_labels.py to 4 to generate a vocabulary of size 10,369.)

You should also preprocess the dataset and get the cache for calculating cider score for SCST:

$ python scripts/prepro_ngrams.py --input_json data/dataset_coco.json --dict_json data/cocotalk.json --output_pkl data/coco-train --split train

Training

$ sh train-aat.sh

See opts.py for the options.

Evaluation

$ CUDA_VISIBLE_DEVICES=0 python eval.py --model log/log_aat_rl/model.pth --infos_path log/log_aat_rl/infos_aat.pkl  --dump_images 0 --dump_json 1 --num_images -1 --language_eval 1 --beam_size 2 --batch_size 100 --split test

Reference

If you find this repo helpful, please consider citing:

@inproceedings{huang2019adaptively,
  title = {Adaptively Aligned Image Captioning via Adaptive Attention Time},
  author = {Huang, Lun and Wang, Wenmin and Xia, Yaxian and Chen, Jie},
  booktitle = {Advances in Neural Information Processing Systems 32},
  year={2019}
}

Acknowledgements

This repository is based on Ruotian Luo's self-critical.pytorch.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
misc		misc
models		models
scripts		scripts
vis		vis
ADVANCED.md		ADVANCED.md
LICENSE		LICENSE
README.md		README.md
dataloader.py		dataloader.py
dataloaderraw.py		dataloaderraw.py
eval.py		eval.py
eval_ensemble.py		eval_ensemble.py
eval_utils.py		eval_utils.py
opts.py		opts.py
test-best.sh		test-best.sh
test-last.sh		test-last.sh
train-aat.sh		train-aat.sh
train.py		train.py

License

husthuaan/AAT

Folders and files

Latest commit

History

Repository files navigation

Adaptively Aligned Image Captioning via Adaptive Attention Time

Requirements

Training AAT

Prepare data (with python2)

Training

Evaluation

Reference

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages