HOI Transformer

Code for CVPR 2021 accepted paper End-to-End Human Object Interaction Detection with HOI Transformer.

Reproduction

We recomend you to setup in the following steps:

1.Clone the repo.

git clone https://github.com/bbepoch/HoiTransformer.git

2.Download the MS-COCO pretrained DETR model.

cd data/detr_coco && bash download_model.sh

3.You are supposed to make a soft link named 'images' in 'data/hico/' to refer to your HICO-DET path, or your will have to modify the data path manually in hico.py.

ln -s /path-to-your-hico-det-dataset/hico_20160224_det/images images

4.Train a model.

python3 -m torch.distributed.launch --nproc_per_node=8 --use_env main.py --epochs=250 --lr_drop=200 --dataset_file=hico --batch_size=2 --backbone=resnet50

5.Test a model.

python3 test.py --dataset_file=hico --batch_size=1 --log_dir=./ --model_path=your_model_path

Citation

@inproceedings{zou2021_hoitrans,
author = {Zou, Cheng and Wang, Bohan and Hu, Yue and Liu, Junqi and Wu, Qian and Zhao, Yu and Li, Boxun and Zhang, Chenguang and Zhang, Chi and Wei, Yichen and Sun, Jian},
title = {End-to-End Human Object Interaction Detection with HOI Transformer},
booktitle={CVPR},
year = {2021},
}

Acknowledgement

We sincerely thank all previous works, especially DETR, PPDM, iCAN, for some of the codes are built upon them.

This is the code for HOI Transformer

Related tags

Overview

HOI Transformer

Reproduction

Citation

Acknowledgement

Owner

BigBangEpoch

Official source code of paper 'IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo'

PyTorch implementation of neural style randomization for data augmentation

Codes for AAAI22 paper "Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum"

Deep Learning for 3D Point Clouds: A Survey (IEEE TPAMI, 2020)

"Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback"

PyTorch implementation of "VRT: A Video Restoration Transformer"

Read number plates with https://platerecognizer.com/

Deep Learning Package based on TensorFlow

Feup-csr - Repository holding my group's submission to the CSR project competition

Attack on Confidence Estimation algorithm from the paper "Disrupting Deep Uncertainty Estimation Without Harming Accuracy"

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA results for single-image motion deblurring, image deraining, image denoising (synthetic and real data), and dual-pixel defocus deblurring.

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Backend code to use MCPI's python API to make infinite worlds with custom generation

This repository gives an example on how to preprocess the data of the HECKTOR challenge

BackgroundRemover lets you Remove Background from images and video with a simple command line interface

OOD Generalization and Detection (ACL 2020)

Angular & Electron desktop UI framework. Angular components for native looking and behaving macOS desktop UI (Electron/Web)

Official Python implementation of the 'Sparse deconvolution'-v0.3.0

Repository for Multimodal AutoML Benchmark

Code for paper: Towards Tokenized Human Dynamics Representation