HOI Transformer

Code for CVPR 2021 accepted paper End-to-End Human Object Interaction Detection with HOI Transformer.

Reproduction

We recomend you to setup in the following steps:

1.Clone the repo.

git clone https://github.com/bbepoch/HoiTransformer.git

2.Download the MS-COCO pretrained DETR model.

cd data/detr_coco && bash download_model.sh

3.You are supposed to make a soft link named 'images' in 'data/hico/' to refer to your HICO-DET path, or your will have to modify the data path manually in hico.py.

ln -s /path-to-your-hico-det-dataset/hico_20160224_det/images images

4.Train a model.

python3 -m torch.distributed.launch --nproc_per_node=8 --use_env main.py --epochs=250 --lr_drop=200 --dataset_file=hico --batch_size=2 --backbone=resnet50

5.Test a model.

python3 test.py --dataset_file=hico --batch_size=1 --log_dir=./ --model_path=your_model_path

Citation

@inproceedings{zou2021_hoitrans,
author = {Zou, Cheng and Wang, Bohan and Hu, Yue and Liu, Junqi and Wu, Qian and Zhao, Yu and Li, Boxun and Zhang, Chenguang and Zhang, Chi and Wei, Yichen and Sun, Jian},
title = {End-to-End Human Object Interaction Detection with HOI Transformer},
booktitle={CVPR},
year = {2021},
}

Acknowledgement

We sincerely thank all previous works, especially DETR, PPDM, iCAN, for some of the codes are built upon them.

This is the code for HOI Transformer

Related tags

Overview

HOI Transformer

Reproduction

Citation

Acknowledgement

Owner

BigBangEpoch

The final project of "Applying AI to 3D Medical Imaging Data" from "AI for Healthcare" nanodegree - Udacity.

KE-Dialogue: Injecting knowledge graph into a fully end-to-end dialogue system.

PyTorch and Tensorflow functional model definitions

Supervised Contrastive Learning for Product Matching

the code for our CVPR 2021 paper Bilateral Grid Learning for Stereo Matching Network [BGNet]

Python scripts form performing stereo depth estimation using the HITNET model in ONNX.

Awesome Human Pose Estimation

Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss （ATVGnet）

Causal Imitative Model for Autonomous Driving

Source code of the paper PatchGraph: In-hand tactile tracking with learned surface normals.

Official implementation of the paper DeFlow: Learning Complex Image Degradations from Unpaired Data with Conditional Flows

NeROIC: Neural Object Capture and Rendering from Online Image Collections

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

PerfFuzz: Automatically Generate Pathological Inputs for C/C++ programs

Spatially-Adaptive Pixelwise Networks for Fast Image Translation, CVPR 2021

A PyTorch-based library for fast prototyping and sharing of deep neural network models.

Hands-On Machine Learning for Algorithmic Trading, published by Packt

Data stream analytics: Implement online learning methods to address concept drift in data streams using the River library. Code for the paper entitled "PWPAE: An Ensemble Framework for Concept Drift Adaptation in IoT Data Streams" accepted in IEEE GlobeCom 2021.

IRON Kaggle project done while doing IRONHACK Bootcamp where we had to analyze and use a Machine Learning Project to predict future sales