Joint-task Self-supervised Learning for Temporal Correspondence (NeurIPS 2019)

Last update: Dec 14, 2022

Related tags

Deep Learning UVC

Overview

Joint-task Self-supervised Learning for Temporal Correspondence

Project | Paper

Overview

Xueting Li*, Sifei Liu*, Shalini De Mello, Xiaolong Wang, Jan Kautz, Ming-Hsuan Yang.

(* equal contributions)

In Neural Information Processing Systems (NeurIPS), 2019.

Citation

If you use our code in your research, please use the following BibTex:

@inproceedings{uvc_2019,
    Author = {Xueting Li and Sifei Liu and Shalini De Mello and Xiaolong Wang and Jan Kautz and Ming-Hsuan Yang},
    Title = {Joint-task Self-supervised Learning for Temporal Correspondence},
    Booktitle = {NeurIPS},
    Year = {2019},
}

Instance segmentation propagation on DAVIS2017

Method	J_mean	J_recall	J_decay	F_mean	F_recall	F_decay
Ours	0.563	0.650	0.289	0.592	0.641	0.354
Ours - track	0.577	0.683	0.263	0.613	0.698	0.324

Prerequisites

The code is tested in the following environment:

Ubuntu 16.04
Pytorch 1.1.0, tqdm, scipy 1.2.1

Testing on DAVIS2017

Testing without tracking

To test on DAVIS2017 for instance segmentation mask propagation, please run:

python test.py -d /workspace/DAVIS/ -s 480

Important parameters:

-c: checkpoint path.
-o: results path.
-d: DAVIS 2017 dataset path.
-s: test resolution, all results in the paper are tested on 480p images, i.e. -s 480.

Please check the test.py file for other parameters.

Testing with tracking

To test on DAVIS2017 by tracking & propagation, please run:

python test_with_track.py -d /workspace/DAVIS/ -s 480

Similar parameters as test.py, please see the test_with_track.py for details.

Testing on the VIP dataset

To test on VIP, please run the following command with your own VIP path:

python test_mask_vip.py -o results/VIP/category/ --scale_size 560 560 --pre_num 1 -d /DATA/VIP/VIP_Fine/Images/ --val_txt /DATA/VIP/VIP_Fine/lists/val_videos.txt -c weights/checkpoint_latest.pth.tar

and then:

python eval_vip.py -g DATA/VIP/VIP_Fine/Annotations/Category_ids/ -p results/VIP/category/

Testing on the JHMDB dataset

Please check out this branch. The code is borrowed from TimeCycle.

Training on Kinetics

Dataset

We use the kinetics dataset for training.

Training command

python track_match_v1.py --wepoch 10 --nepoch 30 -c match_track_switch --batchsize 40 --coord_switch 0 --lc 0.3

Acknowledgements

This code is based on TPN and TimeCycle.
For any issues, please contact [email protected] or [email protected].

Joint-task Self-supervised Learning for Temporal Correspondence (NeurIPS 2019)

Related tags

Overview

Joint-task Self-supervised Learning for Temporal Correspondence

Overview

Citation

Instance segmentation propagation on DAVIS2017

Prerequisites

Testing on DAVIS2017

Testing without tracking

Testing with tracking

Testing on the VIP dataset

Testing on the JHMDB dataset

Training on Kinetics

Dataset

Training command

Acknowledgements

Owner

Sifei Liu

Neural Ensemble Search for Performant and Calibrated Predictions

Multi-modal Vision Transformers Excel at Class-agnostic Object Detection

VOLO: Vision Outlooker for Visual Recognition

Unofficial pytorch implementation of 'Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization'

Fast Neural Style for Image Style Transform by Pytorch

Faster RCNN with PyTorch

ByteTrack超详细教程！训练自己的数据集&&摄像头实时检测跟踪

Implements the training, testing and editing tools for "Pluralistic Image Completion"

Hub is a dataset format with a simple API for creating, storing, and collaborating on AI datasets of any size.

Companion code for "Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees"

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation

PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features

Gym environments used in the paper: "Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors"

Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

DvD-TD3: Diversity via Determinants for TD3 version

这是一个yolo3-tf2的源码，可以用于训练自己的模型。

A face dataset generator with out-of-focus blur detection and dynamic interval adjustment.

[CVPR 2020] Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

This folder contains the python code of UR5E's advanced forward kinematics model.

Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code.