UVO_Challenge

Team Alpes_runner Solutions

This is an official repo for our UVO Challenge solutions for Image/Video-based open-world segmentation. Our team "Alpes_runner" achieved the best performance on both Image/Video-based benchmarks. More details about the workshop can be found here.

Technical Reports

For Track 1: paper
For Track 2: paper

Models

Detection

Model	Pretrained datasets	Finetuned datasets	links
UVO_Detector	COCO	-	config/weights
UVO_Detector	COCO	UVO	config/weights

Segmentation

Model	Pretrained datasets	Finetuned datasets	links
UVO_Segementor	COCO	-	weights
UVO_Segmentor	COCO, PASCAL, OpenImage	-	config/weights
UVO_Segmentor	COCO, PASCAL, OpenImage	UVO	config/weights

Citation

If you find this project useful in your research, please consider cite:

@article{du20211st,
  title={1st Place Solution for the UVO Challenge on Image-based Open-World Segmentation 2021},
  author={Du, Yuming and Guo, Wen and Xiao, Yang and Lepetit, Vincent},
  journal={arXiv preprint arXiv:2110.10239},
  year={2021}
}

@article{du20211st,
  title={1st Place Solution for the UVO Challenge on Video-based Open-World Segmentation 2021},
  author={Du, Yuming and Guo, Wen and Xiao, Yang and Lepetit, Vincent},
  journal={arXiv preprint arXiv:2110.11661},
  year={2021}
}

Contact

Feel free to contact me or open a new issue if you have any questions.

Video-based open-world segmentation

Related tags

Overview

UVO_Challenge

Team Alpes_runner Solutions

Technical Reports

Models

Citation

Contact

Owner

Yuming Du

Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical Instrument Recognition.

An implementation of chunked, compressed, N-dimensional arrays for Python.

TorchXRayVision: A library of chest X-ray datasets and models.

SegNet including indices pooling for Semantic Segmentation with tensorflow and keras

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

Code, environments, and scripts for the paper: "How Private Is Your RL Policy? An Inverse RL Based Analysis Framework"

An imperfect information game is a type of game with asymmetric information

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising

[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles

The official PyTorch implementation of Curriculum by Smoothing (NeurIPS 2020, Spotlight).

Deep Learning to Create StepMania SM FIles

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

DeLighT: Very Deep and Light-Weight Transformers

Split your patch similarly to `git add -p` but supporting multiple buckets

Official implementation for NIPS'17 paper: PredRNN: Recurrent Neural Networks for Predictive Learning Using Spatiotemporal LSTMs.

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Food Drinks and groceries Images Multi Lingual (FooDI-ML) dataset.

SGoLAM - Simultaneous Goal Localization and Mapping

Flexible-Modal Face Anti-Spoofing: A Benchmark