git《USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation》(2020) GitHub: [fig2]

Last update: Nov 28, 2022

Related tags

Overview

USD-Seg

This project is an implement of paper USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation, based on FCOS detector from MMDetection tool box.

Introduction

We present a novel explicit shape representation for instance segmentation. The proposed USD-Seg adopts a linear model, sparse coding with dictionary, for object shapes. First, it learns a dictionary from a large collection of shape datasets, making any shape being able to be decomposed into a linear combination through the dictionary. Hence the name "Universal Shape Dictionary". It adds a simple shape vector regression head to ordinary object detector, giving the detector segmentation ability with minimal overhead.

License

This project is released under the Apache 2.0 license.

Model

The overall pipeline of USD-Seg: an RGB image is input to the base detector, and the base detector will regress both detection related information (bounding box and class) and the shape vector. Then the mask will be decoded by simple multiplication between shape vector and dictionary atoms, followed by proper resize and threshold operations.

Installation

Please refer to INSTALL.md for installation and dataset preparation.

Get Started

Please see GETTING_STARTED.md for the basic usage of MMDetection.
We follow the original usage of mmdetection framework. You can use configs for usd-seg in /configs/usdseg/ to train from scratch.

Citation

If you use this toolbox or benchmark in your research, please cite this project and mmdetection.

@article{USD-Seg,
  title   = {Learning Universal Shape Dictionary for Realtime Instance Segmentation},
  author  = {Tang, Tutian and Xu, Wenqiang and Ye, Ruolin and Yang, Lixin and Lu, Cewu},
  journal= {arXiv preprint arXiv:2012.01050},
  year={2020}
}

Contact

This repo is currently maintained by Tutian tang (@ElectronicElephant)and Ruolin Ye (@YoruCathy). Other core developers include Wenqiang Xu (@WenqiangX). For technical details, please feel free to contact the authors directly via Email.

git《USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation》(2020) GitHub: [fig2]

Related tags

Overview

USD-Seg

Introduction

License

Model

Installation

Get Started

Citation

Contact

Owner

Ruolin Ye

A flag generation AI created using DeepAIs API

[3DV 2021] A Dataset-Dispersion Perspective on Reconstruction Versus Recognition in Single-View 3D Reconstruction Networks

This is the repository for our paper Ditch the Gold Standard: Re-evaluating Conversational Question Answering

Pytorch implementation of One-Shot Affordance Detection

This is the official code of L2G, Unrolling and Recurrent Unrolling in Learning to Learn Graph Topologies.

A unified framework to jointly model images, text, and human attention traces.

PAMI stands for PAttern MIning. It constitutes several pattern mining algorithms to discover interesting patterns in transactional/temporal/spatiotemporal databases

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

A PyTorch implementation of QANet.

This implements the learning and inference/proposal algorithm described in "Learning to Propose Objects, Krähenbühl and Koltun"

A simple Tensorflow based library for deep and/or denoising AutoEncoder.

This is an official implementation for "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"

Live Hand Tracking Using Python

PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

Implementation of the ICCV'21 paper Temporally-Coherent Surface Reconstruction via Metric-Consistent Atlases

Real-time VIBE: Frame by Frame Inference of VIBE (Video Inference for Human Body Pose and Shape Estimation)

Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors

FID calculation with proper image resizing and quantization steps

OpenIPDM is a MATLAB open-source platform that stands for infrastructures probabilistic deterioration model