Deep Watershed Transform for Instance Segmentation

Last update: Nov 20, 2022

Related tags

Overview

Deep Watershed Transform

Performs instance level segmentation detailed in the following paper:

Min Bai and Raquel Urtasun, Deep Watershed Transformation for Instance Segmentation, in CVPR 2017. Accessible at https://arxiv.org/abs/1611.08303.

This page is still under construction.

Dependencies

Developed and tested on Ubuntu 14.04 and 16.04.

TensorFlow www.tensorflow.org
Numpy, Scipy, and Skimage (sudo apt-get install python-numpy python-scipy python-skimage)

Inputs

Cityscapes images (www.cityscapes-dataset.com).
Semantic Segmentation for input images. In our case, we used the output from PSPNet (by H. Zhao et al. https://github.com/hszhao/PSPNet). These are uint8 images with pixel-wise semantic labels encoded with 'trainIDs' defined by Cityscapes. For more information, visit https://github.com/mcordts/cityscapesScripts/blob/master/cityscapesscripts/helpers/labels.py

Outputs

The model produces pixel-wise instance labels as a uint16 image with the same formatting as the Cityscapes instance segmentation challenge ground truth. In particular, each pixel is labeled as 'id' * 1000 + instance_id, where 'id' is as defined by Cityscapes (for more information, consult labels.py in the above link), and instance_id is an integer indexing the object instance.

Testing the Model

Clone repository into dwt/.
Download the model from www.cs.toronto.edu/~mbai/dwt_cityscapes_pspnet.mat and place into the "dwt/model" directory.
run "cd E2E"
run "python main.py"
The results will be available in "dwt/example/output".

Training the Model

Will be available soon.

Deep Watershed Transform for Instance Segmentation

Related tags

Overview

Deep Watershed Transform

Dependencies

Inputs

Outputs

Testing the Model

Training the Model

Owner

Full-featured Decision Trees and Random Forests learner.

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

GPU-Accelerated Deep Learning Library in Python

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

State-of-the-art language models can match human performance on many tasks

ObjDetApp deploys a pytorch model for object detection

MinkLoc++: Lidar and Monocular Image Fusion for Place Recognition

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Unofficial Alias-Free GAN implementation. Based on rosinality's version with expanded training and inference options.

COPA-SSE contains crowdsourced explanations for the Balanced COPA dataset

Reference code for the paper CAMS: Color-Aware Multi-Style Transfer.

FPSAutomaticAiming——基于YOLOV5的FPS类游戏自动瞄准AI

2021-MICCAI-Progressively Normalized Self-Attention Network for Video Polyp Segmentation

Streaming over lightweight data transformations

Generating Videos with Scene Dynamics

MISSFormer: An Effective Medical Image Segmentation Transformer

The official MegEngine implementation of the ICCV 2021 paper: GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning

Convert Apple NeuralHash model for CSAM Detection to ONNX.

Semi-supervised semantic segmentation needs strong, varied perturbations

Data, model training, and evaluation code for "PubTables-1M: Towards a universal dataset and metrics for training and evaluating table extraction models".

Deep Watershed Transform for Instance Segmentation

Related tags

Overview

Deep Watershed Transform

Dependencies

Inputs

Outputs

Testing the Model

Training the Model

Owner

Full-featured Decision Trees and Random Forests learner.

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

GPU-Accelerated Deep Learning Library in Python

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

State-of-the-art language models can match human performance on many tasks

*ObjDetApp* deploys a pytorch model for object detection

MinkLoc++: Lidar and Monocular Image Fusion for Place Recognition

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Unofficial Alias-Free GAN implementation. Based on rosinality's version with expanded training and inference options.

COPA-SSE contains crowdsourced explanations for the Balanced COPA dataset

Reference code for the paper CAMS: Color-Aware Multi-Style Transfer.

FPSAutomaticAiming——基于YOLOV5的FPS类游戏自动瞄准AI

2021-MICCAI-Progressively Normalized Self-Attention Network for Video Polyp Segmentation

Streaming over lightweight data transformations

Generating Videos with Scene Dynamics

MISSFormer: An Effective Medical Image Segmentation Transformer

The official MegEngine implementation of the ICCV 2021 paper: GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning

Convert Apple NeuralHash model for CSAM Detection to ONNX.

Semi-supervised semantic segmentation needs strong, varied perturbations

Data, model training, and evaluation code for "PubTables-1M: Towards a universal dataset and metrics for training and evaluating table extraction models".

ObjDetApp deploys a pytorch model for object detection