DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.

Last update: Nov 14, 2022

Related tags

Deep Learning DeepLab-Context

Overview

DeepLab

Introduction

DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.

It combines densely-computed deep convolutional neural network (CNN) responses with densely connected conditional random fields (CRF).

This distribution provides a publicly available implementation for the key model ingredients first reported in an arXiv paper, accepted in revised form as conference publication to the ICLR-2015 conference. It also contains implementations for methods supporting model learning using only weakly labeled examples, described in a second follow-up arXiv paper. Please consult and consider citing the following papers:

@inproceedings{chen14semantic,
  title={Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs},
  author={Liang-Chieh Chen and George Papandreou and Iasonas Kokkinos and Kevin Murphy and Alan L Yuille},
  booktitle={ICLR},
  url={http://arxiv.org/abs/1412.7062},
  year={2015}
}

@article{papandreou15weak,
  title={Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation},
  author={George Papandreou and Liang-Chieh Chen and Kevin Murphy and Alan L Yuille},
  journal={arxiv:1502.02734},
  year={2015}
}

Note that if you use the densecrf implementation, please consult and cite the following paper:

@inproceedings{KrahenbuhlK11,
  title={Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials},
  author={Philipp Kr{\"{a}}henb{\"{u}}hl and Vladlen Koltun},
  booktitle={NIPS},      
  year={2011}
}

Performance

DeepLab currently achieves 73.9% on the challenging PASCAL VOC 2012 image segmentation task -- see the leaderboard.

Pre-trained models

We have released several trained models and corresponding prototxt files at here. Please check it for more model details.

The best model among the released ones yields 73.6% on PASCAL VOC 2012 test set.

Python wrapper requirements

Install wget library for python

sudo pip install wget

Change DATA_ROOT to point to the PASCAL images
To use the mat_read_layer and mat_write_layer, please download and install matio.

Running the code

python run.py

FAQ

Check FAQ if you have some problems while using the code.

DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.

Related tags

Overview

DeepLab

Introduction

Performance

Pre-trained models

Python wrapper requirements

Running the code

FAQ

Owner

Ali

DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort

DeepLearning Anomalies Detection with Bluetooth Sensor Data

AI Face Mesh: This is a simple face mesh detection program based on Artificial intelligence.

Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)

MEDS: Enhancing Memory Error Detection for Large-Scale Applications

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation(mCOLT/mRASP2), ACL2021

TigerLily: Finding drug interactions in silico with the Graph.

Official implementation of the NRNS paper: No RL, No Simulation: Learning to Navigate without Navigating

Segmentation vgg16 fcn - cityscapes

[CVPR2022] Representation Compensation Networks for Continual Semantic Segmentation

Code for paper [ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot] (ICCV 2021, oral))

Differentiable Factor Graph Optimization for Learning Smoothers @ IROS 2021

A curated list of neural network pruning resources.

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

External Attention Network

[ICCV'21] Neural Radiance Flow for 4D View Synthesis and Video Processing

Implementation of "Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis"

Reviatalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation

Official Implementation of Few-shot Visual Relationship Co-localization