Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

Last update: Aug 30, 2022

Overview

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation:

Work In Progress, Results can't be replicated yet with the models here

UPDATE: April 28th: Skip_Connection added thanks to the reviewers, check model model-tiramasu-67-func-api.py

feel free to open issues for suggestions:)

Keras2 + TF used for the recent updates, which might cause with some confilict from previous version I had in here

What is The One Hundred Layers Tiramisu?

A state of art (as in Jan 2017) Semantic Pixel-wise Image Segmentation model that consists of a fully deep convolutional blocks with downsampling, skip-layer then to Upsampling architecture.
An extension of DenseNets to deal with the problem of semantic segmentation.

Fully Convolutional DensNet = (Dense Blocks + Transition Down Blocks) + (Bottleneck Blocks) + (Dense Blocks + Transition Up Blocks) + Pixel-Wise Classification layer

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio) arXiv:1611.09326 cs.CV

Requirements:

Keras==2.0.2
tensorflow-gpu==1.0.1
or just go ahead and do: pip install -r requirements.txt

Model Strucure:

DenseBlock: BatchNormalization + Activation [ Relu ] + Convolution2D + Dropout
TransitionDown: BatchNormalization + Activation [ Relu ] + Convolution2D + Dropout + MaxPooling2D
TransitionUp: Deconvolution2D (Convolutions Transposed)

Model Params:

RMSprop is used with Learnining Rete of 0.001 and weight decay 0.995
- However, using those got me nowhere, I switched to SGD and started tweaking the LR + Decay myself.
There are no details given about BatchNorm params, again I have gone with what the Original DenseNet paper had suggested.
Things to keep in mind perhaps:
- the weight inti: he_uniform (maybe change it around?)
- the regualzrazation too agressive?

Repo (explanation):

Download the CamVid Dataset as explained below:
- Use the data_loader.py to crop images to 224, 224 as in the paper implementation.
run model-tiramasu-67-func-api.py or python model-tirmasu-56.py for now to generate each models file.
run python train-tirmasu.py to start training:
- Saves best checkpoints for the model and data_loader included for the CamVidDataset
helper.py contains two methods normalized and one_hot_it, currently for the CamVid Task

Dataset:

In a different directory run this to download the dataset from original Implementation.
- git clone [email protected]:alexgkendall/SegNet-Tutorial.git
- copy the /CamVid to here, or change the DataPath in data_loader.py to the above directory
The run python data_loader.py to generate these two files:
- /data/train_data.npz/ and /data/train_label.npz
- This will make it easy to process the model over and over, rather than waiting the data to be loaded into memory.

Experiments:

Models	Acc	Loss	Notes
FC-DenseNet 67			150 Epochs, RMSPROP

To Do:

[x] FC-DenseNet 103
[x] FC-DenseNet 56
[x] FC-DenseNet 67
[ ] Replicate Test Accuracy CamVid Task
[ ] Replicate Test Accuracy GaTech Dataset Task
[ ] Requirements

Original Results Table:

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

Related tags

Overview

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation:

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio) arXiv:1611.09326 cs.CV

Requirements:

Model Strucure:

Model Params:

Repo (explanation):

Dataset:

To Do:

Owner

Yad Konrad

Catch-all collection of generative art made using processing

Code to reproduce the results for Statistically Robust Neural Network Classification, published in UAI 2021

ObsPy: A Python Toolbox for seismology/seismological observatories.

Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

UAV-Networks-Routing is a Python simulator for experimenting routing algorithms and mac protocols on unmanned aerial vehicle networks.

Datasets for new state-of-the-art challenge in disentanglement learning

This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.

QKeras: a quantization deep learning library for Tensorflow Keras

Second-order Attention Network for Single Image Super-resolution (CVPR-2019)

Codebase for testing whether hidden states of neural networks encode discrete structures.

Code for the USENIX 2017 paper: kAFL: Hardware-Assisted Feedback Fuzzing for OS Kernels

Data Preparation, Processing, and Visualization for MoVi Data

[LREC] MMChat: Multi-Modal Chat Dataset on Social Media

Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight

PyTorch implementation of MulMON

Course materials for Fall 2021 "CIS6930 Topics in Computing for Data Science" at New College of Florida

Non-Official Pytorch implementation of "Face Identity Disentanglement via Latent Space Mapping" https://arxiv.org/abs/2005.07728 Using StyleGAN2 instead of StyleGAN

Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.

Implementation of Nalbach et al. 2017 paper.

Physics-Aware Training (PAT) is a method to train real physical systems with backpropagation.