FCN-semantic-segmentation

Simple end-to-end semantic segmentation using fully convolutional networks [1]. Takes a pretrained 34-layer ResNet [2], removes the fully connected layers, and adds transposed convolution layers with skip connections from lower layers. Initialises upsampling convolutions with bilinear interpolation filters and zeros the final (classification) layer.

Uses an independent cross-entropy loss per class. Trained with SGD with momentum, plus weight decay only on convolutional weights. Calculates and plots class-wise and mean intersection-over-union. Checkpoints the network every epoch.

Note: This code does not achieve great results (achieves ~40 IoU fairly quickly, but converges there). Contributions to fix this are welcome! The goal of this repo is to provide strong, simple and efficient baselines for semantic segmentation using the FCN method, so this shouldn't be restricted to using ResNet 34 etc.

Requirements

Instructions

Install all of the required software. To feasibly run the training, CUDA is needed. The crop size and batch size can be tailored to your GPU memory (the default crop and batch sizes use ~10GB of GPU RAM).
Register on the Cityscapes website to access the dataset.
Download and extract the training/validation RGB data (leftImg8bit_trainvaltest) and ground truth data (gtFine_trainvaltest).
Run python main.py <options>.

First a Dataset object is set up, returning the RGB inputs, one-hot targets (for independent classification) and label targets. During training, the images are randomly cropped and horizontally flipped. Testing calculates IoU scores and produces a subset of coloured predictions that match the coloured ground truth.

References

[1] Fully convolutional networks for semantic segmentation
[2] Deep Residual Learning for Image Recognition

Fully convolutional networks for semantic segmentation

Related tags

Overview

FCN-semantic-segmentation

Requirements

Instructions

References

Owner

Kai Arulkumaran

Element selection for functional materials discovery by integrated machine learning of atomic contributions to properties

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

make ASCII Art by Deep Learning

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

A pytorch-based real-time segmentation model for autonomous driving

Code for the paper "Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness"

Repo for our ICML21 paper Unsupervised Learning of Visual 3D Keypoints for Control

Clockwork Variational Autoencoder

Evolving neural network parameters in JAX.

Adaptive Prototype Learning and Allocation for Few-Shot Segmentation (CVPR 2021)

TumorInsight is a Brain Tumor Detection and Classification model built using RESNET50 architecture.

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

Official code release for "Learned Spatial Representations for Few-shot Talking-Head Synthesis" ICCV 2021

Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks

VOS: Learning What You Don’t Know by Virtual Outlier Synthesis

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

This git repo contains the implementation of my ML project on Heart Disease Prediction

Code basis for the paper "Camera Condition Monitoring and Readjustment by means of Noise and Blur" (2021)