Seg-Torch for Image Segmentation with Torch

This work was sparked by my personal research on simple segmentation methods based on deep learning. It is the harvest of two great predecessors;

https://github.com/e-lab/ENet-training
https://github.com/fedor-chervinskii/segnet-torch
- (real project page) http://mi.eng.cam.ac.uk/projects/segnet/

However this code includes radical differences (such as data loading, augmentation, memory optimization) and it has more generic type of implementation suitable for use in any custom project. You only need to modify data-loader files data/custom-gen.lua and data/custom.lua.

Be warned this is susceptible to bugs. Any pull request is appreciated.

Check train_scripts/ for example execution.

Models

SegNet: Very simple encoder-decoder network, segmenting end2end
EroNet: Very similar but it chops Batch-Normalization and uses ELU activation. It is lower in accuracy but faster in training.

Example Results

exp_model/ includes a proof of concept on CamVid dataset. If you compare the results with the real-project this implementation has higher values interestingly (at least for me) :) .

Model will be shared on Dropbox, as soon as I find some time to do so.

Seg-Torch for Image Segmentation with Torch

Related tags

Overview

Seg-Torch for Image Segmentation with Torch

Models

Example Results

Owner

Eren Gölge

SeqTR: A Simple yet Universal Network for Visual Grounding

A collection of pre-trained StyleGAN2 models trained on different datasets at different resolution.

3.8% and 18.3% on CIFAR-10 and CIFAR-100

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

Video-Music Transformer

This is official implementaion of paper "Token Shift Transformer for Video Classification".

An example of time series augmentation methods with Keras

unofficial pytorch implementation of RefineGAN

FcaNet: Frequency Channel Attention Networks

Latent Execution for Neural Program Synthesis

Official implementation of "Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks", NeurIPS 2021.

Hso-groupie - A pwnable challenge in Real World CTF 4th

Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at [email protected]

Code of Puregaze: Purifying gaze feature for generalizable gaze estimation, AAAI 2022.

Vignette is a face tracking software for characters using osu!framework.

This toolkit provides codes to download and pre-process the SLUE datasets, train the baseline models, and evaluate SLUE tasks.

BoxInst: High-Performance Instance Segmentation with Box Annotations

Creating predictive checklists from data using integer programming.

Python package to add text to images, textures and different backgrounds

A Flow-based Generative Network for Speech Synthesis