Models Supported: AlbUNet [18, 34, 50, 101, 152] (1D and 2D versions for Single and Multiclass Segmentation, Feature Extraction with supports for Deep Supervision and Guided Attention)

Last update: Nov 15, 2021

Overview

AlbUNet-1D-2D-Tensorflow-Keras

This repository contains 1D and 2D Signal Segmentation Model Builder for AlbUNet and several of its variants developed in Tensorflow-Keras. The code supports Deep Supervision, AutoEncoder mode, Guided Attention and other options. The segmentation models can be used for binary or multiclass segmentation, or for regression tasks.

Models supported [1]

AlbUNet18
AlbUNet34
AlbUNet50
AlbUNet101
AlbUNet152

AlbUNet

AlbUNet has a ResNet based Encoder and traditional UNet based Decoder, as shown in the Figure below for ALbUNet34, which uses ResNet34 as the Encoder.

AlbUNet Architecture

Supported Features

The speciality about this model is its flexibility, such as:

The user can choose any of the 5 available AlbUNet variants for either 1D or 2D Segmentation tasks.
The models can be used for Binary or Multi-Class Classification, or Regression type Segmentation tasks.
The models allow Deep Supervision [2] with flexibility during Segmentation.
The segmentation models can also be used as Autoencoders [3] for Feature Extraction.
The Segmentation Models can be Attention Guided [4].
Number of input kernel/filter, commonly known as the Width of the model can be varied.
Number of classes for Classification tasks and number of extracted features for Regression tasks can be varied.
Number of Channels in the Input Dataset can be varied.

Mentionable that the 2D version of AlbUNet can also be used in Transfer Learning from previously trained weights (e.g., ImageNet), just the encoder blocks should be replaced with the trained model layers.

References

[1] A. Shvets, V. Iglovikov, A. Rakhlin, and A. A. Kalinin, “Angiodysplasia detection and localization using deep convolutional neural networks,” arXiv.org, 21-Apr-2018. [Online]. Available: https://arxiv.org/abs/1804.08024. [2] Zhou, Z., Siddiquee, M., Tajbakhsh, N., & Liang, J. (2021). UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation. Arxiv-vanity.com. Retrieved 30 August 2021, from https://www.arxiv-vanity.com/papers/1912.05074/.
[3] Zhou, Z., Siddiquee, M., Tajbakhsh, N., & Liang, J. (2021). UNet++: A Nested U-Net Architecture for Medical Image Segmentation. arXiv.org. Retrieved 30 August 2021, from https://arxiv.org/abs/1807.10165.
[4] M. Noori, A. Bahri, and K. Mohammadi, “Attention-guided version of 2D UNET for automatic brain tumor segmentation,” arXiv.org, 04-Apr-2020. [Online]. Available: https://arxiv.org/abs/2004.02009.

Models Supported: AlbUNet [18, 34, 50, 101, 152] (1D and 2D versions for Single and Multiclass Segmentation, Feature Extraction with supports for Deep Supervision and Guided Attention)

Related tags

Overview

AlbUNet-1D-2D-Tensorflow-Keras

Models supported [1]

AlbUNet

Supported Features

References

Owner

Sakib Mahmud

A configurable, tunable, and reproducible library for CTR prediction

TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id

3D-aware GANs based on NeRF (arXiv).

The official implementation of Variable-Length Piano Infilling (VLI).

NAS-FCOS: Fast Neural Architecture Search for Object Detection (CVPR 2020)

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Learning and Building Convolutional Neural Networks using PyTorch

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

[CVPR 2021] Generative Hierarchical Features from Synthesizing Images

code for Grapadora research paper experimentation

A Learning-based Camera Calibration Toolbox

Computer Vision Script to recognize first person motion, developed as final project for the course "Machine Learning and Deep Learning"

A deep learning framework for historical document image analysis

Original code for "Zero-Shot Domain Adaptation with a Physics Prior"

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

PyTorch implementation of Super SloMo by Jiang et al.

A basic implementation of Layer-wise Relevance Propagation (LRP) in PyTorch.

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

Pairwise model for commonlit competition

Cookiecutter PyTorch Lightning