Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Last update: Dec 16, 2022

Overview

SETR - Pytorch

Since the original paper (Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.) has no official code,I implemented SETR-Progressive UPsampling(SETR-PUP) using pytorch.

Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Vit

The Vit model is also implemented, and you can use it for image classification.

Usage SETR

from SETR.transformer_seg import SETRModel
import torch 

if __name__ == "__main__":
    net = SETRModel(patch_size=(32, 32), 
                    in_channels=3, 
                    out_channels=1, 
                    hidden_size=1024, 
                    num_hidden_layers=8, 
                    num_attention_heads=16, 
                    decode_features=[512, 256, 128, 64])
    t1 = torch.rand(1, 3, 256, 256)
    print("input: " + str(t1.shape))
    
    # print(net)
    print("output: " + str(net(t1).shape))

If the output size is (1, 1, 256, 256), the code runs successfully.

Usage Vit

from SETR.transformer_seg import Vit
import torch 

if __name__ == "__main__":
    model = Vit(patch_size=(7, 7), 
                    in_channels=1, 
                    out_class=10, 
                    hidden_size=1024, 
                    num_hidden_layers=1, 
                    num_attention_heads=16)
    print(model)
    t1 = torch.rand(1, 1, 28, 28)
    print("input: " + str(t1.shape))

    print("output: " + str(model(t1).shape))

The output shape is (1, 10).

current examples

task_mnist: The simplest example, using the Vit model to classify the minst dataset.
task_car_seg: The example is sample segmentation task. data download: https://www.kaggle.com/c/carvana-image-masking-challenge/data

More examples will be updated later.

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Related tags

Overview

SETR - Pytorch

Vit

Usage SETR

Usage Vit

current examples

more

Owner

zhaohu xing

abess: Fast Best-Subset Selection in Python and R

Ray tracing of a Schwarzschild black hole written entirely in TensorFlow.

PyTorch implementation of Rethinking Positional Encoding in Language Pre-training

CCPD: a diverse and well-annotated dataset for license plate detection and recognition

Python package for downloading ECMWF reanalysis data and converting it into a time series format.

TensorFlow implementation of "Variational Inference with Normalizing Flows"

Library for converting from RGB / GrayScale image to base64 and back.

The repo of Feedback Networks, CVPR17

Neighborhood Contrastive Learning for Novel Class Discovery

ncnn is a high-performance neural network inference framework optimized for the mobile platform

DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)

Use graph-based analysis to re-classify stocks and to improve Markowitz portfolio optimization

Code for "FGR: Frustum-Aware Geometric Reasoning for Weakly Supervised 3D Vehicle Detection", ICRA 2021

An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"

YOLOv5 in PyTorch > ONNX > CoreML > TFLite

VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection (ICCV 2021)

Implementation of Shape and Electrostatic similarity metric in deepFMPO.

Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equations

Playable Video Generation

Graph InfoClust: Leveraging cluster-level node information for unsupervised graph representation learning