Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Last update: Dec 16, 2022

Overview

SETR - Pytorch

Since the original paper (Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.) has no official code,I implemented SETR-Progressive UPsampling(SETR-PUP) using pytorch.

Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Vit

The Vit model is also implemented, and you can use it for image classification.

Usage SETR

from SETR.transformer_seg import SETRModel
import torch 

if __name__ == "__main__":
    net = SETRModel(patch_size=(32, 32), 
                    in_channels=3, 
                    out_channels=1, 
                    hidden_size=1024, 
                    num_hidden_layers=8, 
                    num_attention_heads=16, 
                    decode_features=[512, 256, 128, 64])
    t1 = torch.rand(1, 3, 256, 256)
    print("input: " + str(t1.shape))
    
    # print(net)
    print("output: " + str(net(t1).shape))

If the output size is (1, 1, 256, 256), the code runs successfully.

Usage Vit

from SETR.transformer_seg import Vit
import torch 

if __name__ == "__main__":
    model = Vit(patch_size=(7, 7), 
                    in_channels=1, 
                    out_class=10, 
                    hidden_size=1024, 
                    num_hidden_layers=1, 
                    num_attention_heads=16)
    print(model)
    t1 = torch.rand(1, 1, 28, 28)
    print("input: " + str(t1.shape))

    print("output: " + str(model(t1).shape))

The output shape is (1, 10).

current examples

task_mnist: The simplest example, using the Vit model to classify the minst dataset.
task_car_seg: The example is sample segmentation task. data download: https://www.kaggle.com/c/carvana-image-masking-challenge/data

More examples will be updated later.

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Related tags

Overview

SETR - Pytorch

Vit

Usage SETR

Usage Vit

current examples

more

Owner

zhaohu xing

Nest - A flexible tool for building and sharing deep learning modules

Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network

MolRep: A Deep Representation Learning Library for Molecular Property Prediction

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

[LREC] MMChat: Multi-Modal Chat Dataset on Social Media

Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)

Yolact-keras实例分割模型在keras当中的实现

A deep learning library that makes face recognition efficient and effective

code for our paper "Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer"

The project was to detect traffic signs, based on the Megengine framework.

Airborne magnetic data of the Osborne Mine and Lightning Creek sill complex, Australia

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"

Code for the paper "Curriculum Dropout", ICCV 2017

Deep Q-network learning to play flappybird.

FastyAPI is a Stack boilerplate optimised for heavy loads.

BLEURT is a metric for Natural Language Generation based on transfer learning.

交互式标注软件，暂定名 iann

Code for the SIGGRAPH 2022 paper "DeltaConv: Anisotropic Operators for Geometric Deep Learning on Point Clouds."

Code for the tech report Toward Training at ImageNet Scale with Differential Privacy