SegFormer_Segmentation

The code uses SegFormer for Semantic Segmentation on Drone Dataset.
The details for the SegFormer can be obtained from the following cited paper and the drone dataset can be downloaded from the link below.
Alternatively, you can also download the dataset from Kaggle, the link is mentioned below.
Clone the repository and install all the packages mentioned in the requirement.txt file.
If you just want to infer the semantic segmentation, open the segformer_inf.py, change the image file name you want to test and run the code.
Make sure the trained model is in the model folder. You can download the model at https://drive.google.com/file/d/1zsHyMlGJCpPZrDB0v3ZeaogTcUULmUVB/view?usp=sharing.
Alternatively, you can train the model and save it, locally, by running segformer_train.py.

If you want to train the SegFormer on the drone dataset. Make sure that the directory structure is as follows:
root
| drone_dataset
|---images
|----|---test
|----|---train
|---mask
|----|---test
|----|---train
|---class_dict_seg.csv

Demo Inference

Citations and References

SegFormer
@article{xie2021segformer,
  title={SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers},
  author={Xie, Enze and Wang, Wenhai and Yu, Zhiding and Anandkumar, Anima and Alvarez, Jose M and Luo, Ping},
  journal={arXiv preprint arXiv:2105.15203},
  year={2021}
}

Drone Dataset
http://dronedataset.icg.tugraz.at/

https://www.kaggle.com/bulentsiyah/semantic-drone-dataset

The code uses SegFormer for Semantic Segmentation on Drone Dataset.

Related tags

Overview

SegFormer_Segmentation

Citations and References

Owner

Dr. Sander Ali Khowaja

PyTorch implementation of Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

Official PyTorch implementation of the paper "Self-Supervised Relational Reasoning for Representation Learning", NeurIPS 2020 Spotlight.

This repository consists of Blender python scripts and corresponding assets to generate variants of the CANDLE dataset

CLADE - Efficient Semantic Image Synthesis via Class-Adaptive Normalization (TPAMI 2021)

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

PyTorch implementations of the paper: "DR.VIC: Decomposition and Reasoning for Video Individual Counting, CVPR, 2022"

Implementing Vision Transformer (ViT) in PyTorch

Source for the paper "Universal Activation Function for machine learning"

A TensorFlow implementation of DeepMind's WaveNet paper

Facial detection, landmark tracking and expression transfer library for Windows, Linux and Mac

In generative deep geometry learning, we often get many obj files remain to be rendered

NLG evaluation via Statistical Measures of Similarity: BaryScore, DepthScore, InfoLM

HINet: Half Instance Normalization Network for Image Restoration

RSC-Net: 3D Human Pose, Shape and Texture from Low-Resolution Images and Videos

Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

9th place solution in "Santa 2020 - The Candy Cane Contest"

The devkit of the nuScenes dataset.

Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds (CVPR 2022, Oral)

Code for the paper "A Study of Face Obfuscation in ImageNet"