SegFormer_Segmentation

The code uses SegFormer for Semantic Segmentation on Drone Dataset.
The details for the SegFormer can be obtained from the following cited paper and the drone dataset can be downloaded from the link below.
Alternatively, you can also download the dataset from Kaggle, the link is mentioned below.
Clone the repository and install all the packages mentioned in the requirement.txt file.
If you just want to infer the semantic segmentation, open the segformer_inf.py, change the image file name you want to test and run the code.
Make sure the trained model is in the model folder. You can download the model at https://drive.google.com/file/d/1zsHyMlGJCpPZrDB0v3ZeaogTcUULmUVB/view?usp=sharing.
Alternatively, you can train the model and save it, locally, by running segformer_train.py.

If you want to train the SegFormer on the drone dataset. Make sure that the directory structure is as follows:
root
| drone_dataset
|---images
|----|---test
|----|---train
|---mask
|----|---test
|----|---train
|---class_dict_seg.csv

Demo Inference

Citations and References

SegFormer
@article{xie2021segformer,
  title={SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers},
  author={Xie, Enze and Wang, Wenhai and Yu, Zhiding and Anandkumar, Anima and Alvarez, Jose M and Luo, Ping},
  journal={arXiv preprint arXiv:2105.15203},
  year={2021}
}

Drone Dataset
http://dronedataset.icg.tugraz.at/

https://www.kaggle.com/bulentsiyah/semantic-drone-dataset

The code uses SegFormer for Semantic Segmentation on Drone Dataset.

Related tags

Overview

SegFormer_Segmentation

Citations and References

Owner

Dr. Sander Ali Khowaja

pytorch implementation of the ICCV'21 paper "MVTN: Multi-View Transformation Network for 3D Shape Recognition"

Original Pytorch Implementation of FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

🛠 All-in-one web-based IDE specialized for machine learning and data science.

This repository is for EMNLP 2021 paper: It is Not as Good as You Think! Evaluating Simultaneous Machine Translation on Interpretation Data

Code for ICMI2020 and ICMI2021 papers: "Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle" and "ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle"

TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.

'Aligned mixture of latent dynamical systems' (amLDS) for stimulus decoding probabilistic manifold alignment across animals. P. Herrero-Vidal et al. NeurIPS 2021 code.

neural image generation

Some bravo or inspiring research works on the topic of curriculum learning.

Revisiting Self-Training for Few-Shot Learning of Language Model.

Manipulation OpenAI Gym environments to simulate robots at the STARS lab

OptNet: Differentiable Optimization as a Layer in Neural Networks

A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.

Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object Detection

NeuPy is a Tensorflow based python library for prototyping and building neural networks

Fully Convolutional Refined Auto Encoding Generative Adversarial Networks for 3D Multi Object Scenes

For holding anime-related object classification and detection models

A Pytorch implement of paper "Anomaly detection in dynamic graphs via transformer" (TADDY).

A library for Deep Learning Implementations and utils

Image Captioning on google cloud platform based on iot