Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

Last update: Dec 17, 2022

Overview

ADE20k Semantic segmentation with MAE

Getting started

Install the mmsegmentation library and some required packages.

pip install mmcv-full==1.3.0 mmsegmentation==0.11.0
pip install scipy timm==0.3.2

Install apex for mixed-precision training

git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./

Follow the guide in mmseg to prepare the ADE20k dataset.

Fine-tuning for Reproducing Results of MAE ViT-Base

Command:

tools/dist_train.sh configs/mae/upernet_mae_base_12_512_slide_160k_ade20k.py 8 --seed 0  --options model.pretrained=https://dl.fbaipublicfiles.com/mae/pretrain/mae_pretrain_vit_base.pth

Expected results log(paper results: 48.1 mIoU):

+--------+-------+-------+-------+
| Scope  | mIoU  | mAcc  | aAcc  |
+--------+-------+-------+-------+
| global | 48.15 | 58.99 | 83.05 |
+--------+-------+-------+-------+

Evaluation

Command format:

tools/dist_test.sh  <CONFIG_PATH> <CHECKPOINT_PATH> <NUM_GPUS> --eval mIoU

Acknowledgment

This code is built using the mmsegmentation library, Timm library, the Swin repository, XCiT, SETR, BEiT and the MAE repository.

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

Related tags

Overview

ADE20k Semantic segmentation with MAE

Getting started

Fine-tuning for Reproducing Results of MAE ViT-Base

Evaluation

Acknowledgment

Owner

DeepAL: Deep Active Learning in Python

Official MegEngine implementation of CREStereo(CVPR 2022 Oral).

HybridNets: End-to-End Perception Network

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

Gesture Volume Control v.2

This repository contains all the code and materials distributed in the 2021 Q-Programming Summer of Qode.

A simple implementation of Kalman filter in Multi Object Tracking

Exploring Machine Learning Models for detecting anomalous behavior in credit-card transactions. It's crucial that credit-card companies are able to recognize fraudulent activity so that customers are not charged for items they didn't purchase.

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

MCMC samplers for Bayesian estimation in Python, including Metropolis-Hastings, NUTS, and Slice

Image-to-image translation with conditional adversarial nets

Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback

Drone Task1 - Drone Task1 With Python

Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition

🏃‍♀️ A curated list about human motion capture, analysis and synthesis.

Algorithmic trading with deep learning experiments

NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go

Accompanying code for the paper "A Kernel Test for Causal Association via Noise Contrastive Backdoor Adjustment".

[ICME 2021 Oral] CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

Deep Learning for Human Part Discovery in Images - Chainer implementation