Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

Last update: Dec 17, 2022

Overview

ADE20k Semantic segmentation with MAE

Getting started

Install the mmsegmentation library and some required packages.

pip install mmcv-full==1.3.0 mmsegmentation==0.11.0
pip install scipy timm==0.3.2

Install apex for mixed-precision training

git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./

Follow the guide in mmseg to prepare the ADE20k dataset.

Fine-tuning for Reproducing Results of MAE ViT-Base

Command:

tools/dist_train.sh configs/mae/upernet_mae_base_12_512_slide_160k_ade20k.py 8 --seed 0  --options model.pretrained=https://dl.fbaipublicfiles.com/mae/pretrain/mae_pretrain_vit_base.pth

Expected results log(paper results: 48.1 mIoU):

+--------+-------+-------+-------+
| Scope  | mIoU  | mAcc  | aAcc  |
+--------+-------+-------+-------+
| global | 48.15 | 58.99 | 83.05 |
+--------+-------+-------+-------+

Evaluation

Command format:

tools/dist_test.sh  <CONFIG_PATH> <CHECKPOINT_PATH> <NUM_GPUS> --eval mIoU

Acknowledgment

This code is built using the mmsegmentation library, Timm library, the Swin repository, XCiT, SETR, BEiT and the MAE repository.

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

Related tags

Overview

ADE20k Semantic segmentation with MAE

Getting started

Fine-tuning for Reproducing Results of MAE ViT-Base

Evaluation

Acknowledgment

Owner

Official implementation of Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models at NeurIPS 2021

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

[AAAI 2021] MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Code repository for the paper "Tracking People with 3D Representations"

Code and data for the paper "Hearing What You Cannot See"

An open source machine learning library for performing regression tasks using RVM technique.

Code for paper entitled "Improving Novelty Detection using the Reconstructions of Nearest Neighbours"

DimReductionClustering - Dimensionality Reduction + Clustering + Unsupervised Score Metrics

Run containerized, rootless applications with podman

Experiments for distributed optimization algorithms

PyTorch implementation of PSPNet segmentation network

Normal Learning in Videos with Attention Prototype Network

This is a file about Unet implemented in Pytorch

A task-agnostic vision-language architecture as a step towards General Purpose Vision

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Python and Julia in harmony.

Re-TACRED: Addressing Shortcomings of the TACRED Dataset

Tutorial in Python targeted at Epidemiologists. Will discuss the basics of analysis in Python 3

Code for the tech report Toward Training at ImageNet Scale with Differential Privacy