Unofficial Implementation of MLP-Mixer, Image Classification Model

Last update: Dec 05, 2022

Related tags

Deep Learning MLP-Mixer

Overview

MLP-Mixer

Unoffical Implementation of MLP-Mixer, easy to use with terminal. Train and test easly.

https://arxiv.org/abs/2105.01601

MLP-Mixer is an architecture based exclusively on multi-layer perceptrons (MLPs).

According to paper, Model offers:

Better accuracy than CNNs and Transformers
Lower time complexity than CNNs and Transformers
Lower parameters than CNNs and Transformers

Quick Start

Clone the repo and install the requirements.txt in a Python>=3.8 environment.

git clone https://github.com/Oguzhanercan/MLP-Mixer
cd MLP-Mixer
pip install -r requirements.txt

Dataset

There are 2 options for dataset. You can use pre-defined datasets listed below

CIFAR10
Mnist
Fashion Mnist

or you can use your own dataset. Organize your folder structure as:

      data---
            |
            --0
               |
                --img0.png
                .
                .
                --img9999.png
            |
            -- 1
                |
                --img0.png
                .
                .
                --img9999.png
            .
            .

0 and 1 represents folders that contains images belongs only one particular class. There is no limit for classes or images.

Train

Open a terminal at the same directory of clone. Then run the code below.

python main.py --mode train --dataset CIFAR10 --save True --device cuda --epochs 20 --valid_per 0.2

You can customize the model hyperparameters, all arguments listed below "Arguments:

dataset
train_path
test_path
batch_size
im_size
valid_per
epochs
learning_rate
beta1
beta2
n_classes
cuda
-eveluate_per_epoch
save_model
model_path

Unofficial Implementation of MLP-Mixer, Image Classification Model

Related tags

Overview

MLP-Mixer

Unoffical Implementation of MLP-Mixer, easy to use with terminal. Train and test easly.

Quick Start

Dataset

Train

Custom dataset mode should include following arguments: mode,dataset,train_path,n_classes,im_size

Owner

Oğuzhan Ercan

Winners of DrivenData's Overhead Geopose Challenge

Package for extracting emotions from social media text. Tailored for financial data.

Offcial repository for the IEEE ICRA 2021 paper Auto-Tuned Sim-to-Real Transfer.

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

A flexible submap-based framework towards spatio-temporally consistent volumetric mapping and scene understanding.

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

You Only 👀 One Sequence

[NeurIPS 2021] The PyTorch implementation of paper "Self-Supervised Learning Disentangled Group Representation as Feature"

7th place solution of Human Protein Atlas - Single Cell Classification on Kaggle

Code for the paper "Reinforcement Learning as One Big Sequence Modeling Problem"

Categorizing comments on YouTube into different categories.

More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

NeurIPS'21 Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

Regression Metrics Calculation Made easy for tensorflow2 and scikit-learn

CoANet: Connectivity Attention Network for Road Extraction From Satellite Imagery

BlockUnexpectedPackets - Preventing BungeeCord CPU overload due to Layer 7 DDoS attacks by scanning BungeeCord's logs

EMNLP 2021 paper The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers.

Prompt Tuning with Rules

A PaddlePaddle implementation of Time Interval Aware Self-Attentive Sequential Recommendation.