Unofficial Implementation of MLP-Mixer, Image Classification Model

Last update: Dec 05, 2022

Related tags

Deep Learning MLP-Mixer

Overview

MLP-Mixer

Unoffical Implementation of MLP-Mixer, easy to use with terminal. Train and test easly.

https://arxiv.org/abs/2105.01601

MLP-Mixer is an architecture based exclusively on multi-layer perceptrons (MLPs).

According to paper, Model offers:

Better accuracy than CNNs and Transformers
Lower time complexity than CNNs and Transformers
Lower parameters than CNNs and Transformers

Quick Start

Clone the repo and install the requirements.txt in a Python>=3.8 environment.

git clone https://github.com/Oguzhanercan/MLP-Mixer
cd MLP-Mixer
pip install -r requirements.txt

Dataset

There are 2 options for dataset. You can use pre-defined datasets listed below

CIFAR10
Mnist
Fashion Mnist

or you can use your own dataset. Organize your folder structure as:

      data---
            |
            --0
               |
                --img0.png
                .
                .
                --img9999.png
            |
            -- 1
                |
                --img0.png
                .
                .
                --img9999.png
            .
            .

0 and 1 represents folders that contains images belongs only one particular class. There is no limit for classes or images.

Train

Open a terminal at the same directory of clone. Then run the code below.

python main.py --mode train --dataset CIFAR10 --save True --device cuda --epochs 20 --valid_per 0.2

You can customize the model hyperparameters, all arguments listed below "Arguments:

dataset
train_path
test_path
batch_size
im_size
valid_per
epochs
learning_rate
beta1
beta2
n_classes
cuda
-eveluate_per_epoch
save_model
model_path

Unofficial Implementation of MLP-Mixer, Image Classification Model

Related tags

Overview

MLP-Mixer

Unoffical Implementation of MLP-Mixer, easy to use with terminal. Train and test easly.

Quick Start

Dataset

Train

Custom dataset mode should include following arguments: mode,dataset,train_path,n_classes,im_size

Owner

Oğuzhan Ercan

ConvMixer unofficial implementation

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

PyTorch Implementation of Region Similarity Representation Learning (ReSim)

CondenseNet V2: Sparse Feature Reactivation for Deep Networks

The Codebase for Causal Distillation for Language Models.

The fundamental package for scientific computing with Python.

DziriBERT: a Pre-trained Language Model for the Algerian Dialect

[CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception

Fast and simple implementation of RL algorithms, designed to run fully on GPU.

FwordCTF 2021 Infrastructure and Source code of Web/Bash challenges

DR-GAN: Automatic Radial Distortion Rectification Using Conditional GAN in Real-Time

Real-time pose estimation accelerated with NVIDIA TensorRT

Using CNN to mimic the driver based on training data from Torcs

Predicting a person's gender based on their weight and height

Reinforcement learning models in ViZDoom environment

Keras-retinanet - Keras implementation of RetinaNet object detection.

Weighted QMIX: Expanding Monotonic Value Function Factorisation

Released code for Objects are Different: Flexible Monocular 3D Object Detection, CVPR21

Codes for CIKM'21 paper 'Self-Supervised Graph Co-Training for Session-based Recommendation'.

IOT: Instance-wise Layer Reordering for Transformer Structures