MAE-keras

Unofficial keras(tensorflow) implementation of MAE model described in 'Masked Autoencoders Are Scalable Vision Learners'. This work has been done on the basis of https://keras.io/examples/vision/image_classification_with_vision_transformer/, https://www.tensorflow.org/text/tutorials/transformer#positional_encoding, https://keras.io/examples/vision/image_classification_with_vision_transformer/

Currently, only pre-training mode is supported. But you can easily fine-tune the model using its encoder part. It is not complete, flawless implementation, so its performance could be different from the paper. If there is anything to modify, please make it right :) Thanks.

✔️ Supported (DONE)
▫️ Sinusoidal positional encoding at both encoder and decoder inputs
▫️ (Random)Mask Token, Patch, PatchesToImages Layer
▫️ ImageReconstruction callback

✖️ Not Supported yet (TO DO)
▫️ Pre-trained model
▫️ Model test

Contact on my bio: ga06033@yonsei.ac.kr

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README.md		README.md
callbacks.py		callbacks.py
layers.py		layers.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

callbacks.py

callbacks.py

layers.py

layers.py

model.py

model.py

Repository files navigation

MAE-keras

About

Releases

Packages

Languages

three0-s/MAE-keras

Folders and files

Latest commit

History

Repository files navigation

MAE-keras

About

Resources

Stars

Watchers

Forks

Languages