MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Last update: Jan 16, 2022

Related tags

Overview

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Using mixup data augmentation as reguliraztion and tuning the hyper parameters of ResNet 50 models to achieve 94.57% test accuracy on CIFAR-10 Dataset. Link to paper

network	error %
resnet-50	6.97
resnet-110	6.61
resnet-164	5.93
resnet-1001	7.61
This method	5.43

Overview

Change the wandb api key to valid api key.
Python 3.8 and pytorch 1.9 (works on older versions as well)
main.py is to train model
sweep.py and sweep_config.py are for hyperparameter optimization for experiment tracking wandb is used please change api key
pred.py is to run the trained model on the custom data. (Appropriately provide model paths)

Important

If you want to run sweep.py then you must use wandb apikey and if you want to run main.py use wandb to log the experiment for comparision else comment out wandb part.

Training


# Start training with:

python main.py (Added --run_name optional argument for better tracking experiments)

  

# You can manually resume the training with:

python main.py --resume --lr=0.01

Hyperparameters sweep


# Start sweep with:

python sweep.py

  

# Provide appropriate hyperparameters range in sweep_config.py (Config written in py file to use the power of math package for sweep configs)

Running on custom dataset


# Convert traget data of (N*32*32*3) into (N*3*32*32) shape and pass through the model:

python pred.py (Provide path of the saved models)

Other files

mixup.py contains functions to claculate loss of mixup predictions as you cant use nn.CrossEntropyLoss
utils.py contain somehelper functions
dataloader.py is a torch class based dataloader of our train data (CIFAR-10 data)
private_loader.py is a torch class based dataloader of our private data.
Transformations are done using torchtransforms in main.py and sweep.py files depending on usage.

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Related tags

Overview

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Overview

Important

Training

Hyperparameters sweep

Running on custom dataset

Other files

Owner

Bhanu

Generate text captions for images from their CLIP embeddings. Includes PyTorch model code and example training script.

YOLOv5 + ROS2 object detection package

Bringing Characters to Life with Computer Brains in Unity

It helps user to learn Pick-up lines and share if he has a better one

Minimal deep learning library written from scratch in Python, using NumPy/CuPy.

Source code for paper "Deep Diffusion Models for Robust Channel Estimation", TBA.

A3C LSTM Atari with Pytorch plus A3G design

Exponential Graph is Provably Efficient for Decentralized Deep Training

ADOP: Approximate Differentiable One-Pixel Point Rendering

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

ROS support for Velodyne 3D LIDARs

Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds (CVPR 2022)

Docker containers of baseline agents for the Crafter environment

[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation

Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019

An open source python library for automated feature engineering

To SMOTE, or not to SMOTE?

Pure python implementations of popular ML algorithms.

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

A copy of Ares that costs 30 fucking dollars.