Code for the paper "Curriculum Dropout", ICCV 2017

Last update: Jan 02, 2022

Overview

Curriculum Dropout

Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network generalization. However, we show that using a fixed dropout probability during training is a suboptimal choice. We propose a time scheduling for the probability of retaining neurons in the network. This induces an adaptive regularization scheme that smoothly increases the difficulty of the optimization problem. This idea of "starting easy" and adaptively increasing the difficulty of the learning problem has its roots in curriculum learning and allows one to train better models. Indeed, we prove that our optimization strategy implements a very general curriculum scheme, by gradually adding noise to both the input and intermediate feature representations in the network architecture. The method, named Curriculum Dropout, yields to better generalization.

Code

Each sub-folder (...in progress...) is named after the dataset analyzed and equipped with its own README. The provided code runs with Python 2.7 (should run with Python 3 as well, not tested). For the installation of tensorflow-gpu please refer to the website.

The following command should install the main dependencies on most Linux (Ubuntu) machines

sudo apt-get install python-dev python-pip && sudo pip install -r requirements.txt

Download and extract MNIST

The script download.sh downloads and extracts mnist. Deafult storing directory is ~/mnist.

sudo chmod a+x download.sh
./download.sh

Move the mnist/ folder wherever you like (e.g. /mydata) and then tell the training scripts where to find it

echo /mydata >> data_dir.txt

Reference

If you use this code as part of any published research, please acknowledge the following paper:

"Curriculum Dropout"
Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, René Vidal and Vittorio Murino pdf

@InProceedings{Morerio2017dropout,
    title={Curriculum Dropout},
    author={Morerio, Pietro and Cavazza, Jacopo and Volpi, Riccardo and Vidal, Ren\'e and Murino, Vittorio},
    booktitle = {ICCV},
    year={2017}
}

License

This repository is released under the GNU GENERAL PUBLIC LICENSE.

Code for the paper "Curriculum Dropout", ICCV 2017

Related tags

Overview

Curriculum Dropout

Code

Download and extract MNIST

Reference

License

Owner

Pietro Morerio

Creating a custom CNN hypertunned architeture for the Fashion MNIST dataset with Python, Keras and Tensorflow.

Creating Multi Task Models With Keras

Deep Multi-Magnification Network for multi-class tissue segmentation of whole slide images

Fully convolutional deep neural network to remove transparent overlays from images

Open source annotation tool for machine learning practitioners.

PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

magiCARP: Contrastive Authoring+Reviewing Pretraining

[cvpr22] Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

Code for the paper Relation Prediction as an Auxiliary Training Objective for Improving Multi-Relational Graph Representations (AKBC 2021).

3D ResNets for Action Recognition (CVPR 2018)

Scalable Optical Flow-based Image Montaging and Alignment

[NeurIPS 2021] "G-PATE: Scalable Differentially Private Data Generator via Private Aggregation of Teacher Discriminators"

Sign Language Transformers (CVPR'20)

SweiNet is an uncertainty-quantifying shear wave speed (SWS) estimator for ultrasound shear wave elasticity (SWE) imaging.

Taming Transformers for High-Resolution Image Synthesis

NudeNet: Neural Nets for Nudity Classification, Detection and selective censoring