Code for the paper "Curriculum Dropout", ICCV 2017

Last update: Jan 02, 2022

Overview

Curriculum Dropout

Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network generalization. However, we show that using a fixed dropout probability during training is a suboptimal choice. We propose a time scheduling for the probability of retaining neurons in the network. This induces an adaptive regularization scheme that smoothly increases the difficulty of the optimization problem. This idea of "starting easy" and adaptively increasing the difficulty of the learning problem has its roots in curriculum learning and allows one to train better models. Indeed, we prove that our optimization strategy implements a very general curriculum scheme, by gradually adding noise to both the input and intermediate feature representations in the network architecture. The method, named Curriculum Dropout, yields to better generalization.

Code

Each sub-folder (...in progress...) is named after the dataset analyzed and equipped with its own README. The provided code runs with Python 2.7 (should run with Python 3 as well, not tested). For the installation of tensorflow-gpu please refer to the website.

The following command should install the main dependencies on most Linux (Ubuntu) machines

sudo apt-get install python-dev python-pip && sudo pip install -r requirements.txt

Download and extract MNIST

The script download.sh downloads and extracts mnist. Deafult storing directory is ~/mnist.

sudo chmod a+x download.sh
./download.sh

Move the mnist/ folder wherever you like (e.g. /mydata) and then tell the training scripts where to find it

echo /mydata >> data_dir.txt

Reference

If you use this code as part of any published research, please acknowledge the following paper:

"Curriculum Dropout"
Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, René Vidal and Vittorio Murino pdf

@InProceedings{Morerio2017dropout,
    title={Curriculum Dropout},
    author={Morerio, Pietro and Cavazza, Jacopo and Volpi, Riccardo and Vidal, Ren\'e and Murino, Vittorio},
    booktitle = {ICCV},
    year={2017}
}

License

This repository is released under the GNU GENERAL PUBLIC LICENSE.

Code for the paper "Curriculum Dropout", ICCV 2017

Related tags

Overview

Curriculum Dropout

Code

Download and extract MNIST

Reference

License

Owner

Pietro Morerio

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

paper list in the area of reinforcenment learning for recommendation systems

This is the code repository for the paper A hierarchical semantic segmentation framework for computer-vision-based bridge column damage detection

The official implementation of ICCV paper "Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds".

Discerning Decision-Making Process of Deep Neural Networks with Hierarchical Voting Transformation

Benchmarking Pipeline for Prediction of Protein-Protein Interactions

Platform-agnostic AI Framework 🔥

[NeurIPS 2020] Semi-Supervision (Unlabeled Data) & Self-Supervision Improve Class-Imbalanced / Long-Tailed Learning

Graph Representation Learning via Graphical Mutual Information Maximization

Resources for our AAAI 2022 paper: "LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification".

Codes to pre-train T5 (Text-to-Text Transfer Transformer) models pre-trained on Japanese web texts

Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation principle for unnormalized statistical models." (Gutmann and Hyvarinen, AISTATS 2010)

Regression Metrics Calculation Made easy for tensorflow2 and scikit-learn

Human Detection - Pedestrian Detection using OpenCV Python

Official pytorch implementation of paper "Image-to-image Translation via Hierarchical Style Disentanglement".

Notepy is a full-featured Notepad Python app

Snscrape-jsonl-urls-extractor - Extracts urls from jsonl produced by snscrape

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Six - a Python 2 and 3 compatibility library

General purpose Slater-Koster tight-binding code for electronic structure calculations