Learning where to learn - Gradient sparsity in meta and continual learning

Last update: Dec 09, 2022

Related tags

Overview

Learning where to learn - Gradient sparsity in meta and continual learning

In this paper, we investigate gradient sparsity found by MAML in various continual and few-shot learning scenarios.
Instead of only learning the initialization of neural network parameters, we additionally meta-learn parameters underneath a step function that stops gradient descent when smaller then 0.

We term this version Sparse-MAML - Link to the paper here.

Interestingly, we see that structured sparsity emerges in both the classic 4-layer ConvNet as well as a ResNet-12 for few-shot learning. This is accompanied by improved robustness and generalisation across many hyperparameters.

Note that Sparse-MAML is an extremely simple variant of MAML that possesses only the possibility to shut on/off training of specific parameters compared to proper gradient modulation.

This codebase implents the few-shot learning experiments that are presented in the paper. To reproduce the results in the paper, please follow these instructions:

Installation

#1. Install a conda env:

conda create -n sparse-MAML

#2. Activate the env:

source activate sparse-MAML

#3. Install anaconda:

conda install anaconda

#4. Install extra requiremetns (make sure you use the correct pip3):

pip3 install -r requirements.txt

#5. Run:

chmod u+x run_sparse_MAML.sh

#6. Execute:

./run_sparse_MAML.sh

Results

MiniImageNet Few-Shot	MAML	ANIL	BOIL	sparse-MAML	sparse-ReLU-MAML
5-way 5-shot \| ConvNet	63.15	61.50	66.45	67.03	64.84
5-way 1-shot \| ConvNet	48.07	46.70	49.61	50.35	50.39
5-way 5-shot \| ResNet12	69.36	70.03	70.50	70.02	73.01
5-way 1-shot \| ResNet12	53.91	55.25	-	55.02	56.39

BOIL results are taken from the original paper.

This code based is heavily build on top of torchmeta.

Learning where to learn - Gradient sparsity in meta and continual learning

Related tags

Overview

Learning where to learn - Gradient sparsity in meta and continual learning

Installation

Results

Owner

Johannes Oswald

Charsiu: A transformer-based phonetic aligner

Pixel Consensus Voting for Panoptic Segmentation (CVPR 2020)

Implementing DropPath/StochasticDepth in PyTorch

Implementation of gaze tracking and demo

Official code for 'Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentationon Complex Urban Driving Scenes'

learning and feeling SLAM together with hands-on-experiments

Contrastive Language-Image Pretraining

Facial recognition project

Global-Local Attention for Emotion Recognition

Diverse graph algorithms implemented using JGraphT library.

Neural Architecture Search Powered by Swarm Intelligence 🐜

Deep Distributed Control of Port-Hamiltonian Systems

The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.

Evolution Strategies in PyTorch

[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression

Imposter-detector-2022 - HackED 2022 Team 3IQ - 2022 Imposter Detector

[v1 (ISBI'21) + v2] MedMNIST: A Large-Scale Lightweight Benchmark for 2D and 3D Biomedical Image Classification

Sequential Model-based Algorithm Configuration

HNECV: Heterogeneous Network Embedding via Cloud model and Variational inference

Unsupervised Feature Ranking via Attribute Networks.