Hierarchical Few-Shot Generative Models

Last update: Dec 12, 2022

Overview

Hierarchical Few-Shot Generative Models

Giorgio Giannone, Ole Winther

This repo contains code and experiments for the paper Hierarchical Few-Shot Generative Models.

Website: https://georgosgeorgos.github.io/hierarchical-few-shot-generative-models/

Settings

Clone the repo:

git clone https://github.com/georgosgeorgos/hierarchical-few-shot-generative-models
cd hierarchical-few-shot-generative-models

Create and activate the conda env:

conda env create -f environment.yml
conda activate hfsgm

The code has been tested on Ubuntu 18.04, Python 3.6 and CUDA 11.3

We use wandb for visualization. The first time you run the code you will need to login.

Data

We provide preprocessed Omniglot dataset.

From the main folder, copy the data in data/omniglot_ns/:

wget https://github.com/georgosgeorgos/hierarchical-few-shot-generative-models/releases/download/Omniglot/omni_train_val_test.pkl

For CelebA you need to download the dataset from here.

Dataset

In dataset we provide utilities to process and augment datasets in the few-shot setting. Each dataset is a large collection of small sets. Sets can be created dynamically. The dataset/base.py file collects basic info about the datasets. For binary datasets (omniglot_ns.py) we augment using flipping and rotations. For RGB datasets (celeba.py) we use only flipping.

Experiment

In experiment we implement scripts for model evaluation, experiments and visualizations.

attention.py - visualize attention weights and heads for models with learnable aggregations (LAG).
cardinality.py - compute ELBOs for different input set size: [1, 2, 5, 10, 20].
classifier_mnist.py - few-shot classifiers on MNIST.
kl_layer.py - compute KL over z and c for each layer in latent space.
marginal.py - compute approximate log-marginal likelihood with 1K importance samples.
refine_vis.py - visualize refined samples.
sampling_rgb.py - reconstruction, conditional, refined, unconditional sampling for RGB datasets.
sampling_transfer.py - reconstruction, conditional, refined, unconditional sampling on transfer datasets.
sampling.py - reconstruction, conditional, refined, unconditional sampling for binary datasets.
transfer.py - compute ELBOs on MNIST, DoubleMNIST, TripleMNIST.

Model

In model we implement baselines and model variants.

base.py - base class for all the models.
vae.py - Variational Autoencoder (VAE).
ns.py - Neural Statistician (NS).
tns.py - NS with learnable aggregation (NS-LAG).
cns.py - NS with convolutional latent space (CNS).
ctns.py - CNS with learnable aggregation (CNS-LAG).
hfsgm.py - Hierarchical Few-Shot Generative Model (HFSGM).
thfsgm.py - HFSGM with learnable aggregation (HFSGM-LAG).
chfsgm.py - HFSGM with convolutional latent space (CHFSGM).
cthfsgm.py - CHFSGM with learnable aggregation (CHFSGM-LAG).

Script

Scripts used for training the models in the paper.

To run a CNS on Omniglot:

sh script/main_cns.sh GPU_NUMBER omniglot_ns

Train a model

To train a generic model run:

python main.py --name {VAE, NS, CNS, CTNS, CHFSGM, CTHFSGM} \
               --model {vae, ns, cns, ctns, chfsgm, cthfsgm} \
               --augment \
               --dataset omniglot_ns \
               --likelihood binary \
               --hidden-dim 128 \
               --c-dim 32 \
               --z-dim 32 \
               --output-dir /output \
               --alpha-step 0.98 \
               --alpha 2 \
               --adjust-lr \
               --scheduler plateau \
               --sample-size {2, 5, 10} \
               --sample-size-test {2, 5, 10} \
               --num-classes 1 \
               --learning-rate 1e-4 \
               --epochs 400 \
               --batch-size 100 \
               --tag (optional string)

If you do not want to save logs, use the flag --dry_run. This flag will call utils/trainer_dry.py instead of trainer.py.

Acknowledgments

A lot of code and ideas borrowed from:

You might also like...

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

Hypercorrelation Squeeze for Few-Shot Segmentation This is the implementation of the paper "Hypercorrelation Squeeze for Few-Shot Segmentation" by Juh

165 Dec 28, 2022

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Cross Transformers - Pytorch (wip) Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch Install $ pip install cross-t

40 Dec 22, 2022

Official repository for Few-shot Image Generation via Cross-domain Correspondence (CVPR '21)

Few-shot Image Generation via Cross-domain Correspondence Utkarsh Ojha, Yijun Li, Jingwan Lu, Alexei A. Efros, Yong Jae Lee, Eli Shechtman, Richard Zh

251 Dec 11, 2022

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

Few-shot 3D Point Cloud Semantic Segmentation Created by Na Zhao from National University of Singapore Introduction This repository contains the PyTor

117 Dec 27, 2022

Few-Shot Graph Learning for Molecular Property Prediction

Few-shot Graph Learning for Molecular Property Prediction Introduction This is the source code and dataset for the following paper: Few-shot Graph Lea

94 Dec 12, 2022

Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs

Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs This is an implemetation of the paper Few-shot Relation Extraction via Baye

36 Nov 22, 2022

The implementation of PEMP in paper "Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes"

Prior-Enhanced network with Meta-Prototypes (PEMP) This is the PyTorch implementation of PEMP. Overview of PEMP Meta-Prototypes & Adaptive Prototypes

8 Oct 14, 2021

Code and data of the ACL 2021 paper: Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

MetaAdaptRank This repository provides the implementation of meta-learning to reweight synthetic weak supervision data described in the paper Few-Shot

5 Jun 16, 2022

Adaptive Prototype Learning and Allocation for Few-Shot Segmentation (CVPR 2021)

ASGNet The code is for the paper "Adaptive Prototype Learning and Allocation for Few-Shot Segmentation" (accepted to CVPR 2021) [arxiv] Overview data/

91 Dec 23, 2022

Hierarchical Few-Shot Generative Models

Related tags

Overview

Hierarchical Few-Shot Generative Models

Settings

Data

Dataset

Experiment

Model

Script

Train a model

Acknowledgments

You might also like...

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Official repository for Few-shot Image Generation via Cross-domain Correspondence (CVPR '21)

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

Few-Shot Graph Learning for Molecular Property Prediction

Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs

The implementation of PEMP in paper "Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes"

Code and data of the ACL 2021 paper: Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

Adaptive Prototype Learning and Allocation for Few-Shot Segmentation (CVPR 2021)

Releases(Omniglot)

Omniglot(Jun 9, 2021)

Owner

Giorgio Giannone

The Codebase for Causal Distillation for Language Models.

Barlow Twins and HSIC

The BCNet related data and inference model.

G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

A scanpy extension to analyse single-cell TCR and BCR data.

This tool converts a Nondeterministic Finite Automata (NFA) into a Deterministic Finite Automata (DFA)

A learning-based data collection tool for human segmentation

Spectrum is an AI that uses machine learning to generate Rap song lyrics

Camera-caps - Examine the camera capabilities for V4l2 cameras

Introducing neural networks to predict stock prices

Benchmark for Answering Existential First Order Queries with Single Free Variable

eXPeditious Data Transfer

Hyperopt for solving CIFAR-100 with a convolutional neural network (CNN) built with Keras and TensorFlow, GPU backend

MATLAB codes of the book "Digital Image Processing Fourth Edition" converted to Python

A gesture recognition system powered by OpenPose, k-nearest neighbours, and local outlier factor.

The repo for reproducing Seed-driven Document Ranking for Systematic Reviews: A Reproducibility Study

codes for IKM (arXiv2021, Submitted to IEEE Trans)

Implementation of "Deep Implicit Templates for 3D Shape Representation"