Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight

Last update: Dec 10, 2022

Related tags

Overview

`dimensions`

Estimating the instrinsic dimensionality of image datasets

Code for: The Intrinsic Dimensionaity of Images and Its Impact On Learning - Phillip Pope and Chen Zhu, Ahmed Abdelkader, Micah Goldblum, Tom Goldstein (ICLR 2021, spotlight)

https://openreview.net/forum?id=XJk19XzGq2J

Environment

This code was developed in the following environment

conda create dimensions python=3.6 jupyter matplotlib scikit-learn pytorch==1.5.0 torchvision cudatoolkit=10.2 -c pytorch

To generate new data of controlled dimensionality with GANs, you must install:

pip install pytorch-pretrained-biggan

To use the shortest-path method (Granata and Carnevale 2016) you must also compile the fast graph shortest path code gsp (written by Jake VdP + Sci-Kit Learn)

cd estimators/gsp
python setup.py install

Generate data of controlled dimensionality

python generate_data/gen_images.py \
  --num_samples 1000 \
  --class_name basenji \
  --latent_dim 16 \
  --batch_size 100 \
  --save_dir samples/basenji_16

Estimate dimension of generated samples

To run the MLE (Levina and Bickel) estimator on the synthetic GAN data generated above:

python main.py \
    --estimator mle \
    --k1 25 \
    --single-k \
    --eval-every-k \
    --average-inverse \
    --dset  samples/basenji_16 \
    --max_num_samples 1000 \
    --save-path results/basenji_16.json

Use --estimators to try different estimators

Citation

If you find our paper or code useful, please cite our paper:

@inproceedings{DBLP:conf/iclr/PopeZAGG21,
  author    = {Phillip Pope and
               Chen Zhu and
               Ahmed Abdelkader and
               Micah Goldblum and
               Tom Goldstein},
  title     = {The Intrinsic Dimension of Images and Its Impact on Learning},
  booktitle = {9th International Conference on Learning Representations, {ICLR} 2021,
               Virtual Event, Austria, May 3-7, 2021},
  publisher = {OpenReview.net},
  year      = {2021},
  url       = {https://openreview.net/forum?id=XJk19XzGq2J},
  timestamp = {Wed, 23 Jun 2021 17:36:39 +0200},
  biburl    = {https://dblp.org/rec/conf/iclr/PopeZAGG21.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Acknowledgements

We gratefully acknowledge use of the following codebases when developing our dimensionality estimators:

We also thank Prof. Vishnu Boddeti for clarifying comments on the graph-distance estimator.

Disclaimer

This code released as is. We will do our best to address questions/bugs, but cannot guarantee support.

Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight

Related tags

Overview

`dimensions`

Environment

Generate data of controlled dimensionality

Estimate dimension of generated samples

Citation

Acknowledgements

Disclaimer

Owner

Phil Pope

Repository accompanying the "Sign Pose-based Transformer for Word-level Sign Language Recognition" paper

NeoPlay is the project dedicated to ESport events.

A python implementation of Physics-informed Spline Learning for nonlinear dynamics discovery

Implementation of "Large Steps in Inverse Rendering of Geometry"

Contains source code for the winning solution of the xView3 challenge

YOLOX + ROS(1, 2) object detection package

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence

Generate images from texts. In Russian

RMTD: Robust Moving Target Defence Against False Data Injection Attacks in Power Grids

Source code and data from the RecSys 2020 article "Carousel Personalization in Music Streaming Apps with Contextual Bandits" by W. Bendada, G. Salha and T. Bontempelli

Half Instance Normalization Network for Image Restoration

End-To-End Optimization of LiDAR Beam Configuration

PyTorch implementation of probabilistic deep forecast applied to air quality.

Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning

Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

Self-supervised learning (SSL) is a method of machine learning

"Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback"

DLFlow is a deep learning framework.

Implementation of Uformer, Attention-based Unet, in Pytorch

Realtime YOLO Monster Detection With Non Maximum Supression