Code for generating the figures in the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?"

Last update: Nov 22, 2022

Overview

Code for running simulations for the paper

"Capacity of Group-invariant Linear Readouts from Equivariant Representations:
How Many Objects can be Linearly Classified Under All Possible Views?",

by Matthew Farrell, Blake Bordelon, Shubhendu Trivedi, and Cengiz Pehlevan.


Note that the file models/vgg.py contains copyright statements for
the original authors and modifiers of the script.

The python packages used for the simulations are contained in
environment.yml (this may include extra packages that are not necessary).


To generate Figure 1, run

python manifold_plots.py

This script is fairly simple and self-explanatory.


To generate Figures 2 and 3, run

python plot_cnn_capacity.py

At the bottom of the plot_cnn_capacity.py script, the plotting function
is called for different panels. Comment out lines to generate specific
figures. This script searches for a match with sets of parameters defined
in cnn_capacity_params.py. To modify parameters used for simulations,
modify the dictionaries in cnn_capacity_params.py or define your own
parameter sets. For a description of different parameter options,
see the docstring for the function cnn_capacity.get_capacity.

The simulations take quite a lot of time to run, even
with parallelization. Also a word of warning that
the simulations take a lot of memory (~100GB for n_cores=5).
To speed things up and reduce memory usage, one can set
perceptron_style=efficient or pool_over_group=True, or reduce n_dichotomies.
One can also choose to set seeds to seeds = [3] in plot_cnn_capacity.py.


cnn_capacity_utils.py contains utility functions. The VGG model can be found
in models/vgg.py. The direct sum (aka "grid cell") convolutional network model
can be found in models/gridcellconv.py The code for generating datasets can be
found in datasets.py.


The code was modified and superficially refactored in preparation for
releasing to the public. The simulations haven't been thoroughly tested after
this refactoring so it's not 100% guaranteed that the code is correct (though
it doesn't appear to throw errors). Fingers crossed that everything works
the way it should.

The development of this code was supported by the Harvard Data Science Initiative.

Code for generating the figures in the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?"

Related tags

Overview

Owner

Matthew Farrell

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.

A Kitti Road Segmentation model implemented in tensorflow.

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

DeepFaceLive - Live Deep Fake in python, Real-time face swap for PC streaming or video calls

MutualGuide is a compact object detector specially designed for embedded devices

Modular Gaussian Processes

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Image Captioning on google cloud platform based on iot

This repository includes the code of the sequence-to-sequence model for discontinuous constituent parsing described in paper Discontinuous Grammar as a Foreign Language.

WTTE-RNN a framework for churn and time to event prediction

Users can free try their models on SIDD dataset based on this code

Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch

Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"

Code for AutoNL on ImageNet (CVPR2020)

Degree-Quant: Quantization-Aware Training for Graph Neural Networks.

The code for Expectation-Maximization Attention Networks for Semantic Segmentation (ICCV'2019 Oral)

Ready-to-use code and tutorial notebooks to boost your way into few-shot image classification.

Unrolled Generative Adversarial Networks

Bootstrapped Representation Learning on Graphs