Code for "Universal inference meets random projections: a scalable test for log-concavity"

Last update: Nov 21, 2021

Overview

How to use this repository

This repository contains code to replicate the results of "Universal inference meets random projections: a scalable test for log-concavity" by Robin Dunn, Larry Wasserman, and Aaditya Ramdas.

Folder contents

batch_scripts: Contains SLURM batch scripts to run the simulations. Scripts are labeled by the figure for which their simulations produce data. These scripts run the code in sim_code, using the parameters in sim_params.
data: Output of simulations.
plot_code: Reads simulation outputs from data and reproduces all figures in the paper. Plots are saved to plots folder.
plots: Contains all plots in paper.
sim_code: R code to run simulations. Simulation output is saved to data folder.
sim_params: Parameters for simulations. Each row contains a single choice of parameters. The scripts in sim_code read in these files, and the scripts in batch_scripts loop through all choices of parameters.

How do I ...

Produce the simulations for a given figure?

In the batch_scripts folder, scripts are labeled by the figure for which they simulate data. Run all batch scripts corresponding to the figure of interest. The allocated run time is estimated from the choice of parameters for which the code has the longest run time. Many scripts will run faster than this time. The files in sim_code each contain progress bars to estimate the remaining run time. You may wish to start running these files outside of a batch submission to understand the run time on your computing system.

Alternatively, to run the code without using a job submission system, click on any .sh file. The Rscript lines can be run on a terminal, replacing $SLURM_ARRAY_TASK_ID with all of the indices in the batch array.

The simulation output will be stored in the data folder, with one dataset per choice of parameters. To combine these datasets into a single dataset (as they currently appear in data), run the code in sim_code/combine_datasets.R.

Example: batch_scripts/fig01_fully_NP_randproj.sh

This script reproduces the universal test simulations for Figure 1. To do this, it runs the R script at sim_code/fig01_fully_NP_randproj.R. It reads in the parameters from sim_params/fig01_fully_NP_randproj_params.csv. There are 30 sets of parameters in total. The results will be stored in the data folder, with names such as fig01_fully_NP_randproj_1.csv, ..., fig01_fully_NP_randproj_30.csv. To combine these files into a single .csv file, run the code at sim_code/combine_datasets.R.

Examine the code for a given simulation?

The R code in sim_code is labeled by the figures for which they simulate data. Click on all files corresponding to a given figure.

Reproduce a figure without rerunning the simulations?

The R scripts in plot_code are labeled by their corresponding plots. They read in the necessary simulated data from the data folder and output the figures to the plots folder.

Code for "Universal inference meets random projections: a scalable test for log-concavity"

Related tags

Overview

How to use this repository

Folder contents

How do I ...

Produce the simulations for a given figure?

Examine the code for a given simulation?

Reproduce a figure without rerunning the simulations?

Owner

Robin Dunn

This repository contains the code for the paper "PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization"

A PyTorch based deep learning library for drug pair scoring.

Official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

Instantaneous Motion Generation for Robots and Machines.

A hifiasm fork for metagenome assembly using Hifi reads.

RoIAlign & crop_and_resize for PyTorch

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).

[TPDS'21] COSCO: Container Orchestration using Co-Simulation and Gradient Based Optimization for Fog Computing Environments

A tight inclusion function for continuous collision detection

Fast (simple) spectral synthesis and emission-line fitting of DESI spectra.

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

Learning Multiresolution Matrix Factorization and its Wavelet Networks on Graphs

Self-Supervised Methods for Noise-Removal

PyTorch implementation of Tacotron speech synthesis model.

Make differentially private training of transformers easy for everyone

Neural Scene Flow Fields using pytorch-lightning, with potential improvements

This is an official implementation for "PlaneRecNet".