A lossless neural compression framework built on top of JAX.

Last update: Mar 14, 2022

Related tags

Overview

Kompressor

Branch	CI	Coverage
`main` (active)
`main`
`development`

A neural compression framework built on top of JAX.

Install

setup.py assumes a compatible version of JAX and JAXLib are already installed. Automated build is tested for a cuda:11.1-cudnn8-runtime-ubuntu20.04 environment with jaxlib==0.1.76+cuda11.cudnn82.

git clone https://github.com/rosalindfranklininstitute/kompressor.git
cd kompressor
pip install -e .

# Run tests
python -m pytest --cov=src/kompressor tests/

Install & Run through Docker environment

Docker image for the Kompressor dependencies are provided in the quay.io/rosalindfranklininstitute/kompressor:main Quay.io image.

# Run the container for the Kompressor environment
docker run --rm quay.io/rosalindfranklininstitute/kompressor:main \
    python -m pytest --cov=/usr/local/kompressor/src/kompressor /usr/local/kompressor/tests

Install & Run through Singularity environment

Singularity image for the Kompressor dependencies are provided in the rosalindfranklininstitute/kompressor/kompressor:main cloud.sylabs.io image.

singularity pull library://rosalindfranklininstitute/kompressor/kompressor:main
singularity run kompressor_main.sif \
    python -m pytest --cov=/usr/local/kompressor/src/kompressor /usr/local/kompressor/tests

Comments

Refactor map tuples to dicts

Closes #14. Functions which currently return an ordered tuple of maps (lrmap, udmap, cmap, ...) now return keyed dictionaries { 'lrmap': lrmap, 'udmap': udmap, 'cmap': cmap, ... } so that order/usage is explicitly enforced.

List comprehensions over the tuples now use jax.tree_map and jax.tree_multimap to ensure key safety.

@GMW99, this will break the current implementation of the Metrics Callback class which iterates over a zip of the hardcoded map names and the maps tuple. This iteration can be replaced by iterating over maps.items() since it is now a dict already.
enhancement

opened by JossWhittle 1
Ensure jax.jit static_argnums is refactored to static_argnames

Functions that currently mark static_argnums=(0, 1, 2) should be updated to use the safer static_argnames=('tom', 'dick', 'harry') that is now available.
enhancement high priority

opened by JossWhittle 1
Update development examples
Splits docker image into JAX base image and Kompressor dependency and install image

JAX image installs JAX from source to ensure correct CUDA / CUDNN versions

Adjust setup.py to install dependencies from requirement.txt

Refactors a how submodules are imported (within the kom.image submodule. Need to check volumes matches)

Add kom.image.data submodule for dealing with tensorflow data pipelines

Fixed pooling in the total variation losses (used as metrics in the example notebooks)

Move all the encoding/decoding functions for the maps into a kom.mapping submodule

Add within-k and run-length metrics to kom.image.metrics for example notebooks

Added example notebooks for interacting with the maps and training a basic Haiku compression model

feature
opened by JossWhittle 0
Add mapping encode/decode functions for float32 data

Will need a bit of thinking to get right. We probably need to consider similar tricks that we used for applying Radix Sort on float32 data to make the compression numerically stable and portable between machines.
enhancement low priority

opened by JossWhittle 0
Add mapping encode/decode functions for uint32 data

Some of our data is uint32 volumes.

Will need to trace through the full compression implementation and make sure intermediate value dtypes are large enough to avoid uint32 overflow when needed.
enhancement low priority

opened by JossWhittle 0
Modify core encode decode functions to pass a dict to the prediction function
Currently the lowres inputs are passed directly to the prediction_fn as the only input.

Modify to accept a dict that has at least one key for the lowres input.

Provide boolean flag to also pass a positional encoding tensor along with the lowres which the model can use if needed.

Chunked encode decode will need to generate the correct chunks of the positional encoding for the current chunk.

Model can choose how to use positional encodings.

Image case would receive (B, H, W, 2) tensor containing the Y and X coordinates of each pixel in the trailing axis.

Volume case would receive (B, D, H, W, 3) tensor containing the Z, Y, and X coordinates of each voxel in the trailing axis.

enhancement high priority
opened by JossWhittle 0
Look at decompressing sliced chunks
Decompress sliced chunk of image or volume without needing to decompress the entire data element.

May require applying secondary compression in blocks to avoid needing to decompress the full level maps, only to apply the predictor to the target slice.

Instead unpack just the blocks needed for the slice then trim.

A kompressor (or stack of) trained to secondary compress the maps from the primary kompressor (or stack of) would be able to naturally handle slice chunked decoding.

Could such a secondary compressor be shared between levels? Between multiple kompressors in the primary stack?

experiment low priority
opened by JossWhittle 0
Look at compressing timeseries data
Experiment with implementing the 1D case for compressing signals.

Video as sequence of 2D frames using the 3D volume code directly.

Look at compressing within timestep using information from neighbouring timesteps without actually compressing (dropping frames) the temporal axis.

experiment low priority
opened by JossWhittle 0

Releases(v0.0.0)

v0.0.0(Feb 14, 2022)

Base Kompressor release to test CI pipeline.
Source code(tar.gz)
Source code(zip)

Owner

Rosalind Franklin Institute

The Rosalind Franklin Institute is dedicated to transforming life science through interdisciplinary research and technology development

GitHub Repository

A Structured Self-attentive Sentence Embedding

Structured Self-attentive sentence embeddings Implementation for the paper A Structured Self-Attentive Sentence Embedding, which was published in ICLR

488 Nov 28, 2022

Point Cloud Registration using Representative Overlapping Points.

Point Cloud Registration using Representative Overlapping Points (ROPNet) Abstract 3D point cloud registration is a fundamental task in robotics and c

36 Dec 16, 2022

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

Self-Supervised Graph Representation Learning via Topology Transformations This repository is the official PyTorch implementation of the following pap

2 Oct 31, 2022

On Evaluation Metrics for Graph Generative Models

On Evaluation Metrics for Graph Generative Models Authors: Rylee Thompson, Boris Knyazev, Elahe Ghalebi, Jungtaek Kim, Graham Taylor This is the offic

13 Jan 07, 2023

Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback

Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback This is our Pytorch implementation for the paper: Yinwei Wei,

17 Jun 10, 2022

Tensors and neural networks in Haskell

Hasktorch Hasktorch is a library for tensors and neural networks in Haskell. It is an independent open source community project which leverages the co

920 Jan 04, 2023

A tiny, friendly, strong baseline code for Person-reID (based on pytorch).

Pytorch ReID Strong, Small, Friendly A tiny, friendly, strong baseline code for Person-reID (based on pytorch). Strong. It is consistent with the new

3.5k Jan 08, 2023

Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286

Pytorch-DPPO Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286 Using PPO with clip loss (from https

163 Dec 26, 2022

Real-world Anomaly Detection in Surveillance Videos- pytorch Re-implementation

Real world Anomaly Detection in Surveillance Videos : Pytorch RE-Implementation This repository is a re-implementation of "Real-world Anomaly Detectio

62 Dec 08, 2022

Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".

66 Dec 15, 2022

A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.

224 Jan 04, 2023

Step by Step on how to create an vision recognition model using LOBE.ai, export the model and run the model in an Azure Function

3 Mar 30, 2022

Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".

Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks This repository contains the official code for the

11 Dec 16, 2022

Some toy examples of score matching algorithms written in PyTorch

toy_gradlogp This repo implements some toy examples of the following score matching algorithms in PyTorch: ssm-vr: sliced score matching with variance

21 Dec 26, 2022

Automate issue discovery for your projects against Lightning nightly and releases.

Automated Testing for Lightning EcoSystem Projects Automate issue discovery for your projects against Lightning nightly and releases. You get CPUs, Mu

41 Dec 24, 2022

Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.

scalableMARL Scalable Reinforcement Learning Policies for Multi-Agent Control CD. Hsu, H. Jeong, GJ. Pappas, P. Chaudhari. "Scalable Reinforcement Lea

17 Nov 17, 2022

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic [Paper] [Colab is coming soon] Approach Example Usage To r

170 Jan 03, 2023

Codes and models of NeurIPS2021 paper - DominoSearch: Find layer-wise fine-grained N:M sparse schemes from dense neural networks

DominoSearch This is repository for codes and models of NeurIPS2021 paper - DominoSearch: Find layer-wise fine-grained N:M sparse schemes from dense n

11 Sep 10, 2022

This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

ObjProp Introduction This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Insta

6 May 03, 2022

A PyTorch implementation of SIN: Superpixel Interpolation Network

SIN: Superpixel Interpolation Network This is is a PyTorch implementation of the superpixel segmentation network introduced in our PRICAI-2021 paper:

6 Sep 28, 2022