A lossless neural compression framework built on top of JAX.

Last update: Mar 14, 2022

Related tags

Overview

Kompressor

Branch	CI	Coverage
`main` (active)
`main`
`development`

A neural compression framework built on top of JAX.

Install

setup.py assumes a compatible version of JAX and JAXLib are already installed. Automated build is tested for a cuda:11.1-cudnn8-runtime-ubuntu20.04 environment with jaxlib==0.1.76+cuda11.cudnn82.

git clone https://github.com/rosalindfranklininstitute/kompressor.git
cd kompressor
pip install -e .

# Run tests
python -m pytest --cov=src/kompressor tests/

Install & Run through Docker environment

Docker image for the Kompressor dependencies are provided in the quay.io/rosalindfranklininstitute/kompressor:main Quay.io image.

# Run the container for the Kompressor environment
docker run --rm quay.io/rosalindfranklininstitute/kompressor:main \
    python -m pytest --cov=/usr/local/kompressor/src/kompressor /usr/local/kompressor/tests

Install & Run through Singularity environment

Singularity image for the Kompressor dependencies are provided in the rosalindfranklininstitute/kompressor/kompressor:main cloud.sylabs.io image.

singularity pull library://rosalindfranklininstitute/kompressor/kompressor:main
singularity run kompressor_main.sif \
    python -m pytest --cov=/usr/local/kompressor/src/kompressor /usr/local/kompressor/tests

Comments

Refactor map tuples to dicts

Closes #14. Functions which currently return an ordered tuple of maps (lrmap, udmap, cmap, ...) now return keyed dictionaries { 'lrmap': lrmap, 'udmap': udmap, 'cmap': cmap, ... } so that order/usage is explicitly enforced.

List comprehensions over the tuples now use jax.tree_map and jax.tree_multimap to ensure key safety.

@GMW99, this will break the current implementation of the Metrics Callback class which iterates over a zip of the hardcoded map names and the maps tuple. This iteration can be replaced by iterating over maps.items() since it is now a dict already.
enhancement

opened by JossWhittle 1
Ensure jax.jit static_argnums is refactored to static_argnames

Functions that currently mark static_argnums=(0, 1, 2) should be updated to use the safer static_argnames=('tom', 'dick', 'harry') that is now available.
enhancement high priority

opened by JossWhittle 1
Update development examples
Splits docker image into JAX base image and Kompressor dependency and install image

JAX image installs JAX from source to ensure correct CUDA / CUDNN versions

Adjust setup.py to install dependencies from requirement.txt

Refactors a how submodules are imported (within the kom.image submodule. Need to check volumes matches)

Add kom.image.data submodule for dealing with tensorflow data pipelines

Fixed pooling in the total variation losses (used as metrics in the example notebooks)

Move all the encoding/decoding functions for the maps into a kom.mapping submodule

Add within-k and run-length metrics to kom.image.metrics for example notebooks

Added example notebooks for interacting with the maps and training a basic Haiku compression model

feature
opened by JossWhittle 0
Add mapping encode/decode functions for float32 data

Will need a bit of thinking to get right. We probably need to consider similar tricks that we used for applying Radix Sort on float32 data to make the compression numerically stable and portable between machines.
enhancement low priority

opened by JossWhittle 0
Add mapping encode/decode functions for uint32 data

Some of our data is uint32 volumes.

Will need to trace through the full compression implementation and make sure intermediate value dtypes are large enough to avoid uint32 overflow when needed.
enhancement low priority

opened by JossWhittle 0
Modify core encode decode functions to pass a dict to the prediction function
Currently the lowres inputs are passed directly to the prediction_fn as the only input.

Modify to accept a dict that has at least one key for the lowres input.

Provide boolean flag to also pass a positional encoding tensor along with the lowres which the model can use if needed.

Chunked encode decode will need to generate the correct chunks of the positional encoding for the current chunk.

Model can choose how to use positional encodings.

Image case would receive (B, H, W, 2) tensor containing the Y and X coordinates of each pixel in the trailing axis.

Volume case would receive (B, D, H, W, 3) tensor containing the Z, Y, and X coordinates of each voxel in the trailing axis.

enhancement high priority
opened by JossWhittle 0
Look at decompressing sliced chunks
Decompress sliced chunk of image or volume without needing to decompress the entire data element.

May require applying secondary compression in blocks to avoid needing to decompress the full level maps, only to apply the predictor to the target slice.

Instead unpack just the blocks needed for the slice then trim.

A kompressor (or stack of) trained to secondary compress the maps from the primary kompressor (or stack of) would be able to naturally handle slice chunked decoding.

Could such a secondary compressor be shared between levels? Between multiple kompressors in the primary stack?

experiment low priority
opened by JossWhittle 0
Look at compressing timeseries data
Experiment with implementing the 1D case for compressing signals.

Video as sequence of 2D frames using the 3D volume code directly.

Look at compressing within timestep using information from neighbouring timesteps without actually compressing (dropping frames) the temporal axis.

experiment low priority
opened by JossWhittle 0

Releases(v0.0.0)

v0.0.0(Feb 14, 2022)

Base Kompressor release to test CI pipeline.
Source code(tar.gz)
Source code(zip)

Owner

Rosalind Franklin Institute

The Rosalind Franklin Institute is dedicated to transforming life science through interdisciplinary research and technology development

GitHub Repository

Covid-19 Test AI (Deep Learning - NNs) Software. Accuracy is the %96.5, loss is the 0.09 :)

Covid-19 Test AI (Deep Learning - NNs) Software I developed a segmentation algorithm to understand whether Covid-19 Test Photos are positive or negati

28 Dec 04, 2021

👨‍💻 run nanosaur in simulation with Gazebo/Ingnition

🦕 👨‍💻 nanosaur_gazebo nanosaur The smallest NVIDIA Jetson dinosaur robot, open-source, fully 3D printable, based on ROS2 & Isaac ROS. Designed & ma

9 Jul 19, 2022

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

PGPElib A mini library for Policy Gradients with Parameter-based Exploration [1] and friends. This library serves as a clean re-implementation of the

56 Jan 01, 2023

Image-to-Image Translation in PyTorch

CycleGAN and pix2pix in PyTorch New: Please check out contrastive-unpaired-translation (CUT), our new unpaired image-to-image translation model that e

19k Jan 07, 2023

Python implementation of a live deep learning based age/gender/expression recognizer

TUT live age estimator Python implementation of a live deep learning based age/gender/smile/celebrity twin recognizer. All components use convolutiona

80 Nov 21, 2022

Official implementation of "Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks", NeurIPS 2021.

PHDimGeneralization Official implementation of "Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks", NeurIPS 2021. Overvie

13 Nov 08, 2022

Implementation of CVPR 2021 paper "Spatially-invariant Style-codes Controlled Makeup Transfer"

SCGAN Implementation of CVPR 2021 paper "Spatially-invariant Style-codes Controlled Makeup Transfer" Prepare The pre-trained model is avaiable at http

118 Dec 12, 2022

Pytorch implementation of Deep Recursive Residual Network for Super Resolution (DRRN)

DRRN-pytorch This is an unofficial implementation of "Deep Recursive Residual Network for Super Resolution (DRRN)", CVPR 2017 in Pytorch. [Paper] You

192 Dec 12, 2022

Power Core Simulator!

Power Core Simulator Power Core Simulator is a simulator based off the Roblox game "Pinewood Builders Computer Core". In this simulator, you can choos

1 Nov 13, 2021

Official PyTorch implementation of MAAD: A Model and Dataset for Attended Awareness

MAAD: A Model for Attended Awareness in Driving Install // Datasets // Training // Experiments // Analysis // License Official PyTorch implementation

7 Oct 16, 2022

Code for paper Adaptively Aligned Image Captioning via Adaptive Attention Time

Adaptively Aligned Image Captioning via Adaptive Attention Time This repository includes the implementation for Adaptively Aligned Image Captioning vi

45 Aug 27, 2022

Official Pytorch implementation of "DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network" (CVPR'21)

DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network Pytorch implementation for our DivCo. We propose a simple ye

64 Nov 22, 2022

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

45 Dec 12, 2022

MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images

MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images This repository contains the implementation of our paper MetaAvatar: Learni

96 Dec 13, 2022

Pyeventbus: a publish/subscribe event bus

pyeventbus pyeventbus is a publish/subscribe event bus for Python 2.7. simplifies the communication between python classes decouples event senders and

15 Apr 21, 2022

A SAT-based sudoku solver

SAT Sudoku solver A SAT-based Sudoku solver made in the context of a small project in the "Logic Problem Solving" class in the first year at the Polyt

5 Apr 15, 2022

PyTorch implementation for the paper Visual Representation Learning with Self-Supervised Attention for Low-Label High-Data Regime

Visual Representation Learning with Self-Supervised Attention for Low-Label High-Data Regime Created by Prarthana Bhattacharyya. Disclaimer: This is n

5 Nov 08, 2022

A machine learning malware analysis framework for Android apps.

🕵️ A machine learning malware analysis framework for Android apps. ☢️ DroidDetective is a Python tool for analysing Android applications (APKs) for p

77 Dec 27, 2022

Behavioral "black-box" testing for recommender systems

RecList RecList Free software: MIT license Documentation: https://reclist.readthedocs.io. Overview RecList is an open source library providing behavio

375 Dec 30, 2022

This repo is official PyTorch implementation of MobileHumanPose: Toward real-time 3D human pose estimation in mobile devices(CVPRW 2021).

Github Code of "MobileHumanPose: Toward real-time 3D human pose estimation in mobile devices" Introduction This repo is official PyTorch implementatio

203 Jan 05, 2023