PyTorch implementation of DCT fast weight RNNs

Last update: Dec 24, 2022

Overview

DCT based fast weights

This repository contains the official code for the paper: Training and Generating Neural Networks in Compressed Weight Space.

The main code includes:

DCT LSTM: LSTMs whose weights are encoded by discrete cosine transform (DCT).
DCT fast weight RNN: RNNs whose weights are encoded by DCT, and the DCT coefficients are parameterized by LSTMs.

The language modeling experiments reported in the paper were produced by porting code (with minor changes due to some clean-up) of this repository in a fork of this toolkit.

Requirements

torch_dct (can be installed via pip install torch_dct)
PyTorch with a version compatible with torch_dct.

Our experiments were conducted using PyTorch version 1.6.0 . More recent versions are apparently not compatible with torch_dct (at least at the time of writing this file). We recommend to run python custom_layer.py to check the compatibility.

References

If you make use of this toolkit for your experiments, please cite:

@inproceedings{irie2021training,
  title={Training and Generating Neural Networks in Compressed Weight Space},
  author={Kazuki Irie and J{\"u}rgen Schmidhuber},
  booktitle={Neural Compression: From Information Theory to Applications -- Workshop @ ICLR 2021},
  year={2021},
  address={Virtual only},
  month=may
}

PyTorch implementation of DCT fast weight RNNs

Related tags

Overview

DCT based fast weights

Requirements

References

Owner

Kazuki Irie

This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

A library for augmentation of a YOLO-formated dataset

ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models

An abstraction layer for mathematical optimization solvers.

In this tutorial, you will perform inference across 10 well-known pre-trained object detectors and fine-tune on a custom dataset. Design and train your own object detector.

Face Depixelizer based on "PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models" repository.

YoloV5 implemented by TensorFlow2 , with support for training, evaluation and inference.

WRENCH: Weak supeRvision bENCHmark

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

Code for the ECCV2020 paper "A Differentiable Recurrent Surface for Asynchronous Event-Based Data"

Code for the paper "Multi-task problems are not multi-objective"

This repository will be a summary and outlook on all our open, medical, AI advancements.

GPT, but made only out of gMLPs

PyTorch code of "SLAPS: Self-Supervision Improves Structure Learning for Graph Neural Networks"

This folder contains the python code of UR5E's advanced forward kinematics model.

Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Code for Generating Disentangled Arguments with Prompts: A Simple Event Extraction Framework that Works

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Constrained Language Models Yield Few-Shot Semantic Parsers