Memoized coduals - Shows that it is possible to implement reverse mode autodiff using a variation on the dual numbers called the codual numbers

Last update: Dec 19, 2022

Overview

The dual numbers can do efficient autodiff!

The codual numbers are a simple method of doing automatic differentiation in reverse mode. They contrast with the dual numbers which provide an easy way of doing automatic differentiation in forward mode. The difference between the two modes is that sometimes one is faster than the other.

The folklore appears to be that forward mode autodiff is easy to implement because it can be done using the beautiful algebra of dual numbers, while the same is assumed to not be the case for reverse mode. This repository presents a counterargument that a variant of the dual numbers – called the codual numbers – can be used to represent an implementation of reverse mode autodiff that is just as elegant and terse as can be done for forward mode. This idea was first suggested by Sandro Magi (pseudonym: Naasking).

This implementation of the codual numbers differs from Sandro Magi’s by using simple memoisation to eliminate the exponential worst-case behaviour he encountered. In Magi’s original implementation, this idea seems obscured, largely because the code was more effectful and therefore the opportunity for memoisation was less apparent. The memoisation is achieved using only one additional line of code!

This implementation should be simpler and more transparent than Magi’s, I hope. It also suggests that Magi’s reasoning behind the term “codual numbers” is perhaps misleading.

Definition of dual number and codual number

The codual numbers are the set

$\mathbb R \times \mathbb R,$

while the codual numbers are a subset of

$\mathbb R \times \mathbb R ^ {\mathbb R}$

where the second component is always a linear map.

A notation that’s used to write a dual number is $a + b \varepsilon$ , which stands for . Formally, $\varepsilon^2 = 0$ while $\varepsilon \neq 0$ .

The codual numbers may be represented using exactly the same notation as the dual numbers. They are no different than the dual numbers, except in how they’re represented on a computer! Using lambda calculus notation (which I assume you are familiar with) any dual number can be turned into the codual number $(a, \lambda k. \,kb)$ , and conversely every codual number can be turned into the dual number . The difference is merely one of data structure; we need a closure to represent the codual numbers.

The definition of an operation on the codual numbers can be inferred from its definition on the dual numbers. We demonstrate this using multiplication. For dual numbers, we may define multiplication by:

$(a,a') \times (b,b') = (ab, ab' + ba').$

For the codual numbers, we may use the correspondence $(a,b') \mapsto (a, \lambda k. \,kb)$ to get:

$(a,A) \times (b,B) = (ab, \lambda k. \,k\cdot(a\cdot B(1) + b\cdot A(1))),$

where by “ $\cdot$ ”, we mean multiplication of real numbers. Using the fact that and are linear maps, we can rearrange this to:

$(a,A) \times (b,B) = (ab, \lambda k. \,B(ak) + A(bk))).$

This is precisely how we define multiplication of codual numbers in the code.

Relationship with other autodiff strategies

It appears that there are three ways of doing reverse-mode autodiff, which correspond directly to the three “stages” of solving a problem using dynamic programming. See the table below:

Idea	Example	Corresponding autodiff algorithm
Unmemoised recursion	Exhibit A	Unmemoised coduals
Memoised recursion, or top-down dynamic programming	Exhibit B	Memoised coduals
Bottom-up dynamic programming	Exhibit C	Tape-based autodiff

This suggests that the tape-based approach can be derived from the coduals.

Exhibit A:

def fib(n):
    if n == 0 or n == 1:
        return n
    else:
        return fib(n-1) + fib(n-2)

Exhibit B:

from functools import cache

@cache
def fib(n):
    if n == 0 or n == 1:
        return n
    else:
        return fib(n-1) + fib(n-2)

Exhibit C:

def fib(n):
    a, b = 0, 1
    for _ in range(n):
        a, b = b, a + b
    return a

Memoized coduals - Shows that it is possible to implement reverse mode autodiff using a variation on the dual numbers called the codual numbers

Related tags

Overview

The dual numbers can do efficient autodiff!

Definition of dual number and codual number

Relationship with other autodiff strategies

Owner

wlad

GANTheftAuto is a fork of the Nvidia's GameGAN

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen, Kaixiong Zhou, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

Low Complexity Channel estimation with Neural Network Solutions

A Python training and inference implementation of Yolov5 helmet detection in Jetson Xavier nx and Jetson nano

TensorFlow Ranking is a library for Learning-to-Rank (LTR) techniques on the TensorFlow platform

Updated for TTS(CE) = Also Known as TTN V3. The code requires the first server to be 'ttn' protocol.

Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

Code for the paper Hybrid Spectrogram and Waveform Source Separation

GNEE - GAT Neural Event Embeddings

An OpenAI Gym environment for Super Mario Bros

An image processing project uses Viola-jones technique to detect faces and then use SIFT algorithm for recognition.

Pathdreamer: A World Model for Indoor Navigation

ObjDetApp deploys a pytorch model for object detection

Multi-layer convolutional LSTM with Pytorch

Real life contra a deep learning project built using mediapipe and openc

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Implementation of the Remixer Block from the Remixer paper, in Pytorch

Memoized coduals - Shows that it is possible to implement reverse mode autodiff using a variation on the dual numbers called the codual numbers

Related tags

Overview

The dual numbers can do efficient autodiff!

Definition of dual number and codual number

Relationship with other autodiff strategies

Owner

wlad

GANTheftAuto is a fork of the Nvidia's GameGAN

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen*, Kaixiong Zhou*, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

Low Complexity Channel estimation with Neural Network Solutions

A Python training and inference implementation of Yolov5 helmet detection in Jetson Xavier nx and Jetson nano

TensorFlow Ranking is a library for Learning-to-Rank (LTR) techniques on the TensorFlow platform

Updated for TTS(CE) = Also Known as TTN V3. The code requires the first server to be 'ttn' protocol.

Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

Code for the paper Hybrid Spectrogram and Waveform Source Separation

GNEE - GAT Neural Event Embeddings

An OpenAI Gym environment for Super Mario Bros

An image processing project uses Viola-jones technique to detect faces and then use SIFT algorithm for recognition.

Pathdreamer: A World Model for Indoor Navigation

*ObjDetApp* deploys a pytorch model for object detection

Multi-layer convolutional LSTM with Pytorch

Real life contra a deep learning project built using mediapipe and openc

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Implementation of the Remixer Block from the Remixer paper, in Pytorch

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen, Kaixiong Zhou, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

ObjDetApp deploys a pytorch model for object detection