Pure python implementation reverse-mode automatic differentiation

Last update: Sep 12, 2022

Related tags

Overview

MiniGrad

A minimal implementation of reverse-mode automatic differentiation (a.k.a. autograd / backpropagation) in pure Python.

Inspired by Andrej Karpathy's micrograd, but with more comments and less cleverness. Thanks for the wonderful reference implementation and tests!

Overview

Create a Scalar.

a = Scalar(1.5)

Do some calculations.

b = Scalar(-4.0)
c = a**3 / 5
d = c + (b**2).relu()

Compute the gradients.

d.backward()

Plot the computational graph.

draw_graph(d)

Repo Structure

demo.ipynb: Demo notebook of MiniGrad's functionality.
tests.ipynb: Test notebook to verify gradients against PyTorch and JAX. Install both to run tests.
minigrad/minigrad.py: The entire autograd logic in one (~100 loc) numeric class. See section below for details.
minigrad/visualize.py: This just draws nice-looking computational graphs. Install Graphviz to run it.
requirements.txt: MiniGrad requires no external modules to run. This file just sets up my dev environment.

Implementation

MiniGrad is implemented in one small (~100 loc) Python class, using no external modules.

The entirety of the auto-differentiation logic lives in the Scalar class in minigrad.py.

A Scalar wraps a float/int and overrides its arithmetic magic methods in order to:

Stitch together a define-by-run computational graph when doing arithmetic operations on a Scalar
Hard code the derivative functions of arithmetic operations
Keep track of ∂self/∂parent between adjacent nodes
Compute ∂output/∂self with the chain rule on demand (when .backward() is called)

This is called reverse-mode automatic differentiation. It's great when you have few outputs and many inputs, since it computes all derivatives of one output in one pass. This is also how TensorFlow and PyTorch normally compute gradients.

(Forward-mode automatic differentiation also exists, and has the opposite advantage.)

Not in Scope

This project is just for fun, so the following are not planned:

Vectorization
Higher order derivatives (i.e. Scalar.grad is a Scalar itself)
Forward-mode automatic differentiation
Neural network library on top of MiniGrad

Pure python implementation reverse-mode automatic differentiation

Related tags

Overview

MiniGrad

Overview

Repo Structure

Implementation

Not in Scope

Owner

Kenny Song

a baseline to practice

A Dataset of Python Challenges for AI Research

Official PyTorch implementation of "Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning" (ICCV2021 Oral)

Individual Tree Crown classification on WorldView-2 Images using Autoencoder -- Group 9 Weak learners - Final Project (Machine Learning 2020 Course)

Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

Gray Zone Assessment

Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines" submission to NeurIPS 2021 (Datasets & Benchmarks track)

An implementation for Neural Architecture Search with Random Labels (CVPR 2021 poster) on Pytorch.

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Federated_learning codes used for the the paper "Evaluation of Federated Learning Aggregation Algorithms" and "A Federated Learning Aggregation Algorithm for Pervasive Computing: Evaluation and Comparison"

YOLOv2 in PyTorch

Benchmarks for Object Detection in Aerial Images

Multi-query Video Retreival

Moon-patrol - A faithful recreation of the 1983 hit classic Moon Patrol for the Atari 2600 created using the Pygame library for Python

TransZero++: Cross Attribute-guided Transformer for Zero-Shot Learning

A Java implementation of the experiments for the paper "k-Center Clustering with Outliers in Sliding Windows"

The official PyTorch implementation for NCSNv2 (NeurIPS 2020)

Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

A sequence of Jupyter notebooks featuring the 12 Steps to Navier-Stokes

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL [Deep Graph Library] and PyTorch.