Pure python implementation reverse-mode automatic differentiation

Last update: Sep 12, 2022

Related tags

Overview

MiniGrad

A minimal implementation of reverse-mode automatic differentiation (a.k.a. autograd / backpropagation) in pure Python.

Inspired by Andrej Karpathy's micrograd, but with more comments and less cleverness. Thanks for the wonderful reference implementation and tests!

Overview

Create a Scalar.

a = Scalar(1.5)

Do some calculations.

b = Scalar(-4.0)
c = a**3 / 5
d = c + (b**2).relu()

Compute the gradients.

d.backward()

Plot the computational graph.

draw_graph(d)

Repo Structure

demo.ipynb: Demo notebook of MiniGrad's functionality.
tests.ipynb: Test notebook to verify gradients against PyTorch and JAX. Install both to run tests.
minigrad/minigrad.py: The entire autograd logic in one (~100 loc) numeric class. See section below for details.
minigrad/visualize.py: This just draws nice-looking computational graphs. Install Graphviz to run it.
requirements.txt: MiniGrad requires no external modules to run. This file just sets up my dev environment.

Implementation

MiniGrad is implemented in one small (~100 loc) Python class, using no external modules.

The entirety of the auto-differentiation logic lives in the Scalar class in minigrad.py.

A Scalar wraps a float/int and overrides its arithmetic magic methods in order to:

Stitch together a define-by-run computational graph when doing arithmetic operations on a Scalar
Hard code the derivative functions of arithmetic operations
Keep track of ∂self/∂parent between adjacent nodes
Compute ∂output/∂self with the chain rule on demand (when .backward() is called)

This is called reverse-mode automatic differentiation. It's great when you have few outputs and many inputs, since it computes all derivatives of one output in one pass. This is also how TensorFlow and PyTorch normally compute gradients.

(Forward-mode automatic differentiation also exists, and has the opposite advantage.)

Not in Scope

This project is just for fun, so the following are not planned:

Vectorization
Higher order derivatives (i.e. Scalar.grad is a Scalar itself)
Forward-mode automatic differentiation
Neural network library on top of MiniGrad

Pure python implementation reverse-mode automatic differentiation

Related tags

Overview

MiniGrad

Overview

Repo Structure

Implementation

Not in Scope

Owner

Kenny Song

An implementation of a discriminant function over a normal distribution to help classify datasets.

code for paper"A High-precision Semantic Segmentation Method Combining Adversarial Learning and Attention Mechanism"

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

Drone-based Joint Density Map Estimation, Localization and Tracking with Space-Time Multi-Scale Attention Network

Full Stack Deep Learning Labs

Official Repository for the paper "Improving Baselines in the Wild".

Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms applied on Continuous Control Tasks

A pytorch &keras implementation and demo of Fastformer.

Deepface is a lightweight face recognition and facial attribute analysis (age, gender, emotion and race) framework for python

TransPrompt - Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification

Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.

Code for paper Decoupled Dynamic Spatial-Temporal Graph Neural Network for Traffic Forecasting

Deep Learning for Computer Vision final project

QQ Browser 2021 AI Algorithm Competition Track 1 1st Place Program

[ICLR 2022] Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics

A system used to detect whether a person is wearing a medical mask or not.

Torch implementation of SegNet and deconvolutional network

Rendering color and depth images for ShapeNet models.

minimizer-space de Bruijn graphs (mdBG) for whole genome assembly

CCCL: Contrastive Cascade Graph Learning.