iftopt

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations.

Requirements

Python 3.7+
PyTorch 1.x

Installation

$ pip install git+https://github.com/money-shredder/iftopt.git

Usage

Assuming a bi-level optimization of the form:

y* = argmin_{y} val_loss(x*, y), where x* = argmin_{x} train_loss(x, y).

To solve for the optimal x* and y* in the optimization problem, we can implement the following with iftopt:

from iftopt import HyperOptimizer
train_lr = val_lr = 0.1
# parameter to minimize the training loss
x = torch.nn.Parameter(...)
# hyper-parameter to minimize the validation loss
y = torch.nn.Parameter(...)
# training loss optimizer
opt = torch.optim.SGD([x], lr=train_lr)
# validation loss optimizer
hopt = HyperOptimizer(
    [y], torch.optim.SGD([y], lr=val_lr), vih_lr=0.1, vih_iterations=5)
# outer optimization loop for y
for _ in range(...):
    # inner optimization loop for x
    for _ in range(...):
        z = train_loss(x, y)
        # inner optimization step for x
        opt.zero_grad()
        z.backward()
        opt.step()
    # outer optimization step for y
    hopt.set_train_parameters([x])
    z = train_loss(x, y)
    hopt.train_step(z)
    v = val_loss(x, y)
    hopt.val_step(v)
    hopt.grad()
    hopt.step()

For a concrete simple example, please check out and run demo.py, where

train_loss = lambda x, y: (x + y) ** 2
val_loss = lambda x, y: x ** 2

with x = y = 1.0 initially. It will generate a video demo.mp4 showing the optimization trajectory in the animation below. Note that although the hyper-parameter y does not have a direct gradient w.r.t. the validation loss, iftopt can still minimize the validation loss by computing the hyper-gradient via implicit function theorem.

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations

Related tags

Overview

iftopt

Requirements

Installation

Usage

Owner

The Money Shredder Lab

Incomplete easy-to-use math solver and PDF generator.

Learning Compatible Embeddings, ICCV 2021

A PyTorch implementation of deep-learning-based registration

Pytoydl: A toy deep learning framework built upon numpy.

Denoising Diffusion Probabilistic Models

Code for Environment Inference for Invariant Learning (ICML 2020 UDL Workshop Paper)

SeqAttack: a framework for adversarial attacks on token classification models

Graph neural network message passing reframed as a Transformer with local attention

An off-line judger supporting distributed problem repositories

Deep learning with dynamic computation graphs in TensorFlow

HandTailor: Towards High-Precision Monocular 3D Hand Recovery

PyTorch implementation of residual gated graph ConvNets, ICLR’18

Example of a Quantum LSTM

Code for 1st place solution in Sleep AI Challenge SNU Hospital

A minimalist tool to display a network graph.

LoFTR:Detector-Free Local Feature Matching with Transformers CVPR 2021

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

Very large and sparse networks appear often in the wild and present unique algorithmic opportunities and challenges for the practitioner

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks

Semantic similarity computation with different state-of-the-art metrics