Application of the L2HMC algorithm to simulations in lattice QCD.

Last update: Dec 14, 2022

Overview

l2hmc-qcd

📊 Slides

Recent talk on Training Topological Samplers for Lattice Gauge Theory from the Machine Learning for High Energy Physics, on and off the Lattice @ ect* Trento (09/30/2021)

📒 Example Notebook

Accepted to the Deep Learning for Simulation (SimDL) Workshop at ICLR 2021
- 📚 : arXiv:2105.03418
- 📊 : poster

Overview

The L2HMC algorithm aims to improve upon HMC by optimizing a carefully chosen loss function which is designed to minimize autocorrelations within the Markov Chain, thereby improving the efficiency of the sampler.

This work is based on the original implementation: brain-research/l2hmc/.

A detailed description of the L2HMC algorithm can be found in the paper:

Generalizing Hamiltonian Monte Carlo with Neural Network

by Daniel Levy, Matt D. Hoffman and Jascha Sohl-Dickstein.

Broadly, given an analytically described target distribution, π(x), L2HMC provides a statistically exact sampler that:

Quickly converges to the target distribution (fast burn-in).
Quickly produces uncorrelated samples (fast mixing).
Is able to efficiently mix between energy levels.
Is capable of traversing low-density zones to mix between modes (often difficult for generic HMC).

L2HMC for LatticeQCD

Goal: Use L2HMC to efficiently generate gauge configurations for calculating observables in lattice QCD.

A detailed description of the (ongoing) work to apply this algorithm to simulations in lattice QCD (specifically, a 2D U(1) lattice gauge theory model) can be found in doc/main.pdf.

Organization

Dynamics / Network

The base class for the augmented L2HMC leapfrog integrator is implemented in the BaseDynamics (a tf.keras.Model object).

The GaugeDynamics is a subclass of BaseDynamics containing modifications for the 2D U(1) pure gauge theory.

The network is defined in l2hmc-qcd/network/functional_net.py.

Network Architecture

An illustration of the leapfrog layer updating (x, v) --> (x', v') can be seen below.

Lattice

Lattice code can be found in lattice.py, specifically the GaugeLattice object that provides the base structure on which our target distribution exists.

Additionally, the GaugeLattice object implements a variety of methods for calculating physical observables such as the average plaquette, ɸₚ, and the topological charge Q,

Training

The training loop is implemented in l2hmc-qcd/utils/training_utils.py .

To train the sampler on a 2D U(1) gauge model using the parameters specified in bin/train_configs.json:

$ python3 /path/to/l2hmc-qcd/l2hmc-qcd/train.py --json_file=/path/to/l2hmc-qcd/bin/train_configs.json

Or via the bin/train.sh script provided in bin/.

Features

Distributed training (via horovod): If horovod is installed, the model can be trained across multiple GPUs (or CPUs) by:

#!/bin/bash

TRAINER=/path/to/l2hmc-qcd/l2hmc-qcd/train.py
JSON_FILE=/path/to/l2hmc-qcd/bin/train_configs.json

horovodrun -np ${PROCS} python3 ${TRAINER} --json_file=${JSON_FILE}

Contact

Code author: Sam Foreman

Pull requests and issues should be directed to: saforem2

Citation

If you use this code or found this work interesting, please cite our work along with the original paper:

@misc{foreman2021deep,
      title={Deep Learning Hamiltonian Monte Carlo}, 
      author={Sam Foreman and Xiao-Yong Jin and James C. Osborn},
      year={2021},
      eprint={2105.03418},
      archivePrefix={arXiv},
      primaryClass={hep-lat}
}

@article{levy2017generalizing,
  title={Generalizing Hamiltonian Monte Carlo with Neural Networks},
  author={Levy, Daniel and Hoffman, Matthew D. and Sohl-Dickstein, Jascha},
  journal={arXiv preprint arXiv:1711.09268},
  year={2017}
}

Acknowledgement

This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under contract DE_AC02-06CH11357. This work describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the work do not necessarily represent the views of the U.S. DOE or the United States Government. Declaration of Interests - None.

Comments

Remove upper bound on python_requires

(I'm moving between meetings so can iterate on this more later, so excuse the very brief Issue for now).

At the moment the project has an upper bound on python_requires

https://github.com/saforem2/l2hmc-qcd/blob/2eb6ee63cc0c53b187e6d716f4c12f418c8b8515/setup.py#L165

Assuming that you're intending l2hmc to be a library and not an application, then I would highly recommend removing this for the reasons summarized in Henry's detailed blog post on the subject.

Congrats on getting l2hmc up on PyPI though! :snake: :rocket:

opened by matthewfeickert 2
Alpha
Pull upstream alpha branch into main

Major changes

new src/ hierarchical module organization

Contains skeleton implementation of 4D SU(3) lattice gauge model

src/l2hmc/lattice/gauge/lattice.py

Framework independent configuration

Unified configuration system simplifies logic, same configs used for both tensorflow and pytorch experiments

Plan to be able to specify which backend to use through config option

Unified (and framework independent) configurations between tensorflow and pytorch implementations

Definitions can be found in l2hmc-qcd/src/l2hmc/configs.py

Note: This is still very much a WIP. Many existing features still need to be re-implemented / updated into new code in src/.

Todo

[ ] Write unit tests

[ ] Use simple configs for end-to-end workflow test + integrate into CI

[ ] dynamic learning rate scheduling

[ ] Test 4D SU(3) numpy code

[ ] Write tensorflow and pytorch implementations of LatticeSU3 objects

[ ] Improved / simplified ( / trainable?) annealing schedule

[ ] Distributed training support

[ ] horovod

[ ] DDP for pytorch implementation

[ ] DeepSpeed from Microsoft??

[ ] Testing / inference logic

[ ] Automatic checkpointing

[ ] Metric logging

[ ] Tensorboard?

[ ] Sacred?

[ ] build custom dashboard? plot.ly?

[ ] Setup packaging / distribution through pip

[ ] Resolve issue
opened by saforem2 1
Alpha
Major upgrades to how training is initialized in l2hmc-qcd/utils/training_utils.py, particularly when trying to restore a model from an existing checkpoint.

Significant upgrades to logging mechanics in l2hmc-qcd/utils/logger.py and l2hmc-qcd/utils/logger_config.py which now use a RichHandler to nicely format log messages characterized by severity, including automatic file rotation, etc.

Improvements to test suite in l2hmc-qcd/tests/test_training.py, more robust tests on larger set of possible cases

TODO: Automate using github actions for CI

Improvements to l2hmc-qcd/dynamics/gauge_dynamics.py but still a WIP
opened by saforem2 1
Rich
General improvements, rewrote logging methods to use Rich for better formatting.

Adds dynamic (trainable) step size eps for each separate x and v updates, seems to generally increase the total energy towards the middle of the trajectory but it remains unclear if this corresponds to an improvement in the tunneling rate

Adds methods for calculating autocorrelations of the topological charge, as well as notebooks for generating the plots

Updates to the writeup in doc/main.pdf

Will likely be last changes to writeup before public release of official draft
opened by saforem2 1
Dev
Updates to README

Ability to load network with new training instance

Updates to doc/, removes old sections related to debugging the bias in the plaquette
opened by saforem2 1
Saveable model
Complete rewrite of dynamics.xnet and dynamics.vnet models to use tf.keras.functional Models.

Additional changes include:

Non-Compact Projection update for gauge fields

Ability to specify convolution structure to be prepended at beginning of gauge network
opened by saforem2 1
Dev

Removes models/gauge_model.py entirely.

Instead, a base dynamics class is implemented in dynamics/dynamics.py, and an example subclass is provided in dynamics/gauge_dynamics.py.

opened by saforem2 1
Split networks

Major rewrite of existing codebase.

This pull request updates everything to be compatible with tensorflow >= 2.2 and removes a bunch of redundant legacy code.

opened by saforem2 1
Dev
Dynamics object is now compatible with tf >= 2.0

Running inference on trained model with tensorflow now creates identical graphs and summary files to numpy inference code

Inference with numpy now uses object oriented structure

Adds LaTeX + PDF documentation in doc/
opened by saforem2 1
Cooley dev

Adds new GaugeNetwork architecture as the default for training GaugeModel

Additionally, replaces pickle with joblib for saving data as .z compressed files (as opposed to .pkl files).

opened by saforem2 1
Testing

Implemented nnehmc_loss calculation for an alternative loss function using the approach suggested in https://infoscience.epfl.ch/record/264887/files/robust_parameter_estimation.pdf.

This modified loss function can be chosen (instead of the standard loss described in the original paper) by passing --use_nnehmc_loss as a command line argument.

opened by saforem2 1

Packaging and PyPI distribution?

As you've made a library and are using it as such:

# snippet from toy_distributions.ipynb

# append parent directory to `sys.path`
# to load from modules in `../l2hmc-qcd/`
module_path = os.path.join('..')
if module_path not in sys.path:
    sys.path.append(module_path)

# Local imports
from utils.attr_dict import AttrDict
from utils.training_utils import train_dynamics
from dynamics.config import DynamicsConfig
from dynamics.base_dynamics import BaseDynamics
from dynamics.generic_dynamics import GenericDynamics
from network.config import LearningRateConfig
from config import (State, NetWeights, MonteCarloStates,
                    BASE_DIR, BIN_DIR, TF_FLOAT)

from utils.distributions import (plot_samples2D, contour_potential,
                                 two_moons_potential, sin_potential,
                                 sin_potential1, sin_potential2)

do you have any plans and/or interest in packaging it as a Python library so it can either be pip installed from GitHub or be distributed on PyPI?

opened by matthewfeickert 5

Releases(0.12.0)

0.12.0(Aug 9, 2022)

Source code(tar.gz)
Source code(zip)
0.8.0(Apr 14, 2022)

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.7.0...0.8.0
Source code(tar.gz)
Source code(zip)
0.7.0(Apr 14, 2022)

pypi release: v0.7.0

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.4.0...0.7.0
Source code(tar.gz)
Source code(zip)
0.4.0(Apr 8, 2022)

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.3.0...0.4.0
Source code(tar.gz)
Source code(zip)

Owner

Sam Foreman

Computational science Postdoc at Argonne National Laboratory working on applying machine learning to simulations in lattice QCD.

GitHub Repository https://samforeman.me/l2hmc-qcd

Poisson Surface Reconstruction for LiDAR Odometry and Mapping

Poisson Surface Reconstruction for LiDAR Odometry and Mapping Surfels TSDF Our Approach Table: Qualitative comparison between the different mapping te

305 Dec 21, 2022

Yolo ros - YOLO-ROS for HUAWEI ATLAS200

YOLO-ROS YOLO-ROS for NVIDIA YOLO-ROS for HUAWEI ATLAS200, please checkout for b

5 Oct 18, 2022

TyXe: Pyro-based BNNs for Pytorch users

TyXe: Pyro-based BNNs for Pytorch users TyXe aims to simplify the process of turning Pytorch neural networks into Bayesian neural networks by leveragi

87 Jan 03, 2023

A Python module for parallel optimization of expensive black-box functions

blackbox: A Python module for parallel optimization of expensive black-box functions What is this? A minimalistic and easy-to-use Python module that e

426 Dec 08, 2022

YOLOX-RMPOLY

本算法为适应robomaster比赛，而改动自矩形识别的yolox算法。基于旷视科技YOLOX，实现对不规则四边形的目标检测 TODO 修改onnx推理模型更改/添加标注： 1.yolox/models/yolox_polyhead.py: 1.1继承yolox/models/yolo_

3 Feb 25, 2022

Regularizing Nighttime Weirdness: Efficient Self-supervised Monocular Depth Estimation in the Dark (ICCV 2021)

Regularizing Nighttime Weirdness: Efficient Self-supervised Monocular Depth Estimation in the Dark (ICCV 2021) Kun Wang, Zhenyu Zhang, Zhiqiang Yan, X

66 Nov 24, 2022

you can add any codes in any language by creating its respective folder (if already not available).

HACKTOBERFEST-2021-WEB-DEV Beginner-Hacktoberfest Need Your first pr for hacktoberfest 2k21 ? come on in About This is repository of Responsive Portfo

8 Oct 17, 2022

Wileless-PDGNet Implementation

Wileless-PDGNet Implementation This repo is related to the following paper: Boning Li, Ananthram Swami, and Santiago Segarra, "Power allocation for wi

6 Oct 04, 2022

SAS output to EXCEL converter for Cornell/MIT Language and acquisition lab

CORNELLSASLAB SAS output to EXCEL converter for Cornell/MIT Language and acquisition lab Instructions: This python code can be used to convert SAS out

2 Jan 26, 2022

Use evolutionary algorithms instead of gridsearch in scikit-learn

sklearn-deap Use evolutionary algorithms instead of gridsearch in scikit-learn. This allows you to reduce the time required to find the best parameter

709 Jan 03, 2023

🧮 Matrix Factorization for Collaborative Filtering is just Solving an Adjoint Latent Dirichlet Allocation Model after All

Accompanying source code to the paper "Matrix Factorization for Collaborative Filtering is just Solving an Adjoint Latent Dirichlet Allocation Model A

39 Dec 03, 2022

Application of the L2HMC algorithm to simulations in lattice QCD.

Related tags

Overview

l2hmc-qcd

📊 Slides

📒 Example Notebook

Overview

L2HMC for LatticeQCD

Organization

Dynamics / Network

Network Architecture

Lattice

Training

Features

Contact

Citation

Acknowledgement

Comments

Major changes

Todo

Releases(0.12.0)

0.12.0(Aug 9, 2022)

0.8.0(Apr 14, 2022)

0.7.0(Apr 14, 2022)

0.4.0(Apr 8, 2022)

Owner

Sam Foreman

Poisson Surface Reconstruction for LiDAR Odometry and Mapping

Yolo ros - YOLO-ROS for HUAWEI ATLAS200

TyXe: Pyro-based BNNs for Pytorch users

A Python module for parallel optimization of expensive black-box functions

YOLOX-RMPOLY

Regularizing Nighttime Weirdness: Efficient Self-supervised Monocular Depth Estimation in the Dark (ICCV 2021)

you can add any codes in any language by creating its respective folder (if already not available).

Wileless-PDGNet Implementation

SAS output to EXCEL converter for Cornell/MIT Language and acquisition lab

Use evolutionary algorithms instead of gridsearch in scikit-learn

🧮 Matrix Factorization for Collaborative Filtering is just Solving an Adjoint Latent Dirichlet Allocation Model after All

Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)

The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

Deep deconfounded recommender (Deep-Deconf) for paper "Deep causal reasoning for recommendations"

Object recognition using Azure Custom Vision AI and Azure Functions

Experiments with Fourier layers on simulation data.

Deep Learning Visuals contains 215 unique images divided in 23 categories

N-HiTS: Neural Hierarchical Interpolation for Time Series Forecasting

Code for Two-stage Identifier: "Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition"