Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

Last update: Mar 03, 2022

Related tags

Overview

CoulombGas

This code implements the neural canonical transformation approach to the thermodynamic properties of uniform electron gas. Building on JAX, it utilizes (both forward- and backwark-mode) automatic differentiation and the pmap mechanism to achieve a large-scale single-program multiple-data (SPMD) training on multiple GPUs.

Requirements

JAX with Nvidia GPU support
A handful of GPUs. The more the better :P
haiku
optax
To analytically computing the thermal entropy of a non-interacting Fermi gas in the canonical ensemble based on arbitrary-precision arithmetic, we have used the python library mpmath.

Demo run

To start, try running the following commands to launch a training of 13 spin-polarized electrons in 2D with the dimensionless density parameter 10.0 and (reduced) temperature 0.15 on 8 GPUs:

export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
python main.py --n 13 --dim 2 --rs 10.0 --Theta 0.15 --Emax 25 --sr --batch 4096 --num_devices 8 --acc_steps 2

Note that we effectively sample a batch of totally 8192 samples in each training step. However, such a batch size will result in too large a memory consumption to be accommodated by 8 GPUs. To overcome this problem, we choose to split the batch into two equal pieces, and accumulate the gradient and various observables for each piece in two sequential substeps. In other words, the argument batch in the command above actually stands for the batch per accumulation step.

If you have only, say, 4 GPUs, you can set batch, num_devices, acc_steps to be 2048, 4 and 4 respectively to launch the same training process, at the expense of doubling the running time. The GPU hours are nevertheless the same.

For the detail meaning of other command line arguments, run

python main.py --help

or directly refer to the source code.

Trained model and data

A training process from complete scratch actually contains two stages. In the first stage, a variational autoregressive network is pretrained to approximate the Boltzmann distribution of the corresponding non-interacting electron gas. The resulting model can be saved and then loaded later. In fact, we have provided such a model file for the parameter settings of the last section for your convenience, so you can quickly get a feeling of the second stage of training the truly interacting system of our interest. We encourage you to remove the file to pretrain the model by yourself; it is actually much faster than the training in the second stage.

To facilitate further developments, we also provide the training models and logged data for various calculations in the paper, which are located in the data directory.

To cite

arxiv

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

Related tags

Overview

CoulombGas

Requirements

Demo run

Trained model and data

To cite

Owner

FermiFlow

Efficiently computes derivatives of numpy code.

The source codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'

A Research-oriented Federated Learning Library and Benchmark Platform for Graph Neural Networks. Accepted to ICLR'2021 - DPML and MLSys'21 - GNNSys workshops.

Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

Official PyTorch implementation of Less is More: Pay Less Attention in Vision Transformers.

Robotic Process Automation in Windows and Linux by using Driagrams.net BPMN diagrams.

Code for "Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo"

Code for models used in Bashiri et al., "A Flow-based latent state generative model of neural population responses to natural images".

Joint project of the duo Hacker Ninjas

Social Fabric: Tubelet Compositions for Video Relation Detection

A Deep Learning based project for creating line art portraits.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Code for the ICCV 2021 paper "Pixel Difference Networks for Efficient Edge Detection" (Oral).

The codes of paper 'Active-LATHE: An Active Learning Algorithm for Boosting the Error exponent for Learning Homogeneous Ising Trees'

Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention

DeceFL: A Principled Decentralized Federated Learning Framework

Backend code to use MCPI's python API to make infinite worlds with custom generation

Multi-Modal Fingerprint Presentation Attack Detection: Evaluation On A New Dataset