Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

Last update: Dec 09, 2021

Overview

EarthGAN - Earth Mantle Surrogate Modeling

Can a surrogate model of the Earth’s Mantle Convection data set be built such that it can be readily run in a web-browser and produce high-fidelity results? We're trying to do just that through the use of a generative adversarial network -- we call ours EarthGAN. We are in active research.

See how EarthGAN currently works! Open up the Colab notebook and create results from the preliminary generator:

Progress updates, along with my thoughts, can be found in the devlog. The preliminary results were presented at VIS 2021 as part of the SciVis contest. See the paper on arXiv, here.

This is active research. If you have any thoughts, suggestions, or would like to collaborate, please reach out! You can also post questions/ideas in the discussions section.

Current Approach

We're leveraging the excellent work of Li et al. who have implemented a GAN for creating super-resolution cosmological simulations. The general method is in their map2map repository. We've used their GAN implementation as it works on 3D data. Please cite their work if you find it useful!

The current approach is based on the StyleGAN2 model. In addition, a conditional-GAN (cGAN) is used to produce results that are partially deterministic.

Setup

Works best if you are in a HPC environment (I used Compute Canada). Also tested locally in linux (MacOS should also work). If you run windows you'll have to do much of the environment setup and data download/preprocessing manually.

To reproduce data pipeline and begin training: *

Clone this repo - clone https://github.com/tvhahn/EarthGAN.git
Create virtual environment. Assumes that Conda is installed when on a local computer.
- HPC: make create_environment will detect HPC environment and automatically create environment from make_hpc_venv.sh. Tested on Compute Canada. Modify make_hpc_venv.sh for your own HPC cluster.
- Linux/MacOS: use command from Makefile - `make create_environment
Download raw data.
- HPC: use make download. Will automatically detect HPC environment.
- Linux/MacOS: use make download. Will automatically download to appropriate data/raw directory.
Extract raw data.
- HPC: use make download. Will automatically detect HPC environment. Again, modify for your HPC cluster.
- Linux/MacOS: use make extract. Will automatically extract to appropriate data/raw directory.
Ensure virtual environment is activated. conda activate earth
From root directory of EarthGAN, run pip install -e . -- this will give the python scripts access to the src folders.
Create the processed data that will be used for training.
- HPC: use make data. Will automatically detect HPC environment and create the processed data.
  
  📝 Note: You will have to modify the make_hpc_data.sh in the ./bash_scripts/ folder to match the requirements of your HPC environment
- Linux/MacOS: use make data.
Copy the processed data to the scratch folder if you're on the HPC. Modify copy_processed_data_to_scratch.sh in ./bash_scripts/ folder.
Train!
- HPC: use make train. Again, modify for your HPC cluster. Not yet optimized for multi-GPU training, so be warned, it will be SLOW!
- Linux/MacOS: use make train.

* Let me know if you run into any problems! This is still in development.

Project Organization

├── Makefile           <- Makefile with commands like `make data` or `make train`
│
├── bash_scripts	   <- Bash scripts used in for training models or setting up environment
│   ├── train_model_hpc.sh       <- Bash/SLURM script used to train models on HPC (you will need to	modify this to work on your HPC). Called with `make train`
│   └── train_model_local.sh     <- Bash script used to train models locally. Called on with `make train`
│
├── data
│   ├── interim        <- Intermediate data before we've applied any scaling.
│   ├── processed      <- The final, canonical data sets for modeling.
│   └── raw            <- Original data from Earth Mantle Convection simulation.
│
├── models             <- Trained and serialized models, model predictions, or model summaries
│   └── interim        <- Interim models and summaries
│   └── final          <- Final, cononical models
│
├── notebooks          <- Jupyter notebooks. Generally used for explaining various components
│   │                     of the code base.
│   └── scratch        <- Rough-draft notebooks, of questionable quality. Be warned!
│
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── figures        <- Generated graphics and figures to be used in reporting
│
├── requirements.txt   <- Recommend using `make create_environment`. However, can use this file
│                         for to recreate environment with pip
├── envearth.yml       <- Used to create conda environment. Use `make create_environment` when
│                         on local compute				
│
├── setup.py           <- makes project pip installable (pip install -e .) so src can be imported
├── src                <- Source code for use in this project.
│   ├── __init__.py    <- Makes src a Python module
│   │
│   ├── data           <- Scripts to download or generate data
│   │   ├── make_dataset.py			<- Script for making downsampled data from the original
│   │   ├── data_prep_utils.py		<- Misc functions used in data prep
│   │   ├── download.sh				<- Bash script to download entire Earth Mantle data set
│   │   │  							   (used when `make data` called)
│   │   └──download.sh				<- Bash script to extract all Earth Mantle data set files
│   │    							   from zip (used when `make extract` called)								   
│   │
│   ├── models         <- Scripts to train models and then use trained models to make
│   │   │                 predictions
│   │   │
│   │   └── train_model.py
│   │
│   └── visualization  <- Scripts to create exploratory and results oriented visualizations
│       └── visualize.py
│
├── LICENSE
└── README.md          <- README describing project.

You might also like...

An implementation of the [Hierarchical (Sig-Wasserstein) GAN] algorithm for large dimensional Time Series Generation

Hierarchical GAN for large dimensional financial market data Implementation This repository is an implementation of the [Hierarchical (Sig-Wasserstein

11 Nov 29, 2022

Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds (CVPR 2022)

Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds (CVPR2022)[paper] Authors: Chenhang He, Ruihuang Li, Shuai Li, L

141 Dec 30, 2022

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

chitra What is chitra? chitra (चित्र) is a multi-functional library for full-stack Deep Learning. It simplifies Model Building, API development, and M

210 Dec 21, 2022

Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

PyTorch Implementation of Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers 1 Using Colab Please notic

489 Jan 7, 2023

Visualize Camera's Pose Using Extrinsic Parameter by Plotting Pyramid Model on 3D Space

Releases(v1.0.0)

v1.0.0(Nov 4, 2021)

Initial version, as presented at IEEE VIS 2021 SciVis contest.

Data prep completed using the make data call from the root directory.
Source code(tar.gz)
Source code(zip)

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

Related tags

Overview

EarthGAN - Earth Mantle Surrogate Modeling

Current Approach

Setup

Project Organization

You might also like...

An implementation of the [Hierarchical (Sig-Wasserstein) GAN] algorithm for large dimensional Time Series Generation

Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds (CVPR 2022)

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Visualize Camera's Pose Using Extrinsic Parameter by Plotting Pyramid Model on 3D Space

Language Models Can See: Plugging Visual Controls in Text Generation

This is my codes that can visualize the psnr image in testing videos.

A library for answering questions using data you cannot see

Code and data for the paper "Hearing What You Cannot See"

Releases(v1.0.0)

v1.0.0(Nov 4, 2021)

Owner

Tim

Scrutinizing XAI with linear ground-truth data

SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation

Code repository for the work "Multi-Domain Incremental Learning for Semantic Segmentation", accepted at WACV 2022

For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training.

Create Own QR code with Python

A tensorflow model that predicts if the image is of a cat or of a dog.

PyTorch code of "SLAPS: Self-Supervision Improves Structure Learning for Graph Neural Networks"

Revisiting Global Statistics Aggregation for Improving Image Restoration

Reproduces the results of the paper "Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations".

Official repository for "Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems"

一个目标检测的通用框架(不需要cuda编译)，支持Yolo全系列(v2~v5)、EfficientDet、RetinaNet、Cascade-RCNN等SOTA网络。

[CVPR 2021] MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

Demo for Real-time RGBD-based Extended Body Pose Estimation paper

Code for Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing(ICCV21)

Draw like Bob Ross using the power of Neural Networks (With PyTorch)!

PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time

[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

This repository contains the code for the CVPR 2020 paper "Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision"

My implementation of Fully Convolutional Neural Networks in Keras

Compare neural networks by their feature similarity