Official implementation of "Generating 3D Molecules for Target Protein Binding"

Last update: Dec 07, 2022

Related tags

Overview

Generating 3D Molecules for Target Protein Binding

This is the official implementation of the GraphBP method proposed in the following paper.

Meng Liu, Youzhi Luo, Kanji Uchino, Koji Maruhashi, and Shuiwang Ji. "Generating 3D Molecules for Target Protein Binding".

Requirements

We include key dependencies below. The versions we used are in the parentheses. Our detailed environmental setup is available in environment.yml.

PyTorch (1.9.0)
PyTorch Geometric (1.7.2)
rdkit-pypi (2021.9.3)
biopython (1.79)
openbabel (3.3.1)

Preparing Data

Download and extract the CrossDocked2020 dataset:

wget https://bits.csb.pitt.edu/files/crossdock2020/CrossDocked2020_v1.1.tgz -P data/crossdock2020/
tar -C data/crossdock2020/ -xzf data/crossdock2020/CrossDocked2020_v1.1.tgz
wget https://bits.csb.pitt.edu/files/it2_tt_0_lowrmsd_mols_train0_fixed.types -P data/crossdock2020/
wget https://bits.csb.pitt.edu/files/it2_tt_0_lowrmsd_mols_test0_fixed.types -P data/crossdock2020/

Note: (1) The unzipping process could take a lot of time. Unzipping on SSD is much faster!!! (2) Several samples in the training set cannot be processed by our code. Hence, we recommend replacing the it2_tt_0_lowrmsd_mols_train0_fixed.types file with a new one, where these samples are deleted. The new one is available here.

Split data files:

python scripts/split_sdf.py data/crossdock2020/it2_tt_0_lowrmsd_mols_train0_fixed.types data/crossdock2020
python scripts/split_sdf.py data/crossdock2020/it2_tt_0_lowrmsd_mols_test0_fixed.types data/crossdock2020

Run

Train GraphBP from scratch:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main.py

Note: GraphBP can be trained on a 48GB GPU with batchsize=16. Our trained model is avaliable here.

Generate atoms in the 3D space with the trained model:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main_gen.py

Postprocess and then save the generated molecules:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main_eval.py

Reference

@article{liu2022graphbp,
      title={Generating 3D Molecules for Target Protein Binding},
      author={Meng Liu and Youzhi Luo and Kanji Uchino and Koji Maruhashi and Shuiwang Ji},
      journal={arXiv preprint arXiv:2204.09410},
      year={2022},
}

Official implementation of "Generating 3D Molecules for Target Protein Binding"

Related tags

Overview

Generating 3D Molecules for Target Protein Binding

Requirements

Preparing Data

Run

Reference

Owner

DIVE Lab, Texas A&M University

TensorFlow implementation for Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How

A hyperparameter optimization framework

PyTorch implementation of MLP-Mixer

A library for low-memory inferencing in PyTorch.

IJON is an annotation mechanism that analysts can use to guide fuzzers such as AFL.

Airborne Optical Sectioning (AOS) is a wide synthetic-aperture imaging technique

Implementation of our recent paper, WOOD: Wasserstein-based Out-of-Distribution Detection.

Official PyTorch implemention of our paper "Learning to Rectify for Robust Learning with Noisy Labels".

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

Plato: A New Framework for Federated Learning Research

Code for "On the Effects of Batch and Weight Normalization in Generative Adversarial Networks"

TensorFlow2 Classification Model Zoo playing with TensorFlow2 on the CIFAR-10 dataset.

Modification of convolutional neural net "UNET" for image segmentation in Keras framework

This script runs neural style transfer against the provided content image.

User-friendly bulk RNAseq deconvolution using simulated annealing

The dataset of tweets pulling from Twitters with keyword: Hydroxychloroquine, location: US, Time: 2020

LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference

Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance

General Virtual Sketching Framework for Vector Line Art (SIGGRAPH 2021)