Extract atomic fingerprints from molecules using pretrained GROVER

Using pretrained GROVER to extract the atomic fingerprints from molecule. The fingerprints can be used for further tasks.

GROVER is short for Graph Representation frOm self-superVised mEssage passing tRansformer which is a Transformer-based self-supervised message-passing neural network by Rong and colleagues as in the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Data.

Install requirements

Create and activate a conda environment:

conda create --name grover python=3.6.8
conda activate grover

Install requirements from requirements.txt file. Additionally, install torchinfo:

conda install -c conda-forge -c pytorch -c acellera -c RMG --file=requirements.txt
pip install torchinfo

Download the pretrained models

There are two pretrained models provided by the original authors. Download, extract and save the .pt file in models_pretrained/.

Inference figerprints

Run the main.py file:

python main.py

Details about the arguments can be viewed in the setup_parser() function found in the main.py, or by running:

python main.py -h

If no arguments are specified, then the default arguments will be used.

By default, the outputs are saved in extracted_fingerprint. The outputs include 3 files:

atom_fp.npy: contains the atomic fingerprints.
distance.npy: contains the pair-wise shortest relative distance matrices between nodes of the molecular graphs.
smiles.txt: contains the SMILES strings of the molecules.

In order to read the .npy files, please refer to this part in the numpy.save documentation

Using pretrained GROVER to extract the atomic fingerprints from molecule

Related tags

Overview

Extract atomic fingerprints from molecules using pretrained GROVER

Install requirements

Download the pretrained models

Inference figerprints

Owner

Xuan Vu Nguyen

Text Summarization - WCN — Weighted Contextual N-gram method for evaluation of Text Summarization

A computational optimization project towards the goal of gerrymandering the results of a hypothetical election in the UK.

System Design course at HSE (2021)

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

Annotate datasets with a semi-trained or fully trained YOLOv5 model

A small library of 3D related utilities used in my research.

Repository for scripts and notebooks from the book: Programming PyTorch for Deep Learning

An end-to-end implementation of intent prediction with Metaflow and other cool tools

Here is the diagnostic tool for BMVC 2021 paper Diagnosing Errors in Video Relation Detectors.

BADet: Boundary-Aware 3D Object Detection from Point Clouds (Pattern Recognition 2022)

Instant Real-Time Example-Based Style Transfer to Facial Videos

Official PyTorch code for the paper: "Point-Based Modeling of Human Clothing" (ICCV 2021)

Wikidated : An Evolving Knowledge Graph Dataset of Wikidata’s Revision History

Warning: This project does not have any current developer. See bellow.

Fuzzing tool (TFuzz): a fuzzing tool based on program transformation

Subnet Replacement Attack: Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks

Running AlphaFold2 (from ColabFold) in Azure Machine Learning

PyTorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision.

Tracking Pipeline helps you to solve the tracking problem more easily

An open source app to help calm you down when needed.