Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

Last update: Jan 02, 2023

Overview

E(n)-Equivariant Transformer (wip)

Implementation of E(n)-Equivariant Transformer, which extends the ideas from Welling's E(n)-Equivariant Graph Neural Network with attention.

Install

$ pip install En-transformer

Usage

import torch
from en_transformer import EnTransformer

model = EnTransformer(
    dim = 512,
    depth = 4,
    dim_head = 64,
    heads = 8,
    edge_dim = 4,
    fourier_features = 2
)

feats = torch.randn(1, 16, 512)
coors = torch.randn(1, 16, 3)
edges = torch.randn(1, 16, 16, 4)

feats, coors = model(feats, coors, edges)  # (1, 16, 512), (1, 16, 3)

Todo

masking
neighborhoods by radius

Citations

@misc{satorras2021en,
    title 	= {E(n) Equivariant Graph Neural Networks}, 
    author 	= {Victor Garcia Satorras and Emiel Hoogeboom and Max Welling},
    year 	= {2021},
    eprint 	= {2102.09844},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG}
}

Comments

Checkpoint sequential segments should equal number of layers instead of 1?

https://github.com/lucidrains/En-transformer/blob/a37e635d93a322cafdaaf829397c601350b23e5b/en_transformer/en_transformer.py#L527

Looking at the source code here: https://pytorch.org/docs/stable/_modules/torch/utils/checkpoint.html#checkpoint_sequential

opened by aced125 2
On rotary embeddings

Hi @lucidrains, thank you for your amazing work; big fan! I had a quick question on the usage of this repository.

Based on my understanding, rotary embeddings are a drop-in replacement for the original sinusoidal or learnt PEs in Transformers for sequential data, as in NLP or other temporal applications. If my application is not on sequential data, is there a reason why I should still use rotary embeddings?

E.g. for molecular datasets such as QM9 (from the En-GNNs paper), would it make sense to have rotary embeddings?

opened by chaitjo 1
Is this line required?

https://github.com/lucidrains/En-transformer/blob/7247e258fab953b2a8b5a73b8dfdfb72910711f8/en_transformer/en_transformer.py#L159

Is this line required? Does line 157, two lines above, make this line redundant?

opened by aced125 1
Performance drop with checkpointing update

I see a drop in performance (higher loss) when I update checkpointing from checkpoint_sequential(self.layers, 1, inp) to checkpoint_sequential(self.layers, len(self.layers), inp). Is this expected?

opened by heiidii 0
varying number of nodes

@lucidrains Thank you for your efficient implementation. I was wondering how to use this implementation for the dataset when the number of nodes in each graph is not the same? For example, the datasets of small molecules.

opened by mohaiminul2810 1
Edge model/rep

Hi,

Thank you for providing this version of the EnGNN model. This is not really an issue just a query. The original model as implemented here (https://github.com/vgsatorras/egnn) has 3 main steps per layer: edge_feat = self.edge_model(h[row], h[col], radial, edge_attr) coord = self.coord_model(coord, edge_index, coord_diff, edge_feat) h, agg = self.node_model(h, edge_index, edge_feat, node_attr) I am interested in the edge_feat and was wondering what would be an equivalent edge representation in your implementation. Line 335 in EnTransformer.py: qk = self.edge_mlp(qk) seems like the best candidate. Thanks, Pooja

opened by heiidii 1
efficient implementation

Hi, I wonder if relative distances and coordinates can be handled more efficiently using memory efficient attention as in " Self-attention Does Not Need O(n^2) Memory". It is straightforward for the scalar part.

opened by amrhamedp 2

Releases(1.0.2)

1.0.2(Jan 4, 2023)

null
Source code(tar.gz)
Source code(zip)
1.0.1(Dec 30, 2022)

null
Source code(tar.gz)
Source code(zip)
1.0.0(Dec 30, 2022)

null
Source code(tar.gz)
Source code(zip)
0.6.0(Nov 24, 2022)

null
Source code(tar.gz)
Source code(zip)
0.5.4(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.3(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.2(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.1(Nov 27, 2021)

Source code(tar.gz)
Source code(zip)
0.5.0(Aug 27, 2021)

Source code(tar.gz)
Source code(zip)
0.4.0(Aug 25, 2021)

Source code(tar.gz)
Source code(zip)
0.3.9(Aug 25, 2021)

Source code(tar.gz)
Source code(zip)
0.3.8(Jun 10, 2021)

Source code(tar.gz)
Source code(zip)
0.3.7(Jun 10, 2021)

Source code(tar.gz)
Source code(zip)
0.3.6(Jun 8, 2021)

Source code(tar.gz)
Source code(zip)
0.3.5(Jun 6, 2021)

Source code(tar.gz)
Source code(zip)
0.3.4(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.3(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.2(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.1(Jun 4, 2021)

Source code(tar.gz)
Source code(zip)
0.3.0(Jun 4, 2021)

Source code(tar.gz)
Source code(zip)
0.2.12(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.11(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.10(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.8(May 17, 2021)

Source code(tar.gz)
Source code(zip)
0.2.7(May 17, 2021)

Source code(tar.gz)
Source code(zip)
0.2.6(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.5(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.4(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.3(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.2(May 15, 2021)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention. It's all we need.

GitHub Repository

This is an unofficial implementation of the paper “Student-Teacher Feature Pyramid Matching for Unsupervised Anomaly Detection”.

32 Oct 26, 2022

Faster Convex Lipschitz Regression

Faster Convex Lipschitz Regression This reepository provides a python implementation of our Faster Convex Lipschitz Regression algorithm with GPU and

0 Nov 19, 2021

(CVPR 2022 Oral) Official implementation for "Surface Representation for Point Clouds"

RepSurf - Surface Representation for Point Clouds [CVPR 2022 Oral] By Haoxi Ran* , Jun Liu, Chengjie Wang ( * : corresponding contact) The pytorch off

264 Dec 23, 2022

DeepLearning Anomalies Detection with Bluetooth Sensor Data

Final Year Project. Constructing models to create offline anomalies detection using Travel Time Data collected from Bluetooth sensors along the route.

1 Jan 10, 2022

This is the official pytorch implementation of Student Helping Teacher: Teacher Evolution via Self-Knowledge Distillation(TESKD)

Student Helping Teacher: Teacher Evolution via Self-Knowledge Distillation (TESKD) By Zheng Li[1,4], Xiang Li[2], Lingfeng Yang[2,4], Jian Yang[2], Zh

9 Sep 26, 2022

SSL_SLAM2: Lightweight 3-D Localization and Mapping for Solid-State LiDAR (mapping and localization separated) ICRA 2021

SSL_SLAM2 Lightweight 3-D Localization and Mapping for Solid-State LiDAR (Intel Realsense L515 as an example) This repo is an extension work of SSL_SL

1.3k Jan 08, 2023

ICNet and PSPNet-50 in Tensorflow for real-time semantic segmentation

Real-Time Semantic Segmentation in TensorFlow Perform pixel-wise semantic segmentation on high-resolution images in real-time with Image Cascade Netwo

219 Nov 21, 2022

Reproduction process of AlexNet

PaddlePaddle论文复现杂谈背景注：该repo基于PaddlePaddle，对AlexNet进行复现。时间仓促，难免有所疏漏，如果问题或者想法，欢迎随时提issue一块交流。飞桨论文复现赛地址：https://aistudio.baidu.com/aistudio/competitio

19 Nov 29, 2022

This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF).

VaxNeRF Paper | Google Colab This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF). This codebase is implemented using JAX, buildin

132 Nov 21, 2022

SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking

SPLADE 🍴 + 🥄 = 🔎 This repository contains the weights for four models as well as the code for running inference for our two papers: [v1]: SPLADE: S

170 Dec 28, 2022

Dynamic Capacity Networks using Tensorflow

Dynamic Capacity Networks using Tensorflow Dynamic Capacity Networks (DCN; http://arxiv.org/abs/1511.07838) implementation using Tensorflow. DCN reduc

8 Feb 23, 2021

This is the official code release for the paper Shape and Material Capture at Home

This is the official code release for the paper Shape and Material Capture at Home. The code enables you to reconstruct a 3D mesh and Cook-Torrance BRDF from one or more images captured with a flashl

89 Dec 10, 2022

Official repository for the CVPR 2021 paper "Learning Feature Aggregation for Deep 3D Morphable Models"

Deep3DMM Official repository for the CVPR 2021 paper Learning Feature Aggregation for Deep 3D Morphable Models. Requirements This code is tested on Py

38 Dec 27, 2022

Official code for "End-to-End Optimization of Scene Layout" -- including VAE, Diff Render, SPADE for colorization (CVPR 2020 Oral)

End-to-End Optimization of Scene Layout Code release for: End-to-End Optimization of Scene Layout CVPR 2020 (Oral) Project site, Bibtex For help conta

41 Dec 09, 2022

Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

Related tags

Overview

E(n)-Equivariant Transformer (wip)

Install

Usage

Todo

Citations

Comments

Checkpoint sequential segments should equal number of layers instead of 1?

On rotary embeddings

Is this line required?

Performance drop with checkpointing update

varying number of nodes

Edge model/rep

efficient implementation

Releases(1.0.2)

1.0.2(Jan 4, 2023)

1.0.1(Dec 30, 2022)

1.0.0(Dec 30, 2022)

0.6.0(Nov 24, 2022)

0.5.4(Mar 4, 2022)

0.5.3(Mar 4, 2022)

0.5.2(Mar 4, 2022)

0.5.1(Nov 27, 2021)

0.5.0(Aug 27, 2021)

0.4.0(Aug 25, 2021)

0.3.9(Aug 25, 2021)

0.3.8(Jun 10, 2021)

0.3.7(Jun 10, 2021)

0.3.6(Jun 8, 2021)

0.3.5(Jun 6, 2021)

0.3.4(Jun 5, 2021)

0.3.3(Jun 5, 2021)

0.3.2(Jun 5, 2021)

0.3.1(Jun 4, 2021)

0.3.0(Jun 4, 2021)

0.2.12(May 27, 2021)

0.2.11(May 27, 2021)

0.2.10(May 27, 2021)

0.2.8(May 17, 2021)

0.2.7(May 17, 2021)

0.2.6(May 16, 2021)

0.2.5(May 16, 2021)

0.2.4(May 16, 2021)

0.2.3(May 16, 2021)

0.2.2(May 15, 2021)