Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

Last update: Jan 02, 2023

Overview

E(n)-Equivariant Transformer (wip)

Implementation of E(n)-Equivariant Transformer, which extends the ideas from Welling's E(n)-Equivariant Graph Neural Network with attention.

Install

$ pip install En-transformer

Usage

import torch
from en_transformer import EnTransformer

model = EnTransformer(
    dim = 512,
    depth = 4,
    dim_head = 64,
    heads = 8,
    edge_dim = 4,
    fourier_features = 2
)

feats = torch.randn(1, 16, 512)
coors = torch.randn(1, 16, 3)
edges = torch.randn(1, 16, 16, 4)

feats, coors = model(feats, coors, edges)  # (1, 16, 512), (1, 16, 3)

Todo

masking
neighborhoods by radius

Citations

@misc{satorras2021en,
    title 	= {E(n) Equivariant Graph Neural Networks}, 
    author 	= {Victor Garcia Satorras and Emiel Hoogeboom and Max Welling},
    year 	= {2021},
    eprint 	= {2102.09844},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG}
}

Comments

Checkpoint sequential segments should equal number of layers instead of 1?

https://github.com/lucidrains/En-transformer/blob/a37e635d93a322cafdaaf829397c601350b23e5b/en_transformer/en_transformer.py#L527

Looking at the source code here: https://pytorch.org/docs/stable/_modules/torch/utils/checkpoint.html#checkpoint_sequential

opened by aced125 2
On rotary embeddings

Hi @lucidrains, thank you for your amazing work; big fan! I had a quick question on the usage of this repository.

Based on my understanding, rotary embeddings are a drop-in replacement for the original sinusoidal or learnt PEs in Transformers for sequential data, as in NLP or other temporal applications. If my application is not on sequential data, is there a reason why I should still use rotary embeddings?

E.g. for molecular datasets such as QM9 (from the En-GNNs paper), would it make sense to have rotary embeddings?

opened by chaitjo 1
Is this line required?

https://github.com/lucidrains/En-transformer/blob/7247e258fab953b2a8b5a73b8dfdfb72910711f8/en_transformer/en_transformer.py#L159

Is this line required? Does line 157, two lines above, make this line redundant?

opened by aced125 1
Performance drop with checkpointing update

I see a drop in performance (higher loss) when I update checkpointing from checkpoint_sequential(self.layers, 1, inp) to checkpoint_sequential(self.layers, len(self.layers), inp). Is this expected?

opened by heiidii 0
varying number of nodes

@lucidrains Thank you for your efficient implementation. I was wondering how to use this implementation for the dataset when the number of nodes in each graph is not the same? For example, the datasets of small molecules.

opened by mohaiminul2810 1
Edge model/rep

Hi,

Thank you for providing this version of the EnGNN model. This is not really an issue just a query. The original model as implemented here (https://github.com/vgsatorras/egnn) has 3 main steps per layer: edge_feat = self.edge_model(h[row], h[col], radial, edge_attr) coord = self.coord_model(coord, edge_index, coord_diff, edge_feat) h, agg = self.node_model(h, edge_index, edge_feat, node_attr) I am interested in the edge_feat and was wondering what would be an equivalent edge representation in your implementation. Line 335 in EnTransformer.py: qk = self.edge_mlp(qk) seems like the best candidate. Thanks, Pooja

opened by heiidii 1
efficient implementation

Hi, I wonder if relative distances and coordinates can be handled more efficiently using memory efficient attention as in " Self-attention Does Not Need O(n^2) Memory". It is straightforward for the scalar part.

opened by amrhamedp 2

Releases(1.0.2)

1.0.2(Jan 4, 2023)

null
Source code(tar.gz)
Source code(zip)
1.0.1(Dec 30, 2022)

null
Source code(tar.gz)
Source code(zip)
1.0.0(Dec 30, 2022)

null
Source code(tar.gz)
Source code(zip)
0.6.0(Nov 24, 2022)

null
Source code(tar.gz)
Source code(zip)
0.5.4(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.3(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.2(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.1(Nov 27, 2021)

Source code(tar.gz)
Source code(zip)
0.5.0(Aug 27, 2021)

Source code(tar.gz)
Source code(zip)
0.4.0(Aug 25, 2021)

Source code(tar.gz)
Source code(zip)
0.3.9(Aug 25, 2021)

Source code(tar.gz)
Source code(zip)
0.3.8(Jun 10, 2021)

Source code(tar.gz)
Source code(zip)
0.3.7(Jun 10, 2021)

Source code(tar.gz)
Source code(zip)
0.3.6(Jun 8, 2021)

Source code(tar.gz)
Source code(zip)
0.3.5(Jun 6, 2021)

Source code(tar.gz)
Source code(zip)
0.3.4(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.3(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.2(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.1(Jun 4, 2021)

Source code(tar.gz)
Source code(zip)
0.3.0(Jun 4, 2021)

Source code(tar.gz)
Source code(zip)
0.2.12(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.11(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.10(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.8(May 17, 2021)

Source code(tar.gz)
Source code(zip)
0.2.7(May 17, 2021)

Source code(tar.gz)
Source code(zip)
0.2.6(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.5(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.4(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.3(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.2(May 15, 2021)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention. It's all we need.

GitHub Repository

Transformer part of 12th place solution in Riiid! Answer Correctness Prediction

kaggle_riiid Transformer part of 12th place solution in Riiid! Answer Correctness Prediction. Please see here for more information. Execution You need

2 Apr 23, 2022

Semantically Contrastive Learning for Low-light Image Enhancement

Semantically Contrastive Learning for Low-light Image Enhancement Here, we propose an effective semantically contrastive learning paradigm for Low-lig

48 Dec 16, 2022

Pytorch cuda extension of grid_sample1d

Grid Sample 1d pytorch cuda extension of grid sample 1d. Since pytorch only supports grid sample 2d/3d, I extend the 1d version for efficiency. The fo

24 Dec 03, 2022

An All-MLP solution for Vision, from Google AI

MLP Mixer - Pytorch An All-MLP solution for Vision, from Google AI, in Pytorch. No convolutions nor attention needed! Yannic Kilcher video Install $ p

784 Jan 06, 2023

AITUS - An atomatic notr maker for CYTUS

AITUS an automatic note maker for CYTUS. 利用AI根据指定乐曲生成CYTUS游戏谱面。效果展示：https://www

6 Feb 24, 2022

CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors

CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors In order to facilitate the res

11 Dec 12, 2022

Download & Install mods for your favorit game with a few simple clicks

Husko's SteamWorkshop Downloader 🔴 IMPORTANT ❗ 🔴 The Tool is currently being rewritten so updates will be slow and only on the dev branch until it i

67 Nov 25, 2022

A repository for the paper "Improved Adversarial Systems for 3D Object Generation and Reconstruction".

Improved Adversarial Systems for 3D Object Generation and Reconstruction: This is a repository for the paper "Improved Adversarial Systems for 3D Obje

188 Dec 25, 2022

Learning Features with Parameter-Free Layers (ICLR 2022)

Learning Features with Parameter-Free Layers (ICLR 2022) Dongyoon Han, YoungJoon Yoo, Beomyoung Kim, Byeongho Heo | Paper NAVER AI Lab, NAVER CLOVA Up

65 Dec 07, 2022

Official PyTorch implementation of "Edge Rewiring Goes Neural: Boosting Network Resilience via Policy Gradient".

Edge Rewiring Goes Neural: Boosting Network Resilience via Policy Gradient This repository is the official PyTorch implementation of "Edge Rewiring Go

4 Dec 12, 2022

This is an implementation of PIFuhd based on Pytorch

Open-PIFuhd This is a unofficial implementation of PIFuhd PIFuHD: Multi-Level Pixel-Aligned Implicit Function forHigh-Resolution 3D Human Digitization

235 Dec 19, 2022

PAMI stands for PAttern MIning. It constitutes several pattern mining algorithms to discover interesting patterns in transactional/temporal/spatiotemporal databases

Introduction PAMI stands for PAttern MIning. It constitutes several pattern mining algorithms to discover interesting patterns in transactional/tempor

43 Jan 08, 2023

基于Flask开发后端、VUE开发前端框架，在WEB端部署YOLOv5目标检测模型

37 Jan 01, 2023

Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace, SG-HMC and more

Bayesian Neural Networks Pytorch implementations for the following approximate inference methods: Bayes by Backprop Bayes by Backprop + Local Reparame

1.4k Jan 07, 2023

Optimizing Deeper Transformers on Small Datasets

DT-Fixup Optimizing Deeper Transformers on Small Datasets Paper published in ACL 2021: arXiv Detailed instructions to replicate our results in the pap

16 Nov 14, 2022

Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296

Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions This repo contains the dataset and code for the paper Benchmarking Ro

168 Dec 29, 2022

Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

Related tags

Overview

E(n)-Equivariant Transformer (wip)

Install

Usage

Todo

Citations

Comments

Checkpoint sequential segments should equal number of layers instead of 1?

On rotary embeddings

Is this line required?

Performance drop with checkpointing update

varying number of nodes

Edge model/rep

efficient implementation

Releases(1.0.2)

1.0.2(Jan 4, 2023)

1.0.1(Dec 30, 2022)

1.0.0(Dec 30, 2022)

0.6.0(Nov 24, 2022)

0.5.4(Mar 4, 2022)

0.5.3(Mar 4, 2022)

0.5.2(Mar 4, 2022)

0.5.1(Nov 27, 2021)

0.5.0(Aug 27, 2021)

0.4.0(Aug 25, 2021)

0.3.9(Aug 25, 2021)

0.3.8(Jun 10, 2021)

0.3.7(Jun 10, 2021)

0.3.6(Jun 8, 2021)

0.3.5(Jun 6, 2021)

0.3.4(Jun 5, 2021)

0.3.3(Jun 5, 2021)

0.3.2(Jun 5, 2021)

0.3.1(Jun 4, 2021)

0.3.0(Jun 4, 2021)

0.2.12(May 27, 2021)

0.2.11(May 27, 2021)

0.2.10(May 27, 2021)

0.2.8(May 17, 2021)

0.2.7(May 17, 2021)

0.2.6(May 16, 2021)

0.2.5(May 16, 2021)

0.2.4(May 16, 2021)

0.2.3(May 16, 2021)

0.2.2(May 15, 2021)