An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates neural fields, predictive coding, top-down-bottom-up, and attention (consensus between columns)

Last update: Dec 14, 2022

Overview

GLOM - Pytorch (wip)

An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates neural fields, predictive coding, top-down-bottom-up, and attention (consensus between columns) for emergent part-whole heirarchies from data.

Citations

@misc{hinton2021represent,
    title   = {How to represent part-whole hierarchies in a neural network}, 
    author  = {Geoffrey Hinton},
    year    = {2021},
    eprint  = {2102.12627},
    archivePrefix = {arXiv},
    primaryClass = {cs.CV}
}

Comments

help

Hello, when I tried to reproduce your model, I got this error. I'm not sure how to correct it， can y help me?

Traceback (most recent call last): File "main.py", line 172, in outputs = custom_model(images,iters = 12) File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 727, in _call_impl result = self.forward(*input, **kwargs) File "/root/class/glom_pytorch/glom_pytorch.py", line 109, in forward consensus = self.attention(levels) File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 727, in call_impl result = self.forward(*input, **kwargs) File "/root/class/glom_pytorch/glom_pytorch.py", line 49, in forward sim.masked_fill(self_mask, TOKEN_ATTEND_SELF_VALUE) RuntimeError: Expected object of scalar type Bool but got scalar type Float for argument #2 'mask' in call to th_masked_fill_bool

opened by DDxk369 1
Levels token

Hello, thank you for your good work. I was trying to implement the idea you shared in this todo:

https://github.com/lucidrains/glom-pytorch/projects/1#card-56284841

The text reads: allow each level to be represented by a list of tokens, updated with attention, simliar to https://github.com/lucidrains/transformer-in-transformer

I was going to implement it with a simple token at each level, but I was wondering if you had any suggestion on how to implement it correctly. Thank you.

opened by zenos4mbu 0
Implementing geometric mean for consensus opinion/levels_mean

Hi, I'm trying to implement the consensus opinion (levels_mean) as a geometric mean of the top-down predictions, bottom-up predictions, attention-weighted average of same-level embeddings, and embeddings of the previous time step as described by the original paper. Any ideas on how the weights should be set?

At first I thought this could be a learnable parameter, but section 9.1 reads

For interpreting a static image with no temporal context, the weights used for this weighted geometric mean need to change during the iterations that occur after a new fixation.

which leads me to believe that these might need to be outputted on the fly a la vanilla attention as opposed to being learned. Maybe an MLP that takes in the four source embeddings and outputs four scalars as weights?

opened by ryan-caesar-ramos 0
Classification
Hi @lucidrains ! Do you have any idea/insight on how to supervise classification (let's say, for example, MNIST digits classification) after having trained GLOM in an unsupervised way as a denoising autoencoder? In the paper that seems to be the final goal. However, it's not clear to me which columns and/or levels should be used for the classification. Also, since GLOM it's dealing with patches, how can single black patches vote towards a certain digit?

In other words, after training GLOM as a denoising autoencoder on MNIST, what we have is:

p X p columns, where p is the number of patches per dimension (e.g. 7X7=49 patches)

6 levels for each column, where the top-most levels should in theory represent higher-level entities, so it seems natural to search for the digit information in these layers

6*2=12 iterations, to allow for information to be passed by both top-down and bottom-up networks

Just by applying dimensionality reduction on the top-most level at different iterations does not seem enough to make the digit clusters emerge. So I'm wondering if you (or anybody else) have some insights on this. Cheers!
opened by A7ocin 1
Bug in forward?

Hello, thank you for making this code available! I think there could be a potential bug in the first line of the forward function:

b, h, w, _, device = *img.shape, img.device

but the input image shape is of kind b c h w, so it could be fixed by replacing it with

b, _, h, w, device = *img.shape, img.device

Am I wrong?

opened by A7ocin 9

Releases(0.0.14)

0.0.14(Mar 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.12(Mar 6, 2021)

Source code(tar.gz)
Source code(zip)
0.0.11(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)
0.0.10a(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)
0.0.9a(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)
0.0.8(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)
0.0.7(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)
0.0.6(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)
0.0.5(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)
0.0.4(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)
0.0.3(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)
0.0.2(Mar 5, 2021)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention. It's all we need.

GitHub Repository

A minimalist tool to display a network graph.

A tool to get a minimalist view of any architecture This tool has only be tested with the models included in this repo. Therefore, I can't guarantee t

1 Feb 11, 2022

a reimplementation of Holistically-Nested Edge Detection in PyTorch

pytorch-hed This is a personal reimplementation of Holistically-Nested Edge Detection [1] using PyTorch. Should you be making use of this work, please

375 Dec 06, 2022

[ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation

RTD-Net (ICCV 2021) This repo holds the codes of paper: "Relaxed Transformer Decoders for Direct Action Proposal Generation", accepted in ICCV 2021. N

80 Nov 30, 2022

OverFeat is a Convolutional Network-based image classifier and feature extractor.

OverFeat OverFeat is a Convolutional Network-based image classifier and feature extractor. OverFeat was trained on the ImageNet dataset and participat

593 Dec 08, 2022

null

DeformingThings4D dataset Video | Paper DeformingThings4D is an synthetic dataset containing 1,972 animation sequences spanning 31 categories of human

208 Jan 03, 2023

make ASCII Art by Deep Learning

DeepAA This is convolutional neural networks generating ASCII art. This repository is under construction. This work is accepted by NIPS 2017 Workshop,

1.4k Dec 28, 2022

A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.

faceswap-GAN Adding Adversarial loss and perceptual loss (VGGface) to deepfakes'(reddit user) auto-encoder architecture. Updates Date Update 2018-08-2

3.2k Dec 30, 2022

Computational Methods Course at UdeA. Forked and size reduced from:

Computational Methods for Physics & Astronomy Book version at: https://restrepo.github.io/ComputationalMethods by: Sebastian Bustamante 2014/2015 Dieg

11 Sep 10, 2022

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Learning to Communicate with Deep Multi-Agent Reinforcement Learning This is a PyTorch implementation of the original Lua code release. Overview This

297 Dec 12, 2022

A hyperparameter optimization framework

Optuna: A hyperparameter optimization framework Website | Docs | Install Guide | Tutorial Optuna is an automatic hyperparameter optimization software

7.4k Jan 04, 2023

Computer Vision application in the web

Computer Vision application in the web Preview Usage Clone this repo git clone https://github.com/amineHY/WebApp-Computer-Vision-streamlit.git cd Web

35 Dec 06, 2022

Code for Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022)

Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022) We consider how a user of a web servi

20 Aug 21, 2022

An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates neural fields, predictive coding, top-down-bottom-up, and attention (consensus between columns)

Related tags

Overview

GLOM - Pytorch (wip)

Citations

Comments

help

Levels token

Implementing geometric mean for consensus opinion/levels_mean

Classification

Bug in forward?

Releases(0.0.14)

0.0.14(Mar 27, 2021)

0.0.12(Mar 6, 2021)

0.0.11(Mar 5, 2021)

0.0.10a(Mar 5, 2021)

0.0.9a(Mar 5, 2021)

0.0.8(Mar 5, 2021)

0.0.7(Mar 5, 2021)

0.0.6(Mar 5, 2021)

0.0.5(Mar 5, 2021)

0.0.4(Mar 5, 2021)

0.0.3(Mar 5, 2021)

0.0.2(Mar 5, 2021)

Owner

Phil Wang

A minimalist tool to display a network graph.

a reimplementation of Holistically-Nested Edge Detection in PyTorch

[ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation

OverFeat is a Convolutional Network-based image classifier and feature extractor.

null

make ASCII Art by Deep Learning

A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.

Computational Methods Course at UdeA. Forked and size reduced from:

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

A hyperparameter optimization framework

Computer Vision application in the web

Code for Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022)

Towards Open-World Feature Extrapolation: An Inductive Graph Learning Approach

ROS support for Velodyne 3D LIDARs

Learning to Reach Goals via Iterated Supervised Learning

Code release of paper "Deep Multi-View Stereo gone wild"

A Library for Modelling Probabilistic Hierarchical Graphical Models in PyTorch

Semantically Contrastive Learning for Low-light Image Enhancement

PyTorch implementation of adversarial patch

CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data