VOneNet: CNNs with a Primary Visual Cortex Front-End

Last update: Dec 22, 2022

Related tags

Deep Learning vonenet

Overview

VOneNet: CNNs with a Primary Visual Cortex Front-End

A family of biologically-inspired Convolutional Neural Networks (CNNs). VOneNets have the following features:

Fixed-weight neural network model of the primate primary visual cortex (V1) as the front-end.
Robust to image perturbations
Brain-mapped
Flexible: can be adapted to different back-end architectures

Available Models

(Click on model names to download the weights of ImageNet-trained models. Alternatively, you can use the function get_model in the vonenet package to download the weights.)

Name	Description
VOneResNet50	Our best performing VOneNet with a ResNet50 back-end
VOneCORnet-S	VOneNet with a recurrent neural network back-end based on the CORnet-S
VOneAlexNet	VOneNet with a back-end based on AlexNet

Quick Start

VOneNets was trained with images normalized with mean=[0.5,0.5,0.5] and std=[0.5,0.5,0.5]

More information coming soon...

Longer Motivation

Current state-of-the-art object recognition models are largely based on convolutional neural network (CNN) architectures, which are loosely inspired by the primate visual system. However, these CNNs can be fooled by imperceptibly small, explicitly crafted perturbations, and struggle to recognize objects in corrupted images that are easily recognized by humans. Recently, we observed that CNN models with a neural hidden layer that better matches primate primary visual cortex (V1) are also more robust to adversarial attacks. Inspired by this observation, we developed VOneNets, a new class of hybrid CNN vision models. Each VOneNet contains a fixed weight neural network front-end that simulates primate V1, called the VOneBlock, followed by a neural network back-end adapted from current CNN vision models. The VOneBlock is based on a classical neuroscientific model of V1: the linear-nonlinear-Poisson model, consisting of a biologically-constrained Gabor filter bank, simple and complex cell nonlinearities, and a V1 neuronal stochasticity generator. After training, VOneNets retain high ImageNet performance, but each is substantially more robust, outperforming the base CNNs and state-of-the-art methods by 18% and 3%, respectively, on a conglomerate benchmark of perturbations comprised of white box adversarial attacks and common image corruptions. Additionally, all components of the VOneBlock work in synergy to improve robustness. Read more: Dapello*, Marques*, et al. (biorxiv, 2020)

Requirements

Python 3.6+
PyTorch 0.4.1+
numpy
pandas
tqdm
scipy

Citation

Dapello, J., Marques, T., Schrimpf, M., Geiger, F., Cox, D.D., DiCarlo, J.J. (2020) Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations. biorxiv. doi.org/10.1101/2020.06.16.154542

License

GNU GPL 3+

FAQ

Soon...

Setup and Run

You need to clone it in your local repository $ git clone https://github.com/dicarlolab/vonenet.git
And when you setup its codes, you must need 'val' directory. so here is link. this link is from Korean's blog I refered as below https://seongkyun.github.io/others/2019/03/06/imagenet_dn/
```
** Download link**
```

https://academictorrents.com/collection/imagenet-2012

Once you download that large tar files, you must unzip that files -- all instructions below are refered above link, I only translate it

Unzip training dataset

$ mkdir train && mb ILSVRC2012_img_train.tar train/ && cd train $ tar -xvf ILSVRC2012_img_train.tar $ rm -f ILSVRC2012_img_train.tar (If you want to remove zipped file(tar)) $ find . -name "*.tar" | while read NAME ; do mkdir -p "${NAME%.tar}"; tar -xvf "${NAME}" -C "${NAME%.tar}"; rm -f "${NAME}"; done $ cd ..

Unzip validation dataset

$ mkdir val && mv ILSVRC2012_img_val.tar val/ && cd val && tar -xvf ILSVRC2012_img_val.tar $ wget -qO- https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh | bash

when it's finished, you can see train directory, val directory that 'val' directory is needed when setting up

Caution!!!!

after all execution above, must remove directory or file not having name n0000 -> there will be fault in training -> ex) 'ILSVRC2012_img_train' in train directory, 'ILSVRC2012_img_val.tar' in val directory

if you've done getting data, then we can setting up go to local repository which into you cloned and open terminal (you must check your versions of python, pytorch, cudatoolkit if okay then,) $ python3 setup.py install $ python3 run.py --in_path {directory including above dataset, 'val' directory must be in!}

If you see any GPU related problem especially 'GPU is not available' although you already got

$ python3 run.py --in_path {directory including above dataset, 'val' directory must be in!} --ngpus 0

ngpus is 1 as default. if you don't care running on CPU you do so

Comments

GPU requirements

Hi! Thank you so much for releasing the code!

If I wanted to train the VOneResNet50 on a NVIDIA GeForce RTX 2070 how long should I expect it to take? I'm new to training neural networks this big and am working on a small project for a course, so it would be good to have an estimate.

Thank you so much!

Maria Inês

opened by mariainescravo 4
k_exc parameter

Hi,

Thanks for releasing your code! Quick question- what is the significance of the k_exc parameter used in the V1 block?

https://github.com/dicarlolab/vonenet/blob/master/vonenet/modules.py#L91

Norman

opened by normster 4
Robust Accuracy results not matching
Firstly, thank you for open sourcing the code for your paper. It has been really helpful !!

I had a small query regarding the robust evaluation of models. I tried to evaluate the pretrained VoneResNet50 model with standard PGD with EOT and I get the following results:

robust accuracy (top1):0.3666 robust accuracy (top5):0.635

My PGD parameters were as follows :

iterations : 64 norm : L inifity epsilon: 0.0009803921569 (= 1/1020) eot_iterations : 8 Library: advertorch

I used the code in this PR and also checked with another library

It seems like the top-5 accuracy is closer to the accuracy mentioned in the paper. I'm confused since the paper mentions that the accuracy is always top-1?
opened by code-Assasin 3
Can you provide the trained VOneNet model file onto google drive?

Can you provide the trained VOneNet model file onto google drive so that I can download for my experiments. CIFAR-10, CIFAR-100, ImageNet datasets, do you have the trained model file??

opened by machanic 2
Update README.md

There are problems in line 17, 18, 19 README.md. Because When I finished download, system tells me this is wrong extension.

and add setup and run instructions. please check it and if there some error, please correct it

opened by comeeasy 1
explaining neural variances

Thank you for the code for the V1Block. Interesting work!

I was wondering how you exactly compared regular convolutional features and the ones from VOneNet to explain the Neural Variances.

Since the paper stresses that this model is SoTA in explaining these, I would be really glad if you can include the code for that too / or if you could point me to existing repositories that do that (if you are aware of any), that'd be great too!

Thanks again!

opened by vinbhaskara 1
fix: added missing argument for restoring model training

For restoring the model training, the code already provided the logic but forgot to add the argument to the parser. Now it is able to restore the model training providing the epoch number and the path containing those files.

opened by ALLIESXO 0
How to test the top-scoring Brain Score model - vonenet-resnet50-non-stochastic?

Hi, I am trying to understand what's the correct way to test (using the pretrained model trained on ImageNet) the voneresnet-50-non_stochastic model that is currently scoring two on Brain Score.

I want the model to be pretrained on ImageNet. When loading the model through net = vonenet.get_model(model_arch='resnet50', pretrained=True) a state_dict file that already contains the noise_level, noise_scale and noise_mode parameter gets loaded (in vonenet/__init__.py line 38. Do the pretrained model performance depends on these values to be fixed at 'neuronal', 0.35 and 0.07? Or can set one of these to 0 (which one?) and just keep using the same pretrained model for testing?

Thanks, Valerio

opened by ValerioB88 0
Alignment of quadrutre pairs (q0 and q1) in terms of input channels?
Hi Tiago and Joel, this is a very cool project.

The initialize method of the GFB class doesn't set the random seed of randint:

def initialize(self, sf, theta, sigx, sigy, phase): random_channel = torch.randint(0, self.in_channels, (self.out_channels,))

Doesn't this cause the filters of simple_conv_q0 and simple_conv_q1 to be misaligned in terms of input channels?
opened by Tal-Golan 1
add example of adversarial evaluation

check out my attack example and let me know what you think.

I made it entirely self contained in adv_evaluate.py, and I added an example to the README.md

opened by dapello 0

Releases(v0.1)

v0.1(Jun 30, 2020)

Weights of the main models presented in the paper Dapello*, Marques*, et. al. Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations
Source code(tar.gz)
Source code(zip)
vonealexnet_e70.pth.tar(577.56 MB)
vonecornets_e70.pth.tar(414.95 MB)
voneresnet50_e70.pth.tar(202.76 MB)

Owner

The DiCarlo Lab at MIT

Working to discover the neuronal algorithms underlying visual object recognition

GitHub Repository

Companion code for "Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees"

Companion code for "Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees" Installa

0 Oct 13, 2021

A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swar.

Omni-swarm A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swarm Introduction Omni-swarm is a decentralized omn

99 Dec 23, 2022

Score refinement for confidence-based 3D multi-object tracking

Score refinement for confidence-based 3D multi-object tracking Our video gives a brief explanation of our Method. This is the official code for the pa

47 Dec 26, 2022

L-Verse: Bidirectional Generation Between Image and Text

Far beyond learning long-range interactions of natural language, transformers are becoming the de-facto standard for many vision tasks with their power and scalabilty

102 Dec 21, 2022

PyTorch common framework to accelerate network implementation, training and validation

pytorch-framework PyTorch common framework to accelerate network implementation, training and validation. This framework is inspired by works from MML

3 Dec 19, 2022

Virtual Dance Reality Stage is a feature that offers you to share a stage with another user virtually.

Virtual Dance Reality Stage is a feature that offers you to share a stage with another user virtually. It uses the concept of Image Background Removal using DeepLab Architecture (based on Semantic Se

5 Aug 24, 2022

This repository lets you interact with Lean through a REPL.

lean-gym This repository lets you interact with Lean through a REPL. See Formal Mathematics Statement Curriculum Learning for a presentation of lean-g

87 Dec 28, 2022

[CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)

EOPSN: Exemplar-Based Open-Set Panoptic Segmentation Network (CVPR 2021) PyTorch implementation for EOPSN. We propose open-set panoptic segmentation t

49 Dec 30, 2022

Implementation for "Conditional entropy minimization principle for learning domain invariant representation features"

Implementation for "Conditional entropy minimization principle for learning domain invariant representation features". The code is reproduced from thi

1 Nov 02, 2022

This repository is an official implementation of the paper MOTR: End-to-End Multiple-Object Tracking with TRansformer.

MOTR: End-to-End Multiple-Object Tracking with TRansformer This repository is an official implementation of the paper MOTR: End-to-End Multiple-Object

348 Jan 07, 2023

A collection of papers about Transformer in the field of medical image analysis.

377 Jan 05, 2023

Official code for "Mean Shift for Self-Supervised Learning"

MSF Official code for "Mean Shift for Self-Supervised Learning" Requirements Python = 3.7.6 PyTorch = 1.4 torchvision = 0.5.0 faiss-gpu = 1.6.1 In

44 Nov 21, 2022

The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Text-to-SQL"

TriageSQL The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question Intention Classification Benchmark for Text

22 Nov 09, 2022

GalaXC: Graph Neural Networks with Labelwise Attention for Extreme Classification

GalaXC GalaXC: Graph Neural Networks with Labelwise Attention for Extreme Classification @InProceedings{Saini21, author = {Saini, D. and Jain,

28 Dec 05, 2022

Statsmodels: statistical modeling and econometrics in Python

About statsmodels statsmodels is a Python package that provides a complement to scipy for statistical computations including descriptive statistics an

8.1k Jan 02, 2023

🐤 Nix-TTS: An Incredibly Lightweight End-to-End Text-to-Speech Model via Non End-to-End Distillation

🐤 Nix-TTS An Incredibly Lightweight End-to-End Text-to-Speech Model via Non End-to-End Distillation Rendi Chevi, Radityo Eko Prasojo, Alham Fikri Aji

156 Jan 09, 2023

IRON Kaggle project done while doing IRONHACK Bootcamp where we had to analyze and use a Machine Learning Project to predict future sales

IRON Kaggle project done while doing IRONHACK Bootcamp where we had to analyze and use a Machine Learning Project to predict future sales. In this case, we ended up using XGBoost because it was the o

1 Jan 04, 2022

VOneNet: CNNs with a Primary Visual Cortex Front-End

Related tags

Overview

VOneNet: CNNs with a Primary Visual Cortex Front-End

Available Models

Quick Start

Longer Motivation

Requirements

Citation

License

FAQ

Setup and Run

Unzip training dataset

Unzip validation dataset

Caution!!!!

If you see any GPU related problem especially 'GPU is not available' although you already got

Comments

Releases(v0.1)

v0.1(Jun 30, 2020)

Owner

The DiCarlo Lab at MIT

Companion code for "Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees"

A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swar.

Score refinement for confidence-based 3D multi-object tracking

L-Verse: Bidirectional Generation Between Image and Text

PyTorch common framework to accelerate network implementation, training and validation

Virtual Dance Reality Stage is a feature that offers you to share a stage with another user virtually.

This repository lets you interact with Lean through a REPL.

[CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)

Implementation for "Conditional entropy minimization principle for learning domain invariant representation features"

This repository is an official implementation of the paper MOTR: End-to-End Multiple-Object Tracking with TRansformer.

A collection of papers about Transformer in the field of medical image analysis.

Official code for "Mean Shift for Self-Supervised Learning"

The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Text-to-SQL"

GalaXC: Graph Neural Networks with Labelwise Attention for Extreme Classification

Statsmodels: statistical modeling and econometrics in Python

🐤 Nix-TTS: An Incredibly Lightweight End-to-End Text-to-Speech Model via Non End-to-End Distillation

IRON Kaggle project done while doing IRONHACK Bootcamp where we had to analyze and use a Machine Learning Project to predict future sales

PyTorch implementation of HDN(Homography Decomposition Networks) for planar object tracking

A hyperparameter optimization framework

Benchmark datasets, data loaders, and evaluators for graph machine learning