Are Convolutional Neural Networks or Transformers more like human vision?

This repository contains the code and fine-tuned models of popular Convolutional Neural Networks (CNNs) and the recently proposed Vision Transformer (ViT) on the augmented Imagenet dataset and the shape/texture bias tests run on the Stylized Imagenet dataset.

This work compares CNNs and the ViT against humans in terms of error consistency beyond traditional metrics. Through these tests, we were able to show that recently proposed self-attention based Transformer models have more human-like errors that traditional CNNs.

Colab

You can directly run tests on the results using a Google Colaboratory without needing to install anything on your local machine. Click "Open in Colab" below:

Developer

Shikhar Tuli. For any questions, comments or suggestions, please reach me at stuli@princeton.edu.

Cite this work

If you use our experimental results or fine-tuned models, please cite:

@article{tuli2021cogsci,
      title={Are Convolutional Neural Networks or Transformers more like human vision?}, 
      author={Shikhar Tuli and Ishita Dasgupta and Erin Grant and Thomas L. Griffiths},
      year={2021},
      eprint={2105.07197},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
big_transfer		big_transfer
cifar-10h		cifar-10h
error-consistency		error-consistency
simclr		simclr
tests		tests
texture-vs-shape		texture-vs-shape
vision_transformer		vision_transformer
.gitattributes		.gitattributes
.gitignore		.gitignore
CNN_vs_Human.pdf		CNN_vs_Human.pdf
CNN_vs_Human.png		CNN_vs_Human.png
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

big_transfer

big_transfer

cifar-10h

cifar-10h

error-consistency

error-consistency

simclr

simclr

tests

tests

texture-vs-shape

texture-vs-shape

vision_transformer

vision_transformer

.gitattributes

.gitattributes

.gitignore

.gitignore

CNN_vs_Human.pdf

CNN_vs_Human.pdf

CNN_vs_Human.png

CNN_vs_Human.png

README.md

README.md

Repository files navigation

Are Convolutional Neural Networks or Transformers more like human vision?

Colab

Developer

Cite this work

About

Releases

Packages

Languages

shikhartuli/cnn_txf_bias

Folders and files

Latest commit

History

Repository files navigation

Are Convolutional Neural Networks or Transformers more like human vision?

Colab

Developer

Cite this work

About

Topics

Resources

Stars

Watchers

Forks

Languages