[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

Last update: Dec 01, 2022

Overview

The neural architecture of language: Integrative modeling converges on predictive processing

Code accompanying the paper The neural architecture of language: Integrative modeling converges on predictive processing by Schrimpf, Blank, Tuckute, Kauf, Hosseini, Kanwisher, Tenenbaum, and Fedorenko.

Large-scale evaluation of neural network language models as predictive models of human language processing. This pipeline compares dozens of state-of-the-art models and 4 human datasets (3 neural, 1 behavioral). It builds on the Brain-Score framework and can easily be extended with new models and datasets.

Installation

git clone https://github.com/mschrimpf/neural-nlp.git
cd neural-nlp
pip install -e .

You might have to install nltk by hand / with conda.

Run

To score gpt2-xl on the Blank2014fROI-encoding benchmark:

python neural_nlp run --model gpt2-xl --benchmark Blank2014fROI-encoding --log_level DEBUG

Other available benchmarks are e.g. Pereira2018-encoding (takes a while to compute), and Fedorenko2016v3-encoding.

You can also specify different models to run -- note that some of them require additional download of weights (run ressources/setup.sh for automated download).

Data

When running a model on a benchmark, the data will automatically be downloaded from S3 (e.g. https://github.com/mschrimpf/neural-nlp/blob/master/neural_nlp/benchmarks/neural.py#L361 for the Pereira2018 benchmark). Costly ceiling estimates have also been precomputed and will be downloaded since they can take days to compute.

Precomputed scores

Scores for models run on the neural, behavioral, and computational-task benchmarks are also available, see the precomputed-scores.csv file. You can re-create the figures in the paper using the analyze scripts.

Citation

If you use this work, please cite

@article{Schrimpf2021,
	author = {Schrimpf, Martin and Blank, Idan and Tuckute, Greta and Kauf, Carina and Hosseini, Eghbal A. and Kanwisher, Nancy and Tenenbaum, Joshua and Fedorenko, Evelina},
	title = {The neural architecture of language: Integrative modeling converges on predictive processing},
	year = {2021},
	journal = {Proceedings of the National Academy of Sciences},
	url = {https://www.pnas.org/content/118/45/e2105646118}
}

[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

Related tags

Overview

The neural architecture of language: Integrative modeling converges on predictive processing

Installation

Run

Data

Precomputed scores

Citation

Owner

Martin Schrimpf

Vision-Language Transformer and Query Generation for Referring Segmentation (ICCV 2021)

OverFeat is a Convolutional Network-based image classifier and feature extractor.

ICSS - Interactive Continual Semantic Segmentation

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

DeFMO: Deblurring and Shape Recovery of Fast Moving Objects (CVPR 2021)

PyTorch implementation of the paper Ultra Fast Structure-aware Deep Lane Detection

Flexible time series feature extraction & processing

Ladder Variational Autoencoders (LVAE) in PyTorch

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

MPViT:Multi-Path Vision Transformer for Dense Prediction

Diverse Branch Block: Building a Convolution as an Inception-like Unit

A developer interface for creating Chat AIs for the Chai app.

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

PyTorch implementation of the Crafting Better Contrastive Views for Siamese Representation Learning

A script written in Python that returns a consensus string and profile matrix of a given DNA string(s) in FASTA format.

rliable is an open-source Python library for reliable evaluation, even with a handful of runs, on reinforcement learning and machine learnings benchmarks.

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

Artifacts for paper "MMO: Meta Multi-Objectivization for Software Configuration Tuning"

Convolutional neural network web app trained to track our infant’s sleep schedule using our Google Nest camera.

Compute descriptors for 3D point cloud registration using a multi scale sparse voxel architecture