[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

Last update: Dec 01, 2022

Overview

The neural architecture of language: Integrative modeling converges on predictive processing

Code accompanying the paper The neural architecture of language: Integrative modeling converges on predictive processing by Schrimpf, Blank, Tuckute, Kauf, Hosseini, Kanwisher, Tenenbaum, and Fedorenko.

Large-scale evaluation of neural network language models as predictive models of human language processing. This pipeline compares dozens of state-of-the-art models and 4 human datasets (3 neural, 1 behavioral). It builds on the Brain-Score framework and can easily be extended with new models and datasets.

Installation

git clone https://github.com/mschrimpf/neural-nlp.git
cd neural-nlp
pip install -e .

You might have to install nltk by hand / with conda.

Run

To score gpt2-xl on the Blank2014fROI-encoding benchmark:

python neural_nlp run --model gpt2-xl --benchmark Blank2014fROI-encoding --log_level DEBUG

Other available benchmarks are e.g. Pereira2018-encoding (takes a while to compute), and Fedorenko2016v3-encoding.

You can also specify different models to run -- note that some of them require additional download of weights (run ressources/setup.sh for automated download).

Data

When running a model on a benchmark, the data will automatically be downloaded from S3 (e.g. https://github.com/mschrimpf/neural-nlp/blob/master/neural_nlp/benchmarks/neural.py#L361 for the Pereira2018 benchmark). Costly ceiling estimates have also been precomputed and will be downloaded since they can take days to compute.

Precomputed scores

Scores for models run on the neural, behavioral, and computational-task benchmarks are also available, see the precomputed-scores.csv file. You can re-create the figures in the paper using the analyze scripts.

Citation

If you use this work, please cite

@article{Schrimpf2021,
	author = {Schrimpf, Martin and Blank, Idan and Tuckute, Greta and Kauf, Carina and Hosseini, Eghbal A. and Kanwisher, Nancy and Tenenbaum, Joshua and Fedorenko, Evelina},
	title = {The neural architecture of language: Integrative modeling converges on predictive processing},
	year = {2021},
	journal = {Proceedings of the National Academy of Sciences},
	url = {https://www.pnas.org/content/118/45/e2105646118}
}

[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

Related tags

Overview

The neural architecture of language: Integrative modeling converges on predictive processing

Installation

Run

Data

Precomputed scores

Citation

Owner

Martin Schrimpf

Compares various time-series feature sets on computational performance, within-set structure, and between-set relationships.

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Barlow Twins and HSIC

Meli Data Challenge 2021 - First Place Solution

Code for DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents

TigerLily: Finding drug interactions in silico with the Graph.

Jittor 64*64 implementation of StyleGAN

Deep learning models for classification of 15 common weeds in the southern U.S. cotton production systems.

Code Release for ICCV 2021 (oral), "AdaFit: Rethinking Learning-based Normal Estimation on Point Clouds"

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

[ACM MM2021] MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

This is the official implementation for the paper "Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization" in NeurIPS 2021.

Educational API for 3D Vision using pose to control carton.

Official repository of the paper 'Essentials for Class Incremental Learning'

A GOOD REPRESENTATION DETECTS NOISY LABELS

Code for the CIKM 2019 paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting".

Examples of how to create colorful, annotated equations in Latex using Tikz.