The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Last update: Nov 13, 2021

Related tags

Deep Learning coda

Overview

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Overview

Code and dataset for The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color.

This repository is roughly split into 2 parts:

probing: The probing implementations, including code for generating CoDa.
mturk-survey: Instruction pages and used for crowdsourcing annotations.

How to use

Using CoDa

If you'd like to use CoDa, we highly recommend using the version hosted on the Huggingface Hub as it requires no additional dependencies.

from datasets import load_dataset

ds = load_dataset('corypaik/coda')

You can find more details about how to use Huggingface Datasets here.

Running experiments

This repository is developed and tested on linux systems and uses Bazel. If you are on other platforms, you might consider running Bazel in a docker container. If you'd like more guidance on this, please open an Issue on GitHub.

First, clone the project

# clone project
git clone https://github.com/nala-cub/coda

# goto project
cd coda

You can run the specific tasks as:

# run zeroshot
bazel run //projects/coda/probing/zeroshot
# representation probing
bazel run //projects/coda/probing/representations
# ngrams
bazel run //projects/coda/probing/ngram_stats
# generate dataset from annotations (relative to workspace root)
bazel run //projects/coda/probing/dataset:create_dataset -- \
  --coda_ds_export_dir=<export_dir>

To see help for any of the commands, use:

bazel run <target> -- --help
# for example:
# bazel run //projects/coda/probing/zeroshot -- --help

Annotation Instructions

Annotations were collected using an Angular app on Firebase. The included files contain all instructions, but not the app itself. If you're interested in the latter please open an issue on GitHub.

Citation

If this code was useful, please cite the paper:

@misc{paik2021world,
      title={The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color},
      author={Cory Paik and Stéphane Aroca-Ouellette and Alessandro Roncone and Katharina Kann},
      year={2021},
      eprint={2110.08182},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

License

CoDa is licensed under the Apache 2.0 license. The text of the license can be found here.

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Related tags

Overview

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Overview

How to use

Using CoDa

Running experiments

Annotation Instructions

Citation

License

Owner

An elaborate and exhaustive paper list for Named Entity Recognition (NER)

Gradient representations in ReLU networks as similarity functions

Semi-supervised Implicit Scene Completion from Sparse LiDAR

Revisiting Temporal Alignment for Video Restoration

The official PyTorch implementation for NCSNv2 (NeurIPS 2020)

Rotary Transformer

yolov5 deepsort 行人车辆跟踪检测计数

This repo contains the source code and a benchmark for predicting user's utilities with Machine Learning techniques for Computational Persuasion

Style-based Neural Drum Synthesis with GAN inversion

M3DSSD: Monocular 3D Single Stage Object Detector

Add-on for importing and auto setup of character creator 3 character exports.

An offline deep reinforcement learning library

Tutoriais publicados nas nossas redes sociais para obtenção de dados, análises simples e outras tarefas relevantes no mercado financeiro.

Multi-task head pose estimation in-the-wild

StyleSwin: Transformer-based GAN for High-resolution Image Generation

A simple Python configuration file operator.

Spatially-Adaptive Pixelwise Networks for Fast Image Translation, CVPR 2021

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

Drone Task1 - Drone Task1 With Python

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Related tags

Overview

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Overview

How to use

Using CoDa

Running experiments

Annotation Instructions

Citation

License

Owner

An elaborate and exhaustive paper list for Named Entity Recognition (NER)

Gradient representations in ReLU networks as similarity functions

Semi-supervised Implicit Scene Completion from Sparse LiDAR

Revisiting Temporal Alignment for Video Restoration

The official PyTorch implementation for NCSNv2 (NeurIPS 2020)

Rotary Transformer

yolov5 deepsort 行人 车辆 跟踪 检测 计数

This repo contains the source code and a benchmark for predicting user's utilities with Machine Learning techniques for Computational Persuasion

Style-based Neural Drum Synthesis with GAN inversion

M3DSSD: Monocular 3D Single Stage Object Detector

Add-on for importing and auto setup of character creator 3 character exports.

An offline deep reinforcement learning library

Tutoriais publicados nas nossas redes sociais para obtenção de dados, análises simples e outras tarefas relevantes no mercado financeiro.

Multi-task head pose estimation in-the-wild

StyleSwin: Transformer-based GAN for High-resolution Image Generation

A simple Python configuration file operator.

Spatially-Adaptive Pixelwise Networks for Fast Image Translation, CVPR 2021

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

Drone Task1 - Drone Task1 With Python

yolov5 deepsort 行人车辆跟踪检测计数