RaceBERT -- A transformer based model to predict race and ethnicty from names

Last update: Nov 02, 2022

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

pip install racebert

Using a virtual environment is highly recommended! You may need to install pytorch as instructed here: https://pytorch.org/get-started/locally/

Paper

Todo

Usage

raceBERT predicts race (U.S census race) and ethnicity from names.

from racebert import RaceBERT

model = RaceBERT()

# To predict race
model.predict_race("Barack Obama")

>>> {"label": "nh_black", "score": 0.5196923613548279}

The race categories are:

Race	Label
Non-hispanic White	nh_white
Hispanic	hispanic
Non-hispanic Black	nh_black
Asian & Pacific Islander	api
American Indian & Alaskan Native	aian

# Predict ethnicity
model.predict_ethnicty("Arjun Gupta")

>>> {"label": "Asian,IndianSubContinent", "score": 0.9612812399864197}

The ethnicity categories are:

Ethnicity
GreaterEuropean,British
GreaterEuropean,WestEuropean,French
GreaterEuropean,WestEuropean,Italian
GreaterEuropean,WestEuropean,Hispanic
GreaterEuropean,Jewish
GreaterEuropean,EastEuropean
Asian,IndianSubContinent
Asian,GreaterEastAsian,Japanese
GreaterAfrican,Muslim
Asian,GreaterEastAsian,EastAsian
GreaterEuropean,WestEuropean,Nordic
GreaterEuropean,WestEuropean,Germanic
GreaterAfrican,Africans

GPU

If you have a GPU, you can speed up the computation by specifying the CUDA device when you instantiate the model.

from racebert import RaceBERT

model = RaceBERT(device=0)

# predict race in batch
model.predict_race(["Barack Obama", "George Bush"])

>>>
[
        {"label": "nh_black", "score": 0.5196923613548279},
        {"label": "nh_white", "score": 0.8365859389305115}
]

# predict ethnicity in batch
model.predict_ethnicity(["Barack Obama", "George Bush"])

HuggingFace

Alternatively, you can work with the transformers models hosted on the huggingface hub directly.

Race Model: https://huggingface.co/pparasurama/raceBERT
Ethnicity Model: https://huggingface.co/pparasurama/raceBERT-ethnicity

Please refer to the transformers documentation.

RaceBERT -- A transformer based model to predict race and ethnicty from names

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

Paper

Usage

GPU

HuggingFace

Owner

Prasanna Parasurama

This is a collection of our NAS and Vision Transformer work.

Collection of in-progress libraries for entity neural networks.

An Abstract Cyber Security Simulation and Markov Game for OpenAI Gym

A Joint Video and Image Encoder for End-to-End Retrieval

Code for "Optimizing risk-based breast cancer screening policies with reinforcement learning"

This is the repository for the NeurIPS-21 paper [Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labels].

PyTorch code for ICLR 2021 paper Unbiased Teacher for Semi-Supervised Object Detection

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

QuadTree Attention for Vision Transformers (ICLR2022)

Improving Compound Activity Classification via Deep Transfer and Representation Learning

This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian Sign Language.

Python Auto-ML Package for Tabular Datasets

Code for "Diffusion is All You Need for Learning on Surfaces"

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

Code and data (Incidents Dataset) for ECCV 2020 Paper "Detecting natural disasters, damage, and incidents in the wild".

Numbering permanent and deciduous teeth via deep instance segmentation in panoramic X-rays

Deduplicating Training Data Makes Language Models Better

[ICCV 2021] Deep Hough Voting for Robust Global Registration

This is the repository for our paper Ditch the Gold Standard: Re-evaluating Conversational Question Answering

Learning Temporal Consistency for Low Light Video Enhancement from Single Images (CVPR2021)