RaceBERT -- A transformer based model to predict race and ethnicty from names

Last update: Nov 02, 2022

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

pip install racebert

Using a virtual environment is highly recommended! You may need to install pytorch as instructed here: https://pytorch.org/get-started/locally/

Paper

Todo

Usage

raceBERT predicts race (U.S census race) and ethnicity from names.

from racebert import RaceBERT

model = RaceBERT()

# To predict race
model.predict_race("Barack Obama")

>>> {"label": "nh_black", "score": 0.5196923613548279}

The race categories are:

Race	Label
Non-hispanic White	nh_white
Hispanic	hispanic
Non-hispanic Black	nh_black
Asian & Pacific Islander	api
American Indian & Alaskan Native	aian

# Predict ethnicity
model.predict_ethnicty("Arjun Gupta")

>>> {"label": "Asian,IndianSubContinent", "score": 0.9612812399864197}

The ethnicity categories are:

Ethnicity
GreaterEuropean,British
GreaterEuropean,WestEuropean,French
GreaterEuropean,WestEuropean,Italian
GreaterEuropean,WestEuropean,Hispanic
GreaterEuropean,Jewish
GreaterEuropean,EastEuropean
Asian,IndianSubContinent
Asian,GreaterEastAsian,Japanese
GreaterAfrican,Muslim
Asian,GreaterEastAsian,EastAsian
GreaterEuropean,WestEuropean,Nordic
GreaterEuropean,WestEuropean,Germanic
GreaterAfrican,Africans

GPU

If you have a GPU, you can speed up the computation by specifying the CUDA device when you instantiate the model.

from racebert import RaceBERT

model = RaceBERT(device=0)

# predict race in batch
model.predict_race(["Barack Obama", "George Bush"])

>>>
[
        {"label": "nh_black", "score": 0.5196923613548279},
        {"label": "nh_white", "score": 0.8365859389305115}
]

# predict ethnicity in batch
model.predict_ethnicity(["Barack Obama", "George Bush"])

HuggingFace

Alternatively, you can work with the transformers models hosted on the huggingface hub directly.

Race Model: https://huggingface.co/pparasurama/raceBERT
Ethnicity Model: https://huggingface.co/pparasurama/raceBERT-ethnicity

Please refer to the transformers documentation.

RaceBERT -- A transformer based model to predict race and ethnicty from names

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

Paper

Usage

GPU

HuggingFace

Owner

Prasanna Parasurama

This is a repository for a semantic segmentation inference API using the OpenVINO toolkit

EqGAN - Improving GAN Equilibrium by Raising Spatial Awareness

Hyperbolic Hierarchical Clustering.

Hcpy - Interface with Home Connect appliances in Python

PyElecCL - Electron Monte Carlo Second Checks

SegNet model implemented using keras framework

Satellite labelling tool for manual labelling of storm top features such as overshooting tops, above-anvil plumes, cold U/Vs, rings etc.

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

E2EDNA2 - An automated pipeline for simulation of DNA aptamers complexed with small molecules and short peptides

ReferFormer - Official Implementation of ReferFormer

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

This repository implements WGAN_GP.

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

Benchmarking the robustness of Spatial-Temporal Models

Official Implementation of VAT

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals, CVPR2021

Referring Video Object Segmentation

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

Recurrent Neural Network Tutorial, Part 2 - Implementing a RNN in Python and Theano