Minimalist BERT implementation assignment for CS11-747

Last update: Jan 03, 2023

Related tags

Overview

minbert Assignment

by Zhengbao Jiang, Shuyan Zhou, and Ritam Dutt

This is an exercise in developing a minimalist version of BERT, part of Carnegie Mellon University's CS11-747: Neural Networks for NLP.

In this assignment, you will implement some important components of the BERT model to better understanding its architecture. You will then perform sentence classification on sst dataset and cfimdb dataset with the BERT model.

Assignment Details

Important Notes

Follow setup.sh to properly setup the environment and install dependencies.
There is a detailed description of the code structure in structure.md, including a description of which parts you will need to implement.
You are only allowed to use torch, no other external libraries are allowed (e.g., transformers).
We will run your code with the following commands, so make sure that whatever your best results are reproducible using these commands (where you replace ANDREWID with your andrew ID):

mkdir -p ANDREWID

python3 classifier.py --option [pretrain/finetune] --epochs NUM_EPOCHS --lr_pretrain LR_FOR_PRETRAINING --lr_finetune LR_FOR_FINETUNING --seed RANDOM_SEED

Reference accuracies:

Mean reference accuracies over 10 random seeds with their standard deviation shown in brackets.

Pretraining for SST: Dev Accuracy : 0.387 (0.008) Test Accuracy : 0.397 (0.013)

Finetuning for SST : Dev Accuracy : 0.520 (0.006) Test Accuracy : 0.525 (0.007)

Submission

The submission file should be a zip file with the following structure (assuming the andrew id is ANDREWID):

ANDREWID/
├── base_bert.py
├── bert.py
├── classifier.py
├── config.py
├── optimizer.py
├── sanity_check.py
├── tokenizer.py
├── utils.py
├── README.md
├── structure.md
├── sanity_check.data
├── sst-dev-output.txt 
├── sst-test-output.txt 
├── cfimdb-dev-output.txt 
├── cfimdb-test-output.txt 
└── setup.py

Grading

A+: You additionally implement something else on top of the requirements for A, and achieve significant accuracy improvements:
- perform continued pre-training using the MLM objective to do domain adaptation
- try alternative fine-tuning algorithms
- add other model components on top of the model
A: You implement all the missing pieces and the original classifier.py with --option finetune code that achieves comparable accuracy to our reference implementation
A-: You implement all the missing pieces and the original classifier.py with --option pretrain code that achieves comparable accuracy to our reference implementation
B+: All missing pieces are implemented and pass tests in sanity_check.py, but accuracy is not comparable to the reference.
B or below: Some parts of the missing pieces are not implemented.

Acknowledgement

Parts of the code are from the transformers library (Apache License 2.0).

Minimalist BERT implementation assignment for CS11-747

Related tags

Overview

minbert Assignment

Assignment Details

Important Notes

Reference accuracies:

Submission

Grading

Acknowledgement

Owner

Graham Neubig

Library management using python & MySQL

Shopping-card - Shopping Card Project With Python

An early stage integration of Hotwire Turbo with Django

Placeholders is a single-unit storage solution for your Frontend.

This program is meant to take the pain out of generating nice bash PS1 prompts.

Automatically give thanks to Pypi packages you use in your project!

This is a backport of the BaseExceptionGroup and ExceptionGroup classes from Python 3.11.

Different steganography methods with examples and my own small image database

A simple and usefull python calculator.

JupyterLite as a Datasette plugin

ROS Foxy + Raspi + Adafruit BNO055

dbt (data build tool) adapter for Oracle Autonomous Database

Demo Python project using Conda and Poetry

Given tool find related trending keywords of input keyword

Example python package with pybind11 cpp extension

Excel cell checker with python

Comprehensive OpenAPI schema generator for Django based on pydantic

The Zig programming language, packaged for PyPI

Automatic certificate unpinning for Android apps

This is a simple SV calling package for diploid assemblies.