This repository contains the implementation of the paper: Federated Distillation of Natural Language Understanding with Confident Sinkhorns

Last update: Nov 16, 2022

Overview

Federated Distillation of Natural Language Understanding with Confident Sinkhorns

This repository provides an alternative method for ensembled distillation of local models to a global model. The local models can be trained via entropy or optimal transport (OT) loss. We train local (on-device) models using cross-entropy loss due to the higher computational complexity of OT. The global model is pretrained on global dataset which is relatively bigger than local datasets.

How to run?

For the Sentiment task, in the Sentiment directory

Within the dataset directory:
- Follow the folder-specific readme to download the datasets and preprocess.

Within the src directory:
- To pretrain local models: run scripts for local models mentioned in bash.sh file under the comment line #train local models.

- To pretrain global model: run the script for global model mentioned in bash.sh file under the comment line #pretrain global model.

- To create noisy labels: run the script mentioned in bash.sh file under the comment line #create noisy labels from local models on transfer set.

- To find pretrained local and global model bias: run the script mentioned in bash.sh file under the comment line #distribution bias.

- To distil knowledge from pretrained local and global model: run the script mentioned in bash.sh file under the comment line #distill knowledge.

Citation

Please cite our paper if you find this repository useful. The latest version is available here.

@article{bhardwaj2021federated,
title={Federated Distillation of Natural Language Understanding with Confident Sinkhorns},
author={Bhardwaj, Rishabh and Vaidya, Tushar and Poria, Soujanya},
journal={arXiv preprint arXiv:2110.02432},
year={2021} }

Contact

If you have any questions, please feel free to contact [email protected].

This repository contains the implementation of the paper: Federated Distillation of Natural Language Understanding with Confident Sinkhorns

Related tags

Overview

Federated Distillation of Natural Language Understanding with Confident Sinkhorns

How to run?

Citation

Contact

Owner

Deep Cognition and Language Research (DeCLaRe) Lab

Selective Wavelet Attention Learning for Single Image Deraining

Moon-patrol - A faithful recreation of the 1983 hit classic Moon Patrol for the Atari 2600 created using the Pygame library for Python

Deep Learning to Improve Breast Cancer Detection on Screening Mammography

Implementation of a Transformer using ReLA (Rectified Linear Attention)

Convert Mission Planner (ArduCopter) Waypoint Missions to Litchi CSV Format to execute on DJI Drones

Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)

Happywhale - Whale and Dolphin Identification Silver🥈 Solution (26/1588)

PEPit is a package enabling computer-assisted worst-case analyses of first-order optimization methods.

Flexible-CLmser: Regularized Feedback Connections for Biomedical Image Segmentation

An implementation of paper `Real-time Convolutional Neural Networks for Emotion and Gender Classification` with PaddlePaddle.

[ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation

View model summaries in PyTorch!

Python implementation of cover trees, near-drop-in replacement for scipy.spatial.kdtree

Active learning for Mask R-CNN in Detectron2

Implementation of average- and worst-case robust flatness measures for adversarial training.

Repository features UNet inspired architecture used for segmenting lungs on chest X-Ray images

buildseg is a building extraction plugin of QGIS based on PaddlePaddle.

This project is based on RIFE and aims to make RIFE more practical for users by adding various features and design new models

CSKG is a commonsense knowledge graph that combines seven popular sources into a consolidated representation

Churn-Prediction-Project - In this project, a churn prediction model is developed for a private bank as a term project for Data Mining class.