The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

Last update: Nov 03, 2022

Related tags

Overview

Comprehensive Knowledge Distillation with Causal Intervention

This repository is a PyTorch implementation of "Comprehensive Knowledge Distillation with Causal Intervention". The code is modified from CRD, and the pretrained teachers (except WRN-40-4) are also downloaded from CRD.

Requirements

The code was tested on

Python 3.6
torch 1.2.0
torchvision 0.4.0

Evaluation

To evaluate our pre-trained light-weight student networks, first download the folder "pretrained_student_model" from CID models into the "save" folder, then simply run the command below to evaluate these light-weight students:

run evaluate_scripts.sh

Training

To train students from scratch by distilling knowledge from teacher networks with CID, first download the pretrained teacher folder "models" from CID models into the "save" folder, and then simply run the command below to compress large models to smaller ones:

run train_scripts.sh

Citation

If you find this code helpful, you may consider citing this paper:

@inproceedings{deng2021comprehensive,
  title={Comprehensive Knowledge Distillation with Causal Intervention},
  author={Deng, Xiang and Zhang, Zhongfei},
  booktitle = {Proceedings of the 30th Annual Conference on Neural Information Processing Systems},
  year={2021}
}

The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

Related tags

Overview

Comprehensive Knowledge Distillation with Causal Intervention

Requirements

Evaluation

Training

Citation

Owner

Xiang Deng

📖 Deep Attentional Guided Image Filtering

Implementation of various Vision Transformers I found interesting

Registration Loss Learning for Deep Probabilistic Point Set Registration

Vis2Mesh: Efficient Mesh Reconstruction from Unstructured Point Clouds of Large Scenes with Learned Virtual View Visibility ICCV2021

A pytorch implementation of Pytorch-Sketch-RNN

Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Decorator for PyMC3

SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

Official code release for 3DV 2021 paper Human Performance Capture from Monocular Video in the Wild.

MPViT:Multi-Path Vision Transformer for Dense Prediction

Ejemplo Algoritmo Viterbi - Example of a Viterbi algorithm applied to a hidden Markov model on DNA sequence

TorchMetrics is a collection of 25+ PyTorch metrics implementations and an easy-to-use API to create custom metrics.

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Script for getting information in discord

SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation

ToFFi - Toolbox for Frequency-based Fingerprinting of Brain Signals

OSLO: Open Source framework for Large-scale transformer Optimization

Benchmarks for the Optimal Power Flow Problem