Adaptive Multi-Teacher Multi-level Knowledge Distillation(AMTML-KD)

Paper has been accepted by Neurocomputing 415(2020): 106–113.

Authors: Yuang Liu, Wei Zhang and Jun Wang.

Links: [ pdf ] [ code ]

Requirements

PyTorch >= 1.0.0
Jupyter
visdom

Introduction

Knowledge distillation (KD) is an effective learning paradigm for improving the performance of light-weight student networks by utilizing additional supervision knowledge distilled from teacher networks. Most pioneering studies either learn from only a single teacher in their distillation learning methods, neglecting the potential that a student can learn from multiple teachers simultaneously, or simply treat each teacher to be equally important, unable to reveal the different importance of teachers for specific examples. To bridge this gap, we propose a novel adaptive multi-teacher multi-level knowledge distillation learning framework (AMTML-KD), which consists two novel insights: (i) associating each teacher with a latent representation to adaptively learn instance-level teacher importance weights which are leveraged for acquiring integrated soft-targets (high-level knowledge) and (ii) enabling the intermediate-level hints (intermediate-level knowledge) to be gathered from multiple teachers by the proposed multi-group hint strategy. As such, a student model can learn multi-level knowledge from multiple teachers through AMTML-KD. Extensive results on publicly available datasets demonstrate the proposed learning framework ensures student to achieve improved performance than strong competitors.

Citation

@article{LIU2020106,
    title = {Adaptive multi-teacher multi-level knowledge distillation},
    author = {Yuang Liu and Wei Zhang and Jun Wang},
    journal = {Neurocomputing},
    volume = {415},
    pages = {106 -- 113},
    year = {2020},
    issn = {0925 -- 2312},
}

AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation

Related tags

Overview

Adaptive Multi-Teacher Multi-level Knowledge Distillation(AMTML-KD)

Requirements

Introduction

Citation

Owner

Frank Liu

Pixel-wise segmentation on VOC2012 dataset using pytorch.

Controlling Hill Climb Racing with Hand Tacking

Python based framework for Automatic AI for Regression and Classification over numerical data.

Notebooks, slides and dataset of the CorrelAid Machine Learning Winter School

Official repository of DeMFI (arXiv.)

End-To-End Memory Network using Tensorflow

Internship Assessment Task for BaggageAI.

Exposure Time Calculator (ETC) and radial velocity precision estimator for the Near InfraRed Planet Searcher (NIRPS) spectrograph

Using modified BiSeNet for face parsing in PyTorch

Supporting code for "Autoregressive neural-network wavefunctions for ab initio quantum chemistry".

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

Eye-Blink-Counter - Python based Computer Vision project which counts how many time a person blinks

A custom DeepStack model that has been trained detecting ONLY the USPS logo

A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️

Betafold - AlphaFold with tunings

DI-HPC is an acceleration operator component for general algorithm modules in reinforcement learning algorithms

The Official PyTorch Implementation of "LSGM: Score-based Generative Modeling in Latent Space" (NeurIPS 2021)

Yolo object detection - Yolo object detection with python

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!