Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

Authors: Sercan O. Arik and Tomas Pfister

Paper: Sercan O. Arik and Tomas Pfister, "ProtoAttend: Attention-Based Prototypical Learning" Link: https://arxiv.org/abs/1902.06292

We propose a novel inherently interpretable machine learning method that bases decisions on few relevant examples that we call prototypes. Our method, ProtoAttend, can be integrated into a wide range of neural network architectures including pre-trained models. It utilizes an attention mechanism that relates the encoded representations to samples in order to determine prototypes. The resulting model outperforms state of the art in three high impact problems without sacrificing accuracy of the original model: (1) it enables high-quality interpretability that outputs samples most relevant to the decision-making (i.e. a sample-based interpretability method); (2) it achieves state of the art confidence estimation by quantifying the mismatch across prototype labels; and (3) it obtains state of the art in distribution mismatch detection. All this can be achieved with minimal additional test time and a practically viable training time computational cost.

This codebase exemplifies the ProtoAttend training and evaluation pipeline for Fashion-MNIST dataset, using ResNet as the image encoder model.

To run the training pipeline, simply use python3 main_protoattend.py. The results and visualizations will be ported to Tensorboard.

To modify the experiment to other datasets and models:

Implement data batching and preprocessing functions (modify input_data.py and data iterators like iter_train etc.).
Integrate the encoder model function suitable for the data type (modify cnn_encoder in model.py).
Reoptimize the learning hyperparameters for the new dataset.

Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

Related tags

Overview

Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

Owner

47

Algorithm to texture 3D reconstructions from multi-view stereo images

Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge

Python package provinding tools for artistic interactive applications using AI

Implementation of Self-supervised Graph-level Representation Learning with Local and Global Structure (ICML 2021).

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

👐OpenHands : Making Sign Language Recognition Accessible (WiP 🚧👷‍♂️🏗)

Code for Universal Semi-Supervised Semantic Segmentation models paper accepted in ICCV 2019

Deep learned, hardware-accelerated 3D object pose estimation

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

Architecture Patterns with Python (TDD, DDD, EDM)

Line-level Handwritten Text Recognition (HTR) system implemented with TensorFlow.

A python comtrade load library accelerated by go

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

Message Passing on Cell Complexes

Code for reproducible experiments presented in KSD Aggregated Goodness-of-fit Test.

pyhsmm - library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and explicit-duration Hidden semi-Markov Models (HSMMs), focusing on the Bayesian Nonparametric extensions, the HDP-HMM and HDP-HSMM, mostly with weak-limit approximations.

OpenMMLab Computer Vision Foundation

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)

Using this you can control your PC/Laptop volume by Hand Gestures (pinch-in, pinch-out) created with Python.