Official implementation of the PICASO: Permutation-Invariant Cascaded Attentional Set Operator

Last update: Dec 23, 2021

Related tags

Deep Learning PICASO

Overview

PICASO

Official PyTorch implemetation for the paper PICASO:Permutation-Invariant Cascaded Attentive Set Operator.

Requirements

Python 3
torch >= 1.0
numpy
matplotlib
scipy
tqdm

Abstract

Set-input deep networks have recently drawn much interest in computer vision and machine learning. This is in part due to the increasing number of important tasks such as meta-learning, clustering, and anomaly detection that are defined on set inputs. These networks must take an arbitrary number of input samples and produce the output invariant to the input set permutation. Several algorithms have been recently developed to address this urgent need. Our paper analyzes these algorithms using both synthetic and real-world datasets, and shows that they are not effective in dealing with common data variations such as image translation or viewpoint change. To address this limitation, we propose a permutation-invariant cascaded attentional set operator (PICASO). The gist of PICASO is a cascade of multihead attention blocks with dynamic templates. The proposed operator is a stand-alone module that can be adapted and extended to serve different machine learning tasks. We demonstrate the utilities of PICASO in four diverse scenarios: (i) clustering, (ii) image classification under novel viewpoints, (iii) image anomaly detection, and (iv) state prediction. PICASO increases the SmallNORB image classification accuracy with novel viewpoints by about 10% points. For set anomaly detection on CelebA dataset, our model improves the areas under ROC and PR curves dataset by about 22% and 10%, respectively. For the state prediction on CLEVR dataset, it improves the AP by about 40%.

Experiments

This repository implements the amortized clustering, classification, set anomaly detection, and state prediction experiments in the paper.

Amortized Clustering

You can use run.py to implement the experiment. To shift the data domain, you can use mvn_diag.py and add shift value to X.

Classification

We have used preprocessed smallNORB dataset for this experiment.

Set Anomaly Detection

In this experiment, we have used CelebA dataset. The preprocessing code is also provided in Set Anomaly Detection folder.

State Prediction

We used the same process employed in the Slot Attention paper. We recommend using multiple GPUs for this experiment.

Reference

If you found our code useful, please consider citing our work.

@misc{zare2021picaso,
      title={PICASO: Permutation-Invariant Cascaded Attentional Set Operator}, 
      author={Samira Zare and Hien Van Nguyen},
      year={2021},
      eprint={2107.08305},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Official implementation of the PICASO: Permutation-Invariant Cascaded Attentional Set Operator

Related tags

Overview

PICASO

Requirements

Abstract

Experiments

Amortized Clustering

Classification

Set Anomaly Detection

State Prediction

Reference

Owner

Samira Zare

This repository provides some of the code implemented and the data used for the work proposed in "A Cluster-Based Trip Prediction Graph Neural Network Model for Bike Sharing Systems".

PyTorch module to use OpenFace's nn4.small2.v1.t7 model

PyTorch implementation for paper "Full-Body Visual Self-Modeling of Robot Morphologies".

计算机视觉中用到的注意力模块和其他即插即用模块PyTorch Implementation Collection of Attention Module and Plug&Play Module

SenseNet is a sensorimotor and touch simulator for deep reinforcement learning research

Official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

A generalized framework for prototyping full-stack cooperative driving automation applications under CARLA+SUMO.

Rank 1st in the public leaderboard of ScanRefer (2021-03-18)

Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Memory Defense: More Robust Classificationvia a Memory-Masking Autoencoder

Cossim - Sharpened Cosine Distance implementation in PyTorch

Official repository of IMPROVING DEEP IMAGE MATTING VIA LOCAL SMOOTHNESS ASSUMPTION.

FaceAPI: AI-powered Face Detection & Rotation Tracking, Face Description & Recognition, Age & Gender & Emotion Prediction for Browser and NodeJS using TensorFlow/JS

Human Dynamics from Monocular Video with Dynamic Camera Movements

Chinese named entity recognization with BiLSTM using Keras

Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"

基于DouZero定制AI实战欢乐斗地主

Official implementation of the method ContIG, for self-supervised learning from medical imaging with genomics

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

Code for "Typilus: Neural Type Hints" PLDI 2020