Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Last update: Jun 06, 2022

Related tags

Overview

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Many applications of generative models rely on the marginalization of their high-dimensional output probability distributions. Normalization functions that yield sparse probability distributions can make exact marginalization more computationally tractable. However, sparse normalization functions usually require alternative loss functions for training because the log-likelihood can be undefined for sparse probability distributions. Furthermore, many sparse normalization functions often collapse the multimodality of distributions. In this work, we present ev-softmax, a sparse normalization function that preserves the multimodality of probability distributions. We derive its properties, including its gradient in closed-form, and introduce a continuous family of approximations to ev-softmax that have full support and can thus be trained with probabilistic loss functions such as negative log-likelihood and Kullback-Leibler divergence. We evaluate our method on a variety of generative models, including variational autoencoders and auto-regressive models. Our method outperforms existing dense and sparse normalization techniques in distributional accuracy and classification performance. We demonstrate that ev-softmax successfully reduces the dimensionality of output probability distributions while maintaining multimodality.

Setup

Required packages are listed in requirements.txt.

Running

The implementation for the ev-softmax function and its loss function can be found in evsoftmax.py.

The MNIST CVAE and VQ-VAE experiments can be run using run_mnist_cvae.sh and run_vqvae.sh, respectively. Instructions for the SSVAE experiment can be found in mnist_ssvae/README.md, and scripts used for preprocessing, training, and evaluating can be found in mnist_ssvae/scripts. Instructions for the translation experiment can be found in translation/README.md, and scripts used for preprocessing, training, and evaluating can be found in translation/scripts/iwslt.

Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Related tags

Overview

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Setup

Running

Owner

Stanford Intelligent Systems Laboratory

FastFace: Lightweight Face Detection Framework

利用yolov5和TensorRT从0到1实现目标检测的模型训练到模型部署全过程

Self-Supervised Image Denoising via Iterative Data Refinement

A template repository for submitting a job to the Slurm Cluster installed at the DISI - University of Bologna

Efficient electromagnetic solver based on rigorous coupled-wave analysis for 3D and 2D multi-layered structures with in-plane periodicity

This provides the R code and data to replicate results in "The USS Trustee’s risky strategy"

It is modified Tensorflow 2.x version of Mask R-CNN

Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation"

Consistency Regularization for Adversarial Robustness

The official implementation of Theme Transformer

CLIP2Video: Mastering Video-Text Retrieval via Image CLIP

Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks

Paddle-Adversarial-Toolbox (PAT) is a Python library for Deep Learning Security based on PaddlePaddle.

Cobalt Strike teamserver detection.

Official PyTorch implementation of the paper: Improving Graph Neural Network Expressivity via Subgraph Isomorphism Counting.

Network Enhancement implementation in pytorch

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

Imbalanced Gradients: A Subtle Cause of Overestimated Adversarial Robustness

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch