Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Last update: Nov 19, 2022

Overview

Improving evidential deep learning via multi task learning

It is a repository of AAAI2022 paper, “Improving evidential deep learning via multi-task learning”, by Dongpin Oh and Bonggun Shin.

This repository contains the code to reproduce the Multi-task evidential neural network (MT-ENet), which uses the Lipschitz MSE loss function as the additional loss function of the evidential regression network (ENet). The Lipschitz MSE loss function can improve the accuracy of the ENet while preserving its uncertainty estimation capability, by avoiding gradient conflict with the NLL loss function—the original loss function of the ENet.

Setup

Please refer to "requirements.txt" for requring packages of this repo.

pip install -r requirements.txt

Training the ENet with the Lipschitz-MSE loss: example

from mtevi.mtevi import EvidentialMarginalLikelihood, EvidenceRegularizer, modified_mse
...
net = EvidentialNetwork() ## Evidential regression network
nll_loss = EvidentialMarginalLikelihood() ## original loss, NLL loss
reg = EvidenceRegularizer() ## evidential regularizer
mmse_loss = modified_mse ## lipschitz MSE loss
...
for inputs, labels in dataloader:
	gamma, nu, alpha, beta = net(inputs)
	loss = nll_loss(gamma, nu, alpha, beta, labels)
	loss += reg(gamma, nu, alpha, beta, labels)
	loss += mmse_loss(gamma, nu, alpha, beta, labels)
	loss.backward()

Quick start

Synthetic data experiment.

python synthetic_exp.py

UCI regression benchmark experiments.

python uci_exp_norm -p energy

Drug target affinity (DTA) regression task on KIBA and Davis datasets.

python train_evinet.py -o test --type davis -f 0 --evi # ENet
python train_evinet.py -o test --type davis -f 0  # MT-ENet

Gradient conflict experiment on the DTA benchmarks

python check_conflict.py --type davis -f 0 # Conflict between the Lipschitz MSE (proposed) and NLL loss. 
python check_conflict.py --type davis -f 0 --abl # Conflict between the simple MSE loss and NLL loss.

Characteristic of the Lipschitz MSE loss

The Lipschitz MSE loss function can support training the ENet to more accurately predicts target values.
It regularizes its gradient to prevent gradient conflict with the NLL loss--the original loss function--if the NLL loss increases predictive uncertainty of the ENet.
Please check our paper for details.

Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Related tags

Overview

Improving evidential deep learning via multi task learning

Setup

Training the ENet with the Lipschitz-MSE loss: example

Quick start

Characteristic of the Lipschitz MSE loss

Owner

deargen

https://sites.google.com/cornell.edu/recsys2021tutorial

SegNet model implemented using keras framework

Transparent Transformer Segmentation

“Robust Lightweight Facial Expression Recognition Network with Label Distribution Training”, AAAI 2021.

OCR Post Correction for Endangered Language Texts

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

Range Image-based LiDAR Localization for Autonomous Vehicles Using Mesh Maps

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID,

PyTorch code for ICPR 2020 paper Future Urban Scene Generation Through Vehicle Synthesis

Implementation of [Time in a Box: Advancing Knowledge Graph Completion with Temporal Scopes].

Shōgun

Code for Massive-scale Decoding for Text Generation using Lattices

3D ResNet Video Classification accelerated by TensorRT

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Prompts - Read a textfile of prompts and import into anki via ankiconnect

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

A Simulated Optimal Intrusion Response Game

Google Brain - Ventilator Pressure Prediction

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.