Code for the Lovász-Softmax loss (CVPR 2018)

Last update: Jan 04, 2023

Overview

The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks

Maxim Berman, Amal Rannen Triki, Matthew B. Blaschko

ESAT-PSI, KU Leuven, Belgium.

Published in CVPR 2018. See project page, arxiv paper, paper on CVF open access.

PyTorch implementation of the loss layer (pytorch folder)

Files included:

lovasz_losses.py: Standalone PyTorch implementation of the Lovász hinge and Lovász-Softmax for the Jaccard index
demo_binary.ipynb: Jupyter notebook showcasing binary training of a linear model, with the Lovász Hinge and with the Lovász-Sigmoid.
demo_multiclass.ipynb: Jupyter notebook showcasing multiclass training of a linear model with the Lovász-Softmax

The binary lovasz_hinge expects real-valued scores (positive scores correspond to foreground pixels).

The multiclass lovasz_softmax expect class probabilities (the maximum scoring category is predicted). First use a Softmax layer on the unnormalized scores.

TensorFlow implementation of the loss layer (tensorflow folder)

Files included:

lovasz_losses_tf.py: Standalone TensorFlow implementation of the Lovász hinge and Lovász-Softmax for the Jaccard index
demo_binary_tf.ipynb: Jupyter notebook showcasing binary training of a linear model, with the Lovász Hinge and with the Lovász-Sigmoid.
demo_multiclass_tf.ipynb: Jupyter notebook showcasing the application of the multiclass loss with the Lovász-Softmax

Warning: the losses values and gradients have been tested to be the same as in PyTorch (see notebooks), however we have not used the TF implementation in a training setting.

Usage

See the demos for simple proofs of principle.

FAQ

How should I use the Lovász-Softmax loss?

The loss can be optimized on its own, but the optimal optimization hyperparameters (learning rates, momentum) might be different from the best ones for cross-entropy. As discussed in the paper, optimizing the dataset-mIoU (Pascal VOC measure) is dependent on the batch size and number of classes. Therefore you might have best results by optimizing with cross-entropy first and finetuning with our loss, or by combining the two losses.

See for example how the work Land Cover Classification From Satellite Imagery With U-Net and Lovasz-Softmax Loss by Alexander Rakhlin et al. used our loss in the CVPR 18 DeepGlobe challenge.

Inference in Tensorflow is very slow...

Compiling from Tensorflow master (or using a future distribution that includes commit tensorflow/[email protected]) should solve this problem; see issue #6.

Citation

Please cite

@inproceedings{berman2018lovasz,
  title={The Lov{\'a}sz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks},
  author={Berman, Maxim and Rannen Triki, Amal and Blaschko, Matthew B},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={4413--4421},
  year={2018}
}

Code for the Lovász-Softmax loss (CVPR 2018)

Related tags

Overview

The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks

PyTorch implementation of the loss layer (pytorch folder)

TensorFlow implementation of the loss layer (tensorflow folder)

Usage

FAQ

Citation

Owner

Maxim Berman

Aircraft design optimization made fast through modern automatic differentiation

A quantum game modeling of pandemic (QHack 2022)

Point Cloud Registration Network

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

Multispectral Object Detection with Yolov5

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation

Code I use to automatically update my videos' metadata on YouTube

This is the accompanying toolbox for the paper "A Survey on GANs for Anomaly Detection"

An implementation of the efficient attention module.

Using Machine Learning to Create High-Res Fine Art

这是一个yolo3-tf2的源码，可以用于训练自己的模型。

Flexible-Modal Face Anti-Spoofing: A Benchmark

Implementation of ICCV19 Paper "Learning Two-View Correspondences and Geometry Using Order-Aware Network"

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

Yet another video caption

[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

An Open-Source Tool for Automatic Disease Diagnosis..

The code for our paper CrossFormer: A Versatile Vision Transformer Based on Cross-scale Attention.

ScaleNet: A Shallow Architecture for Scale Estimation

The official implementation of NeurIPS 2021 paper: Finding Optimal Tangent Points for Reducing Distortions of Hard-label Attacks

Code for the Lovász-Softmax loss (CVPR 2018)

Related tags

Overview

The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks

PyTorch implementation of the loss layer (pytorch folder)

TensorFlow implementation of the loss layer (tensorflow folder)

Usage

FAQ

Citation

Owner

Maxim Berman

Aircraft design optimization made fast through modern automatic differentiation

A quantum game modeling of pandemic (QHack 2022)

Point Cloud Registration Network

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

Multispectral Object Detection with Yolov5

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation

Code I use to automatically update my videos' metadata on YouTube

This is the accompanying toolbox for the paper "A Survey on GANs for Anomaly Detection"

An implementation of the efficient attention module.

Using Machine Learning to Create High-Res Fine Art

这是一个yolo3-tf2的源码，可以用于训练自己的模型。

Flexible-Modal Face Anti-Spoofing: A Benchmark

Implementation of ICCV19 Paper "Learning Two-View Correspondences and Geometry Using Order-Aware Network"

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

Yet another video caption

[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

An Open-Source Tool for Automatic Disease Diagnosis..

The code for our paper CrossFormer: A Versatile Vision Transformer Based on Cross-scale Attention.

ScaleNet: A Shallow Architecture for Scale Estimation

The official implementation of NeurIPS 2021 paper: Finding Optimal Tangent Points for Reducing Distortions of Hard-label Attacks

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务