Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Last update: Dec 25, 2022

Related tags

Deep Learning multi-task_loss_optimizer

Overview

multi-task_losses_optimizer

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

已经实验过了，不会有cuda out of memory情况

##Pareto optimizer

from Pareto_fn import pareto_fn
w_list = [w1,w2,...]
c_list = [c1,c2,...]
[loss1,loss2,...] = model(inputs)
loss_list = [loss1,loss2,...]
# config is the superparameter for training
new_w_list = pareto_fn(w_list,c_list,config,loss_list)
loss = 0
for i in range(len(w_list)):
    loss += new_w_list[i]*loss_list[i]
model.zero_grad()

loss.backward()
optimizer.step()

##pcgrad optimizer

from pcgrad_fn import pcgrad_fn

[loss1,loss2,...] = model(inputs)
loss_list = [loss1,loss2,...]
# config is the superparameter for training

pcgrad_fn(model,loss_list,optimizer)

optimizer.step()

Reference

Please cite as:

@article{yu2020gradient,
  title={Gradient surgery for multi-task learning},
  author={Yu, Tianhe and Kumar, Saurabh and Gupta, Abhishek and Levine, Sergey and Hausman, Karol and Finn, Chelsea},
  journal={arXiv preprint arXiv:2001.06782},
  year={2020}
}

paper: "A Pareto-Efficient Algorithm for Multiple Objective Optimization in E-Commerce Recommendation". RecSys, 2019, Alibaba

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Related tags

Overview

multi-task_losses_optimizer

Reference

Owner

A framework for analyzing computer vision models with simulated data

Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration

Syllabus del curso IIC2115 - Programación como Herramienta para la Ingeniería 2022/I

Human-Pose-and-Motion History

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Make your master artistic punk avatar through machine learning world famous paintings.

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Codes for our paper "SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge" (EMNLP 2020)

NEG loss implemented in pytorch

[3DV 2021] Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

The Codebase for Causal Distillation for Language Models.

DeepAL: Deep Active Learning in Python

Hierarchical Few-Shot Generative Models

Implementations of polygamma, lgamma, and beta functions for PyTorch

Implementation of ProteinBERT in Pytorch

IAST: Instance Adaptive Self-training for Unsupervised Domain Adaptation (ECCV 2020)

Compositional and Parameter-Efficient Representations for Large Knowledge Graphs

Tools for manipulating UVs in the Blender viewport.

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]