Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Last update: Dec 25, 2022

Related tags

Deep Learning multi-task_loss_optimizer

Overview

multi-task_losses_optimizer

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

已经实验过了，不会有cuda out of memory情况

##Pareto optimizer

from Pareto_fn import pareto_fn
w_list = [w1,w2,...]
c_list = [c1,c2,...]
[loss1,loss2,...] = model(inputs)
loss_list = [loss1,loss2,...]
# config is the superparameter for training
new_w_list = pareto_fn(w_list,c_list,config,loss_list)
loss = 0
for i in range(len(w_list)):
    loss += new_w_list[i]*loss_list[i]
model.zero_grad()

loss.backward()
optimizer.step()

##pcgrad optimizer

from pcgrad_fn import pcgrad_fn

[loss1,loss2,...] = model(inputs)
loss_list = [loss1,loss2,...]
# config is the superparameter for training

pcgrad_fn(model,loss_list,optimizer)

optimizer.step()

Reference

Please cite as:

@article{yu2020gradient,
  title={Gradient surgery for multi-task learning},
  author={Yu, Tianhe and Kumar, Saurabh and Gupta, Abhishek and Levine, Sergey and Hausman, Karol and Finn, Chelsea},
  journal={arXiv preprint arXiv:2001.06782},
  year={2020}
}

paper: "A Pareto-Efficient Algorithm for Multiple Objective Optimization in E-Commerce Recommendation". RecSys, 2019, Alibaba

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Related tags

Overview

multi-task_losses_optimizer

Reference

Owner

IPATool-py: download ipa easily

Infrastructure as Code (IaC) for a self-hosted version of Gnosis Safe on AWS

Create animations for the optimization trajectory of neural nets

Project for music generation system based on object tracking and CGAN

Source code and dataset for ACL2021 paper: "ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning".

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorch

Angular & Electron desktop UI framework. Angular components for native looking and behaving macOS desktop UI (Electron/Web)

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

使用yolov5训练自己数据集(详细过程)并通过flask部署

An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"

Simple helper library to convert a collection of numpy data to tfrecord, and build a tensorflow dataset from the tfrecord.

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

It is a simple library to speed up CLIP inference up to 3x (K80 GPU)

Implementation of the paper Scalable Intervention Target Estimation in Linear Models (NeurIPS 2021), and the code to generate simulation results.

Awesome AI Learning with +100 AI Cheat-Sheets, Free online Books, Top Courses, Best Videos and Lectures, Papers, Tutorials, +99 Researchers, Premium Websites, +121 Datasets, Conferences, Frameworks, Tools

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

PointCNN: Convolution On X-Transformed Points (NeurIPS 2018)

Backdoor Attack through Frequency Domain

Simultaneous NMT/MMT framework in PyTorch