Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Last update: Dec 14, 2021

Overview

Guiding Evolutionary Strategies by Differentiable Robot Simulators

In recent years, Evolutionary Strategies were actively explored in robotic tasks for policy search as they provide a simpler alternative to reinforcement learning algorithms. However, this class of algorithms is often claimed to be extremely sample-inefficient. On the other hand, there is a growing interest in Differentiable Robot Simulators (DRS) as they potentially can find successful policies with only a handful of trajectories. But the resulting gradient is not always useful for the first-order optimization. In this work, we demonstrate how DRS gradient can be used in conjunction with Evolutionary Strategies. Preliminary results suggest that this combination can reduce sample complexity of Evolutionary Strategies by 3x-5x times in both simulation and the real world.

To appear in 4th Robot Learning Workshop: Self-Supervised and Lifelong Learning

Paper -- Video -- Poster

Citation

Please use the following Bibtex entry:

@misc{kurenkov2021guiding,
      title={Guiding Evolutionary Strategies by Differentiable Robot Simulators}, 
      author={Vladislav Kurenkov and Bulat Maksudov},
      year={2021},
      eprint={2110.00438},
      archivePrefix={arXiv},
      primaryClass={cs.RO}
}

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Related tags

Overview

Guiding Evolutionary Strategies by Differentiable Robot Simulators

Citation

Owner

Vladislav Kurenkov

Madanalysis5 - A package for event file analysis and recasting of LHC results

Performant, differentiable reinforcement learning

Repo for "Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks"

ICON: Implicit Clothed humans Obtained from Normals (CVPR 2022)

Exadel CompreFace is a free and open-source face recognition GitHub project

MG-GCN: Scalable Multi-GPU GCN Training Framework

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

ICML 21 - Voice2Series: Reprogramming Acoustic Models for Time Series Classification

⚓ Eurybia monitor model drift over time and securize model deployment with data validation

Code for paper entitled "Improving Novelty Detection using the Reconstructions of Nearest Neighbours"

TrTr: Visual Tracking with Transformer

A library for uncertainty representation and training in neural networks.

FlowTorch is a PyTorch library for learning and sampling from complex probability distributions using a class of methods called Normalizing Flows

OpenMMLab Model Deployment Toolset

A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).

HiPAL: A Deep Framework for Physician Burnout Prediction Using Activity Logs in Electronic Health Records

The project page of paper: Architecture disentanglement for deep neural networks [ICCV 2021, oral]

PocketNet: Extreme Lightweight Face Recognition Network using Neural Architecture Search and Multi-Step Knowledge Distillation

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

Labels4Free: Unsupervised Segmentation using StyleGAN