a Lightweight library for sequential learning agents, including reinforcement learning

Last update: Dec 17, 2022

Related tags

Overview

SaLinA: SaLinA - A Flexible and Simple Library for Learning Sequential Agents (including Reinforcement Learning)

TL;DR

salina is a lightweight library extending PyTorch modules for developping sequential decision models. It can be used for Reinforcement Learning (including model-based with differentiable environments, multi-agent RL, ...), but also in a supervised/unsupervised learning settings (for instance for NLP, Computer Vision, etc..).

It allows to write very complex sequential models (or policies) in few lines
It works on multiple CPUs and GPUs

Quick Start

Just clone the repo

Documentation

For development, set up pre-commit hooks:

Run pip install pre-commit
- or conda install -c conda-forge pre-commit
- or brew install pre-commit
In the top directory of the repo, run pre-commit install to set up the git hook scripts
Now pre-commit will run automatically on git commit!
Currently isort, black and blacken-docs are used, in that order

Organization of the repo

salina is the core library
- salina.agents is the catalog of agents (the same than torch.nn but for agents)
salina_examples provide many examples (in different domains)

Dependencies

salina is making use of pytorch, hydra for configuring experiments, and of gym for reinforcement learning algorithms.

Note on the Logger

We provide a simple Logger that logs in both tensorboard format, but also as pickle files that can be re-read to make tables and figures. See logger. This logger can be easily replaced by any other logger.

Description

Sequential Decision Making is much more than Reinforcement learning

Sequential Decision Making is about interactions:
Interaction with data (e.g attention-models, decision tree, cascade models, active sensing, active learning, recommendation, etc….)
Interaction with an environment (e.g games, control)
Interaction with humans (e.g recommender systems, dialog systems, health systems, …)
Interaction with a model of the world (e.g simulation)
Interaction between multiple entities (e.g multi-agent RL)

What `salina` is

A sandbox for developping sequential models at scale.
A small (300 hundred lines) 'core' code that defines everything you will use to implement agents involved in sequential decision learning systems.
- It is easy to understand and to use since it keeps the main principles of pytorch, just extending nn.Module to Agent that handle tthe temporal dimension.

A set of agents that can be combined (like pytorch modules) to obtain complex behaviors

A set of references implementations and examples in different domains Reinforcement learning, Imitation Learning, Computer Vision, ... (more to come..)

What `salina` is not

Yet another reinforcement learning framework: salina is focused on sequential decision making in general. It can be used for RL (which is our main current use-case), but also for supervised learning, attention models, multi-agent learning, planning, control, cascade models, recommender systems,...
A library: salina is just a small layer on top of pytorch that encourages good practices for implementing sequential models. It thus very simple to understand and to use, but very powerful.

Citing `salina`

Please use this bibtex if you want to cite this repository in your publications:

Link to the paper: SaLinA: Sequential Learning of Agents

    @misc{salina,
        author = {Ludovic Denoyer, Alfredo de la Fuente, Song Duong, Jean-Baptiste Gaya, Pierre-Alexandre Kamienny, Daniel H. Thompson},
        title = {SaLinA: Sequential Learning of Agents},
        year = {2021},
        publisher = {Arxiv},
        howpublished = {\url{https://gitHub.com/facebookresearch/salina}},
    }

Papers using SaLinA:

Learning a subspace of policies for online adaptation in Reinforcement Learning. Jean-Baptiste Gaya, Laure Soulier, Ludovic Denoyer - Arxiv

License

salina is released under the MIT license. See LICENSE for additional details about it. See also our Terms of Use and Privacy Policy.

a Lightweight library for sequential learning agents, including reinforcement learning

Related tags

Overview

SaLinA: SaLinA - A Flexible and Simple Library for Learning Sequential Agents (including Reinforcement Learning)

TL;DR

Quick Start

Documentation

Organization of the repo

Dependencies

Note on the Logger

Description

What `salina` is

What `salina` is not

Citing `salina`

Papers using SaLinA:

License

Owner

Facebook Research

EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit

Codebase of deep learning models for inferring stability of mRNA molecules

Source code for CVPR 2021 paper "Riggable 3D Face Reconstruction via In-Network Optimization"

A module that used for encrypt code which includes RSA and AES

Fast algorithms to compute an approximation of the minimal volume oriented bounding box of a point cloud in 3D.

BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search

A Closer Look at Structured Pruning for Neural Network Compression

Unofficial & improved implementation of NeRF--: Neural Radiance Fields Without Known Camera Parameters

Code for paper "Context-self contrastive pretraining for crop type semantic segmentation"

DeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences (PacBio) Circular Consensus Sequencing (CCS) data.

HistoKT: Cross Knowledge Transfer in Computational Pathology

Liecasadi - liecasadi implements Lie groups operation written in CasADi

SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.

Release of the ConditionalQA dataset

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Reproducing Results from A Hybrid Approach to Targeting Social Assistance

rliable is an open-source Python library for reliable evaluation, even with a handful of runs, on reinforcement learning and machine learnings benchmarks.

Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision

Datasets for new state-of-the-art challenge in disentanglement learning

a Lightweight library for sequential learning agents, including reinforcement learning

Related tags

Overview

SaLinA: SaLinA - A Flexible and Simple Library for Learning Sequential Agents (including Reinforcement Learning)

TL;DR

Quick Start

Documentation

Organization of the repo

Dependencies

Note on the Logger

Description

What salina is

What salina is not

Citing salina

Papers using SaLinA:

License

Owner

Facebook Research

EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit

Codebase of deep learning models for inferring stability of mRNA molecules

Source code for CVPR 2021 paper "Riggable 3D Face Reconstruction via In-Network Optimization"

A module that used for encrypt code which includes RSA and AES

Fast algorithms to compute an approximation of the minimal volume oriented bounding box of a point cloud in 3D.

BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search

A Closer Look at Structured Pruning for Neural Network Compression

Unofficial & improved implementation of NeRF--: Neural Radiance Fields Without Known Camera Parameters

Code for paper "Context-self contrastive pretraining for crop type semantic segmentation"

DeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences (PacBio) Circular Consensus Sequencing (CCS) data.

HistoKT: Cross Knowledge Transfer in Computational Pathology

Liecasadi - liecasadi implements Lie groups operation written in CasADi

SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.

Release of the ConditionalQA dataset

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Reproducing Results from A Hybrid Approach to Targeting Social Assistance

rliable is an open-source Python library for reliable evaluation, even with a handful of runs, on reinforcement learning and machine learnings benchmarks.

Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision

Datasets for new state-of-the-art challenge in disentanglement learning

What `salina` is

What `salina` is not

Citing `salina`