Admin Panels
Algorithms
Asset Management
Audio
Authentication
More Categories
Boilerplate Build Tools Caching CMS Code Analysis Code Refactoring Code review tool Command-line Interface Development Command-line Tools Communication Computer Vision Concurrency and Parallelism Configuration Cryptography Data Analysis Data Containers Data Serialization Data Structures Data Validation Data Visualization Database Database Drivers Date & Time Utilities Debugging Tools Deep Learning Deep Learning Model Explanation DevOps Tools Distributed Computing Distribution Django Documentation Downloader E-commerce Editor Plugins Email Environment Management FastAPI Projects FastAPI Utilities Feature Engineering File & Path Utilities Finance Flask Forms Functional Programming Game Development General Utilities Geolocation GPU Utilities GraphQL GUI Development Hardware HTML Manipulation HTTP Clients IDE Image Processing Implementations of Python Internationalization Interpreter Job Scheduler JSON Linters & Style Checkers Logging Machine Learning Markdown/YAML Microsoft Windows Miscellaneous Monitoring Network Virtualization Networking Office Files Processing Organization ORM Package Management Payment Processing PDF Files Processing Performance optimization Pipelines Process Utilities Productivity PyTorch Learning Resources Pytorch Utilities Recommender Systems Reinforcement Learning RESTful API RPC Servers Science SCM Search Security related resources Serialization Serverless Frameworks Sklearn Utilities Specific Formats Processing Static Site Generator Storage Task Queues Template Engine Testing Text Data & NLP Text Processing Third-party APIs Wrappers URL Manipulation Video Web Asset Management Web Content Extracting Web Crawling Web Frameworks WebSocket WSGI Servers
Popular Repo
Latest Repo
Resources
All Article News Book Tutorial

Overview
Comments 1
Releases

Reinforcement Learning Theory Book (rus)

Last update: Nov 27, 2022

Related tags

Deep Learning RL-Theory-book

Overview

Reinforcement Learning Theory Book (rus)

Full book on Arxiv: https://arxiv.org/abs/2201.09746

Ch. 1: Introduction
Ch. 2: Meta-heuristics
- NEAT, WANN
- CEM, OpenAI-ES, CMA-ES
Ch. 3: Classic theory
- Bellman equations
- RPI, policy improv. theorem
- Value Iteration, Generalized Policy Iteration
- Temporal Difference, Q-learning, SARSA
- Eligibility Traces, TD-lambda, Retrace
Ch. 4: Value-based
- DQN
- Double DQN, Dueling DQN, PER, Noisy DQN, Multi-step DQN
- c51, QR-DQN, IQN, Rainbow DQN
Ch. 5: Policy Gradient
- REINFORCE, A2C, GAE
- TRPO, PPO
Ch. 6: Continuous Control
- DDPG, TD3
- SAC
Ch. 7: Model-based
- Bandits
- MCTS, AlphaZero, MuZero
- LQR
Ch. 8: Next Stage
- Imitation Learning / Inverse Reinforcement Learning
- Intrinsic Motivation
- Multi-Task and Hindsight
- Hierarchical RL
- Partial observability
- Multi-Agent RL

Owner

qbrick

qbrick

GitHub Repository

DataCLUE: 国内首个以数据为中心的AI测评（含模型分析报告）

DataCLUE: A Benchmark Suite for Data-centric NLP You can get the english version of README. 以数据为中心的AI测评(DataCLUE) 内容导引章节描述简介介绍以数据为中心的AI测评(DataCLUE

135 Dec 22, 2022

Developing your First ML Workflow of the AWS Machine Learning Engineer Nanodegree Program

Exercises and project documentation for the 3. Developing your First ML Workflow of the AWS Machine Learning Engineer Nanodegree Program

1 Jan 13, 2022

Cockpit is a visual and statistical debugger specifically designed for deep learning.

Cockpit: A Practical Debugging Tool for Training Deep Neural Networks

421 Dec 29, 2022

learned_optimization: Training and evaluating learned optimizers in JAX

learned_optimization: Training and evaluating learned optimizers in JAX learned_optimization is a research codebase for training learned optimizers. I

533 Dec 30, 2022

Official repo for AutoInt: Automatic Integration for Fast Neural Volume Rendering in CVPR 2021

AutoInt: Automatic Integration for Fast Neural Volume Rendering CVPR 2021 Project Page | Video | Paper PyTorch implementation of automatic integration

149 Dec 22, 2022

Machine learning algorithms for many-body quantum systems

NetKet NetKet is an open-source project delivering cutting-edge methods for the study of many-body quantum systems with artificial neural networks and

413 Dec 31, 2022

We will see a basic program that is basically a hint to brute force attack to crack passwords. In other words, we will make a program to Crack Any Password Using Python. Show some ❤️ by starring this repository!

Crack Any Password Using Python We will see a basic program that is basically a hint to brute force attack to crack passwords. In other words, we will

11 Dec 03, 2022

The implemention of Video Depth Estimation by Fusing Flow-to-Depth Proposals

Flow-to-depth (FDNet) video-depth-estimation This is the implementation of paper Video Depth Estimation by Fusing Flow-to-Depth Proposals Jiaxin Xie,

32 Jun 14, 2022

This repository contains pre-trained models and some evaluation code for our paper Towards Unsupervised Dense Information Retrieval with Contrastive Learning

Contriever: Towards Unsupervised Dense Information Retrieval with Contrastive Learning This repository contains pre-trained models and some evaluation

207 Jan 08, 2023

Code repository for "Free View Synthesis", ECCV 2020.

Free View Synthesis Code repository for "Free View Synthesis", ECCV 2020. Setup Install the following Python packages in your Python environment - num

253 Dec 07, 2022

CARL provides highly configurable contextual extensions to several well-known RL environments.

CARL (context adaptive RL) provides highly configurable contextual extensions to several well-known RL environments.

51 Dec 28, 2022

Trains an agent with stochastic policy gradient ascent to solve the Lunar Lander challenge from OpenAI

Introduction This script trains an agent with stochastic policy gradient ascent to solve the Lunar Lander challenge from OpenAI. In order to run this

0 Jan 02, 2022

Differentiable Optimizers with Perturbations in Pytorch

Differentiable Optimizers with Perturbations in PyTorch This contains a PyTorch implementation of Differentiable Optimizers with Perturbations in Tens

54 Jun 22, 2022

This repo provides code for QB-Norm (Cross Modal Retrieval with Querybank Normalisation)

This repo provides code for QB-Norm (Cross Modal Retrieval with Querybank Normalisation) Usage example python dynamic_inverted_softmax.py --sims_train

36 Dec 29, 2022

Generate images from texts. In Russian

ruDALL-E Generate images from texts pip install rudalle==1.1.0rc0 🤗 HF Models: ruDALL-E Malevich (XL) ruDALL-E Emojich (XL) (readme here) ruDALL-E S

1.6k Dec 31, 2022

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

XDVioDet Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020. The proj

64 Dec 12, 2022

A visualization tool to show a TensorFlow's graph like TensorBoard

tfgraphviz tfgraphviz is a module to visualize a TensorFlow's data flow graph like TensorBoard using Graphviz. tfgraphviz enables to provide a visuali

44 Nov 09, 2022

ICSS - Interactive Continual Semantic Segmentation

Presentation This repository contains the code of our paper: Weakly-supervised c

9 Jul 23, 2022

Extending JAX with custom C++ and CUDA code

Extending JAX with custom C++ and CUDA code This repository is meant as a tutorial demonstrating the infrastructure required to provide custom ops in

237 Dec 23, 2022

Make your first PR. A beginner friendly repository made specifically for open source beginners. Add any program under any language (it can be anything from a simple program to a complex data structure algorithm). Happy coding...

Hacktober Fest 2021 Upload Different Types of Programs in any Language Use this project to make your first contribution to an open source project on G

40 Oct 11, 2022

2022.PythonRepo

About
Contact Us
DMCA
Disclaimer
Privacy Policy