Self-driving car env with PPO algorithm from stable baseline3

Last update: Dec 22, 2022

Related tags

Deep Learning Self-Driving-car

Overview

Self-driving car with RL stable baseline3

Most of the project develop from https://github.com/GerardMaggiolino/Gym-Medium-Post Please check it out!

This project focus on training self-driving car env by implementing PPO algorithm from stable baseline3

Installation

Clone the project

git clone https://github.com/SornsiriP/Self-Driving-car

Then run Gym-Medium-Post/main.py

Update

Wrap env to change observation space from box to RGB image

from simple_driving.resources.wrapper import ProcessFrame84

env = ProcessFrame84(env)

Using PPO with CNN policy instead of TRPO

from stable_baselines3 import PPO

model = PPO('CnnPolicy', env, verbose=1,learning_rate = 0.00025,tensorboard_log="./Simple-driving/",n_steps=10000,batch_size=1000,gamma=0.9995)
model.learn(total_timesteps=150000)

Normalize action space

def map_action(self, action):
  speed_range = [0,1]
  steer_range = [-0.6,0.6]
  new_speed = np.interp(action[0],[-1,1],speed_range)
  new_steer = np.interp(action[0],[-1,1],steer_range)
  return [new_speed, new_steer]

Add limited timestep reset condition

if self.current_step >1000:
  self.current_step = 0
  self.done = True

Normalize distance in reward function

previous_dist_to_goal = np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, self.prev_pos)))
current_dist_to_goal =  np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, car_ob[0:2])))

Reference

https://github.com/GerardMaggiolino/Gym-Medium-Post

https://www.etedal.net/2020/04/pybullet-panda_3.html

Contributing

Sornsiri Promma

Thanks original project from Gerard Maggiolino

Please make sure to update tests as appropriate.

Self-driving car env with PPO algorithm from stable baseline3

Related tags

Overview

Self-driving car with RL stable baseline3

Installation

Update

Reference

Contributing

Owner

Sornsiri.P

MRI reconstruction (e.g., QSM) using deep learning methods

This provides the R code and data to replicate results in "The USS Trustee’s risky strategy"

[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.

External Attention Network

a project for 3D multi-object tracking

MANO hand model porting for the GraspIt simulator

Pytorch tutorials for Neural Style transfert

HybridNets: End-to-End Perception Network

Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification

AFL binary instrumentation

MPLP: Metapath-Based Label Propagation for Heterogenous Graphs

Animal Sound Classification (Cats Vrs Dogs Audio Sentiment Classification)

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

This is a model to classify Vietnamese sign language using Motion history image (MHI) algorithm and CNN.

Code for the paper: "On the Bottleneck of Graph Neural Networks and Its Practical Implications"

Language models are open knowledge graphs ( non official implementation )

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Repositório para arquivos sobre o Módulo 1 do curso Top Coders da Let's Code + Safra

VOLO: Vision Outlooker for Visual Recognition

Code for paper "Context-self contrastive pretraining for crop type semantic segmentation"