Skip to content

This repository is related to the final project of Reinforcement Learning taught by Sarath Chandar.( Regularization and Feature Selection in Least Squares Temporal Difference Learning)

mina-parham/INF8953DE_FinalProject

Repository files navigation

Regularization and Feature Selection in Least Squares Temporal Difference Learning

Description

This is Python implementations of Least Angle Regression Temporal Difference (LARS-TD) algorithm and Least-Squares Temporal Difference (LSTD). For more information on the algorithm please refer to the paper

“Regularization and Feature Selection in Least Squares Temporal Difference Learning”

https://zicokolter.com/publications/kolter2009regularization.pdf

In this paper, the authors tried to propose a regularization framework for least-square temporal differences learning. Specifically, they presented an approach to find the fixed point by using l1 regularization framework. To evaluate the framework’s efficiency, they examined the framework by using two well-known problems, which means Mountain Car and Chain Domain. The results showed that the framework could deal with challenges well

Executing program

python main.py

About

This repository is related to the final project of Reinforcement Learning taught by Sarath Chandar.( Regularization and Feature Selection in Least Squares Temporal Difference Learning)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages