Regularization and Feature Selection in Least Squares Temporal Difference Learning

Description

This is Python implementations of Least Angle Regression Temporal Difference (LARS-TD) algorithm and Least-Squares Temporal Difference (LSTD). For more information on the algorithm please refer to the paper

“Regularization and Feature Selection in Least Squares Temporal Difference Learning”

https://zicokolter.com/publications/kolter2009regularization.pdf

In this paper, the authors tried to propose a regularization framework for least-square temporal differences learning. Specifically, they presented an approach to find the fixed point by using l1 regularization framework. To evaluate the framework’s efficiency, they examined the framework by using two well-known problems, which means Mountain Car and Chain Domain. The results showed that the framework could deal with challenges well

Executing program

python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
.DS_Store		.DS_Store
LARSTD.py		LARSTD.py
LSTDQ.py		LSTDQ.py
README.md		README.md
chainwalk.py		chainwalk.py
fig_a.pdf		fig_a.pdf
fig_a.png		fig_a.png
fig_b.pdf		fig_b.pdf
fig_b.png		fig_b.png
fig_c.pdf		fig_c.pdf
fig_c.png		fig_c.png
main.py		main.py
mountaincar.py		mountaincar.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.DS_Store

.DS_Store

LARSTD.py

LARSTD.py

LSTDQ.py

LSTDQ.py

README.md

README.md

chainwalk.py

chainwalk.py

fig_a.pdf

fig_a.pdf

fig_a.png

fig_a.png

fig_b.pdf

fig_b.pdf

fig_b.png

fig_b.png

fig_c.pdf

fig_c.pdf

fig_c.png

fig_c.png

main.py

main.py

mountaincar.py

mountaincar.py

utils.py

utils.py

Repository files navigation

Regularization and Feature Selection in Least Squares Temporal Difference Learning

Description

Executing program

About

Releases

Packages

Languages

mina-parham/INF8953DE_FinalProject

Folders and files

Latest commit

History

Repository files navigation

Regularization and Feature Selection in Least Squares Temporal Difference Learning

Description

Executing program

About

Resources

Stars

Watchers

Forks

Languages