Skip to content

Anca-Mt/TabularRL-StochasticWindyGridWorld

About

Q-value iteration algorithm & ON-policy vs OFF-policy learning, introducing SARSA and Q-learning algorithms in the Stochastic Windy Grid environment

Topics

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages