Home

Awesome

FiniteEpisodicRL.jl

Reinforcement Learning Algorithms for Episodic MDPs With Finite SA-Spaces Source code for experiments in

UBEV - A More Practical Algorithm for Episodic RL with Near-Optimal PAC and Regret Guarantees<br> Christoph Dann, Tor Lattimore, Emma Brunskill<br> https://arxiv.org/abs/1703.07710

For Python implementations of some of the algorithms see https://github.com/iosband/TabulaRL/