Home

Awesome

Random Network Distillation

Intrinsic Reward Graph with play

VentureMontezuma's Revenge
Video Label
~New model for Montezuma

1. Setup

Requirements


2. How to Train

Modify the parameters in config.conf as you like.

python train.py

3. How to Eval

python eval.py

4. Loss/Reward Graph

References

[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Exploration by Random Network Distillation
[4] Proximal Policy Optimization Algorithms