Awesome
Double-DQN-MxNet-Gluon
DDQN.ipynb is the implimentation of DDQN.
DDQN-efficent-ReplayBuffer.ipynb is the same code with 1/8 more efficient cpu memory usage where instead of storing current_states + secessur_state at each times step, we just store the current frame.