Deep Reinforcement Learning

Implementing and showcasing reinforcement learning algorithms with deep networks. Following Open AI spinning up suggestions.


Deep Q learning and Improvements

  1. Deep Q learning
  2. Deep Recurrent Q learning
  3. Double Deep Q learning
  4. Dueling Deep Q learning
  5. Dueling Double Deep Q learning
  6. Prioritized Experience Replay

Policy Gradient Methods

  1. Vanilla Policy Gradient [Reinforce]
  2. Value Actor Critic
  3. Advantage Actor Crictic
  4. Proximal Policy Optimization
  5. GAE
  6. Soft Actor Critic


All implementations upto Advantage actor critic is done. Working on updating the comments.


  1. Add texts explaining the inner workings of the codes.
  2. Solve other environments like cartpole and atari 2600.
  3. Use CNNs.
  4. Actor critics are barebone. Will need to implement improvement methods for performance.

Deep Reinforcement Learning

To keep track and showcase

Deep Reinforcement Learning Info

⭐ Stars13
🔗 Source
🕒 Last Update10 months ago
🕒 Created2 years ago
🐞 Open Issues0
➗ Star-Issue RatioInfinity
😎 Authorabyaadrafid