Language Breakdown
Lines of code distribution across 89 owned repositories
I-Shaped Developer
I-shapedSpecialist — deep expertise in Python
Collaboration Network
Global Impact visualization
Repos
220
PRs
0
Growth
+18%
Top Collaborators
No collaborator data yet.
Coding Streak
Contribution activity over the past year
Tyler Romero
@tyler-romero
Omar Sanseviero
@osanseviero
Pete Walsh
@epwalsh
Edward Hu
@edwhu
Laurent Mazare
@LaurentMazare
Top Repositories
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Create Encrypted Backups of Your Bitwarden Vault with Attachments
RLHF implementation details of OAI's 2019 codebase
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details
The source code for the gym-microrts paper.
A2C is a special case of PPO!
Open Source Impact
Contributions to external projects