Costa Huang

@vwxyzjn

Exploiting physical rewards @periodic. Prev: RL @allenai @huggingface.

@periodic Menlo Park, CA

1793

Followers

122

Following

213

Public Repos

Private Repos

Language Breakdown

Lines of code distribution across 89 owned repositories

4.3M Total LOC

Python

3,698,974 lines

86.6%

N/A

Shell

149,932 lines

3.5%

N/A

JavaScript

109,021 lines

2.6%

N/A

HCL

58,552 lines

1.4%

N/A

TypeScript

50,942 lines

1.2%

N/A

Other

203,102 lines

4.8%

N/A

I-Shaped Developer

I-shaped

Specialist — deep expertise in Python

Python

Shell

JavaScript

HCL

TypeScript

Collaboration Network

Global Impact visualization

LIVE

0 active collaborators

Repos

220

PRs

Growth

+18%

Top Collaborators

No collaborator data yet.

Coding Streak

Contribution activity over the past year

1 day

131

Contributions

Commits

Pull Requests

Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun

Based on GitHub activity

Less

Followers 1,793

Max Marrone

@SyntaxColoring

nirajgtm

@nirajgtm

YunxingZuo

@YunxingZuo

ƬⲘ ⚔️

@TM23-sanji

Conang

@Conanguo0223

View All

Following

122 total

Tyler Romero

@tyler-romero

Omar Sanseviero

@osanseviero

Pete Walsh

@epwalsh

Edward Hu

@edwhu

Laurent Mazare

@LaurentMazare

View All Network

Synced via GitHub

Top Repositories

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

9956 1102

Python

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

941 120

Python

portwarden

Create Encrypted Backups of Your Bitwarden Vault with Attachments

633 35

lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

197 12

Python

invalid-action-masking

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

170 22

Python

summarize_from_feedback_details

165 22

Python

cleanba

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

125 10

Python

PPO-Implementation-Deep-Dive

DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details

46 3

Python

gym-microrts-paper

The source code for the gym-microrts paper.

43 4

Python

a2c_is_a_special_case_of_ppo

A2C is a special case of PPO!

23 2

Python

Open Source Impact

Contributions to external projects

596 merged PRs

SchedMD/slurm

4052

openrlbenchmark/openrlbenchmark

264

CederGroupHub/chgnet

386

allenai/open-instruct

TencentCloud/CubeSandbox

6336

Contributed to 9 repositories