Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
1
5
Jeewoo Kim
jdubkim
Follow
jdubkim
AI & ML interests
LLM (Reasoning, RLHF) Trust and Safety
Organizations
None yet
models
1
jdubkim/ppo-LunarLander-v2-TEST
Reinforcement Learning
•
Updated
Dec 20, 2022
•
1
datasets
None public yet