Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Michel's picture

1

Michel

MichelM2510

AI & ML interests

None yet

Organizations

None yet

Collections 1

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 27 days ago • 131

models

None public yet

datasets

None public yet

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs