Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nvidia
's Collections
Cosmos Tokenizer
Llama-3.1-Nemotron-70B
NVLM 1.0
OpenMath-2
Nemotron 4 340B
SteerLM
Parakeet
Canary
InstructRetro
OpenMath
RLHF
NV-Embed
Llama3-ChatQA-1.5
SSMs
Nemotron 3 8B
BigVGAN
MambaVision
Minitron
RADIO
NIM Serverless Inference API
Model Optimizer
Llama3-ChatQA-2
NeMo Curator - Classifier Models
RLHF
updated
Oct 1
A collection of models trained with Reinforcement Learning from Human Feedback (RLHF).
Upvote
4
nvidia/NV-Llama2-70B-RLHF-Chat
Text Generation
•
Updated
Mar 9
•
4
nvidia/NV-Llama2-13B-RLHF-RM
Text Generation
•
Updated
Mar 9
•
18
•
1
nvidia/sft_datablend_v1
Viewer
•
Updated
Mar 9
•
128k
•
108
•
12
nvidia/Daring-Anteater
Viewer
•
Updated
Jun 17
•
99.5k
•
352
•
19
Upvote
4
Share collection
View history
Collection guide
Browse collections