RLHF - a nvidia Collection

nvidia 's Collections

Cosmos Tokenizer

Llama-3.1-Nemotron-70B

Nemotron 4 340B

SteerLM

Canary

RLHF

Llama3-ChatQA-1.5

SSMs

BigVGAN

RADIO

NIM Serverless Inference API

Model Optimizer

Llama3-ChatQA-2

NeMo Curator - Classifier Models

RLHF

updated Oct 1

A collection of models trained with Reinforcement Learning from Human Feedback (RLHF).