Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
vincentmin
/
llama-2-13b-reward-oasst1
like
0
Text Classification
PEFT
TensorBoard
tasksource/oasst1_pairwise_rlhf_reward
Generated from Trainer
trl
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Use this model
main
llama-2-13b-reward-oasst1
/
README.md
Commit History
Update README.md
5e99797
vincentmin
commited on
Aug 3, 2023
Update README.md
9d26c10
vincentmin
commited on
Jul 27, 2023
Update README.md
4f94bf4
vincentmin
commited on
Jul 27, 2023
Update README.md
37a71b7
vincentmin
commited on
Jul 27, 2023
update model card README.md
f0d55ff
vincentmin
commited on
Jul 27, 2023
End of training
e2fc4dc
vincentmin
commited on
Jul 27, 2023
update model card README.md
d5ec288
vincentmin
commited on
Jul 27, 2023
End of training
5d0d1e5
vincentmin
commited on
Jul 27, 2023