Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sfulay
/
zephyr-7b-dpo-full-gpt_consistent-high-curriculum
like
0
Safetensors
mistral
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
main
zephyr-7b-dpo-full-gpt_consistent-high-curriculum
/
model-00003-of-00003.safetensors
Commit History
Training in progress, step 436
e6a9deb
verified
sfulay
commited on
Aug 28
Training in progress, step 400
246435e
verified
sfulay
commited on
Aug 28
Training in progress, step 300
558c702
verified
sfulay
commited on
Aug 28
Training in progress, step 200
ad0dbf7
verified
sfulay
commited on
Aug 28
Training in progress, step 100
8fe9367
verified
sfulay
commited on
Aug 28