Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DrishtiSharma
/
dolphin-2.1-mistral-7b-dpo-ultrafeedback-binarized-preferences-sigmoid
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
cecd7df
dolphin-2.1-mistral-7b-dpo-ultrafeedback-binarized-preferences-sigmoid
Commit History
Training in progress, step 500
cecd7df
verified
DrishtiSharma
commited on
Feb 22
Training in progress, step 500
284415a
verified
DrishtiSharma
commited on
Feb 22
initial commit
8d1be37
verified
DrishtiSharma
commited on
Feb 22