zephyr-7b-dpo-lora-pairrm / trainer_state.json

Commit History

Model save
125e648
verified

shenxq commited on