What's the difference between this and the DPO version?

#1
by QuantumState745837 - opened

What's the difference between this and the DPO version? Is DPO better? worse? What is DPO? Is one faster than the other?

Additional DPO training supposedly removes some of the alignment to uncensor. This model is just the miqu-70B model fine-tuned on various datasets.

Sign up or log in to comment