euclaise
/

crow-1b-attempt1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

euclaise commited on Jan 10

Commit

f499a00

•

1 Parent(s): dbbcb88

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -7,6 +7,8 @@ datasets:
 Expirements in large-scale small-scale preference learning.
 falcon-rw-1b trained with PRO (preference ranking optimization, see https://arxiv.org/abs/2306.17492) on SuperMC and PRM800K (only stage 1) for 3 epochs, using my supertrainer2000 framework.
 This is an expiremental model.

 Expirements in large-scale small-scale preference learning.
+**This one was a failure, it benchmarks horribly, despite responding okay to trivia questions in testing**
 falcon-rw-1b trained with PRO (preference ranking optimization, see https://arxiv.org/abs/2306.17492) on SuperMC and PRM800K (only stage 1) for 3 epochs, using my supertrainer2000 framework.
 This is an expiremental model.