too smart ?
#4
by
Daemontatox
- opened
I feel like its smarter and better than base and Hermes but it takes way more optimization and tinkering to learn data.
it has a tendency to deviate from instruction and generalize the data it was trained on.
Hi, could you be more specific? Perhaps share a reproducible code piece?
after more testing i am 90% sure its my finetuned model that has the issue , I am using a custom finetuned one and using quant 8 .
Daemontatox
changed discussion status to
closed