Information on the model

by anakin87 - opened Apr 7

Discussion

anakin87

Apr 7

Hey!

I am interested in this model.
On which dataset it was trained?
Just SFT or also some form of alignment?

giux78

Owner Apr 8

Ciao @anakin87 , I follow you everywhere what a honor!!!!! This is a failed experiment because the training was before the gemma fix on HF transformer but also on many other fine tuning lib. The Gemma integration was broken. So it does not work as expected. In any case the dataset used was https://huggingface.co/datasets/mii-community/ultrafeedback-translated-ita, and no DPO or alignment.

giux78

Owner Apr 8

I have a plan to redo the experiment in the next weeks.

anakin87

Apr 8

Thank you!!!

Can you tell me what was the problem with Gemma?

giux78

Owner Apr 8

Yes, it is explained very well here https://unsloth.ai/blog/gemma-bugs and addressed in part here https://github.com/huggingface/transformers/pull/29402. I tried the sft before the fix and the result was a mess.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment