tokenizer_config.json is different from gemma-2-2b-it

by dahara1 - opened 17 days ago

17 days ago

Hello! Thank you for the great model.

By the way, regarding the title, I'm particularly concerned that the following doesn't exist. Is this okay?

"additional_special_tokens": [
"<start_of_turn>",
"<end_of_turn>"
],

ArthurZ

Google org 17 days ago

Don't worry, info about special tokens is stored at the AddedToken level itself, can be ignored!

dahara1

11 days ago

Thank you for checking, it worked.
Thanks to you, I was able to release a finetuned model for translation tasks. The quality varies greatly depending on the writing style, but I feel that the performance is close to that of the 7B model from a year ago.

https://huggingface.co/webbigdata/gemma-2-2b-jpn-it-translate

dahara1 changed discussion status to closed 5 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment