About the tokenizer - Why use LLaMA tokenizer?

by shuyuej - opened Jul 17

Jul 17

I found that the model is based on the Mistral model, but the tokenizer is based on the LLaMA.
I am confused because the special token ids are different.
Could you please explain the reasons?

https://huggingface.co/Salesforce/SFR-Embedding-2_R/blob/main/tokenizer_config.json#L42

Thank you very much in advance!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment