Update config.json

by Bakanayatsu - opened Mar 19

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

-1

Bakanayatsu

Mar 19

No description provided.

Update config.json99c001f2

saishf

Owner Mar 19

Is there a reason it should be mistral? i tried to stay to solar models which use "LlamaForCausalLM"?
I'm still learning all these small things :3

Bakanayatsu

Mar 19

It's for tokenizing and stuff, since SOLAR is initialized from mistral, it never hurts to be correct as it might cause a bug later on (token trimming, counting, or other things)

saishf

Owner Mar 19

I guess being based on mistral explains the higher benchmarks than llama 2 13b

saishf changed pull request status to merged Mar 19

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment