Meta Llama org
No description provided.

Hi there! I noticed that the 8 kv heads PR was merged in for the other 405b checkpoints, is there an ETA on landing this one? Thanks for the help!

Meta Llama org

Merging now!

ArthurZ changed pull request status to open

Does num_key_value_heads in config.json need to be updated as well after this PR is merged?

TYSM @ArthurZ !! just a heads up tho that this is probably an upload issue, but it appears that model parts { 002, [ 107 - 109 ] } were missed from the list / diff above update: those 4 files are not affected by the 16 -> 8 kv head change.

Looks like this is not yet merged?

Meta Llama org

Let me update the value in the config to merge!

Meta Llama org

(I don't have rights yet 😿)

Looking forward to trying!

osanseviero changed pull request status to merged

Sign up or log in to comment