geez-asr / vocab.json
Samuael's picture
Upload tokenizer
e9caaab verified
raw
history blame
502 Bytes
{
"[PAD]": 0,
"[UNK]": 1,
"|": 2,
"ል": 37,
"αˆ•": 30,
"ም": 13,
"ር": 8,
"ሡ": 17,
"ሽ": 3,
"α‰…": 26,
"ቕ": 9,
"α‰₯": 29,
"α‰­": 35,
"ቡ": 34,
"ች": 20,
"αŠ•": 36,
"ኝ": 27,
"ኑ": 21,
"ኒ": 31,
"ኣ": 14,
"ኀ": 32,
"αŠ₯": 22,
"ኦ": 12,
"ኧ": 15,
"ክ": 28,
"ኽ": 6,
"ው": 25,
"ዝ": 7,
"α‹₯": 4,
"α‹­": 23,
"α‹΅": 5,
"αŒ…": 18,
"ግ": 38,
"αŒ₯": 10,
"ጭ": 33,
"ጡ": 11,
"ጽ": 16,
"ፍ": 19,
"ፕ": 24
}