Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
nroggendorff 
posted an update Jul 14

very good !
maybe a colab !
could this be used to extend a tokenizer model with training ?
as i would like to update my mistral tokenizer to include forign chars, such as hebrew and amaric, and hindi

·

Im pretty sure you can add additional tokens and special tokens, so I suppose