AI & ML interests

OCR, Unicode, character-awareness, vocab-less tokenizers, typography, calligraphy, orthography, unbiased multilingualism, text embeddings, teaching stable diffusion how to spell, layout & document understanding, symbols & graphemes in the wild

datasets

None public yet