codemurt's picture

6 32

codemurt

codemurt

·

codemurt

AI & ML interests

NLP in uralic languages

Recent Activity

liked a model 18 days ago

tartuNLP/smugri3_14-finno-ugric-nmt

updated a dataset 22 days ago

udmurtNLP/udmurt-russian-english-labse

updated a dataset 22 days ago

udmurtNLP/flores-250-rus-udm

Organizations

codemurt's activity

upvoted a collection 8 months ago

Zerpal

The largest open-source Udmurt monolingual corpora and pre-trained language models • 14 items • Updated Jun 14 • 1

upvoted a paper 12 months ago

FinGPT: Large Generative Models for a Small Language

Paper • 2311.05640 • Published Nov 3, 2023 • 27

upvoted 4 papers about 1 year ago

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

Scaling Speech Technology to 1,000+ Languages

Paper • 2305.13516 • Published May 22, 2023 • 10

SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis

Paper • 1912.09723 • Published Dec 20, 2019 • 2

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset

Paper • 2309.04662 • Published Sep 9, 2023 • 22