Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. โข 39 items โข Updated Sep 18 โข 346
view article Article Image Similarity with Hugging Face Datasets and Transformers Jan 16, 2023 โข 16
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 โข 15 items โข Updated 12 days ago โข 440
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper โข 2406.17557 โข Published Jun 25 โข 86
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases โข 5 items โข Updated Sep 25 โข 680
Datasets for Pretrained Thai LLM Collection List Datasets for pretrained Thai LLM by PyThaiNLP โข 23 items โข Updated Sep 12 โข 9
BiPhone: Modeling Inter Language Phonetic Influences in Text Paper โข 2307.03322 โข Published Jul 6, 2023 โข 7