Cosmos Tokenizer Collection A suite of image and video tokenizers β’ 10 items β’ Updated 15 days ago β’ 18
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*β‘ By xhluca β’ Jul 9 β’ 39
AMD-OLMo Collection AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinctβ’ MI250 GPUs based on OLMo. β’ 4 items β’ Updated 21 days ago β’ 16
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 10 items β’ Updated about 5 hours ago β’ 172
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. β’ 3 items β’ Updated 28 days ago β’ 26
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 β’ 158
Awesome Document AI Collection A collection of open-source document AI π π π β’ 27 items β’ Updated Mar 11 β’ 74
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding Paper β’ 2407.12594 β’ Published Jul 17 β’ 19
π» Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos β’ 14 items β’ Updated Aug 20 β’ 46
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts Paper β’ 2407.21770 β’ Published Jul 31 β’ 22
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper β’ 2408.11039 β’ Published Aug 20 β’ 56
Llama-3.1 Quantization Collection Neural Magic quantized Llama-3.1 models β’ 21 items β’ Updated 8 days ago β’ 39
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5 β’ 161
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 β’ 63
view article Article From cloud to developers: Hugging Face and Microsoft Deepen Collaboration May 21 β’ 8