Cosmos Tokenizer Collection A suite of image and video tokenizers β’ 10 items β’ Updated 3 days ago β’ 11
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*β‘ By xhluca β’ Jul 9 β’ 36
AMD-OLMo Collection AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinctβ’ MI250 GPUs based on OLMo. β’ 4 items β’ Updated 9 days ago β’ 16
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 8 items β’ Updated 6 days ago β’ 160
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. β’ 3 items β’ Updated 16 days ago β’ 25
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 β’ 156
Awesome Document AI Collection A collection of open-source document AI π π π β’ 27 items β’ Updated Mar 11 β’ 73
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding Paper β’ 2407.12594 β’ Published Jul 17 β’ 19
π» Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos β’ 14 items β’ Updated Aug 20 β’ 44
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts Paper β’ 2407.21770 β’ Published Jul 31 β’ 22
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper β’ 2408.11039 β’ Published Aug 20 β’ 56
Llama-3.1 Quantization Collection Neural Magic quantized Llama-3.1 models β’ 21 items β’ Updated Sep 26 β’ 38
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5 β’ 153
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 β’ 61
view article Article From cloud to developers: Hugging Face and Microsoft Deepen Collaboration May 21 β’ 8
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 16 items β’ Updated Jul 31 β’ 137