Ahmad's picture

Ahmad

AhmadHakami

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

jinaai/jina-clip-v2

liked a model about 1 hour ago

mistralai/Mistral-Large-Instruct-2411

liked a model about 1 hour ago

jinaai/xlm-roberta-flash-implementation

Organizations

AhmadHakami's activity

upvoted a collection 9 days ago

jina-embeddings-v3

Multilingual multi-task general text embedding model • 6 items • Updated Sep 19 • 18

upvoted 2 collections 18 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated about 7 hours ago • 172

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. • 4 items • Updated 21 days ago • 16

upvoted an article 25 days ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

25 days ago

• 37

upvoted a collection 27 days ago

Stable Diffusion 3.5

6 items • Updated 23 days ago • 98

upvoted a collection 28 days ago

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 28 days ago • 26

upvoted a paper 30 days ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21 • 42

upvoted an article about 1 month ago

Article

Open-source LLMs as LangChain Agents

Jan 24

• 36

upvoted a collection about 1 month ago

Model2Vec base models

These are the Minishlab Model2Vec base models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 7 items • Updated 23 days ago • 8

upvoted an article about 2 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 169

upvoted a paper about 2 months ago

Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 23

upvoted 3 collections about 2 months ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 1 item • Updated Oct 1 • 48

Parakeet

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 8 items • Updated Oct 1 • 20

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 28 days ago • 482

upvoted 3 collections 2 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 3 days ago • 224

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 218

Core ML Segment Anything 2

8 items • Updated Oct 4 • 26

upvoted 3 papers 2 months ago

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Paper • 2409.07239 • Published Sep 11 • 11

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13 • 20

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3 • 82