neo-9981 (Aadesh )

upvoted a collection about 1 month ago

Llama-3.1-Nemotron-70B

Collection

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Oct 15 • 136

upvoted an article about 1 month ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

Oct 14

• 55

upvoted an article about 2 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5

• 158

upvoted a collection about 2 months ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 217

upvoted a paper 3 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8 • 155

upvoted an article 4 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 91

upvoted a paper 5 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 84

upvoted a collection 5 months ago

Florence

Collection

9 items • Updated Jul 11 • 160

upvoted 2 articles 6 months ago

Article

Let's talk about LLM evaluation

By

•

May 23

• 134

Article

License to Call: Introducing Transformers Agents 2.0

May 13

• 116

upvoted an article 7 months ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 67

upvoted a paper 7 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

upvoted an article 7 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 278

upvoted a paper 8 months ago

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Paper • 2401.04398 • Published Jan 9 • 21

upvoted a paper 10 months ago

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18 • 34

Aadesh

AI & ML interests

Organizations

neo-9981's activity

Llama-3.1-Nemotron-70B

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Moshi v0.1 Release

Transformer Explainer: Interactive Learning of Text-Generative Models

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Florence

Let's talk about LLM evaluation

License to Call: Introducing Transformers Agents 2.0

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Welcome Llama 3 - Meta's new open LLM

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

ChatQA: Building GPT-4 Level Conversational QA Models