Hristo Panev's picture

85 608

Hristo Panev

hppdqdq

·

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

MarsupialAI/Cydonia-22B-v1.3_iMat_GGUF

liked a model about 22 hours ago

TheDrummer/Cydonia-22B-v1.3-GGUF

upvoted a paper 3 days ago

Organizations

None yet

hppdqdq's activity

upvoted a paper 3 days ago

Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts

Paper • 2411.10669 • Published 6 days ago • 9

upvoted a paper 4 days ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published 6 days ago • 87

upvoted a collection 26 days ago

LongVU

7 items • Updated 21 days ago • 26

upvoted an article about 1 month ago

Article

Allegro: Advanced Video Generation Model

By

•

about 1 month ago

• 55

upvoted 3 papers about 1 month ago

FlatQuant: Flatness Matters for LLM Quantization

Paper • 2410.09426 • Published Oct 12 • 12

Retrospective Learning from Interactions

Paper • 2410.13852 • Published Oct 17 • 8

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

Paper • 2410.11190 • Published Oct 15 • 20

upvoted an article about 1 month ago

Article

How to build a custom text classifier without days of human labeling

By

•

Oct 17

• 55

upvoted a collection about 1 month ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Oct 15 • 139

upvoted a paper about 1 month ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10 • 26

upvoted a collection about 1 month ago

🍓 Ichigo v0.3

The experimental family designed to train LLMs to understand sound natively. • 6 items • Updated 11 days ago • 17

upvoted an article about 1 month ago

Article

Accelerate 1.0.0

Sep 13

• 50

upvoted 2 papers about 2 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27 • 91

upvoted a collection about 2 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 28 days ago • 483

upvoted a paper about 2 months ago

Phantom of Latent for Large Language and Vision Models

Paper • 2409.14713 • Published Sep 23 • 27

upvoted a paper 2 months ago

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18 • 36

upvoted 3 collections 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 372

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 218

v3

12 items • Updated Sep 13 • 6