Maziyar Panahi's picture

Maziyar Panahi PRO

MaziyarPanahi

·

AI & ML interests

Fine-Tuning, RLHF, Merging, Quantizations, Leaderboards

Recent Activity

New activity about 4 hours ago

open-acc/README

New activity about 5 hours ago

MaziyarPanahi/calme-3.1-instruct-78b

updated a model about 5 hours ago

MaziyarPanahi/calme-3.1-instruct-78b

Organizations

MaziyarPanahi's activity

upvoted an article about 23 hours ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 369

upvoted an article 1 day ago

Article

Halo: Open Source Health Tracking with Wearables

By

•

2 days ago

• 60

upvoted 2 papers 5 days ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14 • 16

TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees

Paper • 2410.12854 • Published Oct 10 • 1

upvoted a collection 6 days ago

Nov 15 Releases 🍂

15 items • Updated 6 days ago • 6

upvoted an article 7 days ago

Article

Synthetic dataset generation techniques: Self-Instruct

By

•

May 15

• 12

upvoted an article 8 days ago

Article

Releasing the largest multilingual open pretraining dataset

By

•

8 days ago

• 94

upvoted a collection 14 days ago

🇫🇷 Calme-3

Here you can find all the new Calme-3 models • 26 items • Updated 2 days ago • 7

upvoted a paper 15 days ago

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2 • 12

upvoted a paper 21 days ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126

upvoted a collection 21 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 14 days ago • 95

upvoted a collection 27 days ago

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 28 days ago • 26

upvoted an article 29 days ago

Article

Deploying Speech-to-Speech on Hugging Face

about 1 month ago

• 35

upvoted an article 30 days ago

Article

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

about 1 month ago

• 43

upvoted a paper about 1 month ago

AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published Oct 21 • 57

upvoted an article about 1 month ago

Article

Scaling AI-based Data Processing with Hugging Face + Dask

Oct 9

• 23

upvoted 2 articles about 2 months ago

Article

Introducing the Open FinLLM Leaderboard

Oct 4

• 64

Article

VLM Art Analysis

By

•

Oct 4

• 11

upvoted a paper about 2 months ago

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Paper • 2410.01036 • Published Oct 1 • 14

upvoted an article about 2 months ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

By

•

Sep 27

• 35