mmhamdy (Mohammed Hamdy)

upvoted a collection 9 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 5 days ago • 160

upvoted a collection about 2 months ago

LLM Reasoning Papers

Collection

Papers to improve reasoning capabilities of LLMs • 15 items • Updated 7 days ago • 73

upvoted a collection 2 months ago

"Physics of Language Models" series

Collection

6 items • Updated Aug 30 • 36

upvoted a paper 2 months ago

inftyBench: Extending Long Context Evaluation Beyond 100K Tokens

Paper • 2402.13718 • Published Feb 21 • 1

upvoted a paper 3 months ago

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Paper • 2408.08459 • Published Aug 15 • 44

upvoted an article 3 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23

• 54

upvoted 3 papers 4 months ago

Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?

Paper • 2404.12691 • Published Apr 19 • 1

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

Paper • 2310.16787 • Published Oct 25, 2023 • 5

Consent in Crisis: The Rapid Decline of the AI Data Commons

Paper • 2407.14933 • Published Jul 20 • 11

upvoted a collection 5 months ago

MatMulfree LM

Collection

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10 • 25

upvoted an article 6 months ago

Article

Synthetic dataset generation techniques: Self-Instruct

By

•

May 15

• 11

upvoted a paper 6 months ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29 • 68

upvoted 2 collections 7 months ago

Bio Series

Collection

Embeddings and NLG related to biology / amino acid sequences • 10 items • Updated Sep 19 • 1

Function Calling Datasets & Models

Collection

3 items • Updated May 24 • 1

upvoted an article 7 months ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24

• 58

upvoted 3 papers 7 months ago

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

Paper • 2402.03046 • Published Feb 5 • 6

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Paper • 2402.09844 • Published Feb 15 • 20

BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations

Paper • 2310.07276 • Published Oct 11, 2023 • 5

upvoted 2 articles 7 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22

• 78

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 114

Mohammed Hamdy

AI & ML interests

Organizations

mmhamdy's activity

SmolLM2

LLM Reasoning Papers

"Physics of Language Models" series

inftyBench: Extending Long Context Evaluation Beyond 100K Tokens

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

🪆 Introduction to Matryoshka Embedding Models

Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

Consent in Crisis: The Rapid Decline of the AI Data Commons

MatMulfree LM

Synthetic dataset generation techniques: Self-Instruct

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Bio Series

Function Calling Datasets & Models

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare