efederici (Edoardo Federici)

upvoted an article 20 days ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

20 days ago

• 37

upvoted a paper 22 days ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8 • 80

upvoted 3 papers about 1 month ago

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22 • 10

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 143

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 46

upvoted a paper about 2 months ago

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Paper • 2406.06592 • Published Jun 5 • 25

upvoted a paper 2 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9 • 45

upvoted an article 2 months ago

Article

Selective fine-tuning of Language Models with Spectrum

By

•

Sep 3

• 29

upvoted a paper 3 months ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22 • 50

upvoted a collection 3 months ago

Probably function calling datasets

Collection

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 36

upvoted 2 papers 4 months ago

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1 • 39

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126

upvoted a paper 5 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10 • 65

upvoted 4 papers 6 months ago

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27 • 31

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27 • 85

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23 • 37

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23 • 35

upvoted 3 papers 7 months ago

Edoardo Federici

AI & ML interests

Organizations

efederici's activity