Vegard Wærp's picture

1 11 4

Vegard Wærp

vegardw

·

AI & ML interests

None yet

Organizations

None yet

vegardw's activity

upvoted an article 7 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 165

upvoted a paper 7 months ago

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12 • 63

upvoted 4 papers 8 months ago

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Paper • 2403.17919 • Published Mar 26 • 16

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13 • 48

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 602

upvoted 3 papers 9 months ago

FuseChat: Knowledge Fusion of Chat Models

Paper • 2402.16107 • Published Feb 25 • 36

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 99

How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15 • 38

upvoted a paper 10 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 143