Knut Jägersberg's picture

Knut Jägersberg

KnutJaegersberg

·

jagersbergknut

AI & ML interests

NLP, opinion mining, narrative intelligence

Recent Activity

liked a dataset 2 days ago

OpenScholar/OpenScholar-DataStore-V3

liked a model 2 days ago

bartowski/Mistral-Large-Instruct-2411-GGUF

liked a model 2 days ago

MikeRoz/mistralai_Mistral-Large-Instruct-2411-2.5bpw-h6-exl2

Articles

Perspectives for first principles prompt engineering

Towards actively reasoning LLM systems

Organizations

KnutJaegersberg's activity

upvoted an article 8 days ago

Article

Releasing the largest multilingual open pretraining dataset

By

•

8 days ago

• 94

upvoted a collection 20 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated about 1 hour ago • 172

upvoted a collection 21 days ago

QTIP Quantized Models

See https://github.com/Cornell-RelaxML/qtip • 27 items • Updated 25 days ago • 5

upvoted a paper 23 days ago

GPT-4o System Card

Paper • 2410.21276 • Published 27 days ago • 79

upvoted an article 25 days ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

25 days ago

• 37

upvoted 2 collections 26 days ago

October 25 Releases

19 items • Updated 27 days ago • 7

LongVU

7 items • Updated 21 days ago • 26

upvoted a collection 28 days ago

VILA-U-7B

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation • 2 items • Updated about 1 month ago • 5

upvoted a collection about 1 month ago

CompassJudger

4 items • Updated Oct 16 • 8

upvoted a paper about 1 month ago

SpiRit-LM: Interleaved Spoken and Written Language Model

Paper • 2402.05755 • Published Feb 8 • 12

upvoted a collection about 1 month ago

VPTQ Mistral Large Instruct 2407 without finetune

arxiv.org/abs/2409.17066, VPTQ Mistral Large Instruct 2407 without finetune • 8 items • Updated Oct 18 • 1

upvoted a paper about 1 month ago

Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models

Paper • 2410.02416 • Published Oct 3 • 25

upvoted a collection about 1 month ago

Gemma-APS Release

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated Oct 15 • 19

upvoted a paper about 1 month ago

Scalable and Domain-General Abstractive Proposition Segmentation

Paper • 2406.19803 • Published Jun 28 • 2

upvoted an article about 1 month ago

Article

MicroJAX

By

•

Aug 25

• 16

upvoted a paper about 1 month ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 28

upvoted 2 collections about 2 months ago

ProLong

ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K • 7 items • Updated 30 days ago • 4

Salamandra 🦎

13 items • Updated 14 days ago • 36

upvoted an article about 2 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 369

upvoted a collection about 2 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 7 days ago • 271