SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 5 days ago • 160
The Geometry of Concepts: Sparse Autoencoder Feature Structure Paper • 2410.19750 • Published about 1 month ago • 1
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders Paper • 2410.20526 • Published 13 days ago • 1
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics Paper • 2410.21272 • Published 12 days ago • 1
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Paper • 2410.15999 • Published 19 days ago • 17
Automatically Interpreting Millions of Features in Large Language Models Paper • 2410.13928 • Published 23 days ago • 1
How Do Multilingual Models Remember? Investigating Multilingual Factual Recall Mechanisms Paper • 2410.14387 • Published 22 days ago • 1
Towards Interpreting Visual Information Processing in Vision-Language Models Paper • 2410.07149 • Published Oct 9 • 1
Geometric Signatures of Compositionality Across a Language Model's Lifetime Paper • 2410.01444 • Published Oct 2 • 1
ITA-Bench: Italian Benchmarks for LLMs Collection A collection of Italian benchmarks for Large Language Models. See also our Github repo: https://github.com/SapienzaNLP/ita-bench • 19 items • Updated Sep 23 • 6
A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders Paper • 2409.14507 • Published Sep 22 • 1
Amuro & Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models Paper • 2408.06663 • Published Aug 13 • 15
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2 Paper • 2408.05147 • Published Aug 9 • 37
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP Paper • 2408.04303 • Published Aug 8 • 9
Transformer Explainer: Interactive Learning of Text-Generative Models Paper • 2408.04619 • Published Aug 8 • 154
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability Paper • 2408.01416 • Published Aug 2 • 1