tripplyons (Tripp Lyons)

upvoted an article 19 days ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

30 days ago

• 54

upvoted a paper 8 months ago

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28 • 18

upvoted a collection 10 months ago

SigLIP

Collection

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 8 items • Updated Jul 31 • 34

upvoted a paper 11 months ago

Diffusion Model with Perceptual Loss

Paper • 2401.00110 • Published Dec 30, 2023 • 12

upvoted 2 papers 12 months ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization

Paper • 2311.06243 • Published Nov 10, 2023 • 17

upvoted 9 papers about 1 year ago

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 22

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 37

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 82

RMT: Retentive Networks Meet Vision Transformers

Paper • 2309.11523 • Published Sep 20, 2023 • 33

Tripp Lyons

AI & ML interests

Organizations

tripplyons's activity

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Simple linear attention language models balance the recall-throughput tradeoff

SigLIP

Diffusion Model with Perceptual Loss

Exponentially Faster Language Modelling

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

CausalLM is not optimal for in-context learning

LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models

One Wide Feedforward is All You Need

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Language Modeling Is Compression

RMT: Retentive Networks Meet Vision Transformers