Mastane Achab's picture

Mastane Achab

Mastane

·

https://mastane.github.io/

mastane

AI & ML interests

None yet

Organizations

Mastane's activity

upvoted a paper 7 months ago

Investigating Regularization of Self-Play Language Models

Paper • 2404.04291 • Published Apr 4 • 1

upvoted a paper 8 months ago

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11 • 90

upvoted a paper 11 months ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2 • 64

upvoted 2 collections 11 months ago

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 119

Comparing DPO with IPO and KTO

A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 9 • 31

upvoted a paper 11 months ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 14

upvoted a paper 12 months ago

LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

Paper • 2306.15626 • Published Jun 27, 2023 • 17

upvoted a paper about 1 year ago

Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 87

upvoted a collection about 1 year ago

read papers

This is a collection of some papers I've read in the past few months • 10 items • Updated Nov 21, 2023 • 47

upvoted 2 papers about 1 year ago

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 31

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 48