kang's picture

8 40

kang

qiyue

·

AI & ML interests

None yet

Recent Activity

upvoted an article 24 days ago

Organizations

None yet

qiyue's activity

upvoted an article 24 days ago

Article

Hugging Face welcomes the Aya Expanse family of multilingual models

By

•

28 days ago

• 10

upvoted a paper 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

upvoted an article 3 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21

• 22

upvoted a paper 4 months ago

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18 • 16

upvoted 2 articles 4 months ago

Article

RegMix: Data Mixture as Regression for Language Model Pre-training

By

•

Jul 11

• 10

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 78

upvoted an article 5 months ago

Article

Putting RL back in RLHF

Jun 12

• 62

upvoted a collection 11 months ago

Paloma

Dataset and baseline models for Paloma, a benchmark of language model fit to 546 textual domains • 8 items • Updated 7 days ago • 13