superpeng (peng)

upvoted a collection 14 days ago

Skywork-Reward-Data-Collection

Collection

Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12 • 8

upvoted 2 papers 3 months ago

HelpSteer2: Open-source dataset for training top-performing reward models

Paper • 2406.08673 • Published Jun 12 • 16

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Paper • 2405.20335 • Published May 30 • 17

upvoted a collection 4 months ago

Biomedical NLP papers

Collection

Papers posted on @[email protected] (Clinical, Healthcare & Biomedical NLP) • 166 items • Updated about 17 hours ago • 34

upvoted 2 papers 4 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 155

Inference Performance Optimization for Large Language Models on CPUs

Paper • 2407.07304 • Published Jul 10 • 52

upvoted a collection 5 months ago

Tulu 2 Llama 3 Update

Collection

Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5). • 12 items • Updated Aug 15 • 2

upvoted 2 papers 6 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 67

upvoted a paper 7 months ago

Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts

Paper • 2309.07430 • Published Sep 14, 2023 • 27

upvoted 2 articles 7 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 114

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20

• 66

upvoted 2 papers 8 months ago

MathScale: Scaling Instruction Tuning for Mathematical Reasoning

Paper • 2403.02884 • Published Mar 5 • 15

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182

upvoted a paper 9 months ago

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Paper • 2402.16671 • Published Feb 26 • 26

upvoted a collection 9 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated Jul 31 • 325

upvoted 3 papers 9 months ago

LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

Paper • 2402.10524 • Published Feb 16 • 21

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 77

How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15 • 38

upvoted a collection 9 months ago

Sora参考论文

Collection

OpenAI "Video generation models as world simulators"技术报告后面的参考论文，总共32篇。OpenAI的ImageGPT和Dalle3这两篇缺失，链接已补充到note中。 • 32 items • Updated Feb 18 • 54

peng

AI & ML interests

Organizations

superpeng's activity

Skywork-Reward-Data-Collection

HelpSteer2: Open-source dataset for training top-performing reward models

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Biomedical NLP papers

Qwen2 Technical Report

Inference Performance Optimization for Large Language Models on CPUs

Tulu 2 Llama 3 Update

LoRA Learns Less and Forgets Less

RLHF Workflow: From Reward Modeling to Online RLHF

Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

MathScale: Scaling Instruction Tuning for Mathematical Reasoning

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Gemma release

LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

Adapting Large Language Models via Reading Comprehension

How to Train Data-Efficient LLMs

Sora参考论文