Yushi Bai's picture

Yushi Bai

bys0318

·

https://bys0318.github.io/

bys0318

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago

upvoted a paper 24 days ago

liked a dataset 29 days ago

THU-KEG/RM-Bench

Organizations

bys0318's activity

upvoted a paper 24 days ago

LongReward: Improving Long-context Large Language Models with AI Feedback

Paper • 2410.21252 • Published 24 days ago • 16

upvoted a paper about 1 month ago

Pre-training Distillation for Large Language Models: A Design Space Exploration

Paper • 2410.16215 • Published Oct 21 • 15

upvoted 3 papers 3 months ago

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Paper • 2409.02897 • Published Sep 4 • 44

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13 • 65

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Paper • 2408.06072 • Published Aug 12 • 35

upvoted 4 papers 5 months ago

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5 • 27

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Paper • 2407.02855 • Published Jul 3 • 10

Simulating Classroom Education with LLM-Empowered Agents

Paper • 2406.19226 • Published Jun 27 • 29

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Paper • 2406.12793 • Published Jun 18 • 31

upvoted a collection 6 months ago

GLM-4

GLM-4 Open Models • 13 items • Updated 28 days ago • 111

upvoted a paper 9 months ago

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Paper • 2403.05121 • Published Mar 8 • 22

upvoted 2 papers 10 months ago

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

Paper • 2402.04236 • Published Feb 6 • 7

LongAlign: A Recipe for Long Context Alignment of Large Language Models

Paper • 2401.18058 • Published Jan 31 • 21

upvoted a paper about 1 year ago

DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics

Paper • 2310.13268 • Published Oct 20, 2023 • 17