Miakat's picture

1 4 71

Miakat

darthfalka

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

distilbert/distilroberta-base

upvoted a paper 3 days ago

liked a model 4 days ago

edbeeching/decision-transformer-gym-walker2d-expert

Organizations

None yet

darthfalka's activity

upvoted a paper 3 days ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24 • 3

upvoted an article 3 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 99

upvoted an article 7 months ago

Article

The Technology Behind BLOOM Training

Jul 14, 2022

• 17