Shubham Toshniwal's picture

5 8 12

Shubham Toshniwal

stoshniwal

·

https://shtoshni.github.io/

shtoshni

AI & ML interests

NLP, LLM

Organizations

stoshniwal's activity

upvoted an article 29 days ago

Article

Fixing Gradient Accumulation

Oct 16

• 40

upvoted 2 collections about 1 month ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Oct 15 • 135

OpenMath-2

A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" • 7 items • Updated 18 days ago • 13

upvoted 3 papers about 1 month ago

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Paper • 2410.01560 • Published Oct 2 • 3

Training Language Models on Synthetic Edit Sequences Improves Code Synthesis

Paper • 2410.02749 • Published Oct 3 • 12

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2 • 19

upvoted a paper 9 months ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 35

upvoted a collection 9 months ago

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated Oct 1 • 37