Edit model card

This model is Llemma-7b model used in the paper "An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models". It's based on Llemma-7b and was further finetuned MetaMath with special format for reward. Each step starts with "Step" and ends with "\u043a\u0438".

Safetensors

Model size

6.74B params

Tensor type

BF16

Inference API

Unable to determine this model's library. Check the docs .