static quants of https://huggingface.co/xinlai/DeepSeekMath-RL-Step-DPO