akjindal53244
/

Arithmo-Mistral-7B

Text Generation

Mathematical Reasoning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

akjindal53244 commited on Jan 26

Commit

aff4c18

•

1 Parent(s): f43c596

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -8,6 +8,15 @@ datasets:
 - akjindal53244/Arithmo-Data
 ---
 # Model Card for Model ID
@@ -164,6 +173,9 @@ Building LLMs takes time and resources; if you find my work interesting, your su
 <a href="https://www.buymeacoffee.com/a_little_learner" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 60px !important;width: 217px !important;" ></a>
 <h2 id="References">References</h2>

 - akjindal53244/Arithmo-Data
 ---
+## [January 2024] New Model Release: Arithmo2-Mistral-7B
+**Arithmo2-Mistral-7B** model improves initially released Arithmo-Mistral-7B model on both GSM8K and MATH benchmarks. Specifically, there is **absolute** improvement of:
+- +1.7% on GSM8K
+- +3.0% on GSM8K PoT
+- +1.9% on MATH
+<b>Note</b>: <span style="color:red"><b>It is recommended to use Arithmo2-Mistral-7B model</b></span>. Here is the [merged model](https://huggingface.co/upaya07/Arithmo2-Mistral-7B) and corresponding [LoRA Adapter](https://huggingface.co/upaya07/Arithmo2-Mistral-7B-adapter).
 # Model Card for Model ID
 <a href="https://www.buymeacoffee.com/a_little_learner" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 60px !important;width: 217px !important;" ></a>
+### Citation
+Ashvini Jindal, "Arithmo-Mistral-7B", Oct, 2023, https://huggingface.co/akjindal53244/Arithmo-Mistral-7B
 <h2 id="References">References</h2>