keminglu commited on
Commit
b7f9c4c
1 Parent(s): 23a099e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -3
README.md CHANGED
@@ -1,3 +1,34 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ see our paper in: [https://arxiv.org/abs/2310.05506](https://arxiv.org/abs/2310.05506)
6
+
7
+ ## Model Details
8
+
9
+ MuggleMATH is fully fine-tuned on the AugGSM8K and AugMATH datasets and based on the LLaMA-2 Models.
10
+
11
+ ## **Model Usage**
12
+
13
+ prompting template:
14
+ '''
15
+ "Below is an instruction that describes a task. " "Write a response that appropriately completes the request.\n\n" "### Instruction:\n{instruction}\n\n### Response:"
16
+ '''
17
+ We recommend using vllm to accelerate inference.
18
+
19
+ ## Experiment
20
+
21
+ | | GSM8K | MATH |
22
+ | --- | --- | --- |
23
+ | MuggleMATH-7B | 69.8 | 25.8 |
24
+ | MuggleMATH-13B | 74.3 | 30.7 |
25
+ | MuggleMATH-70B | 82.5 | 42.1 |
26
+
27
+ ## **Citation**
28
+
29
+ @misc{li2023query,
30
+ title={Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization},
31
+ author={Chengpeng Li and Zheng Yuan and Hongyi Yuan and Guanting Dong and Keming Lu and Jiancan Wu and Chuanqi Tan and Xiang Wang and Chang Zhou},
32
+ journal={arXiv preprint arXiv: 2310.05506},
33
+ year={2023}
34
+ }