hanbin commited on
Commit
d871c6c
1 Parent(s): 01849b6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -66,6 +66,12 @@ Simplify your answer as much as possible.
66
  {Instruction} [/INST]
67
  ```
68
 
 
 
 
 
 
 
69
  ## Citation
70
  ```
71
  @misc{yuan2024advancing,
 
66
  {Instruction} [/INST]
67
  ```
68
 
69
+ ## Evaluation
70
+ - Eurus, both the 7B and 70B variants, achieve the best overall performance among open-source models of similar sizes. Eurus even outperforms specialized models in corresponding domains in many cases. Notably, Eurus-7B outperforms baselines that are 5× larger, and Eurus-70B achieves better performance than GPT-3.5 Turbo.
71
+ - Preference learning with UltraInteract can further improve performance, especially in math and the multi-turn ability.
72
+ <img src="./figures/main_exp.png" alt="stats" style="zoom: 40%;" />
73
+
74
+
75
  ## Citation
76
  ```
77
  @misc{yuan2024advancing,