chrisliu298
commited on
Commit
•
f94f78b
1
Parent(s):
e69bfcf
Update README.md
Browse files
README.md
CHANGED
@@ -51,7 +51,7 @@ We evaluate our model on [RewardBench](https://huggingface.co/spaces/allenai/rew
|
|
51 |
|
52 |
| Rank | Model | Model Type | Score | Chat | Chat Hard | Safety | Reasoning |
|
53 |
| :---: | -------------------------------------------- | ----------------- | :---: | :---: | :-------: | :----: | :-------: |
|
54 |
-
| 1 | **Skywork/Skywork-Reward-Gemma-2-27B-v0.2** | Seq. Classifier | 94.
|
55 |
| 2 | nvidia/Llama-3.1-Nemotron-70B-Reward | Custom Classifier | 94.1 | 97.5 | 85.7 | 95.1 | 98.1 |
|
56 |
| 3 | Skywork/Skywork-Reward-Gemma-2-27B | Seq. Classifier | 93.8 | 95.8 | 91.4 | 91.9 | 96.1 |
|
57 |
| 4 | SF-Foundation/TextEval-Llama3.1-70B | Generative | 93.5 | 94.1 | 90.1 | 93.2 | 96.4 |
|
|
|
51 |
|
52 |
| Rank | Model | Model Type | Score | Chat | Chat Hard | Safety | Reasoning |
|
53 |
| :---: | -------------------------------------------- | ----------------- | :---: | :---: | :-------: | :----: | :-------: |
|
54 |
+
| 1 | **Skywork/Skywork-Reward-Gemma-2-27B-v0.2** | Seq. Classifier | 94.2 | 96.1 | 89.7 | 93.0 | 98.1 |
|
55 |
| 2 | nvidia/Llama-3.1-Nemotron-70B-Reward | Custom Classifier | 94.1 | 97.5 | 85.7 | 95.1 | 98.1 |
|
56 |
| 3 | Skywork/Skywork-Reward-Gemma-2-27B | Seq. Classifier | 93.8 | 95.8 | 91.4 | 91.9 | 96.1 |
|
57 |
| 4 | SF-Foundation/TextEval-Llama3.1-70B | Generative | 93.5 | 94.1 | 90.1 | 93.2 | 96.4 |
|