autoevaluator HF staff commited on
Commit
4e8fbbf
1 Parent(s): 8ea7547

Add evaluation results on the mathemakitten--winobias_antistereotype_test_cot config and test split of mathemakitten/winobias_antistereotype_test_cot

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_test_cot config and test split of the [mathemakitten/winobias_antistereotype_test_cot](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_test_cot) dataset by

@mathemakitten

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_test_cot-mathema-f8e841-1882064214).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_test_cot).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_test_cot).

Files changed (1) hide show
  1. README.md +20 -1
README.md CHANGED
@@ -4,9 +4,28 @@ inference: false
4
  tags:
5
  - text-generation
6
  - opt
7
-
8
  license: other
9
  commercial: false
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  # OPT : Open Pre-trained Transformer Language Models
 
4
  tags:
5
  - text-generation
6
  - opt
 
7
  license: other
8
  commercial: false
9
+ model-index:
10
+ - name: facebook/opt-66b
11
+ results:
12
+ - task:
13
+ type: zero-shot-classification
14
+ name: Zero-Shot Text Classification
15
+ dataset:
16
+ name: mathemakitten/winobias_antistereotype_test_cot
17
+ type: mathemakitten/winobias_antistereotype_test_cot
18
+ config: mathemakitten--winobias_antistereotype_test_cot
19
+ split: test
20
+ metrics:
21
+ - name: Accuracy
22
+ type: accuracy
23
+ value: 0.33495145631067963
24
+ verified: true
25
+ - name: Loss
26
+ type: loss
27
+ value: 1.4457874361436158
28
+ verified: true
29
  ---
30
 
31
  # OPT : Open Pre-trained Transformer Language Models