DontPlanToEnd
commited on
Commit
•
310fa50
1
Parent(s):
4fb272d
Update app.py
Browse files
app.py
CHANGED
@@ -125,7 +125,7 @@ with GraInter:
|
|
125 |
gr.Markdown("""
|
126 |
**UGI: Uncensored General Intelligence**. A measurement of the amount of uncensored/controversial information an LLM knows. It is calculated from the average score of 5 subjects LLMs commonly refuse to talk about. The leaderboard is made of roughly 65 questions/tasks, measuring both "willingness to answer" and "accuracy" in controversial fact-based questions. I'm choosing to keep the questions private so people can't train on them and devalue the leaderboard.
|
127 |
|
128 |
-
**W/10:** Willingness/10. A more narrow, 10-point score, measuring how far the model can be pushed before going against its instructions, refusing to answer, or adding
|
129 |
<br><br>
|
130 |
A high UGI but low W/10 could mean for example that the model can provide a lot of sensitive information, but will refuse to form the information into something it sees as dangerous.
|
131 |
<br><br>
|
|
|
125 |
gr.Markdown("""
|
126 |
**UGI: Uncensored General Intelligence**. A measurement of the amount of uncensored/controversial information an LLM knows. It is calculated from the average score of 5 subjects LLMs commonly refuse to talk about. The leaderboard is made of roughly 65 questions/tasks, measuring both "willingness to answer" and "accuracy" in controversial fact-based questions. I'm choosing to keep the questions private so people can't train on them and devalue the leaderboard.
|
127 |
|
128 |
+
**W/10:** Willingness/10. A more narrow, 10-point score, measuring how far the model can be pushed before going against its instructions, refusing to answer, or adding an ethical disclaimer to its response.
|
129 |
<br><br>
|
130 |
A high UGI but low W/10 could mean for example that the model can provide a lot of sensitive information, but will refuse to form the information into something it sees as dangerous.
|
131 |
<br><br>
|