DontPlanToEnd
commited on
Commit
•
be311b8
1
Parent(s):
488a057
Update app.py
Browse files
app.py
CHANGED
@@ -189,7 +189,7 @@ with GraInter:
|
|
189 |
|
190 |
**W/10:** Willingness/10. A more narrow, 10-point score, measuring how far the model can be pushed before going against its instructions, refusing to answer, or adding an ethical disclaimer to its response.
|
191 |
<br><br>
|
192 |
-
A high UGI but low W/10 could mean for example that the model can provide a lot of accurate sensitive information, but will refuse to form the information into something it sees as dangerous.
|
193 |
<br><br>
|
194 |
**Unruly:** Knowledge of activities that are generally frowned upon.
|
195 |
<br>
|
|
|
189 |
|
190 |
**W/10:** Willingness/10. A more narrow, 10-point score, measuring how far the model can be pushed before going against its instructions, refusing to answer, or adding an ethical disclaimer to its response.
|
191 |
<br><br>
|
192 |
+
A high UGI but low W/10 could mean for example that the model can provide a lot of accurate sensitive information, but will refuse to form the information into something it sees as dangerous. Or that it answers questions correctly, but appends a paragraph to its answer explaining why the question is immoral to ask.
|
193 |
<br><br>
|
194 |
**Unruly:** Knowledge of activities that are generally frowned upon.
|
195 |
<br>
|