The Curious Case of Neural Text Degeneration
Paper
•
1904.09751
•
Published
•
3
Figure 5: The probability mass assigned to partial human sentences. Flat distributions lead to many moderately probable tokens, while peaked distribut