A collection of recent papers on NLG evaluations, very applicable to components of LLM systems.
-
Can Large Language Models Be an Alternative to Human Evaluations?
Paper • 2305.01937 • Published • 2 -
Decontextualization: Making Sentences Stand-Alone
Paper • 2102.05169 • Published -
RARR: Researching and Revising What Language Models Say, Using Language Models
Paper • 2210.08726 • Published • 1 -
SummEval: Re-evaluating Summarization Evaluation
Paper • 2007.12626 • Published