autoevaluator's picture
Add evaluation results on the section config and test split of ccdv/arxiv-summarization
99154a1
|
raw
history blame
3.09 kB
---
language: en
license: apache-2.0
tags:
- summarization
datasets: arxiv-summarization
model-index:
- name: ArtifactAI/led_large_16384_arxiv_summarization
results:
- task:
type: summarization
name: Summarization
dataset:
name: ccdv/arxiv-summarization
type: ccdv/arxiv-summarization
config: section
split: test
metrics:
- type: rouge
value: 37.9472
name: ROUGE-1
verified: true
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZDFkMzY4YTk0NGUyNDJjYzc2MWFiMGJlNWUyYTM2YjlmNjlkY2VkYmVhMDk2YjIxMjE3MjE4M2ZkOTAwODE2ZSIsInZlcnNpb24iOjF9.t2x5mqi0xM9Q0K9MscHZ6v_5pc-MOw8KieFTvFMqh5K4UAvvvcVGOGfGQi_Qb57gQa2DkrW0cNrJADY0VA1tAQ
- type: rouge
value: 11.3138
name: ROUGE-2
verified: true
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjdlYmQ4ZmRkNzc3YzE0NGQ2MTRhNDE4YTExNDYwYmNjODFhYjdmYTJlZWE4OTRhYWRiZmNmODZkMDZjMWY3NSIsInZlcnNpb24iOjF9.RPWY5CZMjaFaQ1vRQPoHyZxPD67dQdbXYL0UlJ53b_q1dMczXb7HtE_UmDNPi6F7thciVt6xWIzsckVmp9ZJCw
- type: rouge
value: 20.5557
name: ROUGE-L
verified: true
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYWEwNTQ5MWViZTYwM2EyNzI0OWEyZDNlY2ExOTJiMjI3MmNjM2I4YmJjMzljYTQ3NjhkNjAzYzM5MDQzYjVkOCIsInZlcnNpb24iOjF9.ZgSkTbiUDaQRJGBIXjlTZKbtKmrIljEJ6btwhyfBsaz5oS0qmI76-b_vDRswnx96OcGTqdxICIjma6jgNbKiBA
- type: rouge
value: 33.8336
name: ROUGE-LSUM
verified: true
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiY2EzNzNhMWVmYjM5ZWUwOTZkYjU0MGZjMWQ0YTQ1NzA1NWQ4MjBjNjNhM2FmMmE3MmM3NzQwMzVkN2QzMzQxZiIsInZlcnNpb24iOjF9.bhxtgWXjCEv5ZFY3F7Mp-r4EHrIU8BNZ8X2zhpjSoyVLmjbfdFB-lnJdoH3PfVZEa14T96SJqMSHa6yzlqGEAQ
- type: loss
value: 2.8064792156219482
name: loss
verified: true
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYzBhMTE0ZTdhOTRmYWE1Mjk5ZmViYjZiMjBmNzc2YzQ4YmNhYWM3NzRjYWUwYTEyZjU1NGI5MjVhODQwOTBlNCIsInZlcnNpb24iOjF9.l0nIJCcjoFyPF9M7MHiQxBQ3wtyk6jXURY0ZF6Xny3_DpkDh5YHs9kF494GJp5eYj6XG5HRGCgqhfmU7-fywAw
- type: gen_len
value: 157.4174
name: gen_len
verified: true
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNDY0ZmE4M2VmOTU1NWY5M2I4YTYxNjM3NTkxNWU4NDY3N2Y0MTM1YWNlNmNjMGQ4N2UzM2ZkZWJhZTVmMjQ2OCIsInZlcnNpb24iOjF9.sAp6g7nt1tKTdGfOlGm3fdxzH1jxjNOZO65BNnVJkxDhu86j8QP3ZvNPv7PpD2sK4p6yM_HlHPPeX4bgmDi2BQ
---
## Introduction
A led-large-16384 model to summarize ArXiv papers. Inputs are the abstracts of papers and full documents, and outputs are the summaries of the papers.
[Allenai's Longformer Encoder-Decoder (LED)](https://github.com/allenai/longformer#longformer).
As described in [Longformer: The Long-Document Transformer](https://arxiv.org/pdf/2004.05150.pdf) by Iz Beltagy, Matthew E. Peters, Arman Cohan,
*led-base-16384* was initialized from [*bart-base*](https://huggingface.co/facebook/bart-base) since both models share the exact same architecture. To
be able to process 16K tokens, *bart-base*'s position embedding matrix was simply copied 16 times.