pszemraj
/

long-t5-tglobal-large-booksum-WIP

text2text-generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Nov 27, 2022

Commit

de92e8b

•

1 Parent(s): 017f950

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -16,6 +16,7 @@ model-index:
 # tglobal-large-booksum-WIP
 > this is a WIP checkpoint that has been fine-tuned from the vanilla (original) for 10ish epochs. It is **not ready to be used for inference**
 This model is a fine-tuned version of [google/long-t5-tglobal-large](https://huggingface.co/google/long-t5-tglobal-large) on the `kmfoda/booksum` dataset.
 It achieves the following results on the evaluation set:
 - Loss: 4.9519
@@ -36,6 +37,7 @@ this is a WIP checkpoint that has been fine-tuned from the vanilla (original) fo
 ## Training and evaluation data
 This is **only** fine-tuned on booksum (vs. previous large WIP checkpoint I made that started from a partially-trained `pubmed` checkpoint)
 ## Training procedure
 ### Training hyperparameters

 # tglobal-large-booksum-WIP
 > this is a WIP checkpoint that has been fine-tuned from the vanilla (original) for 10ish epochs. It is **not ready to be used for inference**
 This model is a fine-tuned version of [google/long-t5-tglobal-large](https://huggingface.co/google/long-t5-tglobal-large) on the `kmfoda/booksum` dataset.
 It achieves the following results on the evaluation set:
 - Loss: 4.9519
 ## Training and evaluation data
 This is **only** fine-tuned on booksum (vs. previous large WIP checkpoint I made that started from a partially-trained `pubmed` checkpoint)
 ## Training procedure
 ### Training hyperparameters