deepset
/

flan-t5-xl-squad2

Question Answering

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sjrhuschlee commited on Oct 2, 2023

Commit

01a7127

•

1 Parent(s): 05c4045

Update README.md

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 language: en
 license: cc-by-4.0
 tags:
 - flan
 - flan-t5
@@ -24,7 +25,16 @@ This is the [flan-t5-xl](https://huggingface.co/google/flan-t5-xl) model, fine-t
 ## Hyperparameters
 ```
-n_epochs = 4
 ```
 ## Usage
@@ -55,6 +65,9 @@ model = AutoModelForQuestionAnswering.from_pretrained(model_name)
 tokenizer = AutoTokenizer.from_pretrained(model_name)
 ```
 ## About us
 <div class="grid lg:grid-cols-2 gap-x-4 gap-y-3">

 ---
 language: en
 license: cc-by-4.0
+base_model: google/flan-t5-xl
 tags:
 - flan
 - flan-t5
 ## Hyperparameters
 ```
+learning_rate: 1e-05
+train_batch_size: 4
+eval_batch_size: 8
+seed: 42
+gradient_accumulation_steps: 16
+total_train_batch_size: 64
+optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+lr_scheduler_type: linear
+lr_scheduler_warmup_ratio: 0.1
+num_epochs: 4.0
 ```
 ## Usage
 tokenizer = AutoTokenizer.from_pretrained(model_name)
 ```
+## Authors
+Sebastian Husch Lee: sebastian.huschlee [at] deepset.ai
 ## About us
 <div class="grid lg:grid-cols-2 gap-x-4 gap-y-3">