add text-generation pipeline example with autocast (#47)

- add text-generation pipeline example with autocast (2faa761bc10cdff64021541e93fa9b2f67482bf6)

Co-authored-by: Vitaliy Chiley <[email protected]>

Files changed (1) hide show

README.md CHANGED Viewed

@@ -102,6 +102,22 @@ from transformers import AutoTokenizer
 tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neox-20b")
 ```
 ### Formatting
 This model was trained on data formatted in the dolly-15k format:

 tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neox-20b")
 ```
+The model can then be used, for example, within a text-generation pipeline.
+Note: when running Torch modules in lower precision, it is best practice to use the [torch.autocast context manager](https://pytorch.org/docs/stable/amp.html).
+```python
+from transformers import pipeline
+pipe = pipeline('text-generation', model=model, tokenizer=tokenizer, device='cuda:0')
+with torch.autocast('cuda', dtype=torch.bfloat16):
+    print(
+        pipe('Here is a recipe for vegan banana bread:\n',
+            max_new_tokens=100,
+            do_sample=True,
+            use_cache=True))
+```
 ### Formatting
 This model was trained on data formatted in the dolly-15k format: