π€ Optimum Neuron
π€ Optimum Neuron is the interface between the π€ Transformers library and AWS Accelerators including AWS Trainium and AWS Inferentia. It provides a set of tools enabling easy model loading, training and inference on single- and multi-Accelerator settings for different downstream tasks. The list of officially validated models and tasks is available here.
Learn the basics and become familiar with training & deploying transformers on AWS Trainium and AWS Inferentia. Start here if you are using π€ Optimum Neuron for the first time!
Practical guides to help you achieve a specific goal. Take a look at these guides to learn how to use π€ Optimum Neuron to solve real-world problems.
Technical descriptions of how the classes and methods of π€ Optimum Neuron work.