Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
12
Follow
AWS Inferentia and Trainium
62
License:
apache-2.0
Model card
Files
Files and versions
Community
242
6c8dc74
optimum-neuron-cache
/
neuronxcc-2.13.66.0+6dfecc895
/
0_REGISTRY
/
0.0.23.dev0
/
inference
/
llama
/
princeton-nlp
/
Sheared-LLaMA-1.3B
Commit History
Synchronizing local compiler cache.
6ae4846
verified
dacorvo
HF staff
commited on
May 31
Synchronizing local compiler cache.
4e24461
verified
dacorvo
HF staff
commited on
May 30
Synchronizing local compiler cache.
39811bc
verified
dacorvo
HF staff
commited on
May 30