Compressed LLMs from the Community
Collection
LLMs optimized by the community using Neural Magic's LLM Compressor for efficient deployment in vLLM. Contribute and help advance efficient AI!
•
3 items
•
Updated
•
2