Mahmud ElHuseyni
MElHuseyni
·
AI & ML interests
Computer Vision
NLP
Machine Learning
Organizations
Collections
2
-
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference
Paper • 2401.08671 • Published • 14 -
NanoFlow: Towards Optimal Large Language Model Serving Throughput
Paper • 2408.12757 • Published • 16 -
richard-park/llama3-deepspeed-v1.0
Text Generation • Updated • 2.23k • 1
models
None public yet
datasets
None public yet