dstack to manage clusters of on-prem servers for AI workloads with ease
โข
7
awesome! going to add one more env var to switch mode then :)
gr.ChatInterface
. However, it is not limited to chat usage, but you can leverage the efficiency of TGI for any sort of apps built in Gradio. gr.ChatInterface
. However, it is not limited to chat usage, but you can leverage the efficiency of TGI for any sort of apps built in Gradio. mamba
is now available in transformers. Thanks to
@tridao
and
@albertgu
for this brilliant model! ๐ and the amazing mamba-ssm
kernels powering this!