Compiled engines for running Whisper with TRT LLM for much faster inference.
baseten
company
Verified
AI & ML interests
None defined yet.
Collections
1
models
584
baseten/whisper_trt_large_v3_turbo_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.13.0-TP2
Updated
•
30
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.13.0-TP1
Updated
•
86
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.12.0-TP2
Updated
•
2
baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.12.0-TP1
Updated
•
6
baseten/whisper_trt_large_v3_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated
baseten/btest-Mistral-Large-Instruct-2407-NVIDIA-H100-80GB-HBM3-v0.13.0-TP8
Updated
•
22
baseten/btest-Mistral-Large-Instruct-2407-NVIDIA-H100-80GB-HBM3-v0.13.0-TP4
Updated
baseten/whisper_trt_large_v3_NVIDIA_H100_80GB_HBM3_0_13_0
Updated
baseten/whisper_trt_large_v3_turbo_NVIDIA_H100_80GB_HBM3_0_13_0
Updated