Hosted Inference

#15

by synthetisoft - opened May 14, 2023

Discussion

synthetisoft

May 14, 2023

Does the model architecture mean it can't run on a hosted inference endpoint?

rs2992

May 15, 2023

•

edited May 15, 2023

You can, but you need ampere architecture (GPU). Otherwise, the output is gibberish.

sam-mosaic

May 23, 2023

@synthetisoft We (MosaicML) have a competing inference product: https://www.mosaicml.com/inference

However, I have told our contacts at HuggingFace that there is community interest in inference endpoint examples of MPT and they are working on adding it to the docs

sam-mosaic changed discussion status to closed May 23, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment