Remove Auto-complete Remove Inference Engine Remove Metadata Remove ML
article thumbnail

Host ML models on Amazon SageMaker using Triton: TensorRT models

AWS Machine Learning Blog

SageMaker provides single model endpoints (SMEs), which allow you to deploy a single ML model, or multi-model endpoints (MMEs), which allow you to specify multiple models to host behind a logical endpoint for higher resource utilization. Input and output – These fields are required because NVIDIA Triton needs metadata about the model.

ML 88