Auto-complete, Inference Engine, Metadata and ML

Host ML models on Amazon SageMaker using Triton: TensorRT models

AWS Machine Learning Blog

MAY 8, 2023

SageMaker provides single model endpoints (SMEs), which allow you to deploy a single ML model, or multi-model endpoints (MMEs), which allow you to specify multiple models to host behind a logical endpoint for higher resource utilization. Input and output – These fields are required because NVIDIA Triton needs metadata about the model.

ML BERT Deep Learning Auto-complete

Artificial Intelligence Zone

Host ML models on Amazon SageMaker using Triton: TensorRT models

Webinars

Stay Connected