article thumbnail

Create a Generative AI Gateway to allow secure and compliant consumption of foundation models

AWS Machine Learning Blog

The registry contains metadata about generative AI service endpoints that an organization consumes, whether it’s an internally deployed FM or an externally provided generative AI API from a vendor. This table will hold the endpoint, metadata, and configuration parameters for the model.

article thumbnail

Learn how to build and deploy tool-using LLM agents using AWS SageMaker JumpStart Foundation Models

AWS Machine Learning Blog

Often, these LLMs require some metadata about available tools (descriptions, yaml, or JSON schema for their input parameters) in order to output tool invocations. About the Author John Hwang is a Generative AI Architect at AWS with special focus on Large Language Model (LLM) applications, vector databases, and generative AI product strategy.

LLM 101