Dialogue-guided visual language processing with Amazon SageMaker JumpStart
AWS Machine Learning Blog
NOVEMBER 1, 2023
Combined with large language models (LLM) and Contrastive Language-Image Pre-Training (CLIP) trained with a large quantity of multimodality data, visual language models (VLMs) are particularly adept at tasks like image captioning, object detection and segmentation, and visual question answering.
Let's personalize your content