Cross-Modal Retrieval: Image-to-Text and Text-to-Image Search
Heartbeat
FEBRUARY 8, 2024
Convolutional neural networks (CNNs) and recurrent neural networks (RNNs) are often employed to extract meaningful representations from images and text, respectively. Textual queries are transformed into embeddings using methods like word embeddings or recurrent neural networks.
Let's personalize your content