Feature Request: RAG support for multimodal models #612

azaylamba · 2024-12-02T06:45:57Z

Currently, we can upload an image to get answers about it using multimodal models. But this feature doesn't support RAG workflow.
It would be good if the model first understand the image and then answers the question about it using the documents stored in the vector storage. This way, users would be able to get answers by uploading images of errors.

charles-marion · 2024-12-04T15:56:31Z

I agree with you, it would mean moving the multimodal logic (at least for bedrock) to the Langchain interface. (Or add Rag support toe the idefics one)

I will keep this issue updated if it's being added.

github-project-automation bot added this to AWS GenAI Chatbot Dec 2, 2024

azaylamba changed the title ~~Feature Request: RAG support multimodal models~~ Feature Request: RAG support for multimodal models Dec 2, 2024

charles-marion added bug Something isn't working enhancement New feature or request and removed bug Something isn't working labels Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: RAG support for multimodal models #612

Feature Request: RAG support for multimodal models #612

azaylamba commented Dec 2, 2024

charles-marion commented Dec 4, 2024

Feature Request: RAG support for multimodal models #612

Feature Request: RAG support for multimodal models #612

Comments

azaylamba commented Dec 2, 2024

charles-marion commented Dec 4, 2024