-
Notifications
You must be signed in to change notification settings - Fork 265
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lack of Knowledge Database Support in Vision Mode for Bedrock Claude 3 #436
Comments
Thank you for spotlighting the absence of knowledge database support in Bedrock Claude 3's Vision mode. I think many will want this! The LangChain blog post suggests three approaches for implementing multi-modal RAG:
Which approach best fits your Claude's Vision mode use case? Did you want option 1, 2, 3, or another variant? |
I'm handle the case for IT support system when users uploading image and asking how to fix the issue. The solution 3 will be the best choice at this time due to complexity and cost of multi-modal embedding. But, for retrieval, I will generate semantic query using LLM from histories + user's question + image. Thanks for advice. |
As you're aware, Bedrock Claude 3 is designed to support multi-modal capabilities, including Vision mode. However, during testing of the latest version, it appears that the system does not currently support accessing the knowledge database when operating in Vision mode (see attached image).
Many use cases involve customers uploading images and seeking solutions, with the expectation that the system can retrieve relevant documents from the internal knowledge base and provide appropriate responses based on the visual input and accompanying query.
Suggested Next Steps:
By addressing these points, we can enhance the functionality of Bedrock Claude 3 in Vision mode, enabling it to leverage the knowledge database effectively when processing visual inputs and queries from customers.
The text was updated successfully, but these errors were encountered: