-
Notifications
You must be signed in to change notification settings - Fork 769
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The Tweets with only images are not recognized when asked to LLM #156
Comments
There's ways to fix this right now, but I think that at scale it doesn't make sense to use a vision model or even OCR for that matter on every single image. |
so you have any other method method ? |
@Dhravya for the time being, if we really want the feature, we can use some OCR API? which are free/extremely cheap? |
https://ocr.space/ocrapi here is one |
This should be tackled eventually. If we're importing tweets into Supermemory, providing a first-class experience to the user is important. |
Yes. If the objective is truly to create a "super" memory, it needs to be able to extract useful information from all types of unstructured data in your tweets, including images containing text. This is especially important because tweets with images are more likely to be retweeted, which increases their chances of being valuable and, consequently, the probability of being bookmarked. |
Hmmm. this is gonna be a long discussion. We did create endpoints and stuff to make this work long ago but nothing after that |
yep that endpoint uses llava-1.5-7b-hf, nowhere on cloudflare's docs we can get an idea of how much it will cost but since we are already looking forward to adding pro plan, why not make it part of that ? |
@CodeTorso As a user I would definitely expect this to be part of the free plan since importing tweets right now is basically our USP |
Hmmmmm, llava right now is free. We can probably look into this and do a dual sort of import, should be not that hard when we have the image URL that we can fetch but we can only do this after the queues PR is merged, and then add an extra step in the add workflow to fetch any images and embed those too |
@Dhravya we can add paid version for saving images. Can integrate stripe. What say? |
Again, |
Hmm, that's right. |
When asked about a specific text from the database, it only recognizes text, but the text from the image isn't recognized:
Here the question:
Here the tweet mentioning the image with the text
The text was updated successfully, but these errors were encountered: