Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The Tweets with only images are not recognized when asked to LLM #156

Open
tomipardinas opened this issue Jul 24, 2024 · 13 comments
Open

The Tweets with only images are not recognized when asked to LLM #156

tomipardinas opened this issue Jul 24, 2024 · 13 comments
Labels
enhancement New feature or request

Comments

@tomipardinas
Copy link

When asked about a specific text from the database, it only recognizes text, but the text from the image isn't recognized:
Here the question:
image

Here the tweet mentioning the image with the text
image

@Dhravya
Copy link
Collaborator

Dhravya commented Jul 24, 2024

There's ways to fix this right now, but I think that at scale it doesn't make sense to use a vision model or even OCR for that matter on every single image.
We currently are reaching into 60,000 tweets imported into supermemory. a lot of them also have images. would get expeensive

@CodeTorso
Copy link
Contributor

so you have any other method method ?

@krakenftw
Copy link
Contributor

@Dhravya for the time being, if we really want the feature, we can use some OCR API? which are free/extremely cheap?

@krakenftw
Copy link
Contributor

https://ocr.space/ocrapi here is one

@Welding-Torch
Copy link
Contributor

This should be tackled eventually. If we're importing tweets into Supermemory, providing a first-class experience to the user is important.

@Welding-Torch Welding-Torch added the enhancement New feature or request label Jul 25, 2024
@tomipardinas
Copy link
Author

Yes. If the objective is truly to create a "super" memory, it needs to be able to extract useful information from all types of unstructured data in your tweets, including images containing text. This is especially important because tweets with images are more likely to be retweeted, which increases their chances of being valuable and, consequently, the probability of being bookmarked.

@Dhravya
Copy link
Collaborator

Dhravya commented Jul 26, 2024

Hmmm. this is gonna be a long discussion. We did create endpoints and stuff to make this work long ago but nothing after that

@CodeTorso
Copy link
Contributor

yep that endpoint uses llava-1.5-7b-hf,

nowhere on cloudflare's docs we can get an idea of how much it will cost

but since we are already looking forward to adding pro plan, why not make it part of that ?

@Welding-Torch
Copy link
Contributor

@CodeTorso As a user I would definitely expect this to be part of the free plan since importing tweets right now is basically our USP

@Dhravya
Copy link
Collaborator

Dhravya commented Aug 4, 2024

Hmmmmm, llava right now is free. We can probably look into this and do a dual sort of import, should be not that hard when we have the image URL that we can fetch

but we can only do this after the queues PR is merged, and then add an extra step in the add workflow to fetch any images and embed those too

@ameeetgaikwad
Copy link

@Dhravya we can add paid version for saving images. Can integrate stripe. What say?

@Welding-Torch
Copy link
Contributor

Again,
As a user I would definitely expect recognizing content in saved tweet images to be part of the free plan since importing tweets right now is basically our USP

@ameeetgaikwad
Copy link

Hmm, that's right.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

6 participants