Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing ann-benchmarks documentation? #32

Closed
loisaidasam opened this issue Oct 20, 2023 · 4 comments
Closed

Missing ann-benchmarks documentation? #32

loisaidasam opened this issue Oct 20, 2023 · 4 comments

Comments

@loisaidasam
Copy link

The README says:

like Annoy, but with much higher recall

Think of Voyager like [Sparkey](https://github.com/spotify/sparkey), but for vector/embedding data; or like [Annoy](https://github.com/spotify/annoy), but with [much higher recall](http://ann-benchmarks.com/). It got its name because it searches through (embedding) space(s), much like [the Voyager interstellar probes](https://en.wikipedia.org/wiki/Voyager_program) launched by NASA in 1977.

but I don't see any references to Voyager on the ann benchmarks page. Am I missing something?

@loretoparisi
Copy link

I think they were mentioning the fact that in respect to Annoy, Voyager is using hnsw lib as ANN, hence the base benchmark shows that Voyager has a higher Recall / Queries per seconds than Annoy. You can clearly see this from here.

image

@loisaidasam
Copy link
Author

Which "base benchmark" are you referring to?

Perhaps faiss's hnsw implementation? or perhaps hnswlib? Those don't seem like fair comparisons, since Voyager is an entirely separate implementation.

Also, I think "clearly" is being generous. The way that the README links to ann-benchmarks suggests that one would see "Voyager" on these charts.

@loretoparisi
Copy link

hnswlib

I think a reasonable reference (Base line) could be the bare hnswlib. That said, according to the announcement, Voyager has a customized hnswllib version (as you can see from the sources), so the tests should be done from scratch to be more accurate.

@ijanderso
Copy link

Closing as it has been added to ann-benchmarks: erikbern/ann-benchmarks#473

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants