Feat: Improved module search #3047

Ell1ott · 2024-05-17T21:48:21Z

Previously, when searching for the modules and you made a typo, it would not show it. This is now fixed/improved using the typo-safe-search npm package

Examples:

It is still not perfect, but I much prefer this behaviour.

1zun4 · 2024-05-24T22:36:31Z

Is there any reason it is still on draft?

Ell1ott · 2024-05-25T11:56:16Z

Is there any reason it is still on draft?

Not anymore. Just wanted to make sure that there weren't any edge cases where it wouldn't work, but is now ready to be merged.

SenkJu · 2024-05-25T19:42:37Z

I see the library you are using is made by you. What distance function did you implement? It doesn't seem to be one I know.

Ell1ott · 2024-05-25T21:19:00Z

Well, I chose to write my own library and distance function because I couldn't find any good libraries that did both of the following:

Would allow typos (as in inserting a character into the query that doesn't exist in the item). I, for example, found command-score, which does something similar but doesn't allow incorrect characters.
Was expecting the user not to have completed the query. When the user searches for a module, they are very likely only to type the first few letters. The few distance functions that allowed typos expected the query string to already be complete. An example of this is the levenshtein

My distance function is slightly inspired by command-score but otherwise created by myself to solve the above-listed problems.

SenkJu · 2024-05-26T13:38:55Z

Hm, basic distance functions are generally considered a solved problem (see Levenshtein distance). Your implementation appears rather inefficient to me. Consider using something like the Wagner-Fischer algorithm for much higher efficiency.

Ell1ott · 2024-05-26T15:03:48Z

I ran some tests with the js-levenshtein library and got results where I would prefer the old system (just ranking it depending on if the query existed in the item). Let's take the following example. The user wants to search for 'poison' in a list of words and starts by typing 'poi.' Here, js-levenshtein would recommend the following words as the best for the query 'poi':

pool
spit
war
poison
dose

Whereas my distance function would recommend these:

poison
productive
proportion
pool
proof

Regarding the algorithm's performance, js-levenshtein was about twice as performant as my implementation in my tests. It took Levenshtein about 4364 ms to sort a list of 140 random words 60000 times. It took my implementation 7811 ms to do the same. In the end, this efficiency difference won't be noticeable to the end user because they both are extremely efficient, with mine being able to sort the list of 140 items 7600 times per second. And when testing it in the client i did not feel any speed difference to the old system.

If you know any alternatives to Levenshtein that would give better results, I would be happy to take a closer look at them 🙂

Ell1ott · 2024-05-29T18:55:04Z

After trying to further improve the performance of my algorithm, I have managed to bring it down to about 6000 ms to sort the list of 140 words 60000 times.

Ell1ott · 2024-06-12T17:32:44Z

@SenkJu Do you still have concerns or could we maybe merge?

Ell1ott added 5 commits May 17, 2024 22:52

Improve search using typo-safe-search

0595205

update package version

98ccba7

Filter out unrelated results

d789114

Merge branch 'nextgen' into improved-search

976a524

Update lib

79d3688

Ell1ott changed the title ~~Improved module search~~ Feat: Improved module search May 23, 2024

Ell1ott added 3 commits May 24, 2024 23:27

updating lib

b843f92

Merge branch 'nextgen' into improved-search

15f06bb

Change lib version

148ab84

Update lib

023997e

Ell1ott marked this pull request as ready for review May 25, 2024 11:54

1zun4 requested a review from SenkJu May 25, 2024 15:06

1zun4 added this to the 0.6.0 milestone May 25, 2024

Merge branch 'nextgen' into improved-search

e1665a0

1zun4 modified the milestones: 0.6.0, 0.7.0 May 26, 2024

Update library

f736514

Ell1ott marked this pull request as draft May 29, 2024 23:11

Update string comparing library

07c1445

Ell1ott marked this pull request as ready for review May 30, 2024 11:44

Ell1ott added 3 commits May 30, 2024 13:45

Merge branch 'nextgen' into improved-search

06054ec

Update string compare library

b97ebbb

Merge branch 'nextgen' into improved-search

fc2fd78

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Improved module search #3047

Feat: Improved module search #3047

Ell1ott commented May 17, 2024 •

edited

1zun4 commented May 24, 2024

Ell1ott commented May 25, 2024

SenkJu commented May 25, 2024

Ell1ott commented May 25, 2024

SenkJu commented May 26, 2024 •

edited

Ell1ott commented May 26, 2024 •

edited

Ell1ott commented May 29, 2024 •

edited

Ell1ott commented Jun 12, 2024

Feat: Improved module search #3047

Are you sure you want to change the base?

Feat: Improved module search #3047

Conversation

Ell1ott commented May 17, 2024 • edited

1zun4 commented May 24, 2024

Ell1ott commented May 25, 2024

SenkJu commented May 25, 2024

Ell1ott commented May 25, 2024

SenkJu commented May 26, 2024 • edited

Ell1ott commented May 26, 2024 • edited

Ell1ott commented May 29, 2024 • edited

Ell1ott commented Jun 12, 2024

Ell1ott commented May 17, 2024 •

edited

SenkJu commented May 26, 2024 •

edited

Ell1ott commented May 26, 2024 •

edited

Ell1ott commented May 29, 2024 •

edited