feat(api): improve caching and place name matching #451

steveoh · 2024-12-24T05:23:59Z

This pr updates the caching layers, introduces a new cache, and adds levenshtein to place name matching.

The bigquery hosted service now runs on startup and every sunday (the scheduled query to load data from sheets runs on saturday) and hydrates an in memory cache. There was no reason to have more cache levels as it's static data. I also couldn't figure out firebase remote config :(

I introduced fusion cache since microsofts hybrid cache does not allow for named caches and we needed different settings for each cache. Fusion cache allows for a l1 in memory and l2 redis.

The firestore fusion cache will help reduce the amount of calls to read api keys. We sometime incur a read cost since there was a 1:1 between requests and firestore reads.

The place name fusion cache will allow us to correct minor place name mistakes. First I check the bigquery memory cache for our standard names and on miss check the fusion cache. On another miss, I get all our standard names and levenshtein to get the closest match. If one is found, I get the grids from the bq memory cache and update the fusion cache with the misspelling. Otherwise I set the misspelling to empty so it's not levenshteined again.

I chose not to add this feature to the zip codes but rather expect an exact match since zip codes are very similar and levenshtein could produce strange results. Zips use the bq memory cache.

…ding

steveoh added 10 commits December 23, 2024 22:10

deps(api): add fusion cache with some cve dep updates

d58babf

chore(api): register fusion caches

46fcbf1

refactor(api): update memory cache of mappings every sunday

4db312b

feat(api): use fusion cache and levenshtein to improve place name fin…

9554354

…ding

feat(api): place fusion cache in front of api key repository

12529ca

chore(api): handle startup path

b803587

chore(api): add grid mapping health check

dfd2a74

chore(api): remove redis cache and health check

3b3f4a2

chore(api): correct zip lookup response

790ac59

chore(api): update timeouts

797fdac

steveoh changed the base branch from main to dev December 27, 2024 19:41

steveoh marked this pull request as ready for review December 27, 2024 19:41

steveoh merged commit b85e7d7 into dev Dec 27, 2024
5 checks passed

steveoh deleted the feat/fusion-cache branch December 27, 2024 19:43

steveoh added a commit that referenced this pull request Dec 30, 2024

feat(api): improve caching and place name matching (#451)

1e6dabd

ugrc-release-bot bot mentioned this pull request Dec 30, 2024

chore: api release v1.17.0 #453

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(api): improve caching and place name matching #451

feat(api): improve caching and place name matching #451

steveoh commented Dec 24, 2024

feat(api): improve caching and place name matching #451

feat(api): improve caching and place name matching #451

Conversation

steveoh commented Dec 24, 2024