Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat:core:#95 #97

Closed
wants to merge 2 commits into from
Closed

feat:core:#95 #97

wants to merge 2 commits into from

Conversation

louis030195
Copy link
Contributor

Description

Core step towards

results = (
  await embedbase
  .dataset(recipe_id, farm_id, user_id, locaton_id)
  .search(question, max_token=3000, ratio=[.7, .1, .1, .1])
)

Related Issue

#95

Type of Change

  • 📚 Examples / docs / tutorials / dependencies update
  • 🔧 Bug fix (non-breaking change which fixes an issue)
  • 🥂 Improvement (non-breaking change which improves an existing feature)
  • 🚀 New feature (non-breaking change which adds functionality)
  • 💥 Breaking change (fix or feature that would cause existing functionality to change)
  • 🔐 Security fix

Checklist

  • I've written tests for all new methods and classes that I created.
  • I've written the docstring in Google format for all the methods and classes that I used.

@louis030195 louis030195 self-assigned this May 15, 2023
@vercel
Copy link

vercel bot commented May 15, 2023

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
embedbase ✅ Ready (Inspect) Visit Preview 💬 Add feedback May 15, 2023 2:51pm
embedbase-docs ✅ Ready (Inspect) Visit Preview 💬 Add feedback May 15, 2023 2:51pm

@louis030195
Copy link
Contributor Author

@hotkartoffel happy to receive feedback on how to best handle duplicate in this

example use case in tests:
image

Should we filter duplicate now or in a later version? (i.e. when the same data is present across multiple datasets of the same user)

@louis030195
Copy link
Contributor Author

louis030195 commented May 15, 2023

results = (
  await embedbase
  .dataset(recipe_id, farm_id, user_id, locaton_id)
  .search(question, max_token=3000, ratio=[.7, .1, .1, .1])
)

or

results = (
  await embedbase
  .dataset(recipe_id)
  .dataset(farm_id)
  .dataset(user_id)
  .dataset(locaton_id)
  .search(question)
  .limit(max_token=3000)
  .ratio([.7, .1, .1, .1])
)

or

results = (
  await embedbase
  .dataset(recipe_id, farm_id, user_id, locaton_id)
  .search(question)
  .limit(max_token=3000)
  .ratio([.7, .1, .1, .1])
)

or

results = (
  await embedbase
  .dataset(recipe_id, farm_id, user_id, locaton_id)
  .search(question)
  .limit(max_token=3000, ratio=[.7, .1, .1, .1])
)

or

results = (
  await embedbase
  .dataset(recipe_id, farm_id, user_id, locaton_id)
  .create_context(question, max_token=3000, ratio=[.7, .1, .1, .1])
)

or

results = (
  await embedbase
  .dataset(recipe_id, farm_id, user_id, locaton_id)
  .search(question, max_token=3000, ratio=[.7, .1, .1, .1])
  .get()
)

or something else? (ask same for TS)

@louis030195
Copy link
Contributor Author

imho should optimise UX for:

  • least amount of string
  • functional approach
  • while maintaining all degrees of freedom (the right abstraction)

@ccomkhj
Copy link

ccomkhj commented May 23, 2023

Looking forward to this update! @louis030195

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants