Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature: Embeddings Optimizer (Prompt Rephraser) #2075

Closed
spammenotinoz opened this issue May 8, 2024 · 2 comments
Closed

Feature: Embeddings Optimizer (Prompt Rephraser) #2075

spammenotinoz opened this issue May 8, 2024 · 2 comments

Comments

@spammenotinoz
Copy link

Brilliant project, but currently using another project as when using this one my API costs are considerably higher.

High API Usage Costs with embeddings.

A clear and concise description of what you want to happen.

  • Ability to send embeddings to a REPHRASER (lower cost model) before sending the relevant tokens to the users chosen model.
    Typically this can have a major speed improvement and cost reduction for premium \ large models.

Describe alternatives you've considered

  • Using free high-performance services like Groq

Additional context
Example of a Rephraser and Reranker as used on another project. This pull was not merged into the product, but works extremely well.
I do not use the Reranker jusr the Rephraser.
No modifications to GUI

@spammenotinoz spammenotinoz changed the title Feature: Prompt Rephraser Feature: Embeddings Optimizer (Prompt Rephraser) May 8, 2024
@Yanyutin753
Copy link
Contributor

Yanyutin753 commented May 11, 2024

@spammenotinoz Hey dude, with a free high-performance service like Groq, you can use [one-api] (https://github.com/songquanpeng/one-api) to sum it up and then use it in open webui

@spammenotinoz
Copy link
Author

@spammenotinoz Hey dude, with a free high-performance service like Groq, you can use [one-api] (https://github.com/songquanpeng/one-api) to sum it up and then use it in open webui

Hi I already use one-api for another project that already has the "sum it up feature". I don't use one-api here, as litellm already load-balances multiple API keys.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants