New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature: Embeddings Optimizer (Prompt Rephraser) #2075
Comments
@spammenotinoz Hey dude, with a free high-performance service like Groq, you can use [one-api] (https://github.com/songquanpeng/one-api) to sum it up and then use it in open webui |
Hi I already use one-api for another project that already has the "sum it up feature". I don't use one-api here, as litellm already load-balances multiple API keys. |
Brilliant project, but currently using another project as when using this one my API costs are considerably higher.
High API Usage Costs with embeddings.
A clear and concise description of what you want to happen.
Typically this can have a major speed improvement and cost reduction for premium \ large models.
Describe alternatives you've considered
Additional context
Example of a Rephraser and Reranker as used on another project. This pull was not merged into the product, but works extremely well.
I do not use the Reranker jusr the Rephraser.
No modifications to GUI
The text was updated successfully, but these errors were encountered: