Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support budget/rate limit tiers for keys #7429

Merged
merged 22 commits into from
Dec 27, 2024
Merged

Conversation

krrishdholakia
Copy link
Contributor

Title

Support budget/rate limit tiers for keys

If it's hard to maintain mapping of model: tpm / rpm per key, use this to:

  • create a budget with tpm/rpm limits (including max_max_budget) - /budget/new
  • assign the budget to a key on /key/generate via a new budget_id field

This allows you to increase people's tpm/rpm limits, by just changing the budget_id associated with the key.

Relevant issues

Enables easier key / rate limit management in orgs

Type

🆕 New Feature

Changes

[REQUIRED] Testing - Attach a screenshot of any new tests passing locally

If UI changes, send a screenshot/GIF of working UI fixes

…ed_view for key

allows user to create rate limit tiers and associate those to keys
…x budget metrics with the associated budget table values if set

allows rate limit tiers to be easily applied to keys
…in key generate

make it easy for user to know associated budget on key creation
…nto separate file - makes it easier to run documentation testing against it
Copy link

vercel bot commented Dec 26, 2024

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
litellm ✅ Ready (Inspect) Visit Preview 💬 Add feedback Dec 27, 2024 2:30am

* (Feat) Add logging for `POST v1/fine_tuning/jobs`  (#7426)

* init commit ft jobs logging

* add ft logging

* add logging for FineTuningJob

* simple FT Job create test

* (docs) - show all supported Azure OpenAI endpoints in overview  (#7428)

* azure batches

* update doc

* docs azure endpoints

* docs endpoints on azure

* docs azure batches api

* docs azure batches api

* fix(key_management_endpoints.py): fix key update to actually work

* test(test_key_management.py): add e2e test asserting ui key update call works

* fix: proxy/_types - fix linting erros

* test: update test

---------

Co-authored-by: Ishaan Jaff <[email protected]>
@krrishdholakia krrishdholakia merged commit 539f166 into main Dec 27, 2024
25 of 28 checks passed
@krrishdholakia krrishdholakia deleted the litellm_dev_12_26_2024_p1 branch December 27, 2024 05:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant