You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
After adding a few more models to our staging instance and testing them, looking to the LiteLLM_SpendLogs contents, I've noticed a few inconsistencies. I think that they are not critical but, maybe, it would be great if they can be done more consistently:
Some models, like dall-e-2 and dall-e-3 (OpenAI) don't get the api_base in the table filled ever.
Some cache_hit = True requests, for some models, like gpt-4o and gpt-4o-mini (from OpenAI), or llama-3.1-8b-instant and llama-3.3-70b-versatile (from Groq) don't get the 'api_base' either. Also, perplexity's ones. In the other side, some providers like ollama_chat get the api_base for cache hits filled. Maybe it's something by provider, it would be great to get it managed consistently over all them.
Ciao :-)
Relevant log output
Not logs, but a query and its results, showing the behaviour described above. Hope it's illustrative:
What happened?
After adding a few more models to our staging instance and testing them, looking to the LiteLLM_SpendLogs contents, I've noticed a few inconsistencies. I think that they are not critical but, maybe, it would be great if they can be done more consistently:
dall-e-2
anddall-e-3
(OpenAI) don't get theapi_base
in the table filled ever.cache_hit = True
requests, for some models, likegpt-4o
andgpt-4o-mini
(from OpenAI), orllama-3.1-8b-instant
andllama-3.3-70b-versatile
(from Groq) don't get the 'api_base' either. Also, perplexity's ones. In the other side, some providers like ollama_chat get theapi_base
for cache hits filled. Maybe it's something by provider, it would be great to get it managed consistently over all them.Ciao :-)
Relevant log output
Not logs, but a query and its results, showing the behaviour described above. Hope it's illustrative:
Are you a ML Ops Team?
No
What LiteLLM version are you on ?
v1.55.1
Twitter / LinkedIn details
No response
The text was updated successfully, but these errors were encountered: