How can we support LLM Route to Generalize the LLM Provider #37

Krishanx92 · 2024-12-09T06:18:33Z

Currently support LLMProviders, such as OpenAI, Mistral, Azure, and others has its own specific configurations and implementations. For instance, in the case of OpenAI, the model details are defined in the request body. Additionally, rate-limiting parameters, such as prompt tokens, completion tokens, and total tokens, are also in body.

Current approach involves handling these provider-specific configurations and logic (e.g., transformations) inside translators and external processors (extproc). While this approach works, it tightly couples the implementation to predefined providers and may limit flexibility for supporting custom LLMs.

To make the implementation more flexible and extensible, Can we explore generalizing the approach by storing provider-specific and rate-limiting information directly within the LLMRoute resource? Then will it become LLM Provider rather than LLM Route?

Key Considerations could be
Generalization: LLMRoute that supports diverse providers(Different Models) and rate-limiting use cases while remaining extensible?
Rate-Limiting: Should LLMRoute include fields for prompt tokens, completion tokens, and total tokens, or should these remain provider-specific?

Looking forward to thoughts and suggestions.

mathetake · 2024-12-10T16:50:18Z

let's discuss post v0.1.0 release!

Krishanx92 mentioned this issue Dec 9, 2024

Adds basic translator and api schemas #27

Merged

mathetake added the discussion To be discussed in community label Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can we support LLM Route to Generalize the LLM Provider #37

How can we support LLM Route to Generalize the LLM Provider #37

Krishanx92 commented Dec 9, 2024

mathetake commented Dec 10, 2024

How can we support LLM Route to Generalize the LLM Provider #37

How can we support LLM Route to Generalize the LLM Provider #37

Comments

Krishanx92 commented Dec 9, 2024

mathetake commented Dec 10, 2024