Add support for Azure, llama2, palm, claude2, cohere command nightly (etc) #74

ishaan-jaff · 2023-08-30T15:24:50Z

This PR adds support for models from all the above mentioned providers using https://github.com/BerriAI/litellm/

Here's a sample of how it's used:

from litellm import completion, acompletion

## set ENV variables
# ENV variables can be set in .env file, too. Example in .env.example
os.environ["OPENAI_API_KEY"] = "openai key"
os.environ["COHERE_API_KEY"] = "cohere key"

messages = [{ "content": "Hello, how are you?","role": "user"}]

# openai call
response = completion(model="gpt-3.5-turbo", messages=messages)

# llama2 call
model_name = "replicate/llama-2-70b-chat:2c1608e18606fad2812020dc541930f2d0495ce32eee50074220b87300bc16e1"
response = completion(model_name, messages)

# cohere call
response = completion("command-nightly", messages)

# anthropic call
response = completion(model="claude-instant-1", messages=messages)

ishaan-jaff · 2023-08-30T15:26:05Z

@minimaxir can you please take a look at this PR when possible😊 Happy to add docs/tests too if this initial commit looks good

ishaan-jaff · 2023-08-30T15:27:47Z

Addressing:
#73
#71
#70 (works with fine tuned models)
#43

minimaxir · 2023-09-03T18:48:01Z

Looking into this now: it'll take a bit longer since it's adding a dependency.

minimaxir · 2023-09-04T04:50:46Z

After some investigation into LiteLLM, I will have to reject adding it despite the high demand for alternative services for a number of reasons:

This PR is insufficient. Every instance of hitting the API would need to be update, along with a full refactor. Additionally, the new behavior would have to be documented from scratch, which is partially why I'm intending to add new services one by one.
LiteLLM code is highly redundant with its implementations compared to what simpleaichat is doing, in addition to its own extra dependencies.
Your implementations for pinging the API and ETLing the I/O are inefficient and would result in a slowdown for simpleaichat. Additionally, it's not clear if your hacks used to interface with non-ChatGPT APIs are optimal.
Your demos notebooks have undocumented behavior of automatically creating a dashboard, which is a complete nonstarter.

The design intent of simpleaichat is to be very clear, transparent, and consistent, even in its codebase.

ishaan-jaff · 2023-09-04T14:46:35Z

@minimaxir thanks for the feedback

Your implementations for pinging the API and ETLing the I/O are inefficient and would result in a slowdown for simpleaichat. Additionally, it's not clear if your hacks used to interface with non-ChatGPT APIs are optimal.

Was there something in particular that made it seem like it would result in a slowdown and sub-optimal results ?

Your demos notebooks have undocumented behavior of automatically creating a dashboard, which is a complete nonstarter.

Thanks for pointing that out - it was an experimental feature that users can opt in to. We will clean that out

minimaxir · 2023-09-11T01:33:14Z

Was there something in particular that made it seem like it would result in a slowdown and sub-optimal results ?

More in general for optimization (e.g. minimizing serialization overhead, minimizing HTTP session creation).

v0 of adding litellm

4940661

minimaxir closed this Sep 4, 2023

ishaan-jaff mentioned this pull request Sep 4, 2023

[Optimize LiteLLM] - Remove Redundant Client Initalization in completion calls BerriAI/litellm#266

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Azure, llama2, palm, claude2, cohere command nightly (etc) #74

Add support for Azure, llama2, palm, claude2, cohere command nightly (etc) #74

ishaan-jaff commented Aug 30, 2023 •

edited

Loading

ishaan-jaff commented Aug 30, 2023

ishaan-jaff commented Aug 30, 2023

minimaxir commented Sep 3, 2023

minimaxir commented Sep 4, 2023 •

edited

Loading

ishaan-jaff commented Sep 4, 2023 •

edited

Loading

minimaxir commented Sep 11, 2023

Add support for Azure, llama2, palm, claude2, cohere command nightly (etc) #74

Add support for Azure, llama2, palm, claude2, cohere command nightly (etc) #74

Conversation

ishaan-jaff commented Aug 30, 2023 • edited Loading

ishaan-jaff commented Aug 30, 2023

ishaan-jaff commented Aug 30, 2023

minimaxir commented Sep 3, 2023

minimaxir commented Sep 4, 2023 • edited Loading

ishaan-jaff commented Sep 4, 2023 • edited Loading

minimaxir commented Sep 11, 2023

ishaan-jaff commented Aug 30, 2023 •

edited

Loading

minimaxir commented Sep 4, 2023 •

edited

Loading

ishaan-jaff commented Sep 4, 2023 •

edited

Loading