(Docs) Add support for Llama2, Claude, Palm, Cohere, Replicate (100+ LLMs) #787

krrishdholakia · 2023-10-12T00:08:06Z

Based on your feedback on this PR - #660,

My PR shows users how to call their custom LLM providers (bedrock, togetherai, huggingface tgi, replicate, ai21, cohere, ai21 etc.) by pointing the api base to a local proxy they can use for experimentation with GPT-Engineer.

This makes no code changes to GPT-Engineer.

Happy to add additional tests/documentation if the initial PR looks good.

sweep-ai · 2023-10-12T00:08:10Z

Apply Sweep Rules to your PR?

Apply: Ensure all new functions and classes have very clear, concise and up-to-date docstrings. Take gpt_engineer/ai.py as a good example.
Apply: Leftover TODOs in the code should be handled.
Apply: All new business logic should have corresponding unit tests in the tests/ directory.
Apply: Any clearly inefficient or repeated code should be optimized or refactored.

haseeb-heaven · 2023-10-13T19:34:12Z

Hi @krrishdholakia this is nice idea and great suggestion to add LiteLLM for new models out there i really like this, but i have one question that GPT-Engineer already is using LangChain which already supports all the models right away so why we need LiteLLM in this PR?
Can you describe what feature LangChain is missing that are covered in LiteLLM.

krrishdholakia · 2023-10-13T19:37:24Z

@haseeb-heaven can you show me how i could use anthropic with gpt-engineer without modifying the code?

haseeb-heaven · 2023-10-13T19:45:00Z

@haseeb-heaven can you show me how i could use anthropic with gpt-engineer without modifying the code?

I don't know much about this codebase as i have not yet contributed enough.
So i checked LangChain docs about Anthropic (https://python.langchain.com/docs/integrations/chat/anthropic)

So i asked GPT-4 About the codebase LangChain integration.

here is a summary of the steps to integrate the Anthropic model from Langchain into the GPT-Engineer codebase:

Import the ChatAnthropic class from langchain.chat_models and the necessary message classes from langchain.schema.
Create an instance of the ChatAnthropic class for interacting with the Anthropic model.
Modify the AI class in the gpt_engineer/core/ai.py file to use the ChatAnthropic instance instead of the current model.
Modify the AI class methods start, next, and advance to use the HumanMessage and AIMessage classes for creating and handling messages.
Test the integration by running the application and checking if the Anthropic model is being used correctly.

krrishdholakia · 2023-10-13T22:41:37Z

@haseeb-heaven this is docs for gpt-engineer 😄, not langchain. specific to this repo, i don't believe i can use non-openai/azure models without making code changes or creating a proxy (as indicated with these docs)

haseeb-heaven · 2023-10-14T00:47:25Z

@haseeb-heaven this is docs for gpt-engineer 😄, not langchain. specific to this repo, i don't believe i can use non-openai/azure models without making code changes or creating a proxy (as indicated with these docs)

LangChain supports Anthropic so you could use LangChain but since its using OpenAI directly then I guess you have to make more changes in codebase first for it to work

TheoMcCabe · 2023-10-16T14:18:04Z

These seems sensible to me - its just the readme being changed and its adding value to people that want to use an alternative or local model straight away without making any code changes. Shall we merge?

@krrishdholakia if you run pre-commit run --all-files locally it will fix the failing pre commit checks for you

pip install pre-commit if not installed already

krrishdholakia · 2023-10-16T14:25:54Z

@TheoMcCabe thanks! will do and update the PR

TheoMcCabe · 2023-10-20T22:36:41Z

There is already an example at the top of this document showing how to link with local models by updating the open ai base url variable. Theres no need to duplicate this information in the docs. Closing for this reason.

ATheorell · 2023-10-21T10:34:37Z

I agree that it is a duplication of the wizardcoder example. However, I think, in the section with the wizardcoder example, it could be useful to also have a short instruction on how to alternatively set up a liteLLM endpoint. The using local/open endpoints section could give a (very) brief overview of potential frameworks that can be used for hosting a local server (in particular containing links to the relevant tutorials) and then a unified gpt-engineer call for how to use them. Feel free to have a shot at this in a new PR @krrishdholakia

Add support for Llama2, Claude, Palm, Cohere, Replicate (100+ LLMs)

45d0721

krrishdholakia changed the title ~~Add support for Llama2, Claude, Palm, Cohere, Replicate (100+ LLMs)~~ (Docs) Add support for Llama2, Claude, Palm, Cohere, Replicate (100+ LLMs) Oct 12, 2023

ATheorell mentioned this pull request Oct 12, 2023

Support for OpenAI Base API Integration #788

Closed

pre commit fixes

65b19fa

TheoMcCabe closed this Oct 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(Docs) Add support for Llama2, Claude, Palm, Cohere, Replicate (100+ LLMs) #787

(Docs) Add support for Llama2, Claude, Palm, Cohere, Replicate (100+ LLMs) #787

krrishdholakia commented Oct 12, 2023 •

edited

Loading

sweep-ai bot commented Oct 12, 2023

haseeb-heaven commented Oct 13, 2023

krrishdholakia commented Oct 13, 2023

haseeb-heaven commented Oct 13, 2023

krrishdholakia commented Oct 13, 2023

haseeb-heaven commented Oct 14, 2023

TheoMcCabe commented Oct 16, 2023 •

edited

Loading

krrishdholakia commented Oct 16, 2023

TheoMcCabe commented Oct 20, 2023 •

edited

Loading

ATheorell commented Oct 21, 2023

(Docs) Add support for Llama2, Claude, Palm, Cohere, Replicate (100+ LLMs) #787

(Docs) Add support for Llama2, Claude, Palm, Cohere, Replicate (100+ LLMs) #787

Conversation

krrishdholakia commented Oct 12, 2023 • edited Loading

sweep-ai bot commented Oct 12, 2023

Apply Sweep Rules to your PR?

haseeb-heaven commented Oct 13, 2023

krrishdholakia commented Oct 13, 2023

haseeb-heaven commented Oct 13, 2023

krrishdholakia commented Oct 13, 2023

haseeb-heaven commented Oct 14, 2023

TheoMcCabe commented Oct 16, 2023 • edited Loading

krrishdholakia commented Oct 16, 2023

TheoMcCabe commented Oct 20, 2023 • edited Loading

ATheorell commented Oct 21, 2023

krrishdholakia commented Oct 12, 2023 •

edited

Loading

TheoMcCabe commented Oct 16, 2023 •

edited

Loading

TheoMcCabe commented Oct 20, 2023 •

edited

Loading