New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Added Azure AI Chat Completion Client #4723

Draft

rohanthacker wants to merge 7 commits into microsoft:main from rohanthacker:feature/azure-ai-inference-client

+608 −0

Contributor

rohanthacker commented Dec 16, 2024

Related issue number

#4683 Adds initial support for Azure AI Chat Completion Client

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
[x ] I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

rohanthacker added 6 commits

December 16, 2024 00:25


          Rebase to latest main branch

6a068dd


          Moved _azure module to azure

e34db58


          Validate extra_create_args in and json response

79b5fc1


          Added Support for Github Models

462fa5f


          Added normalize_name and assert_valid name

45925b2


          Added Tests for AzureAIChatCompletionClient

f441aa5

Collaborator

ekzhu commented Dec 16, 2024

@yanivvak can you review this?

yanivvak commented Dec 17, 2024 •

edited

Loading

@ekzhu @rohanthacker
Great work, I tried to deploy with 3 different options offered by Azure AI inference SDK

Azure open AI - works good, I tried it with magnetic one, the websurfer got stucked, can you take a look?
Serverless - it works, but I didn't got the full answer it was phi 3.5
Managed compute - it didn't run for me, I assume it's an issue with the endpoint and it is not related to your code

yanivvak commented Dec 17, 2024

I used this code
https://github.com/Azure-Samples/dream-team/blob/main/Magenticone_example.py

ekzhu linked an issue

that may be closed by this pull request

Adding Azure AI inference #4683

Open

ekzhu reviewed

View reviewed changes

python/packages/autogen-ext/pyproject.toml

@@ @@ -50,6 +50,9 @@ video-surfer = [ @@
               grpc = [
                   "grpcio~=1.62.0", # TODO: update this once we have a stable version.
               ]
+              azure-ai-inference = [
+                  "azure-ai-inference>=1.0.0b6",

Collaborator

ekzhu Dec 17, 2024

Is there a specific reason for this version lower bound?

Contributor Author

rohanthacker Dec 19, 2024

No this is not intentional, this is what gets added when i run uv add "azure-ai-inference" --optional azure-ai-inference.
Is there another command I should use? I'm not experienced with uv

ekzhu reviewed

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

+                          usage=usage,
+                          cached=False,
+                      )
+                      yield result

Collaborator

ekzhu Dec 17, 2024

Update object-level usage data.

ekzhu reviewed

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

+                          usage=usage,
+                          cached=False,
+                      )
+                      return response

Collaborator

ekzhu Dec 17, 2024

Update object-level usage data.

ekzhu reviewed

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

+                              raise ValueError("Model does not support JSON output")
+                          if json_output is True:
+                              create_args["response_format"] = ChatCompletionsResponseFormatJSON()

Collaborator

ekzhu Dec 17, 2024

I think we need to check existing response_format value to make sure we aren't overwriting it.

ekzhu reviewed

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

		return name


		class AzureAIChatCompletionClient(ChatCompletionClient):

Collaborator

ekzhu Dec 17, 2024

Doc string is required. See examples: https://microsoft.github.io/autogen/dev/reference/python/autogen_ext.agents.openai.html

Collaborator

ekzhu commented Dec 18, 2024

@lspinheiro could you help reviewing this PR?

lspinheiro reviewed

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/models/azure/_azure_ai_client.py

+              class AzureAIChatCompletionClient(ChatCompletionClient):
+                  def __init__(self, **kwargs: Unpack[AzureAIChatCompletionClientConfig]):
+                      if "endpoint" not in kwargs:

Collaborator

lspinheiro Dec 18, 2024

I think this part could benefit from some better separation of concerns between config validation and instantiation. e.g.

class AzureAIChatCompletionClient(ChatCompletionClient):
    def __init__(self, **kwargs: Unpack[AzureAIClientConfiguration]):
        config = self._validate_config(kwargs)
        self._client = self._create_client(config)
        self._create_args = self._prepare_create_args(config)
        # ...

    @staticmethod
    def _validate_config(config: Mapping[str, Any]) -> AzureAIClientConfiguration:
        # Validation logic here
        return config

Collaborator

lspinheiro commented Dec 18, 2024

@lspinheiro could you help reviewing this PR?

Looks quite good and is consistent with the openai client. I have a minor comment about the config validation. @jackgerrits may have more options since a lot of the design decisions here are driven by his original implementation of the openai client. If anything doesn't make since in this context he would be the best person to evaluate.


          Merge branch 'main' into feature/azure-ai-inference-client

3e396e7

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet