Add support for `/health` and `/info` endpoints for TGI #1819

Wauplin · 2023-11-13T08:45:41Z

Originally from @thomwolf on slack (private)

I think we could add wraping for health and info endpoints in TGI to the huggingface_hub client at some point

Docs:

Thought I'm not sure yet how/where to integrate those in InferenceClient. Fow now, let's see if there's more demand for it (=> anyone landing on this issue and that is interested, please let us know!)

The text was updated successfully, but these errors were encountered:

julien-c · 2023-11-13T09:24:48Z

let's see if there's more user demand maybe

MoritzLaurer · 2024-04-22T16:54:29Z

@Wauplin , this came up in a conversation with a customer recently and I think it would be great to support his. if I understand correctly, via /info users of serverless API endpoints could check which TGI version/sha an LLM like llama-3/dbrx/command-r is running on. That's quite important for debugging and understanding which recent features of TGI are supported (e.g. if guidance/function calling finally works or if the model runs on an old TGI version which is incompatible). At the moment, I'm trying to get guidance to work for llama-3 for example and I'm not sure if users can know which TGI version it is running with.

Based on this internal conversation, the TGI version/sha is only available in a private HF repo. would be great to enable users to query this information via our library.

MoritzLaurer · 2024-04-22T17:35:15Z

Another example for why it's very useful for users to know the exact TGI version an endpoint is running on: (internal conversation)

Wauplin added the enhancement New feature or request label Nov 13, 2023

Wauplin mentioned this issue May 3, 2024

Support /info and /health routes in InferenceClient #2269

Merged

Wauplin closed this as completed in #2269 May 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for `/health` and `/info` endpoints for TGI #1819

Add support for `/health` and `/info` endpoints for TGI #1819

Wauplin commented Nov 13, 2023

julien-c commented Nov 13, 2023

MoritzLaurer commented Apr 22, 2024

MoritzLaurer commented Apr 22, 2024

Add support for /health and /info endpoints for TGI #1819

Add support for /health and /info endpoints for TGI #1819

Comments

Wauplin commented Nov 13, 2023

julien-c commented Nov 13, 2023

MoritzLaurer commented Apr 22, 2024

MoritzLaurer commented Apr 22, 2024

Add support for `/health` and `/info` endpoints for TGI #1819

Add support for `/health` and `/info` endpoints for TGI #1819