Support Nvidia GPU Feature Discovery #1219

p53 · 2024-05-01T22:24:15Z

Description

Original Title: Ignore node selector labels for provisioning

What problem are you trying to solve?

We have nvidia operator which installs nvidia runtime etc.. on karpenter nodes after they are provisioned, operator runs feature discovery and applies appropriate nvidia labels, we need to place pods on these karpenter nodes depending on these nvidia labels. Problem is that when i place nvidia labels in nodeSelector on pod, which are not in NodePool, because they are placed on nodes during node runtime by nvidia operator, karpenter will fail to provision nodes. Solution might be e.g. placing some annotations on pod e.g. karpenter.sh/ignore-label=somelabel so that karpenter ignores this label during provisioning

How important is this feature to you?

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

The text was updated successfully, but these errors were encountered:

jonathan-innis · 2024-05-02T23:49:56Z

operator runs feature discovery and applies appropriate nvidia labels

What kind of feature discovery are you talking about here? Is it stuff related to the properties of the instance type that we are launching?

Bryce-Soghigian · 2024-05-03T03:52:57Z

https://github.com/NVIDIA/gpu-feature-discovery?tab=readme-ov-file#deploy-nvidia-gpu-feature-discovery-gfd

gfd adds labels after the nodes have already been created.

$ kubectl get nodes -o yaml
apiVersion: v1
items:
- apiVersion: v1
  kind: Node
  metadata:
    ...

    labels:
      nvidia.com/cuda.driver.major: "455"
      nvidia.com/cuda.driver.minor: "06"
      nvidia.com/cuda.driver.rev: ""
      nvidia.com/cuda.runtime.major: "11"
      nvidia.com/cuda.runtime.minor: "1"
      nvidia.com/gpu.compute.major: "8"
      nvidia.com/gpu.compute.minor: "0"
      nvidia.com/gfd.timestamp: "1594644571"
      nvidia.com/gpu.count: "1"
      nvidia.com/gpu.family: ampere
      nvidia.com/gpu.machine: NVIDIA DGX-2H
      nvidia.com/gpu.memory: "39538"
      nvidia.com/gpu.product: A100-SXM4-40GB
      ...
...

Bryce-Soghigian · 2024-05-03T03:54:43Z

basically you are requesting a workload requiring a node with those labels that we create a node with those labels, but the nodepool is not aware of these labels and we wont be aware of them. They aren't added until gfd goes and adds them. They are added after gpu nodes are provisioned?

How can karpenter know these traits? Seems relevant to per instance type overrides. If you know particular instance types will have particular traits then we can override a configmap to say these instance types have these values for the overrides.

Do these values differ from node to node? Seems cuda runtime is dependent on the gpu drivers installed on the node? We can't just cache them directly.

p53 · 2024-05-03T07:44:48Z

basically you are requesting a workload requiring a node with those labels that we create a node with those labels, but the nodepool is not aware of these labels and we wont be aware of them. They aren't added until gfd goes and adds them. They are added after gpu nodes are provisioned? = yup that's right

How can karpenter know these traits? Seems relevant to per instance type overrides. If you know particular instance types will have particular traits then we can override a configmap to say these instance types have these values for the overrides. - i don't know how karpenter precisely works internally, it is probably possible to know these labels, at least part of them ahead of time and configure them statically, best would be if we would not need to define them in config statically

Do these values differ from node to node? Seems cuda runtime is dependent on the gpu drivers installed on the node? We can't just cache them directly. - we have e.g. all AWS g5 intances in one nodepool so for sure they will be different for each instance type, depending on gpu type of instance type, having each instance type in separate nodepool would be quite impractical

p53 · 2024-05-03T12:54:48Z

DRA -> #1231 probably solve thing = "knowing before" as third-party drivers would present noderesourceslices when running on cluster altough not sure about its flexibility in terms we are still assuming that something is there before and it is constrained only on resources

p53 · 2024-05-04T17:19:52Z

Also e.g. node feature discovery adds labels to nodes e.g CPU capabilities

jonathan-innis · 2024-05-14T02:21:28Z

best would be if we would not need to define them in config statically

I think the ideal state here is defining what the different configurations can be for the GPU feature discovery operator and then see if we can surface first-class support for these in Karpenter directly.

Like you mentioned, having to statically configure all of these values is going to be a huge pain, ideally Karpenter can auto-discover them by matching its logic up with what Nvidia tells us should be on these instance types.

I'm wondering if it makes sense to retitle this issue to be more specific to the use-case. Something like: "Support Nvidia GPU Feature Discovery". @p53 What do you think?

jonathan-innis · 2024-05-14T02:22:05Z

/triage accepted

p53 · 2024-05-14T07:34:17Z

@jonathan-innis renamed

p53 added kind/feature Categorizes issue or PR as related to a new feature. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels May 1, 2024

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels May 14, 2024

p53 changed the title ~~Ignore node selector labels for provisioning~~ Support Nvidia GPU Feature Discovery May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Nvidia GPU Feature Discovery #1219

Support Nvidia GPU Feature Discovery #1219

p53 commented May 1, 2024 •

edited

jonathan-innis commented May 2, 2024

Bryce-Soghigian commented May 3, 2024

Bryce-Soghigian commented May 3, 2024 •

edited

p53 commented May 3, 2024 •

edited

p53 commented May 3, 2024 •

edited

p53 commented May 4, 2024

jonathan-innis commented May 14, 2024 •

edited

jonathan-innis commented May 14, 2024

p53 commented May 14, 2024

Support Nvidia GPU Feature Discovery #1219

Support Nvidia GPU Feature Discovery #1219

Comments

p53 commented May 1, 2024 • edited

Description

jonathan-innis commented May 2, 2024

Bryce-Soghigian commented May 3, 2024

Bryce-Soghigian commented May 3, 2024 • edited

p53 commented May 3, 2024 • edited

p53 commented May 3, 2024 • edited

p53 commented May 4, 2024

jonathan-innis commented May 14, 2024 • edited

jonathan-innis commented May 14, 2024

p53 commented May 14, 2024

p53 commented May 1, 2024 •

edited

Bryce-Soghigian commented May 3, 2024 •

edited

p53 commented May 3, 2024 •

edited

p53 commented May 3, 2024 •

edited

jonathan-innis commented May 14, 2024 •

edited