Running executor pods with different nodeselectors #2329

ujjawal-khare-27 · 2024-11-21T04:19:22Z

What question do you want to ask?

[ X] ✋ I have searched the open/closed issues and my issue is not listed.

I have a requirement where I want to give users flexibility to choose number of spot and on-demand executors. Is there any way by which I can achieve this?

Additional context

No response

Have the same question?

Give it a 👍 We prioritize the question with most 👍

jacobsalway · 2024-11-21T05:01:36Z

Hey, do you mean mixing executors between spot and on-demand nodes? For example, 40% on spot and 60% on on-demand?

ujjawal-khare-27 · 2024-11-22T07:30:04Z

Yes @jacobsalway.

jacobsalway · 2024-11-23T02:42:26Z

The properties for executors in Spark on Kubernetes apply to all executors, so I think the answer to your question on different node selectors for different executors is that you can't. However I think this could be done at the node provisioning and/or scheduling level. Here are some approaches that come to mind:

If on AWS, you could use a node group with the desired mix of spot and on-demand.
If using Karpenter, you could try this guide to launch a mix of spot and on-demand and force the scheduler to distribute the executors pods using topology spread constraints. However neither Spark or the operator support topology spread constraints right now, so we'd need to add new functionality to support this.
Not use the operator at all and run Spark in standalone mode with separate StatefulSets targeting different node selectors.

ujjawal-khare-27 · 2024-11-23T06:47:39Z

Thanks for replying @jacobsalway . We can do that at scheduler level but our use case is more like of creating spark as a service in which user can specify these properties.

I am willing to submit a PR for this, in my opinion this will help others as well. Let me know your thoughts.

jacobsalway · 2024-11-23T06:55:55Z

Could you go into more detail on how this feature would look? Is it something akin to EMR instance fleets?

ujjawal-khare-27 · 2024-11-23T07:03:19Z

I was thinking more in directions to take array as a type in executor rather than having single executor. It will help to extend other functionalities as well.

jacobsalway · 2024-11-23T10:46:24Z

Obviously we're welcome to all PRs and will happily review, but I think you might find some difficulty in trying to implement this. Spark on Kubernetes doesn't support any concept of executor groups/fleets, so even if the SparkApplication spec supported this, I'm not sure how you'd construct the spark-submit arguments. I think this would require significant changes to the Kubernetes backend in Spark core.

ujjawal-khare-27 · 2024-11-25T03:31:15Z

Will check and get back to you on this @jacobsalway .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running executor pods with different nodeselectors #2329

Running executor pods with different nodeselectors #2329

ujjawal-khare-27 commented Nov 21, 2024

jacobsalway commented Nov 21, 2024 •

edited

Loading

ujjawal-khare-27 commented Nov 22, 2024

jacobsalway commented Nov 23, 2024 •

edited

Loading

ujjawal-khare-27 commented Nov 23, 2024

jacobsalway commented Nov 23, 2024

ujjawal-khare-27 commented Nov 23, 2024

jacobsalway commented Nov 23, 2024 •

edited

Loading

ujjawal-khare-27 commented Nov 25, 2024

Running executor pods with different nodeselectors #2329

Running executor pods with different nodeselectors #2329

Comments

ujjawal-khare-27 commented Nov 21, 2024

What question do you want to ask?

Additional context

Have the same question?

jacobsalway commented Nov 21, 2024 • edited Loading

ujjawal-khare-27 commented Nov 22, 2024

jacobsalway commented Nov 23, 2024 • edited Loading

ujjawal-khare-27 commented Nov 23, 2024

jacobsalway commented Nov 23, 2024

ujjawal-khare-27 commented Nov 23, 2024

jacobsalway commented Nov 23, 2024 • edited Loading

ujjawal-khare-27 commented Nov 25, 2024

jacobsalway commented Nov 21, 2024 •

edited

Loading

jacobsalway commented Nov 23, 2024 •

edited

Loading

jacobsalway commented Nov 23, 2024 •

edited

Loading