Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

custom shell commands before fn_or_script execution #129

Open
malay-nagda opened this issue Dec 21, 2024 · 0 comments
Open

custom shell commands before fn_or_script execution #129

malay-nagda opened this issue Dec 21, 2024 · 0 comments

Comments

@malay-nagda
Copy link

Context: Running jobs in a multi-node, multi-gpu environment using SlurmExecutor.

Request: Functionality to execute custom shell commands on each GPU inside NeMo container before the main fn_or_script execution starts.

Why?:

  • When experimenting/testing new features, we set env vars (different from env vars added to the sbatch script), execute basic shell commands on each GPU for getting some info about the container
  • I understand we can do so by launching an interactive container session. But, doing so for multiple gpus at the same time is not feasible.

currently, I've updated scripts in NeMo-Run to add the following to srun command-
bash -c "<custom shell command(s)> && python -m nemo_run.core.runners.fdl_runner -n <exp_name> -p /nemo_run/configs/exp_1_packager /nemo_run/configs/<exp_name>_fn_or_script"

as opposed to the default being the following
python -m nemo_run.core.runners.fdl_runner -n <exp_name> -p /nemo_run/configs/exp_1_packager /nemo_run/configs/<exp_name>_fn_or_script

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant