We got into scaling issue with the tagging in prolog script
I understand the prolog is ran at every step and when many nodes are involved the job fails with timeouts
we need to find another place to do the tagging and I understand that the comment is job related but some other tags can be done only once when the instances are created, either because of the min value in the configuration or created by slurm
I am looking at places where this could be done.
maybe it can be done at the headnode instead in the PrologSlurmctld https://slurm.schedmd.com/prolog_epilog.html
We got into scaling issue with the tagging in prolog script
I understand the prolog is ran at every step and when many nodes are involved the job fails with timeouts
we need to find another place to do the tagging and I understand that the comment is job related but some other tags can be done only once when the instances are created, either because of the min value in the configuration or created by slurm
I am looking at places where this could be done.
maybe it can be done at the headnode instead in the PrologSlurmctld https://slurm.schedmd.com/prolog_epilog.html