Commit Graph

16 Commits

Author SHA1 Message Date
Rodrigo Arias Mallo
5c0f179830 stdexp: rename "name" to "clusterName" 2021-04-16 09:29:33 +02:00
Rodrigo Arias Mallo
71c06d02da stages: add baywatch stage to check the exit code
This workaround stage prevents srun from returning 0 to the upper stages
when a signal happens after MPI_Finalize. It writes the return code to a
file named .srun.rc.$rank and later checks that exists and contains a 0.

When the program is killed, exits with non-zero and the error is
propagated to the baywatch stage, which aborts immediately without
creating the rc file.
2021-04-16 09:29:26 +02:00
Rodrigo Arias Mallo
b0af9b8608 srun: add postSrun hook 2021-04-12 17:41:59 +02:00
Rodrigo Arias Mallo
872ad1a289 stdexp: allow preSrun attribute in the srun stage
This option allows an experiment to inject commands before srun starts,
while keeping the standard srun stage options.
2021-03-29 17:46:19 +02:00
Rodrigo Arias Mallo
0b95ea20b7 garlicd: allow manual experiment executions 2021-02-25 11:37:58 +01:00
Rodrigo Arias Mallo
e89139284a stdexp: add result and timetable targets
These targets allow one experiment to directly refer to another
experiment results, thus a dependency chain can be formed to ensure
execution order.

It also simplifies the dataset definition, as they can be automatically
fetched from the experiment directly.
2021-02-03 12:37:54 +01:00
Rodrigo Arias Mallo
5a8cc1e514 stdexp: Run python snippets and import the result 2020-12-10 15:41:49 +01:00
Rodrigo Arias Mallo
734d494d96 stdexp: Allow extra mounts 2020-11-20 15:30:47 +01:00
Rodrigo Arias Mallo
b4a3bb0ede New stdexp resource specification
Now the options	cpusPerTask ntasksPerNode nodes and jobName are required
for the sbatch stage. Also cpuBind has been removed and is always set to
"cores,verbose" in the srun stage.
2020-11-17 11:01:34 +01:00
Rodrigo Arias Mallo
18afcb1f44 Avoid nixpkgs reevaluation
The bsc attrset is now extensible: replacing a few bsc packages is very
fast. Also we allow the complete bscpkgs to be within other custom
overlays (not tested yet).
2020-11-17 10:49:45 +01:00
Rodrigo Arias Mallo
42f2227a9f sbatch: Use experiment reservation if given 2020-11-13 10:17:54 +01:00
Rodrigo Arias Mallo
7a37913b4e Set the ssh host from the machine config 2020-10-13 14:30:03 +02:00
Rodrigo Arias Mallo
a38ff31cca Introduce the runexp stage 2020-10-13 13:00:59 +02:00
Rodrigo Arias Mallo
298c7362b3 New config design 2020-10-09 19:33:06 +02:00
Rodrigo Arias Mallo
53dca32469 Simplify experiment 2020-10-09 17:19:00 +02:00
Rodrigo Arias Mallo
332b738889 Move apps into garlic/apps 2020-10-09 16:42:06 +02:00