Commit Graph

672 Commits

Author SHA1 Message Date
Rodrigo Arias Mallo
0cf35decc5 osu: add mtu and eager experiments 2021-04-16 09:29:33 +02:00
Rodrigo Arias Mallo
26e3a86c78 garlic tool: check the presence of all the units
This check prevents a user from removing units between the
execution of the experiment and the fetch.
2021-04-16 09:29:33 +02:00
Rodrigo Arias Mallo
b96c39e0ba noise: add srun signal bug to the list 2021-04-16 09:29:33 +02:00
Rodrigo Arias Mallo
f842f1e01d slurm: add sigsegv experiment
Ensure that we can catch a sigsegv signal before and after the
MPI_Finalize call.
2021-04-16 09:29:33 +02:00
Rodrigo Arias Mallo
71c06d02da stages: add baywatch stage to check the exit code
This workaround stage prevents srun from returning 0 to the upper stages
when a signal happens after MPI_Finalize. It writes the return code to a
file named .srun.rc.$rank and later checks that exists and contains a 0.

When the program is killed, exits with non-zero and the error is
propagated to the baywatch stage, which aborts immediately without
creating the rc file.
2021-04-16 09:29:26 +02:00
Rodrigo Arias Mallo
604cfd90a3 test: add sigsegv after MPI_Finalize test
The current srun version used in MN4 returns 0 if the program crashes
after MPI_Finalize, as shown by this test.
2021-04-16 09:28:02 +02:00
Rodrigo Arias Mallo
07253c3fa0 fwi: update figure index 2021-04-14 17:18:46 +02:00
Rodrigo Arias Mallo
eab323a13a fwi: update io figure 2021-04-14 17:18:24 +02:00
Rodrigo Arias Mallo
8ce2a68cd7 fwi: update strong scaling figure script 2021-04-14 17:16:12 +02:00
Rodrigo Arias Mallo
99c6196734 fwi: update granularity figure 2021-04-14 17:05:09 +02:00
Rodrigo Arias Mallo
dd75a840ce fwi: use enableIO instead of ioFreq 2021-04-12 20:09:17 +02:00
Rodrigo Arias Mallo
e49e3b087f fwi: rename big io experiment 2021-04-12 19:49:31 +02:00
Rodrigo Arias Mallo
59040d9355 fwi: fix inverted resources 2021-04-12 19:31:35 +02:00
Rodrigo Arias Mallo
6422741cb7 fwi: merge io experiments into one file
The enableExtended parameter control if the experiment runs with
multiple nodes or only one.
2021-04-12 19:27:45 +02:00
Rodrigo Arias Mallo
99beac9b23 fwi: generate the model in every node
As we are using local storage, we need a copy of the input in every
node. The current method is to run the generator only in the rank which
has assigned the cpu 0 in the mask.
2021-04-12 19:01:10 +02:00
Rodrigo Arias Mallo
58dc277d3d fwi: refactor ss-io with common.nix
Also, keep the names short and consistent.
2021-04-12 17:57:46 +02:00
Rodrigo Arias Mallo
47b326c646 fwi: generate the input at runtime 2021-04-12 17:46:07 +02:00
Rodrigo Arias Mallo
419e7f95cc fwi: avoid input generation
The ModelGenerator is now included in the fwi-params, so that the input
can be generated at runtime.
2021-04-12 17:43:30 +02:00
Rodrigo Arias Mallo
b0af9b8608 srun: add postSrun hook 2021-04-12 17:41:59 +02:00
Rodrigo Arias Mallo
4afda7dbfb fwi: use common.nix in sync_io experiment 2021-04-12 16:27:18 +02:00
Rodrigo Arias Mallo
02a103565c fwi: use common.nix in reuse experiment 2021-04-12 15:48:59 +02:00
Rodrigo Arias Mallo
788dd13ebd fwi: merge mpi pure experiment
The getResources function is used to assign the proper cpu binding
depending on the version. However, additional contraints are required to
ensure that we have enough points in Y.

By default the mpi+send+seq branch is disabled.
2021-04-12 15:37:39 +02:00
Rodrigo Arias Mallo
41665bc6fc fwi: refactor config generation into common.nix 2021-04-12 15:01:25 +02:00
Rodrigo Arias Mallo
9aa07993b2 fwi: refactor ss and granularity experiments
A common.nix file contains the shared stages
2021-04-12 14:41:26 +02:00
Rodrigo Arias Mallo
e0a68c077c fwi: merge forkjoin ss experiment into one file
Additional options are only active with enableExtended = true
2021-04-12 12:51:10 +02:00
Aleix Roca Nonell
989f6ee018 fwi: adjust input size to meet timing constraints
The previous iniput size for both granularity and strong scaling tests
where too big to meet the timing constrains needed for garlic. This
patch sets a new, smaller, input size.

Also, a minor cleanup is applied to the rest of the fwi experiments
and figures.
2021-04-07 12:44:14 +02:00
Aleix Roca Nonell
3e5a56ebdb fwi: add tampi non-blocking variant 2021-04-07 12:44:14 +02:00
Aleix Roca Nonell
3ef4a505d3 fwi: add strong scalability tests 2021-04-07 12:44:14 +02:00
Aleix Roca Nonell
aadce016e1 fwi: add granularity and data reuse experiments
The data reuse experiment shows the effect of poor data locality versus
task granularity.
2021-04-07 12:44:14 +02:00
Aleix Roca Nonell
1d9a5c4721 fwi: fix input derivation
The fwiInput derivation must be the same used when compiled the fwi app
as the fwi-input used in the experiment.
2021-04-07 12:44:14 +02:00
Rodrigo Arias Mallo
11e400abb5 fwi: remove old experiment 2021-04-07 12:44:14 +02:00
Rodrigo Arias Mallo
a8477b1b05 fwi: add test figure with the time 2021-04-07 12:44:14 +02:00
Rodrigo Arias Mallo
7a6cbd3a9e fwi: update test experiment 2021-04-07 12:44:14 +02:00
Rodrigo Arias Mallo
3de7b5a0b6 fwi: save the params and frequencies files 2021-04-07 12:44:14 +02:00
Rodrigo Arias Mallo
485b9150e5 fwi: add problem size parameters 2021-04-07 12:44:14 +02:00
Rodrigo Arias Mallo
fa0e9f591f fwi: update repo url to PM server 2021-04-07 12:44:13 +02:00
Rodrigo Arias Mallo
de175b2380 fwi: fix input name 2021-04-07 12:44:13 +02:00
Rodrigo Arias Mallo
bfbbc294ae fwi: split into input and solver
All branches compile with several hacks.
2021-04-07 12:44:13 +02:00
Rodrigo Arias Mallo
9bea3cc264 fwi: add oss experiment 2021-04-07 12:44:13 +02:00
Rodrigo Arias Mallo
f10f8472ac fwi: add seq test experiment 2021-04-07 12:44:13 +02:00
Rodrigo Arias Mallo
26ad3e49f7 fwi: add gitBranch and copy params 2021-04-07 12:44:13 +02:00
Rodrigo Arias Mallo
312656ce54 heat: rename granul -> granularity experiment 2021-04-06 18:42:49 +02:00
Rodrigo Arias Mallo
63aa07dad5 heat: update granularity plot with modern ggplot 2021-04-06 18:40:19 +02:00
Rodrigo Arias Mallo
d1c32869c1 heat: split granularity with extended mode
The HWC version is not yet complete.
2021-04-06 18:38:15 +02:00
Rodrigo Arias Mallo
3566cf0152 develop: add paraver package 2021-04-06 11:14:30 +02:00
Rodrigo Arias Mallo
0b7e92b6f9 heat: add bar plot with time distribution 2021-04-06 11:05:56 +02:00
Rodrigo Arias Mallo
f8122f3c8b heat: use the hcut tool to limit the cpus 2021-04-06 11:05:56 +02:00
Rodrigo Arias Mallo
699404bafe bsc: add cpuid program 2021-04-06 11:05:56 +02:00
Rodrigo Arias Mallo
d68ce914ba heat: use cut to partition the trace
The awk script doesn't take in consideration the events close to the cut
points, which are significative with low parallelism.
2021-04-06 11:05:51 +02:00
Rodrigo Arias Mallo
cb482fa3ea heat: remove perf from the ctf experiment
As we would be extracting perf stats from the trace processing steps.
2021-04-06 11:05:10 +02:00