********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on
********************************************************************************
MAQAO 2026.0.0 - 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 || 2026/03/27
/scratch_p/kevicamu/MAQAO/bin/maqao oneview -dbg=1 --replace --with-POP -S1 -c=config_throughput.json -xp=qmckl_large_c_o1_malloc-only_stab -- /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c /home/kevicamu/POP/qmckl/qmckl_bench/data/Alz_large.h5
Warning: dataset_handler must be "copy" when throughput_core is defined.
It has been automaticaly set to "copy"
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/binaries/bench_pop_c
CPY: [true] /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/__install/lib/libqmckl.so.0.0.0 --> /scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/libs/libqmckl.so.0.0.0
Run 1c: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run 1c: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 35 functions cumulating 0.0% of application profiled time
Run run_1: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_2: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 21 functions cumulating 0.0% of application profiled time
Run run_3: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_4: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_5: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_6: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_7: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_8: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 20 functions cumulating 0.0% of application profiled time
Run run_9: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_10: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_11: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_12: At cluster level, discarded 24 loops cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_13: At cluster level, discarded 29 loops cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 19 functions cumulating 0.0% of application profiled time
Run run_14: At cluster level, discarded 22 loops cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 27 functions cumulating 0.0% of application profiled time
Run run_15: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 30 functions cumulating 0.0% of application profiled time
Run run_16: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_17: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_18: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 31 functions cumulating 0.0% of application profiled time
Run run_19: At cluster level, discarded 26 loops cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_20: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_21: At cluster level, discarded 23 loops cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 32 functions cumulating 0.0% of application profiled time
Run run_22: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_23: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 26 functions cumulating 0.0% of application profiled time
Run run_24: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_25: At cluster level, discarded 25 loops cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 34 functions cumulating 0.0% of application profiled time
Run run_26: At cluster level, discarded 27 loops cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 24 functions cumulating 0.0% of application profiled time
Run run_27: At cluster level, discarded 20 loops cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 25 functions cumulating 0.0% of application profiled time
Run run_28: At cluster level, discarded 30 loops cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 29 functions cumulating 0.0% of application profiled time
Run run_29: At cluster level, discarded 21 loops cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 functions cumulating 0.0% of application profiled time
Run run_30: At cluster level, discarded 28 loops cumulating 0.0% of application profiled time
CMD: /scratch_p/kevicamu/MAQAO/bin/maqao otter -input=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/input_manifest.csv -output=/scratch_p/kevicamu/runs/qmckl/qmckl_large_c_o1_malloc-only_stab/OTTER/output_manifest.csv -dbg=1 --reduce-threads-files=on