Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | r3 | |
|---|---|---|---|---|---|
| Total Time (s) | 5.07 | 5.10 | 4.78 | 4.62 | |
| Max (Thread Active Time) (s) | 5.05 | 5.07 | 4.76 | 4.56 | |
| Average Active Time (s) | 5.05 | 5.07 | 4.76 | 4.56 | |
| Activity Ratio (%) | 99.5 | 99.5 | 99.4 | 98.8 | |
| Average number of active threads | 0.995 | 0.995 | 0.994 | 0.988 | |
| Affinity Stability (%) | 99.6 | 99.5 | 99.7 | 98.7 | |
| Time in analyzed loops (%) | 90.9 | 92.8 | 98.3 | 98.4 | |
| Time in analyzed innermost loops (%) | 67.7 | 69.2 | 72.5 | 89.1 | |
| Time in user code (%) | 90.8 | 93.1 | 98.5 | 98.6 | |
| Compilation Options Score (%) | 97.2 | 97.9 | 97.8 | 16.9 | |
| Array Access Efficiency (%) | 50.8 | 36.4 | 36.7 | 39.0 | |
| Potential Speedups | |||||
| Perfect Flow Complexity | 1.28 | 1.30 | 1.34 | 1.36 | |
| Perfect OpenMP/MPI/Pthread/TBB | 1.00 | 1.00 | 1.00 | 1.00 | |
| Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | 1.00 | |
| No Scalar Integer | Potential Speedup | 1.27 | 1.29 | 1.33 | 1.21 |
| Nb Loops to get 80% | 2 | 2 | 2 | 2 | |
| FP Vectorised | Potential Speedup | 1.01 | 1.01 | 1.01 | 1.33 |
| Nb Loops to get 80% | 1 | 1 | 1 | 4 | |
| Fully Vectorised | Potential Speedup | 1.39 | 1.40 | 1.46 | 3.29 |
| Nb Loops to get 80% | 2 | 2 | 2 | 6 | |
| Only FP Arithmetic | Potential Speedup | 3.18 | 3.32 | 3.79 | 2.40 |
| Nb Loops to get 80% | 5 | 4 | 4 | 4 | |
| Source Object | Issue |
|---|---|
| ▼bench_pop_c | |
| ▼ | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ○ | -O2, -O3 or -Ofast is missing. |
| ○ | -march=(target) is missing. |
| ▼libqmckl.so.0.0.0 | |
| ▼qmckl_blas.c | |
| ○ | |
| ▼qmckl_jastrow_champ_f.F90 | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼qmckl_jastrow_champ.c | |
| ○ | |
| ▼qmckl_jastrow_champ_single_f.F90 | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼qmckl_distance_f.F90 | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼qmckl_mo.c | |
| ○ | |
| ▼qmckl_jastrow_champ_single.c | |
| ○ |
| r0 | r1 | r2 | r3 | |
|---|---|---|---|---|
| Experiment Name | ||||
| Application | /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_c | same as r0 | same as r0 | /home/kevicamu/POP/qmckl/qmckl_bench/build_march/bench_pop_fortran |
| Timestamp | 2026-03-31 14:44:50 | 2026-03-31 12:24:19 | 2026-03-31 12:08:39 | 2026-03-31 10:37:19 |
| Experiment Type | Throughput; | same as r0 | same as r0 | Sequential |
| Machine | romeo-c101 | same as r0 | same as r0 | same as r0 |
| Architecture | x86_64 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | ZEN_V4 | same as r0 | same as r0 | same as r0 |
| Model Name | AMD EPYC 9654 96-Core Processor | same as r0 | same as r0 | same as r0 |
| Cache Size | 1024 KB | same as r0 | same as r0 | same as r0 |
| Number of Cores | 96 | same as r0 | same as r0 | same as r0 |
| Maximal Frequency | 3.707812 GHz | same as r0 | same as r0 | same as r0 |
| OS Version | Linux 5.14.0-427.42.1.el9_4.x86_64 #1 SMP PREEMPT_DYNAMIC Fri Oct 18 14:35:40 EDT 2024 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | ZEN_V4 | same as r0 | same as r0 | same as r0 |
| Compilation Options | bench_pop_c: N/A libqmckl.so.0.0.0: -Wl,-rpath,/apps/2025/spack_install/linux-rhel9-zen4/linux-rhel9-zen4/gcc-11.4.1/intel-oneapi-compilers-2024.0.2-mga5mzcxshliwfnylzlj6ycvtdnikcao/compiler/2024.0/lib --gcc-toolchain=/usr --intel -I . -I ./include -I ./src -I ./include -I ./src -I ./include -I /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/src/libqmckl/share/qmckl/test_data/ -I /home/kevicamu/POP/qmckl/qmckl_bench/build_march/libtrexio/__install/include -D HAVE_CONFIG_H -D QMCKL_TEST_DIR=\"/home/kevicamu/POP/qmckl/qmckl_bench/build_march/libqmckl/src/libqmckl/share/qmckl/test_data/\" -march=native -O3 -ftz -finline -g -qmkl=sequential -fopenmp -MT src/qmckl_jastrow_champ_single.lo -MD -MP -MF src/.deps/qmckl_jastrow_champ_single.Tpo -c src/qmckl_jastrow_champ_single.c -fPIC -D PIC -o src/.libs/qmckl_jastrow_champ_single.o -fveclib=SVML -fheinous-gnu-extensions | same as r0 | same as r0 | bench_pop_fortran: N/A libqmckl.so.0.0.0: Intel(R) Fortran 24.0-1238 |
| Number of processes observed | 1 | same as r0 | same as r0 | same as r0 |
| Number of threads observed | 1 | same as r0 | same as r0 | same as r0 |
| Frequency Driver | acpi-cpufreq | same as r0 | same as r0 | same as r0 |
| Frequency Governor | performance | same as r0 | same as r0 | same as r0 |
| Huge Pages | always | same as r0 | same as r0 | same as r0 |
| Hyperthreading | off | same as r0 | same as r0 | same as r0 |
| Number of sockets | 2 | same as r0 | same as r0 | same as r0 |
| Number of cores per socket | 96 | same as r0 | same as r0 | same as r0 |
| MAQAO version | 2026.0.0 | same as r0 | same as r0 | same as r0 |
| MAQAO build | 4e6d8b0c28471075b95f413bb65d50dc1633d642::20260327-180102 | same as r0 | same as r0 | same as r0 |
| Comments | same as r0 | same as r0 | same as r0 |