Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
---|---|---|---|---|---|---|---|---|---|
Total Time (s) | 386.74 | 198.46 | 103.28 | 55.98 | 32.81 | 21.22 | 15.58 | 15.47 | |
Profiled Time (s) | 64.36 | 33.69 | 18.18 | 10.50 | 6.72 | 4.87 | 3.98 | 4.01 | |
Time in analyzed loops (%) | 6.57 | 7.06 | 8.11 | 9.73 | 12.0 | 15.1 | 17.8 | 17.7 | |
Time in analyzed innermost loops (%) | 6.04 | 6.05 | 6.48 | 6.92 | 7.86 | 9.37 | 10.4 | 10.4 | |
Time in user code (%) | 6.59 | 7.10 | 8.15 | 9.74 | 12.1 | 15.1 | 17.8 | 17.7 | |
Compilation Options Score (%) | 43.9 | 37.0 | 30.0 | 22.0 | 13.9 | 7.88 | 4.15 | 3.29 | |
Array Access Efficiency (%) | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | |
Scalability - Gap | 1.00 | 1.03 | 1.07 | 1.16 | 1.36 | 1.76 | 2.58 | 3.20 | |
Potential Speedups | |||||||||
Perfect Flow Complexity | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.03 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.03 | 1.09 | 1.22 | 1.42 | 1.72 | 2.06 | 2.19 | |
No Scalar Integer | Potential Speedup | 1.01 | 1.01 | 1.02 | 1.02 | 1.03 | 1.05 | 1.06 | 1.06 |
Nb Loops to get 80% | 4 | 3 | 2 | 2 | 1 | 1 | 1 | 1 | |
FP Vectorised | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
Fully Vectorised | Potential Speedup | 1.02 | 1.02 | 1.02 | 1.03 | 1.04 | 1.06 | 1.07 | 1.07 |
Nb Loops to get 80% | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | |
Only FP Arithmetic | Potential Speedup | 1.03 | 1.03 | 1.03 | 1.04 | 1.05 | 1.07 | 1.08 | 1.08 |
Nb Loops to get 80% | 3 | 3 | 2 | 2 | 2 | 2 | 2 | 2 |
r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
---|---|---|---|---|---|---|---|---|
Experiment Name | ||||||||
Application | /users/m23012/camus/code/qmckl/qmckl_bench/bench_jastrow | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Timestamp | 2024-02-26 17:39:44 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Experiment Type | Sequential | OpenMP; | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 |
Machine | turpancomp2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture | ARM_NEOVERSE_N1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Model Name | ||||||||
Cache Size | ||||||||
Number of Cores | ||||||||
Maximal Frequency | 3 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
OS Version | Linux 4.18.0-477.27.1.el8_8.aarch64 #1 SMP Thu Aug 31 11:00:23 EDT 2023 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | ARM_NEOVERSE_N1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Compilation Options | libqmckl.so.0.0.0: Arm C/C++/Fortran Compiler version 23.10 (build number 32) (based on LLVM 17.0.0) | same as r0 | + [vdso]: N/A libqmckl.so.0.0.0: Arm C/C++/Fortran Compiler version 23.10 (build number 32) (based on LLVM 17.0.0) | same as r2 | same as r2 | same as r2 | same as r2 | same as r2 |
Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of threads observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 80 |
Frequency Driver | cppc_cpufreq | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Governor | performance | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Huge Pages | never | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of sockets | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 80 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.19.2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO build | b4419cd98e02cf0e1ddec16d03c3ae3a99469c7b::20240223-153634 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Comments | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |