| Detailed Application Categorization |
| Detailed Function Times |
| Scalability - Coverage per Category |
| Scalability - Time per Category |
| Scalability - Efficiency |
| Function Based Profile |
| Scalability - Coverage per Parallel Efficiency |
| Scalability - Coverage per Parallel Speedup |
| Libraries |
Detailed Application Categorization
| ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so.0.0.0 (%) | Others(%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ▼run_0 | 13.26 | 0.00 | 0.00 | 0.04 | 0.00 | 0.83 | 0.11 | 0.00 | 0.00 | 0.00 | 0.30 | 88.42 | 10.30 |
| ▼Node skylake | 13.26 | 0.00 | 0.00 | 0.04 | 0.00 | 0.83 | 0.11 | 0.00 | 0.00 | 0.00 | 0.30 | 88.42 | 10.30 |
| ▼Process 2955241 | 13.26 | 0.00 | 0.00 | 0.04 | 0.00 | 0.83 | 0.11 | 0.00 | 0.00 | 0.00 | 0.30 | 88.42 | 10.30 |
| ○Thread 2955241 | 13.26 | 0.00 | 0.00 | 0.04 | 0.00 | 0.83 | 0.11 | 0.00 | 0.00 | 0.00 | 0.30 | 88.42 | 10.30 |
| ▼run_1 | 13.22 | 0.00 | 0.00 | 0.11 | 0.00 | 1.02 | 0.15 | 0.00 | 0.00 | 0.00 | 0.19 | 89.11 | 9.42 |
| ▼Node skylake | 13.22 | 0.00 | 0.00 | 0.11 | 0.00 | 1.02 | 0.15 | 0.00 | 0.00 | 0.00 | 0.19 | 89.11 | 9.42 |
| ▼Process 2955296 | 13.22 | 0.00 | 0.00 | 0.11 | 0.00 | 1.02 | 0.15 | 0.00 | 0.00 | 0.00 | 0.19 | 89.11 | 9.42 |
| ○Thread 2955296 | 13.22 | 0.00 | 0.00 | 0.11 | 0.00 | 1.02 | 0.15 | 0.00 | 0.00 | 0.00 | 0.19 | 89.11 | 9.42 |
| ▼run_2 | 13.36 | 0.04 | 0.00 | 0.07 | 0.00 | 0.52 | 0.11 | 0.00 | 0.04 | 0.00 | 0.19 | 89.89 | 9.14 |
| ▼Node skylake | 13.36 | 0.04 | 0.00 | 0.07 | 0.00 | 0.52 | 0.11 | 0.00 | 0.04 | 0.00 | 0.19 | 89.89 | 9.14 |
| ▼Process 2955357 | 13.36 | 0.04 | 0.00 | 0.07 | 0.00 | 0.52 | 0.11 | 0.00 | 0.04 | 0.00 | 0.19 | 89.89 | 9.14 |
| ○Thread 2955357 | 13.36 | 0.04 | 0.00 | 0.07 | 0.00 | 0.52 | 0.11 | 0.00 | 0.04 | 0.00 | 0.19 | 89.89 | 9.14 |
| ▼run_3 | 13.48 | 0.00 | 0.00 | 0.04 | 0.00 | 0.71 | 0.11 | 0.00 | 0.00 | 0.00 | 0.11 | 90.02 | 9.02 |
| ▼Node skylake | 13.48 | 0.00 | 0.00 | 0.04 | 0.00 | 0.71 | 0.11 | 0.00 | 0.00 | 0.00 | 0.11 | 90.02 | 9.02 |
| ▼Process 2955430 | 13.48 | 0.00 | 0.00 | 0.04 | 0.00 | 0.71 | 0.11 | 0.00 | 0.00 | 0.00 | 0.11 | 90.02 | 9.02 |
| ○Thread 2955430 | 13.48 | 0.00 | 0.00 | 0.04 | 0.00 | 0.71 | 0.11 | 0.00 | 0.00 | 0.00 | 0.11 | 90.02 | 9.02 |
| ▼run_4 | 13.46 | 0.00 | 0.00 | 0.15 | 0.00 | 0.78 | 0.15 | 0.00 | 0.04 | 0.00 | 0.22 | 88.03 | 10.63 |
| ▼Node skylake | 13.46 | 0.00 | 0.00 | 0.15 | 0.00 | 0.78 | 0.15 | 0.00 | 0.04 | 0.00 | 0.22 | 88.03 | 10.63 |
| ▼Process 2955530 | 13.46 | 0.00 | 0.00 | 0.15 | 0.00 | 0.78 | 0.15 | 0.00 | 0.04 | 0.00 | 0.22 | 88.03 | 10.63 |
| ○Thread 2955530 | 13.46 | 0.00 | 0.00 | 0.15 | 0.00 | 0.78 | 0.15 | 0.00 | 0.04 | 0.00 | 0.22 | 88.03 | 10.63 |
| ▼run_5 | 13.96 | 0.00 | 0.00 | 0.04 | 0.00 | 0.75 | 0.07 | 0.00 | 0.00 | 0.00 | 0.32 | 88.03 | 10.78 |
| ▼Node skylake | 13.96 | 0.00 | 0.00 | 0.04 | 0.00 | 0.75 | 0.07 | 0.00 | 0.00 | 0.00 | 0.32 | 88.03 | 10.78 |
| ▼Process 2955663 | 13.96 | 0.00 | 0.00 | 0.04 | 0.00 | 0.75 | 0.07 | 0.00 | 0.00 | 0.00 | 0.32 | 88.03 | 10.78 |
| ○Thread 2955663 | 13.96 | 0.00 | 0.00 | 0.04 | 0.00 | 0.75 | 0.07 | 0.00 | 0.00 | 0.00 | 0.32 | 88.03 | 10.78 |
| ▼run_6 | 13.99 | 0.00 | 0.00 | 0.07 | 0.00 | 0.72 | 0.18 | 0.00 | 0.00 | 0.00 | 0.32 | 87.02 | 11.69 |
| ▼Node skylake | 13.99 | 0.00 | 0.00 | 0.07 | 0.00 | 0.72 | 0.18 | 0.00 | 0.00 | 0.00 | 0.32 | 87.02 | 11.69 |
| ▼Process 2955870 | 13.99 | 0.00 | 0.00 | 0.07 | 0.00 | 0.72 | 0.18 | 0.00 | 0.00 | 0.00 | 0.32 | 87.02 | 11.69 |
| ○Thread 2955870 | 13.99 | 0.00 | 0.00 | 0.07 | 0.00 | 0.72 | 0.18 | 0.00 | 0.00 | 0.00 | 0.32 | 87.02 | 11.69 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
| Run | Number of threads | Binary (%) | OMP (%) | Math (%) | System (%) | IO (%) | Memory (%) | Others (%) |
|---|---|---|---|---|---|---|---|---|
| run_0 | 1 | 0 | 0.04 | 0.83 | 0.11 | 0 | 0.3 | 10.3 |
| run_1 | 1 | 0 | 0.11 | 1.02 | 0.15 | 0 | 0.19 | 9.42 |
| run_2 | 1 | 0.04 | 0.07 | 0.52 | 0.11 | 0.04 | 0.19 | 9.14 |
| run_3 | 1 | 0 | 0.04 | 0.71 | 0.11 | 0 | 0.11 | 9.02 |
| run_4 | 1 | 0 | 0.15 | 0.78 | 0.15 | 0.04 | 0.22 | 10.63 |
| run_5 | 1 | 0 | 0.04 | 0.75 | 0.07 | 0 | 0.32 | 10.78 |
| run_6 | 1 | 0 | 0.07 | 0.72 | 0.18 | 0 | 0.32 | 11.69 |
Scalability - Time per Category
Detailed Time per Category
| Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | System (s) | Memory (s) | Others (s) |
|---|---|---|---|---|---|---|---|
| run_0 | 1 | 13.26 | 0 | 0.11 | 0.01 | 0.04 | 1.37 |
| run_1 | 1 | 13.22 | 0.01 | 0.13 | 0.02 | 0.02 | 1.24 |
| run_2 | 1 | 13.36 | 0.01 | 0.07 | 0.01 | 0.02 | 1.22 |
| run_3 | 1 | 13.48 | 0.01 | 0.1 | 0.02 | 0.02 | 1.22 |
| run_4 | 1 | 13.46 | 0.02 | 0.1 | 0.02 | 0.03 | 1.43 |
| run_5 | 1 | 13.96 | 0.01 | 0.11 | 0.01 | 0.05 | 1.5 |
| run_6 | 1 | 13.99 | 0.01 | 0.1 | 0.03 | 0.05 | 1.64 |
Scalability - Efficiency
Detailed Efficiency
| Run | Number of observed threads | Efficiency (ideal is 1) |
|---|---|---|
| run_0 | 1 | 1 |
| run_1 | 1 | 1 |
| run_2 | 1 | 0.99 |
| run_3 | 1 | 0.98 |
| run_4 | 1 | 0.99 |
| run_5 | 1 | 0.95 |
| run_6 | 1 | 0.95 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
| Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
| run_1 | 1 | 0 | 0 | 0 | 0 | 0 | 0.42 | 0 | 0 | 1.02 | 98.41 | 0.15 |
| run_2 | 1 | 0 | 0 | 0 | 0 | 0.07 | 0.26 | 1.16 | 1.12 | 0 | 97.23 | 0.15 |
| run_3 | 1 | 0 | 0 | 0 | 0.22 | 0.07 | 0 | 2.6 | 0 | 0 | 96.88 | 0.22 |
| run_4 | 1 | 0 | 0 | 0 | 0 | 0.56 | 0 | 1.15 | 0 | 0 | 97.96 | 0.33 |
| run_5 | 1 | 0 | 0 | 0 | 0.21 | 0 | 1.47 | 0 | 0.32 | 0 | 97.78 | 0.22 |
| run_6 | 1 | 0 | 0 | 0 | 0 | 0.21 | 1.43 | 0 | 0.32 | 12.55 | 85.2 | 0.29 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
| Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
| run_1 | 1 | 0 | 0 | 0 | 0 | 0 | 0.42 | 0 | 0 | 1.02 | 87.86 | 10.7 | 0 |
| run_2 | 1 | 0 | 0 | 0 | 0 | 0.07 | 0.26 | 1.16 | 1.12 | 0 | 87.2 | 10.18 | 0 |
| run_3 | 1 | 0 | 0 | 0 | 0.22 | 0.07 | 0 | 2.6 | 0 | 0 | 87.31 | 9.8 | 0 |
| run_4 | 1 | 0 | 0 | 0 | 0 | 0.56 | 0 | 1.15 | 0 | 0 | 96.43 | 1.86 | 0 |
| run_5 | 1 | 0 | 0 | 0 | 0.21 | 0 | 1.47 | 0 | 0.32 | 0 | 96.35 | 1.65 | 0 |
| run_6 | 1 | 0 | 0 | 0 | 0 | 0.21 | 1.43 | 0 | 0.32 | 12.55 | 84.48 | 1 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
| Library | run_0 | run_1 | run_2 | run_3 | run_4 | run_5 | run_6 |
|---|---|---|---|---|---|---|---|
| /home/kcamus/Trex/qmckl/qmckl_bench/build_pop/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
| /home/kcamus/Trex/qmckl/qmckl_bench/build_pop/libtrexio/__install/lib/libtrexio.so.0.0.0 | |||||||
| /opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | |||||||
| /opt/intel/oneapi/compiler/2024.2/lib/libifcoremt.so.5 | |||||||
| /opt/intel/oneapi/compiler/2024.2/lib/libifport.so.5 | |||||||
| /opt/intel/oneapi/compiler/2024.2/lib/libimf.so | |||||||
| /opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | |||||||
| /opt/intel/oneapi/compiler/2024.2/lib/libiomp5.so | |||||||
| /opt/intel/oneapi/compiler/2024.2/lib/libirng.so | |||||||
| /opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | |||||||
| /usr/lib/ld-linux-x86-64.so.2 | |||||||
| /usr/lib/libblas.so.3.12.0 | |||||||
| /usr/lib/libc.so.6 | |||||||
| /usr/lib/libdl.so.2 | |||||||
| /usr/lib/libgcc_s.so.1 | |||||||
| /usr/lib/libgfortran.so.5.0.0 | |||||||
| /usr/lib/libhdf5.so.310.5.1 | |||||||
| /usr/lib/liblapack.so.3.12.0 | |||||||
| /usr/lib/libm.so.6 | |||||||
| /usr/lib/libpthread.so.0 | |||||||
| /usr/lib/librt.so.1 | |||||||
| /usr/lib/libsz.so.2.0.1 | |||||||
| /usr/lib/libz.so.1.3.1 |

