Run OMP1 | Number processes: 8Number nodes: 1Number processes per node: 8Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_PROC_BIND: closeOMP_PLACES: coresOMP_NUM_THREADS: 1 |
---|---|
Run OMP2 | Number processes: 8Number nodes: 1Number processes per node: 8Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 2OMP_PROC_BIND: closeOMP_PLACES: cores |
Run OMP4 | Number processes: 8Number nodes: 1Number processes per node: 8Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 4OMP_PROC_BIND: closeOMP_PLACES: cores |
Run OMP8 | Number processes: 8Number nodes: 1Number processes per node: 8Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 8OMP_PROC_BIND: closeOMP_PLACES: cores |
Run OMP16 | Number processes: 8Number nodes: 1Number processes per node: 8Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 16OMP_PROC_BIND: closeOMP_PLACES: cores |
Run OMP24 | Number processes: 8Number nodes: 1Number processes per node: 8Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 24OMP_PROC_BIND: closeOMP_PLACES: cores |
Name | Module | Coverage OMP1 (%) | Coverage OMP2 (%) | Coverage OMP4 (%) | Coverage OMP8 (%) | Coverage OMP16 (%) | Coverage OMP24 (%) | Coverage Excluding Loops OMP1 (%) | Coverage Excluding Loops OMP2 (%) | Coverage Excluding Loops OMP4 (%) | Coverage Excluding Loops OMP8 (%) | Coverage Excluding Loops OMP16 (%) | Coverage Excluding Loops OMP24 (%) | Max Inclusive Time Over Threads OMP1 (s) | Max Inclusive Time Over Threads OMP2 (s) | Max Inclusive Time Over Threads OMP4 (s) | Max Inclusive Time Over Threads OMP8 (s) | Max Inclusive Time Over Threads OMP16 (s) | Max Inclusive Time Over Threads OMP24 (s) | Max Exclusive Time Over Threads OMP1 (s) | Max Exclusive Time Over Threads OMP2 (s) | Max Exclusive Time Over Threads OMP4 (s) | Max Exclusive Time Over Threads OMP8 (s) | Max Exclusive Time Over Threads OMP16 (s) | Max Exclusive Time Over Threads OMP24 (s) | Inclusive Time w.r.t. Wall Time OMP1 (s) | Inclusive Time w.r.t. Wall Time OMP2 (s) | Inclusive Time w.r.t. Wall Time OMP4 (s) | Inclusive Time w.r.t. Wall Time OMP8 (s) | Inclusive Time w.r.t. Wall Time OMP16 (s) | Inclusive Time w.r.t. Wall Time OMP24 (s) | Exclusive Time w.r.t. Wall Time OMP1 (s) | Exclusive Time w.r.t. Wall Time OMP2 (s) | Exclusive Time w.r.t. Wall Time OMP4 (s) | Exclusive Time w.r.t. Wall Time OMP8 (s) | Exclusive Time w.r.t. Wall Time OMP16 (s) | Exclusive Time w.r.t. Wall Time OMP24 (s) | Nb Threads OMP1 | Nb Threads OMP2 | Nb Threads OMP4 | Nb Threads OMP8 | Nb Threads OMP16 | Nb Threads OMP24 | Deviation (coverage) OMP1 | Deviation (coverage) OMP2 | Deviation (coverage) OMP4 | Deviation (coverage) OMP8 | Deviation (coverage) OMP16 | Deviation (coverage) OMP24 | Deviation (walltime) OMP1 | Deviation (walltime) OMP2 | Deviation (walltime) OMP4 | Deviation (walltime) OMP8 | Deviation (walltime) OMP16 | Deviation (walltime) OMP24 | Categories OMP1 | Categories OMP2 | Categories OMP4 | Categories OMP8 | Categories OMP16 | Categories OMP24 | GFLOPS OMP1 | GFLOPS OMP2 | GFLOPS OMP4 | GFLOPS OMP8 | GFLOPS OMP16 | GFLOPS OMP24 | Compilation Options | (OMP1) Efficiency | (OMP1) Potential Speed-Up (%) | (OMP2) Efficiency | (OMP2) Potential Speed-Up (%) | (OMP4) Efficiency | (OMP4) Potential Speed-Up (%) | (OMP8) Efficiency | (OMP8) Potential Speed-Up (%) | (OMP16) Efficiency | (OMP16) Potential Speed-Up (%) | (OMP24) Efficiency | (OMP24) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○bli_dgemm_avx512_asm_8x24_macro_kernel | libblis-mt.so.5.0.0 | 86.46 | 85.22 | 81.28 | 78.62 | 66.48 | 62.30 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 642.55 | 346.15 | 190.47 | 113.61 | 64.27 | 44.48 | 0.00 | 0.13 | 0.00 | 0.13 | 0.18 | 0.00 | 628.90 | 349.52 | 196.68 | 124.35 | 71.96 | 53.84 | 0.00 | 0.03 | 0.00 | 0.01 | 0.01 | 0.00 | 8 | 16 | 32 | 64 | 128 | 192 | 1.50 | 2.42 | 4.32 | 4.29 | 7.82 | 4.55 | 10.85 | 9.52 | 8.64 | 4.53 | 8.13 | 2.49 | 95.04 | 170.99 | 303.93 | 480.88 | 830.63 | 1109.61 | 1 | 0 | 0.9 | 8.55 | 0.8 | 16.31 | 0.63 | 28.91 | 0.55 | 30.17 | 0.49 | 31.98 | |||||||
○HPL_rand | xhpl | 2.74 | 2.53 | 2.24 | 1.78 | 1.44 | 1.22 | 2.74 | 2.53 | 2.24 | 1.78 | 1.44 | 1.22 | 20.59 | 20.26 | 20.23 | 20.32 | 20.29 | 20.19 | 20.59 | 20.26 | 20.23 | 20.32 | 20.29 | 20.19 | 19.92 | 10.39 | 5.43 | 2.81 | 1.55 | 1.06 | 19.92 | 10.39 | 5.43 | 2.81 | 1.55 | 1.06 | 8 | 8 | 8 | 8 | 8 | 8 | 0.14 | 0.01 | 0.04 | 0.19 | 0.09 | 0.35 | 1.00 | 0.06 | 0.06 | 0.28 | 0.07 | 0.29 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 11.62 | 22.98 | 44.01 | 83.75 | 153.56 | 220.15 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_rand.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/linpack... | 1 | 0 | 0.96 | 0.1 | 0.92 | 0.19 | 0.88 | 0.2 | 0.8 | 0.29 | 0.79 | 0.26 |
○bli_dgemm_avx512_asm_8x24 | libblis-mt.so.5.0.0 | 1.42 | 1.41 | 1.32 | 1.26 | 1.10 | 1.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 21.89 | 22.21 | 19.52 | 14.19 | 13.79 | 10.40 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 10.34 | 5.77 | 3.20 | 2.00 | 1.19 | 0.91 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5 | 7 | 14 | 15 | 30 | 38 | 1.05 | 2.73 | 3.28 | 4.74 | 5.33 | 6.33 | 7.62 | 10.50 | 7.16 | 6.56 | 4.68 | 4.23 | 97.16 | 175.91 | 311.86 | 490.40 | 840.38 | 1133.99 | 1 | 0 | 0.9 | 0.15 | 0.81 | 0.26 | 0.65 | 0.45 | 0.54 | 0.5 | 0.47 | 0.55 | |||||||
○uct_mm_iface_progress | libuct.so.0.0.0 | 1.27 | 0.81 | 0.61 | 0.30 | 0.30 | 0.19 | 1.27 | 0.81 | 0.61 | 0.30 | 0.30 | 0.19 | 13.43 | 8.28 | 9.57 | 4.01 | 8.63 | 3.85 | 13.43 | 8.28 | 9.57 | 4.01 | 8.63 | 3.85 | 9.24 | 3.33 | 1.47 | 0.48 | 0.33 | 0.16 | 9.24 | 3.33 | 1.47 | 0.48 | 0.33 | 0.16 | 8 | 8 | 8 | 8 | 8 | 8 | 0.36 | 0.25 | 1.05 | 0.29 | 2.15 | 0.57 | 2.60 | 1.04 | 2.55 | 0.45 | 2.33 | 0.50 | 9.79 | 19.00 | 33.51 | 54.45 | 97.36 | 124.64 | 1 | 0 | 1.39 | 0 | 1.57 | 0 | 2.42 | 0 | 1.76 | 0 | 2.34 | 0 | |||||||
○bli_dpackm_zen4_asm_8xk | libblis-mt.so.5.0.0 | 1.00 | 0.99 | 0.86 | 0.74 | 0.70 | 0.66 | 1.00 | 0.99 | 0.86 | 0.74 | 0.70 | 0.66 | 7.64 | 4.50 | 2.38 | 1.25 | 0.80 | 0.60 | 7.64 | 4.50 | 2.38 | 1.25 | 0.80 | 0.60 | 7.31 | 4.08 | 2.08 | 1.17 | 0.75 | 0.57 | 7.31 | 4.08 | 2.08 | 1.17 | 0.75 | 0.57 | 8 | 16 | 32 | 64 | 128 | 192 | 0.03 | 0.06 | 0.11 | 0.09 | 0.08 | 0.10 | 0.22 | 0.23 | 0.24 | 0.10 | 0.07 | 0.06 | 13.81 | 25.50 | 51.57 | 92.84 | 154.50 | 205.02 | 1 | 0 | 0.9 | 0.1 | 0.88 | 0.1 | 0.78 | 0.16 | 0.61 | 0.27 | 0.53 | 0.31 | |||||||
○bli_dgemmtrsm_l_zen4_asm_8x24 | libblis-mt.so.5.0.0 | 0.92 | 0.87 | 0.76 | 0.73 | 0.61 | 0.55 | 0.92 | 0.87 | 0.76 | 0.73 | 0.61 | 0.55 | 6.92 | 3.62 | 1.85 | 1.15 | 0.67 | 0.44 | 6.92 | 3.62 | 1.85 | 1.15 | 0.67 | 0.44 | 6.72 | 3.57 | 1.83 | 1.15 | 0.66 | 0.47 | 6.72 | 3.57 | 1.83 | 1.15 | 0.66 | 0.47 | 8 | 16 | 32 | 64 | 128 | 192 | 0.02 | 0.04 | 0.09 | 0.06 | 0.08 | 0.06 | 0.13 | 0.10 | 0.19 | 0.07 | 0.08 | 0.04 | 53.33 | 100.68 | 196.06 | 311.48 | 540.37 | 757.64 | 1 | 0 | 0.94 | 0.05 | 0.92 | 0.06 | 0.73 | 0.2 | 0.63 | 0.22 | 0.59 | 0.22 | |||||||
○bli_dpackm_zen4_asm_24xk | libblis-mt.so.5.0.0 | 0.56 | 0.63 | 0.71 | 0.72 | 0.67 | 0.73 | 0.56 | 0.63 | 0.71 | 0.72 | 0.67 | 0.73 | 4.31 | 2.69 | 1.91 | 1.17 | 0.77 | 0.62 | 4.31 | 2.69 | 1.91 | 1.17 | 0.77 | 0.62 | 4.05 | 2.60 | 1.71 | 1.15 | 0.73 | 0.63 | 4.05 | 2.60 | 1.71 | 1.15 | 0.73 | 0.63 | 8 | 16 | 32 | 64 | 128 | 192 | 0.02 | 0.04 | 0.12 | 0.06 | 0.08 | 0.08 | 0.12 | 0.13 | 0.28 | 0.08 | 0.08 | 0.05 | 3.68 | 5.67 | 8.64 | 12.83 | 20.38 | 23.43 | 1 | 0 | 0.78 | 0.14 | 0.59 | 0.29 | 0.44 | 0.4 | 0.35 | 0.44 | 0.27 | 0.53 | |||||||
►HPL_dlaswp01N | xhpl | 0.53 | 0.37 | 0.24 | 0.18 | 0.14 | 0.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4.29 | 3.39 | 2.67 | 2.51 | 2.49 | 2.32 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3.89 | 1.53 | 0.58 | 0.29 | 0.15 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.05 | 0.08 | 0.17 | 0.27 | 0.41 | 0.45 | 0.33 | 0.34 | 0.40 | 0.43 | 0.44 | 0.39 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 3.83 | 9.72 | 25.44 | 51.97 | 99.47 | 145.11 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_dlaswp01N.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/li... | 1 | 0 | 1.27 | 0 | 1.66 | 0 | 1.7 | 0 | 1.63 | 0 | 1.58 | 0 |
►Loop 253 - HPL_dlaswp01N.c:158-191 - xhpl | 0.53 | 0.37 | 0.24 | 0.18 | 0.14 | 0.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4.30 | 3.39 | 2.67 | 2.51 | 2.50 | 2.32 | 0.01 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 3.89 | 1.53 | 0.58 | 0.29 | 0.15 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 5 | 7 | 7 | 8 | 6 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 6.99 | 0.00 | 63.05 | 32.09 | 207.39 | 0.00 | ||||||||||||||||||||
○Loop 252 - HPL_dlaswp01N.c:160-191 - xhpl | 0.53 | 0.37 | 0.24 | 0.18 | 0.14 | 0.12 | 0.53 | 0.37 | 0.24 | 0.18 | 0.14 | 0.12 | 4.29 | 3.39 | 2.66 | 2.50 | 2.49 | 2.32 | 4.29 | 3.39 | 2.66 | 2.50 | 2.49 | 2.32 | 3.89 | 1.53 | 0.58 | 0.29 | 0.15 | 0.10 | 3.89 | 1.53 | 0.58 | 0.29 | 0.15 | 0.10 | 8 | 8 | 8 | 8 | 8 | 8 | 0.05 | 0.08 | 0.17 | 0.27 | 0.41 | 0.45 | 0.33 | 0.34 | 0.40 | 0.43 | 0.44 | 0.39 | 3.83 | 9.71 | 25.42 | 51.99 | 99.40 | 145.00 | 1 | 0 | 1.27 | 0 | 1.66 | 0 | 1.7 | 0 | 1.63 | 0 | 1.58 | 0 | ||||||||
►Loop 251 - HPL_dlaswp01N.c:198-203 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 250 - HPL_dlaswp01N.c:203-203 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►HPL_dlaswp04N | xhpl | 0.52 | 0.40 | 0.30 | 0.17 | 0.12 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3.85 | 3.68 | 3.09 | 2.23 | 1.99 | 1.67 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3.77 | 1.63 | 0.72 | 0.26 | 0.13 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.01 | 0.10 | 0.21 | 0.17 | 0.24 | 0.25 | 0.08 | 0.41 | 0.49 | 0.27 | 0.26 | 0.22 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.63 | 6.08 | 13.82 | 37.41 | 79.03 | 128.25 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_dlaswp04N.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/li... | 1 | 0 | 1.15 | 0 | 1.31 | 0 | 1.78 | 0 | 1.88 | 0 | 2.03 | 0 |
►Loop 301 - HPL_dlaswp04N.c:177-260 - xhpl | 0.52 | 0.40 | 0.30 | 0.17 | 0.12 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3.85 | 3.69 | 3.09 | 2.23 | 1.99 | 1.67 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 3.77 | 1.63 | 0.72 | 0.26 | 0.13 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 3 | 6 | 7 | 7 | 7 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.94 | 25.93 | 34.23 | 77.83 | 131.64 | ||||||||||||||||||||
○Loop 303 - HPL_dlaswp04N.c:180-226 - xhpl | 0.52 | 0.40 | 0.30 | 0.17 | 0.12 | 0.09 | 0.52 | 0.40 | 0.30 | 0.17 | 0.12 | 0.09 | 3.85 | 3.68 | 3.09 | 2.22 | 1.99 | 1.67 | 3.85 | 3.68 | 3.09 | 2.22 | 1.99 | 1.67 | 3.77 | 1.63 | 0.72 | 0.26 | 0.13 | 0.08 | 3.77 | 1.63 | 0.72 | 0.26 | 0.13 | 0.08 | 8 | 8 | 8 | 8 | 8 | 8 | 0.01 | 0.10 | 0.21 | 0.17 | 0.24 | 0.25 | 0.08 | 0.41 | 0.49 | 0.26 | 0.26 | 0.22 | 2.63 | 6.08 | 13.81 | 37.43 | 79.03 | 128.29 | 1 | 0 | 1.15 | 0 | 1.31 | 0 | 1.78 | 0 | 1.88 | 0 | 2.03 | 0 | ||||||||
○Loop 302 - HPL_dlaswp04N.c:230-260 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 300 - HPL_dlaswp04N.c:267-273 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 1 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 299 - HPL_dlaswp04N.c:272-273 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 298 - HPL_dlaswp04N.c:275-279 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 297 - HPL_dlaswp04N.c:279-279 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○uct_rc_mlx5_iface_progress_cyclic | libuct_ib_mlx5.so.0.0.0 | 0.49 | 0.35 | 0.29 | 0.17 | 0.16 | 0.10 | 0.49 | 0.35 | 0.29 | 0.17 | 0.16 | 0.10 | 4.87 | 4.08 | 4.11 | 2.35 | 3.89 | 1.84 | 4.87 | 4.08 | 4.11 | 2.35 | 3.89 | 1.84 | 3.59 | 1.42 | 0.70 | 0.28 | 0.18 | 0.09 | 3.59 | 1.42 | 0.70 | 0.28 | 0.18 | 0.09 | 8 | 8 | 8 | 8 | 8 | 8 | 0.12 | 0.15 | 0.41 | 0.13 | 0.96 | 0.19 | 0.86 | 0.63 | 0.98 | 0.21 | 1.04 | 0.17 | 9.91 | 19.41 | 33.81 | 57.39 | 84.43 | 111.80 | 1 | 0 | 1.26 | 0 | 1.28 | 0 | 1.62 | 0 | 1.28 | 0 | 1.74 | 0 | |||||||
○HPL_bcast | xhpl | 0.43 | 0.23 | 0.22 | 0.11 | 0.16 | 0.09 | 0.43 | 0.23 | 0.22 | 0.11 | 0.16 | 0.09 | 7.49 | 2.95 | 6.02 | 1.54 | 5.80 | 1.84 | 7.49 | 2.95 | 6.02 | 1.54 | 5.80 | 1.84 | 3.13 | 0.93 | 0.54 | 0.18 | 0.18 | 0.08 | 3.13 | 0.93 | 0.54 | 0.18 | 0.18 | 0.08 | 8 | 8 | 8 | 8 | 8 | 8 | 0.26 | 0.18 | 0.74 | 0.25 | 1.92 | 0.25 | 1.90 | 0.75 | 1.80 | 0.39 | 2.08 | 0.22 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 7.54 | 12.92 | 27.75 | 24.67 | 105.79 | 138.86 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_bcast.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/linpac... | 1 | 0 | 1.69 | 0 | 1.44 | 0 | 2.17 | 0 | 1.1 | 0 | 1.6 | 0 |
○__GI___strcasecmp_l_sse2 | libc.so.6 | 0.41 | 0.27 | 0.19 | 0.13 | 0.12 | 0.10 | 0.41 | 0.27 | 0.19 | 0.13 | 0.12 | 0.10 | 3.11 | 2.36 | 1.90 | 1.51 | 2.09 | 1.89 | 3.11 | 2.36 | 1.90 | 1.51 | 2.09 | 1.89 | 3.00 | 1.12 | 0.47 | 0.20 | 0.13 | 0.09 | 3.00 | 1.12 | 0.47 | 0.20 | 0.13 | 0.09 | 8 | 12 | 16 | 14 | 16 | 19 | 0.01 | 0.26 | 0.37 | 0.46 | 0.82 | 1.00 | 0.07 | 1.08 | 0.90 | 0.73 | 0.88 | 0.86 | String (%): 100.00 | String (%): 100.00 | String (%): 100.00 | String (%): 100.00 | String (%): 100.00 | String (%): 100.00 | 2.07 | 5.36 | 12.45 | 29.45 | 45.90 | 67.19 | 1 | 0 | 1.34 | 0 | 1.59 | 0 | 1.87 | 0 | 1.44 | 0 | 1.38 | 0 | |
►HPL_pdlange | xhpl | 0.38 | 0.17 | 0.09 | 0.04 | 0.03 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.78 | 1.39 | 0.84 | 0.50 | 0.47 | 0.43 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.74 | 0.70 | 0.21 | 0.07 | 0.03 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.01 | 0.01 | 0.02 | 0.01 | 0.02 | 0.01 | 0.04 | 0.03 | 0.04 | 0.02 | 0.02 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.28 | 8.88 | 29.38 | 91.12 | 182.19 | 277.59 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_pdlange.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/linp... | 1 | 0 | 1.95 | 0 | 3.23 | 0 | 5 | 0 | 5 | 0 | 5.08 | 0 |
►Loop 114 - HPL_pdlange.c:170-174 - xhpl | 0.18 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.36 | 0.66 | 0.41 | 0.23 | 0.21 | 0.19 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.33 | 0.33 | 0.10 | 0.03 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 1 | 2 | 2 | 1 | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3.99 | 0.00 | 0.00 | 0.00 | 25.96 | 112.87 | ||||||||||||||||||||
○Loop 115 - HPL_pdlange.c:173-173 - xhpl | 0.18 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 0.18 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 1.36 | 0.66 | 0.41 | 0.23 | 0.21 | 0.19 | 1.36 | 0.66 | 0.41 | 0.23 | 0.21 | 0.19 | 1.33 | 0.33 | 0.10 | 0.03 | 0.01 | 0.01 | 1.33 | 0.33 | 0.10 | 0.03 | 0.01 | 0.01 | 8 | 8 | 8 | 8 | 8 | 8 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.04 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 1.88 | 7.56 | 25.24 | 80.25 | 166.78 | 257.59 | 1 | 0 | 2.01 | 0 | 3.35 | 0 | 5.33 | 0 | 5.53 | 0 | 5.7 | 0 | ||||||||
○Loop 116 - HPL_pdlange.c:173-173 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 113 - HPL_pdlange.c:173-173 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 111 - HPL_pdlange.c:208-212 - xhpl | 0.19 | 0.09 | 0.05 | 0.02 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.42 | 0.73 | 0.44 | 0.27 | 0.26 | 0.24 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.42 | 0.37 | 0.11 | 0.04 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 112 - HPL_pdlange.c:210-211 - xhpl | 0.19 | 0.09 | 0.05 | 0.02 | 0.02 | 0.01 | 0.19 | 0.09 | 0.05 | 0.02 | 0.02 | 0.01 | 1.42 | 0.73 | 0.44 | 0.27 | 0.26 | 0.24 | 1.42 | 0.73 | 0.44 | 0.27 | 0.26 | 0.24 | 1.42 | 0.37 | 0.11 | 0.04 | 0.02 | 0.01 | 1.42 | 0.37 | 0.11 | 0.04 | 0.02 | 0.01 | 8 | 8 | 8 | 8 | 8 | 8 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | 0.02 | 0.01 | 0.01 | 0.00 | 2.65 | 10.04 | 32.97 | 99.94 | 194.50 | 293.19 | 1 | 0 | 1.9 | 0 | 3.12 | 0 | 4.72 | 0 | 4.59 | 0 | 4.62 | 0 | ||||||||
○Loop 110 - HPL_pdlange.c:210-211 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 2 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 118 - HPL_pdlange.c:149-152 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 117 - HPL_pdlange.c:151-152 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○mca_pml_ucx_iprobe | mca_pml_ucx.so | 0.37 | 0.24 | 0.15 | 0.12 | 0.15 | 0.09 | 0.37 | 0.00 | 0.15 | 0.12 | 0.15 | 0.09 | 3.61 | 2.66 | 3.24 | 1.98 | 3.44 | 1.67 | 3.61 | 0.00 | 3.24 | 1.98 | 3.44 | 1.67 | 2.67 | 0.99 | 0.36 | 0.20 | 0.17 | 0.07 | 2.67 | 0.00 | 0.36 | 0.20 | 0.17 | 0.07 | 8 | 8 | 8 | 8 | 8 | 8 | 0.09 | 0.10 | 0.41 | 0.21 | 0.94 | 0.27 | 0.66 | 0.40 | 0.99 | 0.34 | 1.01 | 0.23 | 6.26 | 13.76 | 26.66 | 48.45 | 74.10 | 104.35 | 1 | 0 | 1.35 | 0 | 1.84 | 0 | 1.7 | 0 | 1.01 | -0 | 1.51 | 0 | |||||||
►HPL_dlaswp02N | xhpl | 0.33 | 0.20 | 0.12 | 0.08 | 0.06 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.73 | 2.02 | 1.45 | 1.13 | 1.08 | 1.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.40 | 0.81 | 0.29 | 0.13 | 0.07 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.03 | 0.08 | 0.11 | 0.11 | 0.15 | 0.17 | 0.19 | 0.33 | 0.27 | 0.18 | 0.17 | 0.14 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.05 | 6.12 | 17.10 | 37.05 | 74.15 | 108.59 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_dlaswp02N.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/li... | 1 | 0 | 1.48 | 0 | 2.07 | 0 | 2.25 | 0 | 2.26 | 0 | 2.2 | 0 |
○Loop 292 - HPL_dlaswp02N.c:151-152 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 291 - HPL_dlaswp02N.c:151-152 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 286 - HPL_dlaswp02N.c:151-152 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 288 - HPL_dlaswp02N.c:196-199 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 287 - HPL_dlaswp02N.c:199-199 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 1 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 290 - HPL_dlaswp02N.c:157-189 - xhpl | 0.33 | 0.20 | 0.12 | 0.08 | 0.06 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.73 | 2.02 | 1.45 | 1.13 | 1.08 | 1.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.40 | 0.81 | 0.29 | 0.13 | 0.07 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 289 - HPL_dlaswp02N.c:160-189 - xhpl | 0.33 | 0.20 | 0.12 | 0.08 | 0.06 | 0.05 | 0.33 | 0.20 | 0.12 | 0.08 | 0.06 | 0.05 | 2.73 | 2.02 | 1.45 | 1.13 | 1.08 | 1.01 | 2.73 | 2.02 | 1.45 | 1.13 | 1.08 | 1.01 | 2.40 | 0.81 | 0.29 | 0.13 | 0.07 | 0.05 | 2.40 | 0.81 | 0.29 | 0.13 | 0.07 | 0.05 | 8 | 8 | 8 | 8 | 8 | 8 | 0.03 | 0.08 | 0.11 | 0.11 | 0.15 | 0.17 | 0.19 | 0.33 | 0.27 | 0.18 | 0.17 | 0.14 | 2.05 | 6.12 | 17.10 | 37.05 | 74.13 | 108.59 | 1 | 0 | 1.48 | 0 | 2.07 | 0 | 2.25 | 0 | 2.26 | 0 | 2.2 | 0 | ||||||||
○bli_daxpyf_zen_int32_avx512 | libblis-mt.so.5.0.0 | 0.29 | 0.13 | 0.07 | 0.03 | 0.02 | 0.02 | 0.29 | 0.13 | 0.07 | 0.03 | 0.02 | 0.02 | 2.15 | 1.08 | 0.69 | 0.38 | 0.33 | 0.31 | 2.15 | 1.08 | 0.69 | 0.38 | 0.33 | 0.31 | 2.15 | 0.55 | 0.17 | 0.05 | 0.02 | 0.02 | 2.15 | 0.55 | 0.17 | 0.05 | 0.02 | 0.02 | 8 | 8 | 8 | 8 | 8 | 8 | 0.00 | 0.00 | 0.03 | 0.01 | 0.02 | 0.00 | 0.01 | 0.01 | 0.07 | 0.02 | 0.02 | 0.00 | 1.85 | 7.24 | 23.03 | 78.11 | 166.55 | 248.86 | 1 | 0 | 1.95 | 0 | 3.11 | 0 | 5.27 | 0 | 5.62 | 0 | 5.58 | 0 | |||||||
○HPL_bcast_1ring | xhpl | 0.27 | 0.16 | 0.12 | 0.08 | 0.11 | 0.06 | 0.27 | 0.16 | 0.12 | 0.08 | 0.11 | 0.06 | 2.46 | 1.59 | 2.51 | 1.27 | 2.97 | 1.21 | 2.46 | 1.59 | 2.51 | 1.27 | 2.97 | 1.21 | 1.96 | 0.65 | 0.29 | 0.13 | 0.12 | 0.05 | 1.96 | 0.65 | 0.29 | 0.13 | 0.12 | 0.05 | 8 | 8 | 8 | 8 | 8 | 8 | 0.06 | 0.06 | 0.29 | 0.10 | 0.81 | 0.16 | 0.45 | 0.23 | 0.70 | 0.16 | 0.88 | 0.13 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 6.07 | 9.65 | 26.25 | 32.40 | 84.60 | 114.28 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_1ring.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/linpac... | 1 | 0 | 1.52 | 0 | 1.68 | 0 | 1.88 | 0 | 1.06 | 0 | 1.53 | 0 |
○ucp_worker_progress | libucp.so.0.0.0 | 0.22 | 0.15 | 0.16 | 0.09 | 0.09 | 0.06 | 0.22 | 0.15 | 0.16 | 0.09 | 0.09 | 0.06 | 2.29 | 1.55 | 2.48 | 1.26 | 2.28 | 1.06 | 2.29 | 1.55 | 2.48 | 1.26 | 2.28 | 1.06 | 1.62 | 0.63 | 0.39 | 0.14 | 0.10 | 0.05 | 1.62 | 0.63 | 0.39 | 0.14 | 0.10 | 0.05 | 8 | 8 | 8 | 8 | 8 | 8 | 0.05 | 0.08 | 0.27 | 0.13 | 0.58 | 0.10 | 0.39 | 0.31 | 0.65 | 0.20 | 0.63 | 0.09 | 8.18 | 14.65 | 31.30 | 53.87 | 86.14 | 115.30 | 1 | 0 | 1.29 | 0 | 1.04 | 0 | 1.49 | 0 | 1.02 | -0 | 1.31 | 0 | |||||||
○bli_dgemmsup_rv_zen4_asm_24x8m | libblis-mt.so.5.0.0 | 0.17 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.17 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.33 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.33 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.27 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.27 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | NA | NA | NA | NA | NA | 67.84 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | ||
►HPL_dlaswp03N | xhpl | 0.16 | 0.11 | 0.06 | 0.04 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.22 | 0.94 | 0.73 | 0.44 | 0.39 | 0.36 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.17 | 0.45 | 0.16 | 0.06 | 0.03 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.00 | 0.02 | 0.04 | 0.02 | 0.03 | 0.02 | 0.03 | 0.07 | 0.10 | 0.03 | 0.03 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 4.39 | 11.38 | 32.39 | 91.72 | 193.23 | 287.30 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_dlaswp03N.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/li... | 1 | 0 | 1.29 | 0 | 1.85 | 0 | 2.62 | 0 | 2.74 | 0 | 2.72 | 0 |
►Loop 296 - HPL_dlaswp03N.c:144-177 - xhpl | 0.16 | 0.11 | 0.06 | 0.04 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.22 | 0.95 | 0.73 | 0.44 | 0.40 | 0.36 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 1.17 | 0.45 | 0.16 | 0.06 | 0.03 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0 | 1 | 1 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 295 - HPL_dlaswp03N.c:147-177 - xhpl | 0.16 | 0.11 | 0.06 | 0.04 | 0.02 | 0.02 | 0.16 | 0.11 | 0.06 | 0.04 | 0.02 | 0.02 | 1.22 | 0.94 | 0.73 | 0.44 | 0.39 | 0.36 | 1.22 | 0.94 | 0.73 | 0.44 | 0.39 | 0.36 | 1.17 | 0.45 | 0.16 | 0.06 | 0.03 | 0.02 | 1.17 | 0.45 | 0.16 | 0.06 | 0.03 | 0.02 | 8 | 8 | 8 | 8 | 8 | 8 | 0.00 | 0.02 | 0.04 | 0.02 | 0.03 | 0.02 | 0.03 | 0.07 | 0.10 | 0.03 | 0.03 | 0.02 | 4.39 | 11.38 | 32.39 | 91.87 | 193.58 | 287.02 | 1 | 0 | 1.29 | 0 | 1.85 | 0 | 2.62 | 0 | 2.74 | 0 | 2.72 | 0 | ||||||||
►Loop 294 - HPL_dlaswp03N.c:184-188 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 293 - HPL_dlaswp03N.c:188-188 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○bli_dgemmsup_rv_zen5_asm_24x8m | libblis-mt.so.5.0.0 | 0.14 | 0.34 | 0.43 | 0.33 | 0.30 | 0.27 | 0.00 | 0.34 | 0.43 | 0.00 | 0.30 | 0.00 | 1.05 | 1.50 | 1.13 | 0.64 | 0.40 | 0.27 | 0.00 | 1.50 | 1.13 | 0.00 | 0.40 | 0.01 | 1.01 | 1.39 | 1.04 | 0.53 | 0.32 | 0.24 | 0.00 | 1.39 | 1.04 | 0.00 | 0.32 | 0.00 | 8 | 16 | 32 | 64 | 128 | 192 | 0.00 | 0.02 | 0.04 | 0.04 | 0.06 | 0.05 | 0.03 | 0.06 | 0.10 | 0.06 | 0.06 | 0.03 | 91.70 | 128.60 | 173.18 | 340.82 | 554.97 | 759.20 | 1 | 0 | 0.36 | 0.22 | 0.25 | 0.32 | 0.24 | 0.25 | 0.2 | 0.24 | 0.18 | 0.22 | |||||||
○mca_pml_ucx_send | mca_pml_ucx.so | 0.09 | 0.07 | 0.15 | 0.05 | 0.06 | 0.04 | 0.09 | 0.02 | 0.02 | 0.00 | 0.04 | 0.04 | 1.09 | 0.73 | 2.22 | 0.79 | 1.55 | 0.75 | 1.09 | 0.11 | 0.12 | 0.00 | 0.08 | 0.75 | 0.62 | 0.30 | 0.35 | 0.08 | 0.07 | 0.03 | 0.62 | 0.08 | 0.06 | 0.00 | 0.04 | 0.03 | 8 | 8 | 8 | 8 | 8 | 8 | 0.03 | 0.03 | 0.22 | 0.08 | 0.35 | 0.09 | 0.22 | 0.13 | 0.53 | 0.13 | 0.38 | 0.08 | 10.01 | 19.71 | 40.17 | 68.86 | 123.19 | 186.34 | 1 | 0 | 1.02 | -0 | 0.44 | 0.08 | 0.92 | 0 | 0.57 | 0.03 | 0.78 | 0.01 | |||||||
►HPL_dlacpy | xhpl | 0.08 | 0.08 | 0.07 | 0.05 | 0.04 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.68 | 0.70 | 0.72 | 0.59 | 0.58 | 0.55 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.58 | 0.33 | 0.17 | 0.08 | 0.04 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.01 | 0.01 | 0.04 | 0.02 | 0.03 | 0.03 | 0.07 | 0.04 | 0.09 | 0.03 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 17.60 | 31.17 | 59.89 | 132.62 | 247.46 | 373.34 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_dlacpy.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/linpa... | 1 | 0 | 0.88 | 0.01 | 0.85 | 0.01 | 0.94 | 0 | 0.87 | 0 | 0.88 | 0 |
►Loop 175 - HPL_dlacpy.c:167-304 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 176 - HPL_dlacpy.c:294-294 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 174 - HPL_dlacpy.c:289-294 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 171 - HPL_dlacpy.c:311-337 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 169 - HPL_dlacpy.c:313-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 172 - HPL_dlacpy.c:337-337 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 173 - HPL_dlacpy.c:313-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 170 - HPL_dlacpy.c:337-337 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 167 - HPL_dlacpy.c:311-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 168 - HPL_dlacpy.c:316-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 166 - HPL_dlacpy.c:313-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 164 - HPL_dlacpy.c:311-337 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 165 - HPL_dlacpy.c:337-337 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 163 - HPL_dlacpy.c:337-337 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 178 - HPL_dlacpy.c:167-304 - xhpl [...] | 0.08 | 0.08 | 0.07 | 0.05 | 0.04 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.66 | 0.75 | 0.75 | 0.62 | 0.61 | 0.57 | 0.11 | 0.12 | 0.11 | 0.11 | 0.10 | 0.11 | 0.57 | 0.33 | 0.17 | 0.08 | 0.04 | 0.03 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 23.17 | 43.70 | 102.25 | 167.15 | 293.29 | 441.07 | 1 | 0 | 0.96 | 0 | 1.1 | -0 | 0.88 | 0 | 0.79 | 0 | 0.8 | 0 | ||||||||
○Loop 177 - HPL_dlacpy.c:169-280 - xhpl [...] | 0.07 | 0.07 | 0.06 | 0.04 | 0.03 | 0.03 | 0.07 | 0.07 | 0.06 | 0.04 | 0.03 | 0.03 | 0.56 | 0.63 | 0.64 | 0.50 | 0.51 | 0.46 | 0.56 | 0.63 | 0.64 | 0.50 | 0.51 | 0.46 | 0.50 | 0.29 | 0.15 | 0.07 | 0.04 | 0.02 | 0.50 | 0.29 | 0.15 | 0.07 | 0.04 | 0.02 | 8 | 8 | 8 | 8 | 8 | 8 | 0.01 | 0.01 | 0.03 | 0.02 | 0.03 | 0.03 | 0.05 | 0.04 | 0.08 | 0.03 | 0.03 | 0.02 | 16.81 | 29.48 | 55.10 | 126.31 | 238.41 | 364.15 | 1 | 0 | 0.87 | 0.01 | 0.82 | 0.01 | 0.94 | 0 | 0.88 | 0 | 0.9 | 0 | ||||||||
○Loop 179 - HPL_dlacpy.c:174-195 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 182 - HPL_dlacpy.c:167-304 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 3 | 3 | 2 | 2 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 6.99 | 7.76 | 18.54 | 0.00 | 0.00 | 45.17 | ||||||||||||||||||||
○Loop 184 - HPL_dlacpy.c:169-195 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 183 - HPL_dlacpy.c:294-294 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 181 - HPL_dlacpy.c:289-294 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 1 | 2 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 180 - HPL_dlacpy.c:169-280 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 9.19 | 26.18 | 237.32 | 0.00 | 570.82 | 275.94 | ||||||||||||||||||||
○MPI_Iprobe | libmpi.so.40.30.7 | 0.08 | 0.05 | 0.03 | 0.03 | 0.03 | 0.02 | 0.08 | 0.05 | 0.03 | 0.03 | 0.03 | 0.00 | 0.70 | 0.68 | 0.73 | 0.41 | 0.80 | 0.44 | 0.70 | 0.68 | 0.73 | 0.41 | 0.80 | 0.00 | 0.55 | 0.22 | 0.08 | 0.04 | 0.03 | 0.02 | 0.55 | 0.22 | 0.08 | 0.04 | 0.03 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.02 | 0.03 | 0.10 | 0.04 | 0.23 | 0.10 | 0.17 | 0.12 | 0.23 | 0.06 | 0.25 | 0.09 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 7.67 | 10.62 | 29.32 | 36.51 | 89.31 | 109.47 | 1 | 0 | 1.27 | 0 | 1.64 | 0 | 1.62 | 0 | 1.17 | -0 | 1.42 | 0 | |
○unknown_kernel_region | kernel | 0.05 | 0.09 | 0.13 | 0.15 | 0.23 | 0.24 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.42 | 0.53 | 0.40 | 0.33 | 0.38 | 0.31 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.37 | 0.39 | 0.32 | 0.23 | 0.25 | 0.21 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 15 | 23 | 39 | 71 | 134 | 198 | 36.69 | 30.35 | 27.16 | 14.31 | 7.98 | 7.46 | 0.18 | 0.19 | 0.13 | 0.08 | 0.08 | 0.05 | System (%): 90.99 MPI (%): 9.01 | System (%): 51.99 OMP (%): 43.11 MPI (%): 4.73 Math (%): 0.08 Pthread (%): 0.08 | OMP (%): 60.14 System (%): 37.20 MPI (%): 2.51 Pthread (%): 0.16 | OMP (%): 67.34 System (%): 30.99 MPI (%): 1.60 Pthread (%): 0.08 | OMP (%): 76.00 System (%): 22.97 MPI (%): 0.90 Pthread (%): 0.13 | OMP (%): 77.55 System (%): 21.55 MPI (%): 0.65 Pthread (%): 0.25 | 17.85 | 17.50 | 22.73 | 35.11 | 35.22 | 46.03 | 1 | 0 | 0.48 | 0.05 | 0.29 | 0.09 | 0.2 | 0.12 | 0.09 | 0.21 | 0.07 | 0.23 | |
○unknown_function | xhpl | 0.05 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.56 | 0.29 | 0.37 | 0.13 | 0.39 | 0.16 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.33 | 0.10 | 0.04 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.02 | 0.02 | 0.04 | 0.02 | 0.12 | 0.02 | 0.15 | 0.06 | 0.10 | 0.03 | 0.13 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 8.05 | 16.01 | 29.12 | 51.31 | 115.34 | 137.85 | 1 | 0 | 1.73 | 0 | 1.94 | 0 | 3.35 | 0 | 1.59 | 0 | 1.93 | 0 | |
○unknown_function | mca_pml_ucx.so | 0.04 | 0.02 | 0.03 | 0.01 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.51 | 0.33 | 0.76 | 0.22 | 0.63 | 0.25 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.30 | 0.10 | 0.07 | 0.02 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.02 | 0.02 | 0.08 | 0.02 | 0.19 | 0.06 | 0.12 | 0.09 | 0.20 | 0.04 | 0.20 | 0.05 | 8.73 | 17.87 | 31.82 | 62.44 | 98.52 | 110.11 | 1 | 0 | 1.56 | 0 | 1.04 | -0 | 1.98 | 0 | 1.04 | -0 | 1.2 | -0 | |||||||
►HPL_pdgesv0 | xhpl | 0.04 | 0.02 | 0.02 | 0.01 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.61 | 0.36 | 0.62 | 0.28 | 0.44 | 0.18 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.29 | 0.07 | 0.05 | 0.02 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.02 | 0.02 | 0.08 | 0.05 | 0.12 | 0.04 | 0.18 | 0.10 | 0.19 | 0.08 | 0.13 | 0.04 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 3.12 | 6.47 | 25.73 | 22.18 | 72.34 | 87.73 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_pdgesv0.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/linp... | 1 | 0 | 2.07 | 0 | 1.34 | 0 | 1.7 | 0 | 0.93 | 0 | 1.74 | 0 |
►Loop 369 - HPL_pdgesv0.c:121-156 - xhpl [...] | 0.04 | 0.02 | 0.02 | 0.01 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.61 | 0.36 | 0.62 | 0.28 | 0.44 | 0.18 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.29 | 0.07 | 0.05 | 0.02 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 2 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 368 - HPL_pdgesv0.c:149-150 - xhpl | 0.04 | 0.02 | 0.02 | 0.01 | 0.02 | 0.01 | 0.04 | 0.02 | 0.02 | 0.01 | 0.02 | 0.01 | 0.61 | 0.35 | 0.62 | 0.28 | 0.44 | 0.18 | 0.61 | 0.35 | 0.62 | 0.28 | 0.44 | 0.18 | 0.29 | 0.07 | 0.05 | 0.02 | 0.02 | 0.01 | 0.29 | 0.07 | 0.05 | 0.02 | 0.02 | 0.01 | 8 | 8 | 8 | 8 | 8 | 8 | 0.02 | 0.02 | 0.08 | 0.05 | 0.12 | 0.04 | 0.18 | 0.10 | 0.19 | 0.08 | 0.13 | 0.04 | 3.12 | 6.53 | 25.73 | 22.18 | 72.34 | 87.73 | 1 | 0 | 2.08 | 0 | 1.34 | 0 | 1.7 | 0 | 0.93 | 0 | 1.74 | 0 | ||||||||
○opal_progress | libopen-pal.so.40.30.4 | 0.04 | 0.02 | 0.04 | 0.01 | 0.01 | 0.01 | 0.04 | 0.02 | 0.04 | 0.01 | 0.01 | 0.01 | 0.53 | 0.34 | 1.23 | 0.22 | 0.36 | 0.19 | 0.53 | 0.34 | 1.23 | 0.22 | 0.36 | 0.19 | 0.28 | 0.09 | 0.09 | 0.02 | 0.01 | 0.00 | 0.28 | 0.09 | 0.09 | 0.02 | 0.01 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.02 | 0.03 | 0.16 | 0.04 | 0.11 | 0.06 | 0.18 | 0.10 | 0.38 | 0.07 | 0.11 | 0.05 | 7.10 | 17.07 | 23.93 | 39.87 | 84.71 | 104.37 | 1 | 0 | 1.51 | 0 | 0.8 | 0.01 | 2.09 | 0 | 1.29 | -0 | 2.4 | 0 | |||||||
○ucp_tag_probe_nb | libucp.so.0.0.0 | 0.04 | 0.02 | 0.03 | 0.01 | 0.01 | 0.01 | 0.04 | 0.02 | 0.03 | 0.01 | 0.01 | 0.01 | 0.36 | 0.29 | 0.45 | 0.25 | 0.47 | 0.22 | 0.36 | 0.29 | 0.45 | 0.25 | 0.47 | 0.22 | 0.27 | 0.09 | 0.06 | 0.02 | 0.02 | 0.01 | 0.27 | 0.09 | 0.06 | 0.02 | 0.02 | 0.01 | 8 | 8 | 8 | 8 | 8 | 8 | 0.01 | 0.02 | 0.07 | 0.03 | 0.13 | 0.04 | 0.07 | 0.08 | 0.17 | 0.05 | 0.14 | 0.04 | 8.11 | 19.25 | 36.18 | 50.47 | 97.67 | 111.68 | 1 | 0 | 1.51 | 0 | 1.12 | -0 | 1.55 | 0 | 1.12 | -0 | 1.36 | -0 | |||||||
○HPL_lmul | xhpl | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.37 | 0.06 | 0.04 | 0.14 | 0.05 | 0.10 | 0.00 | 0.06 | 0.04 | 0.14 | 0.05 | 0.10 | 0.19 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.07 | 0.01 | 0.01 | 0.03 | 0.02 | 0.04 | 0.48 | 0.02 | 0.01 | 0.04 | 0.02 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 21.47 | 24.86 | 34.28 | 154.72 | 169.86 | 348.16 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_lmul.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/linpack... | 1 | 0 | 10.77 | 0 | 7.51 | 0 | 4.05 | 0 | 5.79 | 0 | 2.83 | 0 |
○bli_dtrsm_ll_ker_var2 | libblis-mt.so.5.0.0 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.21 | 0.06 | 0.04 | 0.03 | 0.02 | 0.02 | 0.21 | 0.06 | 0.04 | 0.03 | 0.02 | 0.02 | 0.15 | 0.04 | 0.02 | 0.01 | 0.01 | 0.00 | 0.15 | 0.04 | 0.02 | 0.01 | 0.01 | 0.00 | 8 | 16 | 32 | 64 | 128 | 191 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.03 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 7.15 | 15.05 | 46.94 | 128.45 | 266.84 | 430.16 | 1 | 0 | 2.02 | 0 | 2.51 | 0 | 2.22 | 0 | 1.75 | -0 | 1.99 | -0 | |||||||
○opal_timer_linux_get_cycles_sys_timer | libopen-pal.so.40.30.4 | 0.02 | 0.01 | 0.02 | 0.01 | 0.00 | 0.00 | 0.02 | 0.01 | 0.02 | 0.01 | 0.00 | 0.00 | 0.34 | 0.23 | 0.49 | 0.12 | 0.13 | 0.05 | 0.34 | 0.23 | 0.49 | 0.12 | 0.13 | 0.05 | 0.15 | 0.06 | 0.04 | 0.01 | 0.01 | 0.00 | 0.15 | 0.06 | 0.04 | 0.01 | 0.01 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.02 | 0.02 | 0.07 | 0.02 | 0.03 | 0.02 | 0.13 | 0.09 | 0.16 | 0.04 | 0.04 | 0.02 | 6.72 | 11.53 | 22.09 | 35.51 | 72.47 | 100.86 | 1 | 0 | 1.32 | -0 | 1 | 0 | 2.03 | 0 | 1.72 | -0 | 3.96 | 0 | |||||||
○HPL_ladd | xhpl | 0.02 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.02 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 1.08 | 0.12 | 0.07 | 0.52 | 0.08 | 0.51 | 1.08 | 0.12 | 0.07 | 0.52 | 0.08 | 0.51 | 0.15 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.15 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 8 | 8 | 8 | 8 | 8 | 8 | 0.05 | 0.01 | 0.01 | 0.11 | 0.02 | 0.22 | 0.38 | 0.04 | 0.03 | 0.18 | 0.03 | 0.19 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 17.43 | 10.80 | 13.29 | 160.97 | 67.94 | 410.07 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_ladd.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/linpack... | 1 | 0 | 6.17 | 0 | 5.08 | 0 | 1.36 | -0 | 4.55 | 0 | 0.53 | 0.01 |
○bli_dpackm_struc_cxk | libblis-mt.so.5.0.0 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.18 | 0.14 | 0.06 | 0.05 | 0.03 | 0.02 | 0.18 | 0.14 | 0.06 | 0.05 | 0.03 | 0.02 | 0.14 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 0.14 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 8 | 16 | 32 | 64 | 128 | 192 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 10.97 | 19.70 | 45.02 | 85.33 | 135.73 | 205.33 | 1 | 0 | 0.94 | 0 | 0.94 | 0 | 0.8 | 0 | 0.66 | 0 | 0.68 | 0 | |||||||
○bli_dpackm_cxk | libblis-mt.so.5.0.0 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.18 | 0.11 | 0.08 | 0.06 | 0.03 | 0.02 | 0.18 | 0.11 | 0.08 | 0.06 | 0.03 | 0.02 | 0.14 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 0.14 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 8 | 16 | 32 | 64 | 128 | 192 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.04 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 10.10 | 19.44 | 42.56 | 85.25 | 146.59 | 232.13 | 1 | 0 | 0.89 | 0 | 0.85 | 0 | 0.84 | 0 | 0.72 | 0 | 0.83 | 0 | |||||||
○HPL_setran | xhpl | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.49 | 0.11 | 0.10 | 0.24 | 0.10 | 0.19 | 0.49 | 0.11 | 0.10 | 0.24 | 0.10 | 0.19 | 0.14 | 0.04 | 0.02 | 0.02 | 0.01 | 0.01 | 0.14 | 0.04 | 0.02 | 0.02 | 0.01 | 0.01 | 8 | 8 | 8 | 8 | 8 | 8 | 0.02 | 0.01 | 0.01 | 0.04 | 0.01 | 0.06 | 0.14 | 0.03 | 0.02 | 0.06 | 0.02 | 0.05 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 14.74 | 22.21 | 37.85 | 81.48 | 130.98 | 253.33 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /cluster/comp/aocc/5.0.0/bin/clang-17 -o HPL_setran.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_REPORT -I /beegfs/hackathon/users/eoseret/linpa... | 1 | 0 | 1.62 | 0 | 1.63 | 0 | 1.12 | -0 | 1.29 | -0 | 0.92 | 0 |
○bli_dgemmsup_rv_zen4_asm_24x2m | libblis-mt.so.5.0.0 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.14 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.14 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | NA | NA | NA | NA | NA | 38.15 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | ||
○ucc_pq_st_is_empty | libucc.so.1.0.0 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.25 | 0.17 | 0.55 | 0.10 | 0.12 | 0.09 | 0.25 | 0.17 | 0.55 | 0.10 | 0.12 | 0.09 | 0.11 | 0.04 | 0.03 | 0.01 | 0.00 | 0.00 | 0.11 | 0.04 | 0.03 | 0.01 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.01 | 0.02 | 0.08 | 0.02 | 0.03 | 0.03 | 0.08 | 0.06 | 0.18 | 0.03 | 0.04 | 0.03 | 7.13 | 11.32 | 21.95 | 36.54 | 81.95 | 90.49 | 1 | 0 | 1.4 | -0 | 0.83 | 0 | 2.43 | 0 | 1.74 | -0 | 3.25 | -0 | |||||||
○bli_dgemmsup_rv_zen4_asm_24x6m | libblis-mt.so.5.0.0 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | NA | NA | NA | NA | NA | 42.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | ||
○bli_dpackm_blk_var1 | libblis-mt.so.5.0.0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.09 | 0.11 | 0.06 | 0.05 | 0.04 | 0.03 | 0.09 | 0.11 | 0.06 | 0.05 | 0.04 | 0.03 | 0.08 | 0.06 | 0.04 | 0.03 | 0.01 | 0.01 | 0.08 | 0.06 | 0.04 | 0.03 | 0.01 | 0.01 | 8 | 16 | 32 | 64 | 128 | 192 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 16.33 | 25.34 | 49.62 | 78.88 | 150.92 | 218.97 | 1 | 0 | 0.68 | 0 | 0.53 | 0.01 | 0.35 | 0.01 | 0.33 | 0.01 | 0.29 | 0.01 | |||||||
○ucc_context_progress | libucc.so.1.0.0 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.13 | 0.10 | 0.28 | 0.09 | 0.09 | 0.06 | 0.13 | 0.10 | 0.28 | 0.09 | 0.09 | 0.06 | 0.08 | 0.02 | 0.02 | 0.01 | 0.00 | 0.00 | 0.08 | 0.02 | 0.02 | 0.01 | 0.00 | 0.00 | 8 | 8 | 8 | 8 | 8 | 8 | 0.01 | 0.01 | 0.04 | 0.02 | 0.02 | 0.02 | 0.05 | 0.03 | 0.09 | 0.03 | 0.03 | 0.02 | 7.28 | 14.75 | 22.00 | 26.83 | 68.87 | 78.88 | 1 | 0 | 1.55 | -0 | 1.09 | -0 | 1.83 | -0 | 1.53 | -0 | 3.06 | -0 | |||||||
○bli_dgemmsup_rv_zen5_asm_24x6m | libblis-mt.so.5.0.0 | 0.00 | 0.02 | 0.02 | 0.01 | 0.01 | 0.02 | 0.00 | 0.02 | 0.02 | 0.01 | 0.01 | 0.02 | 0.00 | 0.11 | 0.08 | 0.05 | 0.04 | 0.04 | 0.00 | 0.11 | 0.08 | 0.05 | 0.04 | 0.04 | 0.00 | 0.07 | 0.05 | 0.02 | 0.01 | 0.02 | 0.00 | 0.07 | 0.05 | 0.02 | 0.01 | 0.02 | 0 | 16 | 32 | 64 | 128 | 192 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | NA | 0.00 | 56.92 | 76.31 | 167.80 | 246.42 | 209.19 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | ||||||
○__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | 0.00 | 2.00 | 4.44 | 4.62 | 11.35 | 11.90 | 0.00 | 2.00 | 4.44 | 4.62 | 11.35 | 11.90 | 0.00 | 20.64 | 27.11 | 15.87 | 26.23 | 16.92 | 0.00 | 20.64 | 27.11 | 15.87 | 26.23 | 16.92 | 0.00 | 8.19 | 10.74 | 7.31 | 12.29 | 10.28 | 0.00 | 8.19 | 10.74 | 7.31 | 12.29 | 10.28 | 0 | 12 | 29 | 64 | 128 | 192 | 0.00 | 2.19 | 3.48 | 3.12 | 8.72 | 5.45 | 0.00 | 8.46 | 7.57 | 4.32 | 6.95 | 3.65 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libomp.so | 0.00 | 0.03 | 0.01 | 0.01 | 0.02 | 0.05 | 0.00 | 0.03 | 0.01 | 0.01 | 0.02 | 0.05 | 0.00 | 0.48 | 0.18 | 0.16 | 0.49 | 0.62 | 0.00 | 0.48 | 0.18 | 0.16 | 0.49 | 0.62 | 0.00 | 0.10 | 0.03 | 0.02 | 0.02 | 0.05 | 0.00 | 0.10 | 0.03 | 0.02 | 0.02 | 0.05 | 0 | 10 | 16 | 28 | 38 | 65 | 0.00 | 0.05 | 0.03 | 0.04 | 0.09 | 0.18 | 0.00 | 0.20 | 0.06 | 0.05 | 0.09 | 0.15 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.01 | 0.30 | 0.54 | 0.51 | 0.77 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○bli_dgemmsup_rv_zen5_asm_24x2m | libblis-mt.so.5.0.0 | 0.00 | 0.02 | 0.02 | 0.02 | 0.04 | 0.06 | 0.00 | 0.02 | 0.02 | 0.00 | 0.04 | 0.04 | 0.00 | 0.11 | 0.12 | 0.06 | 0.08 | 0.10 | 0.00 | 0.11 | 0.12 | 0.00 | 0.08 | 0.75 | 0.00 | 0.08 | 0.06 | 0.03 | 0.04 | 0.05 | 0.00 | 0.08 | 0.06 | 0.00 | 0.04 | 0.03 | 0 | 16 | 32 | 64 | 128 | 192 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.03 | 0.00 | 0.01 | 0.02 | 0.01 | 0.02 | 0.02 | NA | 0.00 | 49.62 | 69.19 | 110.20 | 99.61 | 74.08 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | ||||||
○__kmp_hardware_timestamp | libomp.so | 0.00 | 1.61 | 4.37 | 8.90 | 14.97 | 19.27 | 0.00 | 1.61 | 4.37 | 8.90 | 14.97 | 19.27 | 0.00 | 19.45 | 29.25 | 19.29 | 27.14 | 18.23 | 0.00 | 19.45 | 29.25 | 19.29 | 27.14 | 18.23 | 0.00 | 6.61 | 10.57 | 14.07 | 16.20 | 16.65 | 0.00 | 6.61 | 10.57 | 14.07 | 16.20 | 16.65 | 0 | 16 | 32 | 64 | 128 | 192 | 0.00 | 1.79 | 3.68 | 4.31 | 6.46 | 6.33 | 0.00 | 6.89 | 7.94 | 5.96 | 5.48 | 4.23 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |