Run OMP1 | Number processes: 4Number nodes: 1Number processes per node: 4Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_PROC_BIND: closeOMP_PLACES: coresOMP_NUM_THREADS: 1 |
---|---|
Run OMP2 | Number processes: 4Number nodes: 1Number processes per node: 4Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 2OMP_PROC_BIND: closeOMP_PLACES: cores |
Run OMP4 | Number processes: 4Number nodes: 1Number processes per node: 4Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 4OMP_PROC_BIND: closeOMP_PLACES: cores |
Run OMP8 | Number processes: 4Number nodes: 1Number processes per node: 4Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 8OMP_PROC_BIND: closeOMP_PLACES: cores |
Run OMP16 | Number processes: 4Number nodes: 1Number processes per node: 4Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 16OMP_PROC_BIND: closeOMP_PLACES: cores |
Run OMP24 | Number processes: 4Number nodes: 1Number processes per node: 4Run Command: <executable>MPI Command: mpirun -n <number_processes> --bind-to core --map-by ppr:<number_processes>:node:PE=<OMP_NUM_THREADS>Dataset: Run Directory: .OMP_NUM_THREADS: 24OMP_PROC_BIND: closeOMP_PLACES: cores |
Name | Module | Coverage OMP1 (%) | Coverage OMP2 (%) | Coverage OMP4 (%) | Coverage OMP8 (%) | Coverage OMP16 (%) | Coverage OMP24 (%) | Coverage Excluding Loops OMP1 (%) | Coverage Excluding Loops OMP2 (%) | Coverage Excluding Loops OMP4 (%) | Coverage Excluding Loops OMP8 (%) | Coverage Excluding Loops OMP16 (%) | Coverage Excluding Loops OMP24 (%) | Max Inclusive Time Over Threads OMP1 (s) | Max Inclusive Time Over Threads OMP2 (s) | Max Inclusive Time Over Threads OMP4 (s) | Max Inclusive Time Over Threads OMP8 (s) | Max Inclusive Time Over Threads OMP16 (s) | Max Inclusive Time Over Threads OMP24 (s) | Max Exclusive Time Over Threads OMP1 (s) | Max Exclusive Time Over Threads OMP2 (s) | Max Exclusive Time Over Threads OMP4 (s) | Max Exclusive Time Over Threads OMP8 (s) | Max Exclusive Time Over Threads OMP16 (s) | Max Exclusive Time Over Threads OMP24 (s) | Inclusive Time w.r.t. Wall Time OMP1 (s) | Inclusive Time w.r.t. Wall Time OMP2 (s) | Inclusive Time w.r.t. Wall Time OMP4 (s) | Inclusive Time w.r.t. Wall Time OMP8 (s) | Inclusive Time w.r.t. Wall Time OMP16 (s) | Inclusive Time w.r.t. Wall Time OMP24 (s) | Exclusive Time w.r.t. Wall Time OMP1 (s) | Exclusive Time w.r.t. Wall Time OMP2 (s) | Exclusive Time w.r.t. Wall Time OMP4 (s) | Exclusive Time w.r.t. Wall Time OMP8 (s) | Exclusive Time w.r.t. Wall Time OMP16 (s) | Exclusive Time w.r.t. Wall Time OMP24 (s) | Nb Threads OMP1 | Nb Threads OMP2 | Nb Threads OMP4 | Nb Threads OMP8 | Nb Threads OMP16 | Nb Threads OMP24 | Deviation (coverage) OMP1 | Deviation (coverage) OMP2 | Deviation (coverage) OMP4 | Deviation (coverage) OMP8 | Deviation (coverage) OMP16 | Deviation (coverage) OMP24 | Deviation (walltime) OMP1 | Deviation (walltime) OMP2 | Deviation (walltime) OMP4 | Deviation (walltime) OMP8 | Deviation (walltime) OMP16 | Deviation (walltime) OMP24 | Categories OMP1 | Categories OMP2 | Categories OMP4 | Categories OMP8 | Categories OMP16 | Categories OMP24 | GFLOPS OMP1 | GFLOPS OMP2 | GFLOPS OMP4 | GFLOPS OMP8 | GFLOPS OMP16 | GFLOPS OMP24 | Compilation Options | (OMP1) Efficiency | (OMP1) Potential Speed-Up (%) | (OMP2) Efficiency | (OMP2) Potential Speed-Up (%) | (OMP4) Efficiency | (OMP4) Potential Speed-Up (%) | (OMP8) Efficiency | (OMP8) Potential Speed-Up (%) | (OMP16) Efficiency | (OMP16) Potential Speed-Up (%) | (OMP24) Efficiency | (OMP24) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○dgemm_vanilla_big | libarmpl_lp64_mp.so | 95.73 | 95.81 | 95.56 | 94.17 | 91.71 | 89.29 | 95.73 | 95.81 | 95.56 | 94.17 | 91.71 | 89.29 | 4512.92 | 2502.82 | 1478.91 | 831.46 | 483.30 | 364.66 | 4512.92 | 2502.82 | 1478.91 | 831.46 | 483.30 | 364.66 | 4510.85 | 2553.01 | 1547.45 | 908.10 | 562.47 | 439.52 | 4510.85 | 2553.01 | 1547.45 | 908.10 | 562.47 | 439.52 | 4 | 8 | 16 | 32 | 64 | 96 | 0.05 | 2.09 | 2.66 | 3.21 | 3.47 | 3.52 | 2.60 | 0.72 | 1.84 | 1.48 | 0.97 | 1.40 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 10.21 | 19.98 | 38.90 | 74.50 | 140.21 | 161.28 | 1 | 0 | 0.88 | 11.17 | 0.73 | 25.92 | 0.62 | 35.7 | 0.5 | 45.74 | 0.43 | 51.11 | |
○HPL_ladd | xhpl | 1.14 | 1.03 | 0.87 | 0.76 | 0.64 | 0.56 | 1.14 | 1.03 | 0.00 | 0.76 | 0.00 | 0.56 | 54.33 | 53.76 | 54.09 | 54.17 | 53.73 | 53.97 | 54.33 | 53.76 | 0.02 | 54.17 | 0.00 | 53.97 | 53.70 | 27.32 | 14.04 | 7.34 | 3.90 | 2.73 | 53.70 | 27.32 | 0.00 | 7.34 | 0.00 | 2.73 | 4 | 4 | 4 | 4 | 4 | 4 | 0.01 | 0.01 | 0.03 | 0.06 | 0.05 | 0.06 | 0.46 | 0.21 | 0.55 | 0.54 | 0.28 | 0.30 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.01 | 0.03 | 0.05 | 0.10 | 0.18 | 0.26 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_ladd.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_... | 1 | 0 | 0.98 | 0.02 | 0.96 | 0.04 | 0.91 | 0.07 | 0.86 | 0.09 | 0.82 | 0.1 |
○HPL_setran | xhpl | 0.58 | 0.52 | 0.44 | 0.39 | 0.33 | 0.28 | 0.58 | 0.52 | 0.44 | 0.39 | 0.33 | 0.28 | 27.62 | 27.75 | 27.41 | 27.37 | 27.83 | 27.68 | 27.62 | 27.75 | 27.41 | 27.37 | 27.83 | 27.68 | 27.26 | 13.95 | 7.14 | 3.72 | 2.01 | 1.40 | 27.26 | 13.95 | 7.14 | 3.72 | 2.01 | 1.40 | 4 | 4 | 4 | 4 | 4 | 4 | 0.01 | 0.01 | 0.02 | 0.03 | 0.03 | 0.06 | 0.36 | 0.29 | 0.26 | 0.26 | 0.20 | 0.27 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.65 | 5.16 | 10.08 | 19.75 | 35.69 | 51.20 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_setran.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRES... | 1 | 0 | 0.98 | 0.01 | 0.95 | 0.02 | 0.92 | 0.03 | 0.85 | 0.05 | 0.81 | 0.05 |
○t_interleave_kernel_d8 | libarmpl_lp64_mp.so | 0.49 | 0.45 | 0.40 | 0.36 | 0.35 | 0.38 | 0.49 | 0.45 | 0.40 | 0.36 | 0.35 | 0.38 | 24.32 | 12.41 | 6.96 | 3.68 | 2.17 | 1.79 | 24.32 | 12.41 | 6.96 | 3.68 | 2.17 | 1.79 | 23.25 | 12.05 | 6.40 | 3.45 | 2.14 | 1.86 | 23.25 | 12.05 | 6.40 | 3.45 | 2.14 | 1.86 | 4 | 8 | 16 | 32 | 64 | 96 | 0.02 | 0.02 | 0.02 | 0.03 | 0.03 | 0.03 | 1.06 | 0.40 | 0.39 | 0.29 | 0.16 | 0.12 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.04 | 0.08 | 1 | 0 | 0.96 | 0.02 | 0.91 | 0.04 | 0.84 | 0.06 | 0.68 | 0.11 | 0.52 | 0.18 | |
○mca_btl_vader_component_progress | mca_btl_vader.so | 0.41 | 0.24 | 0.15 | 0.11 | 0.09 | 0.07 | 0.41 | 0.24 | 0.15 | 0.11 | 0.09 | 0.07 | 23.85 | 15.20 | 10.88 | 8.82 | 8.83 | 7.87 | 23.85 | 15.20 | 10.88 | 8.82 | 8.83 | 7.87 | 19.43 | 6.48 | 2.42 | 1.03 | 0.54 | 0.36 | 19.43 | 6.48 | 2.42 | 1.03 | 0.54 | 0.36 | 4 | 4 | 4 | 4 | 4 | 4 | 0.08 | 0.08 | 0.08 | 0.12 | 0.21 | 0.19 | 3.61 | 2.06 | 1.35 | 1.20 | 1.30 | 0.94 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.5 | 0 | 2.01 | 0 | 2.37 | 0 | 2.27 | 0 | 2.25 | 0 | |
○mca_pml_ob1_recv_req_start | mca_pml_ob1.so | 0.20 | 0.10 | 0.05 | 0.03 | 0.02 | 0.02 | 0.20 | 0.00 | 0.05 | 0.03 | 0.02 | 0.00 | 11.93 | 6.28 | 4.63 | 3.34 | 2.87 | 2.29 | 11.93 | 0.00 | 4.63 | 3.34 | 2.87 | 0.00 | 9.41 | 2.63 | 0.86 | 0.33 | 0.14 | 0.08 | 9.41 | 0.00 | 0.86 | 0.33 | 0.14 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.04 | 0.05 | 0.07 | 0.09 | 0.13 | 0.10 | 2.08 | 1.28 | 1.21 | 0.87 | 0.79 | 0.51 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.79 | 0 | 2.72 | 0 | 3.6 | 0 | 4.3 | 0 | 4.67 | 0 | |
○interleave_3vl_sve_kernel_d | libarmpl_lp64_mp.so | 0.17 | 0.15 | 0.12 | 0.10 | 0.09 | 0.09 | 0.17 | 0.15 | 0.12 | 0.10 | 0.09 | 0.09 | 7.95 | 4.13 | 2.06 | 1.09 | 0.53 | 0.48 | 7.95 | 4.13 | 2.06 | 1.09 | 0.53 | 0.48 | 7.82 | 4.00 | 1.94 | 0.96 | 0.52 | 0.42 | 7.82 | 4.00 | 1.94 | 0.96 | 0.52 | 0.42 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.13 | 0.14 | 0.11 | 0.09 | 0.04 | 0.04 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.98 | 0 | 1.01 | 0 | 1.02 | 0 | 0.93 | 0.01 | 0.78 | 0.02 | |
○opal_progress | libopen-pal.so.40.30.2 | 0.15 | 0.09 | 0.06 | 0.04 | 0.03 | 0.03 | 0.15 | 0.09 | 0.06 | 0.04 | 0.03 | 0.03 | 9.36 | 5.80 | 3.97 | 3.23 | 3.02 | 3.02 | 9.36 | 5.80 | 3.97 | 3.23 | 3.02 | 3.02 | 7.19 | 2.45 | 0.90 | 0.41 | 0.19 | 0.12 | 7.19 | 2.45 | 0.90 | 0.41 | 0.19 | 0.12 | 4 | 4 | 4 | 4 | 4 | 4 | 0.04 | 0.04 | 0.03 | 0.02 | 0.05 | 0.12 | 1.73 | 0.94 | 0.43 | 0.17 | 0.28 | 0.57 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.47 | 0 | 2.01 | 0 | 2.2 | 0 | 2.35 | 0 | 2.4 | 0 | |
►HPL_dlaswp01N | xhpl | 0.13 | 0.11 | 0.09 | 0.08 | 0.07 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5.98 | 5.94 | 6.06 | 5.92 | 6.14 | 6.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5.90 | 2.95 | 1.54 | 0.80 | 0.44 | 0.31 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.01 | 0.01 | 0.01 | 0.03 | 0.03 | 0.08 | 0.19 | 0.18 | 0.09 | 0.17 | 0.14 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_dlaswp01N.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROG... | 1 | 0 | 1 | 0 | 0.96 | 0 | 0.92 | 0.01 | 0.85 | 0.01 | 0.8 | 0.01 |
►Loop 302 - HPL_dlaswp01N.c:158-191 - xhpl | 0.13 | 0.11 | 0.09 | 0.08 | 0.07 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5.98 | 5.94 | 6.06 | 5.92 | 6.14 | 6.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5.90 | 2.95 | 1.54 | 0.80 | 0.44 | 0.31 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 301 - HPL_dlaswp01N.c:160-191 - xhpl | 0.13 | 0.11 | 0.09 | 0.08 | 0.07 | 0.06 | 0.13 | 0.11 | 0.09 | 0.08 | 0.07 | 0.06 | 5.98 | 5.94 | 6.06 | 5.92 | 6.14 | 6.13 | 5.98 | 5.94 | 6.06 | 5.92 | 6.14 | 6.13 | 5.90 | 2.95 | 1.54 | 0.80 | 0.44 | 0.31 | 5.90 | 2.95 | 1.54 | 0.80 | 0.44 | 0.31 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.01 | 0.01 | 0.01 | 0.03 | 0.03 | 0.08 | 0.19 | 0.18 | 0.09 | 0.17 | 0.14 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 0.96 | 0 | 0.92 | 0.01 | 0.85 | 0.01 | 0.8 | 0.01 | ||||||||
►Loop 298 - HPL_dlaswp01N.c:198-203 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 300 - HPL_dlaswp01N.c:203-203 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 299 - HPL_dlaswp01N.c:203-203 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 1 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○HPL_lmul | xhpl | 0.12 | 0.11 | 0.09 | 0.08 | 0.07 | 0.06 | 0.12 | 0.11 | 0.09 | 0.08 | 0.07 | 0.06 | 5.66 | 5.62 | 5.87 | 5.76 | 5.67 | 5.73 | 5.66 | 5.62 | 5.87 | 5.76 | 5.67 | 5.73 | 5.52 | 2.83 | 1.46 | 0.77 | 0.41 | 0.28 | 5.52 | 2.83 | 1.46 | 0.77 | 0.41 | 0.28 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.02 | 0.02 | 0.01 | 0.03 | 0.10 | 0.13 | 0.24 | 0.22 | 0.07 | 0.15 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 15.72 | 30.80 | 59.86 | 111.33 | 213.27 | 310.97 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_lmul.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_... | 1 | 0 | 0.97 | 0 | 0.95 | 0 | 0.9 | 0.01 | 0.84 | 0.01 | 0.82 | 0.01 |
►HPL_dlaswp04N | xhpl | 0.09 | 0.08 | 0.07 | 0.06 | 0.05 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4.24 | 4.23 | 4.29 | 4.42 | 4.45 | 4.52 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4.16 | 2.13 | 1.11 | 0.60 | 0.32 | 0.23 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.07 | 0.07 | 0.07 | 0.06 | 0.04 | 0.08 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_dlaswp04N.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROG... | 1 | 0 | 0.98 | 0 | 0.94 | 0 | 0.87 | 0.01 | 0.81 | 0.01 | 0.77 | 0.01 |
►Loop 331 - HPL_dlaswp04N.c:275-279 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 330 - HPL_dlaswp04N.c:279-279 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 332 - HPL_dlaswp04N.c:279-279 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 333 - HPL_dlaswp04N.c:267-273 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 335 - HPL_dlaswp04N.c:267-273 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 334 - HPL_dlaswp04N.c:272-273 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 329 - HPL_dlaswp04N.c:272-273 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 336 - HPL_dlaswp04N.c:177-260 - xhpl | 0.09 | 0.08 | 0.07 | 0.06 | 0.05 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4.24 | 4.23 | 4.29 | 4.42 | 4.44 | 4.52 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4.16 | 2.13 | 1.11 | 0.60 | 0.32 | 0.23 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 338 - HPL_dlaswp04N.c:180-226 - xhpl | 0.09 | 0.08 | 0.07 | 0.06 | 0.05 | 0.05 | 0.09 | 0.08 | 0.07 | 0.06 | 0.05 | 0.05 | 4.24 | 4.23 | 4.29 | 4.42 | 4.44 | 4.52 | 4.24 | 4.23 | 4.29 | 4.42 | 4.44 | 4.52 | 4.16 | 2.13 | 1.11 | 0.60 | 0.32 | 0.23 | 4.16 | 2.13 | 1.11 | 0.60 | 0.32 | 0.23 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.07 | 0.07 | 0.07 | 0.06 | 0.04 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.98 | 0 | 0.94 | 0 | 0.87 | 0.01 | 0.81 | 0.01 | 0.77 | 0.01 | ||||||||
○Loop 337 - HPL_dlaswp04N.c:230-260 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○mca_pml_ob1_iprobe | mca_pml_ob1.so | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 0.01 | 0.08 | 0.00 | 0.02 | 0.01 | 0.01 | 0.00 | 5.04 | 2.64 | 1.83 | 1.34 | 1.09 | 0.88 | 5.04 | 0.00 | 1.83 | 1.34 | 1.09 | 0.00 | 3.94 | 1.12 | 0.34 | 0.14 | 0.06 | 0.03 | 3.94 | 0.00 | 0.34 | 0.14 | 0.06 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.02 | 0.01 | 0.03 | 0.04 | 0.04 | 0.05 | 0.80 | 0.36 | 0.47 | 0.35 | 0.26 | 0.22 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.75 | 0 | 2.88 | 0 | 3.52 | 0 | 4.12 | 0 | 5.08 | 0 | |
►HPL_dlaswp02N | xhpl | 0.08 | 0.07 | 0.06 | 0.05 | 0.04 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3.92 | 3.90 | 3.80 | 3.85 | 3.79 | 3.86 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3.84 | 1.91 | 0.98 | 0.52 | 0.27 | 0.19 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.07 | 0.12 | 0.05 | 0.10 | 0.07 | 0.10 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_dlaswp02N.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROG... | 1 | 0 | 1.01 | 0 | 0.98 | 0 | 0.93 | 0 | 0.88 | 0.01 | 0.84 | 0.01 |
○Loop 317 - HPL_dlaswp02N.c:151-152 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 2 | 1 | 0 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 318 - HPL_dlaswp02N.c:151-152 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 316 - HPL_dlaswp02N.c:157-189 - xhpl | 0.08 | 0.07 | 0.06 | 0.05 | 0.04 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3.92 | 3.89 | 3.80 | 3.85 | 3.77 | 3.86 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3.84 | 1.90 | 0.98 | 0.52 | 0.27 | 0.19 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 315 - HPL_dlaswp02N.c:160-189 - xhpl | 0.08 | 0.07 | 0.06 | 0.05 | 0.04 | 0.04 | 0.08 | 0.07 | 0.06 | 0.05 | 0.04 | 0.04 | 3.92 | 3.89 | 3.80 | 3.85 | 3.77 | 3.86 | 3.92 | 3.89 | 3.80 | 3.85 | 3.77 | 3.86 | 3.84 | 1.90 | 0.98 | 0.52 | 0.27 | 0.19 | 3.84 | 1.90 | 0.98 | 0.52 | 0.27 | 0.19 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.07 | 0.12 | 0.05 | 0.10 | 0.07 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.01 | 0 | 0.98 | 0 | 0.93 | 0 | 0.88 | 0.01 | 0.84 | 0.01 | ||||||||
►Loop 312 - HPL_dlaswp02N.c:196-199 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 314 - HPL_dlaswp02N.c:196-199 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 313 - HPL_dlaswp02N.c:199-199 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 311 - HPL_dlaswp02N.c:199-199 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○HPL_rand | xhpl | 0.07 | 0.06 | 0.06 | 0.05 | 0.04 | 0.04 | 0.07 | 0.06 | 0.06 | 0.05 | 0.04 | 0.04 | 3.41 | 3.51 | 3.49 | 3.54 | 3.47 | 3.51 | 3.41 | 3.51 | 3.49 | 3.54 | 3.47 | 3.51 | 3.28 | 1.73 | 0.89 | 0.47 | 0.24 | 0.17 | 3.28 | 1.73 | 0.89 | 0.47 | 0.24 | 0.17 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.01 | 0.02 | 0.02 | 0.14 | 0.10 | 0.08 | 0.11 | 0.10 | 0.08 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.05 | 0.06 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_rand.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS_... | 1 | 0 | 0.95 | 0 | 0.92 | 0 | 0.88 | 0.01 | 0.84 | 0.01 | 0.79 | 0.01 |
○void armpl::clag::(anonymous namespace)::trsm_kernel<double, true, true, true, false, false>(double const*, long, long, double*, long, long, long, long) | libarmpl_lp64_mp.so | 0.07 | 0.06 | 0.04 | 0.03 | 0.02 | 0.02 | 0.07 | 0.06 | 0.04 | 0.03 | 0.02 | 0.02 | 3.25 | 1.59 | 0.71 | 0.34 | 0.20 | 0.15 | 3.25 | 1.59 | 0.71 | 0.34 | 0.20 | 0.15 | 3.20 | 1.51 | 0.64 | 0.29 | 0.15 | 0.10 | 3.20 | 1.51 | 0.64 | 0.29 | 0.15 | 0.10 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.10 | 0.08 | 0.05 | 0.04 | 0.03 | 0.02 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 2.93 | 5.70 | 12.28 | 30.63 | 70.20 | 106.62 | 1 | 0 | 1.06 | 0 | 1.25 | 0 | 1.36 | 0 | 1.35 | 0 | 1.33 | 0 | |
►HPL_pdlange | xhpl | 0.04 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.77 | 1.60 | 1.59 | 1.58 | 1.58 | 1.59 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.75 | 0.81 | 0.42 | 0.22 | 0.12 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 10.11 | 19.72 | 38.44 | 73.65 | 137.60 | 157.99 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_pdlange.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRE... | 1 | 0 | 1.08 | 0 | 1.05 | 0 | 1.01 | 0 | 0.95 | 0 | 0.9 | 0 |
►Loop 81 - HPL_pdlange.c:208-212 - xhpl | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.92 | 0.84 | 0.85 | 0.84 | 0.85 | 0.84 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.91 | 0.43 | 0.22 | 0.12 | 0.06 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 76 - HPL_pdlange.c:210-211 - xhpl | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.92 | 0.84 | 0.85 | 0.84 | 0.85 | 0.84 | 0.92 | 0.84 | 0.85 | 0.84 | 0.85 | 0.84 | 0.91 | 0.43 | 0.22 | 0.12 | 0.06 | 0.04 | 0.91 | 0.43 | 0.22 | 0.12 | 0.06 | 0.04 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 10.10 | 19.71 | 38.34 | 73.48 | 137.58 | 157.64 | 1 | 0 | 1.06 | 0 | 1.03 | 0 | 0.99 | 0 | 0.92 | 0 | 0.89 | 0 | ||||||||
►Loop 78 - HPL_pdlange.c:208-212 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 1 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 79 - HPL_pdlange.c:210-211 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 80 - HPL_pdlange.c:210-211 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 77 - HPL_pdlange.c:191-242 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 86 - HPL_pdlange.c:149-153 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 87 - HPL_pdlange.c:151-152 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 83 - HPL_pdlange.c:170-174 - xhpl | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.85 | 0.75 | 0.74 | 0.74 | 0.74 | 0.75 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.84 | 0.38 | 0.19 | 0.10 | 0.05 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 1 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 82 - HPL_pdlange.c:173-173 - xhpl | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.85 | 0.75 | 0.74 | 0.74 | 0.74 | 0.75 | 0.85 | 0.75 | 0.74 | 0.74 | 0.74 | 0.75 | 0.84 | 0.38 | 0.19 | 0.10 | 0.05 | 0.04 | 0.84 | 0.38 | 0.19 | 0.10 | 0.05 | 0.04 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 10.11 | 19.73 | 38.55 | 73.79 | 137.83 | 158.39 | 1 | 0 | 1.11 | 0 | 1.07 | 0 | 1.04 | 0 | 0.98 | 0 | 0.92 | 0 | ||||||||
►Loop 85 - HPL_pdlange.c:170-174 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 1 | 1 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 84 - HPL_pdlange.c:173-173 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○MPI_Iprobe | libmpi.so.40.30.2 | 0.03 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.03 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 2.19 | 1.17 | 0.78 | 0.70 | 0.42 | 0.40 | 2.19 | 1.17 | 0.78 | 0.70 | 0.42 | 0.40 | 1.64 | 0.49 | 0.15 | 0.07 | 0.02 | 0.01 | 1.64 | 0.49 | 0.15 | 0.07 | 0.02 | 0.01 | 4 | 4 | 4 | 4 | 4 | 4 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.38 | 0.17 | 0.20 | 0.20 | 0.10 | 0.10 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.68 | 0 | 2.67 | 0 | 3.13 | 0 | 4.75 | 0 | 4.59 | 0 | |
○opal_timer_linux_get_cycles_sys_timer | libopen-pal.so.40.30.2 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 2.02 | 1.25 | 0.82 | 0.73 | 0.71 | 0.68 | 2.02 | 1.25 | 0.82 | 0.73 | 0.71 | 0.68 | 1.64 | 0.54 | 0.19 | 0.09 | 0.04 | 0.03 | 1.64 | 0.54 | 0.19 | 0.09 | 0.04 | 0.03 | 4 | 4 | 4 | 4 | 4 | 4 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.38 | 0.16 | 0.08 | 0.07 | 0.08 | 0.06 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.51 | 0 | 2.12 | 0 | 2.25 | 0 | 2.35 | 0 | 2.21 | 0 | |
○opal_convertor_prepare_for_recv | libopen-pal.so.40.30.2 | 0.03 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.03 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 1.97 | 0.98 | 0.65 | 0.47 | 0.33 | 0.41 | 1.97 | 0.98 | 0.65 | 0.47 | 0.33 | 0.41 | 1.54 | 0.40 | 0.13 | 0.05 | 0.02 | 0.01 | 1.54 | 0.40 | 0.13 | 0.05 | 0.02 | 0.01 | 4 | 4 | 4 | 4 | 4 | 4 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.33 | 0.21 | 0.18 | 0.12 | 0.09 | 0.10 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.93 | 0 | 3.02 | 0 | 3.93 | 0 | 5.17 | 0 | 4.4 | 0 | |
○ompi_coll_libnbc_progress | mca_coll_libnbc.so | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 1.86 | 1.12 | 0.86 | 0.99 | 0.91 | 0.66 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.66 | 1.49 | 0.47 | 0.19 | 0.09 | 0.05 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 4 | 4 | 4 | 4 | 4 | 4 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.02 | 0.40 | 0.16 | 0.10 | 0.21 | 0.18 | 0.10 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.59 | 0 | 1.94 | 0 | 2 | 0 | 1.91 | 0 | 2.15 | 0 | |
►HPL_bcast_1ring | xhpl | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 1.70 | 0.85 | 0.61 | 0.40 | 0.30 | 0.32 | 1.18 | 0.63 | 0.41 | 0.27 | 0.20 | 0.21 | 1.26 | 0.35 | 0.11 | 0.04 | 0.02 | 0.01 | 0.85 | 0.24 | 0.08 | 0.03 | 0.01 | 0.01 | 4 | 4 | 4 | 4 | 4 | 4 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.02 | 0.30 | 0.13 | 0.17 | 0.09 | 0.06 | 0.08 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_1ring.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS... | 1 | 0 | 1.78 | 0 | 2.75 | 0 | 3.92 | 0 | 4.55 | 0 | 5.07 | 0 |
○Loop 269 - HPL_1ring.c:152-184 - xhpl [...] | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.52 | 0.25 | 0.20 | 0.12 | 0.10 | 0.11 | 0.52 | 0.25 | 0.20 | 0.12 | 0.10 | 0.11 | 0.41 | 0.11 | 0.04 | 0.01 | 0.01 | 0.00 | 0.41 | 0.11 | 0.04 | 0.01 | 0.01 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.08 | 0.03 | 0.05 | 0.03 | 0.03 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.86 | 0 | 2.68 | 0 | 4.13 | 0 | 5.08 | 0 | 5.33 | 0 | ||||||||
○__GI___pthread_mutex_init | libc.so.6 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 1.59 | 0.96 | 0.55 | 0.43 | 0.33 | 0.32 | 1.59 | 0.96 | 0.55 | 0.43 | 0.33 | 0.32 | 1.24 | 0.38 | 0.10 | 0.05 | 0.02 | 0.01 | 1.24 | 0.38 | 0.10 | 0.05 | 0.02 | 0.01 | 4 | 4 | 4 | 4 | 4 | 4 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.24 | 0.20 | 0.16 | 0.12 | 0.07 | 0.08 | MPI (%): 100.00 Pthread (%): 0.00 | MPI (%): 100.00 | MPI (%): 100.00 Pthread (%): 0.00 | MPI (%): 100.00 | MPI (%): 100.00 Pthread (%): 0.00 | MPI (%): 100.00 Pthread (%): 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.65 | 0 | 3.01 | 0 | 3.29 | 0 | 4.39 | 0 | 4.54 | 0 | |
○mca_pml_ob1_recv_request_construct | mca_pml_ob1.so | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 1.62 | 0.80 | 0.50 | 0.46 | 0.28 | 0.23 | 1.62 | 0.80 | 0.50 | 0.46 | 0.28 | 0.23 | 1.23 | 0.36 | 0.10 | 0.04 | 0.02 | 0.01 | 1.23 | 0.36 | 0.10 | 0.04 | 0.02 | 0.01 | 4 | 4 | 4 | 4 | 4 | 4 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.28 | 0.10 | 0.14 | 0.13 | 0.07 | 0.04 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.69 | 0 | 3.05 | 0 | 3.51 | 0 | 4.94 | 0 | 5.1 | 0 | |
○dgemv_n_sve_kernel | libarmpl_lp64_mp.so | 0.03 | 0.03 | 0.03 | 0.03 | 0.05 | 0.06 | 0.03 | 0.03 | 0.03 | 0.03 | 0.05 | 0.06 | 1.26 | 0.82 | 0.47 | 0.33 | 0.28 | 0.26 | 1.26 | 0.82 | 0.47 | 0.33 | 0.28 | 0.26 | 1.20 | 0.73 | 0.43 | 0.33 | 0.30 | 0.28 | 1.20 | 0.73 | 0.43 | 0.33 | 0.30 | 0.28 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.07 | 0.09 | 0.04 | 0.02 | 0.01 | 0.01 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 11.04 | 21.86 | 43.73 | 81.12 | 100.43 | 105.33 | 1 | 0 | 0.82 | 0 | 0.7 | 0.01 | 0.45 | 0.02 | 0.25 | 0.04 | 0.17 | 0.05 | |
►HPL_pdmatgen | xhpl | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.18 | 1.26 | 1.17 | 1.24 | 1.10 | 1.21 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.14 | 0.58 | 0.29 | 0.16 | 0.08 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.04 | 0.09 | 0.05 | 0.07 | 0.04 | 0.07 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.01 | 0.03 | 0.07 | 0.04 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_pdmatgen.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGR... | 1 | 0 | 0.99 | 0 | 0.98 | 0 | 0.89 | 0 | 0.93 | 0 | 0.83 | 0 |
►Loop 128 - HPL_pdmatgen.c:173-198 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 129 - HPL_pdmatgen.c:176-188 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 131 - HPL_pdmatgen.c:173-193 - xhpl | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.20 | 1.27 | 1.17 | 1.25 | 1.10 | 1.22 | 0.02 | 0.01 | 0.01 | 0.01 | 0.00 | 0.02 | 1.14 | 0.58 | 0.29 | 0.16 | 0.08 | 0.06 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 2 | 4 | 1 | 2 | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 130 - HPL_pdmatgen.c:173-193 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 132 - HPL_pdmatgen.c:181-181 - xhpl | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 1.18 | 1.26 | 1.15 | 1.24 | 1.10 | 1.20 | 1.18 | 1.26 | 1.15 | 1.24 | 1.10 | 1.20 | 1.13 | 0.58 | 0.29 | 0.16 | 0.08 | 0.06 | 1.13 | 0.58 | 0.29 | 0.16 | 0.08 | 0.06 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.05 | 0.10 | 0.04 | 0.07 | 0.04 | 0.07 | 0.00 | 0.00 | 0.01 | 0.03 | 0.07 | 0.04 | 1 | 0 | 0.98 | 0 | 0.98 | 0 | 0.89 | 0 | 0.92 | 0 | 0.83 | 0 | ||||||||
►HPL_dlacpy | xhpl | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.17 | 1.14 | 1.15 | 1.24 | 1.40 | 1.58 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 1.06 | 0.56 | 0.28 | 0.16 | 0.10 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.02 | 0.07 | 0.04 | 0.05 | 0.05 | 0.08 | 0.08 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_dlacpy.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRES... | 1 | 0 | 0.95 | 0 | 0.94 | 0 | 0.82 | 0 | 0.69 | 0 | 0.59 | 0.01 |
►Loop 162 - HPL_dlacpy.c:167-304 - xhpl [...] | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.15 | 1.20 | 1.20 | 1.24 | 1.39 | 1.61 | 0.14 | 0.11 | 0.15 | 0.09 | 0.13 | 0.16 | 1.05 | 0.56 | 0.28 | 0.16 | 0.10 | 0.08 | 0.11 | 0.04 | 0.03 | 0.01 | 0.01 | 0.01 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.02 | 0.03 | 0.01 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.39 | 0 | 0.95 | 0 | 1.19 | 0 | 0.86 | 0 | 0.73 | 0 | ||||||||
○Loop 163 - HPL_dlacpy.c:174-195 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 161 - HPL_dlacpy.c:167-304 - xhpl [...] | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 1.01 | 1.09 | 1.04 | 1.15 | 1.26 | 1.45 | 1.01 | 1.09 | 1.04 | 1.15 | 1.26 | 1.45 | 0.94 | 0.52 | 0.25 | 0.15 | 0.09 | 0.07 | 0.94 | 0.52 | 0.25 | 0.15 | 0.09 | 0.07 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.02 | 0.06 | 0.06 | 0.07 | 0.04 | 0.06 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.91 | 0 | 0.93 | 0 | 0.78 | 0 | 0.67 | 0 | 0.57 | 0.01 | ||||||||
►Loop 158 - HPL_dlacpy.c:167-304 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 159 - HPL_dlacpy.c:289-294 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 160 - HPL_dlacpy.c:289-294 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 155 - HPL_dlacpy.c:167-304 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 0 | 1 | 1 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 153 - HPL_dlacpy.c:169-280 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 1 | 1 | 2 | 2 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 154 - HPL_dlacpy.c:289-294 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 156 - HPL_dlacpy.c:294-294 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 157 - HPL_dlacpy.c:174-195 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 146 - HPL_dlacpy.c:311-337 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 147 - HPL_dlacpy.c:337-337 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 145 - HPL_dlacpy.c:337-337 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 144 - HPL_dlacpy.c:311-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 141 - HPL_dlacpy.c:316-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 142 - HPL_dlacpy.c:311-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 143 - HPL_dlacpy.c:294-294 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
►Loop 150 - HPL_dlacpy.c:311-337 - xhpl [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 152 - HPL_dlacpy.c:316-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 148 - HPL_dlacpy.c:313-321 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 151 - HPL_dlacpy.c:337-337 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 149 - HPL_dlacpy.c:337-337 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○mca_pml_base_recv_request_construct | libmpi.so.40.30.2 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 1.23 | 0.64 | 0.56 | 0.38 | 0.35 | 0.23 | 1.23 | 0.64 | 0.56 | 0.38 | 0.35 | 0.23 | 0.97 | 0.28 | 0.09 | 0.03 | 0.01 | 0.01 | 0.97 | 0.28 | 0.09 | 0.03 | 0.01 | 0.01 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.01 | 0.23 | 0.08 | 0.18 | 0.11 | 0.11 | 0.07 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.72 | 0 | 2.65 | 0 | 3.63 | 0 | 4.35 | 0 | 4.78 | 0 | |
►HPL_bcast | xhpl | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 1.24 | 0.66 | 0.44 | 0.29 | 0.25 | 0.20 | 0.84 | 0.37 | 0.25 | 0.18 | 0.16 | 0.12 | 0.92 | 0.26 | 0.06 | 0.03 | 0.01 | 0.01 | 0.56 | 0.14 | 0.04 | 0.02 | 0.01 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.22 | 0.11 | 0.14 | 0.08 | 0.07 | 0.06 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_bcast.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROGRESS... | 1 | 0 | 1.78 | 0 | 3.67 | 0 | 3.88 | 0 | 5.24 | 0 | 5.36 | 0 |
○Loop 194 - HPL_bcast.c:101-118 - xhpl [...] | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.40 | 0.29 | 0.18 | 0.12 | 0.09 | 0.11 | 0.40 | 0.29 | 0.18 | 0.12 | 0.09 | 0.11 | 0.36 | 0.11 | 0.03 | 0.01 | 0.00 | 0.00 | 0.36 | 0.11 | 0.03 | 0.01 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.03 | 0.05 | 0.06 | 0.03 | 0.02 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.6 | 0 | 3.45 | 0 | 4.06 | 0 | 5.38 | 0 | 4.37 | 0 | ||||||||
○__memcpy | libastring.so | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 1.00 | 1.02 | 0.98 | 0.92 | 0.96 | 0.97 | 0.00 | 0.01 | 0.98 | 0.00 | 0.00 | 0.00 | 0.89 | 0.46 | 0.23 | 0.12 | 0.06 | 0.05 | 0.00 | 0.00 | 0.23 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.07 | 0.09 | 0.08 | 0.03 | 0.07 | 0.07 | MPI (%): 100.00 | MPI (%): 100.00 String (%): 0.00 | MPI (%): 100.00 String (%): 0.00 | MPI (%): 100.00 String (%): 0.00 | MPI (%): 100.00 String (%): 0.00 | MPI (%): 100.00 String (%): 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.97 | 0 | 0.96 | 0 | 0.92 | 0 | 0.87 | 0 | 0.82 | 0 | |
►HPL_dlaswp03N | xhpl | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.75 | 0.67 | 0.68 | 0.69 | 0.74 | 0.74 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.71 | 0.34 | 0.18 | 0.09 | 0.05 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.01 | 0.02 | 0.01 | 0.02 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.01 | 0.04 | 0.10 | 0.24 | 0.46 | 0.61 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -o HPL_dlaswp03N.o -c -D Add__ -D F77_INTEGER=int -D StringSunStyle -D HPL_DETAILED_TIMING -D HPL_PROG... | 1 | 0 | 1.05 | 0 | 1.01 | 0 | 0.95 | 0 | 0.85 | 0 | 0.79 | 0 |
►Loop 323 - HPL_dlaswp03N.c:144-177 - xhpl | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.75 | 0.67 | 0.68 | 0.69 | 0.74 | 0.74 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.71 | 0.34 | 0.18 | 0.09 | 0.05 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 322 - HPL_dlaswp03N.c:147-177 - xhpl | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.75 | 0.67 | 0.68 | 0.69 | 0.74 | 0.74 | 0.75 | 0.67 | 0.68 | 0.69 | 0.74 | 0.74 | 0.71 | 0.34 | 0.18 | 0.09 | 0.05 | 0.04 | 0.71 | 0.34 | 0.18 | 0.09 | 0.05 | 0.04 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.01 | 0.02 | 0.01 | 0.02 | 0.02 | 0.01 | 0.04 | 0.10 | 0.24 | 0.46 | 0.61 | 1 | 0 | 1.05 | 0 | 1.01 | 0 | 0.95 | 0 | 0.85 | 0 | 0.79 | 0 | ||||||||
►Loop 320 - HPL_dlaswp03N.c:184-188 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 319 - HPL_dlaswp03N.c:188-188 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○Loop 321 - HPL_dlaswp03N.c:188-188 - xhpl | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||
○__kmp_api_omp_get_thread_num | libomp.so | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.57 | 0.31 | 0.23 | 0.13 | 0.09 | 0.07 | 0.57 | 0.31 | 0.23 | 0.13 | 0.09 | 0.07 | 0.50 | 0.25 | 0.17 | 0.09 | 0.06 | 0.05 | 0.50 | 0.25 | 0.17 | 0.09 | 0.06 | 0.05 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.08 | 0.04 | 0.04 | 0.02 | 0.02 | 0.01 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.82 | 1.58 | 2.76 | 5.31 | 10.30 | 12.18 | 1 | 0 | 1.01 | 0 | 0.73 | 0 | 0.68 | 0 | 0.51 | 0 | 0.46 | 0 | |
○opal_mutex_construct | libopen-pal.so.40.30.2 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.58 | 0.38 | 0.22 | 0.15 | 0.13 | 0.10 | 0.58 | 0.38 | 0.22 | 0.15 | 0.13 | 0.10 | 0.48 | 0.15 | 0.04 | 0.02 | 0.01 | 0.00 | 0.48 | 0.15 | 0.04 | 0.02 | 0.01 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.07 | 0.08 | 0.06 | 0.04 | 0.03 | 0.03 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.64 | 0 | 2.93 | 0 | 3.72 | 0 | 4.09 | 0 | 5.27 | 0 | |
○@plt_start@ | mca_pml_ob1.so | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.53 | 0.28 | 0.21 | 0.15 | 0.14 | 0.13 | 0.16 | 0.00 | 0.06 | 0.00 | 0.00 | 0.04 | 0.48 | 0.14 | 0.04 | 0.02 | 0.01 | 0.01 | 0.13 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 4 | 4 | 4 | 4 | 4 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.04 | 0.01 | 0.04 | 0.01 | 0.03 | 0.02 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.75 | 0 | 2.79 | 0 | 3.1 | 0 | 3.7 | 0 | 3.37 | 0 | |
○__sched_yield | libc.so.6 | 0.00 | 0.02 | 0.05 | 0.10 | 0.16 | 0.22 | 0.00 | 0.00 | 0.05 | 0.10 | 0.16 | 0.00 | 0.00 | 1.25 | 1.33 | 1.16 | 1.16 | 1.15 | 0.00 | 0.00 | 1.33 | 1.16 | 1.16 | 0.00 | 0.00 | 0.54 | 0.81 | 0.92 | 0.95 | 1.06 | 0.00 | 0.00 | 0.81 | 0.92 | 0.95 | 0.00 | 0 | 8 | 15 | 32 | 64 | 96 | 0.00 | 0.02 | 0.03 | 0.04 | 0.04 | 0.05 | 0.00 | 0.55 | 0.44 | 0.33 | 0.22 | 0.18 | NA | OMP (%): 97.77 System (%): 0.00 Math (%): 2.23 | OMP (%): 99.76 System (%): 0.00 Math (%): 0.24 | OMP (%): 99.53 System (%): 0.00 Math (%): 0.47 | OMP (%): 99.72 System (%): 0.00 Math (%): 0.28 | OMP (%): 99.38 System (%): 0.00 Math (%): 0.62 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | 0.00 | 0.03 | 0.06 | 0.14 | 0.28 | 0.40 | 0.00 | 0.03 | 0.06 | 0.14 | 0.28 | 0.40 | 0.00 | 1.52 | 1.44 | 1.66 | 1.83 | 2.62 | 0.00 | 1.52 | 1.44 | 1.66 | 1.83 | 2.62 | 0.00 | 0.71 | 1.00 | 1.36 | 1.71 | 1.96 | 0.00 | 0.71 | 1.00 | 1.36 | 1.71 | 1.96 | 0 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.03 | 0.04 | 0.05 | 0.08 | 0.09 | 0.00 | 0.70 | 0.56 | 0.46 | 0.39 | 0.36 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | 0.00 | 0.67 | 1.53 | 3.14 | 5.61 | 7.82 | 0.00 | 0.67 | 1.53 | 3.14 | 5.61 | 7.82 | 0.00 | 39.18 | 34.57 | 35.50 | 34.64 | 36.56 | 0.00 | 39.18 | 34.57 | 35.50 | 34.64 | 36.56 | 0.00 | 17.88 | 24.70 | 30.23 | 34.43 | 38.47 | 0.00 | 17.88 | 24.70 | 30.23 | 34.43 | 38.47 | 0 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.72 | 0.92 | 1.20 | 1.49 | 1.60 | 0.00 | 18.32 | 14.05 | 10.41 | 7.77 | 6.32 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |