Run run_0 | Number processes: 1Number nodes: 1Run Command: <executable> -t 1 -c -f input-matrix/mat_dim_493039.txt -r 100000MPI Command: Dataset: Run Directory: /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main |
---|---|
Run o2 | Run Command: <executable> -t 2 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 2 |
Run o4 | Run Command: <executable> -t 4 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 4 |
Run o8 | Run Command: <executable> -t 8 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 8 |
Run o16 | Run Command: <executable> -t 16 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 16 |
Run o26 | Run Command: <executable> -t 26 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 26 |
Run o52 | Run Command: <executable> -t 52 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 52 |
Loop id | Source Location | Source Function | Level | Exclusive coverage run_0 (%) | Exclusive coverage o2 (%) | Exclusive coverage o4 (%) | Exclusive coverage o8 (%) | Exclusive coverage o16 (%) | Exclusive coverage o26 (%) | Exclusive coverage o52 (%) | Inclusive coverage run_0 (%) | Inclusive coverage o2 (%) | Inclusive coverage o4 (%) | Inclusive coverage o8 (%) | Inclusive coverage o16 (%) | Inclusive coverage o26 (%) | Inclusive coverage o52 (%) | Max Exclusive Time Over Threads run_0 (s) | Max Exclusive Time Over Threads o2 (s) | Max Exclusive Time Over Threads o4 (s) | Max Exclusive Time Over Threads o8 (s) | Max Exclusive Time Over Threads o16 (s) | Max Exclusive Time Over Threads o26 (s) | Max Exclusive Time Over Threads o52 (s) | Max Inclusive Time Over Threads run_0 (s) | Max Inclusive Time Over Threads o2 (s) | Max Inclusive Time Over Threads o4 (s) | Max Inclusive Time Over Threads o8 (s) | Max Inclusive Time Over Threads o16 (s) | Max Inclusive Time Over Threads o26 (s) | Max Inclusive Time Over Threads o52 (s) | Exclusive Time w.r.t. Wall Time run_0 (s) | Exclusive Time w.r.t. Wall Time o2 (s) | Exclusive Time w.r.t. Wall Time o4 (s) | Exclusive Time w.r.t. Wall Time o8 (s) | Exclusive Time w.r.t. Wall Time o16 (s) | Exclusive Time w.r.t. Wall Time o26 (s) | Exclusive Time w.r.t. Wall Time o52 (s) | Inclusive Time w.r.t. Wall Time run_0 (s) | Inclusive Time w.r.t. Wall Time o2 (s) | Inclusive Time w.r.t. Wall Time o4 (s) | Inclusive Time w.r.t. Wall Time o8 (s) | Inclusive Time w.r.t. Wall Time o16 (s) | Inclusive Time w.r.t. Wall Time o26 (s) | Inclusive Time w.r.t. Wall Time o52 (s) | Nb Threads run_0 | Nb Threads o2 | Nb Threads o4 | Nb Threads o8 | Nb Threads o16 | Nb Threads o26 | Nb Threads o52 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing run_0 | Speedup If Perfect Load Balancing o2 | Speedup If Perfect Load Balancing o4 | Speedup If Perfect Load Balancing o8 | Speedup If Perfect Load Balancing o16 | Speedup If Perfect Load Balancing o26 | Speedup If Perfect Load Balancing o52 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (run_0) Efficiency | (run_0) Potential Speed-Up (%) | (o2) Efficiency | (o2) Potential Speed-Up (%) | (o4) Efficiency | (o4) Potential Speed-Up (%) | (o8) Efficiency | (o8) Potential Speed-Up (%) | (o16) Efficiency | (o16) Potential Speed-Up (%) | (o26) Efficiency | (o26) Potential Speed-Up (%) | (o52) Efficiency | (o52) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3 | spmxv.exe - main.cpp:72-74 | spmxv(ooo_options*, ooo_input*) [clone ._omp_fn.2] | Innermost | 95.31 | 94.76 | 90.44 | 90.38 | 91.02 | 89.97 | 79.61 | 95.31 | 94.76 | 90.44 | 90.38 | 91.02 | 89.97 | 79.61 | 1039.56 | 535.77 | 277.31 | 148.06 | 81.94 | 67.03 | 27.85 | 1039.56 | 535.77 | 277.31 | 148.06 | 81.94 | 67.03 | 27.85 | 1039.56 | 535.94 | 269.63 | 145.58 | 82.31 | 67.39 | 29.60 | 1039.56 | 535.94 | 269.63 | 145.58 | 82.31 | 67.39 | 29.60 | 1 | 2 | 4 | 8 | 16 | 26 | 52 | 0 | 12.5 | 1 | 2.91 | 8 | 1 | 1 | 1.04 | 1.04 | 1.04 | 1.05 | 1.06 | 0 | 2 | 0 | 0 | 1 | 1 | 0 | 0.97 | 2.86 | 0.96 | 3.27 | 0.89 | 9.7 | 0.79 | 19.17 | 0.59 | 36.59 | 0.68 | 25.85 |
1 | spmxv.exe - main.cpp:68-76 | spmxv(ooo_options*, ooo_input*) [clone ._omp_fn.2] | InBetween | 4.22 | 4.47 | 4.62 | 4.27 | 3.82 | 2.95 | 3.71 | 99.53 | 99.24 | 95.06 | 94.65 | 94.84 | 92.92 | 83.32 | 45.98 | 25.95 | 13.86 | 7.16 | 3.60 | 2.36 | 1.40 | 1085.54 | 560.21 | 291.17 | 155.22 | 85.21 | 69.07 | 29.23 | 45.98 | 25.30 | 13.76 | 6.88 | 3.46 | 2.21 | 1.38 | 1085.54 | 561.24 | 283.39 | 152.45 | 85.76 | 69.60 | 30.98 | 1 | 2 | 4 | 8 | 16 | 26 | 52 | 20 | 12.5 | 1.7 | 1 | 13.22 | 1 | 1.03 | 1.02 | 1.07 | 1.09 | 1.13 | 1.14 | 0.5 | 2 | 1 | 0 | 0 | 1 | 0 | 0.91 | 0.41 | 0.84 | 0.76 | 0.84 | 0.7 | 0.83 | 0.64 | 0.8 | 0.59 | 0.64 | 1.34 |
2 | spmxv.exe - main.cpp:65-76 | spmxv(ooo_options*, ooo_input*) [clone ._omp_fn.2] | Outermost | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 99.55 | 99.26 | 95.08 | 94.68 | 94.85 | 92.94 | 83.34 | 0.25 | 0.14 | 0.07 | 0.04 | 0.03 | 0.02 | 0.04 | 1085.79 | 560.34 | 291.24 | 155.27 | 84.81 | 69.08 | 29.24 | 0.25 | 0.14 | 0.05 | 0.04 | 0.01 | 0.01 | 0.01 | 1085.79 | 561.37 | 283.44 | 152.49 | 85.78 | 69.61 | 30.99 | 1 | 2 | 4 | 8 | 14 | 24 | 31 | 0 | 9.38 | 1 | 1 | 14.4 | 1 | 1.05 | 1.47 | 1.31 | 2.23 | 2.26 | 3.62 | 1 | 3 | 1 | 0 | 0 | 1 | 0 | 0.91 | 0 | 1.3 | -0 | 0.89 | 0 | 1.09 | -0 | 0.89 | 0 | 0.74 | 0 |
91 | spmxv.exe - ooo_cmdline.h:83-97 [...] | void load_drops_matlab_matrix<double, int>(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, int*&, int*&, double*&, int&, int&, int&) | Innermost | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.04 | 0.07 | 0.13 | 0.04 | 0.07 | 0.05 | 0.04 | 0.04 | 0.07 | 0.13 | 0.04 | 0.07 | 0.05 | 0.04 | 0.04 | 0.04 | 0.03 | 0.01 | 0.00 | 0.00 | 0.00 | 0.04 | 0.04 | 0.03 | 0.01 | 0.00 | 0.00 | 0.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 11.7 | 4.71 | 1 | 12.97 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 3 | 1 | 2.5 | 0 | 0 | 1 | 0 | 0.6 | 0 | 0.35 | 0.01 | 0.98 | 0 | 0.61 | 0 | 0.85 | 0 | 0.89 | 0 |
81 | spmxv.exe - ooo_cmdline.cpp:171-173 | print_error_check(double*, ooo_input*, ooo_options*) | Innermost | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 2.91 | 8 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0.99 | 0 | 1.46 | -0 | 0.96 | 0 | 2.84 | -0 | 0.89 | 0 |
7 | spmxv.exe - main.cpp:49-52 | spmxv(ooo_options*, ooo_input*) [clone ._omp_fn.1] | Innermost | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 1 | 1 | 3 | 1 | 100 | 50 | 1 | 1 | 2 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 4 | 0 | 0 | 0 | 1 | 0 | 1.97 | -0 | 0.63 | 0 | 1.78 | -0 | |||||||||||||||||||||
4 | spmxv.exe - main.cpp:45-52 | spmxv(ooo_options*, ooo_input*) [clone ._omp_fn.1] | InBetween | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 2 | 3 | 1 | 1 | 5 | 1 | 18.18 | 12.5 | 3.38 | 1 | 12.2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 0.25 | 0 | 0.33 | 0 | 0.98 | 0 | 0.96 | 0 | 0.19 | 0 | 0.89 | 0 |
11 | spmxv.exe - main.cpp:60-79 [...] | spmxv(ooo_options*, ooo_input*) | Single | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.03 | 0.00 | 0.00 | 0.02 | 0.00 | 0.03 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 1 | 1 | 1 | 1 | 1 | 50 | 22.5 | 1.75 | 1 | 5.89 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 2 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0.25 | 0 | 0.95 | 0 | 0.15 | 0 | ||||||||||||||
79 | spmxv.exe - ooo_cmdline.cpp:165-175 | print_error_check(double*, ooo_input*, ooo_options*) | Outermost | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 17.14 | 12.5 | 1 | 2.15 | 12.65 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0.5 | 1 | 1 | 0 | 0.5 | 1 | 0 | 1 | 0 | |||||||||||||||||||||||||||||||||||
9 | spmxv.exe - main.cpp:32-37 | spmxv(ooo_options*, ooo_input*) [clone ._omp_fn.0] | Outermost | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 18.6 | 12.5 | 4.41 | 1 | 12.4 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | NA | NA | NA | NA | NA | 1 | 0 | ||||||||||||||||||||||||||||||||||||||||||
80 | spmxv.exe - ooo_cmdline.cpp:180-181 | print_error_check(double*, ooo_input*, ooo_options*) | Single | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 12.5 | 1 | 2.67 | 8 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |