| Loop id | Source Location | Source Function | Level | Max Thread Time / Walltime run_0 (%) | Max Thread Time / Walltime m2 (%) | Max Thread Time / Walltime m4 (%) | Max Thread Time / Walltime m8 (%) | Max Thread Time / Walltime m16 (%) | Max Thread Time / Walltime m32 (%) | Exclusive Coverage run_0 (%) | Exclusive Coverage m2 (%) | Exclusive Coverage m4 (%) | Exclusive Coverage m8 (%) | Exclusive Coverage m16 (%) | Exclusive Coverage m32 (%) | Inclusive Coverage run_0 (%) | Inclusive Coverage m2 (%) | Inclusive Coverage m4 (%) | Inclusive Coverage m8 (%) | Inclusive Coverage m16 (%) | Inclusive Coverage m32 (%) | Max Exclusive Time Over Threads run_0 (s) | Max Exclusive Time Over Threads m2 (s) | Max Exclusive Time Over Threads m4 (s) | Max Exclusive Time Over Threads m8 (s) | Max Exclusive Time Over Threads m16 (s) | Max Exclusive Time Over Threads m32 (s) | Max Inclusive Time Over Threads run_0 (s) | Max Inclusive Time Over Threads m2 (s) | Max Inclusive Time Over Threads m4 (s) | Max Inclusive Time Over Threads m8 (s) | Max Inclusive Time Over Threads m16 (s) | Max Inclusive Time Over Threads m32 (s) | Exclusive Time w.r.t. Wall Time run_0 (s) | Exclusive Time w.r.t. Wall Time m2 (s) | Exclusive Time w.r.t. Wall Time m4 (s) | Exclusive Time w.r.t. Wall Time m8 (s) | Exclusive Time w.r.t. Wall Time m16 (s) | Exclusive Time w.r.t. Wall Time m32 (s) | Inclusive Time w.r.t. Wall Time run_0 (s) | Inclusive Time w.r.t. Wall Time m2 (s) | Inclusive Time w.r.t. Wall Time m4 (s) | Inclusive Time w.r.t. Wall Time m8 (s) | Inclusive Time w.r.t. Wall Time m16 (s) | Inclusive Time w.r.t. Wall Time m32 (s) | Nb Threads run_0 | Nb Threads m2 | Nb Threads m4 | Nb Threads m8 | Nb Threads m16 | Nb Threads m32 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing run_0 | Speedup If Perfect Load Balancing m2 | Speedup If Perfect Load Balancing m4 | Speedup If Perfect Load Balancing m8 | Speedup If Perfect Load Balancing m16 | Speedup If Perfect Load Balancing m32 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | Array Access Efficiency | (run_0) Efficiency | (run_0) Potential Speed-Up (%) | (m2) Efficiency | (m2) Potential Speed-Up (%) | (m4) Efficiency | (m4) Potential Speed-Up (%) | (m8) Efficiency | (m8) Potential Speed-Up (%) | (m16) Efficiency | (m16) Potential Speed-Up (%) | (m32) Efficiency | (m32) Potential Speed-Up (%) |
|---|
| 18 | xy_model - simulation.cpp:27-56 [...] | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | InBetween | 13.33 | 13.68 | 13.53 | 13.36 | 12.49 | 11.09 | 13.35 | 13.64 | 13.22 | 12.62 | 11.56 | 10.09 | 32.85 | 33.88 | 32.90 | 31.41 | 28.73 | 24.65 | 40.78 | 20.62 | 10.45 | 5.47 | 2.81 | 1.50 | 100.34 | 51.27 | 25.36 | 13.35 | 6.62 | 3.50 | 40.78 | 20.50 | 10.15 | 5.11 | 2.55 | 1.32 | 100.34 | 50.90 | 25.26 | 12.71 | 6.33 | 3.22 | 1 | 2 | 4 | 8 | 16 | 32 | 8.87 | 13.81 | 1.66 | 1.74 | 9.81 | 1 | 1.01 | 1.03 | 1.07 | 1.1 | 1.14 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 0.99 | 0.07 | 1 | 0 | 1 | 0.03 | 1 | 0 | 0.97 | 0.34 |
| 10 | xy_model - simulation.cpp:46-46 | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | Innermost | 4.08 | 4.23 | 4.32 | 4.18 | 4.32 | 4.14 | 4.08 | 4.23 | 4.17 | 3.85 | 3.71 | 3.04 | 4.08 | 4.23 | 4.17 | 3.85 | 3.71 | 3.04 | 12.46 | 6.38 | 3.33 | 1.71 | 0.97 | 0.56 | 12.46 | 6.38 | 3.33 | 1.71 | 0.97 | 0.56 | 12.46 | 6.36 | 3.20 | 1.56 | 0.82 | 0.40 | 12.46 | 6.36 | 3.20 | 1.56 | 0.82 | 0.40 | 1 | 2 | 4 | 8 | 16 | 32 | NA | NA | NA | NA | NA | 1 | 1 | 1.04 | 1.1 | 1.19 | 1.41 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 0.98 | 0.08 | 0.97 | 0.11 | 1 | 0 | 0.95 | 0.17 | 0.98 | 0.06 |
| 9 | xy_model - random.tcc:458-3558 [...] | double std::__generate_canonical_pow2<double, 53ul, std::mersenne_twister_engine<unsigned long, 32ul, 624ul, 397ul, 31ul, 2567483615ul, 11ul, 4294967295ul, 7ul, 2636928640ul, 15ul, 4022730752ul, 18ul, 1812433253ul> >(std::mersenne_twister_engin... | Single | 3.63 | 3.73 | 3.76 | 3.76 | 3.58 | 3.25 | 3.63 | 3.72 | 3.59 | 3.44 | 3.27 | 2.72 | 3.63 | 3.72 | 3.59 | 3.44 | 3.27 | 2.72 | 11.10 | 5.62 | 2.90 | 1.54 | 0.81 | 0.44 | 11.10 | 5.62 | 2.90 | 1.54 | 0.81 | 0.44 | 11.10 | 5.59 | 2.75 | 1.39 | 0.72 | 0.36 | 11.10 | 5.59 | 2.75 | 1.39 | 0.72 | 0.36 | 1 | 2 | 4 | 8 | 16 | 32 | 2.38 | 11.9 | 5.02 | 1 | 11.83 | 1 | 1.01 | 1.05 | 1.11 | 1.12 | 1.24 | 1.5 | 0 | 0 | 0.5 | 0 | 87.50 | 1 | 0 | 0.99 | 0.03 | 1.01 | 0 | 1 | 0.01 | 0.96 | 0.12 | 0.98 | 0.06 |
| 14 | xy_model - simulation.cpp:46-46 | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | Innermost | 2.56 | 2.73 | 2.67 | 2.55 | 2.47 | 2.48 | 2.56 | 2.62 | 2.56 | 2.39 | 2.21 | 1.91 | 2.56 | 2.62 | 2.56 | 2.39 | 2.21 | 1.91 | 7.82 | 4.11 | 2.06 | 1.04 | 0.56 | 0.33 | 7.82 | 4.11 | 2.06 | 1.04 | 0.56 | 0.33 | 7.82 | 3.94 | 1.97 | 0.97 | 0.49 | 0.25 | 7.82 | 3.94 | 1.97 | 0.97 | 0.49 | 0.25 | 1 | 2 | 4 | 8 | 16 | 32 | NA | NA | NA | NA | NA | 1 | 1.04 | 1.05 | 1.08 | 1.14 | 1.34 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 0.99 | 0.02 | 0.99 | 0.01 | 1.01 | 0 | 1 | 0 | 0.98 | 0.04 |
| 16 | xy_model - simulation.cpp:46-46 | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | Innermost | 2.56 | 2.92 | 2.78 | 2.60 | 2.76 | 2.48 | 2.56 | 2.79 | 2.64 | 2.48 | 2.32 | 1.96 | 2.56 | 2.79 | 2.64 | 2.48 | 2.32 | 1.96 | 7.81 | 4.39 | 2.14 | 1.06 | 0.62 | 0.34 | 7.81 | 4.39 | 2.14 | 1.06 | 0.62 | 0.34 | 7.82 | 4.20 | 2.02 | 1.01 | 0.51 | 0.26 | 7.82 | 4.20 | 2.02 | 1.01 | 0.51 | 0.26 | 1 | 2 | 4 | 8 | 16 | 32 | NA | NA | NA | NA | NA | 1 | 1.05 | 1.06 | 1.06 | 1.21 | 1.31 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 0.93 | 0.19 | 0.97 | 0.09 | 0.97 | 0.07 | 0.95 | 0.11 | 0.96 | 0.09 |
| 12 | xy_model - simulation.cpp:46-46 | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | Innermost | 2.54 | 2.68 | 2.60 | 2.79 | 2.83 | 2.37 | 2.54 | 2.66 | 2.52 | 2.58 | 2.17 | 1.92 | 2.54 | 2.66 | 2.52 | 2.58 | 2.17 | 1.92 | 7.76 | 4.04 | 2.00 | 1.14 | 0.63 | 0.32 | 7.76 | 4.04 | 2.00 | 1.14 | 0.63 | 0.32 | 7.76 | 4.00 | 1.93 | 1.04 | 0.48 | 0.25 | 7.76 | 4.00 | 1.93 | 1.04 | 0.48 | 0.25 | 1 | 2 | 4 | 8 | 16 | 32 | NA | NA | NA | NA | NA | 1 | 1.01 | 1.04 | 1.09 | 1.33 | 1.28 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 0.97 | 0.08 | 1 | 0 | 0.93 | 0.18 | 1.02 | 0 | 0.97 | 0.06 |
| 15 | xy_model - simulation.cpp:47-47 | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | Innermost | 2.15 | 2.12 | 2.23 | 2.10 | 2.18 | 2.11 | 2.15 | 2.07 | 2.11 | 1.92 | 1.80 | 1.60 | 2.15 | 2.07 | 2.11 | 1.92 | 1.80 | 1.60 | 6.58 | 3.20 | 1.72 | 0.86 | 0.49 | 0.28 | 6.58 | 3.20 | 1.72 | 0.86 | 0.49 | 0.28 | 6.58 | 3.12 | 1.62 | 0.78 | 0.40 | 0.21 | 6.58 | 3.12 | 1.62 | 0.78 | 0.40 | 0.21 | 1 | 2 | 4 | 8 | 16 | 32 | NA | NA | NA | NA | NA | 1 | 1.03 | 1.06 | 1.11 | 1.24 | 1.36 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 1.06 | 0 | 1.02 | 0 | 1.06 | 0 | 1.04 | 0 | 0.98 | 0.03 |
| 11 | xy_model - simulation.cpp:47-47 | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | Innermost | 1.91 | 2.09 | 2.01 | 2.77 | 1.96 | 1.96 | 1.92 | 2.01 | 1.93 | 2.04 | 1.70 | 1.49 | 1.92 | 2.01 | 1.93 | 2.04 | 1.70 | 1.49 | 5.86 | 3.15 | 1.55 | 1.14 | 0.44 | 0.26 | 5.86 | 3.15 | 1.55 | 1.14 | 0.44 | 0.26 | 5.86 | 3.01 | 1.49 | 0.83 | 0.37 | 0.20 | 5.86 | 3.01 | 1.49 | 0.83 | 0.37 | 0.20 | 1 | 2 | 4 | 8 | 16 | 32 | NA | NA | NA | NA | NA | 1 | 1.05 | 1.04 | 1.37 | 1.17 | 1.36 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 0.97 | 0.06 | 0.99 | 0.03 | 0.88 | 0.24 | 0.98 | 0.04 | 0.94 | 0.09 |
| 17 | xy_model - simulation.cpp:47-47 | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | Innermost | 1.91 | 2.03 | 2.03 | 2.07 | 1.85 | 1.66 | 1.92 | 1.97 | 1.98 | 1.84 | 1.59 | 1.34 | 1.92 | 1.97 | 1.98 | 1.84 | 1.59 | 1.34 | 5.86 | 3.06 | 1.57 | 0.85 | 0.42 | 0.22 | 5.86 | 3.06 | 1.57 | 0.85 | 0.42 | 0.22 | 5.85 | 2.96 | 1.52 | 0.75 | 0.35 | 0.18 | 5.85 | 2.96 | 1.52 | 0.75 | 0.35 | 0.18 | 1 | 2 | 4 | 8 | 16 | 32 | NA | NA | NA | NA | NA | 1 | 1.03 | 1.04 | 1.13 | 1.19 | 1.28 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 0.99 | 0.02 | 0.97 | 0.07 | 0.98 | 0.04 | 1.05 | 0 | 1.04 | 0 |
| 13 | xy_model - simulation.cpp:47-47 | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | Innermost | 1.77 | 1.94 | 1.88 | 1.80 | 2.07 | 1.74 | 1.77 | 1.88 | 1.77 | 1.68 | 1.68 | 1.30 | 1.77 | 1.88 | 1.77 | 1.68 | 1.68 | 1.30 | 5.41 | 2.93 | 1.45 | 0.73 | 0.47 | 0.23 | 5.41 | 2.93 | 1.45 | 0.73 | 0.47 | 0.23 | 5.41 | 2.82 | 1.36 | 0.68 | 0.37 | 0.17 | 5.41 | 2.82 | 1.36 | 0.68 | 0.37 | 0.17 | 1 | 2 | 4 | 8 | 16 | 32 | NA | NA | NA | NA | NA | 1 | 1.04 | 1.07 | 1.08 | 1.26 | 1.38 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 0.96 | 0.08 | 1 | 0.01 | 1 | 0 | 0.92 | 0.14 | 0.99 | 0.01 |
| 23 | xy_model - random.tcc:412-417 | std::mersenne_twister_engine<unsigned long, 32ul, 624ul, 397ul, 31ul, 2567483615ul, 11ul, 4294967295ul, 7ul, 2636928640ul, 15ul, 4022730752ul, 18ul, 1812433253ul>::_M_gen_rand() | Single | 0.27 | 0.33 | 0.33 | 0.42 | 0.49 | 0.37 | 0.27 | 0.29 | 0.30 | 0.34 | 0.29 | 0.23 | 0.27 | 0.29 | 0.30 | 0.34 | 0.29 | 0.23 | 0.82 | 0.49 | 0.25 | 0.17 | 0.11 | 0.05 | 0.82 | 0.49 | 0.25 | 0.17 | 0.11 | 0.05 | 0.82 | 0.44 | 0.23 | 0.14 | 0.06 | 0.03 | 0.82 | 0.44 | 0.23 | 0.14 | 0.06 | 0.03 | 1 | 2 | 4 | 8 | 16 | 31 | 100 | 50 | 1 | 1 | 2 | 1 | 1.11 | 1.11 | 1.25 | 1.75 | 1.62 | 0 | 0 | 1 | 0 | 0 | 75.00 | 1 | 0 | 0.94 | 0.02 | 0.9 | 0.03 | 0.76 | 0.08 | 0.82 | 0.05 | 0.86 | 0.03 |
| 22 | xy_model - random.tcc:404-409 | std::mersenne_twister_engine<unsigned long, 32ul, 624ul, 397ul, 31ul, 2567483615ul, 11ul, 4294967295ul, 7ul, 2636928640ul, 15ul, 4022730752ul, 18ul, 1812433253ul>::_M_gen_rand() | Single | 0.17 | 0.20 | 0.20 | 0.22 | 0.29 | 0.26 | 0.17 | 0.18 | 0.17 | 0.17 | 0.18 | 0.13 | 0.17 | 0.18 | 0.17 | 0.17 | 0.18 | 0.13 | 0.50 | 0.30 | 0.15 | 0.09 | 0.07 | 0.04 | 0.50 | 0.30 | 0.15 | 0.09 | 0.07 | 0.04 | 0.50 | 0.28 | 0.13 | 0.07 | 0.04 | 0.02 | 0.50 | 0.28 | 0.13 | 0.07 | 0.04 | 0.02 | 1 | 2 | 4 | 8 | 16 | 31 | 100 | 50 | 1 | 1 | 2 | 1 | 1.07 | 1.17 | 1.35 | 1.66 | 2.03 | 0 | 0 | 1 | 0 | 0 | 75.00 | 1 | 0 | 0.92 | 0.01 | 0.95 | 0.01 | 0.94 | 0.01 | 0.81 | 0.03 | 0.94 | 0.01 |
| 19 | xy_model - simulation.cpp:26-56 [...] | metropolisHalfSweep(Lattice&, int, int, double, int, int, int) | Outermost | 0.07 | 0.08 | 0.19 | 0.20 | 0.33 | 0.33 | 0.07 | 0.08 | 0.16 | 0.12 | 0.23 | 0.18 | 32.92 | 33.95 | 33.05 | 31.53 | 28.96 | 24.84 | 0.20 | 0.13 | 0.14 | 0.08 | 0.08 | 0.05 | 100.54 | 51.38 | 25.47 | 13.40 | 6.66 | 3.53 | 0.20 | 0.12 | 0.12 | 0.05 | 0.05 | 0.02 | 100.54 | 51.01 | 25.38 | 12.76 | 6.38 | 3.24 | 1 | 2 | 4 | 8 | 16 | 32 | 0 | 7.5 | 1 | 1 | 13 | 1 | 1.09 | 1.21 | 1.62 | 1.51 | 1.88 | NA | NA | NA | NA | NA | 0.00 | 1 | 0 | 0.87 | 0.01 | 0.42 | 0.09 | 0.51 | 0.06 | 0.25 | 0.17 | 0.26 | 0.14 |
| 33 | xy_model - | exchangeHalo(Lattice&, int) | Single | 0.02 | 0.04 | 0.06 | 0.04 | 0.09 | 0.15 | 0.02 | 0.03 | 0.04 | 0.02 | 0.05 | 0.04 | 0.02 | 0.03 | 0.04 | 0.02 | 0.05 | 0.04 | 0.06 | 0.06 | 0.05 | 0.02 | 0.02 | 0.02 | 0.06 | 0.06 | 0.05 | 0.02 | 0.02 | 0.02 | 0.06 | 0.05 | 0.03 | 0.01 | 0.01 | 0.01 | 0.06 | 0.05 | 0.03 | 0.01 | 0.01 | 0.01 | 1 | 2 | 4 | 6 | 12 | 20 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1.33 | 1.33 | 1.5 | 1.41 | 2.42 | 0 | 2 | 0 | 8 | 0 | 60.00 | 1 | 0 | 0.67 | 0.01 | 0.44 | 0.02 | 1 | 0 | 0.35 | 0.03 | 0.36 | 0.03 |
| 35 | xy_model - | exchangeHalo(Lattice&, int) | Single | 0.00 | 0.00 | 0.03 | 0.02 | 0.09 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.02 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.02 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0 | 0 | 2 | 6 | 11 | 0 | 33.33 | 16.67 | 1.24 | 1 | 5.25 | 0 | 0 | 1.43 | 1.5 | 2.32 | 0 | 0 | 2 | 0 | 8 | 0 | 60.00 | | | | | 1 | 0 | 1 | 0 | 1 | 0 | | |
| 34 | xy_model - | exchangeHalo(Lattice&, int) | Single | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0 | 0 | 0 | 0 | 14 | 0 | 12.5 | 1 | 1 | 8 | 0 | 0 | 0 | 0 | 0 | 1.83 | 0 | 2 | 0 | 8 | 0 | 60.00 | | | | | | | | | | | 1 | 0 |
| 3 | xy_model - main.cpp:76-96 [...] | main | Single | 0.00 | 0.00 | 0.00 | 0.04 | 0.04 | 0.07 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0 | 0 | 0 | 4 | 6 | 8 | 0 | 7.14 | 6.75 | 1 | 14.75 | 0 | 0 | 0 | 1.5 | 1.71 | 1.78 | 1 | 0 | 0 | 0 | 0 | 100.00 | | | | | | | 1 | 0 | 1 | 0 | 1 | 0 |