| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ○Loop 43 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 1.14 | 1.14 | 31.80 | 1.13 | 1.00 | 1.50 | 6.75 | 1 | 96.08 | 51.72 | 7.77 | 27.00 | 24.00 | 27.00 | 18.00 | 4.00 |
| ○Loop 30 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 1.08 | 1.08 | 29.99 | 1.14 | 1.00 | 1.52 | 6.25 | 1 | 97.83 | 53.40 | 9.39 | 25.00 | 22.00 | 25.00 | 16.50 | 4.00 |
| ○Loop 56 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 0.33 | 0.33 | 9.21 | 1.11 | 1.00 | 1.50 | 6.75 | 1 | 96.43 | 51.56 | 6.86 | 27.00 | 24.33 | 27.00 | 18.00 | 4.00 |
| ○Loop 63 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 0.31 | 0.31 | 8.79 | 1.11 | 1.00 | 1.50 | 6.75 | 1 | 96.00 | 51.75 | 7.11 | 27.00 | 24.33 | 27.00 | 18.00 | 4.00 |
| ○Loop 70 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 0.31 | 0.31 | 8.65 | 1.11 | 1.00 | 1.50 | 6.75 | 1 | 96.00 | 51.75 | 7.26 | 27.00 | 24.33 | 27.00 | 18.00 | 4.00 |
| ○Loop 42 | attention-avx512 | attention.cpp:27-27,attention.cpp:30-33 | main | InBetween | 0.18 | 0.18 | 5.02 | 1.06 | 1.18 | 3.35 | 2.29 | 3 | 47.47 | 22.10 | 19.84 | 27.83 | 26.17 | 23.67 | 8.30 | 12.17 |
| ○Loop 36 | attention-avx512 | attention.cpp:47-48 | main | Innermost | 0.06 | 0.06 | 1.81 | 1.00 | 1.00 | 16.00 | 1.00 | 1 | 0.00 | 6.25 | 7.04 | 32.00 | 32.00 | 32.00 | 2.00 | 32.00 |
| ○Loop 69 | attention-avx512 | attention.cpp:27-27,attention.cpp:30-33 | main | InBetween | 0.04 | 0.04 | 0.98 | 1.06 | 1.12 | 3.22 | 2.97 | 3 | 53.32 | 24.66 | 23.06 | 25.17 | 23.83 | 22.50 | 7.81 | 8.47 |
| ○Loop 62 | attention-avx512 | attention.cpp:27-27,attention.cpp:30-33 | main | InBetween | 0.04 | 0.04 | 0.98 | 1.06 | 1.12 | 3.21 | 2.97 | 3 | 52.83 | 24.58 | 23.37 | 25.17 | 23.83 | 22.50 | 7.83 | 8.47 |
| ○Loop 33 | attention-avx512 | attention.cpp:43-44,attention.cpp:47-47,attention.cpp:52-61,attention.cpp:98-98,attention.cpp:284-284 | main | InBetween | 0.03 | 0.03 | 0.84 | 1.52 | 1.02 | 1.40 | 4.81 | 72 | 37.65 | 31.62 | 44.97 | 28.83 | 19.00 | 28.25 | 20.56 | 6.00 |
| ○Loop 29 | attention-avx512 | attention.cpp:27-27,attention.cpp:30-33 | main | InBetween | 0.03 | 0.03 | 0.70 | 1.06 | 1.12 | 3.20 | 2.97 | 3 | 52.69 | 24.07 | 33.60 | 25.17 | 23.83 | 22.50 | 7.85 | 8.47 |
| ○Loop 55 | attention-avx512 | attention.cpp:27-27,attention.cpp:30-33 | main | InBetween | 0.02 | 0.02 | 0.42 | 1.06 | 1.10 | 3.18 | 3.01 | 3 | 54.39 | 24.63 | 53.10 | 25.50 | 24.17 | 23.17 | 8.01 | 8.47 |
| ○Loop 34 | attention-avx512 | attention.cpp:55-56 | main | Innermost | 0.02 | 0.02 | 0.42 | 1.50 | 1.00 | 1.00 | 1.50 | 1 | 100.00 | 100.00 | 10.67 | 1.50 | 1.00 | 1.50 | 1.50 | 1.00 |