| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ○Loop 27 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 3.85 | 3.85 | 40.96 | 1.00 | 1.91 | 13.54 | 1.00 | 1 | 0.00 | 8.33 | 2.70 | 4.00 | 4.00 | 2.09 | 0.30 | 4.00 |
| ○Loop 43 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 1.95 | 1.95 | 20.74 | 1.00 | 1.91 | 13.54 | 1.00 | 1 | 0.00 | 8.33 | 5.33 | 4.00 | 4.00 | 2.09 | 0.30 | 4.00 |
| ○Loop 52 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 0.70 | 0.70 | 7.50 | 1.00 | 1.91 | 13.54 | 1.00 | 1 | 0.00 | 8.33 | 3.69 | 4.00 | 4.00 | 2.09 | 0.30 | 4.00 |
| ○Loop 62 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 0.69 | 0.68 | 7.29 | 1.00 | 1.91 | 13.54 | 1.00 | 1 | 0.00 | 8.33 | 3.77 | 4.00 | 4.00 | 2.09 | 0.30 | 4.00 |
| ○Loop 57 | attention-avx512 | attention.cpp:30-31 | main | Innermost | 0.68 | 0.68 | 7.23 | 1.00 | 1.91 | 13.54 | 1.00 | 1 | 0.00 | 8.33 | 3.84 | 4.00 | 4.00 | 2.09 | 0.30 | 4.00 |
| ○Loop 44 | attention-avx512 | attention.cpp:26-27,attention.cpp:30-30,attention.cpp:33-33 | main | InBetween | 0.53 | 0.53 | 5.69 | 1.30 | 2.96 | 6.00 | 1.43 | 4 | 31.97 | 16.94 | 0.40 | 3.56 | 2.75 | 1.20 | 0.59 | 2.50 |
| ○Loop 35 | attention-avx512 | attention.cpp:55-56 | main | Innermost | 0.11 | 0.12 | 1.22 | 1.38 | 1.18 | 4.51 | 4.00 | 1 | 81.08 | 25.00 | 2.59 | 10.00 | 7.25 | 8.50 | 2.22 | 2.50 |
| ○Loop 33 | attention-avx512 | attention.cpp:52-53 | main | Innermost | 0.05 | 0.04 | 0.48 | 1.31 | 1.14 | 4.20 | 9.00 | 1 | 83.66 | 26.55 | 2.78 | 36.00 | 27.50 | 31.63 | 8.58 | 4.00 |
| ○Loop 65 | attention-avx512 | attention.cpp:26-27,attention.cpp:30-30,attention.cpp:33-33 | main | InBetween | 0.04 | 0.04 | 0.43 | 1.55 | 1.71 | 5.61 | 1.90 | 4 | 41.66 | 18.26 | 0.75 | 3.25 | 2.09 | 1.90 | 0.58 | 1.71 |
| ○Loop 30 | attention-avx512 | attention.cpp:26-27,attention.cpp:30-30,attention.cpp:33-33 | main | InBetween | 0.04 | 0.04 | 0.43 | 1.73 | 1.67 | 6.02 | 2.12 | 4 | 31.75 | 16.77 | 1.88 | 3.63 | 2.09 | 2.18 | 0.60 | 1.71 |
| ○Loop 58 | attention-avx512 | attention.cpp:26-27,attention.cpp:30-30,attention.cpp:33-33 | main | InBetween | 0.04 | 0.04 | 0.37 | 1.60 | 1.75 | 5.38 | 1.86 | 4 | 38.80 | 18.26 | 0.07 | 3.09 | 1.94 | 1.77 | 0.57 | 1.67 |
| ○Loop 55 | attention-avx512 | attention.cpp:26-27,attention.cpp:30-30,attention.cpp:33-33 | main | InBetween | 0.02 | 0.02 | 0.21 | 1.55 | 1.71 | 5.61 | 1.90 | 4 | 41.66 | 18.26 | 0.50 | 3.25 | 2.09 | 1.90 | 0.58 | 1.71 |
| ○Loop 48 | attention-avx512 | stl_vector.h:1046-1046,attention.cpp:240-241 | main | Innermost | 0.01 | 0.01 | 0.11 | 1.00 | 1.00 | 7.31 | 8.00 | 1 | 38.46 | 21.15 | 0.00 | 8.00 | 8.00 | 8.00 | 1.09 | 1.00 |
| ○Loop 40 | attention-avx512 | attention.cpp:52-53 | main | Innermost | 0.01 | 0.01 | 0.11 | 1.33 | 1.29 | 5.33 | 5.00 | 1 | 84.21 | 21.38 | 5.00 | 5.00 | 3.75 | 3.88 | 0.94 | 1.00 |
| ○Loop 39 | attention-avx512 | attention.cpp:55-56 | main | Innermost | 0.01 | 0.01 | 0.11 | 1.00 | 2.39 | 12.31 | 1.00 | 1 | 42.86 | 14.29 | 1.00 | 2.50 | 2.50 | 1.05 | 0.20 | 2.50 |
| ○Loop 37 | attention-avx512 | attention.cpp:43-44,attention.cpp:47-47,attention.cpp:52-52,attention.cpp:55-55,attention.cpp:58-61,attention.cpp:284-284 | main | InBetween | 0.01 | 0.01 | 0.11 | 2.14 | 2.01 | 7.70 | 4.41 | 490 | 44.44 | 20.09 | 5.00 | 23.50 | 11.00 | 11.67 | 3.05 | 5.33 |
| ○Loop 42 | attention-avx512 | attention.cpp:47-48 | main | Innermost | 0.00 | 0.00 | 0.05 | 1.00 | 1.00 | 4.00 | 1.00 | 1 | 100.00 | 25.00 | 0.00 | 2.00 | 2.00 | 2.00 | 0.50 | 2.00 |
| ○Loop 41 | attention-avx512 | attention.cpp:47-48 | main | Innermost | 0.00 | 0.00 | 0.05 | 1.00 | 1.00 | 16.00 | 1.50 | 1 | 0.00 | 6.25 | 0.00 | 0.50 | 0.50 | 0.50 | 0.03 | 0.33 |
| ○Loop 38 | attention-avx512 | attention.cpp:55-56 | main | Innermost | 0.00 | 0.00 | 0.05 | 1.44 | 1.35 | 5.75 | 2.30 | 1 | 78.95 | 20.72 | 2.50 | 5.75 | 4.00 | 4.25 | 1.00 | 2.50 |
| ○Loop 34 | attention-avx512 | attention.cpp:52-53 | main | Innermost | 0.00 | 0.00 | 0.05 | 1.33 | 1.94 | 8.53 | 2.00 | 1 | 57.14 | 16.96 | 2.00 | 2.00 | 1.50 | 1.03 | 0.23 | 1.00 |
| ○Loop 29 | attention-avx512 | attention.cpp:26-26,attention.cpp:306-306 | main | InBetween | 0.00 | 0.00 | 0.05 | 1.00 | 1.00 | 13.63 | 5.75 | 4 | 0.00 | 8.85 | 2.50 | 2.88 | 2.88 | 2.88 | 0.21 | 0.50 |