| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ○Loop 42 | attention-clang-skl256 | attention_v2.cpp:30-31 | main | Innermost | 9.26 | 9.26 | 32.57 | 1.00 | 1.60 | 13.02 | 1.00 | 1 | 0.00 | 8.33 | 1.12 | 16.00 | 16.00 | 10.00 | 1.23 | 16.00 |
| ○Loop 53 | attention-clang-skl256 | attention_v2.cpp:30-31 | main | Innermost | 7.49 | 7.49 | 26.35 | 1.00 | 1.60 | 13.02 | 1.00 | 1 | 0.00 | 8.33 | 1.10 | 16.00 | 16.00 | 10.00 | 1.23 | 16.00 |
| ○Loop 52 | attention-clang-skl256 | attention_v2.cpp:26-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 1.97 | 1.96 | 6.91 | 3.33 | 2.78 | 9.84 | 4.17 | 23 | 28.00 | 16.75 | 1.27 | 25.00 | 7.50 | 9.00 | 2.54 | 6.00 |
| ○Loop 31 | attention-clang-skl256 | attention_v2.cpp:30-31 | main | Innermost | 1.90 | 1.90 | 6.68 | 1.00 | 1.78 | 14.22 | 1.00 | 1 | 0.00 | 7.50 | 1.21 | 16.00 | 16.00 | 9.00 | 1.13 | 16.00 |
| ○Loop 32 | attention-clang-skl256 | attention_v2.cpp:30-31 | main | Innermost | 1.85 | 1.85 | 6.51 | 1.00 | 1.78 | 14.22 | 1.00 | 1 | 0.00 | 7.50 | 1.37 | 16.00 | 16.00 | 9.00 | 1.13 | 16.00 |
| ○Loop 33 | attention-clang-skl256 | attention_v2.cpp:30-31 | main | Innermost | 1.77 | 1.76 | 6.21 | 1.00 | 1.78 | 14.22 | 1.00 | 1 | 0.00 | 7.50 | 1.30 | 16.00 | 16.00 | 9.00 | 1.13 | 16.00 |
| ○Loop 38 | attention-clang-skl256 | attention_v2.cpp:55-56 | main | Innermost | 0.68 | 0.68 | 2.39 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 50.00 | 0.00 | 5.00 | 5.00 | 5.00 | 5.00 | 5.00 |
| ○Loop 77 | attention-clang-skl256 | random.tcc:404-409,random.tcc:412-412,random.tcc:420-423,random.tcc:458-458,random.tcc:462-466,random.tcc:3557-3558,attention_v2.cpp:163-163 | main | InBetween | 0.26 | 0.26 | 0.93 | 3.91 | 2.08 | 12.24 | 8.31 | 2 | 0.76 | 9.65 | 0.15 | 16.63 | 4.25 | 7.99 | 1.36 | 2.00 |
| ○Loop 41 | attention-clang-skl256 | attention_v2.cpp:27-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 0.23 | 0.23 | 0.81 | 3.13 | 2.27 | 12.86 | 3.92 | 23 | 28.00 | 16.66 | 0.91 | 23.50 | 7.50 | 10.36 | 1.83 | 6.00 |
| ○Loop 66 | attention-clang-skl256 | attention_v2.cpp:27-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 0.19 | 0.19 | 0.67 | 2.41 | 1.00 | 13.59 | 3.82 | 3 | 15.28 | 12.50 | 0.95 | 5.42 | 2.25 | 5.42 | 0.40 | 1.42 |
| ○Loop 63 | attention-clang-skl256 | attention_v2.cpp:27-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 0.16 | 0.16 | 0.56 | 2.52 | 1.00 | 13.16 | 4.00 | 3 | 10.65 | 12.50 | 0.94 | 5.67 | 2.25 | 5.67 | 0.43 | 1.42 |
| ○Loop 47 | attention-clang-skl256 | attention_v2.cpp:55-56 | main | Innermost | 0.14 | 0.14 | 0.47 | 1.46 | 2.81 | 6.33 | 1.58 | 1 | 33.33 | 13.89 | 0.07 | 4.75 | 3.25 | 1.69 | 0.75 | 3.00 |
| ○Loop 60 | attention-clang-skl256 | attention_v2.cpp:27-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 0.13 | 0.13 | 0.46 | 2.26 | 1.00 | 12.92 | 3.59 | 3 | 11.50 | 12.50 | 2.08 | 5.08 | 2.25 | 5.08 | 0.39 | 1.42 |
| ○Loop 46 | attention-clang-skl256 | attention_v2.cpp:43-44,attention_v2.cpp:47-47,attention_v2.cpp:52-52,attention_v2.cpp:55-55,attention_v2.cpp:58-61 | main | InBetween | 0.08 | 0.07 | 0.26 | 2.15 | 2.12 | 8.21 | 5.72 | 210 | 41.59 | 18.92 | 3.33 | 45.75 | 21.25 | 21.55 | 5.57 | 8.00 |
| ○Loop 34 | attention-clang-skl256 | attention_v2.cpp:237-238 | main | Innermost | 0.05 | 0.04 | 0.16 | 1.04 | 1.00 | 3.06 | 13.00 | 1 | 100.00 | 37.50 | 0.00 | 13.00 | 12.50 | 13.00 | 4.25 | 1.00 |
| ○Loop 72 | attention-clang-skl256 | random.tcc:404-409,random.tcc:412-412,random.tcc:420-423,random.tcc:458-458,random.tcc:462-466,random.tcc:3557-3558 | main | InBetween | 0.04 | 0.04 | 0.14 | 3.71 | 2.32 | 12.68 | 8.40 | 2 | 0.89 | 9.51 | 0.00 | 15.75 | 4.25 | 6.79 | 1.24 | 1.88 |
| ○Loop 36 | attention-clang-skl256 | attention_v2.cpp:52-53 | main | Innermost | 0.04 | 0.04 | 0.12 | 1.40 | 1.17 | 2.00 | 3.50 | 1 | 100.00 | 50.00 | 0.00 | 14.00 | 10.00 | 12.00 | 7.00 | 4.00 |
| ○Loop 79 | attention-clang-skl256 | random.tcc:412-417 | main | Innermost | 0.02 | 0.02 | 0.07 | 1.00 | 1.00 | 4.77 | 8.25 | 1 | 89.29 | 23.66 | 0.00 | 8.25 | 8.25 | 8.25 | 1.73 | 1.00 |
| ○Loop 68 | attention-clang-skl256 | random.tcc:404-409,random.tcc:412-412,random.tcc:420-423,random.tcc:458-458,random.tcc:462-466,random.tcc:3557-3558,attention_v2.cpp:164-167 | main | InBetween | 0.02 | 0.02 | 0.07 | 4.36 | 1.98 | 11.97 | 13.63 | 8 | 1.47 | 10.02 | 1.50 | 27.25 | 6.25 | 13.76 | 2.28 | 2.00 |
| ○Loop 80 | attention-clang-skl256 | random.h:585-585,random.tcc:333-339 | main | Innermost | 0.01 | 0.02 | 0.05 | 1.00 | 1.00 | 11.00 | 1.00 | 1 | 0.00 | 10.21 | 0.00 | 27.00 | 27.00 | 27.00 | 2.45 | 27.00 |
| ○Loop 35 | attention-clang-skl256 | attention_v2.cpp:47-48 | main | Innermost | 0.01 | 0.02 | 0.05 | 1.00 | 1.00 | 2.00 | 1.00 | 1 | 100.00 | 50.00 | 0.00 | 4.00 | 4.00 | 4.00 | 2.00 | 4.00 |
| ○Loop 71 | attention-clang-skl256 | random.tcc:412-417 | main | Innermost | 0.01 | 0.01 | 0.04 | 1.00 | 1.00 | 4.77 | 8.25 | 1 | 89.29 | 23.66 | 0.00 | 8.25 | 8.25 | 8.25 | 1.73 | 1.00 |
| ○Loop 59 | attention-clang-skl256 | attention_v2.cpp:26-26,attention_v2.cpp:213-213 | main | InBetween | 0.01 | 0.01 | 0.04 | 1.00 | 1.00 | 13.09 | 1.50 | 3 | 0.00 | 10.42 | 0.00 | 2.25 | 2.25 | 2.25 | 0.17 | 1.50 |
| ○Loop 48 | attention-clang-skl256 | attention_v2.cpp:52-53 | main | Innermost | 0.01 | 0.01 | 0.04 | 1.25 | 1.88 | 4.00 | 3.00 | 1 | 100.00 | 25.00 | 4.00 | 3.75 | 3.00 | 1.99 | 0.94 | 1.25 |
| ○Loop 37 | attention-clang-skl256 | attention_v2.cpp:52-53 | main | Innermost | 0.01 | 0.01 | 0.04 | 1.25 | 2.21 | 8.00 | 3.75 | 1 | 57.14 | 16.96 | 0.00 | 3.75 | 3.00 | 1.70 | 0.47 | 1.00 |
| ○Loop 78 | attention-clang-skl256 | random.tcc:404-409 | main | Innermost | 0.00 | 0.00 | 0.02 | 1.00 | 1.00 | 2.00 | 10.75 | 1 | 100.00 | 50.00 | 0.00 | 10.75 | 10.75 | 10.75 | 5.38 | 1.00 |
| ○Loop 69 | attention-clang-skl256 | random.tcc:404-409,random.tcc:412-412,random.tcc:420-423,random.tcc:458-458,random.tcc:462-466,random.tcc:3557-3558 | main | InBetween | 0.00 | 0.00 | 0.02 | 3.71 | 2.32 | 12.68 | 8.40 | 2 | 0.89 | 9.51 | 4.00 | 15.75 | 4.25 | 6.79 | 1.24 | 1.88 |
| ○Loop 62 | attention-clang-skl256 | attention_v2.cpp:26-26,attention_v2.cpp:194-194 | main | InBetween | 0.00 | 0.00 | 0.02 | 1.00 | 1.00 | 13.09 | 1.50 | 3 | 0.00 | 10.42 | 0.00 | 2.25 | 2.25 | 2.25 | 0.17 | 1.50 |
| ○Loop 51 | attention-clang-skl256 | attention_v2.cpp:26-26,attention_v2.cpp:254-254 | main | InBetween | 0.00 | 0.00 | 0.02 | 1.00 | 1.00 | 12.80 | 3.20 | 23 | 0.00 | 9.38 | 2.00 | 4.00 | 4.00 | 4.00 | 0.31 | 1.25 |
| ○Loop 50 | attention-clang-skl256 | attention_v2.cpp:47-48 | main | Innermost | 0.00 | 0.00 | 0.02 | 1.00 | 1.00 | 4.00 | 1.00 | 1 | 100.00 | 25.00 | 0.00 | 4.00 | 4.00 | 4.00 | 1.00 | 4.00 |
| ○Loop 49 | attention-clang-skl256 | attention_v2.cpp:47-48 | main | Innermost | 0.00 | 0.00 | 0.02 | 1.00 | 1.00 | 16.00 | 2.00 | 1 | 0.00 | 6.25 | 0.00 | 1.00 | 1.00 | 1.00 | 0.06 | 0.50 |
| ○Loop 40 | attention-clang-skl256 | attention_v2.cpp:26-26,attention_v2.cpp:292-292 | main | InBetween | 0.00 | 0.00 | 0.02 | 1.00 | 1.00 | 12.36 | 3.40 | 23 | 0.00 | 9.72 | 2.00 | 4.25 | 4.25 | 4.25 | 0.34 | 1.25 |