| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ○Loop 29 | attention-gnr-512 | attention_v2.cpp:27-27,attention_v2.cpp:30-33,attention_v2.cpp:98-98 | main | InBetween | 1.20 | 1.20 | 22.00 | 1.07 | 1.08 | 1.72 | 3.73 | 3 | 63.33 | 36.33 | 9.33 | 8.72 | 8.68 | 5.44 | 2.50 |
| ○Loop 30 | attention-gnr-512 | attention_v2.cpp:30-31 | main | Innermost | 1.00 | 1.00 | 18.33 | 1.11 | 1.00 | 1.75 | 1.75 | 1 | 85.71 | 43.75 | 7.00 | 6.33 | 7.00 | 4.00 | 4.00 |
| ○Loop 22 | attention-gnr-512 | attention_v2.cpp:30-31 | main | Innermost | 0.99 | 1.00 | 18.24 | 1.07 | 1.00 | 1.79 | 1.29 | 1 | 90.00 | 48.13 | 5.17 | 4.83 | 5.17 | 2.89 | 4.00 |
| ○Loop 21 | attention-gnr-512 | attention_v2.cpp:27-27,attention_v2.cpp:30-33,attention_v2.cpp:98-98 | main | InBetween | 0.32 | 0.32 | 5.87 | 1.06 | 1.05 | 1.71 | 4.50 | 3 | 66.03 | 37.34 | 9.00 | 8.50 | 8.61 | 5.27 | 2.00 |
| ○Loop 41 | attention-gnr-512 | attention_v2.cpp:30-31 | main | Innermost | 0.30 | 0.30 | 5.50 | 1.11 | 1.00 | 1.75 | 1.75 | 1 | 85.71 | 43.75 | 7.00 | 6.33 | 7.00 | 4.00 | 4.00 |
| ○Loop 40 | attention-gnr-512 | attention_v2.cpp:27-27,attention_v2.cpp:30-33,attention_v2.cpp:98-98 | main | InBetween | 0.29 | 0.29 | 5.32 | 1.06 | 1.05 | 1.72 | 4.50 | 3 | 69.07 | 38.48 | 9.00 | 8.50 | 8.61 | 5.23 | 2.00 |
| ○Loop 44 | attention-gnr-512 | attention_v2.cpp:30-31 | main | Innermost | 0.28 | 0.28 | 5.13 | 1.11 | 1.00 | 1.75 | 1.75 | 1 | 85.71 | 43.75 | 7.00 | 6.33 | 7.00 | 4.00 | 4.00 |
| ○Loop 37 | attention-gnr-512 | attention_v2.cpp:27-27,attention_v2.cpp:30-33,attention_v2.cpp:98-98 | main | InBetween | 0.26 | 0.26 | 4.77 | 1.06 | 1.05 | 1.72 | 4.50 | 3 | 69.07 | 38.48 | 9.00 | 8.50 | 8.61 | 5.23 | 2.00 |
| ○Loop 38 | attention-gnr-512 | attention_v2.cpp:30-31 | main | Innermost | 0.22 | 0.22 | 4.12 | 1.11 | 1.00 | 1.75 | 1.75 | 1 | 85.71 | 43.75 | 7.00 | 6.33 | 7.00 | 4.00 | 4.00 |
| ○Loop 43 | attention-gnr-512 | attention_v2.cpp:27-27,attention_v2.cpp:30-33,attention_v2.cpp:98-98 | main | InBetween | 0.21 | 0.21 | 3.85 | 1.06 | 1.05 | 1.72 | 4.50 | 3 | 69.07 | 38.48 | 9.00 | 8.50 | 8.61 | 5.23 | 2.00 |
| ○Loop 52 | attention-gnr-512 | cmath:1661-1661,random.tcc:404-406,random.tcc:412-414,random.tcc:417-420,random.tcc:455-455,random.tcc:459-463,random.tcc:3366-3366,random.tcc:3369-3370,attention_v2.cpp:163-163 | main | InBetween | 0.08 | 0.08 | 1.47 | 2.13 | 1.91 | 7.53 | 9.58 | 5 | 33.33 | 18.95 | 19.17 | 9.00 | 10.01 | 2.54 | 2.00 |
| ○Loop 27 | attention-gnr-512 | attention_v2.cpp:47-48 | main | Innermost | 0.06 | 0.06 | 1.10 | 1.00 | 1.00 | 16.00 | 1.00 | 1 | 0.00 | 6.25 | 32.00 | 32.00 | 32.00 | 2.00 | 32.00 |
| ○Loop 47 | attention-gnr-512 | cmath:1661-1661,random.tcc:404-406,random.tcc:412-414,random.tcc:417-420,random.tcc:455-455,random.tcc:459-463,random.tcc:3366-3366,random.tcc:3369-3370,attention_v2.cpp:164-167 | main | InBetween | 0.04 | 0.04 | 0.64 | 2.15 | 1.92 | 7.58 | 11.60 | 85 | 33.16 | 18.85 | 58.00 | 27.00 | 30.21 | 7.65 | 5.00 |
| ○Loop 23 | attention-gnr-512 | attention_v2.cpp:43-44,attention_v2.cpp:47-47,attention_v2.cpp:52-61,attention_v2.cpp:98-98,attention_v2.cpp:273-273 | main | InBetween | 0.03 | 0.03 | 0.46 | 1.69 | 1.02 | 1.32 | 4.50 | 72 | 41.54 | 37.31 | 27.00 | 16.00 | 26.42 | 20.40 | 6.00 |
| ○Loop 18 | attention-gnr-512 | random.h:140-140,random.tcc:330-336 | main | Innermost | 0.03 | 0.03 | 0.46 | 1.00 | 1.00 | 16.00 | 35.20 | 1 | 0.00 | 9.13 | 17.60 | 17.60 | 17.60 | 1.10 | 0.50 |
| ○Loop 75 | attention-gnr-512 | __intel_avx_rep_memset | Single | 0.03 | 0.03 | 0.46 | 1.00 | 1.00 | 2.00 | 4.00 | 1 | 100.00 | 50.00 | 4.00 | 4.00 | 4.00 | 2.00 | 1.00 | |
| ○Loop 33 | attention-gnr-512 | attention_v2.cpp:237-238 | main | Innermost | 0.01 | 0.02 | 0.27 | 1.03 | 1.00 | 1.22 | 10.17 | 1 | 90.91 | 73.86 | 10.17 | 9.83 | 10.17 | 8.34 | 1.00 |
| ○Loop 25 | attention-gnr-512 | attention_v2.cpp:52-53 | main | Innermost | 0.01 | 0.01 | 0.18 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 3.00 | 3.00 | 3.00 | 3.00 | 3.00 |
| ○Loop 53 | attention-gnr-512 | random.tcc:401-406 | main | Innermost | 0.00 | 0.00 | 0.09 | 1.00 | 1.00 | 2.29 | 3.33 | 1 | 84.62 | 44.23 | 3.33 | 3.33 | 3.33 | 1.46 | 1.00 |
| ○Loop 26 | attention-gnr-512 | attention_v2.cpp:47-48 | main | Innermost | 0.00 | 0.00 | 0.09 | 1.00 | 1.00 | 16.00 | 1.00 | 1 | 0.00 | 6.25 | 4.00 | 4.00 | 4.00 | 0.25 | 4.00 |