| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ○Loop 16 | attention-gcc-gnr-256 | attention_v2.cpp:30-31 | main | Innermost | 3.03 | 3.03 | 30.44 | 1.00 | 2.58 | 13.33 | 1.00 | 1 | 0.00 | 8.75 | 3.00 | 3.00 | 1.16 | 0.23 | 3.00 |
| ○Loop 13 | attention-gcc-gnr-256 | attention_v2.cpp:30-31 | main | Innermost | 2.90 | 2.90 | 29.13 | 1.00 | 2.58 | 13.33 | 1.00 | 1 | 0.00 | 8.75 | 3.00 | 3.00 | 1.16 | 0.23 | 3.00 |
| ○Loop 2 | attention-gcc-gnr-256 | attention_v2.cpp:30-31 | main | Innermost | 0.73 | 0.73 | 7.28 | 1.00 | 2.58 | 13.33 | 1.00 | 1 | 0.00 | 8.75 | 3.00 | 3.00 | 1.16 | 0.23 | 3.00 |
| ○Loop 28 | attention-gcc-gnr-256 | attention_v2.cpp:55-56 | softmax(float const*, float*, float*, int) | Innermost | 0.63 | 0.63 | 6.33 | 1.00 | 3.00 | 4.00 | 1.00 | 1 | 0.00 | 6.25 | 3.00 | 3.00 | 1.00 | 0.75 | 3.00 |
| ○Loop 8 | attention-gcc-gnr-256 | attention_v2.cpp:30-31 | main | Innermost | 0.61 | 0.61 | 6.08 | 1.00 | 2.58 | 13.33 | 1.00 | 1 | 0.00 | 8.75 | 3.00 | 3.00 | 1.16 | 0.23 | 3.00 |
| ○Loop 5 | attention-gcc-gnr-256 | attention_v2.cpp:30-31 | main | Innermost | 0.60 | 0.60 | 5.98 | 1.00 | 2.58 | 13.33 | 1.00 | 1 | 0.00 | 8.75 | 3.00 | 3.00 | 1.16 | 0.23 | 3.00 |
| ○Loop 14 | attention-gcc-gnr-256 | attention_v2.cpp:27-30,attention_v2.cpp:33-33,attention_v2.cpp:236-236 | main | InBetween | 0.31 | 0.31 | 3.06 | 1.86 | 2.17 | 14.86 | 2.17 | 1 | 20.00 | 11.25 | 2.17 | 1.17 | 1.00 | 0.15 | 1.00 |
| ○Loop 17 | attention-gcc-gnr-256 | attention_v2.cpp:26-30,attention_v2.cpp:33-33 | main | InBetween | 0.12 | 0.12 | 1.21 | 1.67 | 1.00 | 13.33 | 1.67 | 1 | 25.00 | 12.50 | 1.67 | 1.00 | 1.67 | 0.13 | 1.00 |
| ○Loop 3 | attention-gcc-gnr-256 | random.tcc:3374-3374,attention_v2.cpp:27-30,attention_v2.cpp:33-33 | main | InBetween | 0.09 | 0.09 | 0.85 | 2.00 | 1.00 | 14.77 | 2.00 | 1 | 25.00 | 12.50 | 2.00 | 1.00 | 2.00 | 0.14 | 1.00 |
| ○Loop 22 | attention-gcc-gnr-256 | random.tcc:458-466,random.tcc:3367-3374,attention_v2.cpp:163-163 | main | Innermost | 0.06 | 0.06 | 0.55 | 2.06 | 2.13 | 11.47 | 4.11 | 2 | 9.13 | 11.53 | 6.17 | 3.00 | 2.90 | 0.54 | 1.50 |
| ○Loop 21 | attention-gcc-gnr-256 | random.tcc:458-466,random.tcc:3367-3374,attention_v2.cpp:164-167 | main | Innermost | 0.06 | 0.06 | 0.55 | 1.98 | 2.77 | 13.64 | 4.96 | 8 | 18.18 | 11.36 | 19.83 | 10.00 | 7.16 | 1.45 | 4.00 |
| ○Loop 6 | attention-gcc-gnr-256 | attention_v2.cpp:26-30,attention_v2.cpp:33-33 | main | InBetween | 0.05 | 0.05 | 0.45 | 2.33 | 1.00 | 14.93 | 2.33 | 1 | 25.00 | 12.50 | 2.33 | 1.00 | 2.33 | 0.16 | 1.00 |
| ○Loop 27 | attention-gcc-gnr-256 | attention_v2.cpp:52-53 | softmax(float const*, float*, float*, int) | Innermost | 0.04 | 0.04 | 0.40 | 1.33 | 1.33 | 16.00 | 1.33 | 1 | 0.00 | 6.25 | 1.33 | 1.00 | 1.00 | 0.08 | 1.00 |
| ○Loop 9 | attention-gcc-gnr-256 | attention_v2.cpp:26-30,attention_v2.cpp:33-33 | main | InBetween | 0.04 | 0.04 | 0.35 | 1.50 | 1.00 | 12.00 | 1.50 | 1 | 25.00 | 12.50 | 1.50 | 1.00 | 1.50 | 0.13 | 1.00 |
| ○Loop 11 | attention-gcc-gnr-256 | attention_v2.cpp:237-238 | main | Innermost | 0.01 | 0.01 | 0.10 | 1.00 | 1.00 | 16.00 | 1.00 | 1 | 0.00 | 6.25 | 1.00 | 1.00 | 1.00 | 0.06 | 1.00 |
| ○Loop 31 | attention-gcc-gnr-256 | attention_v2.cpp:47-48 | softmax(float const*, float*, float*, int) | InBetween | 0.01 | 0.01 | 0.10 | 1.44 | 1.00 | 10.11 | 3.00 | 1 | 43.75 | 13.67 | 6.00 | 4.17 | 6.00 | 0.59 | 2.00 |
| ○Loop 38 | attention-gcc-gnr-256 | random.tcc:412-417 | std::mersenne_twister_engine | Single | 0.01 | 0.01 | 0.10 | 1.00 | 1.00 | 2.00 | 2.67 | 1 | 100.00 | 50.00 | 2.67 | 2.67 | 2.67 | 1.33 | 1.00 |
| ○Loop 18 | attention-gcc-gnr-256 | attention_v2.cpp:26-27 | main | InBetween | 0.00 | 0.01 | 0.05 | 1.00 | 1.00 | 16.00 | 1.00 | 1 | 0.00 | 8.33 | 2.00 | 2.00 | 2.00 | 0.13 | 2.00 |
| ○Loop 30 | attention-gcc-gnr-256 | attention_v2.cpp:43-43,attention_v2.cpp:46-47,attention_v2.cpp:50-52,attention_v2.cpp:58-61 | softmax(float const*, float*, float*, int) | InBetween | 0.00 | 0.01 | 0.05 | 3.69 | 1.00 | 10.80 | 6.56 | 1 | 21.88 | 12.30 | 9.83 | 2.67 | 9.83 | 0.91 | 1.50 |
| ○Loop 4 | attention-gcc-gnr-256 | attention_v2.cpp:26-27 | main | InBetween | 0.00 | 0.01 | 0.05 | 1.00 | 1.00 | 16.00 | 1.00 | 1 | 0.00 | 6.25 | 1.00 | 1.00 | 1.00 | 0.06 | 1.00 |
| ○Loop 29 | attention-gcc-gnr-256 | attention_v2.cpp:47-48 | softmax(float const*, float*, float*, int) | Innermost | 0.00 | 0.01 | 0.05 | 1.00 | 1.00 | 2.00 | 1.00 | 1 | 100.00 | 50.00 | 4.00 | 4.00 | 4.00 | 2.00 | 4.00 |
| ○Loop 1 | attention-gcc-gnr-256 | random.h:248-248,random.tcc:333-339 | main | Innermost | 0.00 | 0.01 | 0.05 | 1.00 | 1.00 | 8.00 | 1.00 | 1 | 0.00 | 12.50 | 6.00 - 10.00 | 6.00 - 10.00 | 6.00 - 10.00 | 0.75 - 1.25 | 6.00 - 10.00 |