| ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
| ○Loop 42 | attention-clang-skl512 | attention_v2.cpp:30-31 | main | Innermost | 9.11 | 9.11 | 31.68 | 1.00 | 1.60 | 13.02 | 1.00 | 1 | 0.00 | 8.33 | 1.12 | 16.00 | 16.00 | 10.00 | 1.23 | 16.00 |
| ○Loop 53 | attention-clang-skl512 | attention_v2.cpp:30-31 | main | Innermost | 7.35 | 7.35 | 25.56 | 1.00 | 1.60 | 13.02 | 1.00 | 1 | 0.00 | 8.33 | 1.14 | 16.00 | 16.00 | 10.00 | 1.23 | 16.00 |
| ○Loop 32 | attention-clang-skl512 | attention_v2.cpp:30-31 | main | Innermost | 1.98 | 1.98 | 6.87 | 1.00 | 1.78 | 14.22 | 1.00 | 1 | 0.00 | 7.50 | 1.22 | 16.00 | 16.00 | 9.00 | 1.13 | 16.00 |
| ○Loop 31 | attention-clang-skl512 | attention_v2.cpp:30-31 | main | Innermost | 1.98 | 1.98 | 6.87 | 1.00 | 1.78 | 14.22 | 1.00 | 1 | 0.00 | 7.50 | 1.18 | 16.00 | 16.00 | 9.00 | 1.13 | 16.00 |
| ○Loop 52 | attention-clang-skl512 | attention_v2.cpp:26-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 1.84 | 1.84 | 6.38 | 3.19 | 1.04 | 1.42 | 4.64 | 23 | 30.77 | 21.88 | 1.31 | 25.50 | 8.00 | 24.56 | 17.97 | 5.50 |
| ○Loop 33 | attention-clang-skl512 | attention_v2.cpp:30-31 | main | Innermost | 1.81 | 1.81 | 6.29 | 1.00 | 1.78 | 14.22 | 1.00 | 1 | 0.00 | 7.50 | 1.34 | 16.00 | 16.00 | 9.00 | 1.13 | 16.00 |
| ○Loop 38 | attention-clang-skl512 | attention_v2.cpp:55-56 | main | Innermost | 0.94 | 0.94 | 3.25 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 50.00 | 0.00 | 5.00 | 5.00 | 5.00 | 5.00 | 5.00 |
| ○Loop 78 | attention-clang-skl512 | random.tcc:404-409,random.tcc:412-412,random.tcc:420-423,random.tcc:458-458,random.tcc:462-466,random.tcc:3557-3558,attention_v2.cpp:163-163 | main | InBetween | 0.28 | 0.28 | 0.97 | 3.91 | 2.08 | 12.24 | 8.31 | 2 | 0.76 | 9.65 | 0.07 | 16.63 | 4.25 | 7.99 | 1.36 | 2.00 |
| ○Loop 41 | attention-clang-skl512 | attention_v2.cpp:27-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 0.23 | 0.23 | 0.80 | 3.00 | 1.04 | 1.50 | 4.36 | 23 | 30.77 | 21.78 | 1.48 | 24.00 | 8.00 | 23.18 | 16.01 | 5.50 |
| ○Loop 64 | attention-clang-skl512 | attention_v2.cpp:27-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 0.15 | 0.15 | 0.52 | 2.52 | 1.00 | 13.16 | 4.00 | 3 | 10.65 | 12.50 | 1.27 | 5.67 | 2.25 | 5.67 | 0.43 | 1.42 |
| ○Loop 46 | attention-clang-skl512 | attention_v2.cpp:43-44,attention_v2.cpp:47-47,attention_v2.cpp:52-52,attention_v2.cpp:55-55,attention_v2.cpp:58-61 | main | InBetween | 0.13 | 0.13 | 0.45 | 2.11 | 2.11 | 1.62 | 5.75 | 210 | 43.97 | 22.20 | 1.23 | 46.00 | 21.75 | 21.80 | 28.33 | 8.00 |
| ○Loop 67 | attention-clang-skl512 | attention_v2.cpp:27-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 0.13 | 0.12 | 0.43 | 2.41 | 1.00 | 13.59 | 3.82 | 3 | 15.28 | 12.50 | 2.08 | 5.42 | 2.25 | 5.42 | 0.40 | 1.42 |
| ○Loop 61 | attention-clang-skl512 | attention_v2.cpp:27-27,attention_v2.cpp:30-30,attention_v2.cpp:33-33 | main | InBetween | 0.11 | 0.11 | 0.38 | 2.26 | 1.00 | 12.92 | 3.59 | 3 | 11.50 | 12.50 | 1.55 | 5.08 | 2.25 | 5.08 | 0.39 | 1.42 |
| ○Loop 47 | attention-clang-skl512 | attention_v2.cpp:55-56 | main | Innermost | 0.07 | 0.07 | 0.24 | 1.46 | 2.81 | 6.33 | 1.58 | 1 | 33.33 | 13.89 | 0.43 | 4.75 | 3.25 | 1.69 | 0.75 | 3.00 |
| ○Loop 80 | attention-clang-skl512 | random.tcc:412-417 | main | Innermost | 0.05 | 0.05 | 0.17 | 1.00 | 1.00 | 4.77 | 8.25 | 1 | 89.29 | 23.66 | 0.00 | 8.25 | 8.25 | 8.25 | 1.73 | 1.00 |
| ○Loop 73 | attention-clang-skl512 | random.tcc:404-409,random.tcc:412-412,random.tcc:420-423,random.tcc:458-458,random.tcc:462-466,random.tcc:3557-3558 | main | InBetween | 0.04 | 0.04 | 0.16 | 3.71 | 2.32 | 12.68 | 8.40 | 2 | 0.89 | 9.51 | 0.00 | 15.75 | 4.25 | 6.79 | 1.24 | 1.88 |
| ○Loop 69 | attention-clang-skl512 | random.tcc:404-409,random.tcc:412-412,random.tcc:420-423,random.tcc:458-458,random.tcc:462-466,random.tcc:3557-3558,attention_v2.cpp:164-167 | main | InBetween | 0.04 | 0.04 | 0.12 | 4.36 | 1.98 | 11.97 | 13.63 | 8 | 1.47 | 10.02 | 0.29 | 27.25 | 6.25 | 13.76 | 2.28 | 2.00 |
| ○Loop 34 | attention-clang-skl512 | attention_v2.cpp:237-238 | main | Innermost | 0.04 | 0.04 | 0.12 | 1.03 | 1.00 | 1.18 | 17.00 | 1 | 100.00 | 75.00 | 0.00 | 17.00 | 16.50 | 17.00 | 14.38 | 1.00 |
| ○Loop 81 | attention-clang-skl512 | random.h:585-585,random.tcc:333-339 | main | Innermost | 0.03 | 0.03 | 0.10 | 1.00 | 1.00 | 11.00 | 1.00 | 1 | 0.00 | 10.21 | 0.00 | 27.00 | 27.00 | 27.00 | 2.45 | 27.00 |
| ○Loop 49 | attention-clang-skl512 | attention_v2.cpp:47-48 | main | Innermost | 0.02 | 0.03 | 0.09 | 1.00 | 1.00 | 16.00 | 2.00 | 1 | 0.00 | 6.25 | 0.00 | 1.00 | 1.00 | 1.00 | 0.06 | 0.50 |
| ○Loop 70 | attention-clang-skl512 | random.tcc:404-409,random.tcc:412-412,random.tcc:420-423,random.tcc:458-458,random.tcc:462-466,random.tcc:3557-3558 | main | InBetween | 0.02 | 0.02 | 0.07 | 3.71 | 2.32 | 12.68 | 8.40 | 2 | 0.89 | 9.51 | 0.00 | 15.75 | 4.25 | 6.79 | 1.24 | 1.88 |
| ○Loop 50 | attention-clang-skl512 | attention_v2.cpp:47-48 | main | Innermost | 0.02 | 0.02 | 0.07 | 1.00 | 1.00 | 2.00 | 1.00 | 1 | 100.00 | 50.00 | 0.00 | 4.00 | 4.00 | 4.00 | 2.00 | 4.00 |
| ○Loop 36 | attention-clang-skl512 | attention_v2.cpp:52-53 | main | Innermost | 0.02 | 0.02 | 0.07 | 1.40 | 1.17 | 2.00 | 3.50 | 1 | 100.00 | 50.00 | 0.00 | 14.00 | 10.00 | 12.00 | 7.00 | 4.00 |
| ○Loop 35 | attention-clang-skl512 | attention_v2.cpp:47-48 | main | Innermost | 0.02 | 0.01 | 0.05 | 1.00 | 1.00 | 1.00 | 1.00 | 1 | 100.00 | 100.00 | 0.00 | 4.00 | 4.00 | 4.00 | 4.00 | 4.00 |
| ○Loop 79 | attention-clang-skl512 | random.tcc:404-409 | main | Innermost | 0.00 | 0.00 | 0.02 | 1.00 | 1.00 | 1.00 | 14.00 | 1 | 100.00 | 100.00 | 0.00 | 14.00 | 14.00 | 14.00 | 14.00 | 1.00 |
| ○Loop 72 | attention-clang-skl512 | random.tcc:412-417 | main | Innermost | 0.00 | 0.00 | 0.02 | 1.00 | 1.00 | 4.77 | 8.25 | 1 | 89.29 | 23.66 | 0.00 | 8.25 | 8.25 | 8.25 | 1.73 | 1.00 |
| ○Loop 57 | attention-clang-skl512 | attention_v2.cpp:236-237 | main | InBetween | 0.00 | 0.00 | 0.02 | 2.37 | 1.00 | 1.58 | 3.75 | 7 | 17.86 | 24.78 | 0.00 | 11.25 | 4.75 | 11.25 | 7.13 | 3.00 |
| ○Loop 48 | attention-clang-skl512 | attention_v2.cpp:52-53 | main | Innermost | 0.00 | 0.00 | 0.02 | 1.27 | 1.80 | 4.00 | 3.50 | 1 | 100.00 | 25.00 | 0.00 | 3.50 | 2.75 | 1.95 | 0.88 | 1.00 |
| ○Loop 39 | attention-clang-skl512 | stl_construct.h:133-133,stl_vector.h:375-375,stl_vector.h:389-389,stl_vector.h:397-397,stl_vector.h:409-409,stl_vector.h:2205-2205,stl_uninitialized.h:946-946,random.h:2371-2371,random.h:2377-2377,random.h:2393-2393,random.tcc:330-330,random.tcc:342-342,ostream.h:739-739,stl_algobase.h:923-923,stl_algobase.h:1147-1147,chrono.h:218-218,chrono.h:631-631,chrono.h:720-720,attention_v2.cpp:43-43,attention_v2.cpp:144-144,attention_v2.cpp:160-160,attention_v2.cpp:163-164,attention_v2.cpp:175-175,attention_v2.cpp:185-185,attention_v2.cpp:194-194,attention_v2.cpp:204-204,attention_v2.cpp:213-213,attention_v2.cpp:223-223,attention_v2.cpp:233-233,attention_v2.cpp:245-245,attention_v2.cpp:254-254,attention_v2.cpp:264-264,attention_v2.cpp:273-273,attention_v2.cpp:283-283,attention_v2.cpp:292-292,attention_v2.cpp:302-302,attention_v2.cpp:306-307,attention_v2.cpp:310-310,new_allocator.h:162-162,new_allocator.h:183-183,char_traits.h:355-355,char_traits.h:396-396,char_traits.h:432-432,basic_string.h:204-204,basic_string.h:233-233,basic_string.h:238-238,basic_string.h:279-279,basic_string.h:298-298,basic_string.h:304-304 | main | Outermost | 0.00 | 0.00 | 0.02 | 5.87 | 1.00 | 17.89 | 8.64 | 1000012 | 4.46 | 10.95 | 0.00 | 151.25 | 25.75 | 151.25 | 8.45 | 17.50 |
| ○Loop 37 | attention-clang-skl512 | attention_v2.cpp:52-53 | main | Innermost | 0.00 | 0.00 | 0.02 | 1.25 | 2.21 | 8.00 | 3.75 | 1 | 57.14 | 16.96 | 0.00 | 3.75 | 3.00 | 1.70 | 0.47 | 1.00 |