| Name | Module | Max Thread Time / Walltime run_0 (%) | Coverage run_0 (%) | Coverage Excluding Loops run_0 (%) | Max Inclusive Time Over Threads run_0 (s) | Max Exclusive Time Over Threads run_0 (s) | Inclusive Time w.r.t. Wall Time run_0 (s) | Exclusive Time w.r.t. Wall Time run_0 (s) | Nb Threads run_0 | Deviation (coverage) run_0 | Deviation (walltime) run_0 | Categories run_0 | GFLOPS run_0 | Compilation Options |
| ►main+ | attention-avx512 | 95.95 | 96.38 | 0.00 | 9.06 | 0.00 | 9.06 | 0.00 | 1 | 0.00 | 0.00 | Exe (%): 100.00 | 3.26 | AMD clang version 17.0.6 (CLANG: AOCC_5.1.0-Build#1994 2025_12_23) /cluster/comp/aocc/5.1.0/bin/clang-17 --driver-mode=g++ -O3 -g -fno-omit-frame-pointer -grecord-command-line -march=native -mprefer-vector-width=512 -Rpass=loop-vectorize -Rpass-analysis=lo... |
| ►Loop 75 - attention.cpp:157-160 - attention-avx512 [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 82 - random.tcc:401-3367 - attention-avx512 [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 84 - random.tcc:412-414 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 83 - random.tcc:401-406 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 79 - random.tcc:401-3367 - attention-avx512 [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 80 - random.tcc:401-406 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 81 - random.tcc:412-414 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 76 - random.tcc:401-3367 - attention-avx512 [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 78 - random.tcc:412-414 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 77 - random.tcc:401-406 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 54 - attention.cpp:238-241 - attention-avx512+ | | 0.00 | 0.11 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 55 - attention.cpp:239-241 - attention-avx512+ | | 0.00 | 0.11 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 53 - attention.cpp:240-241 - attention-avx512 | | 0.11 | 0.11 | 0.11 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 56 - attention.cpp:240-241 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 58 - attention.cpp:26-215 - attention-avx512 [...]+ | | 0.00 | 7.71 | 0.00 | 0.72 | 0.00 | 0.72 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 59 - attention.cpp:26-215 - attention-avx512 [...]+ | | 0.00 | 7.71 | 0.00 | 0.72 | 0.00 | 0.72 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 60 - attention.cpp:26-33 - attention-avx512+ | | 0.48 | 7.71 | 0.48 | 0.72 | 0.05 | 0.72 | 0.04 | 1 | 0.00 | 0.00 | | 3.06 | |
| ○Loop 62 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 61 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 57 - attention.cpp:30-31 - attention-avx512 | | 7.20 | 7.23 | 7.23 | 0.68 | 0.68 | 0.68 | 0.68 | 1 | 0.00 | 0.00 | | 3.64 | |
| ►Loop 85 - attention.cpp:156-156 - attention-avx512 [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 86 - random.tcc:401-3367 - attention-avx512 [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 88 - random.tcc:412-414 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 87 - random.tcc:401-406 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 64 - attention.cpp:26-193 - attention-avx512 [...]+ | | 0.00 | 7.71 | 0.00 | 0.72 | 0.00 | 0.72 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 65 - attention.cpp:26-193 - attention-avx512 [...]+ | | 0.05 | 7.71 | 0.05 | 0.72 | 0.00 | 0.72 | 0.00 | 1 | 0.00 | 0.00 | | 2.00 | |
| ►Loop 66 - attention.cpp:26-33 - attention-avx512+ | | 0.11 | 7.66 | 0.11 | 0.72 | 0.01 | 0.72 | 0.01 | 1 | 0.00 | 0.00 | | 12.50 | |
| ○Loop 67 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 63 - attention.cpp:30-31 - attention-avx512 | | 7.52 | 7.55 | 7.55 | 0.71 | 0.71 | 0.71 | 0.71 | 1 | 0.00 | 0.00 | | 3.49 | |
| ○Loop 68 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 70 - attention.cpp:26-171 - attention-avx512 [...]+ | | 0.00 | 7.71 | 0.00 | 0.72 | 0.00 | 0.72 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 71 - attention.cpp:26-171 - attention-avx512 [...]+ | | 0.00 | 7.71 | 0.00 | 0.72 | 0.00 | 0.72 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 72 - attention.cpp:26-33 - attention-avx512+ | | 0.26 | 7.71 | 0.27 | 0.72 | 0.02 | 0.72 | 0.03 | 1 | 0.00 | 0.00 | | 7.90 | |
| ○Loop 74 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 73 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 69 - attention.cpp:30-31 - attention-avx512 | | 7.41 | 7.45 | 7.45 | 0.70 | 0.70 | 0.70 | 0.70 | 1 | 0.00 | 0.00 | | 3.45 | |
| ○Loop 29 - random.tcc:330-336 - attention-avx512 [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 40 - attention.cpp:43-284 - attention-avx512 [...]+ | | 0.00 | 0.69 | 0.00 | 0.06 | 0.00 | 0.06 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 41 - attention.cpp:43-284 - attention-avx512 [...]+ | | 0.16 | 0.69 | 0.16 | 0.06 | 0.01 | 0.06 | 0.01 | 1 | 0.00 | 0.00 | | 16.33 | |
| ○Loop 45 - attention.cpp:47-48 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 37 - attention.cpp:52-53 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 38 - attention.cpp:52-53 - attention-avx512 | | 0.11 | 0.11 | 0.11 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | | 5.25 | |
| ○Loop 46 - attention.cpp:47-48 - attention-avx512 | | 0.05 | 0.05 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 3.50 | |
| ○Loop 43 - attention.cpp:55-56 - attention-avx512 | | 0.11 | 0.11 | 0.11 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | | 2.75 | |
| ○Loop 36 - attention.cpp:47-48 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 44 - attention.cpp:52-53 - attention-avx512 | | 0.11 | 0.11 | 0.11 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | | 5.00 | |
| ○Loop 39 - attention.cpp:55-56 - attention-avx512 | | 0.16 | 0.16 | 0.16 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | | 13.66 | |
| ○Loop 42 - attention.cpp:55-56 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 31 - attention.cpp:26-306 - attention-avx512 [...]+ | | 0.00 | 41.33 | 0.00 | 3.88 | 0.00 | 3.88 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 32 - attention.cpp:26-306 - attention-avx512 [...]+ | | 0.00 | 41.33 | 0.00 | 3.88 | 0.00 | 3.88 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 33 - attention.cpp:26-33 - attention-avx512+ | | 0.64 | 41.33 | 0.64 | 3.88 | 0.06 | 3.88 | 0.06 | 1 | 0.00 | 0.00 | | 1.04 | |
| ○Loop 30 - attention.cpp:30-31 - attention-avx512 | | 40.51 | 40.69 | 40.69 | 3.82 | 3.82 | 3.82 | 3.82 | 1 | 0.00 | 0.00 | | 2.72 | |
| ○Loop 34 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 35 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 48 - attention.cpp:26-262 - attention-avx512 [...]+ | | 0.00 | 31.12 | 0.00 | 2.92 | 0.00 | 2.92 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 49 - attention.cpp:26-262 - attention-avx512 [...]+ | | 0.00 | 31.12 | 0.00 | 2.92 | 0.00 | 2.92 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 50 - attention.cpp:27-33 - attention-avx512+ | | 1.11 | 31.12 | 1.12 | 2.92 | 0.10 | 2.92 | 0.10 | 1 | 0.00 | 0.00 | | 10.33 | |
| ○Loop 47 - attention.cpp:30-31 - attention-avx512 | | 29.87 | 30.00 | 30.00 | 2.82 | 2.82 | 2.82 | 2.82 | 1 | 0.00 | 0.00 | | 3.38 | |
| ○Loop 52 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 51 - attention.cpp:30-31 - attention-avx512 | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○amd_opt_expf | libalm.so | 0.79 | 0.80 | 0.80 | 0.07 | 0.07 | 0.07 | 0.07 | 1 | 0.00 | 0.00 | Math (%): 100.00 | 10.10 | |
| ○call_v16_f32 | libalm.so | 0.74 | 0.74 | 0.74 | 0.07 | 0.07 | 0.07 | 0.07 | 1 | 0.00 | 0.00 | Math (%): 100.00 | 7.60 | |
| ○amd_vrs16_expf_zn5 | libalm.so | 0.64 | 0.64 | 0.64 | 0.06 | 0.06 | 0.06 | 0.06 | 1 | 0.00 | 0.00 | Math (%): 100.00 | 9.33 | |
| ○alm_expf_special | libalm.so | 0.48 | 0.48 | 0.48 | 0.05 | 0.05 | 0.04 | 0.04 | 1 | 0.00 | 0.00 | Math (%): 100.00 | 10.94 | |
| ○amd_expf_zn5 | libalm.so | 0.42 | 0.43 | 0.43 | 0.04 | 0.04 | 0.04 | 0.04 | 1 | 0.00 | 0.00 | Math (%): 100.00 | 9.12 | |
| ○amd_vrs8_expf_zn5 | libalm.so | 0.37 | 0.37 | 0.37 | 0.04 | 0.04 | 0.04 | 0.04 | 1 | 0.00 | 0.00 | Math (%): 100.00 | 10.07 | |
| ○amd_expf_zn5@plt | libalm.so | 0.05 | 0.05 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | Math (%): 100.00 | 2.50 | |
| ○unknown_function | attention-avx512 | 0.05 | 0.05 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | Exe (%): 100.00 | 4.00 | |
| ○__memset_avx512_unaligned_erms | libc.so.6 | 0.05 | 0.05 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | Memory (%): 100.00 | 5.50 | |