| Run gcc | Run armclang |
| | | |
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 1128 | 0.05 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | -1 | 0.10 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | 0.00 |
| 1299 | 0.39 | 0.04 | 0.04 | 1 | 0.00 | 0.00 | 0.81 | 443 | 0.05 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 2.50 |
| 443 | 0.58 | 0.06 | 0.06 | 1 | 0.00 | 0.00 | 5.92 | 338 | 0.30 | 0.03 | 0.03 | 1 | 0.00 | 0.00 | 12.42 |
| 338 | 8.13 | 0.84 | 0.84 | 1 | 0.00 | 0.00 | 4.22 | -1 | 0.20 | 0.02 | 0.02 | 1 | 0.00 | 0.00 | 7.88 |
| -1 | 0.05 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 1299 | 0.30 | 0.03 | 0.03 | 1 | 0.00 | 0.00 | 1.29 |
| -1 | 0.15 | 0.01 | 0.02 | 1 | 0.00 | 0.00 | 7.92 | 21 | 0.30 | 0.03 | 0.03 | 1 | 0.00 | 0.00 | 21.50 |
| 20 | 0.70 | 0.07 | 0.07 | 1 | 0.00 | 0.00 | 27.88 |
| 248 | 0.25 | 0.02 | 0.03 | 1 | 0.00 | 0.00 | 27.55 |
| -1 | 0.05 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 |
| Run gcc | Run armclang |
| - /home/eoseret/llm-attention/attention_v2.cpp: 42-63
| | |
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 10 | 0.92 | 0.09 | 0.09 | 1 | 0.00 | 0.00 | 2.76 | |
| Run gcc | Run armclang |
| - /usr/include/c++/14/bits/random.tcc: 397-397
- /usr/include/c++/14/bits/random.tcc: 404-425
| | |
| ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 12 | 0.48 | 0.05 | 0.05 | 1 | 0.00 | 0.00 | 0.03 | |
| Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
| gcc | armclang | gcc | armclang | gcc | armclang | gcc | armclang | gcc | armclang | gcc | armclang | gcc | armclang |
| main | binary | 89.26 | 97.69 | 9.23 | 9.72 | 9.23 | 9.72 | 1 | 1 | 6.33 | 6.14 | 0.00 | 0.00 | 0.00 | 0.00 |
| __expf_finite | libm.so.6 | 8.13 | 0.30 | 0.84 | 0.03 | 0.84 | 0.03 | 1 | 1 | 4.22 | 12.42 | 0.00 | 0.00 | 0.00 | 0.00 |
| softmax(float const*, float*, float*, int) | binary | 0.92 | NA | 0.09 | NA | 0.09 | NA | 1 | NA | 2.76 | NA | 0.00 | NA | 0.00 | NA |
| _ZGVsMxv_expf | libamath.so | NA | 0.70 | NA | 0.07 | NA | 0.07 | NA | 1 | NA | 27.88 | NA | 0.00 | NA | 0.00 |
| __GI___memset_generic | libc.so.6 | 0.39 | 0.30 | 0.04 | 0.03 | 0.04 | 0.03 | 1 | 1 | 0.81 | 1.29 | 0.00 | 0.00 | 0.00 | 0.00 |
| __exp2f_finite | libm.so.6 | 0.58 | 0.05 | 0.06 | 0.00 | 0.06 | 0.00 | 1 | 1 | 5.92 | 2.50 | 0.00 | 0.00 | 0.00 | 0.00 |
| std::mersenne_twister_engine<unsigned long, 32ul, 624ul, 397ul, 31ul, 2567483615ul, 11ul, 4294967295ul, 7ul, 2636928640ul, 15ul, 4022730752ul, 18ul, 1812433253ul>::_M_gen_rand() | binary | 0.48 | NA | 0.05 | NA | 0.05 | NA | 1 | NA | 0.03 | NA | 0.00 | NA | 0.00 | NA |
| unknown_function | binary | 0.15 | 0.20 | 0.01 | 0.02 | 0.02 | 0.02 | 1 | 1 | 7.92 | 7.88 | 0.00 | 0.00 | 0.00 | 0.00 |
| special_case | libamath.so | NA | 0.30 | NA | 0.03 | NA | 0.03 | NA | 1 | NA | 21.50 | NA | 0.00 | NA | 0.00 |
| _ZGVnN4v_expf | libamath.so | NA | 0.25 | NA | 0.02 | NA | 0.03 | NA | 1 | NA | 27.55 | NA | 0.00 | NA | 0.00 |
| unknown_function | [vdso] | 0.05 | 0.10 | 0.01 | 0.01 | 0.00 | 0.01 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| crng_make_state | kernel | NA | 0.05 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
| el0_svc_common.constprop.0 | kernel | NA | 0.05 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
| _int_malloc | libc.so.6 | 0.05 | NA | 0.01 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA |