| Name | Module | Max Thread Time / Walltime gcc_0 (%) | Coverage gcc_0 (%) | Coverage Excluding Loops gcc_0 (%) | Max Inclusive Time Over Threads gcc_0 (s) | Max Exclusive Time Over Threads gcc_0 (s) | Inclusive Time w.r.t. Wall Time gcc_0 (s) | Exclusive Time w.r.t. Wall Time gcc_0 (s) | Nb Threads gcc_0 | Deviation (coverage) gcc_0 | Deviation (walltime) gcc_0 | Categories gcc_0 | Compilation Options |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ►kai_run_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm | libggml-cpu.so | 34.22 | 55.18 | 0.02 | 3.41 | 0.01 | 3.83 | 0.00 | 64 | 1.94 | 0.10 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C11 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu11 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 2204 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2203 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2202 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2207 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.00 | 55.16 | 0.00 | 3.44 | 0.00 | 3.83 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2206 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.35 | 55.16 | 0.23 | 3.44 | 0.04 | 3.83 | 0.02 | 61 | 0.13 | 0.01 | |||
| ○Loop 2205 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 34.17 | 54.94 | 54.94 | 3.40 | 3.40 | 3.82 | 3.82 | 64 | 1.92 | 0.10 | |||
| ○gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 14.45 | 19.94 | 19.94 | 1.44 | 1.44 | 1.38 | 1.38 | 64 | 2.29 | 0.13 | OMP (%): 100.00 | |
| ►ggml_vec_dot_q6_K_q8_K | libggml-cpu.so | 10.49 | 16.98 | 0.01 | 1.04 | 0.01 | 1.18 | 0.00 | 64 | 0.66 | 0.03 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C11 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu11 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 2071 - quants.c:2486-2923 - libggml-cpu.so [...] | 0.25 | 16.97 | 0.08 | 1.15 | 0.03 | 1.18 | 0.01 | 37 | 0.07 | 0.00 | |||
| ○Loop 2070 - quants.c:2683-2758 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2073 - quants.c:2492-2654 - libggml-cpu.so [...] | 2.16 | 16.90 | 2.74 | 1.12 | 0.22 | 1.17 | 0.19 | 64 | 0.38 | 0.02 | |||
| ○Loop 2072 - quants.c:2506-2575 - libggml-cpu.so [...] | 9.13 | 14.16 | 14.16 | 0.91 | 0.91 | 0.98 | 0.98 | 64 | 0.76 | 0.04 | |||
| ○Loop 2074 - quants.c:2683-2812 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►ggml_compute_forward_flash_attn_ext | libggml-cpu.so | 1.15 | 1.18 | 0.01 | 0.12 | 0.00 | 0.08 | 0.00 | 64 | 0.36 | 0.02 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 1534 - ops.cpp:8778-8939 - libggml-cpu.so [...] | 0.00 | 1.17 | 0.00 | 0.18 | 0.00 | 0.08 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1530 - vec.h:282-725 - libggml-cpu.so [...] | 0.65 | 1.17 | 0.54 | 0.18 | 0.06 | 0.08 | 0.04 | 63 | 0.25 | 0.01 | |||
| ►Loop 1533 - ops.cpp:8778-8920 - libggml-cpu.so [...] | 0.05 | 0.15 | 0.01 | 0.06 | 0.00 | 0.01 | 0.00 | 4 | 0.00 | 0.00 | |||
| ○Loop 1531 - ops.cpp:8817-8819 - libggml-cpu.so [...] | 0.25 | 0.10 | 0.10 | 0.03 | 0.03 | 0.01 | 0.01 | 44 | 0.09 | 0.00 | |||
| ►Loop 1532 - ops.cpp:8778-8920 - libggml-cpu.so [...] | 0.10 | 0.04 | 0.02 | 0.03 | 0.01 | 0.00 | 0.00 | 12 | 0.03 | 0.00 | |||
| ○Loop 1537 - ops.cpp:8885-8886 - libggml-cpu.so [...] | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | |||
| ○Loop 1536 - vec.h:646-653 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1535 - vec.h:646-653 - libggml-cpu.so | 0.10 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 14 | 0.03 | 0.00 | |||
| ○Loop 1546 - vec.h:646-653 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1543 - vec.h:290-338 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1540 - vec.h:411-458 - libggml-cpu.so | 0.55 | 0.45 | 0.45 | 0.05 | 0.05 | 0.03 | 0.03 | 64 | 0.20 | 0.01 | |||
| ○Loop 1539 - vec.h:461-466 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1541 - vec.h:710-717 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1538 - ops.cpp:8825-8826 - libggml-cpu.so | 0.10 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 17 | 0.03 | 0.00 | |||
| ○Loop 1544 - vec.h:343-348 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1542 - vec.h:343-348 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1545 - vec.h:290-338 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○__aarch64_ldadd4_acq_rel | libgomp.so.1.0.0 | 1.10 | 0.99 | 0.99 | 0.11 | 0.11 | 0.07 | 0.07 | 64 | 0.38 | 0.02 | OMP (%): 100.00 | |
| ►ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool) | libggml-cpu.so | 0.85 | 0.95 | 0.01 | 0.09 | 0.00 | 0.07 | 0.00 | 64 | 0.26 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 781 - ops.cpp:6210-6490 - libggml-cpu.so [...] | 0.05 | 0.94 | 0.01 | 0.12 | 0.01 | 0.07 | 0.00 | 5 | 0.00 | 0.00 | |||
| ○Loop 789 - ops.cpp:6446-6457 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 782 - ops.cpp:6210-6484 - libggml-cpu.so [...] | 0.00 | 0.88 | 0.00 | 0.10 | 0.00 | 0.06 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 775 - ops.cpp:6210-6481 - libggml-cpu.so [...] | 0.05 | 0.88 | 0.01 | 0.10 | 0.00 | 0.06 | 0.00 | 9 | 0.00 | 0.00 | |||
| ►Loop 777 - ops.cpp:6210-6481 - libggml-cpu.so [...] | 0.15 | 0.87 | 0.03 | 0.10 | 0.01 | 0.06 | 0.00 | 18 | 0.04 | 0.00 | |||
| ►Loop 776 - ops.cpp:6210-6481 - libggml-cpu.so [...] | 0.05 | 0.84 | 0.02 | 0.08 | 0.01 | 0.06 | 0.00 | 13 | 0.00 | 0.00 | |||
| ○Loop 778 - ops.cpp:6210-6245 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 780 - ops.cpp:6210-6245 - libggml-cpu.so [...] | 0.80 | 0.82 | 0.82 | 0.08 | 0.08 | 0.06 | 0.06 | 64 | 0.24 | 0.01 | |||
| ○Loop 779 - ops.cpp:6220-6245 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 783 - ops.cpp:6462-6475 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 784 - ops.cpp:6413-6426 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 788 - ops.cpp:6479-6483 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 787 - ops.cpp:6479-6484 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 790 - ops.cpp:6446-6457 - libggml-cpu.so [...] | 0.15 | 0.06 | 0.06 | 0.01 | 0.01 | 0.00 | 0.00 | 31 | 0.05 | 0.00 | |||
| ►Loop 785 - ops.cpp:6429-6479 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 786 - ops.cpp:6429-6442 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○$x | libc.so.6 | 0.90 | 0.84 | 0.84 | 0.09 | 0.09 | 0.06 | 0.06 | 64 | 0.30 | 0.02 | System (%): 100.00 | |
| ○__pthread_mutex_lock | libc.so.6 | 0.75 | 0.55 | 0.55 | 0.08 | 0.08 | 0.04 | 0.04 | 63 | 0.29 | 0.02 | Pthread (%): 100.00 | |
| ►ggml_vec_dot_f16 | libggml-cpu.so | 0.60 | 0.49 | 0.06 | 0.06 | 0.02 | 0.03 | 0.00 | 64 | 0.18 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ○Loop 759 - vec.cpp:231-262 - libggml-cpu.so | 0.50 | 0.40 | 0.40 | 0.05 | 0.05 | 0.03 | 0.03 | 64 | 0.18 | 0.01 | |||
| ►Loop 757 - vec.cpp:224-337 - libggml-cpu.so [...] | 0.10 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 21 | 0.03 | 0.00 | |||
| ○Loop 758 - vec.cpp:266-269 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►ggml_vec_swiglu_f32 | libggml-cpu.so | 1.15 | 0.32 | 0.00 | 0.12 | 0.00 | 0.02 | 0.00 | 16 | 0.33 | 0.02 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ○Loop 761 - vec.cpp:385-387 - libggml-cpu.so [...] | 1.10 | 0.32 | 0.32 | 0.11 | 0.11 | 0.02 | 0.02 | 16 | 0.31 | 0.02 | |||
| ○sincosf | libm.so.6 | 0.45 | 0.29 | 0.29 | 0.05 | 0.05 | 0.02 | 0.02 | 63 | 0.16 | 0.01 | Math (%): 100.00 | |
| ○gomp_barrier_wait_end | libgomp.so.1.0.0 | 0.40 | 0.27 | 0.27 | 0.04 | 0.04 | 0.02 | 0.02 | 63 | 0.15 | 0.01 | OMP (%): 100.00 | |
| ○__expf_finite | libm.so.6 | 0.30 | 0.23 | 0.23 | 0.03 | 0.03 | 0.02 | 0.02 | 59 | 0.13 | 0.01 | Math (%): 100.00 | |
| ►kai_run_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0 | libggml-cpu.so | 7.68 | 0.21 | 0.00 | 0.76 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C11 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu11 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 2187 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-158 - libggml-cpu.so | 0.00 | 0.21 | 0.00 | 0.76 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2189 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:135-141 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2185 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...] | 0.90 | 0.21 | 0.02 | 0.76 | 0.09 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | |||
| ○Loop 2183 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-134 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2188 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-139 - libggml-cpu.so [...] | 3.51 | 0.09 | 0.09 | 0.35 | 0.35 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | |||
| ►Loop 2184 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...] | 0.75 | 0.09 | 0.02 | 0.32 | 0.07 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | |||
| ○Loop 2186 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | 2.51 | 0.07 | 0.07 | 0.25 | 0.25 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | |||
| ►kai_run_lhs_quant_pack_qsi8d32p4x8sb_f32_neon | libggml-cpu.so | 2.06 | 0.18 | 0.00 | 0.21 | 0.00 | 0.01 | 0.00 | 4 | 0.71 | 0.03 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C11 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu11 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 2170 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:90-340 - libggml-cpu.so [...] | 0.00 | 0.18 | 0.00 | 0.21 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2169 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:267-340 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2172 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:268-336 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2171 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:271-332 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2173 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:93-264 - libggml-cpu.so [...] | 0.00 | 0.18 | 0.00 | 0.21 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2174 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:96-262 - libggml-cpu.so [...] | 2.06 | 0.18 | 0.18 | 0.21 | 0.21 | 0.01 | 0.01 | 4 | 0.71 | 0.03 | |||
| ►ggml_graph_compute_thread | libggml-cpu.so | 0.30 | 0.13 | 0.01 | 0.03 | 0.01 | 0.01 | 0.00 | 50 | 0.10 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C11 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu11 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 73 - ggml-cpu.c:1424-1642 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 74 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 79 - ggml-cpu.c:1437-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 78 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 77 - ggml-cpu.c:1437-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 76 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 75 - ggml-cpu.c:1461-1462 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 72 - ggml-cpu.c:1585-1587 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 81 - ggml-cpu.c:1572-1579 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 80 - ggml-cpu.c:1572-1579 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 71 - ggml-cpu.c:1664-2898 - libggml-cpu.so [...] | 0.25 | 0.12 | 0.11 | 0.03 | 0.03 | 0.01 | 0.01 | 48 | 0.08 | 0.00 | |||
| ○Loop 70 - ggml-cpu.c:2879-2898 - libggml-cpu.so [...] | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 7 | 0.00 | 0.00 | |||
| ○Loop 85 - ggml-cpu.c:2087-2088 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 84 - ggml-cpu.c:1552-1560 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 83 - ggml-cpu.c:1553-1560 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 82 - ggml-cpu.c:1554-1560 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►ggml_compute_forward_rms_norm | libggml-cpu.so | 0.50 | 0.13 | 0.00 | 0.05 | 0.01 | 0.01 | 0.00 | 18 | 0.22 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 1149 - ops.cpp:4319-4343 - libggml-cpu.so [...] | 0.00 | 0.12 | 0.00 | 0.06 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1148 - ops.cpp:4320-4343 - libggml-cpu.so [...] | 0.00 | 0.12 | 0.00 | 0.06 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1147 - ops.cpp:4321-4338 - libggml-cpu.so [...] | 0.05 | 0.12 | 0.00 | 0.06 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | |||
| ○Loop 1150 - vec.h:646-653 - libggml-cpu.so | 0.10 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 4 | 0.04 | 0.00 | |||
| ○Loop 1151 - ops.cpp:4325-4326 - libggml-cpu.so | 0.45 | 0.12 | 0.12 | 0.05 | 0.05 | 0.01 | 0.01 | 16 | 0.17 | 0.01 | |||
| ►ggml_compute_forward_add_non_quantized | libggml-cpu.so | 0.40 | 0.12 | 0.02 | 0.04 | 0.01 | 0.01 | 0.00 | 27 | 0.19 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 421 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.05 | 0.10 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | |||
| ►Loop 423 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.10 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 422 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 424 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.40 | 0.10 | 0.10 | 0.04 | 0.04 | 0.01 | 0.01 | 16 | 0.15 | 0.01 | |||
| ○Loop 425 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 411 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 415 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 413 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 414 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 412 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 389 - binary-ops.cpp:10-146 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 396 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 400 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 398 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 397 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 399 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 390 - binary-ops.cpp:10-146 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 391 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 395 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 393 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 394 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 392 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 401 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 403 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 402 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 404 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 405 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 406 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 408 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 407 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 409 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 410 - binary-ops.cpp:42-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 416 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 418 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 417 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 419 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 420 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►ggml_cpu_fp32_to_fp16 | libggml-cpu.so | 0.25 | 0.11 | 0.00 | 0.02 | 0.00 | 0.01 | 0.00 | 41 | 0.10 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C11 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu11 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ○Loop 3 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...] | 0.25 | 0.11 | 0.11 | 0.02 | 0.02 | 0.01 | 0.01 | 41 | 0.10 | 0.01 | |||
| ►ggml_compute_forward_mul_mat | libggml-cpu.so | 0.20 | 0.09 | 0.00 | 0.02 | 0.00 | 0.01 | 0.00 | 40 | 0.07 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C11 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu11 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 61 - ggml-cpu.c:1125-1397 - libggml-cpu.so [...] | 0.05 | 0.09 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | |||
| ○Loop 63 - ggml-cpu.c:1132-1165 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 62 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...] | 0.10 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 13 | 0.02 | 0.00 | |||
| ○Loop 58 - ggml-cpu.c:1193-1397 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 56 - ggml-cpu.c:1193-1194 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 60 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...] | 0.10 | 0.07 | 0.04 | 0.03 | 0.01 | 0.00 | 0.00 | 25 | 0.04 | 0.00 | |||
| ○Loop 57 - ggml-cpu.c:1197-1198 - libggml-cpu.so | 0.05 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | |||
| ►Loop 65 - ggml-cpu.c:1163-1198 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 64 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 66 - ggml-cpu.c:1197-1198 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 59 - ggml-cpu.c:1193-1194 - libggml-cpu.so | 0.10 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 17 | 0.03 | 0.00 | |||
| ►Loop 69 - ggml-cpu.c:1289-1297 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 68 - ggml-cpu.c:1290-1297 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 67 - ggml-cpu.c:1291-1297 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○$x | libc.so.6 | 0.65 | 0.08 | 0.08 | 0.06 | 0.06 | 0.01 | 0.01 | 31 | 0.16 | 0.01 | System (%): 100.00 | |
| ○$x | libc.so.6 | 0.30 | 0.08 | 0.08 | 0.03 | 0.03 | 0.01 | 0.01 | 32 | 0.11 | 0.01 | Pthread (%): 100.00 | |
| ►ggml_compute_forward_mul | libggml-cpu.so | 0.25 | 0.08 | 0.01 | 0.03 | 0.01 | 0.01 | 0.00 | 25 | 0.12 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 463 - binary-ops.cpp:18-154 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 475 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 479 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 477 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 478 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 476 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 470 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 474 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 472 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 473 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 471 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 464 - binary-ops.cpp:18-154 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 465 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 469 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 467 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 468 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 466 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 490 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 494 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 492 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 491 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 493 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 480 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 482 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 481 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 483 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 484 - binary-ops.cpp:42-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 485 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 487 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 486 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 488 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 489 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 495 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.06 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 497 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.06 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 496 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 498 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.25 | 0.06 | 0.06 | 0.03 | 0.03 | 0.00 | 0.00 | 16 | 0.09 | 0.01 | |||
| ○Loop 499 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►ggml::cpu::kleidiai::extra_buffer_type::get_tensor_traits(ggml_tensor const*) | libggml-cpu.so | 0.15 | 0.07 | 0.05 | 0.02 | 0.01 | 0.00 | 0.00 | 33 | 0.06 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ○Loop 2144 - kleidiai.cpp:535-547 - libggml-cpu.so [...] | 0.15 | 0.01 | 0.01 | 0.02 | 0.02 | 0.00 | 0.00 | 9 | 0.06 | 0.00 | |||
| ○__GI___lll_lock_wake | libc.so.6 | 0.20 | 0.06 | 0.06 | 0.02 | 0.02 | 0.00 | 0.00 | 35 | 0.06 | 0.00 | Pthread (%): 86.96 System (%): 13.04 | |
| ○__GI___lll_lock_wait | libc.so.6 | 0.15 | 0.04 | 0.04 | 0.01 | 0.01 | 0.00 | 0.00 | 22 | 0.05 | 0.00 | Pthread (%): 100.00 | |
| ○unknown_function | libggml-cpu.so | 0.15 | 0.04 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 23 | 0.05 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | |
| ►ggml_cpu_extra_compute_forward | libggml-cpu.so | 0.15 | 0.03 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 20 | 0.05 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ○Loop 387 - traits.cpp:13-17 - libggml-cpu.so | 0.15 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 19 | 0.04 | 0.00 | |||
| ○$x | libc.so.6 | 0.15 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 20 | 0.05 | 0.00 | System (%): 100.00 | |
| ○ggml_is_empty | libggml-base.so | 0.10 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 21 | 0.03 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml-blas.so (%): 100.00 | GNU C11 14.2.0 -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu11 -fno-omit-frame-pointer -fcf-protection=none -fPIC |
| ○$x | libc.so.6 | 0.85 | 0.03 | 0.03 | 0.09 | 0.09 | 0.00 | 0.00 | 5 | 0.51 | 0.04 | System (%): 100.00 | |
| ►ggml::cpu::kleidiai::tensor_traits::compute_forward_q4_0(ggml_compute_params*, ggml_tensor*) [clone .isra.0] | libggml-cpu.so | 0.10 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 16 | 0.02 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ○Loop 2134 - kleidiai.cpp:92-382 - libggml-cpu.so [...] | 0.10 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 4 | 0.04 | 0.00 | |||
| ○__log2_finite | libm.so.6 | 0.05 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 12 | 0.00 | 0.00 | Math (%): 100.00 | |
| ○gomp_team_barrier_wait | libgomp.so.1.0.0 | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 11 | 0.00 | 0.00 | OMP (%): 100.00 | |
| ►ggml_compute_forward_set_rows | libggml-cpu.so | 0.10 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 10 | 0.03 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 1220 - ops.cpp:5550-5563 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1219 - ops.cpp:5551-5563 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1221 - ops.cpp:5552-5563 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►ggml_compute_forward_add | libggml-cpu.so | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 10 | 0.00 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../gcc/bin/libggml.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v1+sm4+crc+aes+sha3+nossbs+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ►Loop 1004 - vec.h:80-80 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1002 - ops.cpp:1395-1424 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1000 - ops.cpp:1395-1424 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1003 - vec.h:80-80 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1001 - ops.cpp:1422-1422 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 999 - vec.h:80-80 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○GOMP_barrier | libgomp.so.1.0.0 | 0.10 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 6 | 0.05 | 0.00 | OMP (%): 100.00 | |
| ○__GI___pthread_mutex_unlock_usercnt | libc.so.6 | 0.10 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 7 | 0.03 | 0.00 | Pthread (%): 100.00 |