| Name | Module | Max Thread Time / Walltime armclang_4 (%) | Coverage armclang_4 (%) | Coverage Excluding Loops armclang_4 (%) | Max Inclusive Time Over Threads armclang_4 (s) | Max Exclusive Time Over Threads armclang_4 (s) | Inclusive Time w.r.t. Wall Time armclang_4 (s) | Exclusive Time w.r.t. Wall Time armclang_4 (s) | Nb Threads armclang_4 | Deviation (coverage) armclang_4 | Deviation (walltime) armclang_4 | Categories armclang_4 | Compilation Options |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ►kai_run_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm | libggml-cpu.so | 34.13 | 55.24 | 0.02 | 3.33 | 0.01 | 3.75 | 0.00 | 64 | 2.26 | 0.09 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE... |
| ►Loop 2478 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2477 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2476 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2481 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.00 | 55.22 | 0.00 | 3.34 | 0.00 | 3.75 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2480 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 0.41 | 55.22 | 0.28 | 3.34 | 0.04 | 3.75 | 0.02 | 62 | 0.14 | 0.01 | |||
| ○Loop 2479 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | 33.82 | 54.94 | 54.94 | 3.30 | 3.30 | 3.73 | 3.73 | 64 | 2.27 | 0.09 | |||
| ○kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | 14.07 | 17.53 | 17.53 | 1.38 | 1.38 | 1.19 | 1.19 | 64 | 1.94 | 0.12 | OMP (%): 100.00 | |
| ►ggml_vec_dot_q6_K_q8_K | libggml-cpu.so | 10.03 | 16.37 | 0.04 | 0.98 | 0.01 | 1.11 | 0.00 | 64 | 0.62 | 0.02 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE... |
| ○Loop 2375 - quants.c:2835-2913 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2377 - quants.c:2492-2660 - libggml-cpu.so [...] | 1.84 | 16.32 | 2.18 | 1.05 | 0.18 | 1.11 | 0.15 | 64 | 0.42 | 0.02 | |||
| ○Loop 2376 - quants.c:2506-2590 - libggml-cpu.so [...] | 8.85 | 14.14 | 14.14 | 0.87 | 0.87 | 0.96 | 0.96 | 64 | 0.67 | 0.03 | |||
| ○kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | 1.94 | 1.97 | 1.97 | 0.19 | 0.19 | 0.13 | 0.13 | 64 | 0.50 | 0.03 | OMP (%): 100.00 | |
| ►ggml_compute_forward_flash_attn_ext | libggml-cpu.so | 1.33 | 1.39 | 0.02 | 0.13 | 0.01 | 0.09 | 0.00 | 64 | 0.33 | 0.02 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 1838 - ops.cpp:8778-8920 - libggml-cpu.so [...] | 0.05 | 1.36 | 0.00 | 0.17 | 0.00 | 0.09 | 0.00 | 1 | 0.00 | 0.00 | |||
| ►Loop 1837 - vec.h:375-751 - libggml-cpu.so [...] | 0.10 | 1.36 | 0.02 | 0.17 | 0.01 | 0.09 | 0.00 | 13 | 0.03 | 0.00 | |||
| ○Loop 1858 - vec.h:677-682 - libggml-cpu.so | 0.10 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 16 | 0.04 | 0.00 | |||
| ○Loop 1861 - vec.h:677-682 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1856 - vec.h:687-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1857 - vec.h:688-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1860 - vec.h:688-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1859 - vec.h:687-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1840 - vec.h:375-751 - libggml-cpu.so [...] | 0.87 | 1.31 | 0.82 | 0.15 | 0.09 | 0.09 | 0.06 | 64 | 0.24 | 0.01 | |||
| ○Loop 1843 - vec.h:375-381 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1853 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1849 - vec.h:677-682 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1848 - vec.h:688-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1855 - vec.h:740-745 - libggml-cpu.so | 0.05 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | |||
| ○Loop 1845 - vec.h:387-387 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1846 - vec.h:375-381 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1852 - vec.h:491-497 - libggml-cpu.so | 0.56 | 0.49 | 0.49 | 0.05 | 0.05 | 0.03 | 0.03 | 64 | 0.19 | 0.01 | |||
| ○Loop 1841 - vec.h:386-387 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1854 - vec.h:751-751 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1851 - vec.h:503-503 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1842 - vec.h:387-387 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1844 - vec.h:386-387 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1847 - vec.h:687-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1850 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1839 - ops.cpp:8885-8886 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1836 - ops.cpp:8885-8886 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○$x | libc.so.6 | 1.07 | 1.03 | 1.03 | 0.10 | 0.10 | 0.07 | 0.07 | 64 | 0.44 | 0.02 | System (%): 100.00 | |
| ►ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool) | libggml-cpu.so | 0.97 | 0.84 | 0.02 | 0.09 | 0.01 | 0.06 | 0.00 | 64 | 0.24 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 1528 - ops.cpp:6210-6484 - libggml-cpu.so [...] | 0.05 | 0.83 | 0.02 | 0.13 | 0.01 | 0.06 | 0.00 | 14 | 0.00 | 0.00 | |||
| ►Loop 1527 - ops.cpp:6210-6462 - libggml-cpu.so [...] | 0.15 | 0.70 | 0.04 | 0.08 | 0.01 | 0.05 | 0.00 | 28 | 0.04 | 0.00 | |||
| ○Loop 1537 - ops.cpp:6210-6245 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1536 - ops.cpp:6220-6245 - libggml-cpu.so [...] | 0.61 | 0.66 | 0.66 | 0.06 | 0.06 | 0.04 | 0.04 | 64 | 0.22 | 0.01 | |||
| ○Loop 1534 - ops.cpp:6210-6303 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1535 - ops.cpp:6220-6245 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1529 - ops.cpp:6462-6475 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1533 - ops.cpp:6429-6442 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1530 - ops.cpp:6479-6484 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1531 - ops.cpp:6446-6456 - libggml-cpu.so [...] | 0.46 | 0.11 | 0.11 | 0.05 | 0.05 | 0.01 | 0.01 | 45 | 0.11 | 0.01 | |||
| ○Loop 1532 - ops.cpp:6413-6426 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○__sched_yield | libc.so.6 | 0.77 | 0.65 | 0.65 | 0.08 | 0.08 | 0.04 | 0.04 | 64 | 0.25 | 0.01 | OMP (%): 100.00 | |
| ○__pthread_mutex_lock | libc.so.6 | 0.87 | 0.59 | 0.59 | 0.09 | 0.09 | 0.04 | 0.04 | 56 | 0.37 | 0.02 | Pthread (%): 100.00 | |
| ►ggml_vec_dot_f16 | libggml-cpu.so | 0.72 | 0.57 | 0.02 | 0.07 | 0.01 | 0.04 | 0.00 | 64 | 0.24 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ○Loop 855 - vec.cpp:325-325 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 856 - vec.cpp:324-325 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 857 - vec.cpp:311-316 - libggml-cpu.so | 0.72 | 0.54 | 0.54 | 0.07 | 0.07 | 0.04 | 0.04 | 64 | 0.24 | 0.01 | |||
| ►ggml_vec_swiglu_f32 | libggml-cpu.so | 1.18 | 0.36 | 0.00 | 0.12 | 0.00 | 0.02 | 0.00 | 16 | 0.32 | 0.02 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 863 - vec.cpp:402-405 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 864 - vec.cpp:403-403 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 862 - vec.cpp:402-403 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 865 - vec.h:1045-1072 - libggml-cpu.so [...] | 1.18 | 0.35 | 0.35 | 0.12 | 0.12 | 0.02 | 0.02 | 16 | 0.33 | 0.02 | |||
| ○unknown_function | [vdso] | 0.61 | 0.32 | 0.00 | 0.06 | 0.00 | 0.02 | 0.00 | 59 | 0.19 | 0.01 | OMP (%): 100.00 | |
| ○__sincosf_finite | libamath.so | 0.36 | 0.27 | 0.27 | 0.04 | 0.04 | 0.02 | 0.02 | 61 | 0.13 | 0.01 | Math (%): 100.00 | |
| ○__aarch64_ldadd8_acq_rel | libomp.so | 0.67 | 0.26 | 0.26 | 0.06 | 0.06 | 0.02 | 0.02 | 53 | 0.23 | 0.01 | OMP (%): 100.00 | |
| ○__expf_finite | libamath.so | 0.36 | 0.24 | 0.24 | 0.04 | 0.04 | 0.02 | 0.02 | 61 | 0.13 | 0.01 | Math (%): 100.00 | |
| ○__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | 0.36 | 0.21 | 0.21 | 0.04 | 0.04 | 0.01 | 0.01 | 59 | 0.13 | 0.01 | OMP (%): 100.00 | |
| ►kai_run_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0 | libggml-cpu.so | 7.16 | 0.19 | 0.00 | 0.70 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE... |
| ►Loop 2455 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2457 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2456 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-142 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2453 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2452 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2454 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-148 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2458 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2460 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2459 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-134 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2461 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...] | 0.56 | 0.19 | 0.02 | 0.70 | 0.06 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | |||
| ○Loop 2462 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-139 - libggml-cpu.so [...] | 4.14 | 0.11 | 0.11 | 0.41 | 0.41 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | |||
| ○Loop 2463 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | 2.46 | 0.07 | 0.07 | 0.24 | 0.24 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | |||
| ►ggml_graph_compute_thread | libggml-cpu.so | 0.36 | 0.18 | 0.01 | 0.03 | 0.01 | 0.01 | 0.00 | 50 | 0.15 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE... |
| ○Loop 76 - ggml-cpu.c:1592-1601 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 88 - ggml-cpu.c:1572-1579 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 87 - ggml-cpu.c:1573-1579 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 75 - ggml-cpu.c:533-2897 - libggml-cpu.so [...] | 0.36 | 0.16 | 0.16 | 0.03 | 0.03 | 0.01 | 0.01 | 50 | 0.14 | 0.01 | |||
| ○Loop 74 - ggml-cpu.c:533-2897 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 92 - ggml-cpu.c:2087-2088 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 91 - ggml-cpu.c:1552-1560 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 90 - ggml-cpu.c:1552-1560 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 89 - ggml-cpu.c:1552-1560 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 77 - ggml-cpu.c:1436-1642 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 81 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 80 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 79 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 78 - ggml-cpu.c:1454-1462 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 85 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 84 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 83 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 82 - ggml-cpu.c:1461-1462 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 86 - ggml-cpu.c:1585-1587 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►kai_run_lhs_quant_pack_qsi8d32p4x8sb_f32_neon | libggml-cpu.so | 1.74 | 0.17 | 0.00 | 0.17 | 0.00 | 0.01 | 0.00 | 4 | 0.22 | 0.02 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE... |
| ►Loop 2445 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:268-335 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2446 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:271-332 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2448 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:93-264 - libggml-cpu.so [...] | 0.00 | 0.17 | 0.00 | 0.17 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2447 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:96-258 - libggml-cpu.so [...] | 1.74 | 0.17 | 0.17 | 0.17 | 0.17 | 0.01 | 0.01 | 4 | 0.22 | 0.02 | |||
| ►ggml_compute_forward_mul | libggml-cpu.so | 0.41 | 0.13 | 0.04 | 0.04 | 0.01 | 0.01 | 0.00 | 35 | 0.17 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 495 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 497 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 496 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 498 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 488 - binary-ops.cpp:18-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 489 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 487 - binary-ops.cpp:84-84 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 490 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 461 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 462 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 460 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 473 - binary-ops.cpp:18-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 468 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 470 - binary-ops.cpp:18-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 471 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 472 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 469 - binary-ops.cpp:84-84 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 480 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 481 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 479 - binary-ops.cpp:18-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 478 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 477 - binary-ops.cpp:84-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 494 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 493 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 483 - binary-ops.cpp:18-110 - libggml-cpu.so [...] | 0.00 | 0.09 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 485 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 486 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 484 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.41 | 0.09 | 0.09 | 0.04 | 0.04 | 0.01 | 0.01 | 16 | 0.16 | 0.01 | |||
| ○Loop 482 - binary-ops.cpp:84-84 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 475 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 474 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 476 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 454 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 455 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 453 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 491 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 492 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 500 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 499 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 501 - binary-ops.cpp:18-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 503 - binary-ops.cpp:18-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 502 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 504 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 456 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 457 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 459 - binary-ops.cpp:42-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 458 - binary-ops.cpp:42-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 464 - binary-ops.cpp:18-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 463 - binary-ops.cpp:84-84 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 466 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 467 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 465 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►ggml_cpu_fp32_to_fp16 | libggml-cpu.so | 0.31 | 0.13 | 0.00 | 0.03 | 0.01 | 0.01 | 0.00 | 47 | 0.10 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE... |
| ○Loop 0 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...] | 0.31 | 0.12 | 0.12 | 0.03 | 0.03 | 0.01 | 0.01 | 46 | 0.10 | 0.01 | |||
| ○Loop 1 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►ggml_compute_forward_add_non_quantized | libggml-cpu.so | 0.46 | 0.12 | 0.02 | 0.04 | 0.01 | 0.01 | 0.00 | 28 | 0.20 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 394 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 393 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 379 - binary-ops.cpp:10-110 - libggml-cpu.so [...] | 0.00 | 0.09 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 381 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 382 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 378 - binary-ops.cpp:84-84 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 380 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.46 | 0.09 | 0.09 | 0.04 | 0.04 | 0.01 | 0.01 | 16 | 0.19 | 0.01 | |||
| ►Loop 369 - binary-ops.cpp:10-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 366 - binary-ops.cpp:10-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 365 - binary-ops.cpp:84-84 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 367 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 368 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 364 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 395 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 397 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 396 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 398 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 360 - binary-ops.cpp:10-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 362 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 359 - binary-ops.cpp:84-84 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 363 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 361 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 400 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 399 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 391 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 392 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 350 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 351 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 349 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 371 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 372 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 370 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 357 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 358 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 356 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 376 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 377 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 375 - binary-ops.cpp:10-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 374 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 373 - binary-ops.cpp:84-101 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 387 - binary-ops.cpp:10-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 389 - binary-ops.cpp:10-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 388 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 390 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 352 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 353 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 355 - binary-ops.cpp:42-95 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 354 - binary-ops.cpp:42-45 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 384 - binary-ops.cpp:10-110 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 386 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 383 - binary-ops.cpp:84-84 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 385 - ggml-impl.h:355-404 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►ggml_compute_forward_mul_mat | libggml-cpu.so | 0.20 | 0.10 | 0.00 | 0.02 | 0.00 | 0.01 | 0.00 | 46 | 0.06 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLE... |
| ►Loop 62 - ggml-cpu.c:1289-1297 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 61 - ggml-cpu.c:1289-1297 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 60 - ggml-cpu.c:1289-1297 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 55 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...] | 0.00 | 0.10 | 0.00 | 0.05 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 53 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...] | 0.10 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 12 | 0.04 | 0.00 | |||
| ►Loop 56 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...] | 0.00 | 0.08 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 57 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...] | 0.05 | 0.08 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | |||
| ►Loop 58 - ggml-cpu.c:1163-1198 - libggml-cpu.so [...] | 0.15 | 0.08 | 0.04 | 0.03 | 0.02 | 0.01 | 0.00 | 26 | 0.04 | 0.00 | |||
| ○Loop 59 - ggml-cpu.c:1197-1198 - libggml-cpu.so | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 0.01 | 0.00 | |||
| ○Loop 54 - ggml-cpu.c:1183-1194 - libggml-cpu.so [...] | 0.10 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 21 | 0.03 | 0.00 | |||
| ○__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libomp.so | 0.26 | 0.09 | 0.09 | 0.03 | 0.03 | 0.01 | 0.01 | 39 | 0.09 | 0.01 | OMP (%): 100.00 | |
| ○__memcpy | libastring.so | 0.77 | 0.08 | 0.08 | 0.08 | 0.08 | 0.01 | 0.01 | 32 | 0.18 | 0.01 | String (%): 100.00 | |
| ○kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check() | libomp.so | 0.46 | 0.08 | 0.08 | 0.05 | 0.05 | 0.01 | 0.01 | 17 | 0.19 | 0.01 | OMP (%): 100.00 | |
| ○ggml::cpu::kleidiai::extra_buffer_type::get_tensor_traits(ggml_tensor const*) | libggml-cpu.so | 0.26 | 0.07 | 0.07 | 0.02 | 0.02 | 0.00 | 0.00 | 33 | 0.08 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ○$x | libc.so.6 | 0.20 | 0.07 | 0.07 | 0.02 | 0.02 | 0.00 | 0.00 | 33 | 0.08 | 0.00 | Pthread (%): 100.00 | |
| ○__kmp_barrier | libomp.so | 0.26 | 0.07 | 0.07 | 0.03 | 0.03 | 0.00 | 0.00 | 35 | 0.06 | 0.00 | OMP (%): 100.00 | |
| ○__GI___lll_lock_wait | libc.so.6 | 0.15 | 0.06 | 0.06 | 0.02 | 0.02 | 0.00 | 0.00 | 33 | 0.05 | 0.00 | Pthread (%): 100.00 | |
| ○unknown_function | libggml-cpu.so | 0.20 | 0.06 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 30 | 0.07 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | |
| ○@plt_start@ | libomp.so | 0.10 | 0.05 | 0.05 | 0.01 | 0.01 | 0.00 | 0.00 | 31 | 0.04 | 0.00 | OMP (%): 100.00 | |
| ►ggml_compute_forward_rms_norm | libggml-cpu.so | 0.20 | 0.05 | 0.00 | 0.02 | 0.01 | 0.00 | 0.00 | 16 | 0.09 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 1266 - ops.cpp:4319-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1267 - ops.cpp:4319-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1268 - vec.h:687-688 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1269 - vec.h:688-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1265 - vec.h:687-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1293 - ops.cpp:4319-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1292 - ops.cpp:4319-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1291 - ops.cpp:4319-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1290 - vec.h:677-682 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1280 - ops.cpp:4319-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1284 - ops.cpp:4320-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1282 - ops.cpp:4321-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1283 - ops.cpp:4321-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1281 - ops.cpp:4321-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1285 - ops.cpp:4319-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1286 - ops.cpp:4321-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1289 - ops.cpp:4320-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1288 - ops.cpp:4321-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1287 - ops.cpp:4321-4333 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1276 - ops.cpp:4319-4338 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1277 - vec.h:687-688 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1275 - vec.h:687-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1274 - ops.cpp:4325-4326 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1279 - ops.cpp:4325-4326 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1278 - vec.h:688-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1272 - vec.h:677-688 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1273 - vec.h:688-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1270 - vec.h:677-682 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1271 - vec.h:687-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1259 - ops.cpp:4319-4365 - libggml-cpu.so [...] | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | |||
| ►Loop 1260 - ops.cpp:4319-4338 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1261 - ops.cpp:4319-4338 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1262 - vec.h:677-688 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1258 - vec.h:687-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1256 - ops.cpp:4325-4326 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1264 - ops.cpp:4325-4326 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1263 - vec.h:688-688 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1257 - vec.h:677-682 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1298 - ops.cpp:4319-4338 - libggml-cpu.so [...] | 0.00 | 0.04 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1297 - ops.cpp:4319-4338 - libggml-cpu.so [...] | 0.00 | 0.04 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1296 - ops.cpp:4319-4338 - libggml-cpu.so [...] | 0.05 | 0.04 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | |||
| ○Loop 1295 - vec.h:677-682 - libggml-cpu.so | 0.10 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 7 | 0.03 | 0.00 | |||
| ○Loop 1299 - ops.cpp:4325-4326 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1294 - ops.cpp:4325-4326 - libggml-cpu.so | 0.15 | 0.03 | 0.03 | 0.02 | 0.02 | 0.00 | 0.00 | 11 | 0.07 | 0.00 | |||
| ○__GI___lll_lock_wake | libc.so.6 | 0.15 | 0.05 | 0.05 | 0.02 | 0.02 | 0.00 | 0.00 | 26 | 0.05 | 0.00 | Pthread (%): 91.18 System (%): 8.82 | |
| ○$x | libc.so.6 | 0.20 | 0.04 | 0.04 | 0.02 | 0.02 | 0.00 | 0.00 | 20 | 0.06 | 0.00 | System (%): 100.00 | |
| ○ggml_is_empty | libggml-base.so | 0.15 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 17 | 0.05 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BUILD -D GGML_COMMIT="unknown" -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLEIDIAI ... |
| ○__kmpc_barrier | libomp.so | 0.20 | 0.02 | 0.02 | 0.02 | 0.02 | 0.00 | 0.00 | 14 | 0.07 | 0.00 | OMP (%): 100.00 | |
| ○__kmp_yield | libomp.so | 0.10 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 15 | 0.03 | 0.00 | OMP (%): 100.00 | |
| ○__memset | libastring.so | 0.41 | 0.02 | 0.02 | 0.04 | 0.04 | 0.00 | 0.00 | 7 | 0.19 | 0.01 | String (%): 100.00 | |
| ○ggml::cpu::kleidiai::tensor_traits::compute_forward_q4_0(ggml_compute_params*, ggml_tensor*) | libggml-cpu.so | 0.10 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 10 | 0.04 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►ggml_compute_forward_set_rows | libggml-cpu.so | 0.15 | 0.02 | 0.02 | 0.02 | 0.02 | 0.00 | 0.00 | 10 | 0.06 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 1451 - ops.cpp:5550-5563 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 1450 - ops.cpp:5551-5563 - libggml-cpu.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 1449 - ops.cpp:5552-5563 - libggml-cpu.so | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | |||
| ○ggml_is_contiguous_1 | libggml-base.so | 0.10 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 9 | 0.03 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 -D GGML_BUILD -D GGML_COMMIT="unknown" -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLEIDIAI ... |
| ○__fs_pow_1 | libamath.so | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 10 | 0.00 | 0.00 | Math (%): 100.00 | |
| ○__kmp_now_nsec | libomp.so | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 9 | 0.00 | 0.00 | OMP (%): 100.00 | |
| ○__GI___pthread_mutex_unlock_usercnt | libc.so.6 | 0.05 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 9 | 0.00 | 0.00 | Pthread (%): 100.00 | |
| ►ggml_cpu_extra_compute_forward | libggml-cpu.so | 0.15 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 7 | 0.07 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-18-66/176-138-1719/llama.cpp/build/llama.cpp/../armclang_4/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ○Loop 347 - traits.cpp:13-17 - libggml-cpu.so [...] | 0.15 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 6 | 0.07 | 0.00 | |||
| ○__ieee754_log2 | libamath.so | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 9 | 0.00 | 0.00 | Math (%): 100.00 | |
| ○__exp2f_finite | libamath.so | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 8 | 0.01 | 0.00 | Math (%): 100.00 |