Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
Neoverse V2 GCC O3 Manual Unroll (250 iterations, 96 threads) | Neoverse V2 ACFL Ofast Manual Unroll (250 iterations, 96 threads) | Neoverse V2 GCC O3 Manual Unroll (250 iterations, 96 threads) | Neoverse V2 ACFL Ofast Manual Unroll (250 iterations, 96 threads) | Neoverse V2 GCC O3 Manual Unroll (250 iterations, 96 threads) | Neoverse V2 ACFL Ofast Manual Unroll (250 iterations, 96 threads) | Neoverse V2 GCC O3 Manual Unroll (250 iterations, 96 threads) | Neoverse V2 ACFL Ofast Manual Unroll (250 iterations, 96 threads) | Neoverse V2 GCC O3 Manual Unroll (250 iterations, 96 threads) | Neoverse V2 ACFL Ofast Manual Unroll (250 iterations, 96 threads) | Neoverse V2 GCC O3 Manual Unroll (250 iterations, 96 threads) | Neoverse V2 ACFL Ofast Manual Unroll (250 iterations, 96 threads) | Neoverse V2 GCC O3 Manual Unroll (250 iterations, 96 threads) | Neoverse V2 ACFL Ofast Manual Unroll (250 iterations, 96 threads) |
k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.0] | binary | 58.54 | NA | 13.35 | NA | 17.72 | NA | 96 | NA | 101.55 | NA | 11.84 | NA | 3.02 | NA |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined] | binary | NA | 50.51 | NA | 5.22 | NA | 5.18 | NA | 96 | NA | 109.59 | NA | 4.57 | NA | 0.15 |
kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | NA | 40.82 | NA | 4.22 | NA | 4.45 | NA | 96 | NA | 0.00 | NA | 4.34 | NA | 0.48 |
gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 30.43 | NA | 6.94 | NA | 9.06 | NA | 96 | NA | 0.00 | NA | 6.26 | NA | 1.19 | NA |
gomp_barrier_wait_end | libgomp.so.1.0.0 | 10.01 | NA | 2.28 | NA | 3.86 | NA | 95 | NA | 0.00 | NA | 6.02 | NA | 1.18 | NA |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined.3] | binary | NA | 4.56 | NA | 0.47 | NA | 0.52 | NA | 96 | NA | 45.18 | NA | 0.42 | NA | 0.04 |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | NA | 2.59 | NA | 0.27 | NA | 0.37 | NA | 95 | NA | 0.00 | NA | 0.20 | NA | 0.02 |
__sched_yield | libc.so.6 | NA | 1.32 | NA | 0.14 | NA | 0.17 | NA | 95 | NA | 0.00 | NA | 0.13 | NA | 0.01 |
k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.1] | binary | 1.01 | NA | 0.23 | NA | 0.67 | NA | 95 | NA | 45.77 | NA | 0.96 | NA | 0.19 | NA |
@plt_start@ | libomp.so | NA | 0.14 | NA | 0.01 | NA | 0.02 | NA | 95 | NA | 0.00 | NA | 0.04 | NA | 0.00 |
__kmp_yield | libomp.so | NA | 0.05 | NA | 0.01 | NA | 0.01 | NA | 93 | NA | 0.00 | NA | 0.02 | NA | 0.00 |
__aarch64_ldadd4_acq_rel | libgomp.so.1.0.0 | 0.00 | NA | 0.00 | NA | 0.00 | NA | 22 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA |
__aarch64_ldadd8_acq_rel | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 6 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmpc_for_static_fini | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 5 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_invoke_task_func | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 5 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 5 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined_debug__.2] [clone .omp] [clone .reduction] [clone .reduction_func] | binary | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 4 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_task_team_sync | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 3 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check() | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 4 | NA | 7.55 | NA | 0.01 | NA | 0.00 |
__kmp_launch_thread | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 2 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_invoke_microtask | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 2 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_finish_implicit_task | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 2 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
gomp_barrier_wait | libgomp.so.1.0.0 | 0.00 | NA | 0.00 | NA | 0.00 | NA | 3 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA |
__kmp_fork_barrier(int, int) | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_join_barrier(int) | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_barrier | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmpc_for_static_init_4 | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmpc_reduce_nowait | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__memset | libastring.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_determine_reduction_method | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
gomp_team_barrier_wait_final | libgomp.so.1.0.0 | 0.00 | NA | 0.00 | NA | 0.00 | NA | 2 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA |
gomp_thread_start | libgomp.so.1.0.0 | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA |
unknown_kernel_region | kernel | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 24 | 26 | NA | NA | 0.00 | 0.00 | 0.00 | 0.00 |
unknown_function | binary | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 2 | NA | NA | NA | 0.00 | NA | 0.00 |