Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-123
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 113-122
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
15 | 70.87 | 1.49 | 1.75 | 64 | 8.16 | 0.18 | 17.04 | 14 | 78.71 | 1.83 | 2.18 | 64 | 6.98 | 0.18 | 460.38 |
Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
| | | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
1 | 0.05 | 0.00 | 0.01 | 12 | 0.02 | 0.00 | 0.00 | 1 | 0.09 | 0.00 | 0.01 | 23 | 0.08 | 0.00 | 0.00 |
528 | 0.99 | 0.02 | 0.05 | 62 | 0.56 | 0.01 | 0.00 | 529 | 0.97 | 0.02 | 0.05 | 62 | 0.60 | 0.01 | 0.00 |
1241 | 0.04 | 0.00 | 0.01 | 8 | 0.11 | 0.00 | 0.00 | 1902 | 0.00 | 0.00 | 0.01 | 1 | 0.00 | 0.00 | 0.00 |
854 | 19.33 | 0.41 | 0.63 | 64 | 6.21 | 0.11 | 0.00 | 1242 | 0.03 | 0.00 | 0.01 | 8 | 0.08 | 0.00 | 0.00 |
437 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 855 | 15.08 | 0.35 | 0.75 | 64 | 6.07 | 0.13 | 0.00 |
1137 | 0.83 | 0.02 | 0.04 | 59 | 0.53 | 0.01 | 0.00 | 492 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | NA |
-1 | 0.00 | 0.00 | 0.00 | 3 | 0.00 | 0.00 | NA | 71 | 0.46 | 0.01 | 0.03 | 54 | 0.32 | 0.01 | 0.00 |
| 16 | 4.65 | 0.11 | 0.13 | 64 | 1.16 | 0.03 | 41.56 |
| -1 | 0.00 | 0.00 | 0.00 | 21 | 0.00 | 0.00 | NA |
Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 139-144
| | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
17 | 7.87 | 0.17 | 0.19 | 64 | 2.42 | 0.05 | 1.20 | |
Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast | Neoverse V1 ACFL Ofast | Neoverse V2 ACFL Ofast |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined] | binary | 70.87 | 78.71 | 1.49 | 1.83 | 1.75 | 2.18 | 64 | 64 | 17.04 | 460.38 | 8.16 | 6.98 | 0.18 | 0.18 |
kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | 19.33 | 15.08 | 0.41 | 0.35 | 0.63 | 0.75 | 64 | 64 | 0.00 | 0.00 | 6.21 | 6.07 | 0.11 | 0.13 |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined.3] | binary | 7.87 | 4.65 | 0.17 | 0.11 | 0.19 | 0.13 | 64 | 64 | 1.20 | 41.56 | 2.42 | 1.16 | 0.05 | 0.03 |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | 0.99 | 0.97 | 0.02 | 0.02 | 0.05 | 0.05 | 62 | 62 | 0.00 | 0.00 | 0.56 | 0.60 | 0.01 | 0.01 |
__sched_yield | libc.so.6 | 0.83 | NA | 0.02 | NA | 0.04 | NA | 59 | NA | 0.00 | NA | 0.53 | NA | 0.01 | NA |
__sched_yield | libc.so.6 | NA | 0.46 | NA | 0.01 | NA | 0.03 | NA | 54 | NA | 0.00 | NA | 0.32 | NA | 0.01 |
@plt_start@ | libomp.so | 0.05 | 0.09 | 0.00 | 0.00 | 0.01 | 0.01 | 12 | 23 | 0.00 | 0.00 | 0.02 | 0.08 | 0.00 | 0.00 |
__kmp_yield | libomp.so | 0.04 | 0.03 | 0.00 | 0.00 | 0.01 | 0.01 | 8 | 8 | 0.00 | 0.00 | 0.11 | 0.08 | 0.00 | 0.00 |
__kmp_resume_if_soft_paused | libomp.so | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA |
__aarch64_ldadd8_acq_rel | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.01 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
unknown_kernel_region | kernel | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 21 | NA | NA | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_invoke_task_func | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | NA | NA | 0.00 | NA | 0.00 |