Run Skylake ICPX Ofast AoS (base) | Run Skylake ICPX Ofast SoA | Run Skylake ICPX Ofast Manual Unroll |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-126
| | | | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 113-142
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
13 | 79.28 | 10.82 | 10.11 | 26 | 8.34 | 0.88 | 78.68 | | 13 | 88.79 | 5.37 | 5.06 | 26 | 6.51 | 0.58 | 386.25 |
Run Skylake ICPX Ofast AoS (base) | Run Skylake ICPX Ofast SoA | Run Skylake ICPX Ofast Manual Unroll |
| | | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 67-79
| | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| 13 | 95.41 | 12.36 | 12.72 | 26 | 1.98 | 0.68 | 87.65 | |
Run Skylake ICPX Ofast AoS (base) | Run Skylake ICPX Ofast SoA | Run Skylake ICPX Ofast Manual Unroll |
| | | | | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
1164 | 0.20 | 0.03 | 0.08 | 23 | 0.16 | 0.02 | 0.00 | 1164 | 0.03 | 0.00 | 0.01 | 11 | 0.02 | 0.00 | 0.00 | 1164 | 0.02 | 0.00 | 0.01 | 2 | 0.01 | 0.00 | 0.00 |
1106 | 0.20 | 0.03 | 0.05 | 24 | 0.11 | 0.01 | 0.00 | 1106 | 0.01 | 0.00 | 0.01 | 7 | 0.00 | 0.00 | 0.00 | 1106 | 0.02 | 0.00 | 0.01 | 2 | 0.01 | 0.00 | 0.00 |
1163 | 0.00 | 0.00 | 0.01 | 1 | 0.00 | 0.00 | 0.00 | 1102 | 1.38 | 0.18 | 0.18 | 26 | 0.31 | 0.03 | 0.00 | 1102 | 0.81 | 0.05 | 0.04 | 24 | 0.42 | 0.01 | 0.00 |
2030 | 0.00 | 0.00 | 0.01 | 1 | 0.00 | 0.00 | 0.00 | 12 | 3.17 | 0.41 | 0.62 | 26 | 2.01 | 0.20 | 9.57 | |
2813 | 0.00 | 0.00 | 0.01 | 2 | 0.00 | 0.00 | 0.00 | | |
1102 | 18.04 | 2.46 | 3.76 | 26 | 7.88 | 0.88 | 0.00 | | |
12 | 2.27 | 0.31 | 0.48 | 26 | 1.24 | 0.13 | 9.51 | | |
-1 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | NA | | |
Run Skylake ICPX Ofast AoS (base) | Run Skylake ICPX Ofast SoA | Run Skylake ICPX Ofast Manual Unroll |
| | | | | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 155-161
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
| | 12 | 10.37 | 0.63 | 0.94 | 26 | 6.43 | 0.21 | 8.80 |
Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
Skylake ICPX Ofast AoS (base) | Skylake ICPX Ofast SoA | Skylake ICPX Ofast Manual Unroll | Skylake ICPX Ofast AoS (base) | Skylake ICPX Ofast SoA | Skylake ICPX Ofast Manual Unroll | Skylake ICPX Ofast AoS (base) | Skylake ICPX Ofast SoA | Skylake ICPX Ofast Manual Unroll | Skylake ICPX Ofast AoS (base) | Skylake ICPX Ofast SoA | Skylake ICPX Ofast Manual Unroll | Skylake ICPX Ofast AoS (base) | Skylake ICPX Ofast SoA | Skylake ICPX Ofast Manual Unroll | Skylake ICPX Ofast AoS (base) | Skylake ICPX Ofast SoA | Skylake ICPX Ofast Manual Unroll | Skylake ICPX Ofast AoS (base) | Skylake ICPX Ofast SoA | Skylake ICPX Ofast Manual Unroll |
k_means(int, point_t*, point_t*, int*, int, int) [clone .extracted.18] | binary | 79.28 | NA | 88.79 | 10.82 | NA | 5.37 | 10.11 | NA | 5.06 | 26 | NA | 26 | 78.68 | NA | 386.25 | 8.34 | NA | 6.51 | 0.88 | NA | 0.58 |
k_means(int, point_t&, point_t&, int*, int, int) [clone .extracted.18] | binary | NA | 95.41 | NA | NA | 12.36 | NA | NA | 12.72 | NA | NA | 26 | NA | NA | 87.65 | NA | NA | 1.98 | NA | NA | 0.68 | NA |
kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libiomp5.so | 18.04 | 1.38 | 0.81 | 2.46 | 0.18 | 0.05 | 3.76 | 0.18 | 0.04 | 26 | 26 | 24 | 0.00 | 0.00 | 0.00 | 7.88 | 0.31 | 0.42 | 0.88 | 0.03 | 0.01 |
k_means(int, point_t*, point_t*, int*, int, int) [clone .extracted] | binary | 2.27 | NA | 10.37 | 0.31 | NA | 0.63 | 0.48 | NA | 0.94 | 26 | NA | 26 | 9.51 | NA | 8.80 | 1.24 | NA | 6.43 | 0.13 | NA | 0.21 |
k_means(int, point_t&, point_t&, int*, int, int) [clone .extracted] | binary | NA | 3.17 | NA | NA | 0.41 | NA | NA | 0.62 | NA | NA | 26 | NA | NA | 9.57 | NA | NA | 2.01 | NA | NA | 0.20 | NA |
__sched_yield | libc.so.6 | 0.20 | 0.03 | 0.02 | 0.03 | 0.00 | 0.00 | 0.08 | 0.01 | 0.01 | 23 | 11 | 2 | 0.00 | 0.00 | 0.00 | 0.16 | 0.02 | 0.01 | 0.02 | 0.00 | 0.00 |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libiomp5.so | 0.20 | 0.01 | 0.02 | 0.03 | 0.00 | 0.00 | 0.05 | 0.01 | 0.01 | 24 | 7 | 2 | 0.00 | 0.00 | 0.00 | 0.11 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 |
__kmp_yield | libiomp5.so | 0.00 | NA | NA | 0.00 | NA | NA | 0.01 | NA | NA | 2 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA |
__kmpc_for_static_fini | libiomp5.so | 0.00 | NA | NA | 0.00 | NA | NA | 0.01 | NA | NA | 1 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA |
__kmpc_for_static_init_4 | libiomp5.so | 0.00 | NA | NA | 0.00 | NA | NA | 0.01 | NA | NA | 1 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA |
unknown_kernel_region | kernel | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 1 | NA | NA | NA | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA |