options

Loops Index

Columns Filter

Level Exclusive Coverage run_1_thread (%) Exclusive Coverage run_2_threads (%) Exclusive Coverage run_4_threads (%) Exclusive Coverage run_8_threads (%) Exclusive Coverage run_10_threads (%) Inclusive Coverage run_1_thread (%) Inclusive Coverage run_2_threads (%) Inclusive Coverage run_4_threads (%) Inclusive Coverage run_8_threads (%) Inclusive Coverage run_10_threads (%) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_10_threads (s) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_10_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_10_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_10_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_10_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_10_threads Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_1_thread Speedup If Perfect Load Balancing run_2_threads Speedup If Perfect Load Balancing run_4_threads Speedup If Perfect Load Balancing run_8_threads Speedup If Perfect Load Balancing run_10_threads Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_10_threads) Efficiency (run_10_threads) Potential Speed-Up (%)
Loop idSource LocationSource FunctionLevelExclusive Coverage run_1_thread (%)Exclusive Coverage run_2_threads (%)Exclusive Coverage run_4_threads (%)Exclusive Coverage run_8_threads (%)Exclusive Coverage run_10_threads (%)Inclusive Coverage run_1_thread (%)Inclusive Coverage run_2_threads (%)Inclusive Coverage run_4_threads (%)Inclusive Coverage run_8_threads (%)Inclusive Coverage run_10_threads (%)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_10_threads (s)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_10_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_10_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_10_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_10_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_10_threadsVectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_1_threadSpeedup If Perfect Load Balancing run_2_threadsSpeedup If Perfect Load Balancing run_4_threadsSpeedup If Perfect Load Balancing run_8_threadsSpeedup If Perfect Load Balancing run_10_threadsStride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_10_threads) Efficiency(run_10_threads) Potential Speed-Up (%)
9kmeans-clang-O3-all - main.cpp:73-79k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .omp_outlined]Innermost91.6088.4582.8573.5069.5791.6088.4582.8573.5069.57113.9558.4530.8315.7612.56113.9558.4530.8315.7612.56113.9559.6032.5717.6114.42113.9559.6032.5717.6114.421248105.3610.2118.6734.5442.225518.591.242.67.71111.01111000100.00100.963.890.8710.380.8114.050.7914.58
7kmeans-clang-O3-all - main.cpp:93-96k_means(int, point_t*, point_t*, int*, point_t*, int, int)Innermost5.925.645.214.624.355.925.645.214.624.357.367.457.737.877.837.367.457.737.877.837.363.802.051.110.907.363.802.051.110.90111110.681.312.444.525.55011.611.219.6111110200340.00100.970.180.90.530.830.780.820.8
8kmeans-clang-O3-all - main.cpp:71-79k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .omp_outlined]Outermost2.482.272.292.071.9794.0890.7285.1475.5771.543.091.520.910.550.40117.0459.9231.6116.1012.893.091.530.900.490.41117.0461.1333.4718.1114.821248104.6910.8418.8333.3639.6919.0512.052.11112.9811.011.071.261.13NANANANANA83.33101.0100.860.330.780.450.760.47
×