Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
Loop Source Regions | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 117-123
| Loop Source Regions | |
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
7 | 1.74 | 1.47 | 70.23 | 41.67 | 37.5 | 17.02 | |
| |
Sum on 1 analyzed binary loop (kmeans-acfl-Ofast - 7) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
Analysis | Count | Analysis | Count |
Loop Computation Issues | | | |
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | | |
Control Flow Issues | | | |
Presence of 2 to 4 paths | 1 | | |
Vectorization Roadblocks | | | |
Presence of 2 to 4 paths | 1 | | |
Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
Loop Source Regions | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 140-144
| Loop Source Regions | |
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
25 | 0.19 | 0.17 | 7.87 | 0 | 20.83 | 1.2 | |
| |
Sum on 1 analyzed binary loop (kmeans-acfl-Ofast - 25) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
Analysis | Count | Analysis | Count |
Loop Computation Issues | | | |
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | | |
Presence of a large number of scalar integer instructions | 1 | | |
Data Access Issues | | | |
Presence of indirect access | 1 | | |
Vectorization Roadblocks | | | |
Presence of indirect access | 1 | | |
Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
Loop Source Regions | | Loop Source Regions | |
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 37 | 0.13 | 0.11 | 4.65 | 0 | 0 | 41.56 |
| |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
Analysis | Count | Analysis | Count |
Run Neoverse V1 ACFL Ofast | Run Neoverse V2 ACFL Ofast |
Loop Source Regions | | Loop Source Regions | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-116
|
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
| 7 | 0.03 | 0.01 | 0.27 | 0 | 37.5 | 498.61 |
| |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (kmeans-acfl-Ofast - 7) |
Analysis | Count | Analysis | Count |
| | Control Flow Issues | |
| | Vectorization Roadblocks | |
| | Presence of more than 4 paths | 1 |