Loops
quants.c: 322 - 273.02 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
310 | 8.68 | 8.18 | 68.63 | 66.67 | 59.52 | 354 | 8.33 | 8.22 | 67.42 | 70.83 | 59.38 | 379 | 8.47 | 8.18 | 70.86 | 38.46 | 53.85 | 351 | 8.04 | 7.92 | 66.11 | 53.33 | 61.67 |
Sum on 1 analyzed binary loop (libggml-cpu.so - 310) | Sum on 1 analyzed binary loop (libggml-cpu.so - 354) | Sum on 1 analyzed binary loop (libggml-cpu.so - 379) | Sum on 1 analyzed binary loop (libggml-cpu.so - 351) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||
Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | Presence of a large number of scalar integer instructions | ||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | |||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 |
vec.cpp: 311 - 0.60 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
760 | 0.05 | 0.01 | 0.08 | 100 | 75 | 762 | 0.04 | 0.01 | 0.07 | 100 | 75 | 908 | 0.16 | 0.03 | 0.30 | 100 | 100 | 788 | 0.12 | 0.02 | 0.16 | 100 | 100 |
Sum on 1 analyzed binary loop (libggml-cpu.so - 760) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 908) | Sum on 1 analyzed binary loop (libggml-cpu.so - 788) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 |
vec.h: 491 - 0.56 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1706 | 0.09 | 0.02 | 0.16 | 100 | 81.82 | 1561 | 0.13 | 0.03 | 0.22 | 69.23 | 69.23 | 1908 | 0.09 | 0.02 | 0.14 | 100 | 100 | 1622 | 0.04 | 0.01 | 0.05 | 100 | 100 |
Sum on 1 analyzed binary loop (libggml-cpu.so - 1706) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1561) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1908) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1622) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | |||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 |
ggml-cpu.c: 3228 - 0.04 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 0.02 | 0.00 | 0.01 | 97.5 | 98.75 | 6 | 0.06 | 0.00 | 0.02 | 0 | 24.4 | 0 | 0.04 | 0.00 | 0.01 | 97.22 | 98.61 | 1 | 0.01 | 0.00 | 0.01 | 100 | 100 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
ops.cpp: 6446 - 0.03 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1390 | 0.02 | 0.00 | 0.01 | 40 | 45 | 808 | 0.01 | 0.00 | 0.01 | 42.86 | 57.14 | 1591 | 0.01 | 0.00 | 0.01 | 66.67 | 47.22 | 830 | 0.01 | 0.00 | 0.01 | 42.86 | 57.14 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
binary-ops.cpp: 18 - 0.03 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
429 | 0.09 | 0.00 | 0.01 | 0 | 25 | 505 | 0.05 | 0.00 | 0.00 | 100 | 100 | 541 | 0.10 | 0.00 | 0.01 | 0 | 28.95 | 515 | 0.07 | 0.00 | 0.01 | 100 | 100 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
binary-ops.cpp: 10 - 0.03 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
343 | 0.11 | 0.00 | 0.01 | 0 | 30 | 429 | 0.03 | 0.00 | 0.00 | 100 | 100 | 437 | 0.10 | 0.00 | 0.01 | 0 | 30 | 431 | 0.05 | 0.00 | 0.00 | 100 | 100 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
ops.cpp: 4325 - 0.03 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1170 | 0.11 | 0.00 | 0.01 | 35 | 62.5 | 1172 | 0.13 | 0.00 | 0.01 | 0 | 31.25 | 1353 | 0.02 | 0.00 | 0.00 | 91.67 | 87.5 | 1193 | 0.03 | 0.00 | 0.00 | 100 | 91.11 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
vec.h: 677 - 0.02 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1171 | 0.02 | 0.00 | 0.00 | 100 | 100 | 1170 | 0.01 | 0.00 | 0.00 | 100 | 100 | 1914 | 0.01 | 0.00 | 0.00 | 100 | 100 | 1197 | 0.01 | 0.00 | 0.00 | 100 | 100 |
1688 | 0.00 | 0.00 | 0.00 | 100 | 100 | 1555 | 0.01 | 0.00 | 0.00 | 100 | 100 | 1354 | 0.02 | 0.00 | 0.00 | 100 | 100 | 1617 | 0.01 | 0.00 | 0.00 | 100 | 100 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
sampling.cpp: 125 - 0.02 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3048 | 0.03 | 0.00 | 0.00 | 75 | 87.5 | 4137 | 0.05 | 0.00 | 0.00 | 25 | 37.5 | 3346 | 0.04 | 0.00 | 0.00 | 75 | 87.5 | 4449 | 0.06 | 0.00 | 0.01 | 16.13 | 32.26 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
hashtable.h: 2627 - 0.00 %
Run orig_default | Run gcc_default | Run armclang_3 | Run gcc_3 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2744 | 0.00 | 0.00 | 0.00 | 0 | 50 | 4135 | 0.01 | 0.00 | 0.00 | 0 | 50 | 3084 | 0.01 | 0.00 | 0.00 | 0 | 50 | 4238 | 0.00 | 0.00 | 0.00 | 0 | 50 |
4164 | 0.01 | 0.00 | 0.00 | 0 | 50 | ||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |