Loops
quants.c: 910 - 0.63 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2276 | 0.04 | 0.03 | 0.15 | 72.22 | 38.54 | 2058 | 0.04 | 0.03 | 0.18 | 70 | 36.88 | 2366 | 0.04 | 0.03 | 0.17 | 81.25 | 42.19 | 2103 | 0.04 | 0.02 | 0.14 | 70 | 36.88 |
Sum on 1 analyzed binary loop (libggml-cpu.so - 2276) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2058) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2366) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2103) | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | ||||||||||||||||||||
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||
Presence of a large number of scalar integer instructions | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||
Data Access Issues | Data Access Issues | Data Access Issues | Data Access Issues | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | ||||||||||||||||||
Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||||||
Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access |
ggml-cpu.c: 1125 - 0.13 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
64 | 0.02 | 0.00 | 0.02 | 0 | 26.52 | 62 | 0.02 | 0.01 | 0.04 | 0 | 30.47 | 53 | 0.02 | 0.01 | 0.04 | 0 | 26.52 | 66 | 0.02 | 0.01 | 0.03 | 0 | 28.06 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
ops.cpp: 6446 - 0.04 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1437 | 0.02 | 0.00 | 0.01 | 47.06 | 22.06 | 790 | 0.01 | 0.00 | 0.01 | 42.86 | 35.71 | 1531 | 0.02 | 0.00 | 0.01 | 54.55 | 23.86 | 830 | 0.01 | 0.00 | 0.01 | 42.86 | 35.71 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
ggml-cpu.c: 3228 - 0.03 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 0.03 | 0.00 | 0.01 | 90.48 | 95.39 | 3 | 0.04 | 0.00 | 0.01 | 95.35 | 93.9 | 0 | 0.02 | 0.00 | 0.01 | 97.22 | 50.69 | 1 | 0.05 | 0.00 | 0.01 | 100 | 51.72 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
ops.cpp: 4325 - 0.02 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1282 | 0.09 | 0.00 | 0.01 | 94.12 | 95.59 | 1151 | 0.08 | 0.00 | 0.01 | 0 | 15.63 | 1294 | 0.01 | 0.00 | 0.00 | 91.67 | 47.92 | 1194 | 0.05 | 0.00 | 0.00 | 100 | 48.89 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
binary-ops.cpp: 18 - 0.02 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
541 | 0.08 | 0.00 | 0.01 | 0 | 14.47 | 498 | 0.05 | 0.00 | 0.00 | 100 | 50 | 484 | 0.03 | 0.00 | 0.00 | 0 | 14.47 | 515 | 0.06 | 0.00 | 0.01 | 100 | 50 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
binary-ops.cpp: 10 - 0.02 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
431 | 0.09 | 0.00 | 0.01 | 0 | 15 | 424 | 0.05 | 0.00 | 0.00 | 100 | 50 | 380 | 0.03 | 0.00 | 0.00 | 0 | 15 | 431 | 0.04 | 0.00 | 0.00 | 100 | 50 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
sampling.cpp: 125 - 0.01 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3048 | 0.02 | 0.00 | 0.00 | 75 | 62.5 | 4137 | 0.03 | 0.00 | 0.00 | 25 | 18.75 | 3346 | 0.00 | 0.00 | 0.00 | 75 | 62.5 | 4449 | 0.09 | 0.00 | 0.01 | 16.13 | 16.13 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
hashtable.h: 2627 - 0.00 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2728 | 0.00 | 0.00 | 0.00 | 0 | 25 | 4164 | 0.02 | 0.00 | 0.00 | 0 | 25 | 3081 | 0.00 | 0.00 | 0.00 | 0 | 25 | 4239 | 0.01 | 0.00 | 0.00 | 0 | 25 |
2744 | 0.00 | 0.00 | 0.00 | 0 | 25 | ||||||||||||||||||
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
llama-vocab.cpp: 2148 - 0.00 %
Run orig_default | Run gcc_default | Run armclang_4 | Run gcc_4 | ||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2568 | 0.00 | 0.00 | 0.00 | 0 | 22.39 | 3988 | 0.00 | 0.00 | 0.00 | 6.67 | 18.59 | 2870 | 0.00 | 0.00 | 0.00 | 0 | 22.39 | 4060 | 0.01 | 0.00 | 0.00 | 6.2 | 18.14 |
No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |