Loops
▶quants.c: 1089 - 6.96 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1976 | 0.05 | 2.95 | 6.96 | 48.48 | 71.4 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1976) | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | |||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||
| Data Access Issues | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
▶quants.c: 979 - 5.49 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2447 | 0.05 | 0.03 | 0.27 | 56.25 | 68.36 | 1940 | 0.07 | 0.06 | 5.22 | 58.06 | 69.56 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 2447) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1940) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | Loop Computation Issues | ||||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||
| Data Access Issues | Data Access Issues | ||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||||
▶quants.c: 910 - 4.94 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2450 | 0.04 | 0.02 | 0.20 | 10.53 | 49.34 | 1938 | 0.04 | 0.01 | 0.60 | 16.22 | 51.35 | 1974 | 0.04 | 1.76 | 4.15 | 16.22 | 51.35 |
| Sum on 1 analyzed binary loop (libggml-cpu.so - 2450) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1938) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1974) | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||
| Presence of a large number of scalar integer instructions | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | |||||||||||||
| Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 1 | ||||||||||||
▶ops.cpp: 6210 - 0.85 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 790 | 0.02 | 0.01 | 0.46 | 2.33 | 20.2 | 814 | 0.01 | 0.17 | 0.39 | 0 | 25.52 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 790) | Sum on 1 analyzed binary loop (libggml-cpu.so - 814) | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | Loop Computation Issues | ||||||||||||||||
| Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||
| Control Flow Issues | Control Flow Issues | ||||||||||||||||
| Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||
| Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 | ||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||
| Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||
| Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | ||||||||||||||
▶ops.cpp: 6238 - 0.46 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 838 | 0.01 | 0.20 | 0.46 | 0 | 28.37 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 838) | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | |||||||||||||||||
| Presence of expensive FP instructions | 1 | ||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||
| Control Flow Issues | |||||||||||||||||
| Presence of calls | 1 | ||||||||||||||||
| Data Access Issues | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||
| Presence of calls | 1 | ||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||
▶vec.cpp: 385 - 0.22 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1015 | 0.02 | 0.01 | 0.04 | 68.18 | 82.24 | 767 | 0.01 | 0.00 | 0.18 | 80 | 97.68 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 1015) | Sum on 1 analyzed binary loop (libggml-cpu.so - 767) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | Loop Computation Issues | ||||||||||||||||
| Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||
| Data Access Issues | Data Access Issues | ||||||||||||||||
| Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | |||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||
| Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | |||||||||||||||
▶ggml-cpu.c: 3228 - 0.22 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6 | 0.01 | 0.00 | 0.09 | 0 | 18.47 | 1 | 0.01 | 0.06 | 0.13 | 72.6 | 83.56 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 6) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1) | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | Loop Computation Issues | ||||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | |||||||||||||||
| Data Access Issues | Data Access Issues | ||||||||||||||||
| Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | |||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | ||||||||||||||||
| Presence of constant non-unit stride data access | Presence of constant non-unit stride data access | 1 | |||||||||||||||
▶<unknown>: 0 - 0.18 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2483 | 0.00 | 0.00 | 0.00 | 0 | 0 | 3907 | 0.00 | 0.00 | 0.01 | 0 | 0 | 4210 | 0.00 | 0.00 | 0.01 | 0 | 0 |
| 2480 | 0.00 | 0.00 | 0.00 | 0 | 0 | 4301 | 0.00 | 0.00 | 0.01 | 0 | 0 | 3801 | 0.00 | 0.00 | 0.01 | 0 | 0 |
| 2886 | 0.01 | 0.00 | 0.00 | 0 | 0 | 4320 | 0.00 | 0.00 | 0.01 | 0 | 0 | 1677 | 0.00 | 0.00 | 0.01 | 0 | 0 |
| 2514 | 0.00 | 0.00 | 0.00 | 0 | 0 | 1626 | 0.00 | 0.00 | 0.01 | 0 | 0 | 3797 | 0.00 | 0.00 | 0.01 | 0 | 0 |
| 2767 | 0.01 | 0.00 | 0.00 | 0 | 0 | 4127 | 0.00 | 0.00 | 0.01 | 0 | 0 | 3973 | 0.00 | 0.00 | 0.01 | 0 | 0 |
| 2650 | 0.01 | 0.00 | 0.00 | 0 | 0 | 2010 | 0.00 | 0.00 | 0.01 | 0 | 0 | 3971 | 0.00 | 0.00 | 0.01 | 0 | 0 |
| 0 | 0.00 | 0.00 | 0.00 | 0 | 0 | 51 | 0.01 | 0.00 | 0.01 | 0 | 0 | 4211 | 0.00 | 0.00 | 0.01 | 0 | 0 |
| 77 | 0.00 | 0.00 | 0.00 | 0 | 0 | 64 | 0.00 | 0.00 | 0.01 | 0 | 0 | ||||||
| 96 | 0.00 | 0.00 | 0.00 | 0 | 0 | 63 | 0.00 | 0.00 | 0.01 | 0 | 0 | ||||||
| 1900 | 0.01 | 0.00 | 0.00 | 0 | 0 | 1537 | 0.00 | 0.00 | 0.01 | 0 | 0 | ||||||
| 434 | 0.02 | 0.00 | 0.01 | 0 | 0 | 1956 | 0.00 | 0.00 | 0.01 | 0 | 0 | ||||||
| 1570 | 0.01 | 0.00 | 0.00 | 0 | 0 | ||||||||||||
| 1401 | 0.00 | 0.00 | 0.00 | 0 | 0 | ||||||||||||
| 2427 | 0.00 | 0.00 | 0.00 | 0 | 0 | ||||||||||||
| 82 | 0.00 | 0.00 | 0.00 | 0 | 0 | ||||||||||||
| 78 | 0.01 | 0.00 | 0.00 | 0 | 0 | ||||||||||||
| 1007 | 0.01 | 0.00 | 0.00 | 0 | 0 | ||||||||||||
| 551 | 0.00 | 0.00 | 0.00 | 0 | 0 | ||||||||||||
| 1398 | 0.01 | 0.00 | 0.01 | 0 | 0 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶vec.h: 486 - 0.17 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1526 | 0.01 | 0.07 | 0.17 | 10 | 39.38 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1526) | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | |||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||
| Control Flow Issues | |||||||||||||||||
| Presence of calls | 1 | ||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||
| Presence of calls | 1 | ||||||||||||||||
| Presence of more than 4 paths | 1 | ||||||||||||||||
▶vec.cpp: 390 - 0.17 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 790 | 0.01 | 0.07 | 0.17 | 85.29 | 97.06 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 790) | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | |||||||||||||||||
| Presence of expensive FP instructions | 1 | ||||||||||||||||
| Data Access Issues | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
▶ops.cpp: 4325 - 0.15 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1127 | 0.01 | 0.00 | 0.08 | 0 | 26.56 | 1170 | 0.00 | 0.03 | 0.08 | 17.39 | 56.52 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1127) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | |||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||
▶binary-ops.cpp: 18 - 0.14 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 495 | 0.01 | 0.00 | 0.05 | 25 | 100 | 502 | 0.00 | 0.04 | 0.09 | 25 | 100 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶binary-ops.cpp: 10 - 0.11 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 411 | 0.01 | 0.00 | 0.06 | 25 | 100 | 416 | 0.00 | 0.02 | 0.05 | 25 | 100 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 411) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | |||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||
| Data Access Issues | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
▶vec.h: 491 - 0.11 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1518 | 0.00 | 0.05 | 0.11 | 27.59 | 100 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1518) | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Data Access Issues | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
▶ops.cpp: 8787 - 0.11 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1533 | 0.01 | 0.05 | 0.11 | 0 | 49.75 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1533) | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | |||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||
| Control Flow Issues | |||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||
| Presence of more than 4 paths | 1 | ||||||||||||||||
▶vec.cpp: 231 - 0.09 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 764 | 0.01 | 0.00 | 0.09 | 100 | 100 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 764) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Data Access Issues | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
▶vec.cpp: 311 - 0.06 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 782 | 0.00 | 0.03 | 0.06 | 52.63 | 100 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶hashtable.h: 2386 - 0.05 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4180 | 0.01 | 0.00 | 0.01 | 0 | 50 | 4197 | 0.01 | 0.01 | 0.02 | 0 | 50 | ||||||
| 4247 | 0.01 | 0.01 | 0.02 | 0 | 50 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶vec.h: 411 - 0.05 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1454 | 0.01 | 0.00 | 0.05 | 100 | 100 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶ops.cpp: 6446 - 0.04 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| |||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 800 | 0.00 | 0.00 | 0.01 | 37.5 | 81.25 | 848 | 0.00 | 0.01 | 0.03 | 37.5 | 81.25 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶ops.cpp: 6220 - 0.03 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1565 | 0.02 | 0.00 | 0.03 | 0 | 26.34 | ||||||||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 1565) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
| Loop Computation Issues | |||||||||||||||||
| Presence of expensive FP instructions | 1 | ||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||
| Control Flow Issues | |||||||||||||||||
| Presence of calls | 1 | ||||||||||||||||
| Data Access Issues | |||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||
| Presence of calls | 1 | ||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||
▶basic_string.h: 6237 - 0.03 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4472 | 0.01 | 0.00 | 0.02 | 0 | 50 | ||||||||||||
| 4141 | 0.01 | 0.00 | 0.02 | 0 | 50 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶ops.cpp: 8825 - 0.02 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1447 | 0.01 | 0.00 | 0.02 | 0 | 0 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶vec.h: 677 - 0.02 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1169 | 0.00 | 0.01 | 0.02 | 50 | 100 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶quants.c: 49 - 0.02 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1915 | 0.01 | 0.00 | 0.02 | 26.25 | 39.66 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶llama-vocab.cpp: 3216 - 0.02 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3732 | 0.01 | 0.00 | 0.02 | 0 | 50 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶ggml-cpu.c: 1193 - 0.01 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 55 | 0.00 | 0.00 | 0.01 | 0 | 46.88 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||
▶vec.h: 646 - 0.01 %
| Run orig_default | Run gcc_default | Run gcc_3 | |||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1126 | 0.01 | 0.00 | 0.01 | 100 | 100 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | |||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||

