- r_1 - gcc_o3_ov1_o52/ - 4 analyzed loop(s)- Loop 23 - spmxv.exe
- Loop 21 - spmxv.exe
- Loop 20 - spmxv.exe
- Loop 22 - spmxv.exe
 
- r_2 - icx_o3_ov1_o52/ - 5 analyzed loop(s)- Loop 16 - spmxv.exe
- Loop 14 - spmxv.exe
- Loop 15 - spmxv.exe
- Loop 13 - spmxv.exe
- Loop 12 - spmxv.exe
 
| Analysis | Count | Percentage | Weighted Count | 
|---|
| ▼Loop Computation Issues– | 6 |  |  | 
| ○Presence of a large number of scalar integer instructions | 4 | 44.44 | 0.22 | 
| ○Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | 11.11 | 0.19 | 
| ○Low iteration count | 1 | 11.11 | 0.16 | 
| ▼Control Flow Issues– | 15 |  |  | 
| ○Non-innermost loop | 6 | 66.67 | 0.22 | 
| ○Presence of 2 to 4 paths | 5 | 55.56 | 0.22 | 
| ○Presence of calls | 2 | 22.22 | 0.00 | 
| ○Low iteration count | 1 | 11.11 | 0.16 | 
| ○Presence of more than 4 paths | 1 | 11.11 | 0.00 | 
| ▼Data Access Issues– | 13 |  |  | 
| ○Presence of indirect access | 5 | 55.56 | 1.52 | 
| ○More than 20% of the loads are accessing the stack | 4 | 44.44 | 0.19 | 
| ○Presence of constant non-unit stride data access | 2 | 22.22 | 0.04 | 
| ○Presence of expensive instructions: scatter/gather | 1 | 11.11 | 0.51 | 
| ○Presence of special instructions executing on a single port | 1 | 11.11 | 0.19 | 
| ▼Vectorization Roadblocks– | 21 |  |  | 
| ○Non-innermost loop | 6 | 66.67 | 0.22 | 
| ○Presence of indirect access | 5 | 55.56 | 1.52 | 
| ○Presence of 2 to 4 paths | 5 | 55.56 | 0.22 | 
| ○Presence of calls | 2 | 22.22 | 0.00 | 
| ○Presence of constant non-unit stride data access | 2 | 22.22 | 0.04 | 
| ○Presence of more than 4 paths | 1 | 11.11 | 0.00 | 
| ▼Inefficient Vectorization– | 2 |  |  | 
| ○Presence of expensive instructions: scatter/gather | 1 | 11.11 | 0.51 | 
| ○Presence of special instructions executing on a single port | 1 | 11.11 | 0.19 | 
| Analysis | r_1 | r_2 | 
|---|
| Loop Computation Issues | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 0 | 1 | 
|---|
| Presence of a large number of scalar integer instructions | 2 | 2 | 
| Low iteration count | 0 | 1 | 
| Control Flow Issues | Presence of calls | 1 | 1 | 
|---|
| Presence of 2 to 4 paths | 3 | 2 | 
| Presence of more than 4 paths | 0 | 1 | 
| Non-innermost loop | 3 | 3 | 
| Low iteration count | 0 | 1 | 
| Data Access Issues | Presence of constant non-unit stride data access | 2 | 0 | 
|---|
| Presence of indirect access | 3 | 2 | 
| Presence of expensive instructions: scatter/gather | 0 | 1 | 
| Presence of special instructions executing on a single port | 0 | 1 | 
| More than 20% of the loads are accessing the stack | 1 | 3 | 
| Vectorization Roadblocks | Presence of calls | 1 | 1 | 
|---|
| Presence of 2 to 4 paths | 3 | 2 | 
| Presence of more than 4 paths | 0 | 1 | 
| Non-innermost loop | 3 | 3 | 
| Presence of constant non-unit stride data access | 2 | 0 | 
| Presence of indirect access | 3 | 2 | 
| Inefficient Vectorization | Presence of expensive instructions: scatter/gather | 0 | 1 | 
|---|
| Presence of special instructions executing on a single port | 0 | 1 |