options

Statistics

AnalysisCountPercentageWeighted Count
Loop Computation Issues15
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA1155.000.98
Presence of a large number of scalar integer instructions420.000.70
Control Flow Issues8
Presence of calls420.000.22
Presence of 2 to 4 paths210.000.06
Presence of more than 4 paths210.000.04
Data Access Issues19
More than 20% of the loads are accessing the stack525.000.16
Presence of indirect access420.000.10
Presence of special instructions executing on a single port315.000.07
Presence of constant non-unit stride data access315.000.07
More than 10% of the vector loads instructions are unaligned315.000.07
Presence of expensive instructions: scatter/gather15.000.03
Vectorization Roadblocks17
Presence of calls420.000.22
Presence of indirect access420.000.10
Presence of more than 4 paths420.000.22
Presence of constant non-unit stride data access315.000.07
Presence of 2 to 4 paths210.000.06
Inefficient Vectorization4
Presence of special instructions executing on a single port315.000.07
Presence of expensive instructions: scatter/gather15.000.03

Details

Analysisr_1r_2
Loop Computation IssuesLess than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA65
Presence of a large number of scalar integer instructions13
Control Flow IssuesPresence of calls22
Presence of 2 to 4 paths11
Presence of more than 4 paths11
Data Access IssuesPresence of constant non-unit stride data access12
Presence of indirect access22
More than 10% of the vector loads instructions are unaligned30
Presence of expensive instructions: scatter/gather01
Presence of special instructions executing on a single port30
More than 20% of the loads are accessing the stack23
Vectorization RoadblocksPresence of calls22
Presence of 2 to 4 paths11
Presence of more than 4 paths22
Presence of constant non-unit stride data access12
Presence of indirect access22
Inefficient VectorizationPresence of expensive instructions: scatter/gather01
Presence of special instructions executing on a single port30
×