options

Loops Index

13 loops have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 0.02% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Exclusive Coverage icx_7 (%) Inclusive Coverage icx_7 (%) Max Exclusive Time Over Threads icx_7 (s) Max Inclusive Time Over Threads icx_7 (s) Exclusive Time w.r.t. Wall Time icx_7 (s) Inclusive Time w.r.t. Wall Time icx_7 (s) Nb Threads icx_7 GFLOPS icx_7 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing icx_7 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect
Loop idSource LocationSource FunctionLevelExclusive Coverage icx_7 (%)Inclusive Coverage icx_7 (%)Max Exclusive Time Over Threads icx_7 (s)Max Inclusive Time Over Threads icx_7 (s)Exclusive Time w.r.t. Wall Time icx_7 (s)Inclusive Time w.r.t. Wall Time icx_7 (s)Nb Threads icx_7GFLOPS icx_7Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing icx_7Stride 0Stride 1Stride nStride UnknownStride Indirect
81exec - ljForce.c:191-216 [...]ljForce.extractedInnermost56.6356.6353.7553.7547.4547.4572213.6835.9316.9912.495.781.141.33100.670
87exec - timestep.c:74-78advanceVelocity.extractedInnermost1.741.741.601.601.461.467216.25012.51181.102000
54exec - haloExchange.c:621-629sortAtomsInCellSingle1.261.261.201.201.051.05720.005022.921.613.661.1405000
83exec - ljForce.c:157-158 [...]ljForce.extracted.27Single1.161.161.121.120.970.97720.005015.63213.21.1601000
89exec - timestep.c:88-94advancePosition.extractedInnermost0.840.840.850.850.710.717215.4996.4376.34111.011.201003
80exec - ljForce.c:187-216 [...]ljForce.extractedInBetween0.6957.320.8454.470.5848.0372285.41012.51181.4501010
75exec - linkCells.c:295-378 [...]updateLinkCellsInnermost0.340.3410.2510.250.290.29230.4721.8812.71.71.7410.361NANANANANA
79exec - ljForce.c:178-216 [...]ljForce.extractedInBetween0.2457.560.2754.690.2048.2372309.23010.4211141.36NANANANANA
88exec - timestep.c:85-94advancePosition.extractedOutermost0.181.030.240.990.150.867239.1380651.3211.121.56NANANANANA
44exec - haloExchange.c:380-389loadAtomsBufferInnermost0.050.051.491.490.040.04216.3930.7713.941.581.274.34105000
55exec - haloExchange.c:633-642sortAtomsInCellSingle0.040.040.070.070.030.03720.0033.3314.581.613.662.0105000
94exec - timestep.c:110-116kineticEnergy.extractedInnermost0.040.040.060.060.030.037235.3495.2479.17111.022.1301002
45exec - haloExchange.c:414-424unloadAtomsBufferSingle0.020.020.600.600.020.0226.40011.251.3319.141.0801000
93exec - timestep.c:107-116kineticEnergy.extractedOutermost0.020.050.030.080.010.047264.1972.1351.841.261.061.142.29NANANANANA
78exec - ljForce.c:172-216 [...]ljForce.extractedOutermost0.0257.570.0454.700.0148.2472344.7509.381113.333.5NANANANANA
×