options

Expert Summary

Columns Filter

Source Location Source Function Level Max Time Over Threads (s) Time w.r.t. Wall Time (s) Coverage (% app. time) Speedup if no scalar integer Speedup if FP arith vectorized Speedup if fully vectorized Speedup if FP only Number of paths Vectorization Ratio (%) Vector Length Use (%) Flops (GFLOP/s) CQA cycles CQA cycles if no scalar integer CQA cycles if FP arith vectorized CQA cycles if fully vectorized CQA cycles if FP only
IDModuleSource LocationSource FunctionLevelMax Time Over Threads (s)Time w.r.t. Wall Time (s)Coverage (% app. time)Speedup if no scalar integerSpeedup if FP arith vectorizedSpeedup if fully vectorizedSpeedup if FP onlyNumber of pathsVectorization Ratio (%)Vector Length Use (%)Flops (GFLOP/s)CQA cyclesCQA cycles if no scalar integerCQA cycles if FP arith vectorizedCQA cycles if fully vectorizedCQA cycles if FP only
Loop 18attention-gcc-skl256attention_v2.cpp:30-31mainInnermost9.259.2533.641.002.5814.551.00120.0011.251.124.004.001.550.284.00
Loop 15attention-gcc-skl256attention_v2.cpp:30-31mainInnermost7.337.3326.681.002.5814.551.00120.0011.251.324.004.001.550.284.00
Loop 10attention-gcc-skl256attention_v2.cpp:30-31mainInnermost2.022.027.351.002.5814.551.00120.0011.251.224.004.001.550.284.00
Loop 7attention-gcc-skl256attention_v2.cpp:30-31mainInnermost2.002.007.271.002.5814.551.00120.0011.251.354.004.001.550.284.00
Loop 4attention-gcc-skl256attention_v2.cpp:30-31mainInnermost1.991.997.241.002.5814.551.00120.0011.251.234.004.001.550.284.00
Loop 32attention-gcc-skl256attention_v2.cpp:55-56softmax(float const*, float*, float*, int)Innermost1.261.264.581.002.004.001.0010.006.250.393.003.001.500.753.00
Loop 16attention-gcc-skl256attention_v2.cpp:27-30,attention_v2.cpp:33-33,attention_v2.cpp:236-236mainInBetween0.690.692.531.572.7514.672.75120.0011.251.372.751.751.000.191.00
Loop 2attention-gcc-skl256attention_v2.cpp:163-163,random.tcc:458-466,random.tcc:3557-3558mainInnermost0.220.220.783.082.1212.834.6322.0810.400.149.253.004.360.722.00
Loop 11attention-gcc-skl256attention_v2.cpp:26-30,attention_v2.cpp:33-33mainInBetween0.200.200.731.671.0014.552.50125.0012.500.302.501.502.500.171.00
Loop 19attention-gcc-skl256attention_v2.cpp:26-30,attention_v2.cpp:33-33mainInBetween0.170.170.621.671.0014.552.50125.0012.500.712.501.502.500.171.00
Loop 8attention-gcc-skl256attention_v2.cpp:26-30,attention_v2.cpp:33-33mainInBetween0.150.140.531.671.0014.552.50125.0012.501.102.501.502.500.171.00
Loop 31attention-gcc-skl256attention_v2.cpp:52-53softmax(float const*, float*, float*, int)Innermost0.150.140.531.331.3316.002.0010.006.251.172.001.501.500.131.00
Loop 3attention-gcc-skl256attention_v2.cpp:164-167,random.tcc:406-409,random.tcc:458-459,random.tcc:462-466,random.tcc:3519-3519,random.tcc:3557-3558mainInBetween0.120.120.442.681.3310.628.33813.3315.000.1718.757.0014.061.772.25
Loop 5attention-gcc-skl256attention_v2.cpp:27-30,attention_v2.cpp:33-33,random.tcc:422-422mainInBetween0.110.110.401.671.0014.552.50125.0012.500.912.501.502.500.171.00
Loop 13attention-gcc-skl256attention_v2.cpp:237-238mainInnermost0.060.060.241.251.0016.001.2510.006.250.001.251.001.250.081.00
Loop 23attention-gcc-skl256random.tcc:404-409,random.tcc:420-423,random.tcc:458-458,random.tcc:462-466,random.tcc:3557-3558mainInBetween0.050.050.202.332.1711.004.43210.0012.030.1811.635.005.361.062.63
Loop 26attention-gcc-skl256random.tcc:458-466,random.tcc:3557-3558mainInnermost0.050.050.182.782.3411.875.3327.8911.630.608.002.883.420.671.50
Loop 34attention-gcc-skl256attention_v2.cpp:43-43,attention_v2.cpp:46-47,attention_v2.cpp:50-52,attention_v2.cpp:58-61softmax(float const*, float*, float*, int)InBetween0.050.050.183.391.0010.6710.17122.5812.500.0015.254.5015.251.431.50
Loop 25attention-gcc-skl256random.tcc:412-417mainInnermost0.040.030.131.001.002.003.501100.0050.000.003.503.503.501.751.00
Loop 42attention-gcc-skl256random.tcc:412-417std::mersenne_twister_engine::_M_gen_rand()Single0.020.020.071.001.002.003.501100.0050.000.003.503.503.501.751.00
Loop 35attention-gcc-skl256attention_v2.cpp:47-48softmax(float const*, float*, float*, int)InBetween0.010.010.041.431.0010.554.13140.0012.920.008.255.758.250.782.00
Loop 33attention-gcc-skl256attention_v2.cpp:47-48softmax(float const*, float*, float*, int)Innermost0.010.010.041.001.002.001.001100.0050.001.004.004.004.002.004.00
Loop 41attention-gcc-skl256random.tcc:404-409std::mersenne_twister_engine::_M_gen_rand()Single0.010.010.041.001.002.003.501100.0050.000.003.503.503.501.751.00
Loop 1attention-gcc-skl256random.h:585-585,random.tcc:333-339mainInnermost0.010.010.041.001.008.001.0010.0012.500.006.006.006.000.756.00
Loop 6attention-gcc-skl256attention_v2.cpp:26-27mainInBetween0.000.000.021.001.0016.001.2510.006.250.001.251.251.250.081.00
×