Loop id | Source Location | Source Function | Level | Coverage OMP1x13 (%) | Coverage OMP2x13 (%) | Coverage OMP1x26 (%) | Coverage OMP2x26 (%) | Max Time Over Threads OMP1x13 (s) | Max Time Over Threads OMP2x13 (s) | Max Time Over Threads OMP1x26 (s) | Max Time Over Threads OMP2x26 (s) | Time w.r.t. Wall Time OMP1x13 (s) | Time w.r.t. Wall Time OMP2x13 (s) | Time w.r.t. Wall Time OMP1x26 (s) | Time w.r.t. Wall Time OMP2x26 (s) | Nb Threads OMP1x13 | Nb Threads OMP2x13 | Nb Threads OMP1x26 | Nb Threads OMP2x26 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing OMP1x13 | Speedup If Perfect Load Balancing OMP2x13 | Speedup If Perfect Load Balancing OMP1x26 | Speedup If Perfect Load Balancing OMP2x26 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (OMP1x13) Efficiency | (OMP1x13) Potential Speed-Up (%) | (OMP2x13) Efficiency | (OMP2x13) Potential Speed-Up (%) | (OMP1x26) Efficiency | (OMP1x26) Potential Speed-Up (%) | (OMP2x26) Efficiency | (OMP2x26) Potential Speed-Up (%) |
---|
768 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0.49 | 0.45 | 0.46 | 0.39 | 7.68 | 4.2 | 4.3 | 2.53 | 7.3 | 3.67 | 3.71 | 1.86 | 13 | 26 | 26 | 52 | 5.56 | 10.76 | 1 | 1 | 12.08 | 1.05 | 1.15 | 1.16 | 1.37 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0.99 | 0 | 0.98 | 0.01 | 0.98 | 0.01 |
778 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0.46 | 0.42 | 0.43 | 0.38 | 7.24 | 3.82 | 3.92 | 2.23 | 6.79 | 3.4 | 3.47 | 1.79 | 13 | 26 | 26 | 52 | 5.56 | 10.76 | 1 | 1 | 12.08 | 1.07 | 1.13 | 1.14 | 1.25 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.98 | 0.01 | 0.95 | 0.02 |
258 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Outermost | 0.39 | 0.36 | 0.36 | 0.31 | 6.04 | 3.04 | 3.14 | 1.6 | 5.81 | 2.94 | 2.93 | 1.47 | 13 | 26 | 26 | 52 | 6.67 | 11.67 | 1 | 1 | 11.69 | 1.04 | 1.04 | 1.08 | 1.1 | NA | NA | NA | NA | NA | 1 | 0 | 0.99 | 0 | 0.99 | 0 | 0.99 | 0 |
403 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Outermost | 0.39 | 0.36 | 0.36 | 0.31 | 6.15 | 3.09 | 3.12 | 1.64 | 5.84 | 2.93 | 2.89 | 1.48 | 13 | 26 | 26 | 52 | 6.67 | 11.67 | 1 | 1 | 11.69 | 1.05 | 1.06 | 1.08 | 1.12 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1.01 | -0 | 0.99 | 0 |
257 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0.34 | 0.29 | 0.36 | 0.28 | 5.41 | 2.8 | 3.33 | 1.56 | 5.02 | 2.4 | 2.91 | 1.34 | 13 | 26 | 26 | 52 | 12.5 | 10.94 | 1 | 1 | 14.64 | 1.08 | 1.17 | 1.15 | 1.17 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 1.05 | 0 | 0.86 | 0.05 | 0.94 | 0.02 |
402 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0.32 | 0.29 | 0.35 | 0.29 | 5.53 | 3.07 | 3.09 | 1.55 | 4.8 | 2.39 | 2.82 | 1.36 | 13 | 26 | 26 | 52 | 12.5 | 10.94 | 1 | 1 | 14.64 | 1.15 | 1.29 | 1.1 | 1.15 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 1 | -0 | 0.85 | 0.05 | 0.88 | 0.03 |
770 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Outermost | 0.31 | 0.27 | 0.28 | 0.24 | 4.78 | 2.6 | 2.69 | 1.62 | 4.55 | 2.22 | 2.23 | 1.12 | 13 | 26 | 26 | 52 | 9.09 | 11.93 | 3 | 1 | 12.44 | 1.05 | 1.18 | 1.21 | 1.45 | NA | NA | NA | NA | NA | 1 | 0 | 1.02 | 0 | 1.02 | 0 | 1.02 | -0 |
780 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Outermost | 0.31 | 0.27 | 0.28 | 0.24 | 5.01 | 2.69 | 2.67 | 1.57 | 4.57 | 2.23 | 2.25 | 1.14 | 13 | 26 | 26 | 52 | 9.09 | 11.93 | 3 | 1 | 12.44 | 1.1 | 1.21 | 1.19 | 1.39 | NA | NA | NA | NA | NA | 1 | 0 | 1.02 | 0 | 1.02 | -0 | 1 | -0 |
769 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0.26 | 0.29 | 0.28 | 0.29 | 4.31 | 3.06 | 2.61 | 1.92 | 3.89 | 2.36 | 2.24 | 1.38 | 13 | 26 | 26 | 52 | 6.67 | 8.33 | 1 | 3.81 | 14.9 | 1.11 | 1.3 | 1.17 | 1.4 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.82 | 0.05 | 0.87 | 0.04 | 0.7 | 0.09 |
779 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0.25 | 0.29 | 0.28 | 0.29 | 4.08 | 2.88 | 2.59 | 1.77 | 3.74 | 2.35 | 2.22 | 1.35 | 13 | 26 | 26 | 52 | 6.67 | 8.33 | 1 | 3.81 | 14.9 | 1.09 | 1.23 | 1.17 | 1.32 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.8 | 0.06 | 0.84 | 0.04 | 0.69 | 0.09 |
401 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0.23 | 0.2 | 0.22 | 0.19 | 4.15 | 2.01 | 2.11 | 1.13 | 3.42 | 1.61 | 1.79 | 0.89 | 13 | 26 | 26 | 52 | 5.56 | 7.99 | 4.83 | 1 | 16 | 1.21 | 1.26 | 1.19 | 1.27 | 1 | 0 | 0 | 2 | 0 | 1 | 0 | 1.06 | 0 | 0.96 | 0.01 | 0.96 | 0.01 |
291 | picongpu - ParticlesBase.kernel:552-563 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.22 | 0.22 | 0.2 | 0.2 | 3.44 | 2.2 | 2.01 | 1.32 | 3.22 | 1.83 | 1.61 | 0.93 | 13 | 26 | 26 | 52 | 0 | 9.05 | 3.35 | 1 | 16 | 1.07 | 1.2 | 1.26 | 1.43 | 0 | 2.33 | 3.33 | 0.67 | 2 | 1 | 0 | 0.88 | 0.03 | 1 | 0 | 0.87 | 0.03 |
290 | picongpu - ForEach.hpp:278-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.2 | 0.18 | 0.19 | 0.17 | 3.29 | 1.86 | 1.76 | 1.04 | 3.04 | 1.51 | 1.51 | 0.78 | 13 | 26 | 26 | 52 | 0 | 11.61 | 1 | 1 | 12.76 | 1.09 | 1.23 | 1.17 | 1.33 | NA | NA | NA | NA | NA | 1 | 0 | 1.01 | -0 | 1.01 | -0 | 0.97 | 0 |
256 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0.2 | 0.18 | 0.18 | 0.17 | 4.69 | 2.14 | 2.14 | 1.07 | 2.99 | 1.45 | 1.48 | 0.78 | 13 | 26 | 26 | 52 | 5.56 | 7.99 | 4.83 | 1 | 16 | 1.57 | 1.48 | 1.46 | 1.37 | 1 | 0 | 0 | 2 | 0 | 1 | 0 | 1.03 | 0 | 1.01 | -0 | 0.96 | 0.01 |
771 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Innermost | 0.2 | 0.18 | 0.18 | 0.16 | 3.24 | 1.72 | 1.78 | 1 | 2.9 | 1.44 | 1.45 | 0.74 | 13 | 26 | 26 | 52 | 0 | 11.16 | 1 | 1 | 18.16 | 1.12 | 1.19 | 1.23 | 1.37 | NA | NA | NA | NA | NA | 1 | 0 | 1.01 | -0 | 1 | 0 | 0.98 | 0 |
293 | picongpu - ParticlesBase.kernel:487-490 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.19 | 0.18 | 0.18 | 0.16 | 2.97 | 1.82 | 1.73 | 1.13 | 2.76 | 1.51 | 1.43 | 0.78 | 13 | 26 | 26 | 52 | 0 | 6.77 | 1 | 1 | 15.73 | 1.08 | 1.21 | 1.21 | 1.47 | NA | NA | NA | NA | NA | 1 | 0 | 0.91 | 0.02 | 0.97 | 0.01 | 0.88 | 0.02 |
884 | picongpu - TaskSetValue.hpp:79-89 [...] | void alpaka::detail::ParallelForImpl<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::trait::OmpSchedule<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::AccCpuOmp2Bl... | Innermost | 0.18 | 0.16 | 0.17 | 0.14 | 2.77 | 1.5 | 1.45 | 0.79 | 2.63 | 1.34 | 1.33 | 0.67 | 13 | 26 | 26 | 52 | 6.62 | 9.08 | 3.29 | 1 | 13.18 | 1.05 | 1.12 | 1.1 | 1.18 | 2.5 | 0 | 0 | 0.5 | 0 | 1 | 0 | 0.98 | 0 | 0.99 | 0 | 0.98 | 0 |
259 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Innermost | 0.17 | 0.16 | 0.16 | 0.14 | 2.66 | 1.54 | 1.5 | 0.8 | 2.53 | 1.32 | 1.28 | 0.67 | 13 | 26 | 26 | 52 | 0 | 11.16 | 1 | 1 | 18.16 | 1.06 | 1.17 | 1.17 | 1.19 | NA | NA | NA | NA | NA | 1 | 0 | 0.96 | 0.01 | 0.99 | 0 | 0.94 | 0.01 |
781 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Innermost | 0.17 | 0.16 | 0.16 | 0.14 | 2.79 | 1.51 | 1.56 | 0.92 | 2.59 | 1.31 | 1.28 | 0.67 | 13 | 26 | 26 | 52 | 0 | 11.16 | 1 | 1 | 18.16 | 1.08 | 1.16 | 1.23 | 1.39 | NA | NA | NA | NA | NA | 1 | 0 | 0.99 | 0 | 1.01 | -0 | 0.97 | 0 |
404 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Innermost | 0.17 | 0.15 | 0.16 | 0.13 | 2.63 | 1.48 | 1.38 | 0.8 | 2.52 | 1.25 | 1.27 | 0.62 | 13 | 26 | 26 | 52 | 0 | 11.16 | 1 | 1 | 18.16 | 1.05 | 1.18 | 1.09 | 1.29 | NA | NA | NA | NA | NA | 1 | 0 | 1.01 | -0 | 0.99 | 0 | 1.02 | -0 |
279 | picongpu - ParticlesBase.kernel:265-269 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.16 | 0.15 | 0.15 | 0.13 | 2.68 | 1.52 | 1.48 | 0.86 | 2.43 | 1.23 | 1.23 | 0.62 | 13 | 26 | 26 | 52 | 0 | 4.79 | 1 | 1 | 24 | 1.11 | 1.25 | 1.21 | 1.39 | NA | NA | NA | NA | NA | 1 | 0 | 0.99 | 0 | 0.99 | 0 | 0.98 | 0 |
281 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.12 | 0.12 | 0.11 | 0.11 | 2 | 1.15 | 1.17 | 0.69 | 1.85 | 0.99 | 0.92 | 0.5 | 13 | 26 | 26 | 52 | 0 | 5.6 | 1 | 1 | 21.5 | 1.08 | 1.16 | 1.29 | 1.38 | NA | NA | NA | NA | NA | 1 | 0 | 0.93 | 0.01 | 1.01 | -0 | 0.93 | 0.01 |
434 | picongpu - ForEach.hpp:278-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.12 | 0.1 | 0.1 | 0.09 | 1.86 | 1.01 | 1.05 | 0.6 | 1.73 | 0.83 | 0.84 | 0.43 | 13 | 26 | 26 | 52 | 0 | 11.61 | 1 | 1 | 12.76 | 1.08 | 1.22 | 1.25 | 1.4 | NA | NA | NA | NA | NA | 1 | 0 | 1.04 | -0 | 1.03 | -0 | 1.01 | -0 |
278 | picongpu - ParticlesBase.kernel:279-291 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.11 | 0.1 | 0.1 | 0.09 | 1.79 | 0.96 | 1.01 | 0.61 | 1.58 | 0.8 | 0.78 | 0.4 | 13 | 26 | 26 | 52 | 0 | 10.16 | 2.53 | 1 | 15.29 | 1.13 | 1.2 | 1.29 | 1.52 | 2 | 1 | 0 | 3 | 1.5 | 1 | 0 | 0.99 | 0 | 1.01 | -0 | 0.99 | 0 |
437 | picongpu - ParticlesBase.kernel:487-490 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.1 | 0.09 | 0.09 | 0.08 | 1.56 | 0.86 | 0.86 | 0.55 | 1.41 | 0.72 | 0.71 | 0.38 | 13 | 26 | 26 | 52 | 0 | 6.77 | 1 | 1 | 15.73 | 1.11 | 1.19 | 1.23 | 1.45 | NA | NA | NA | NA | NA | 1 | 0 | 0.98 | 0 | 0.99 | 0 | 0.93 | 0.01 |
292 | picongpu - ParticlesBase.kernel:514-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.08 | 0.07 | 0.08 | 0.07 | 1.39 | 0.75 | 0.72 | 0.48 | 1.23 | 0.59 | 0.62 | 0.31 | 13 | 26 | 26 | 52 | 38.1 | 19.64 | 2.15 | 1 | 4.37 | 1.13 | 1.27 | 1.18 | 1.55 | NA | NA | NA | NA | NA | 1 | 0 | 1.04 | -0 | 0.99 | 0 | 0.99 | 0 |
435 | picongpu - ParticlesBase.kernel:552-563 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.08 | 0.07 | 0.07 | 0.07 | 1.23 | 0.81 | 0.82 | 0.47 | 1.13 | 0.6 | 0.6 | 0.32 | 13 | 26 | 26 | 52 | 0 | 9.05 | 3.35 | 1 | 16 | 1.09 | 1.35 | 1.37 | 1.52 | 0 | 2.33 | 3.33 | 0.67 | 2 | 1 | 0 | 0.94 | 0 | 0.94 | 0 | 0.88 | 0.01 |
667 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0.07 | 0.06 | 0.06 | 0.05 | 1.13 | 0.56 | 0.57 | 0.33 | 0.98 | 0.47 | 0.47 | 0.25 | 13 | 26 | 26 | 52 | 0 | 5.6 | 1 | 1 | 21.5 | 1.15 | 1.19 | 1.21 | 1.38 | NA | NA | NA | NA | NA | 1 | 0 | 1.04 | -0 | 1.04 | -0 | 0.98 | 0 |
590 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0.07 | 0.06 | 0.06 | 0.06 | 1.13 | 0.63 | 0.62 | 0.35 | 1.04 | 0.53 | 0.5 | 0.28 | 13 | 26 | 26 | 52 | 0 | 5.6 | 1 | 1 | 21.5 | 1.09 | 1.21 | 1.24 | 1.3 | NA | NA | NA | NA | NA | 1 | 0 | 0.98 | 0 | 1.04 | -0 | 0.93 | 0 |
425 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.06 | 0.05 | 0.06 | 0.05 | 0.95 | 0.64 | 0.55 | 0.32 | 0.84 | 0.45 | 0.45 | 0.22 | 13 | 26 | 26 | 52 | 0 | 5.6 | 1 | 1 | 21.5 | 1.13 | 1.42 | 1.22 | 1.45 | NA | NA | NA | NA | NA | 1 | 0 | 0.93 | 0 | 0.93 | 0 | 0.95 | 0 |
752 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.03 | 0.03 | 0.03 | 0.03 | 0.56 | 0.33 | 0.32 | 0.25 | 0.47 | 0.25 | 0.25 | 0.15 | 13 | 26 | 26 | 52 | 7.14 | 8.48 | 1 | 1 | 14.79 | 1.22 | 1.32 | 1.28 | 1.67 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.94 | 0 | 0.94 | 0 | 0.78 | 0.01 |
916 | picongpu - CopyGuardToExchange.hpp:92-121 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<pmacc::fields::operations::KernelCopyGuardToExchange, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsigned l... | Single | 0.03 | 0.03 | 0.03 | 0.03 | 0.56 | 0.3 | 0.33 | 0.22 | 0.4 | 0.23 | 0.23 | 0.13 | 13 | 26 | 26 | 52 | 10.42 | 9.64 | 4.21 | 1 | 14.34 | 1.4 | 1.3 | 1.43 | 1.69 | NA | NA | NA | NA | NA | 1 | 0 | 0.87 | 0 | 0.87 | 0 | 0.77 | 0.01 |
3562 | picongpu - | __intel_avx_rep_memcpy | Single | 0.03 | 0.03 | 0.03 | 0.03 | 2.15 | 2.29 | 1.92 | 2.43 | 0.44 | 0.24 | 0.23 | 0.13 | 13 | 26 | 26 | 52 | 100 | 50 | 1 | 1 | 2 | 4.89 | 9.54 | 8.35 | 18.69 | 0 | 2 | 0 | 0 | 0 | 1 | 0 | 0.92 | 0 | 0.96 | 0 | 0.85 | 0 |
726 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.03 | 0.04 | 0.03 | 0.04 | 0.56 | 0.46 | 0.33 | 0.25 | 0.48 | 0.33 | 0.26 | 0.17 | 13 | 26 | 26 | 52 | 25 | 11.13 | 1.82 | 3.06 | 11.95 | 1.17 | 1.44 | 1.27 | 1.47 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.73 | 0.01 | 0.92 | 0 | 0.71 | 0.01 |
910 | picongpu - AddExchangeToBorder.hpp:95-126 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<pmacc::fields::operations::KernelAddExchangeToBorder, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsigned l... | Single | 0.03 | 0.03 | 0.03 | 0.03 | 0.56 | 0.36 | 0.32 | 0.26 | 0.44 | 0.25 | 0.24 | 0.16 | 13 | 26 | 26 | 52 | 6.85 | 8.56 | 3.55 | 2.52 | 14.21 | 1.27 | 1.44 | 1.33 | 1.63 | NA | NA | NA | NA | NA | 1 | 0 | 0.88 | 0 | 0.92 | 0 | 0.69 | 0.01 |
725 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.03 | 0.03 | 0.03 | 0.03 | 0.58 | 0.31 | 0.35 | 0.17 | 0.52 | 0.25 | 0.26 | 0.13 | 13 | 26 | 26 | 52 | 0 | 7.03 | 1 | 1 | 16 | 1.14 | 1.24 | 1.35 | 1.31 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 1.04 | -0 | 1 | 0 | 1 | 0 |
423 | picongpu - ParticlesBase.kernel:265-269 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.03 | 0.03 | 0.03 | 0.02 | 0.51 | 0.3 | 0.29 | 0.18 | 0.44 | 0.22 | 0.22 | 0.11 | 13 | 26 | 26 | 52 | 0 | 4.79 | 1 | 1 | 24 | 1.16 | 1.36 | 1.32 | 1.64 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
789 | picongpu - ForEach.hpp:202-202 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Innermost | 0.02 | 0.02 | 0.02 | 0.02 | 0.3 | 0.23 | 0.18 | 0.13 | 0.24 | 0.17 | 0.14 | 0.08 | 13 | 26 | 26 | 52 | 20 | 11 | 2.6 | 1.18 | 12.06 | 1.25 | 1.35 | 1.29 | 1.63 | 3 | 0 | 1 | 1 | 0 | 1 | 0 | 0.71 | 0.01 | 0.86 | 0 | 0.75 | 0.01 |
761 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.02 | 0.02 | 0.02 | 0.29 | 0.31 | 0.22 | 0.23 | 0.25 | 0.16 | 0.13 | 0.11 | 13 | 26 | 26 | 52 | 27.03 | 11.66 | 1.79 | 2.6 | 10.85 | 1.16 | 1.94 | 1.69 | 2.09 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.78 | 0 | 0.96 | 0 | 0.57 | 0.01 |
843 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.02 | 0.02 | 0.02 | 0.33 | 0.21 | 0.19 | 0.14 | 0.29 | 0.16 | 0.15 | 0.08 | 13 | 26 | 26 | 52 | 0 | 7.03 | 1 | 1 | 16 | 1.14 | 1.31 | 1.27 | 1.75 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.91 | 0 | 0.97 | 0 | 0.91 | 0 |
855 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.02 | 0.01 | 0.02 | 0.31 | 0.33 | 0.19 | 0.18 | 0.23 | 0.14 | 0.11 | 0.08 | 13 | 26 | 26 | 52 | 0 | 8.93 | 1 | 1 | 14.77 | 1.35 | 2.36 | 1.73 | 2.25 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.82 | 0 | 1.05 | -0 | 0.72 | 0.01 |
3563 | picongpu - | __intel_avx_rep_memset | Single | 0.02 | 0.02 | 0.02 | 0.02 | 0.36 | 0.23 | 0.2 | 0.14 | 0.25 | 0.16 | 0.15 | 0.08 | 13 | 26 | 26 | 52 | 100 | 50 | 1 | 1 | 2 | 1.44 | 1.53 | 1.33 | 1.75 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0.78 | 0 | 0.83 | 0 | 0.78 | 0 |
856 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.02 | 0.02 | 0.02 | 0.43 | 0.41 | 0.26 | 0.2 | 0.27 | 0.17 | 0.13 | 0.1 | 13 | 26 | 26 | 52 | 25 | 11.13 | 1.82 | 3.06 | 11.95 | 1.59 | 2.41 | 2 | 2 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.79 | 0 | 1.04 | -0 | 0.68 | 0.01 |
753 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.03 | 0.02 | 0.03 | 0.36 | 0.32 | 0.25 | 0.26 | 0.3 | 0.24 | 0.18 | 0.13 | 13 | 26 | 26 | 52 | 31.25 | 12.3 | 1.79 | 2.96 | 10.98 | 1.2 | 1.39 | 1.39 | 2 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.63 | 0.01 | 0.83 | 0 | 0.58 | 0.01 |
844 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.02 | 0.02 | 0.02 | 0.33 | 0.24 | 0.19 | 0.14 | 0.29 | 0.19 | 0.15 | 0.1 | 13 | 26 | 26 | 52 | 25 | 11.13 | 1.82 | 3.06 | 11.95 | 1.14 | 1.33 | 1.27 | 1.56 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.76 | 0 | 0.97 | 0 | 0.72 | 0.01 |
760 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.02 | 0.02 | 0.02 | 0.42 | 0.46 | 0.24 | 0.31 | 0.35 | 0.2 | 0.17 | 0.11 | 13 | 26 | 26 | 52 | 7.14 | 8.48 | 1 | 1 | 14.79 | 1.2 | 2.3 | 1.41 | 2.82 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.87 | 0 | 1.03 | -0 | 0.8 | 0 |
422 | picongpu - ParticlesBase.kernel:279-291 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.02 | 0.02 | 0.02 | 0.02 | 0.39 | 0.24 | 0.19 | 0.14 | 0.29 | 0.16 | 0.14 | 0.09 | 13 | 26 | 26 | 52 | 0 | 10.16 | 2.53 | 1 | 15.29 | 1.34 | 1.5 | 1.36 | 1.56 | 2 | 1 | 0 | 3 | 1.5 | 1 | 0 | 0.91 | 0 | 1.04 | -0 | 0.81 | 0 |
433 | picongpu - ParticlesBase.kernel:487-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Outermost | 0.01 | 0.01 | 0.01 | 0.01 | 0.15 | 0.11 | 0.11 | 0.06 | 0.12 | 0.07 | 0.06 | 0.03 | 13 | 26 | 26 | 52 | 10.16 | 11.53 | 3.23 | 1 | 7.7 | 1.25 | 1.57 | 1.83 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 0.86 | 0 | 1 | 0 | 1 | 0 |
270 | picongpu - ParticlesBase.kernel:124-128 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.01 | 0.01 | 0.01 | 0.18 | 0.12 | 0.13 | 0.08 | 0.14 | 0.07 | 0.08 | 0.04 | 13 | 26 | 26 | 52 | 0 | 7.95 | 1 | 1 | 23.19 | 1.29 | 1.71 | 1.63 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 0.88 | 0 | 0.88 | 0 |
1486 | picongpu - Kernel.hpp:161-164 [...] | void std::__invoke_impl<void, pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigat... | Innermost | 0.01 | 0.01 | 0.01 | 0.01 | 0.14 | 0.11 | 0.08 | 0.07 | 0.1 | 0.06 | 0.05 | 0.03 | 13 | 26 | 26 | 42 | 12.5 | 10.16 | 3.29 | 1.17 | 4.67 | 1.4 | 1.83 | 1.6 | 1.75 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0.83 | 0 | 1 | 0 | 0.83 | 0 |
277 | picongpu - FramePointer.hpp:55-77 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Outermost | 0.01 | 0.01 | 0.01 | 0.01 | 0.13 | 0.08 | 0.12 | 0.07 | 0.1 | 0.06 | 0.05 | 0.03 | 13 | 26 | 26 | 52 | 12.82 | 11.9 | 4.47 | 1 | 21.19 | 1.3 | 1.33 | 2.4 | 2.33 | NA | NA | NA | NA | NA | 1 | 0 | 0.83 | 0 | 1 | 0 | 0.83 | 0 |
289 | picongpu - ParticlesBase.kernel:487-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Outermost | 0.01 | 0.01 | 0.01 | 0.01 | 0.28 | 0.17 | 0.14 | 0.11 | 0.22 | 0.12 | 0.1 | 0.06 | 13 | 26 | 26 | 52 | 10.16 | 11.53 | 3.23 | 1 | 7.7 | 1.27 | 1.42 | 1.4 | 1.83 | NA | NA | NA | NA | NA | 1 | 0 | 0.92 | 0 | 1.1 | -0 | 0.92 | 0 |
414 | picongpu - ParticlesBase.kernel:124-128 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0 | 0 | 0 | 0.1 | 0.07 | 0.07 | 0.04 | 0.08 | 0.03 | 0.04 | 0.02 | 13 | 26 | 26 | 50 | 0 | 7.95 | 1 | 1 | 23.19 | 1.25 | 2.33 | 1.75 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 1.33 | -0 | 1 | 0 | 1 | 0 |
788 | picongpu - ForEach.hpp:202-202 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Innermost | 0.01 | 0.01 | 0.01 | 0.01 | 0.23 | 0.19 | 0.12 | 0.08 | 0.18 | 0.1 | 0.1 | 0.05 | 13 | 26 | 26 | 52 | 0 | 8.04 | 1 | 1 | 14.93 | 1.28 | 1.9 | 1.2 | 1.6 | 3 | 0 | 0 | 2 | 0 | 1 | 0 | 0.9 | 0 | 0.9 | 0 | 0.9 | 0 |
413 | picongpu - ParticlesBase.kernel:138-153 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.01 | 0.01 | 0.01 | 0.15 | 0.1 | 0.11 | 0.07 | 0.11 | 0.07 | 0.06 | 0.03 | 13 | 26 | 26 | 52 | 0 | 7.62 | 2.32 | 1 | 20.44 | 1.36 | 1.43 | 1.83 | 2.33 | 3 | 1 | 0 | 4.5 | 0 | 1 | 0 | 0.79 | 0 | 0.92 | 0 | 0.92 | 0 |
1123 | picongpu - Op.hpp:96-96 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu27KernelFillGridWithParticlesINS5_9ParticlesINS0_4meta6StringIJLc101EEEEN5boost4mp117mp_listIJNS5_14particlePusherINS5_9particles6pusher5BorisENS0_13pmacc_isAliasEEENS5_5shapeINS... | Innermost | 0.01 | 0 | 0 | 0 | 0.08 | 0.05 | 0.04 | 0.03 | 0.08 | 0.04 | 0.04 | 0.02 | 13 | 26 | 26 | 52 | 16.8 | 9.11 | 1.86 | 1.03 | 9.32 | 1 | 1.25 | 1 | 1.5 | 2.67 | 0 | 1 | 0.67 | 5.67 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
294 | picongpu - ParticlesBase.kernel:440-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.01 | 0.01 | 0.01 | 0.17 | 0.1 | 0.09 | 0.09 | 0.11 | 0.06 | 0.06 | 0.04 | 13 | 26 | 26 | 52 | 4 | 8.25 | 3.58 | 1 | 14.86 | 1.55 | 1.67 | 1.5 | 3 | NA | NA | NA | NA | NA | 1 | 0 | 0.92 | 0 | 0.92 | 0 | 0.69 | 0 |
803 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.01 | 0.01 | 0.01 | 0.01 | 0.39 | 0.31 | 0.22 | 0.19 | 0.18 | 0.12 | 0.09 | 0.06 | 13 | 26 | 26 | 52 | 18.75 | 11.72 | 2.38 | 2.85 | 12.16 | 2.17 | 2.58 | 2.44 | 3.17 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.75 | 0 | 1 | 0 | 0.75 | 0 |
802 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.01 | 0.01 | 0.01 | 0.01 | 0.21 | 0.2 | 0.1 | 0.16 | 0.13 | 0.08 | 0.07 | 0.05 | 13 | 26 | 26 | 52 | 0 | 9.38 | 1 | 1 | 15.48 | 1.62 | 2.5 | 1.43 | 3.2 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0.81 | 0 | 0.93 | 0 | 0.65 | 0 |
436 | picongpu - ParticlesBase.kernel:514-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.01 | 0.01 | 0.01 | 0.01 | 0.22 | 0.14 | 0.15 | 0.1 | 0.19 | 0.09 | 0.09 | 0.05 | 13 | 26 | 26 | 52 | 38.1 | 19.64 | 2.15 | 1 | 4.37 | 1.16 | 1.56 | 1.67 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 1.06 | -0 | 1.06 | -0 | 0.95 | 0 |
269 | picongpu - ParticlesBase.kernel:138-153 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.02 | 0.01 | 0.01 | 0.24 | 0.2 | 0.14 | 0.12 | 0.22 | 0.13 | 0.11 | 0.07 | 13 | 26 | 26 | 52 | 0 | 7.62 | 2.32 | 1 | 20.44 | 1.09 | 1.54 | 1.27 | 1.71 | 3 | 1 | 0 | 4.5 | 0 | 1 | 0 | 0.85 | 0 | 1 | 0 | 0.79 | 0 |
271 | picongpu - Op.hpp:30-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.01 | 0.01 | 0.01 | 0.18 | 0.11 | 0.11 | 0.07 | 0.12 | 0.06 | 0.06 | 0.03 | 13 | 26 | 26 | 52 | 0 | 4.3 | 1 | 1 | 24.57 | 1.5 | 1.83 | 1.83 | 2.33 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
665 | picongpu - ParticlesBase.kernel:265-269 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0 | 0 | 0 | 0 | 0.02 | 0.03 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 8 | 9 | 10 | 11 | 0 | 4.79 | 1 | 1 | 24 | 2 | 3 | 2 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3548 | picongpu - stl_tree.h:1951-1952 [...] | cupla_omp2_seq_sync::cuplaEventRecord(void*, void*) | Single | 0 | 0 | 0 | 0 | 0.02 | 0.03 | 0.02 | 0.03 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
724 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 10 | 18 | 15 | 19 | 25 | 14.73 | 6.67 | 1 | 4.44 | 1 | 2 | 1 | 1 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 0.5 | 0 | 1 | 0 | 1 | 0 |
804 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 1 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | | | | | 1 | 0 |
1478 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigat... | Single | 0 | 0 | 0 | 0 | | | 0 | | | | 0 | | | | 1 | | 11.11 | 9.29 | 1.36 | 1.36 | 17.5 | 0 | 0 | 0 | 0 | 0.67 | 2 | 0.33 | 0.33 | 0 | | | | | 1 | 0 | | |
264 | picongpu - Stream.hpp:53-53 [...] | void pmacc::ParticlesBase<pmacc::ParticleDescription<pmacc::meta::String<(char)101>, pmacc::math::CT::Vector<std::integral_constant<int, 8>, std::integral_constant<int, 8>, std::integral_constant<int, 4> >, boost::mp11::... | Outermost | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0.02 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 14.54 | 14 | 5.27 | 1 | 11.75 | 1 | 0 | 0 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1186 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Outermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 12 | 15 | 15 | 17 | 25 | 10.16 | 1.71 | 2.56 | 12.67 | 1 | 1 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
647 | picongpu - CopyIdentifier.hpp:34-34 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_21KernelInsertParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj... | Single | 0 | 0 | 0 | 0 | 0.04 | 0.04 | 0.03 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 13 | 20 | 26 | 44 | 0 | 10.4 | 2.75 | 1 | 13.2 | 2 | 4 | 3 | 2 | 2 | 0 | 0 | 1 | 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0 |
593 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | 0 | | | 0 | 0 | | | 1 | 2 | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | | | | |
581 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 4 | 6 | 5 | 7 | 0 | 6.25 | 1 | 1 | 16 | 0 | 1 | 0 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1192 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Outermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 13 | 20 | 21 | 22 | 25 | 10.16 | 1.71 | 2.56 | 12.67 | 1 | 1 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 1 | 0 |
1168 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | | | | 0 | | | | 1 | | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | | | | |
426 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 4 | 7 | 6 | 7 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
274 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 7 | 7 | 10 | 9 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
288 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.02 | 0.01 | 0.01 | 0 | 0.01 | 0 | 13 | 15 | 18 | 26 | 100 | 50 | 1 | 1 | 2 | 2 | 1 | 2 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0 | 1 | 0 |
1205 | picongpu - tinymt32.h:105-136 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 11 | 17 | 20 | 21 | 0 | 6.56 | 2.41 | 4.89 | 15.59 | 2 | 1 | 1 | 1 | 0.5 | 0 | 0 | 0 | 0 | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 1 | 0 |
412 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelINS2_20KernelShiftParticlesENS3_9WorkerCfgILj256EEEEENS_5trait11OmpScheduleISA_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedule4Kin... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0.01 | 0.03 | 0.01 | 0.01 | 0 | 0 | 12 | 20 | 16 | 31 | 24.14 | 17.89 | 6.67 | 1 | 4.44 | 1 | 2 | 1 | 3 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 0.5 | 0 | 1 | 0 | 1 | 0 |
1477 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::Standard... | Single | 0 | 0 | 0 | 0 | | 0 | | | | 0 | | | | 1 | | | 17.24 | 15.3 | 7.7 | 1 | 4.28 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 3 | | | 1 | 0 | | | | |
911 | picongpu - Manager.hpp:98-109 [...] | pmacc::TaskFieldSend<picongpu::FieldJ>::init() | Single | 0 | 0 | 0 | 0 | | 0 | | 0 | | 0 | | 0 | | 1 | | 1 | 0 | 7.96 | 1 | 1 | 17.92 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | | | 1 | 0 | | | 1 | 0 |
1124 | picongpu - ParticlesInit.kernel:148-164 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu27KernelFillGridWithParticlesINS5_9ParticlesINS0_4meta6StringIJLc101EEEEN5boost4mp117mp_listIJNS5_14particlePusherINS5_9particles6pusher5BorisENS0_13pmacc_isAliasEEENS5_5shapeINS... | Outermost | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 2 | 26.09 | 13.86 | 2.4 | 1 | 7.38 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | | | | | | | 1 | 0 |
1132 | picongpu - ParticlesBase.kernel:138-153 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 2 | 0 | 6.64 | 2.38 | 1 | 22.01 | 0 | 0 | 0 | 0 | 2.5 | 1 | 0 | 4 | 0.5 | | | | | | | 1 | 0 |
1162 | picongpu - ParticlesBase.kernel:124-128 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 1 | 0 | 6.48 | 1 | 1 | 33.66 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | | | | | | | 1 | 0 |
606 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Innermost | 0 | 0 | 0 | 0 | 0.04 | 0.03 | 0.02 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 13 | 22 | 22 | 42 | 0 | 5.27 | 1 | 1 | 34.4 | 2 | 1.5 | 2 | 3 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0 |
1182 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelINS2_9particles9algorithm3acc6detail21KernelForEachParticleENS3_9WorkerCfgILj256EEEEENS_5trait11OmpScheduleISE_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotS... | Single | 0 | 0 | 0 | 0 | 0 | | | | 0 | | | | 1 | | | | 25 | 16.52 | 6.25 | 1 | 4.17 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | | | | | | |
1018 | picongpu - TaskSetValue.hpp:83-89 [...] | void alpaka::detail::ParallelForImpl<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::trait::OmpSchedule<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::AccCpuOmp2Bl... | Innermost | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 1 | 1 | 2 | | 53.33 | 16.25 | 1.48 | 1 | 8.5 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | | |
648 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_21KernelInsertParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj... | Single | 0 | 0 | 0 | 0 | | | 0 | | | | 0 | | | | 1 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | | | 1 | 0 | | |
604 | picongpu - ParticlesBase.kernel:768-833 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Outermost | 0 | 0 | 0 | 0 | 0.02 | 0.03 | 0.02 | 0.04 | 0.01 | 0.01 | 0.01 | 0.01 | 10 | 23 | 21 | 45 | 40.43 | 24 | 1.88 | 1 | 3.45 | 2 | 3 | 2 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 0.25 | 0 |
591 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 0 | 0 | 0 | | 1 | 1 | 1 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | 1 | 0 | 1 | 0 | 1 | 0 |
416 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 8 | 10 | 7 | 11 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 0 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1008 | picongpu - AlpakaRand.hpp:45-45 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::random::kernel::InitRNGProvider<256u, pmacc::random::methods::AlpakaRand<alpaka::AccCpuOmp2Blocks<std::integral_constant<unsigned long, 3ul>, uns... | Innermost | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 0 | 0 | 12 | | 1 | 3 | 0 | 6.32 | 1 | 1 | 15.88 | 0 | 0 | 0 | 0 | 1.5 | 0 | 0 | 1.5 | 0 | 1 | 0 | | | 1 | 0 | 1 | 0 |
609 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Single | 0 | 0 | 0 | 0 | | | 0 | 0 | | | 0 | 0 | | | 2 | 2 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | 0 | | | | | 1 | 0 | 1 | 0 |
440 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | | | 0 | | | | 0 | | | | 2 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | | | 1 | 0 | | |
845 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0 | 0 | 0 | 0 | | 0 | | | | 0 | | | | 1 | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | 1 | 0 | | | | |
1480 | picongpu - Kernel.hpp:190-193 [...] | void std::__invoke_impl<void, pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigat... | Single | 0 | 0 | 0 | 0 | | | 0 | | | | 0 | | | | 2 | | 16.67 | 11.35 | 1.53 | 1.65 | 12.9 | 0 | 0 | 0 | 0 | 0.75 | 2 | 0.5 | 0.5 | 0 | | | | | 1 | 0 | | |
909 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::fields::operations::KernelAddExchangeToBorder, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::LockSte... | Single | 0 | 0 | 0 | 0 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 13 | 15 | 16 | 22 | 20.59 | 13.97 | 7.33 | 1 | 4.89 | 2 | 2 | 1 | 1 | 1 | 0 | 1 | 1 | 2 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
997 | picongpu - TaskSetValue.hpp:79-89 [...] | void alpaka::detail::ParallelForImpl<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::trait::OmpSchedule<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::AccCpuOmp2Bl... | Innermost | 0 | 0 | 0 | 0 | | 0 | 0 | | | 0 | 0 | | | 22 | 18 | | 17.74 | 10.82 | 2.33 | 1 | 10.18 | 0 | 0 | 0 | 0 | 2.5 | 0 | 0 | 0.5 | 0 | | | 1 | 0 | 1 | 0 | | |
669 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 1 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | | | | | 1 | 0 |
3445 | picongpu - stl_tree.h:1951-1952 [...] | pmacc::Manager::addTask(pmacc::ITask*) | Single | 0 | 0 | 0 | 0 | 0.26 | 0.19 | 0.24 | 0.2 | 0.02 | 0.01 | 0.01 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1154 | picongpu - ForEach.hpp:202-241 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu21KernelDeriveParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticAr... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 12 | 22 | 20 | 25 | 25 | 19.21 | 2.8 | 1 | 4.69 | 1 | 1 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 1 | 0 |
1203 | picongpu - tinymt32.h:105-136 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 11 | 15 | 19 | 20 | 0 | 6.56 | 2.41 | 4.89 | 15.59 | 2 | 2 | 1 | 1 | 0.5 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0 | 1 | 0 |
3439 | picongpu - stl_tree.h:1967-1968 [...] | pmacc::Manager::waitForFinished(unsigned long) | Single | 0 | 0 | 0 | 0 | 0.03 | 0.05 | 0.03 | 0.04 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
677 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelINS2_25KernelCopyGuardToExchangeENS3_9WorkerCfgILj256EEEEENS_5trait11OmpScheduleISA_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedul... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 6 | 14 | 6 | 17 | 31.03 | 20.69 | 4.69 | 1 | 4.17 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
579 | picongpu - ParticlesBase.kernel:124-128 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.06 | 0.03 | 0.05 | 0.02 | 0.05 | 0.02 | 0.02 | 0.01 | 13 | 26 | 26 | 48 | 0 | 7.95 | 1 | 1 | 23.19 | 1.2 | 1.5 | 2.5 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 1.25 | -0 | 1.25 | -0 | 1.25 | -0 |
751 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 7 | 9 | 6 | 12 | 25 | 14.73 | 6.67 | 1 | 4.44 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
419 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 5 | 4 | 10 | 7 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
273 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 8 | 11 | 12 | 14 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
663 | picongpu - FramePointer.hpp:55-77 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Outermost | 0 | 0 | 0 | 0 | 0.03 | 0.03 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 11 | 23 | 18 | 31 | 16.33 | 13.04 | 4.59 | 1 | 16.8 | 3 | 3 | 2 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 1 | 0 |
767 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS2_20SuperCellDescriptionINS2_4math2CT6VectorISt17integral_constantIiLi8EESF_SE_IiLi4EEEENSD_ISE_IiLi2EESI_SI_EENSD_ISE_IiLi3EE... | Single | 0 | 0 | 0 | 0 | 0.04 | 0.03 | 0.03 | 0.02 | 0.02 | 0.01 | 0.01 | 0 | 13 | 24 | 22 | 32 | 29.03 | 18.35 | 5 | 1 | 4.44 | 2 | 3 | 3 | 2 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
75 | picongpu - stl_tree.h:1951-1952 [...] | cupla::cupla_omp2_seq_sync::manager::Device<alpaka::DevCpu>::device(int) | Single | 0 | 0 | 0 | 0 | 0 | 0 | | | 0 | 0 | | | 1 | 1 | | | 0 | 9.38 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | | | |
759 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 5 | 8 | 5 | 12 | 25 | 14.73 | 6.67 | 1 | 4.44 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
283 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 8 | 5 | 9 | 9 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3450 | picongpu - stl_tree.h:376-2509 [...] | std::map<unsigned long, pmacc::ITask*, std::less<unsigned long>, std::allocator<std::pair<unsigned long const, pmacc::ITask*> > >::erase(unsigned long const&) | Single | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
261 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 7 | 9 | 13 | 7 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1199 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 3 | 5 | 4 | 0 | 11.16 | 1 | 1 | 18.16 | 0 | 1 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
32 | picongpu - internals.hpp:178-195 [...] | boost::basic_format<char, std::char_traits<char>, std::allocator<char> >::parse(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) | Single | 0 | 0 | 0 | 0 | 0 | | 0.01 | | 0 | | 0 | | 1 | | 1 | | 0 | 7.6 | 1 | 1 | 32.74 | 0 | 0 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | | | 1 | 0 | | |
584 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 5 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1200 | picongpu - random.tcc:1827-3368 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | InBetween | 0 | 0 | 0 | 0 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 11 | 20 | 19 | 25 | 27.03 | 12.25 | 1.57 | 1.36 | 12.85 | 2 | 2 | 2 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 1 | 0 |
103 | picongpu - Mask.hpp:132-158 [...] | pmacc::Mask::getMirroredExchangeType(unsigned int) | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | | 0 | 0 | 0 | | 1 | 1 | 1 | | 0 | 7.14 | 1 | 1 | 14.42 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | |
3440 | picongpu - stl_tree.h:1967-1968 [...] | pmacc::Manager::waitForFinished(unsigned long) | Innermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | | 0 | 0 | 0 | | 1 | 1 | 1 | | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | |
295 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 3 | 1 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
284 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 4 | 7 | 12 | 13 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1232 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 5 | 7 | 4 | 0 | 11.16 | 1 | 1 | 18.16 | 1 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
646 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelINS2_21KernelInsertParticlesENS3_9WorkerCfgILj256EEEEENS_5trait11OmpScheduleISA_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedule4Ki... | Single | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 0 | 0 | 2 | | 1 | 1 | 25.93 | 16.67 | 6.08 | 1 | 4.06 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | | | 1 | 0 | 1 | 0 |
1207 | picongpu - tinymt32.h:105-136 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 10 | 10 | 11 | 12 | 0 | 6.62 | 2.41 | 5.2 | 15.59 | 1 | 0 | 1 | 0 | 0.5 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1484 | picongpu - Kernel.hpp:190-193 [...] | void std::__invoke_impl<void, pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigat... | Single | 0 | 0 | 0 | 0 | 0 | | | | 0 | | | | 2 | | | | 15 | 10.91 | 1.62 | 1.68 | 13.18 | 0 | 0 | 0 | 0 | 0.75 | 2 | 0.5 | 0.5 | 0 | 1 | 0 | | | | | | |
1122 | picongpu - SetAttributeToDefault.hpp:47-47 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu27KernelFillGridWithParticlesINS5_9ParticlesINS0_4meta6StringIJLc101EEEEN5boost4mp117mp_listIJNS5_14particlePusherINS5_9particles6pusher5BorisENS0_13pmacc_isAliasEEENS5_5shapeINS... | Outermost | 0 | 0 | 0 | 0 | | 0 | 0.01 | 0 | | 0 | 0 | 0 | | 3 | 3 | 1 | 19.3 | 17 | 3.88 | 1 | 4.82 | 0 | 0 | 1 | 0 | NA | NA | NA | NA | NA | | | 1 | 0 | 1 | 0 | 1 | 0 |
1252 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu9particles11debyeLength25DebyeLengthEstimateKernelENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail2... | Single | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0 | 0.02 | 0.01 | 0.01 | 0 | 13 | 26 | 26 | 50 | 23.33 | 13.85 | 1.87 | 2.32 | 7 | 1 | 1 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
659 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 4 | 3 | 3 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
104 | picongpu - Mask.hpp:132-158 [...] | pmacc::Mask::getMirroredExchangeType(unsigned int) | Single | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | | 0 | 0 | 0 | | 1 | 1 | 1 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | |
569 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelINS2_21KernelInsertParticlesENS3_9WorkerCfgILj256EEEEENS_5trait11OmpScheduleISA_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedule4Ki... | Single | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 2 | 9 | 2 | 14 | 25.93 | 16.67 | 6.08 | 1 | 4.06 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
428 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 | 2 | 4 | 3 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3437 | picongpu - stl_tree.h:1967-1968 [...] | pmacc::Manager::getITaskIfNotFinished(unsigned long) const | Single | 0 | 0 | 0 | 0 | 0.86 | 0.82 | 0.76 | 0.89 | 0.07 | 0.03 | 0.03 | 0.02 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1.17 | -0 | 1.17 | -0 | 0.88 | 0 |
915 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::fields::operations::KernelCopyGuardToExchange, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::LockSte... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 7 | 17 | 13 | 17 | 20.59 | 13.97 | 7.33 | 1 | 4.89 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 2 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
583 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 5 | 5 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
681 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | | 0 | 0 | 0 | | 0 | 4 | 2 | | 2 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | | | 1 | 0 |
571 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_21KernelInsertParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj... | Single | 0 | 0 | 0 | 0 | 0 | | | 0 | 0 | | | 0 | 1 | | | 2 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | | | 1 | 0 |
801 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 3 | 10 | 12 | 8 | 26.47 | 14.71 | 4.3 | 1 | 4.78 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 2 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
679 | picongpu - ParticlesBase.kernel:815-820 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Innermost | 0 | 0 | 0 | 0 | 0.05 | 0.03 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0 | 12 | 24 | 23 | 31 | 0 | 10.4 | 2.25 | 1 | 16.3 | 2.5 | 3 | 2 | 2 | 1 | 1 | 0 | 1.67 | 0.33 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3082 | picongpu - Copy.hpp:114-114 [...] | void alpaka::memcpy<alpaka::Vec<std::integral_constant<unsigned long, 3ul>, unsigned long>, alpaka::ViewSubView<alpaka::DevCpu, unsigned char, std::integral_constant<unsigned long, 3ul>, unsigned long>, alpaka::ViewSubView<alp... | Outermost | 0 | 0 | 0 | 0 | 0 | 0.03 | 0.03 | 0.04 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.08 | 1 | 1 | 9.68 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1184 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 5 | 7 | 3 | 0 | 11.16 | 1 | 1 | 18.16 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
671 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | | | | 0 | | | | 1 | | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | | | | |
657 | picongpu - Op.hpp:30-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.08 | 0.05 | 0.05 | 0.04 | 0.06 | 0.03 | 0.03 | 0.01 | 13 | 26 | 26 | 49 | 0 | 4.3 | 1 | 1 | 24.57 | 1.33 | 1.67 | 1.67 | 4 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1.5 | -0 |
3434 | picongpu - stl_tree.h:1967-1968 [...] | pmacc::Manager::execute(unsigned long) | Innermost | 0 | 0 | 0 | 0 | 0.07 | 0.11 | 0.07 | 0.09 | 0.01 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
660 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 | 1 | 3 | 3 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
664 | picongpu - ParticlesBase.kernel:279-291 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0 | 0 | 0 | 0 | 0.11 | 0.07 | 0.03 | 0.04 | 0.03 | 0.01 | 0.01 | 0.01 | 12 | 16 | 17 | 25 | 0 | 10.16 | 2.5 | 1 | 15.32 | 3.67 | 3.5 | 3 | 4 | 2 | 1 | 0 | 2.5 | 1.5 | 1 | 0 | 1.5 | -0 | 1.5 | -0 | 0.75 | 0 |
661 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 4 | 1 | 3 | 5 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1198 | picongpu - ForEach.hpp:202-205 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Outermost | 0 | 0 | 0 | 0 | 0.07 | 0.05 | 0.04 | 0.03 | 0.05 | 0.03 | 0.02 | 0.01 | 13 | 26 | 26 | 48 | 12.5 | 9.77 | 5.94 | 1.14 | 2.25 | 1.4 | 1.67 | 2 | 3 | NA | NA | NA | NA | NA | 1 | 0 | 0.83 | 0 | 1.25 | -0 | 1.25 | -0 |
1610 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::Standard... | Single | 0 | 0 | 0 | 0 | | | 0 | | | | 0 | | | | 1 | | 0 | 9.25 | 1 | 1 | 4 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 3 | | | | | 1 | 0 | | |
1044 | picongpu - TaskSetValue.hpp:83-89 [...] | _ZN6alpaka6detail15ParallelForImplIN5cupla19cupla_omp2_seq_sync11CuplaKernelIN5pmacc14KernelSetValueILj256EEEEENS_5trait11OmpScheduleIS8_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedule4KindE0EEclIRZNKS_23TaskK... | Innermost | 0 | 0 | 0 | 0 | 0 | | | 0 | 0 | | | 0 | 1 | | | 1 | 69.57 | 19.29 | 1.31 | 1 | 5.25 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | | | | | 1 | 0 |
680 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Innermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 8 | 13 | 15 | 16 | 0 | 5.27 | 1 | 1 | 34.4 | 1 | 1 | 1 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
673 | picongpu - TaskParticlesSend.hpp:57-69 [...] | pmacc::TaskParticlesSend<picongpu::Particles<pmacc::meta::String<(char)105>, boost::mp11::mp_list<picongpu::particlePusher<picongpu::particles::pusher::Boris, pmacc::pmacc_isAlias>, picongpu::shape<picongpu::particles::shapes::TSC, ... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 1 | 1 | 1 | | 4.17 | 8.66 | 15.67 | 1 | 16.44 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | | |
683 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Single | 0 | 0 | 0 | 0 | | | 0 | 0 | | | 0 | 0 | | | 1 | 2 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | 0 | | | | | 1 | 0 | 1 | 0 |
406 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 7 | 9 | 12 | 8 | 0 | 6.25 | 1 | 1 | 16 | 2 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1196 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 1 | 6 | 4 | 0 | 11.16 | 1 | 1 | 18.16 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
933 | picongpu - TaskSetValue.hpp:79-89 [...] | void alpaka::detail::ParallelForImpl<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::trait::OmpSchedule<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::AccCpuOmp2Bl... | Innermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 13 | 24 | 24 | 23 | 7.18 | 8.94 | 3.06 | 1 | 13.87 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0.5 | 0 | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 1 | 0 |
913 | picongpu - charconv.h:94-100 | void pmacc::fields::operations::CopyGuardToExchange::operator()<pmacc::GridBuffer<pmacc::math::Vector<float, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigator, pmacc::math::detail::Vector_components<float, 3u> >, 3u, pmacc... | Single | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 1 | 0 | 10.71 | 1 | 1 | 12.95 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | | | | | | | 1 | 0 |
1181 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 7 | 4 | 4 | 0 | 11.16 | 1 | 1 | 18.16 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1481 | picongpu - Kernel.hpp:190-193 [...] | void std::__invoke_impl<void, pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigat... | Single | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 1 | 16.67 | 11.35 | 1.53 | 1.65 | 12.9 | 0 | 0 | 0 | 0 | 0.75 | 2 | 0.5 | 0.5 | 0 | | | | | | | 1 | 0 |
3414 | picongpu - stl_algobase.h:2055-2055 [...] | pmacc::DataConnector::findId(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 9.65 | 1 | 1 | 26.43 | 1 | 1 | 1 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
727 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0 | 0 | 0 | 0 | 0 | | | | 0 | | | | 1 | | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | | | | |
1163 | picongpu - Op.hpp:30-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 1 | 0 | 4.3 | 1 | 1 | 24.57 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | | | | | | | 1 | 0 |
3081 | picongpu - NdLoop.hpp:42-42 [...] | void alpaka::memcpy<alpaka::Vec<std::integral_constant<unsigned long, 3ul>, unsigned long>, alpaka::ViewSubView<alpaka::DevCpu, unsigned char, std::integral_constant<unsigned long, 3ul>, unsigned long>, alpaka::ViewSubView<alp... | Innermost | 0 | 0 | 0 | 0 | 0.33 | 0.72 | 0.44 | 0.51 | 0.03 | 0.03 | 0.02 | 0.01 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0.5 | 0 | 0.75 | 0 | 0.75 | 0 |
854 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 4 | 8 | 6 | 8 | 25 | 14.73 | 6.67 | 1 | 4.44 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1189 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Outermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 12 | 13 | 15 | 16 | 25 | 10.16 | 1.71 | 2.56 | 12.67 | 1 | 1 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
267 | picongpu - StrideMapping.hpp:119-127 [...] | pmacc::StrideMapping<3u, 3u, pmacc::MappingDescription<3u, pmacc::math::CT::Vector<std::integral_constant<int, 8>, std::integral_constant<int, 8>, std::integral_constant<int, 4> > > >::next() | Single | 0 | 0 | 0 | 0 | 0 | 0 | | | 0 | 0 | | | 1 | 1 | | | 52.94 | 17.65 | 1 | 1 | 9.81 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | | | | |
1173 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 5 | 9 | 7 | 6 | 0 | 5.6 | 1 | 1 | 21.5 | 1 | 0 | 0 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
255 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS2_20SuperCellDescriptionINS2_4math2CT6VectorISt17integral_constantIiLi8EESE_SD_IiLi4EEEENSC_ISD_IiLi1EESH_SH_EENSC_ISD_IiLi2EESJ_SJ_EEE... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 9 | 16 | 18 | 21 | 27.27 | 16.48 | 5.38 | 1 | 4.78 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 2 | 1 | 0 | 1 | 0 | 0.5 | 0 | 1 | 0 |
1133 | picongpu - ParticlesBase.kernel:124-128 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 0 | 1 | 1 | | 2 | 0 | 6.48 | 1 | 1 | 33.66 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | | | 1 | 0 |
1183 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Outermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 12 | 19 | 18 | 25 | 25 | 10.16 | 1.71 | 2.56 | 12.67 | 1 | 1 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
101 | picongpu - Mask.hpp:132-158 [...] | pmacc::Mask::getMirroredExchangeType(unsigned int) | Single | 0 | 0 | 0 | 0 | | 0 | | 0.01 | | 0 | | 0 | | 1 | | 1 | 0 | 7.14 | 1 | 1 | 14.42 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | | | 1 | 0 | | | 1 | 0 |
48 | picongpu - optional.hpp:204-411 [...] | boost::io::detail::format_item<char, std::char_traits<char>, std::allocator<char> >* std::__uninitialized_fill_n_a<boost::io::detail::format_item<char, std::char_traits<char>, std::allocator<char> >*, unsigned long, b... | Single | 0 | 0 | 0 | 0 | | | 0.01 | | | | 0 | | | | 1 | | 16.28 | 13.84 | 3 | 1 | 12.41 | 0 | 0 | 1 | 0 | NA | NA | NA | NA | NA | | | | | 1 | 0 | | |
605 | picongpu - ParticlesBase.kernel:815-820 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Innermost | 0 | 0 | 0 | 0 | 0.05 | 0.05 | 0.04 | 0.03 | 0.04 | 0.03 | 0.02 | 0.01 | 13 | 26 | 25 | 46 | 0 | 10.4 | 2.25 | 1 | 16.3 | 1.25 | 1.67 | 2 | 3 | 1 | 1 | 0 | 1.67 | 0.33 | 1 | 0 | 0.67 | 0 | 1 | 0 | 1 | 0 |
427 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 3 | 3 | 7 | 3 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 0 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1479 | picongpu - Kernel.hpp:190-193 [...] | void std::__invoke_impl<void, pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigat... | Single | 0 | 0 | 0 | 0 | | 0 | | | | 0 | | | | 1 | | | 15 | 11.33 | 1.64 | 1.69 | 13.08 | 0 | 0 | 0 | 0 | 0.75 | 2 | 0.5 | 0.5 | 0 | | | 1 | 0 | | | | |
415 | picongpu - Op.hpp:30-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.11 | 0.07 | 0.06 | 0.05 | 0.07 | 0.04 | 0.04 | 0.02 | 13 | 26 | 26 | 50 | 0 | 4.3 | 1 | 1 | 24.57 | 1.57 | 1.75 | 1.5 | 2.5 | NA | NA | NA | NA | NA | 1 | 0 | 0.88 | 0 | 0.88 | 0 | 0.88 | 0 |
3553 | picongpu - stl_tree.h:1951-1952 [...] | cupla::cupla_omp2_seq_sync::manager::Event<alpaka::DevCpu, alpaka::QueueGenericThreadsBlocking<alpaka::DevCpu> >::event(void*) | Single | 0 | 0 | 0 | 0 | 0.59 | 0.48 | 0.53 | 0.62 | 0.05 | 0.02 | 0.02 | 0.01 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1.25 | -0 | 1.25 | -0 | 1.25 | -0 |
572 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_21KernelInsertParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj... | Single | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 1 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | | | | | 1 | 0 |
588 | picongpu - ParticlesBase.kernel:265-269 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0 | 0 | 0 | 0 | 0.02 | 0.04 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 12 | 20 | 19 | 31 | 0 | 4.79 | 1 | 1 | 24 | 2 | 4 | 3 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 0.25 | 0 |
582 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 5 | 4 | 4 | 8 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
607 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Single | 0 | 0 | 0 | 0 | 0 | | 0 | | 0 | | 0 | | 2 | | 2 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | 1 | 0 | | |
592 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | | 0 | | | | 0 | | | | 1 | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | 1 | 0 | | | | |
678 | picongpu - ParticlesBase.kernel:768-833 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Outermost | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 6 | 8 | 9 | 19 | 40.43 | 24 | 1.88 | 1 | 3.45 | 0 | 1 | 1 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
578 | picongpu - ParticlesBase.kernel:138-153 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.08 | 0.05 | 0.04 | 0.04 | 0.05 | 0.03 | 0.03 | 0.01 | 13 | 26 | 26 | 46 | 0 | 7.62 | 2.32 | 1 | 20.44 | 1.6 | 1.67 | 1.33 | 4 | 3 | 1 | 0 | 4.5 | 0 | 1 | 0 | 0.83 | 0 | 0.83 | 0 | 1.25 | -0 |
790 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Outermost | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 10 | 13 | 14 | 20 | 14.04 | 12.75 | 4.82 | 1 | 8.75 | 2 | 1 | 1 | 2 | 1 | 1 | 2.75 | 9.5 | 2 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1075 | picongpu - TaskSetValue.hpp:79-89 [...] | _ZN6alpaka6detail15ParallelForImplIN5cupla19cupla_omp2_seq_sync11CuplaKernelIN5pmacc14KernelSetValueILj256EEEEENS_5trait11OmpScheduleIS8_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedule4KindE0EEclIRZNKS_23TaskK... | Innermost | 0 | 0 | 0 | 0 | | 0 | | 0 | | 0 | | 0 | | 1 | | 1 | 12.18 | 10.12 | 2.95 | 1 | 11.2 | 0 | 0 | 0 | 0 | 2.5 | 0 | 0 | 0.5 | 0 | | | 1 | 0 | | | 1 | 0 |
670 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | | 0 | 0 | | | 0 | 0 | | | 1 | 1 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | 1 | 0 | 1 | 0 | | |
1083 | picongpu - TaskSetValue.hpp:83-89 [...] | _ZN6alpaka6detail15ParallelForImplIN5cupla19cupla_omp2_seq_sync11CuplaKernelIN5pmacc14KernelSetValueILj256EEEEENS_5trait11OmpScheduleIS8_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedule4KindE0EEclIRZNKS_23TaskK... | Innermost | 0 | 0 | 0 | 0 | | 0 | 0 | | | 0 | 0 | | | 2 | 1 | | 69.57 | 19.29 | 1.31 | 1 | 5.25 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | | | 1 | 0 | 1 | 0 | | |
1193 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 2 | 3 | 0 | 11.16 | 1 | 1 | 18.16 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
907 | picongpu - charconv.h:94-100 | void pmacc::fields::operations::AddExchangeToBorder::operator()<pmacc::GridBuffer<pmacc::math::Vector<float, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigator, pmacc::math::detail::Vector_components<float, 3u> >, 3u, pmacc... | Single | 0 | 0 | 0 | 0 | | | 0 | | | | 0 | | | | 1 | | 0 | 10.71 | 1 | 1 | 12.95 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | | | | | 1 | 0 | | |
439 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 4 | 3 | 2 | 0 | 6.25 | 1 | 1 | 16 | 0 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
587 | picongpu - ParticlesBase.kernel:279-291 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0 | 0 | 0 | 0 | 0.13 | 0.12 | 0.07 | 0.05 | 0.04 | 0.03 | 0.02 | 0.01 | 13 | 26 | 19 | 36 | 0 | 10.16 | 2.5 | 1 | 15.32 | 3.25 | 4 | 2.33 | 2.5 | 2 | 1 | 0 | 2.5 | 1.5 | 1 | 0 | 0.67 | 0 | 1 | 0 | 1 | 0 |
1202 | picongpu - random.tcc:1827-3368 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | InBetween | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.02 | 0.01 | 0 | 0.01 | 0 | 13 | 14 | 17 | 22 | 20.69 | 10.85 | 1.45 | 1.36 | 13.27 | 2 | 1 | 1 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 0.5 | 0 | 1 | 0 |
539 | picongpu - TaskParticlesReceive.hpp:58-70 [...] | pmacc::TaskParticlesReceive<picongpu::Particles<pmacc::meta::String<(char)101>, boost::mp11::mp_list<picongpu::particlePusher<picongpu::particles::pusher::Boris, pmacc::pmacc_isAlias>, picongpu::shape<picongpu::particles::shapes::TS... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | | | 0 | 0 | | | 1 | 1 | | | 0 | 7.91 | 1 | 1 | 17.59 | 1 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | | | | |
577 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelINS2_14KernelFillGapsENS3_9WorkerCfgILj256EEEEENS_5trait11OmpScheduleISA_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedule4KindE0EEc... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 7 | 7 | 8 | 9 | 20 | 16 | 7.2 | 1 | 4 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3442 | picongpu - stl_tree.h:1967-1968 [...] | pmacc::Manager::waitForFinished(unsigned long) | Single | 0 | 0 | 0 | 0 | 0.12 | 0.09 | 0.1 | 0.13 | 0.01 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1144 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 6 | 4 | 12 | 0 | 5.6 | 1 | 1 | 21.5 | 1 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3446 | picongpu - stl_tree.h:1951-1952 [...] | pmacc::Manager::addPassiveTask(pmacc::ITask*) | Single | 0 | 0 | 0 | 0 | 0.03 | 0.05 | 0.06 | 0.07 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
585 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 0 | 2 | 2 | | 2 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | | | 1 | 0 |
1222 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Outermost | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 | 15 | 15 | 9 | 4.17 | 7.36 | 1.92 | 1 | 24.86 | 1 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3436 | picongpu - stl_tree.h:1967-1968 [...] | pmacc::Manager::getITaskIfNotFinished(unsigned long) const | Single | 0 | 0 | 0 | 0 | 0.21 | 0.2 | 0.18 | 0.25 | 0.02 | 0.01 | 0.01 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3452 | picongpu - stl_tree.h:1951-1952 [...] | std::map<unsigned long, pmacc::ITask*, std::less<unsigned long>, std::allocator<std::pair<unsigned long const, pmacc::ITask*> > >::erase(unsigned long const&) | Single | 0 | 0 | 0 | 0 | 0.02 | 0.04 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
265 | picongpu - charconv.h:94-100 | void pmacc::ParticlesBase<pmacc::ParticleDescription<pmacc::meta::String<(char)101>, pmacc::math::CT::Vector<std::integral_constant<int, 8>, std::integral_constant<int, 8>, std::integral_constant<int, 4> >, boost::mp11::... | Innermost | 0 | 0 | 0 | 0 | | | | 0 | | | | 0 | | | | 1 | 0 | 12.5 | 1 | 1 | 8 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | | | | | | | 1 | 0 |
63 | picongpu - stl_tree.h:782-1936 [...] | std::_Rb_tree<pmacc::IEvent*, pmacc::IEvent*, std::_Identity<pmacc::IEvent*>, std::less<pmacc::IEvent*>, std::allocator<pmacc::IEvent*> >::_M_erase(std::_Rb_tree_node<pmacc::IEvent*>*) | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
92 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-898 [...] | void alpaka::detail::ParallelForImpl<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValueOnDeviceMemory>, alpaka::trait::OmpSchedule<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValueOnDeviceMemory>, alpaka::AccCpuOmp2... | Single | 0 | 0 | 0 | 0 | | | 0.01 | | | | 0 | | | | 1 | | 0 | 7.81 | 1 | 1 | 4 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 4 | | | | | 1 | 0 | | |
275 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 6 | 12 | 13 | 9 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 2 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
603 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelINS2_25KernelCopyGuardToExchangeENS3_9WorkerCfgILj256EEEEENS_5trait11OmpScheduleISA_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedul... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.03 | 0 | 0 | 0 | 0 | 4 | 10 | 11 | 12 | 31.03 | 20.69 | 4.69 | 1 | 4.17 | 1 | 1 | 1 | 3 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
287 | picongpu - SuperCell.hpp:89-94 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.04 | 0.04 | 0.02 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 13 | 24 | 21 | 43 | 5.88 | 8.09 | 3.35 | 1 | 15.18 | 2 | 4 | 2 | 3 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0 |
409 | picongpu - Stream.hpp:53-53 [...] | void pmacc::ParticlesBase<pmacc::ParticleDescription<pmacc::meta::String<(char)105>, pmacc::math::CT::Vector<std::integral_constant<int, 8>, std::integral_constant<int, 8>, std::integral_constant<int, 4> >, boost::mp11::... | Outermost | 0 | 0 | 0 | 0 | | 0.01 | 0.02 | 0.01 | | 0 | 0 | 0 | | 1 | 1 | 1 | 14.54 | 14 | 5.27 | 1 | 11.75 | 0 | 1 | 1 | 1 | NA | NA | NA | NA | NA | | | 1 | 0 | 1 | 0 | 1 | 0 |
655 | picongpu - ParticlesBase.kernel:138-153 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.07 | 0.05 | 0.04 | 0.03 | 0.05 | 0.03 | 0.02 | 0.01 | 13 | 26 | 26 | 46 | 0 | 7.62 | 2.32 | 1 | 20.44 | 1.4 | 1.67 | 2 | 3 | 3 | 1 | 0 | 4.5 | 0 | 1 | 0 | 0.83 | 0 | 1.25 | -0 | 1.25 | -0 |
431 | picongpu - SuperCell.hpp:89-94 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.03 | 0.02 | 0.01 | 0 | 0 | 0 | 9 | 13 | 16 | 24 | 5.88 | 8.09 | 3.35 | 1 | 15.18 | 2 | 1 | 3 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
421 | picongpu - FramePointer.hpp:55-77 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Outermost | 0 | 0 | 0 | 0 | 0.07 | 0.04 | 0.04 | 0.03 | 0.05 | 0.02 | 0.02 | 0.01 | 13 | 25 | 26 | 44 | 12.82 | 11.9 | 4.47 | 1 | 21.19 | 1.4 | 2 | 2 | 3 | NA | NA | NA | NA | NA | 1 | 0 | 1.25 | -0 | 1.25 | -0 | 1.25 | -0 |
285 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 9 | 7 | 7 | 6 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
842 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 9 | 10 | 9 | 7 | 25 | 14.73 | 6.67 | 1 | 4.44 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
777 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS2_20SuperCellDescriptionINS2_4math2CT6VectorISt17integral_constantIiLi8EESF_SE_IiLi4EEEENSD_ISE_IiLi2EESI_SI_EENSD_ISE_IiLi3EE... | Single | 0 | 0 | 0 | 0 | 0.05 | 0.04 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 12 | 24 | 20 | 37 | 29.03 | 18.35 | 5 | 1 | 4.44 | 2.5 | 2 | 2 | 2 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0 |
429 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 3 | 6 | 3 | 2 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
297 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 1 | 1 | 1 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | |
1180 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Outermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 12 | 13 | 16 | 17 | 25 | 10.16 | 1.71 | 2.56 | 12.67 | 1 | 1 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
682 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Single | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 0 | 0 | 0 | | 2 | 1 | 1 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | 0 | | | 1 | 0 | 1 | 0 | 1 | 0 |
417 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 7 | 8 | 8 | 12 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
774 | picongpu - Stream.hpp:53-53 [...] | _ZNK8picongpu13currentSolver7DepositINS0_8strategy23StridedCachedSupercellsEvE7executeILj3EN5pmacc18MappingDescriptionILj3ENS6_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEEEENS0_20KernelComputeCurrentINS6_20SuperCellDescriptionISE_NSA_ISB_IiL... | Outermost | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 18.69 | 15.61 | 4.87 | 1 | 10.16 | 1 | 1 | 1 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1201 | picongpu - random.tcc:1827-3368 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | InBetween | 0 | 0 | 0 | 0 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 11 | 19 | 20 | 23 | 23.81 | 11.61 | 1.54 | 1.23 | 13 | 2 | 2 | 1 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 1 | 0 |
296 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 1 | 1 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
420 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0 | 0.01 | 0 | 10 | 12 | 18 | 19 | 0 | 6.25 | 1 | 1 | 16 | 1 | 2 | 2 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0 | 1 | 0 |
432 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 10 | 11 | 16 | 18 | 100 | 50 | 1 | 1 | 2 | 2 | 1 | 2 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
418 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 5 | 5 | 6 | 4 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
260 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.02 | 0 | 0.01 | 0 | 0 | 0 | 7 | 6 | 7 | 11 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 2 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
102 | picongpu - Mask.hpp:132-158 [...] | pmacc::Mask::getMirroredExchangeType(unsigned int) | Single | 0 | 0 | 0 | 0 | | 0.01 | | | | 0 | | | | 1 | | | 0 | 7.14 | 1 | 1 | 14.42 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | | 1 | 0 | | | | |
586 | picongpu - FramePointer.hpp:55-77 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Outermost | 0 | 0 | 0 | 0 | 0.03 | 0.02 | 0.03 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 13 | 18 | 24 | 35 | 16.33 | 13.04 | 4.59 | 1 | 16.8 | 1.5 | 2 | 3 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0 |
430 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 4 | 5 | 7 | 6 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3441 | picongpu - Manager.cpp:158-158 [...] | pmacc::Manager::waitForFinished(unsigned long) | Outermost | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
608 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 1 | 3 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
684 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Single | 0 | 0 | 0 | 0 | 0 | | | | 0 | | | | 1 | | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | | | | |
610 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_25KernelCopyGuardToExchangeENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArra... | Single | 0 | 0 | 0 | 0 | | 0 | | | | 0 | | | | 1 | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | 1 | 0 | | | | |
594 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | | | 0 | | | | 0 | | | | 1 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | | | 1 | 0 | | |
1247 | picongpu - AtomicOmpBuiltIn.hpp:43-43 | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu9particles11debyeLength25DebyeLengthEstimateKernelENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail2... | Single | 0 | 0 | 0 | 0 | | | 0 | | | | 0 | | | | 1 | | 0 | 12.5 | 1 | 8 | 8 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | | | | | 1 | 0 | | |
100 | picongpu - Mask.hpp:132-158 [...] | pmacc::Mask::getMirroredExchangeType(unsigned int) | Single | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 0 | 0 | 0 | | 1 | 1 | 1 | 0 | 7.14 | 1 | 1 | 14.42 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | | 1 | 0 | 1 | 0 | 1 | 0 |
656 | picongpu - ParticlesBase.kernel:124-128 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.07 | 0.06 | 0.04 | 0.03 | 0.05 | 0.03 | 0.02 | 0.01 | 13 | 24 | 26 | 42 | 0 | 7.95 | 1 | 1 | 23.19 | 1.4 | 2 | 2 | 3 | NA | NA | NA | NA | NA | 1 | 0 | 0.83 | 0 | 1.25 | -0 | 1.25 | -0 |
400 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS2_20SuperCellDescriptionINS2_4math2CT6VectorISt17integral_constantIiLi8EESE_SD_IiLi4EEEENSC_ISD_IiLi1EESH_SH_EENSC_ISD_IiLi2EESJ_SJ_EEE... | Single | 0 | 0 | 0 | 0 | 0.05 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0 | 13 | 25 | 22 | 31 | 27.27 | 16.48 | 5.38 | 1 | 4.78 | 2.5 | 2 | 2 | 2 | 1 | 0 | 1 | 1 | 2 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1190 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 5 | 7 | 7 | 4 | 0 | 11.16 | 1 | 1 | 18.16 | 1 | 0 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3457 | picongpu - EventNotify.cpp:30-33 [...] | pmacc::EventNotify::notify(unsigned long, pmacc::eventSystem::EventType, pmacc::IEventData*) | Single | 0 | 0 | 0 | 0 | 0.1 | 0.12 | 0.16 | 0.16 | 0.01 | 0 | 0.01 | 0 | 1 | 1 | 1 | 1 | 0 | 11.36 | 1 | 1 | 13.25 | 1 | 1 | 1 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 0.5 | 0 | 1 | 0 |
764 | picongpu - Stream.hpp:53-53 [...] | _ZNK8picongpu13currentSolver7DepositINS0_8strategy23StridedCachedSupercellsEvE7executeILj3EN5pmacc18MappingDescriptionILj3ENS6_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEEEENS0_20KernelComputeCurrentINS6_20SuperCellDescriptionISE_NSA_ISB_IiL... | Outermost | 0 | 0 | 0 | 0 | 0.02 | 0.02 | 0.03 | 0.01 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 18.69 | 15.61 | 4.87 | 1 | 10.16 | 1 | 1 | 1 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
857 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0 | 0 | 0 | 0 | | 0 | | | | 0 | | | | 2 | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | 1 | 0 | | | | |
276 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 13 | 22 | 19 | 26 | 0 | 6.25 | 1 | 1 | 16 | 2 | 1 | 2 | 2 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 1 | 0 |
286 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 5 | 14 | 12 | 6 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
754 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 2 | 1 | 1 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | |
654 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelINS2_14KernelFillGapsENS3_9WorkerCfgILj256EEEEENS_5trait11OmpScheduleISA_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedule4KindE0EEc... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 5 | 12 | 8 | 12 | 20 | 16 | 7.2 | 1 | 4 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
629 | picongpu - TaskParticlesReceive.hpp:58-70 [...] | pmacc::TaskParticlesReceive<picongpu::Particles<pmacc::meta::String<(char)105>, boost::mp11::mp_list<picongpu::particlePusher<picongpu::particles::pusher::Boris, pmacc::pmacc_isAlias>, picongpu::shape<picongpu::particles::shapes::TS... | Single | 0 | 0 | 0 | 0 | | 0.01 | | 0 | | 0 | | 0 | | 1 | | 1 | 0 | 7.91 | 1 | 1 | 17.59 | 0 | 1 | 0 | 0 | NA | NA | NA | NA | NA | | | 1 | 0 | | | 1 | 0 |
668 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | | 0 | 0 | 2 | | 2 | 5 | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | 1 | 0 | 1 | 0 |
1209 | picongpu - tinymt32.h:105-136 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 11 | 14 | 16 | 18 | 0 | 6.62 | 2.41 | 5.2 | 15.59 | 2 | 1 | 1 | 1 | 0.5 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1211 | picongpu - tinymt32.h:105-136 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 9 | 9 | 16 | 13 | 0 | 6.56 | 2.41 | 4.89 | 15.59 | 3 | 1 | 1 | 1 | 0.5 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
282 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 7 | 8 | 6 | 13 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
407 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0 | 0 | 0 | 0 | | 0 | 0 | | | 0 | 0 | | | 1 | 1 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | 1 | 0 | 1 | 0 | | |
662 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 3 | 2 | 1 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
762 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0 | 0 | 0 | 0 | 0 | | | | 0 | | | | 1 | | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | | | | |
272 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 9 | 13 | 17 | 16 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1485 | picongpu - Kernel.hpp:155-166 [...] | void std::__invoke_impl<void, pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigat... | Outermost | 0 | 0 | 0 | 0 | 0.02 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 7 | 5 | 4 | 6 | 6.46 | 11.12 | 3.7 | 1.72 | 7.07 | 2 | 0 | 1 | 0 | 0.67 | 1.67 | 0 | 2.33 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
1195 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Outermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 10 | 17 | 17 | 18 | 25 | 10.16 | 1.71 | 2.56 | 12.67 | 1 | 1 | 1 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
658 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 5 | 6 | 5 | 4 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
782 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0 | 0 | 0 | 0 | 0 | | 0.01 | | 0 | | 0 | | 2 | | 2 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | 1 | 0 | | |
1187 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 4 | 4 | 4 | 0 | 11.16 | 1 | 1 | 18.16 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
405 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 | 3 | 3 | 6 | 0 | 6.25 | 1 | 1 | 16 | 1 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
66 | picongpu - stl_tree.h:1951-1952 [...] | cupla::cupla_omp2_seq_sync::manager::Stream<alpaka::DevCpu, alpaka::QueueGenericThreadsBlocking<alpaka::DevCpu> >::stream(void*) | Single | 0 | 0 | 0 | 0 | 0 | 0.03 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
772 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0 | 0 | 0 | 0 | 0 | | 0 | | 0 | | 0 | | 1 | | 1 | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | | | 1 | 0 | | |
438 | picongpu - ParticlesBase.kernel:440-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0 | 0 | 0 | 0 | 0.09 | 0.06 | 0.06 | 0.05 | 0.07 | 0.04 | 0.03 | 0.02 | 13 | 26 | 26 | 50 | 4 | 8.25 | 3.58 | 1 | 14.86 | 1.29 | 1.5 | 2 | 2.5 | NA | NA | NA | NA | NA | 1 | 0 | 0.88 | 0 | 1.17 | -0 | 0.88 | 0 |
580 | picongpu - Op.hpp:30-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | 0.08 | 0.05 | 0.05 | 0.03 | 0.05 | 0.02 | 0.02 | 0.01 | 13 | 25 | 26 | 47 | 0 | 4.3 | 1 | 1 | 24.57 | 1.6 | 2.5 | 2.5 | 3 | NA | NA | NA | NA | NA | 1 | 0 | 1.25 | -0 | 1.25 | -0 | 1.25 | -0 |
570 | picongpu - CopyIdentifier.hpp:34-34 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_21KernelInsertParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj... | Single | 0 | 0 | 0 | 0 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.02 | 0.02 | 0.01 | 13 | 25 | 24 | 43 | 0 | 10.4 | 2.75 | 1 | 13.2 | 1.33 | 2 | 2 | 4 | 2 | 0 | 0 | 1 | 2 | 1 | 0 | 0.75 | 0 | 0.75 | 0 | 0.75 | 0 |
596 | picongpu - TaskParticlesSend.hpp:57-69 [...] | pmacc::TaskParticlesSend<picongpu::Particles<pmacc::meta::String<(char)101>, boost::mp11::mp_list<picongpu::particlePusher<picongpu::particles::pusher::Boris, pmacc::pmacc_isAlias>, picongpu::shape<picongpu::particles::shapes::TSC, ... | Single | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0 | | 0 | 0 | 0 | | 1 | 1 | 1 | | 4.17 | 8.66 | 15.67 | 1 | 16.44 | 1 | 1 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | | |
1213 | picongpu - tinymt32.h:105-136 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 9 | 15 | 15 | 19 | 0 | 6.62 | 2.41 | 5.2 | 15.59 | 1 | 1 | 1 | 2 | 0.5 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3451 | picongpu - stl_tree.h:1983-1984 [...] | std::map<unsigned long, pmacc::ITask*, std::less<unsigned long>, std::allocator<std::pair<unsigned long const, pmacc::ITask*> > >::erase(unsigned long const&) | Single | 0 | 0 | 0 | 0 | 0.05 | 0.05 | 0.04 | 0.04 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3433 | picongpu - Manager.cpp:64-132 [...] | pmacc::Manager::execute(unsigned long) | Outermost | 0 | 0 | 0 | 0 | 0.22 | 0.34 | 0.27 | 0.34 | 0.02 | 0.01 | 0.01 | 0.01 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0 |
904 | picongpu - Manager.hpp:98-109 [...] | pmacc::TaskFieldReceiveAndInsert<picongpu::FieldJ>::init() | Single | 0 | 0 | 0 | 0 | 0.01 | | 0 | | 0 | | 0 | | 1 | | 1 | | 0 | 7.96 | 1 | 1 | 17.92 | 1 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | | | 1 | 0 | | |
40 | picongpu - locale_facets.h:941-945 [...] | bool boost::io::detail::parse_printf_directive<char, std::char_traits<char>, std::allocator<char>, __gnu_cxx::__normal_iterator<char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >... | Single | 0 | 0 | 0 | 0 | | 0 | | | | 0 | | | | 1 | | | 0 | 7.32 | 1 | 1 | 42.12 | 0 | 0 | 0 | 0 | 3 | 1 | 1.67 | 0 | 0.67 | | | 1 | 0 | | | | |
1165 | picongpu - BlockSharedMemStMemberImpl.hpp:85-95 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Single | 0 | 0 | 0 | 0 | | 0 | | | | 0 | | | | 1 | | | 0 | 6.25 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | | | 1 | 0 | | | | |
3459 | picongpu - stl_tree.h:2115-2119 [...] | pmacc::TaskLogicalAnd::TaskLogicalAnd(pmacc::ITask*, pmacc::ITask*) | Single | 0 | 0 | 0 | 0 | 0.01 | 0.01 | | | 0 | 0 | | | 1 | 1 | | | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 1 | 0 | 1 | 0 | | | | |
1223 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Innermost | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 | 4 | 4 | 9 | 0 | 11.16 | 1 | 1 | 18.16 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
883 | picongpu - TaskKernelCpuOmp2Blocks.hpp:85-900 [...] | void alpaka::detail::ParallelForImpl<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::trait::OmpSchedule<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::AccCpuOmp2Bl... | Outermost | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 10 | 16 | 12 | 14 | 0 | 9.47 | 14.63 | 1 | 4.88 | 2 | 1 | 1 | 1 | 1 | 0 | 2 | 5 | 3 | 1 | 0 | 0.5 | 0 | 1 | 0 | 1 | 0 |
1231 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_9particles9algorithm3acc6detail21KernelForEachParticleENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_7functor9InterfaceINSL_8FilteredINS0_6fi... | Outermost | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 | 15 | 11 | 10 | 4.17 | 7.36 | 1.92 | 1 | 24.86 | 1 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
3453 | picongpu - stl_tree.h:2018-2022 [...] | std::map<unsigned long, pmacc::ITask*, std::less<unsigned long>, std::allocator<std::pair<unsigned long const, pmacc::ITask*> > >::erase(unsigned long const&) | Single | 0 | 0 | 0 | 0 | 0.07 | 0.06 | 0.07 | 0.06 | 0.01 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
268 | picongpu - TaskKernelCpuOmp2Blocks.hpp:84-900 [...] | _ZN6alpaka6detail15ParallelForImplIN5pmacc8lockstep4exec6detail14LockStepKernelINS2_20KernelShiftParticlesENS3_9WorkerCfgILj256EEEEENS_5trait11OmpScheduleISA_NS_16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEEvE19TraitNotSpecializedELNS_3omp8Schedule4Kin... | Single | 0 | 0 | 0 | 0 | 0.03 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 13 | 24 | 22 | 38 | 24.14 | 17.89 | 6.67 | 1 | 4.44 | 3 | 2 | 2 | 2 | 1 | 0 | 1 | 0 | 3 | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 0.25 | 0 |