Loop id | Source Location | Source Function | Level | Coverage run_0 (%) | Max Time Over Threads run_0 (s) | Time w.r.t. Wall Time run_0 (s) | Nb Threads run_0 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing run_0 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | Speedup If Data in L1 run_0 |
---|
745 | miniqmc - einspline_spo_omp.cpp:259-259 [...] | qmcplusplus::einspline_spo_omp<double>::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<double, std::allocator<double> > const&, ... | Innermost | 47.89 | 17.31 | 16.72 | 16 | 90.53 | 39.08 | 1.12 | 1.23 | 2.45 | 1.04 | 1 | 1 | 0 | 0 | 3 | 12.76 |
805 | miniqmc - MultiBsplineVGLH_OMPoffload.hpp:321-351 | void spline2offload::evaluate_vgh_impl_v2<double>(qmcplusplus::bspline_traits<double, 3u>::SplineType const*, int, int, int, int, double const*, double const*, double const*, double const*, double const*, double const*, double const*, double co... | Single | 28.15 | 10 | 9.83 | 16 | 74.47 | 31.38 | 1.06 | 1.32 | 3.07 | 1.02 | 1 | 3 | 0 | 0 | 2 | 1.41 |
1466 | miniqmc - SoaDistanceTableAAOMPTarget.h:440-442 [...] | qmcplusplus::SoaDistanceTableAAOMPTarget<double, 3u, 40>::update(int) | Single | 4.2 | 1.63 | 1.47 | 16 | 54.55 | 15.91 | 1.19 | 1 | 8 | 1.12 | 0 | 4 | 2 | 1 | 2 | 55.63 |
1683 | miniqmc - SoaDistanceTableABOMPTarget.h:228-228 [...] | qmcplusplus::SoaDistanceTableABOMPTarget<double, 3u, 40>::evaluate(qmcplusplus::ParticleSet&) | Innermost | 2.04 | 0.86 | 0.71 | 16 | 100 | 100 | 1 | 1 | 1 | 1.21 | 0 | 7 | 0 | 0 | 0 | 1.37 |
1048 | miniqmc - ParticleBConds3DSoa.h:234-255 | void qmcplusplus::DTD_BConds<double, 3u, 40>::computeDistances<qmcplusplus::TinyVector<double, 3u>, qmcplusplus::VectorSoAContainer<double, 3u, qmcplusplus::Mallocator<double, 64ul> >, qmcplusplus::VectorSoAContainer<double, 3... | Single | 0.76 | 0.33 | 0.26 | 16 | 94 | 94.75 | 1 | 1 | 1 | 1.27 | 1 | 7 | 0 | 0 | 0 | 0.04 |
218 | miniqmc - BsplineFunctor.h:236-241 | qmcplusplus::BsplineFunctor<double>::evaluateV(int, int, int, double const*, double*) const | Single | 0.67 | 0.3 | 0.23 | 16 | 87.5 | 63.93 | 1.07 | 1 | 1.29 | 1.3 | 0 | 1 | 2 | 0 | 0 | 0.03 |
807 | miniqmc - einspline_spo_omp.cpp:353-358 [...] | qmcplusplus::einspline_spo_omp<double>::evaluate_build_vgl(qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector<double, 3... | Innermost | 0.5 | 0.22 | 0.17 | 16 | 0 | 11.16 | 1.55 | 1 | 12.17 | 1.29 | 4 | 0 | 0 | 8 | 0 | 1.26 |
804 | miniqmc - einspline_spo_omp.cpp:323-324 | qmcplusplus::einspline_spo_omp<double>::evaluate_vgh(qmcplusplus::ParticleSet const&, int, bool) | Innermost | 0.38 | 0.17 | 0.13 | 16 | 0 | 11.16 | 1 | 1 | 16 | 1.31 | 1 | 0 | 0 | 0 | 0 | NA |
1918 | miniqmc - | __intel_avx_rep_memset | Single | 0.35 | 0.19 | 0.12 | 16 | 100 | 50 | 1 | 1 | 2 | 1.58 | 0 | 1 | 0 | 0 | 0 | NA |
512 | miniqmc - TwoBodyJastrow.h:343-348 | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::acceptMove(qmcplusplus::ParticleSet&, int) | Innermost | 0.31 | 0.15 | 0.11 | 16 | 100 | 100 | 1 | 1 | 1 | 1.36 | 0 | 5 | 0 | 0 | 0 | 4.46 |
688 | miniqmc - inner_product.hpp:82-83 [...] | qmcplusplus::DiracDeterminant<qmcplusplus::DelayedUpdate<double, double> >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector<double,... | Innermost | 0.28 | 0.1 | 0.1 | 16 | 25 | 15.63 | 1 | 2 | 6.86 | 1 | 0 | 2 | 1 | 0 | 0 | 2.09 |
683 | miniqmc - inner_product.hpp:155-155 [...] | qmcplusplus::DiracDeterminant<qmcplusplus::DelayedUpdate<double, double> >::evalGrad(qmcplusplus::ParticleSet&, int) | Single | 0.23 | 0.12 | 0.08 | 16 | 33.33 | 16.67 | 1 | 1.73 | 6.4 | 1.5 | 0 | 1 | 1 | 0 | 0 | NA |
215 | miniqmc - BsplineFunctor.h:291-297 | qmcplusplus::BsplineFunctor<double>::evaluateVGL(int, int, int, double const*, double*, double*, double*, double*, int*) const | Single | 0.22 | 0.11 | 0.08 | 16 | 83.33 | 60.1 | 1.13 | 1 | 1.33 | 1.38 | 0 | 1 | 2 | 0 | 0 | NA |
508 | miniqmc - TwoBodyJastrow.h:155-156 | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&) | Innermost | 0.17 | 0.08 | 0.06 | 16 | 100 | 100 | 1 | 1 | 1 | 1.33 | 0 | 2 | 0 | 0 | 0 | NA |
744 | miniqmc - einspline_spo_omp.cpp:264-265 | qmcplusplus::einspline_spo_omp<double>::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<double, std::allocator<double> > const&, ... | Innermost | 0.14 | 0.07 | 0.05 | 16 | 100 | 100 | 1 | 1 | 1 | 1.4 | 0 | 2 | 0 | 0 | 0 | NA |
707 | miniqmc - inner_product.hpp:155-155 [...] | qmcplusplus::DiracDeterminant<qmcplusplus::DelayedUpdate<double, double> >::ratioGrad_compute(int, qmcplusplus::TinyVector<double, 3u>&) | Single | 0.14 | 0.07 | 0.05 | 16 | 33.33 | 16.67 | 1 | 1.73 | 6.4 | 1.4 | 0 | 1 | 1 | 0 | 0 | NA |
514 | miniqmc - TwoBodyJastrow.h:325-332 | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::acceptMove(qmcplusplus::ParticleSet&, int) | Single | 0.11 | 0.06 | 0.04 | 16 | 100 | 100 | 1 | 1 | 1 | 1.5 | 0 | 8 | 0 | 0 | 0 | NA |
706 | miniqmc - inner_product.hpp:211-212 | qmcplusplus::DiracMatrix<double, double>::invert_transpose(qmcplusplus::Matrix<double, std::allocator<double> > const&, qmcplusplus::Matrix<double, std::allocator<double> >&, double&, double&) | Innermost | 0.1 | 0.04 | 0.03 | 16 | 85.71 | 76.79 | 1.13 | 1 | 1.09 | 1.33 | 0 | 1 | 0 | 0 | 1 | NA |
213 | miniqmc - BsplineFunctor.h:302-335 | qmcplusplus::BsplineFunctor<double>::evaluateVGL(int, int, int, double const*, double*, double*, double*, double*, int*) const | Single | 0.05 | 0.03 | 0.02 | 15 | 100 | 96.07 | 1.04 | 1 | 1.01 | 1.5 | 1 | 2 | 0 | 0 | 4 | NA |
681 | miniqmc - inner_product.hpp:83-155 [...] | qmcplusplus::DiracDeterminant<qmcplusplus::DelayedUpdate<double, double> >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector<double... | Innermost | 0.05 | 0.02 | 0.02 | 16 | 25 | 15.63 | 1 | 2 | 6.86 | 1 | 0 | 2 | 1 | 0 | 0 | NA |
1917 | miniqmc - | __intel_avx_rep_memcpy | Single | 0.05 | 0.04 | 0.02 | 16 | 100 | 50 | 1 | 1 | 2 | 2 | 0 | 2 | 0 | 0 | 0 | NA |
216 | miniqmc - BsplineFunctor.h:246-260 | qmcplusplus::BsplineFunctor<double>::evaluateV(int, int, int, double const*, double*) const | Single | 0.05 | 0.02 | 0.02 | 16 | 100 | 89.39 | 1.03 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | NA |
200 | miniqmc - stl_algobase.h:921-922 | qmcplusplus::Vector<double, std::allocator<double> >::resize(unsigned long, double) | Single | 0.04 | 0.02 | 0.01 | 16 | 100 | 100 | 1 | 1 | 1 | 2 | 0 | 1 | 0 | 0 | 0 | NA |
743 | miniqmc - einspline_spo_omp.cpp:207-266 [...] | qmcplusplus::einspline_spo_omp<double>::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<double, std::allocator<double> > const&, ... | InBetween | 0.03 | 0.02 | 0.01 | 15 | 54.02 | 37.07 | 1.74 | 1.02 | 1.4 | 2 | NA | NA | NA | NA | NA | NA |
52 | miniqmc - NonLocalPP.hpp:126-135 [...] | qmcplusplus::NonLocalPP<double>::evaluate(qmcplusplus::ParticleSet const&, qmcplusplus::WaveFunction&) | InBetween | 0.03 | 0.02 | 0.01 | 13 | 0 | 12.17 | 3.15 | 1 | 12.41 | 2 | 2.33 | 0.33 | 3 | 2 | 1 | NA |
700 | miniqmc - stl_algobase.h:909-910 | qmcplusplus::DiracDeterminant<qmcplusplus::DelayedUpdate<double, double> >::resize(int, int) | Single | 0.03 | 0.01 | 0.01 | 15 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | NA |
529 | miniqmc - TwoBodyJastrow.h:382-383 | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.03 | 0.01 | 0.01 | 16 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | NA |
709 | miniqmc - inner_product.hpp:82-83 | qmcplusplus::DiracDeterminant<qmcplusplus::DelayedUpdate<double, double> >::ratioGrad_compute(int, qmcplusplus::TinyVector<double, 3u>&) | Single | 0.03 | 0.02 | 0.01 | 15 | 100 | 100 | 1 | 1 | 1 | 2 | 0 | 2 | 0 | 0 | 0 | NA |
519 | miniqmc - TwoBodyJastrow.h:107-132 [...] | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector<double, std::allocator<double> >&) | Outermost | 0.02 | 0.01 | 0.01 | 9 | 8.01 | 12.12 | 2.21 | 2.21 | 17.65 | 1 | 2 | 0 | 0 | 5.5 | 0 | NA |
1682 | miniqmc - SoaDistanceTableABOMPTarget.h:61-228 [...] | qmcplusplus::SoaDistanceTableABOMPTarget<double, 3u, 40>::evaluate(qmcplusplus::ParticleSet&) | InBetween | 0.02 | 0.03 | 0.01 | 12 | 37.97 | 44.27 | 1.46 | 1 | 1.44 | 3 | 1.25 | 0 | 2 | 5.75 | 0 | NA |
525 | miniqmc - TwoBodyJastrow.h:398-399 | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.01 | 0 | 0 | 6 | 100 | 100 | 1 | 1 | 1 | 0 | 0 | 3 | 0 | 0 | 0 | NA |
83 | miniqmc - WaveFunction.cpp:263-274 [...] | qmcplusplus::WaveFunction::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector<double, std::allocator<double> >&) | Outermost | 0.01 | 0.01 | 0 | 7 | 60.87 | 58.97 | 2.09 | 1 | 1.15 | 1 | NA | NA | NA | NA | NA | NA |
509 | miniqmc - algorithm.hpp:26-28 | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&) | Single | 0.01 | 0.01 | 0 | 12 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | NA |
742 | miniqmc - einspline_spo_omp.cpp:207-266 [...] | qmcplusplus::einspline_spo_omp<double>::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<double, std::allocator<double> > const&, ... | InBetween | 0.01 | 0.01 | 0 | 6 | 30.3 | 17.3 | 1.21 | 1.77 | 6.52 | 1 | NA | NA | NA | NA | NA | NA |
530 | miniqmc - TwoBodyJastrow.h:376-377 | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.01 | 0.01 | 0 | 6 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | NA |
527 | miniqmc - TwoBodyJastrow.h:389-392 | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.01 | 0.01 | 0 | 11 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 5 | 0 | 0 | 0 | NA |
523 | miniqmc - TwoBodyJastrow.h:398-399 | qmcplusplus::TwoBodyJastrow<qmcplusplus::BsplineFunctor<double> >::recompute(qmcplusplus::ParticleSet&) | Innermost | 0.01 | 0 | 0 | 6 | 100 | 100 | 1 | 1 | 1 | 0 | 0 | 3 | 0 | 0 | 0 | NA |
1681 | miniqmc - SoaDistanceTableABOMPTarget.h:61-228 [...] | qmcplusplus::SoaDistanceTableABOMPTarget<double, 3u, 40>::evaluate(qmcplusplus::ParticleSet&) | Outermost | 0.01 | 0.01 | 0 | 9 | 0 | 12.3 | 1.5 | 1 | 8.31 | 1 | 1 | 0 | 0 | 1 | 0 | NA |
77 | miniqmc - WaveFunction.cpp:198-201 [...] | qmcplusplus::WaveFunction::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&) | Single | 0.01 | 0.01 | 0 | 4 | 0 | 11.72 | 1.86 | 2.67 | 12.69 | 1 | 2 | 0 | 1 | 0 | 2 | NA |
781 | miniqmc - stl_algobase.h:921-922 | qmcplusplus::Vector<double, qmcplusplus::OMPallocator<double, qmcplusplus::Mallocator<double, 64ul> > >::resize(unsigned long, double) | Single | 0.01 | 0 | 0 | 6 | 100 | 100 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | NA |
505 | miniqmc - OneBodyJastrow.h:214-219 [...] | qmcplusplus::OneBodyJastrow<qmcplusplus::BsplineFunctor<double> >::computeU3(qmcplusplus::ParticleSet&, int, double const*) | Single | 0.01 | 0 | 0 | 7 | 0 | 11.16 | 1 | 1 | 12.13 | 0 | 1 | 1 | 0.5 | 0 | 0.5 | NA |
36 | miniqmc - miniqmc.cpp:429-458 [...] | main.extracted.106 | Innermost | 0.01 | 0.01 | 0 | 8 | 7.61 | 11.79 | 7.12 | 1 | 12.88 | 1 | 2 | 0 | 2 | 0 | 0 | NA |
927 | miniqmc - ParticleSet.cpp:242-243 [...] | qmcplusplus::ParticleSet::update(bool) | Single | 0.01 | 0 | 0 | 6 | 0 | 12.5 | 1 | 1 | 8 | 0 | 1 | 1 | 0 | 1 | 1 | NA |
501 | miniqmc - OneBodyJastrow.h:134-155 [...] | qmcplusplus::OneBodyJastrow<qmcplusplus::BsplineFunctor<double> >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector<double, std::allocator<double> >&) | Outermost | 0.01 | 0.02 | 0 | 6 | 15.38 | 12.98 | 2.92 | 2.91 | 13.09 | 2 | NA | NA | NA | NA | NA | NA |