* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 56981)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 56986)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 1
Number of walkers per rank = 1
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.1216 0.1216 1 0.121563821
ParticleSet:::update 0.0000 0.0000 1 0.000003488
Total 41.0901 0.0002 1 41.090138003
Diffusion 21.3957 0.0310 5 4.279149120
Complete Updates 0.1687 0.0000 5 0.033747691
DeterminantRef::update 0.1687 0.1687 10 0.016872364
Current Gradient 1.1224 0.0300 30720 0.000036535
DeterminantRef::ratio 1.0766 1.0766 30720 0.000035046
OneBodyJastrowRef 0.0086 0.0086 30720 0.000000279
TwoBodyJastrowRef 0.0072 0.0072 30720 0.000000235
Kinetic Energy 0.2910 0.2908 5 0.058204901
OneBodyJastrowRef 0.0001 0.0001 5 0.000028718
TwoBodyJastrowRef 0.0001 0.0001 5 0.000017759
New Gradient 5.3153 0.0452 30720 0.000173024
DeterminantRef::ratio 0.1623 0.1623 30720 0.000005284
DeterminantRef::spovgl 4.4317 0.2651 30720 0.000144260
Single-Particle Orbitals 4.1666 4.1666 30720 0.000135631
OneBodyJastrowRef 0.1085 0.1085 30720 0.000003533
TwoBodyJastrowRef 0.5675 0.5675 30720 0.000018475
ParticleSet:::acceptMove 1.7243 0.0167 15371 0.000112178
DTAAOMPTarget::update_e_e 1.6892 1.6892 15371 0.000109897
DTABOMPTarget::update_ion_e 0.0184 0.0184 15371 0.000001197
ParticleSet:::computeNewPosDT 0.7248 0.0222 30720 0.000023595
DTAAOMPTarget::move_e_e 0.6224 0.6224 30720 0.000020259
DTABOMPTarget::move_ion_e 0.0803 0.0803 30720 0.000002614
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000002534
Update 12.0182 0.0180 15371 0.000781876
DeterminantRef::update 11.4449 11.4449 15371 0.000744580
OneBodyJastrowRef 0.0035 0.0035 15371 0.000000229
TwoBodyJastrowRef 0.5518 0.5518 15371 0.000035896
Initialization 1.9651 0.1664 1 1.965094128
DeterminantRef::inverse 0.7916 0.7916 2 0.395802906
DeterminantRef::spovgl 0.8355 0.0602 2 0.417732546
Single-Particle Orbitals 0.7753 0.7753 6144 0.000126186
OneBodyJastrowRef 0.0084 0.0084 1 0.008350388
ParticleSet:::update 0.0619 0.0074 2 0.030927490
DTAAOMPTarget::evaluate_e_e 0.0389 0.0389 1 0.038885437
DTABOMPTarget::evaluate_ion_e 0.0155 0.0001 1 0.015531206
DTABOMPTarget::offload_ion_e 0.0155 0.0155 1 0.015480491
TwoBodyJastrowRef 0.1014 0.1014 1 0.101421842
Pseudopotential 17.7291 0.0447 5 3.545818692
DeterminantRef::spoval 13.1126 0.2675 10215 0.001283662
Single-Particle Orbitals 12.8452 12.8452 122580 0.000104790
OneBodyJastrowRef 0.0198 0.0198 10215 0.000001938
ParticleSet:::update 4.1142 0.0104 10215 0.000402764
DTABOMPTarget::evaluate_e_virtual 3.7733 0.0051 10215 0.000369391
DTABOMPTarget::offload_e_virtual 3.7682 3.7682 10215 0.000368889
DTABOMPTarget::evaluate_ion_virtual 0.3305 0.0051 10215 0.000032354
DTABOMPTarget::offload_ion_virtual 0.3254 0.3254 10215 0.000031859
TwoBodyJastrowRef 0.4377 0.4377 10215 0.000042850
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.12888e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 2.16798e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 4.25839e+06
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 56981)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 56986)
Info: 1/2 lprof instances finished
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_0 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57031)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57036)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 2
Number of walkers per rank = 2
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0630 0.0630 1 0.062997826
ParticleSet:::update 0.0000 0.0000 1 0.000003593
Total 42.4384 0.0008 1 42.438362982
Diffusion 22.1619 0.0300 5 4.432383870
Complete Updates 0.1720 0.0000 5 0.034395042
DeterminantRef::update 0.1720 0.1720 10 0.017195653
Current Gradient 1.1230 0.0297 30720 0.000036555
DeterminantRef::ratio 1.0792 1.0792 30720 0.000035129
OneBodyJastrowRef 0.0079 0.0079 30720 0.000000259
TwoBodyJastrowRef 0.0061 0.0061 30720 0.000000200
Kinetic Energy 0.2915 0.2912 5 0.058295246
OneBodyJastrowRef 0.0002 0.0002 5 0.000031439
TwoBodyJastrowRef 0.0001 0.0001 5 0.000017312
New Gradient 6.2191 0.0418 30720 0.000202444
DeterminantRef::ratio 0.1608 0.1608 30720 0.000005235
DeterminantRef::spovgl 5.3515 0.2578 30720 0.000174202
Single-Particle Orbitals 5.0937 5.0937 30720 0.000165811
OneBodyJastrowRef 0.1031 0.1031 30720 0.000003355
TwoBodyJastrowRef 0.5620 0.5620 30720 0.000018293
ParticleSet:::acceptMove 1.7077 0.0170 15371 0.000111096
DTAAOMPTarget::update_e_e 1.6718 1.6718 15371 0.000108760
DTABOMPTarget::update_ion_e 0.0189 0.0189 15371 0.000001231
ParticleSet:::computeNewPosDT 0.6753 0.0200 30720 0.000021981
DTAAOMPTarget::move_e_e 0.5847 0.5847 30720 0.000019035
DTABOMPTarget::move_ion_e 0.0705 0.0705 30720 0.000002295
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000001800
Update 11.9435 0.0168 15371 0.000777015
DeterminantRef::update 11.3758 11.3758 15371 0.000740085
OneBodyJastrowRef 0.0035 0.0035 15371 0.000000227
TwoBodyJastrowRef 0.5474 0.5474 15371 0.000035613
Initialization 2.2090 0.1656 1 2.208973781
DeterminantRef::inverse 0.7729 0.7729 2 0.386457514
DeterminantRef::spovgl 1.0873 0.0759 2 0.543631869
Single-Particle Orbitals 1.0113 1.0113 6144 0.000164607
OneBodyJastrowRef 0.0083 0.0083 1 0.008324765
ParticleSet:::update 0.0734 0.0074 2 0.036699550
DTAAOMPTarget::evaluate_e_e 0.0503 0.0503 1 0.050321382
DTABOMPTarget::evaluate_ion_e 0.0156 0.0000 1 0.015643469
DTABOMPTarget::offload_ion_e 0.0156 0.0156 1 0.015594035
TwoBodyJastrowRef 0.1015 0.1015 1 0.101457980
Pseudopotential 18.0667 0.0429 5 3.613339434
DeterminantRef::spoval 13.4404 0.2459 10215 0.001315755
Single-Particle Orbitals 13.1945 13.1945 122580 0.000107640
OneBodyJastrowRef 0.0183 0.0183 10215 0.000001787
ParticleSet:::update 4.1292 0.0094 10215 0.000404226
DTABOMPTarget::evaluate_e_virtual 3.7902 0.0049 10215 0.000371046
DTABOMPTarget::offload_e_virtual 3.7853 3.7853 10215 0.000370566
DTABOMPTarget::evaluate_ion_virtual 0.3295 0.0047 10215 0.000032257
DTABOMPTarget::offload_ion_virtual 0.3248 0.3248 10215 0.000031801
TwoBodyJastrowRef 0.4359 0.4359 10215 0.000042676
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.18602e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 4.18607e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 8.35764e+06
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57036)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57031)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_1 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57108)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57113)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 4
Number of walkers per rank = 4
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0357 0.0357 1 0.035687058
ParticleSet:::update 0.0000 0.0000 1 0.000003257
Total 40.9043 0.0003 1 40.904336385
Diffusion 21.4597 0.0293 5 4.291946918
Complete Updates 0.1718 0.0000 5 0.034354547
DeterminantRef::update 0.1718 0.1718 10 0.017175555
Current Gradient 1.1166 0.0272 30720 0.000036347
DeterminantRef::ratio 1.0756 1.0756 30720 0.000035014
OneBodyJastrowRef 0.0076 0.0076 30720 0.000000247
TwoBodyJastrowRef 0.0062 0.0062 30720 0.000000201
Kinetic Energy 0.2917 0.2915 5 0.058346392
OneBodyJastrowRef 0.0002 0.0002 5 0.000030081
TwoBodyJastrowRef 0.0001 0.0001 5 0.000017607
New Gradient 5.4380 0.0412 30720 0.000177017
DeterminantRef::ratio 0.1561 0.1561 30720 0.000005082
DeterminantRef::spovgl 4.5895 0.2485 30720 0.000149399
Single-Particle Orbitals 4.3411 4.3411 30720 0.000141311
OneBodyJastrowRef 0.0979 0.0979 30720 0.000003188
TwoBodyJastrowRef 0.5532 0.5532 30720 0.000018007
ParticleSet:::acceptMove 1.7217 0.0155 15371 0.000112007
DTAAOMPTarget::update_e_e 1.6882 1.6882 15371 0.000109830
DTABOMPTarget::update_ion_e 0.0180 0.0180 15371 0.000001170
ParticleSet:::computeNewPosDT 0.6663 0.0192 30720 0.000021690
DTAAOMPTarget::move_e_e 0.5797 0.5797 30720 0.000018871
DTABOMPTarget::move_ion_e 0.0674 0.0674 30720 0.000002194
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000001938
Update 12.0244 0.0171 15371 0.000782281
DeterminantRef::update 11.4501 11.4501 15371 0.000744915
OneBodyJastrowRef 0.0035 0.0035 15371 0.000000231
TwoBodyJastrowRef 0.5537 0.5537 15371 0.000036021
Initialization 2.1632 0.2804 1 2.163225937
DeterminantRef::inverse 0.7722 0.7722 2 0.386113349
DeterminantRef::spovgl 0.9353 0.0783 2 0.467653551
Single-Particle Orbitals 0.8570 0.8570 6144 0.000139483
OneBodyJastrowRef 0.0084 0.0084 1 0.008379650
ParticleSet:::update 0.0639 0.0078 2 0.031934930
DTAAOMPTarget::evaluate_e_e 0.0404 0.0404 1 0.040396612
DTABOMPTarget::evaluate_ion_e 0.0156 0.0001 1 0.015631537
DTABOMPTarget::offload_ion_e 0.0156 0.0156 1 0.015580766
TwoBodyJastrowRef 0.1030 0.1030 1 0.103048386
Pseudopotential 17.2811 0.0418 5 3.456217333
DeterminantRef::spoval 12.6512 0.2525 10215 0.001238497
Single-Particle Orbitals 12.3988 12.3988 122580 0.000101148
OneBodyJastrowRef 0.0180 0.0180 10215 0.000001759
ParticleSet:::update 4.1246 0.0093 10215 0.000403778
DTABOMPTarget::evaluate_e_virtual 3.7841 0.0048 10215 0.000370444
DTABOMPTarget::offload_e_virtual 3.7793 3.7793 10215 0.000369978
DTABOMPTarget::evaluate_ion_virtual 0.3312 0.0044 10215 0.000032422
DTABOMPTarget::offload_ion_virtual 0.3268 0.3268 10215 0.000031991
TwoBodyJastrowRef 0.4455 0.4455 10215 0.000043608
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 4.53601e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 8.64608e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.74752e+07
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57113)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57108)
Info: 1/2 lprof instances finished
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_2 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57172)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57177)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 8
Number of walkers per rank = 8
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0350 0.0350 1 0.035020667
ParticleSet:::update 0.0000 0.0000 1 0.000003426
Total 41.4529 0.1836 1 41.452877377
Diffusion 21.6889 0.0270 5 4.337770257
Complete Updates 0.1748 0.0000 5 0.034966078
DeterminantRef::update 0.1748 0.1748 10 0.017481214
Current Gradient 1.1010 0.0259 30720 0.000035841
DeterminantRef::ratio 1.0632 1.0632 30720 0.000034609
OneBodyJastrowRef 0.0065 0.0065 30720 0.000000210
TwoBodyJastrowRef 0.0055 0.0055 30720 0.000000179
Kinetic Energy 0.2922 0.2920 5 0.058440011
OneBodyJastrowRef 0.0002 0.0002 5 0.000031158
TwoBodyJastrowRef 0.0001 0.0001 5 0.000018451
New Gradient 5.3688 0.0370 30720 0.000174766
DeterminantRef::ratio 0.1517 0.1517 30720 0.000004937
DeterminantRef::spovgl 4.5590 0.2541 30720 0.000148406
Single-Particle Orbitals 4.3049 4.3049 30720 0.000140134
OneBodyJastrowRef 0.0914 0.0914 30720 0.000002974
TwoBodyJastrowRef 0.5298 0.5298 30720 0.000017247
ParticleSet:::acceptMove 1.6896 0.0153 15371 0.000109922
DTAAOMPTarget::update_e_e 1.6565 1.6565 15371 0.000107766
DTABOMPTarget::update_ion_e 0.0178 0.0178 15371 0.000001160
ParticleSet:::computeNewPosDT 0.6637 0.0181 30720 0.000021606
DTAAOMPTarget::move_e_e 0.5777 0.5777 30720 0.000018805
DTABOMPTarget::move_ion_e 0.0679 0.0679 30720 0.000002210
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000002050
Update 12.3716 0.0167 15371 0.000804868
DeterminantRef::update 11.8034 11.8034 15371 0.000767900
OneBodyJastrowRef 0.0027 0.0027 15371 0.000000177
TwoBodyJastrowRef 0.5488 0.5488 15371 0.000035703
Initialization 2.2138 0.3079 1 2.213822780
DeterminantRef::inverse 0.8015 0.8015 2 0.400739877
DeterminantRef::spovgl 0.9200 0.0731 2 0.459982100
Single-Particle Orbitals 0.8469 0.8469 6144 0.000137834
OneBodyJastrowRef 0.0084 0.0084 1 0.008358757
ParticleSet:::update 0.0735 0.0082 2 0.036747305
DTAAOMPTarget::evaluate_e_e 0.0496 0.0496 1 0.049635030
DTABOMPTarget::evaluate_ion_e 0.0157 0.0001 1 0.015690543
DTABOMPTarget::offload_ion_e 0.0156 0.0156 1 0.015573308
TwoBodyJastrowRef 0.1026 0.1026 1 0.102580777
Pseudopotential 17.3666 0.0428 5 3.473317265
DeterminantRef::spoval 12.6500 0.2384 10215 0.001238375
Single-Particle Orbitals 12.4116 12.4116 122580 0.000101253
OneBodyJastrowRef 0.0194 0.0194 10215 0.000001895
ParticleSet:::update 4.1242 0.0093 10215 0.000403739
DTABOMPTarget::evaluate_e_virtual 3.7846 0.0050 10215 0.000370493
DTABOMPTarget::offload_e_virtual 3.7795 3.7795 10215 0.000369999
DTABOMPTarget::evaluate_ion_virtual 0.3303 0.0042 10215 0.000032334
DTABOMPTarget::offload_ion_virtual 0.3261 0.3261 10215 0.000031924
TwoBodyJastrowRef 0.5302 0.5302 10215 0.000051905
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 8.95198e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 1.71095e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 3.47783e+07
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57177)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57172)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_3 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57269)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57274)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 16
Number of walkers per rank = 16
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0470 0.0470 1 0.046969219
ParticleSet:::update 0.0000 0.0000 1 0.000003158
Total 43.4415 0.2984 1 43.441528864
Diffusion 23.0528 0.0283 5 4.610556262
Complete Updates 0.1858 0.0000 5 0.037150277
DeterminantRef::update 0.1857 0.1857 10 0.018573472
Current Gradient 1.1579 0.0258 30720 0.000037692
DeterminantRef::ratio 1.1193 1.1193 30720 0.000036435
OneBodyJastrowRef 0.0069 0.0069 30720 0.000000225
TwoBodyJastrowRef 0.0059 0.0059 30720 0.000000191
Kinetic Energy 0.2988 0.2986 5 0.059765917
OneBodyJastrowRef 0.0002 0.0002 5 0.000032310
TwoBodyJastrowRef 0.0001 0.0001 5 0.000017885
New Gradient 5.7080 0.0394 30720 0.000185809
DeterminantRef::ratio 0.1576 0.1576 30720 0.000005129
DeterminantRef::spovgl 4.8661 0.2678 30720 0.000158402
Single-Particle Orbitals 4.5983 4.5983 30720 0.000149684
OneBodyJastrowRef 0.0898 0.0898 30720 0.000002925
TwoBodyJastrowRef 0.5551 0.5551 30720 0.000018070
ParticleSet:::acceptMove 1.8946 0.0165 15371 0.000123255
DTAAOMPTarget::update_e_e 1.8595 1.8595 15371 0.000120977
DTABOMPTarget::update_ion_e 0.0186 0.0186 15371 0.000001208
ParticleSet:::computeNewPosDT 0.6985 0.0189 30720 0.000022738
DTAAOMPTarget::move_e_e 0.6079 0.6079 30720 0.000019788
DTABOMPTarget::move_ion_e 0.0717 0.0717 30720 0.000002335
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000001833
Update 13.0808 0.0171 15371 0.000851006
DeterminantRef::update 12.4740 12.4740 15371 0.000811530
OneBodyJastrowRef 0.0031 0.0031 15371 0.000000204
TwoBodyJastrowRef 0.5865 0.5865 15371 0.000038158
Initialization 2.3551 0.4078 1 2.355138083
DeterminantRef::inverse 0.8358 0.8358 2 0.417898280
DeterminantRef::spovgl 0.9266 0.0638 2 0.463305512
Single-Particle Orbitals 0.8629 0.8629 6144 0.000140439
OneBodyJastrowRef 0.0084 0.0084 1 0.008419974
ParticleSet:::update 0.0749 0.0091 2 0.037457610
DTAAOMPTarget::evaluate_e_e 0.0500 0.0500 1 0.049961866
DTABOMPTarget::evaluate_ion_e 0.0158 0.0002 1 0.015823971
DTABOMPTarget::offload_ion_e 0.0156 0.0156 1 0.015580366
TwoBodyJastrowRef 0.1016 0.1016 1 0.101569782
Pseudopotential 17.7352 0.0473 5 3.547047781
DeterminantRef::spoval 12.8147 0.2700 10215 0.001254501
Single-Particle Orbitals 12.5448 12.5448 122580 0.000102339
OneBodyJastrowRef 0.0240 0.0240 10215 0.000002350
ParticleSet:::update 4.2144 0.0114 10215 0.000412573
DTABOMPTarget::evaluate_e_virtual 3.8654 0.0056 10215 0.000378408
DTABOMPTarget::offload_e_virtual 3.8598 3.8598 10215 0.000377856
DTABOMPTarget::evaluate_ion_virtual 0.3376 0.0043 10215 0.000033052
DTABOMPTarget::offload_ion_virtual 0.3334 0.3334 10215 0.000032635
TwoBodyJastrowRef 0.6348 0.6348 10215 0.000062143
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.70844e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 3.21944e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 6.81107e+07
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57269)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57274)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_4 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57380)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57385)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 32
Number of walkers per rank = 32
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0495 0.0495 1 0.049491403
ParticleSet:::update 0.0000 0.0000 1 0.000003765
Total 53.7105 0.3550 1 53.710494028
Diffusion 28.8915 0.0348 5 5.778298715
Complete Updates 0.2211 0.0000 5 0.044217617
DeterminantRef::update 0.2211 0.2211 10 0.022106851
Current Gradient 1.4740 0.0302 30720 0.000047980
DeterminantRef::ratio 1.4292 1.4292 30720 0.000046523
OneBodyJastrowRef 0.0081 0.0081 30720 0.000000262
TwoBodyJastrowRef 0.0065 0.0065 30720 0.000000211
Kinetic Energy 0.3387 0.3385 5 0.067748930
OneBodyJastrowRef 0.0002 0.0002 5 0.000036828
TwoBodyJastrowRef 0.0001 0.0001 5 0.000020831
New Gradient 6.9904 0.0464 30720 0.000227552
DeterminantRef::ratio 0.2164 0.2164 30720 0.000007043
DeterminantRef::spovgl 5.8891 0.3616 30720 0.000191703
Single-Particle Orbitals 5.5275 5.5275 30720 0.000179932
OneBodyJastrowRef 0.1052 0.1052 30720 0.000003423
TwoBodyJastrowRef 0.7334 0.7334 30720 0.000023874
ParticleSet:::acceptMove 2.5594 0.0196 15371 0.000166511
DTAAOMPTarget::update_e_e 2.5155 2.5155 15371 0.000163654
DTABOMPTarget::update_ion_e 0.0244 0.0244 15371 0.000001586
ParticleSet:::computeNewPosDT 0.9132 0.0220 30720 0.000029726
DTAAOMPTarget::move_e_e 0.8043 0.8043 30720 0.000026182
DTABOMPTarget::move_ion_e 0.0868 0.0868 30720 0.000002827
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000002040
Update 16.3599 0.0200 15371 0.001064333
DeterminantRef::update 15.5962 15.5962 15371 0.001014650
OneBodyJastrowRef 0.0032 0.0032 15371 0.000000206
TwoBodyJastrowRef 0.7405 0.7405 15371 0.000048174
Initialization 2.8612 0.4130 1 2.861200708
DeterminantRef::inverse 1.0264 1.0264 2 0.513193276
DeterminantRef::spovgl 1.2158 0.1264 2 0.607883092
Single-Particle Orbitals 1.0894 1.0894 6144 0.000177310
OneBodyJastrowRef 0.0094 0.0094 1 0.009396581
ParticleSet:::update 0.0819 0.0134 2 0.040939113
DTAAOMPTarget::evaluate_e_e 0.0498 0.0498 1 0.049842724
DTABOMPTarget::evaluate_ion_e 0.0187 0.0002 1 0.018675217
DTABOMPTarget::offload_ion_e 0.0185 0.0185 1 0.018519542
TwoBodyJastrowRef 0.1148 0.1148 1 0.114750116
Pseudopotential 21.6028 0.0634 5 4.320554829
DeterminantRef::spoval 15.3045 0.3520 10215 0.001498239
Single-Particle Orbitals 14.9525 14.9525 122580 0.000121982
OneBodyJastrowRef 0.0361 0.0361 10215 0.000003533
ParticleSet:::update 5.3399 0.0145 10215 0.000522750
DTABOMPTarget::evaluate_e_virtual 4.9022 0.0071 10215 0.000479900
DTABOMPTarget::offload_e_virtual 4.8951 4.8951 10215 0.000479207
DTABOMPTarget::evaluate_ion_virtual 0.4233 0.0053 10215 0.000041436
DTABOMPTarget::offload_ion_virtual 0.4180 0.4180 10215 0.000040921
TwoBodyJastrowRef 0.8589 0.8589 10215 0.000084082
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.7636e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 5.13764e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.11834e+08
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57380)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57385)
Info: 1/2 lprof instances finished
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_5 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57584)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 57589)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 48
Number of walkers per rank = 48
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0426 0.0426 1 0.042590638
ParticleSet:::update 0.0000 0.0000 1 0.000003432
Total 72.2472 0.0093 1 72.247245661
Diffusion 39.8850 0.0473 5 7.977004592
Complete Updates 0.2947 0.0000 5 0.058943054
DeterminantRef::update 0.2947 0.2947 10 0.029469036
Current Gradient 2.1686 0.0474 30720 0.000070594
DeterminantRef::ratio 2.1036 2.1036 30720 0.000068478
OneBodyJastrowRef 0.0108 0.0108 30720 0.000000351
TwoBodyJastrowRef 0.0068 0.0068 30720 0.000000221
Kinetic Energy 0.4264 0.4259 5 0.085279103
OneBodyJastrowRef 0.0003 0.0003 5 0.000055613
TwoBodyJastrowRef 0.0002 0.0002 5 0.000037898
New Gradient 9.5807 0.0580 30720 0.000311873
DeterminantRef::ratio 0.3102 0.3102 30720 0.000010097
DeterminantRef::spovgl 8.0289 0.5336 30720 0.000261356
Single-Particle Orbitals 7.4953 7.4953 30720 0.000243988
OneBodyJastrowRef 0.1403 0.1403 30720 0.000004566
TwoBodyJastrowRef 1.0435 1.0435 30720 0.000033968
ParticleSet:::acceptMove 3.5986 0.0286 15371 0.000234115
DTAAOMPTarget::update_e_e 3.5130 3.5130 15371 0.000228546
DTABOMPTarget::update_ion_e 0.0570 0.0570 15371 0.000003711
ParticleSet:::computeNewPosDT 1.3616 0.0286 30720 0.000044324
DTAAOMPTarget::move_e_e 1.2082 1.2082 30720 0.000039328
DTABOMPTarget::move_ion_e 0.1249 0.1249 30720 0.000004066
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000002356
Update 22.4070 0.0280 15371 0.001457742
DeterminantRef::update 21.3719 21.3719 15371 0.001390404
OneBodyJastrowRef 0.0044 0.0044 15371 0.000000283
TwoBodyJastrowRef 1.0027 1.0027 15371 0.000065233
Initialization 4.0700 0.7762 1 4.070008238
DeterminantRef::inverse 1.3612 1.3612 2 0.680621773
DeterminantRef::spovgl 1.6645 0.1316 2 0.832269128
Single-Particle Orbitals 1.5329 1.5329 6144 0.000249499
OneBodyJastrowRef 0.0100 0.0100 1 0.009994917
ParticleSet:::update 0.1251 0.0486 2 0.062535036
DTAAOMPTarget::evaluate_e_e 0.0477 0.0477 1 0.047743709
DTABOMPTarget::evaluate_ion_e 0.0287 0.0006 1 0.028741747
DTABOMPTarget::offload_ion_e 0.0281 0.0281 1 0.028128506
TwoBodyJastrowRef 0.1330 0.1330 1 0.132993742
Pseudopotential 28.2829 0.0820 5 5.656585443
DeterminantRef::spoval 19.7880 0.5058 10215 0.001937152
Single-Particle Orbitals 19.2822 19.2822 122580 0.000157303
OneBodyJastrowRef 0.0459 0.0459 10215 0.000004489
ParticleSet:::update 7.3149 0.0217 10215 0.000716099
DTABOMPTarget::evaluate_e_virtual 6.6902 0.0090 10215 0.000654938
DTABOMPTarget::offload_e_virtual 6.6812 6.6812 10215 0.000654054
DTABOMPTarget::evaluate_ion_virtual 0.6031 0.0076 10215 0.000059039
DTABOMPTarget::offload_ion_virtual 0.5955 0.5955 10215 0.000058293
TwoBodyJastrowRef 1.0521 1.0521 10215 0.000102998
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 3.08179e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 5.58232e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.2813e+08
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57589)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 57584)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_HBM/intel/miniqmc/run/oneview_runs/compilers/icx_1/oneview_results_scal/tools/lprof_npsu_run_6 #
###################################################################################################################################################################################################