options

Executable Output


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -O2 -march=znver5 -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cxx=clang++    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 1 threads on rank 0
    0->  0

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.01939
  LPlusTimes                  10       7.89233
  LTimes                      10       8.04822
  Population                  10       1.78329
  Scattering                  10     245.76057
  Solve                        1     271.21845
  Source                      10       0.01413
  SweepSolver                 10       7.33971
  SweepSubdomain             160       4.57099

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.019387,7.892332,8.048221,1.783291,245.760570,271.218454,0.014128,7.339706,4.570985

Figures of Merit
================

  Throughput:         2.226913e+07 [unknowns/(second/iteration)]
  Grind time :        4.490522e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  62.27749 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_0

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_0  #
#########################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -O2 -march=znver5 -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cxx=clang++    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 2 threads on rank 0
    0->  0    1->  6

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.01743
  LPlusTimes                  10       4.02130
  LTimes                      10       4.12212
  Population                  10       0.87509
  Scattering                  10     123.70074
  Solve                        1     136.28755
  Source                      10       0.00760
  SweepSolver                 10       3.15774
  SweepSubdomain             160       2.33640

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.017433,4.021303,4.122120,0.875086,123.700736,136.287546,0.007601,3.157736,2.336401

Figures of Merit
================

  Throughput:         4.431658e+07 [unknowns/(second/iteration)]
  Grind time :        2.256492e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  73.98976 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_1

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_1  #
#########################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -O2 -march=znver5 -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cxx=clang++    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 4 threads on rank 0
    0->  0    1->  3    2->  6    3->  9

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.01425
  LPlusTimes                  10       2.12008
  LTimes                      10       2.28414
  Population                  10       0.46682
  Scattering                  10      62.02724
  Solve                        1      69.56996
  Source                      10       0.00377
  SweepSolver                 10       2.26306
  SweepSubdomain             160       1.20196

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.014253,2.120077,2.284141,0.466824,62.027242,69.569961,0.003773,2.263065,1.201957

Figures of Merit
================

  Throughput:         8.681617e+07 [unknowns/(second/iteration)]
  Grind time :        1.151859e-08 [(seconds/iteration)/unknowns]
  Sweep efficiency :  53.11191 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_2

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_2  #
#########################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -O2 -march=znver5 -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cxx=clang++    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 8 threads on rank 0
    0->  0    1->193    2->  3    3->196    4->  6    5->199    6->  9    7->202

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.03540
  LPlusTimes                  10       1.18838
  LTimes                      10       1.23712
  Population                  10       0.39700
  Scattering                  10      32.83986
  Solve                        1      37.05075
  Source                      10       0.00234
  SweepSolver                 10       0.98217
  SweepSubdomain             160       0.61833

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.035398,1.188376,1.237118,0.396999,32.839862,37.050750,0.002339,0.982167,0.618334

Figures of Merit
================

  Throughput:         1.630142e+08 [unknowns/(second/iteration)]
  Grind time :        6.134435e-09 [(seconds/iteration)/unknowns]
  Sweep efficiency :  62.95615 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_3

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_3  #
#########################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -O2 -march=znver5 -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cxx=clang++    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 16 threads on rank 0
    0->  0    1->192    2->193    3->  2    4->  3    5->195    6->196    7->  5
    8->  6    9->  7   10->199   11->200   12->  9   13-> 10   14->202   15->203

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.01828
  LPlusTimes                  10       1.08338
  LTimes                      10       1.08809
  Population                  10       0.14616
  Scattering                  10      33.49054
  Solve                        1      37.85016
  Source                      10       0.00262
  SweepSolver                 10       1.41291
  SweepSubdomain             160       0.42448

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.018277,1.083379,1.088092,0.146161,33.490543,37.850164,0.002615,1.412906,0.424476

Figures of Merit
================

  Throughput:         1.595712e+08 [unknowns/(second/iteration)]
  Grind time :        6.266793e-09 [(seconds/iteration)/unknowns]
  Sweep efficiency :  30.04279 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_4

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_4  #
#########################################################################################################################################


   _  __       _         _
  | |/ /      (_)       | |
  | ' /  _ __  _  _ __  | | __ ___
  |  <  | '__|| || '_ \ | |/ // _ \ 
  | . \ | |   | || |_) ||   <|  __/
  |_|\_\|_|   |_|| .__/ |_|\_\\___|
                 | |
                 |_|        Version 1.2.5-dev

LLNL-CODE-775068

Copyright (c) 2014-23, Lawrence Livermore National Security, LLC

Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license

This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.

Author: Adam J. Kunen 

Compilation Options:
  Architecture:           OpenMP
  Compiler:               /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
  Compiler Flags:         "-O3 -O2 -march=znver5 -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -grecord-gcc-switches -cxx=clang++    "
  Linker Flags:           " "
  CHAI Enabled:           No
  CUDA Enabled:           No
  MPI Enabled:            Yes
  OpenMP Enabled:         Yes
  Caliper Enabled:        No

OpenMP Thread->Core mapping for 24 threads on rank 0
    0->  0    1->192    2->  1    3->193    4->  2    5->194    6->  3    7->195
    8->  4    9->196   10->  5   11->197   12->  6   13->198   14->  7   15->199
   16->  8   17->200   18->  9   19->201   20-> 10   21->202   22-> 11   23->203

Input Parameters
================

  Problem Size:
    Zones:                 24 x 16 x 16  (6144 total)
    Groups:                1024
    Legendre Order:        4
    Quadrature Set:        Dummy S2 with 96 points

  Physical Properties:
    Total X-Sec:           sigt=[0.100000, 0.000100, 0.100000]
    Scattering X-Sec:      sigs=[0.050000, 0.000050, 0.050000]

  Solver Options:
    Number iterations:     10

  MPI Decomposition Options:
    Total MPI tasks:       8
    Spatial decomp:        2 x 2 x 2 MPI tasks
    Block solve method:    Sweep

  Per-Task Options:
    DirSets/Directions:    8 sets, 12 directions/set
    GroupSet/Groups:       2 sets, 512 groups/set
    Zone Sets:             1 x 1 x 1
    Architecture:          OpenMP
    Data Layout:           DGZ

Generating Problem
==================

  Decomposition Space:   Procs:      Subdomains (local/global):
  ---------------------  ----------  --------------------------
  (P) Energy:            1           2 / 2
  (Q) Direction:         1           8 / 8
  (R) Space:             8           1 / 8
  (Rx,Ry,Rz) R in XYZ:   2x2x2       1x1x1 / 2x2x2
  (PQR) TOTAL:           8           16 / 128

  Material Volumes=[9.375000e+03, 1.237500e+05, 2.746875e+06]

  Memory breakdown of Field variables:
  Field Variable            Num Elements    Megabytes
  --------------            ------------    ---------
  data/sigs                     15728640      120.000
  dx                                  24        0.000
  dy                                  16        0.000
  dz                                  16        0.000
  ell                               2400        0.018
  ell_plus                          2400        0.018
  i_plane                       25165824      192.000
  j_plane                       37748736      288.000
  k_plane                       37748736      288.000
  mixelem_to_fraction               6320        0.048
  phi                          157286400     1200.000
  phi_out                      157286400     1200.000
  psi                          603979776     4608.000
  quadrature/w                        96        0.001
  quadrature/xcos                     96        0.001
  quadrature/ycos                     96        0.001
  quadrature/zcos                     96        0.001
  rhs                          603979776     4608.000
  sigt_zonal                     6291456       48.000
  volume                            6144        0.047
  --------                  ------------    ---------
  TOTAL                       1645233448    12552.135

  Generation Complete!

Steady State Solve
==================

  iter 0: particle count=1.289596e+09, change=1.000000e+00
  iter 1: particle count=1.938605e+09, change=3.347815e-01
  iter 2: particle count=2.261901e+09, change=1.429312e-01
  iter 3: particle count=2.422372e+09, change=6.624562e-02
  iter 4: particle count=2.501793e+09, change=3.174531e-02
  iter 5: particle count=2.540981e+09, change=1.542247e-02
  iter 6: particle count=2.560258e+09, change=7.529487e-03
  iter 7: particle count=2.569713e+09, change=3.679282e-03
  iter 8: particle count=2.574337e+09, change=1.796232e-03
  iter 9: particle count=2.576593e+09, change=8.754707e-04
  Solver terminated

Timers
======

  Timer                    Count       Seconds
  ----------------  ------------  ------------
  Generate                     1       0.02535
  LPlusTimes                  10       0.87169
  LTimes                      10       0.76946
  Population                  10       0.11751
  Scattering                  10      24.31454
  Solve                        1      27.77830
  Source                      10       0.00136
  SweepSolver                 10       1.07208
  SweepSubdomain             160       0.32746

TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.025353,0.871688,0.769463,0.117507,24.314545,27.778298,0.001356,1.072082,0.327458

Figures of Merit
================

  Throughput:         2.174286e+08 [unknowns/(second/iteration)]
  Grind time :        4.599210e-09 [(seconds/iteration)/unknowns]
  Sweep efficiency :  30.54411 [100.0 * SweepSubdomain time / SweepSolver time]
  Number of unknowns: 603979776

END


Your experiment path is /beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_5

To display your profiling results:
#########################################################################################################################################
#    LEVEL    |     REPORT     |                                                COMMAND                                                 #
#########################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/beegfs/hackathon/users/eoseret/OV1_WP_WS_kripke_Zen5_rerun/tools/lprof_run_5  #
#########################################################################################################################################

×