* Info: Detected 1 Lprof instances in o404: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
* Info: Selecting the 'perf-high-ppn' engine for node o404
* Info: Process launched (host o404, process 116715)-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 860160000 (elements), Offset = 0 (elements)
Memory per array = 6562.5 MiB (= 6.4 GiB).
Total memory required = 19687.5 MiB (= 19.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 112
Number of Threads counted = 112
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 9757 microseconds.
(= 9757 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 1052505.4 0.013698 0.013076 0.014142
Scale: 1087949.4 0.013506 0.012650 0.013886
Add: 1152707.6 0.018827 0.017909 0.019498
Triad: 1130426.0 0.019228 0.018262 0.019386
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
* Info: Process finished (host o404, process 116715)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/icx_2/oneview_results_1714153525/tools/lprof_npsu_run_0
To display your profiling results:
#############################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/icx_2/oneview_results_1714153525/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/icx_2/oneview_results_1714153525/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/icx_2/oneview_results_1714153525/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/icx_2/oneview_results_1714153525/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/icx_2/oneview_results_1714153525/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/icx_2/oneview_results_1714153525/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/icx_2/oneview_results_1714153525/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-415-2041/intel/stream/run/oneview_runs/compilers/icx_2/oneview_results_1714153525/tools/lprof_npsu_run_0 #
#############################################################################################################################################################################################################