* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 9107)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 235.03761 +- 0.000001. Correct Result: 235.037611
Configuration
Number of Threads: 1
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 590.411
Minimum kernel time: 0.00587606
Maximum kernel time: 0.00658798
Arithm. Mean kernel time: 0.00590402
Performance results
Total GFlops/s: 2.45481
Minimum GFlops/s: 2.19999
Maximum GFlops/s: 2.46653
Arithm. Mean GFlops/s: 2.45485
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 9107)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 9107)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 9107)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 9107)
Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0
To display your profiling results:
############################################################################################################################
# LEVEL | REPORT | COMMAND #
############################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_0 #
############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 9359)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.09150 +- 0.000001. Correct Result: 234.091499
Configuration
Number of Threads: 2
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 320.897
Minimum kernel time: 0.00317192
Maximum kernel time: 0.00428486
Arithm. Mean kernel time: 0.00320889
Performance results
Total GFlops/s: 4.51656
Minimum GFlops/s: 3.38249
Maximum GFlops/s: 4.56931
Arithm. Mean GFlops/s: 4.51667
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 9359)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 9359)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 9359)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 9359)
Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1
To display your profiling results:
############################################################################################################################
# LEVEL | REPORT | COMMAND #
############################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_1 #
############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 9606)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.94635 +- 0.000001. Correct Result: 234.946347
Configuration
Number of Threads: 4
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 208.941
Minimum kernel time: 0.0020709
Maximum kernel time: 0.00258303
Arithm. Mean kernel time: 0.00208934
Performance results
Total GFlops/s: 6.93665
Minimum GFlops/s: 5.61105
Maximum GFlops/s: 6.99863
Arithm. Mean GFlops/s: 6.93687
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 9606)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 9606)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 9606)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 9606)
Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2
To display your profiling results:
############################################################################################################################
# LEVEL | REPORT | COMMAND #
############################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_2 #
############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 9841)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.57306 +- 0.000001. Correct Result: 234.573063
Configuration
Number of Threads: 8
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 136.976
Minimum kernel time: 0.00135493
Maximum kernel time: 0.00198197
Arithm. Mean kernel time: 0.00136969
Performance results
Total GFlops/s: 10.5811
Minimum GFlops/s: 7.31266
Maximum GFlops/s: 10.6968
Arithm. Mean GFlops/s: 10.5816
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 9841)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 9841)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 9841)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 9841)
Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3
To display your profiling results:
############################################################################################################################
# LEVEL | REPORT | COMMAND #
############################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_3 #
############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 10081)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.26021 +- 0.000001. Correct Result: 233.260206
Configuration
Number of Threads: 16
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 78.9088
Minimum kernel time: 0.000714779
Maximum kernel time: 0.00503993
Arithm. Mean kernel time: 0.000788997
Performance results
Total GFlops/s: 18.3674
Minimum GFlops/s: 2.87573
Maximum GFlops/s: 20.2769
Arithm. Mean GFlops/s: 18.3695
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 10081)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 10081)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 10081)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 10081)
Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4
To display your profiling results:
############################################################################################################################
# LEVEL | REPORT | COMMAND #
############################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_4 #
############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 10322)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 235.30691 +- 0.000001. Correct Result: 235.306908
Configuration
Number of Threads: 32
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 52.0227
Minimum kernel time: 0.000365019
Maximum kernel time: 0.00592399
Arithm. Mean kernel time: 0.00052011
Performance results
Total GFlops/s: 27.8599
Minimum GFlops/s: 2.44658
Maximum GFlops/s: 39.7062
Arithm. Mean GFlops/s: 27.8662
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 10322)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 10322)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 10322)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 10322)
Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5
To display your profiling results:
############################################################################################################################
# LEVEL | REPORT | COMMAND #
############################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_5 #
############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 10578)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.59948 +- 0.000001. Correct Result: 234.599480
Configuration
Number of Threads: 52
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 27.3909
Minimum kernel time: 0.000200033
Maximum kernel time: 0.0037992
Arithm. Mean kernel time: 0.000273877
Performance results
Total GFlops/s: 52.9135
Minimum GFlops/s: 3.81488
Maximum GFlops/s: 72.4554
Arithm. Mean GFlops/s: 52.9197
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 10578)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 10578)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 10578)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 10578)
Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6
To display your profiling results:
############################################################################################################################
# LEVEL | REPORT | COMMAND #
############################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_6 #
############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 10853)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.40563 +- 0.000001. Correct Result: 234.405628
Configuration
Number of Threads: 104
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 5.8824
Minimum kernel time: 4.60148e-05
Maximum kernel time: 0.0122659
Arithm. Mean kernel time: 5.87934e-05
Performance results
Total GFlops/s: 246.388
Minimum GFlops/s: 1.18161
Maximum GFlops/s: 314.975
Arithm. Mean GFlops/s: 246.516
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 10853)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 10853)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 10853)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 10853)
Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7
To display your profiling results:
############################################################################################################################
# LEVEL | REPORT | COMMAND #
############################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_7 #
############################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 11179)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.27556 +- 0.000001. Correct Result: 234.275559
Configuration
Number of Threads: 208
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 5.22391
Minimum kernel time: 3.88622e-05
Maximum kernel time: 0.012706
Arithm. Mean kernel time: 5.22014e-05
Performance results
Total GFlops/s: 277.445
Minimum GFlops/s: 1.14068
Maximum GFlops/s: 372.946
Arithm. Mean GFlops/s: 277.646
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 11179)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 11179)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 11179)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 11179)
Your experiment path is /home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8
To display your profiling results:
############################################################################################################################
# LEVEL | REPORT | COMMAND #
############################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/icx_ov1_scala/tools/lprof_npsu_run_8 #
############################################################################################################################