options

Executable Output

Pagesize: 4 MiB
reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Using CSR format

Correctness check
Success, correct result.

Configuration              
Number of Threads:         64
Number of Repetitions:     200000
Input filename:            input-matrix/mat_dim_493039.txt
Matrix value array size:   57973976

Time measurements          
Total experiment time:     58.3348
Minimum kernel time:       9.89437e-05
Maximum kernel time:       0.00501084
Arithm. Mean kernel time:  0.000291637

Performance results        
Total GFlops/s:            49.6907
Minimum GFlops/s:          2.89243
Maximum GFlops/s:          146.482
Arithm. Mean GFlops/s:     49.697
Arithm. Median GFlops/s:   51.2132


* [MAQAO] Info: Dumping samples (host ip-172-31-18-66, process 3911)
* [MAQAO] Info: Dumping source info for callchain nodes (host ip-172-31-18-66, process 3911)
* [MAQAO] Info: Building/writing metadata (host ip-172-31-18-66)
* [MAQAO] Info: Finished collect step (host ip-172-31-18-66, process 3911)


Your experiment path is /home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-acfl-ofast.exe/tools/lprof_npsu_run_0

To display your profiling results:
#############################################################################################################################################################
#    LEVEL    |     REPORT     |                                                          COMMAND                                                           #
#############################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-acfl-ofast.exe/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-acfl-ofast.exe/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-acfl-ofast.exe/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-acfl-ofast.exe/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-acfl-ofast.exe/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-acfl-ofast.exe/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-acfl-ofast.exe/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-acfl-ofast.exe/tools/lprof_npsu_run_0  #
#############################################################################################################################################################

×