options

Executable Output

Pagesize: 4 MiB
reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Using CSR format

Correctness check
Success, correct result.

Configuration              
Number of Threads:         64
Number of Repetitions:     200000
Input filename:            input-matrix/mat_dim_493039.txt
Matrix value array size:   57973976

Time measurements          
Total experiment time:     72.2026
Minimum kernel time:       9.1135e-05
Maximum kernel time:       0.00538477
Arithm. Mean kernel time:  0.000360941

Performance results        
Total GFlops/s:            40.1468
Minimum GFlops/s:          2.69157
Maximum GFlops/s:          159.033
Arithm. Mean GFlops/s:     40.1548
Arithm. Median GFlops/s:   47.688


* [MAQAO] Info: Dumping samples (host ip-172-31-18-66, process 2407)
* [MAQAO] Info: Dumping source info for callchain nodes (host ip-172-31-18-66, process 2407)
* [MAQAO] Info: Building/writing metadata (host ip-172-31-18-66)
* [MAQAO] Info: Finished collect step (host ip-172-31-18-66, process 2407)


Your experiment path is /home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-gcc-ofast.exe/tools/lprof_npsu_run_0

To display your profiling results:
############################################################################################################################################################
#    LEVEL    |     REPORT     |                                                          COMMAND                                                          #
############################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-gcc-ofast.exe/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-gcc-ofast.exe/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-gcc-ofast.exe/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-gcc-ofast.exe/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-gcc-ofast.exe/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-gcc-ofast.exe/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-gcc-ofast.exe/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/fmusial/SPMXV_Benchmarks/epi-spmxv-main/results/spmxv-gcc-ofast.exe/tools/lprof_npsu_run_0  #
############################################################################################################################################################

×