* Info: Selecting the 'perf-low-ppn' engine for node ip-172-31-87-179.ec2.internal
* Info: "ref-cycles" not supported on ip-172-31-87-179.ec2.internal: fallback to "cpu-clock"
* Info: Process launched (host ip-172-31-87-179.ec2.internal, process 76667)Pagesize: 4 MiB
filename option set, ignoring flags -c, -n, -q, -z, -w
reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Using CSR format
Correctness check
Success, correct result.
Configuration
Number of Threads: 96
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Matrix value array size: 57973976
Time measurements
Total experiment time: 7.56103
Minimum kernel time: 4.79221e-05
Maximum kernel time: 0.0112841
Arithm. Mean kernel time: 7.56959e-05
Performance results
Total GFlops/s: 191.687
Minimum GFlops/s: 1.28442
Maximum GFlops/s: 302.438
Arithm. Mean GFlops/s: 191.47
Arithm. Median GFlops/s: 195.467
* Info: Process finished (host ip-172-31-87-179.ec2.internal, process 76667)
* Info: Dumping samples (host ip-172-31-87-179.ec2.internal, process 76667)
* Info: Dumping source info for callchain nodes (host ip-172-31-87-179.ec2.internal, process 76667)
* Info: Building/writing metadata (host ip-172-31-87-179.ec2.internal)
* Info: Finished collect step (host ip-172-31-87-179.ec2.internal, process 76667)
Your experiment path is /home/kcamus/pop3/spmxv/epi-spmxv-main/armclang_o3_ov1_o96/tools/lprof_npsu_run_0
To display your profiling results:
#############################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/pop3/spmxv/epi-spmxv-main/armclang_o3_ov1_o96/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/pop3/spmxv/epi-spmxv-main/armclang_o3_ov1_o96/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/pop3/spmxv/epi-spmxv-main/armclang_o3_ov1_o96/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/pop3/spmxv/epi-spmxv-main/armclang_o3_ov1_o96/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/pop3/spmxv/epi-spmxv-main/armclang_o3_ov1_o96/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/pop3/spmxv/epi-spmxv-main/armclang_o3_ov1_o96/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/pop3/spmxv/epi-spmxv-main/armclang_o3_ov1_o96/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/pop3/spmxv/epi-spmxv-main/armclang_o3_ov1_o96/tools/lprof_npsu_run_0 #
#############################################################################################################################################