Help is available by moving the cursor above any symbol or by checking MAQAO website.
▶Filter Information
63 threads covering less than 1% of profiled time ( = Max (Thread Active Time)) were discarded, cumulating 29.84 seconds CPU time. You can adjust the threshold below which a thread will be discarded with the thread-filter-threshold option.
Global Metrics
Total Time (s)
385.43
Max (Thread Active Time) (s)
358.62
Average Active Time (s)
358.62
Activity Ratio (%)
93.0
Average number of active threads
0.930
Affinity Stability (%)
100.0
Time in analyzed loops (%)
4.16
Time in analyzed innermost loops (%)
3.28
Time in user code (%)
4.37
Compilation Options Score (%)
99.1
Array Access Efficiency (%)
61.8
Potential Speedups
Perfect Flow Complexity
1.00
Perfect OpenMP/MPI/Pthread/TBB
1.00
Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution
1.00
No Scalar Integer
Potential Speedup
1.01
Nb Loops to get 80%
3
FP Vectorised
Potential Speedup
1.01
Nb Loops to get 80%
3
Fully Vectorised
Potential Speedup
1.02
Nb Loops to get 80%
4
FP Arithmetic Only
Potential Speedup
1.02
Nb Loops to get 80%
5
CQA Potential Speedups Summary
Average Active Threads Count⏎
Loop Based Profile⏎
Innermost Loop Based Profile⏎
Application Categorization⏎
Compilation Options⏎
Source Object
Issue
▼libllama.so–
▼–
○
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)