Help is available by moving the cursor above any symbol or by checking MAQAO website.
▶Filter Information
There is no filter information to display
Global Metrics
Total Time (s)
9.11
Max (Thread Active Time) (s)
1.62
Average Active Time (s)
1.60
Activity Ratio (%)
19.9
Average number of active threads
11.260
Affinity Stability (%)
95.1
Time in analyzed loops (%)
6.12
Time in analyzed innermost loops (%)
5.67
Time in user code (%)
6.27
Compilation Options Score (%)
74.8
Array Access Efficiency (%)
77.0
Potential Speedups
Perfect Flow Complexity
1.00
Perfect OpenMP/MPI/Pthread/TBB
1.02
Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution
1.05
No Scalar Integer
Potential Speedup
1.01
Nb Loops to get 80%
3
FP Vectorised
Potential Speedup
1.01
Nb Loops to get 80%
2
Fully Vectorised
Potential Speedup
1.04
Nb Loops to get 80%
1
FP Arithmetic Only
Potential Speedup
1.04
Nb Loops to get 80%
1
CQA Potential Speedups Summary
Average Active Threads Count⏎
Loop Based Profile⏎
Innermost Loop Based Profile⏎
Application Categorization⏎
Compilation Options⏎
Source Object
Issue
▼libllama.so–
○hashtable.h
-funroll-loops is missing.
○llama-vocab.cpp
-funroll-loops is missing.
▼libggml-base.so–
▼–
○
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)