Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Total Time (s) | 18.86 | ||
| Max (Thread Active Time) (s) | 16.87 | ||
| Average Active Time (s) | 15.93 | ||
| Activity Ratio (%) | 92.4 | ||
| Average number of active threads | 9.63 E3 | ||
| Affinity Stability (%) | 93.4 | ||
| Time in analyzed loops (%) | 53.0 | ||
| Time in analyzed innermost loops (%) | 52.6 | ||
| Time in user code (%) | 77.4 | ||
| Compilation Options Score (%) | 66.7 | ||
| Array Access Efficiency (%) | 90.4 | ||
| Potential Speedups | |||
| Perfect Flow Complexity | 1.00 | ||
| Perfect OpenMP/MPI/Pthread/TBB | 1.27 | ||
| Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution | 1.33 | ||
| No Scalar Integer | Potential Speedup | 1.00 | |
| Nb Loops to get 80% | 5 | ||
| FP Vectorised | Potential Speedup | 1.16 | |
| Nb Loops to get 80% | 11 | ||
| Fully Vectorised | Potential Speedup | 1.72 | |
| Nb Loops to get 80% | 17 | ||
| FP Arithmetic Only | Potential Speedup | 1.11 | |
| Nb Loops to get 80% | 7 | ||
Enable log scale
| Source Object | Issue |
|---|---|
| ▼bt-mz.E.x | |
| ○y_solve.f90 | -march=(target) is missing. |
| ○add.f90 | -march=(target) is missing. |
| ○x_solve.f90 | -march=(target) is missing. |
| ○exact_rhs.f90 | -march=(target) is missing. |
| ○exact_solution.f90 | -march=(target) is missing. |
| ○solve_subs.f90 | -march=(target) is missing. |
| ○z_solve.f90 | -march=(target) is missing. |
| ○initialize.f90 | -march=(target) is missing. |
| ○exch_qbc.f90 | -march=(target) is missing. |
| ○rhs.f90 | -march=(target) is missing. |
| Experiment Name | |||||
| Application | ./bt-mz.E.x | ||||
| Timestamp | NA | Universal Timestamp | NA | ||
| Number of processes observed | 200 | Number of threads observed | 11400 | ||
| Experiment Type | MPI; OpenMP; | ||||
| Machine | gs24r3b56,gs25r2b10,gs04r2b48,gs21r1b13,gs26r3b29,gs25r2b06,gs14r1b40,gs26r3b30,gs26r3b26,gs25r2b07,gs04r2b42,gs24r3b48,gs14r1b43,gs25r2b48,gs04r2b54,gs04r2b59,gs14r1b18,gs04r2b35,gs26r3b32,gs23r3b59,gs26r3b27,gs24r3b64,gs14r1b65,gs21r1b49,gs14r1b60,gs25r2b09,gs04r2b44,gs23r3b70,gs04r2b49,gs25r2b02,gs14r1b63,gs12r3b34,gs04r2b40,gs21r1b24,gs25r2b03,gs25r2b27,gs04r2b12,gs25r2b46,gs14r1b56,gs21r1b45,gs25r2b47,gs12r3b06,gs04r2b62,gs14r1b11,gs23r3b60,gs24r3b49,gs24r3b50,gs23r3b58,gs12r3b71,gs04r2b38,gs24r3b58,gs14r1b28,gs21r1b44,gs14r1b46,gs04r2b60,gs23r3b69,gs25r2b18,gs14r1b29,gs04r2b45,gs04r2b57,gs14r1b22,gs04r2b32,gs12r3b35,gs25r2b01,gs25r2b04,gs21r1b50,gs04r2b41,gs24r3b36,gs04r2b51,gs14r1b59,gs14r1b58,gs24r3b35,gs14r1b20,gs25r2b08,gs14r1b27,gs24r3b44,gs04r2b46,gs14r1b32,gs12r3b36,gs24r3b53,gs14r1b01,gs23r3b24,gs12r3b42,gs12r3b12,gs24r3b51,gs04r2b33,gs26r3b28,gs25r2b05,gs04r2b65,gs14r1b55,gs24r3b54,gs04r2b39,gs24r3b52,gs14r1b57,gs14r1b26,gs14r1b36,gs24r3b39,gs21r1b62,gs04r2b47,gs04r2b52 | ||||
| Model Name | Intel(R) Xeon(R) Platinum 8480+ | ||||
| Architecture | x86_64 | Micro Architecture | SAPPHIRE_RAPIDS | ||
| Cache Size | 107520 KB | Number of Cores | 56 | ||
| OS Version | Linux 5.14.0-284.30.1.el9_2.x86_64 #1 SMP PREEMPT_DYNAMIC Fri Aug 25 09:13:12 EDT 2023 | ||||
| Architecture used during static analysis | x86_64 | Micro Architecture used during static analysis | SAPPHIRE_RAPIDS | ||
| Frequency Driver | intel_cpufreq | Frequency Governor | performance | ||
| Huge Pages | always | Hyperthreading | on | ||
| Number of sockets | 2 | Number of cores per socket | 56 | ||
| Compilation Options | bt-mz.E.x: -I/gpfs/apps/MN5/GPP/ONEAPI/2023.2.0/mpi/2021.10.0/include/gfortran -I/gpfs/apps/MN5/GPP/ONEAPI/2023.2.0/mpi/2021.10.0/include -c -O3 -fopenmp -g -fno-omit-frame-pointer | ||||
| Comments | |||||
| Dataset | |
| Run Command | <executable> |
| MPI Command | srun -A ehpc535 -t 5 -q gp_ehpc -N <number_nodes> --ntasks-per-node=<number_processes_per_node> -c <OMP_NUM_THREADS> |
| Number Nodes | 100 |
| Number Processes per Node | 2 |
| Filter | Not Used |
| Profile Start | Not Used |
| Profile Stop | Not Used |
| Maximal Path Number | 4 |