Help is available by moving the cursor above any symbol or by checking MAQAO website.
- There is no filter information to display
Total Time (s) | 1.78 E3 | |
Max (Thread Active Time) (s) | 1.55 E3 | |
Average Active Time (s) | 1.37 E3 | |
Activity Ratio (%) | 77.5 | |
Average number of active threads | 74.324 | |
Affinity Stability (%) | 100.0 | |
Time in analyzed loops (%) | 83.1 | |
Time in analyzed innermost loops (%) | 73.5 | |
Time in user code (%) | 88.7 | |
Compilation Options Score (%) | 100 | |
Array Access Efficiency (%) | 74.3 | |
|
Potential Speedups |
Perfect Flow Complexity | 1.01 | |
Perfect OpenMP + MPI + Pthread | 1.14 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.21 | |
No Scalar Integer | Potential Speedup | 1.17 | |
Nb Loops to get 80% | 21 | |
FP Vectorised | Potential Speedup | 1.04 | |
Nb Loops to get 80% | 7 | |
Fully Vectorised | Potential Speedup | 1.20 | |
Nb Loops to get 80% | 25 | |
FP Arithmetic Only | Potential Speedup | 1.60 | |
Nb Loops to get 80% | 35 | |
Source Object | Issue |
▼engine_linuxa64_ompi– | |
○ccurv3.F | |
○forint.F | |
○inter_count_node_curv.F | |
○r2len3.F | |
○cmain3.F | |
○r4evec3.F | |
○i7buce_crit.F | |
○cdlen3.F | |
○cfint3.F | |
○inter_minmax_node.F | |
○cortdir3.F | |
○r4def3.F | |
○cupdt3.F | |
○depla.F | |
○ccoef3.F | |
○dtnoda.F | |
○resol.F | |
○spmd_cell_size_exchange.F | |
○myqsort_int.F | |
○spmd_exch2_a_pon.F | |
○shvis3.F | |
○intfop2.F | |
○i7mainf.F | |
○rbilan.F | |
○rbyvit.F | |
○rbyonf.F | |
○rgbodv.F | |
○sigeps02c.F | |
○sforc3.F | |
○i7main_crit_tri.F | |
○spmd_i7xvcom2.F | |
○chvis3.F | |
○rforc3.F | |
○mmain.F90 | |
○asspar4.F | |
○forintc.F | |
○cstra3.F | |
○spmd_i7fcom_pon.F | |
○vitesse.F | |
○mulawc.F | |
○rgbcor.F | |
○r4cum3p.F | |
○i7trivox.F | |
○rgbodfp.F | |
○mulawglc.F | |
○accele.F | |
○cbilan.F | |
○cderi3.F | |
○timer.F | |
○i7for3.F | |
○i7cdcor3.F | |
○parit.F | |
○cnvec3.F | |
○deplafakeige.F | |
○rgwall.F | |
○ccoor3.F | |
○inter_check_sort.F | |
○inttri.F | |
○inter_voxel_creation.F | |
○i7ass3.F | |
○i7optcd.F | |
○i7pen3.F | |
○c3forc3.F | |
○sderi3.F | |
○inter_cell_color.F | |
○i7cor3.F | |
○redef3.F | |
○bcs10.F | |
○i7main_opt_tri.F | |
○layini.F | |
○sigeps01g.F | |
○ecrit.F | |
○i7dst3.F | |
○scoor3.F | |
○scumu3p.F | |
○m2cplr.F | |
○hist2.F | |
○rbyfor.F | |
○r2coor3.F | |
○cforc3.F | |
○cdefo3.F | |
Experiment Name | |
Application | /home/hbollore/pop3/openradioss/OpenRadioss/exec/engine_linuxa64_ompi |
Timestamp | 2025-01-15 14:35:38 |
Universal Timestamp | 1736951738 |
Number of processes observed | 24 |
Number of threads observed | 96 |
Experiment Type | MPI; OpenMP; |
Machine | ip-172-31-47-249.ec2.internal |
Architecture | aarch64 |
Micro Architecture | ARM_NEOVERSE_V2 |
OS Version | Linux 6.1.109-118.189.amzn2023.aarch64 #1 SMP Tue Sep 10 08:58:40 UTC 2024 |
Architecture used during static analysis | aarch64 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V2 |
Frequency Driver | NA |
Frequency Governor | NA |
Huge Pages | madvise |
Hyperthreading | off |
Number of sockets | 1 |
Number of cores per socket | 96 |
Compilation Options | engine_linuxa64_ompi: Arm F90 F90 Flang - 1.5 2017-05-01 flang -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/../common_source/includes -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/../common_source/modules -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/share/includes -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/share/r8 -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/share/spe_inc -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/cbuild_engine_linuxa64_ompi/CMakeFiles/includes_engine_linuxa64_ompi -g -mcpu=native -fno-omit-frame-pointer -module CMakeFiles/modules_engine_linuxa64_ompi -D WITHOUT_LINALG -mcpu=native -D COMP_ARMFLANG=1 -D ARCH_CPU=ARM -fopenmp -D MYREAL8 -ffixed-line-length-none -D MPI -I /home/hbollore/soft/openmpi-5.0.6-armsuite/include/ -D CPP_mach=CPP_p4linux964 -D CPP_rel=70 -O3 -nofma -ffp-contract=off -fno-unsafe-math-optimizations -fno-fast-math -fveclib=none -I /home/hbollore/pop3/openradioss/OpenRadioss/engine/../extlib/h3d/includes -c -o CMakeFiles/engine_linuxa64_ompi.dir/source/materials/mat_share/mulawc.F.o | | |
Comments | | | |
Dataset | |
Run Command | <executable> -i /home/hbollore/pop3/openradioss/dataset/N1M/NEON1M11_0001.rad |
MPI Command | mpirun -n <number_processes> --bind-to core --map-by node:PE=<OMP_NUM_THREADS> --report-bindings |
Number Processes | 24 |
Number Nodes | 1 |
Number Processes per Nodes | 24 |
Filter | Not Used |
Profile Start | Not Used |
Maximal Path Number | 4 |