options

engine_linux64_intel_impi - 2024-10-10 14:04:09 - MAQAO 2.20.9

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Total Time (s)370.25
Profiled Time (s)340.82
Time in analyzed loops (%)84.0
Time in analyzed innermost loops (%)81.1
Time in user code (%)82.2
Compilation Options Score (%)100
Array Access Efficiency (%)88.3
Potential Speedups
Perfect Flow Complexity1.09
Perfect OpenMP + MPI + Pthread1.13
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution1.19
No Scalar IntegerPotential Speedup1.10
Nb Loops to get 80%19
FP VectorisedPotential Speedup1.04
Nb Loops to get 80%8
Fully VectorisedPotential Speedup1.87
Nb Loops to get 80%17
FP Arithmetic OnlyPotential Speedup1.83
Nb Loops to get 80%35

CQA Potential Speedups Summary

Loop Based Profile

Innermost Loop Based Profile

Application Categorization

Compilation Options

Source ObjectIssue
engine_linux64_intel_impi
m2cplr.F
forint.F
layini.F
r2len3.F
parsorc.F
cmain3.F
r4evec3.F
cdlen3.F
cfint3.F
inter_minmax_node.F
r4def3.F
i7dst3.F
hist2.F
ccoef3.F
cbilan.F
i7main_opt_tri.F
spmd_cell_size_exchange.F
spmd_exch2_a_pon.F
shvis3.F
intfop2.F
i7mainf.F
rbilan.F
rbyvit.F
rbyonf.F
rgbodv.F
sigeps02c.F
i7main_crit_tri.F
spmd_i7xvcom2.F
chvis3.F
rforc3.F
mmain.F90
asspar4.F
forintc.F
cstra3.F
spmd_i7fcom_pon.F
vitesse.F
mulawc.F
rgbcor.F
inttri.F
i7trivox.F
rbyfor.F
cderi3.F
timer.F
parit.F
cnvec3.F
ccoor3.F
forints.F
i7optcd.F
ecrit.F
accele.F
scoor3.F
rgbodfp.F
i7cor3.F
ccurv3.F
inter_cell_color.F
i7for3.F
redef3.F
s8sav3.F
mulawglc.F
rgwall.F
r4cum3p.F
scumu3p.F
dtnoda.F
depla.F
cdefo3.F
r2coor3.F
cforc3.F
bcs10.F

Loop Path Count Profile

Cumulated Speedup If No Scalar Integer

Cumulated Speedup If FP Vectorized

Cumulated Speedup If Fully Vectorized

Cumulated Speedup If FP Arithmetic Only

Experiment Summary

Application./../OpenRadioss/engine/cbuild_engine_linux64_intel_impi/engine_linux64_intel_impi
Timestamp2024-10-10 14:04:09 Universal Timestamp1728561849
Number of processes observed26 Number of threads observed52
Experiment TypeMPI; OpenMP;
Machineskylake
Model NameIntel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz
Architecturex86_64 Micro ArchitectureSKYLAKE
Cache Size36608 KB Number of Cores26
OS VersionLinux 6.10.10-arch1-1 #1 SMP PREEMPT_DYNAMIC Thu, 12 Sep 2024 17:21:02 +0000
Architecture used during static analysisx86_64 Micro Architecture used during static analysisSKYLAKE
Frequency Driverintel_cpufreq Frequency Governorperformance
Huge Pagesalways Hyperthreadingoff
Number of sockets2 Number of cores per socket26
Compilation Optionsengine_linux64_intel_impi: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.13.1 Build 20240703_000000 -I/home/kcamus/POP/POP3/OpenRadioss/OpenRadioss/engine/../common_source/includes -I/home/kcamus/POP/POP3/OpenRadioss/OpenRadioss/engine/../common_source/modules -I/home/kcamus/POP/POP3/OpenRadioss/OpenRadioss/engine/share/includes -I/home/kcamus/POP/POP3/OpenRadioss/OpenRadioss/engine/share/r8 -I/home/kcamus/POP/POP3/OpenRadioss/OpenRadioss/engine/share/spe_inc -I/home/kcamus/POP/POP3/OpenRadioss/OpenRadioss/engine/cbuild_engine_linux64_intel_impi/CMakeFiles/includes_engine_linux64_intel_impi -I/opt/intel/oneapi/2024.2/include/ -module CMakeFiles/modules_engine_linux64_intel_impi -axSSE3,COMMON-AVX512 -no-fma -O3 -fp-model precise -fimf-use-svml=true -qopenmp -DMYREAL8 -DWITHOUT_LINALG -ftz -extend-source -assume buffered_io -align array64byte -DMPI -DCPP_mach=CPP_p4linux964 -DCPP_rel=00 -g -fno-omit-frame-pointer -c -o CMakeFiles/engine_linux64_intel_impi.dir/source/assembly/asspar4.F.o

Configuration Summary

Dataset
Run Command<executable> -i NEON1M11_0001.rad
MPI Commandmpirun -np 26
Number Processes1
Number Nodes1
FilterNot Used
Profile StartNot Used
×