OV - Compare Summary

▼Stylizer

Neoverse V2 GCC O2	Neoverse V2 GCC O3	Neoverse V2 GCC Ofast	Neoverse V2 ACFL O2	Neoverse V2 ACFL O3	Neoverse V2 ACFL Ofast
[ 2 / 3 ] Security settings from the host restrict profiling. Some metrics will be missing or incomplete. Current value for kernel.perf_event_paranoid is 2. If possible, set it to 1 or check with your system administrator which flag can be used to achieve this.	[ 2 / 3 ] Security settings from the host restrict profiling. Some metrics will be missing or incomplete. Current value for kernel.perf_event_paranoid is 2. If possible, set it to 1 or check with your system administrator which flag can be used to achieve this.	[ 2 / 3 ] Security settings from the host restrict profiling. Some metrics will be missing or incomplete. Current value for kernel.perf_event_paranoid is 2. If possible, set it to 1 or check with your system administrator which flag can be used to achieve this.	[ 2 / 3 ] Security settings from the host restrict profiling. Some metrics will be missing or incomplete. Current value for kernel.perf_event_paranoid is 2. If possible, set it to 1 or check with your system administrator which flag can be used to achieve this.	[ 2 / 3 ] Security settings from the host restrict profiling. Some metrics will be missing or incomplete. Current value for kernel.perf_event_paranoid is 2. If possible, set it to 1 or check with your system administrator which flag can be used to achieve this.	[ 2 / 3 ] Security settings from the host restrict profiling. Some metrics will be missing or incomplete. Current value for kernel.perf_event_paranoid is 2. If possible, set it to 1 or check with your system administrator which flag can be used to achieve this.
[ 0 / 0 ] Fastmath not used Consider to add ffast-math to compilation flags (or replace -O3 with -Ofast) to unlock potential extra speedup by relaxing floating-point computation consistency. Warning: floating-point accuracy may be reduced and the compliance to IEEE/ISO rules/specifications for math functions will be relaxed, typically 'errno' will no longer be set after calling some math functions.	[ 0 / 0 ] Fastmath not used Consider to add ffast-math to compilation flags (or replace -O3 with -Ofast) to unlock potential extra speedup by relaxing floating-point computation consistency. Warning: floating-point accuracy may be reduced and the compliance to IEEE/ISO rules/specifications for math functions will be relaxed, typically 'errno' will no longer be set after calling some math functions.	Not available for this run	Not available for this run	Not available for this run	Not available for this run
[ 0 / 3 ] Compilation of some functions is not optimized for the target processor Application run on the ARM_NEOVERSE_V2 micro-architecture while the code was specialized for armv8.5-a+crypto+rng+sve2-aes+sve2-sha3+sve2-bitperm+i8mm+bf16+nossbs+nopredres. Architecture specific options are needed to produce efficient code for a specific processor ( -mcpu=native ).	[ 0 / 3 ] Compilation of some functions is not optimized for the target processor Application run on the ARM_NEOVERSE_V2 micro-architecture while the code was specialized for armv8.5-a+crypto+rng+sve2-aes+sve2-sha3+sve2-bitperm+i8mm+bf16+nossbs+nopredres. Architecture specific options are needed to produce efficient code for a specific processor ( -mcpu=native ).	[ 0 / 3 ] Compilation of some functions is not optimized for the target processor Application run on the ARM_NEOVERSE_V2 micro-architecture while the code was specialized for armv8.5-a+crypto+rng+sve2-aes+sve2-sha3+sve2-bitperm+i8mm+bf16+nossbs+nopredres. Architecture specific options are needed to produce efficient code for a specific processor ( -mcpu=native ).	[ 3.00 / 3 ] Architecture specific option -mcpu is used	[ 3.00 / 3 ] Architecture specific option -mcpu is used	[ 3.00 / 3 ] Architecture specific option -mcpu is used
[ 2.40 / 3 ] Most of time spent in analyzed modules comes from functions without compilation information Functions without compilation information (typically not compiled with -g) cumulate 0.06% of the time spent in analyzed modules. Check that -g is present. Remark: if -g is indeed used, this can also be due to some compiler built-in functions (typically math) or statically linked libraries. This warning can be ignored in that case.	[ 2.40 / 3 ] Most of time spent in analyzed modules comes from functions without compilation information Functions without compilation information (typically not compiled with -g) cumulate 0.07% of the time spent in analyzed modules. Check that -g is present. Remark: if -g is indeed used, this can also be due to some compiler built-in functions (typically math) or statically linked libraries. This warning can be ignored in that case.	[ 2.40 / 3 ] Most of time spent in analyzed modules comes from functions without compilation information Functions without compilation information (typically not compiled with -g) cumulate 0.06% of the time spent in analyzed modules. Check that -g is present. Remark: if -g is indeed used, this can also be due to some compiler built-in functions (typically math) or statically linked libraries. This warning can be ignored in that case.	[ 2.40 / 3 ] Most of time spent in analyzed modules comes from functions without compilation information Functions without compilation information (typically not compiled with -g and -grecord-gcc-switches) cumulate 0.07% of the time spent in analyzed modules. Check that -g and (-grecord-gcc-switches or -frecord-command-line) are present. Remark: if -g and (-grecord-gcc-switches / -frecord-command-line) are indeed used, this can also be due to some compiler built-in functions (typically math) or statically linked libraries. This warning can be ignored in that case.	[ 2.40 / 3 ] Most of time spent in analyzed modules comes from functions without compilation information Functions without compilation information (typically not compiled with -g and -grecord-gcc-switches) cumulate 0.08% of the time spent in analyzed modules. Check that -g and (-grecord-gcc-switches or -frecord-command-line) are present. Remark: if -g and (-grecord-gcc-switches / -frecord-command-line) are indeed used, this can also be due to some compiler built-in functions (typically math) or statically linked libraries. This warning can be ignored in that case.	[ 2.40 / 3 ] Most of time spent in analyzed modules comes from functions without compilation information Functions without compilation information (typically not compiled with -g and -grecord-gcc-switches) cumulate 0.10% of the time spent in analyzed modules. Check that -g and (-grecord-gcc-switches or -frecord-command-line) are present. Remark: if -g and (-grecord-gcc-switches / -frecord-command-line) are indeed used, this can also be due to some compiler built-in functions (typically math) or statically linked libraries. This warning can be ignored in that case.
[ 4 / 4 ] Application profile is long enough (14.27 s) To have good quality measurements, it is advised that the application profiling time is greater than 10 seconds.	[ 4 / 4 ] Application profile is long enough (13.84 s) To have good quality measurements, it is advised that the application profiling time is greater than 10 seconds.	[ 4 / 4 ] Application profile is long enough (12.82 s) To have good quality measurements, it is advised that the application profiling time is greater than 10 seconds.	[ 4 / 4 ] Application profile is long enough (11.93 s) To have good quality measurements, it is advised that the application profiling time is greater than 10 seconds.	[ 4 / 4 ] Application profile is long enough (11.90 s) To have good quality measurements, it is advised that the application profiling time is greater than 10 seconds.	[ 4 / 4 ] Application profile is long enough (11.77 s) To have good quality measurements, it is advised that the application profiling time is greater than 10 seconds.
[ 2 / 2 ] Application is correctly profiled ("Others" category represents 0.00 % of the execution time) To have a representative profiling, it is advised that the category "Others" represents less than 20% of the execution time in order to analyze as much as possible of the user code	[ 2 / 2 ] Application is correctly profiled ("Others" category represents 0.00 % of the execution time) To have a representative profiling, it is advised that the category "Others" represents less than 20% of the execution time in order to analyze as much as possible of the user code	[ 2 / 2 ] Application is correctly profiled ("Others" category represents 0.00 % of the execution time) To have a representative profiling, it is advised that the category "Others" represents less than 20% of the execution time in order to analyze as much as possible of the user code	[ 2 / 2 ] Application is correctly profiled ("Others" category represents 0.00 % of the execution time) To have a representative profiling, it is advised that the category "Others" represents less than 20% of the execution time in order to analyze as much as possible of the user code	[ 2 / 2 ] Application is correctly profiled ("Others" category represents 0.00 % of the execution time) To have a representative profiling, it is advised that the category "Others" represents less than 20% of the execution time in order to analyze as much as possible of the user code	[ 2 / 2 ] Application is correctly profiled ("Others" category represents 0.00 % of the execution time) To have a representative profiling, it is advised that the category "Others" represents less than 20% of the execution time in order to analyze as much as possible of the user code
[ 3 / 3 ] Optimization level option is correctly used	[ 3 / 3 ] Optimization level option is correctly used	[ 3 / 3 ] Optimization level option is correctly used	[ 3 / 3 ] Optimization level option is correctly used	[ 3 / 3 ] Optimization level option is correctly used	[ 3 / 3 ] Optimization level option is correctly used
[ 1 / 1 ] Lstopo present. The Topology lstopo report will be generated.	[ 1 / 1 ] Lstopo present. The Topology lstopo report will be generated.	[ 1 / 1 ] Lstopo present. The Topology lstopo report will be generated.	[ 1 / 1 ] Lstopo present. The Topology lstopo report will be generated.	[ 1 / 1 ] Lstopo present. The Topology lstopo report will be generated.	[ 1 / 1 ] Lstopo present. The Topology lstopo report will be generated.

Neoverse V2 GCC O2

Neoverse V2 GCC O3

Neoverse V2 GCC Ofast

Neoverse V2 ACFL O2

Neoverse V2 ACFL O3

Neoverse V2 ACFL Ofast

[ 2 / 3 ] Security settings from the host restrict profiling. Some metrics will be missing or incomplete.

Current value for kernel.perf_event_paranoid is 2. If possible, set it to 1 or check with your system administrator which flag can be used to achieve this.