Hi
I have binary build Intel C 19.0.1 that is a heavy user of MKL.
When I do
export MKL_ENABLE_INSTRUCTIONS=AVX512
time goes from 88s to 55s for 1 problem instance.
That is HUGE savnings.
How come MKL figure that by itself? What is the philosophy?
Also it seems if I instead do
mkl_enable_instructions(MKL_ENABLE_AVX512)
then it has ZERO effect. Why?
Erling