Porting implicit unstructured mesh code onto Intel Xeon Phi “Knights Corner”.
Employing aggressive optimizations to improve the CFD flux kernel on Phi hardware.
Exploring different thread affinity modes with different programming paradigms.
Achieving 3.8x speedup using the offload mode compare to the baseline.
Achieving 5x speedup using the native mode compare to the baseline.