This work shows several optimization strategies evaluated and applied to an elastic wave propagation engine, based on a Fully Staggered Grid, running on the latest Intel Xeon Phi processors, the second generation of the product (code-named Knights Landing). Our fully optimized code shows a speed-up of about 4x when compared with the same algorithm optimized for the previous generation processor.


Article metrics loading...

Loading full text...

Full text loading...


  1. Caballero, D., Farres, A., Duran, A., Hanzich, M., Fernández, S. and Martorell, X.
    [2015] Optimizing Fully Anisotropic Elastic Propagation on Intel Xeon Phi Coprocessors. In: Second EAGE Workshop on High Performance Computing for Upstream.
    [Google Scholar]
  2. Davydycheva, S., Druskin, V. and Habashy, T.
    [2003] An efficient finite-difference scheme for electromagnetic logging in 3D anisotropic inhomogeneous media. Geophysics, 68(5), 1525–1536.
    [Google Scholar]
  3. Keiiti, A. and Richards, P.G.
    [2003] Quantitative Seismology. University Science Books.
    [Google Scholar]
  4. de la Puente, J., Ferrer, M., Hanzich, M., Castillo, J.E. and Cela, J.M.
    [2014] Mimetic seismic wave modeling including topography on deformed staggered grids. Geophysics, 79(3), T125–T141.
    [Google Scholar]
  5. Raman, K.
    [2016] Optimizing Memory Bandwidth in Knights Landing on Stream Triad. https://software.intel.com/en-us/articles/optimizing-memory-bandwidth-in-knights-landing-on-stream-triad.
    [Google Scholar]
  6. Yount, C.
    [2015] Vector Folding: Improving Stencil Performance via Multi-dimensional SIMD-vector Representation. In: 2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS), 2015 IEEE 12th International Conferen on Embedded Software and Systems (ICESS). 865–870.
    [Google Scholar]
  7. Yount, C., Tobin, J., Breuer, A. and Duran, A.
    [2016] YASK-Yet Another Stencil Kernel: a framework for HPC stencil code-generation and tuning. In: Proceedings of the 6th International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing held as part of ACM/IEEE Supercomputing 2016 (SC16), WOLFHPC’16.
    [Google Scholar]

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error