
Full text loading...
This paper describes two methods to improve the performance of a FDTD solver for the first order formulation of the 3D acoustic wave equation. Based on spatial and temporal cache blocking techniques, these methods enable to maximize bandwidth of the memory subsystem, while reducing data traffic in-between the memory hierarchy. On the one hand, the spatial blocking (SB) approach increases data reuse among cores within each iteration of the time integration. On the other hand, the multicore wavefront diamond temporal blocking (MWD-TB) technique further leverages the SB performance by intrinsically reusing freshly cached data solutions across iterations of the time integration. While SB achieves sixfold performance speedup against the naive implementation (without cache blocking), MWD-TB outperforms SB by up to 50\% on a two-socket 16-core Intel Haswell system.