This paper describes the parallel simulation of the memory/computing-intensive and large-scale three-dimensional acoustic wave equation with CPU stencil optimization. Taking the 8-core shared storage platform as an example, we obtain a one-time speed-up ratio of 6.7× compared with the serial program by using a coarse-grained OpenMP parallel scheme. Our method vectorizes the data on the template buffer with Single Instruction-Multiple Data techniques to further exploit the computing potential of the CPUs. We apply an 8-channel parallel vector to simulate seismic wave fields with the 256-bit AVX instruction set. This increases the computing bandwidth, thereby eliminating a significant volume of computing instructions and ultimately obtaining a secondary speed-up ratio of 3-7×. Finally, we analyze the factors affecting the secondary speed-up effect of AVX through complicated three-dimensional forward modeling experiments using the Salt model. The results indicate that the memory, cache, and register can better cooperate with each other when vectorization is conducted along the shortest direction of the model data cube, and that the speed-up effect can be enhanced by optimizing the AVX algorithm under such a principle.


Article metrics loading...

Loading full text...

Full text loading...


  1. Etgen, J.T. and O’Brien, M.J.
    [2007] Computational methods for large-scale 3D acoustic finite-difference modeling. Geophysics, 72, SM223–SM230.
    [Google Scholar]
  2. Francés, J., Bleda, S., Márquez, A., Neipp, C., Gallego, S., Otero, B. and Beléndez, A.
    [2014] Performance analysis of SSE and AVX instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems. J Supercomput, 70, 514–526.
    [Google Scholar]
  3. Souza, P., Borges, L., Andreolli, C. and Thierry, P.
    [2015] OpenVec Portable SIMD Intrinsics. Second EAGE Workshop on High Performance Computing for Upstream13-16 September, Dubai, UAE.
    [Google Scholar]
  4. Zhou, M.H. and Symes, W.W.
    [2014] Wave Equation Based Stencil Optimizations on Multi-core CPU. SEG Technical Program Expanded Abstracts, 3551–3555.
    [Google Scholar]

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error