1887

Abstract

Summary

He production of reliable three-dimensional images of the subsurface remains a major challenge in the oil and gas industry and strongly relies on the efficient exploitation of supercomputers. However, as each vendor is working on next-generation technologies, the landscape of architectures that may be available leads to increasing concerns regarding real applicative performance. Whatever the design of these systems will be (heterogeneity, high core counts or depth of the memory hierarchy), it is admitted that co-design approaches will play a major role to ensure that oil and gas applications will be in best position to adopt the next breakthroughs. In this paper, after a review of recent contributions for the optimization of geophysical stencils, we discuss key feature from Arm hardware that may influence standard implementations.

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.201903279
2019-10-07
2024-03-28
Loading full text...

Full text loading...

References

  1. Abdelkhalek, R., Calandra, H., Coulaud, O., Latu, G. and Roman, J.
    [2012] Fast seismic modeling and reverse time migration on a graphics processing unit cluster. Concurrency and Computation: Practice and Experience, 24(7), 739–750.
    [Google Scholar]
  2. Andreolli, C., Thierry, P., Borges, L., Skinner, G. and Yount, C.
    [2015] Chapter 23 - Characterization and Optimization Methodology Applied to Stencil Computations. In: Reinders, J. and Jeffers, J. (Eds.) High Performance Parallelism Pearls, Morgan Kaufmann, Boston, 377–396.
    [Google Scholar]
  3. Armejach, A., Caminal, H., Cebrian, J.M., Langarita, R., González-Alberquilla, R., Adeniyi-Jones, C., Valero, M., Casas, M. and Moretó, M.
    [2019] Using Arm’s scalable vector extension on stencil codes. The Journal of Supercomputing.
    [Google Scholar]
  4. Boillot, L., Bosilca, G., Agullo, E. and Calandra, H.
    [2014] Task-Based Programming for Seismic Imaging: Preliminary Results. In: 2014 IEEE International Conference on High Performance Computing and Communications, HPCC/CSS/ICESS 2014, Paris, France, August 20–22, 2014. 1259–1266.
    [Google Scholar]
  5. Christen, M., Schenk, O. and Cui, Y.
    [2012] Patus for convenient high-performance stencils: evaluation in earthquake simulations. In: SC Conference on High Performance Computing Networking, Storage and Analysis, SC ‘12, Salt Lake City, UT, USA - November 11–15, 2012. 11.
    [Google Scholar]
  6. Datta, K., Kamil, S., Williams, S., Oliker, L., Shalf, J. and Yelick, K.
    [2009] Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors. SIAM Review, 51(1), 129–159.
    [Google Scholar]
  7. Dupros, F., Boulahya, F., Aochi, H. and Thierry, P.
    [2015] Communication-Avoiding Seismic Numerical Kernels on Multicore Processors. In: International Conference on High Performance Computing and Communications (HPCC).330–335.
    [Google Scholar]
  8. Göddeke, D., Komatitsch, D., Geveler, M., Ribbrock, D., Rajovic, N., Puzovic, N. and Ramírez, A.
    [2013] Energy efficiency vs. performance of the numerical solution of PDEs: An application study on a low-power ARM-based cluster. J. Comput. Physics, 237, 132–150.
    [Google Scholar]
  9. Kukreja, N., Louboutin, M., Vieira, F., Luporini, F., Lange, M. and Gorman, G.
    [2016] Devito: Automated Fast Finite Difference Computation. In: Sixth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, WOLFHPC@SC 2016, Salt Lake, UT, USA, November 14, 2016. 11–19.
    [Google Scholar]
  10. Malas, T.M., Hager, G., Ltaief, H., Stengel, H., Wellein, G. and Keyes, D.E.
    [2015] Multicore-Optimized Wavefront Diamond Blocking for Optimizing Stencil Updates. SIAM J. Scientific Computing, 37(4).
    [Google Scholar]
  11. Micikevicius, P.
    [2009] 3D finite-difference computation on GPUs using CUDA. In: Workshop on General Purpose Processing on Graphics Processing Units.ACM, Washington, USA, 79–84.
    [Google Scholar]
  12. Moustafa, S., Kirschenmann, W., Dupros, F. and Aochi, H.
    [2018] Task-Based Programming on Emerging Parallel Architectures for Finite-Differences Seismic Numerical Kernel. In: Euro-Par 2018: Parallel Processing - 24th International Conference on Parallel and Distributed Computing, Turin, Italy, August 27–31, 2018, Proceedings. 764–777.
    [Google Scholar]
  13. Qawasmeh, A., Hugues, M.R., Calandra, H. and Chapman, B.M.
    [2017] Performance portability in reverse time migration and seismic modelling via OpenACC. IJHPCA, 31(5), 422–440.
    [Google Scholar]
  14. Roten, D., Cui, Y., Olsen, K.B., Day, S.M., Withers, K., Savran, W.H., Wang, P. and Mu, D.
    [2016] High-frequency nonlinear earthquake simulations on petascale heterogeneous supercomputers. In: Proceedings ofthe International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2016, Salt Lake City, UT, USA, November 13–18, 2016. 957–968.
    [Google Scholar]
  15. Said, I., Fortin, P., Lamotte, J.L. and Calandra, H.
    [2018] Leveraging the accelerated processing units for seismic imaging: A performance and power efficiency comparison against CPUs and GPUs. IJHPCA, 32(6).
    [Google Scholar]
  16. Tsuboi, S., Ando, K., Miyoshi, T., Peter, D., Komatitsch, D. and Tromp, J.
    [2016] A 1.8 trillion degrees-of-freedom, 1.24 petaflops global seismic wave simulation on the K computer. IJHPCA, 30(4), 411–422.
    [Google Scholar]
  17. Yount, C., Tobin, J., Breuer, A. and Duran, A.
    [2016] YASK - Yet Another Stencil Kernel: A Framework for HPC Stencil Code-Generation and Tuning. In: Sixth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, WOLFHPC@SC 2016, Salt Lake, UT, USA, November 14, 2016. 30–39.
    [Google Scholar]
http://instance.metastore.ingenta.com/content/papers/10.3997/2214-4609.201903279
Loading
/content/papers/10.3997/2214-4609.201903279
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error