1887

Abstract

Summary

Automated Distributed-memory Parallelism has been added to Devito, a rapidly evolving framework adopted by a dynamic, heterogeneous and fast-growing community. The key innovations are the abstractions provided to the user and the compiler- based implementation approach, which we consider invaluable for long-term sustainable software to replace (partly or fully) obsolete, impenetrable, hardly extendable and often inefficient legacy code. The auto-tuner, which determines, among the other things, the best block shape for each tiled loop nest in an Operator, has already been tweaked to support DMP. Single-node multi-socket (one MPI process per socket) as well as Multi-node experiments, both weak and strong scaling, are planned for the near future.

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.201903282
2019-10-07
2026-04-21
Loading full text...

Full text loading...

References

  1. Devito v3.5
    Devito v3.5 [2019] The MPI tutorial. https://github.com/opesci/devito/blob/v3.5/examples/userapi/mpi.ipynb.
    [Google Scholar]
  2. Louboutin, M., Lange, M., Luporini, F., Kukreja, N., Witte, P.A., Herrmann, F.J., Velesko, P. and Gorman, G.J.
    [2019] Devito (v3.1.0): an embedded domain-specific language for finite differences and geophysical exploration. Geoscientific Model Development, 12(3), 1165–1187.
    [Google Scholar]
  3. Luporini, F., Lange, M., Louboutin, M., Kukreja, N., Hückelheim, J., Yount, C., Witte, P., Kelly, P.H.J., Gorman, G.J. and Herrmann, F.J.
    [2018] Architecture and performance of Devito, a system for automated stencil computation. CoRR, abs/1807.03032.
    [Google Scholar]
/content/papers/10.3997/2214-4609.201903282
Loading
/content/papers/10.3997/2214-4609.201903282
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error