1887

Abstract

Summary

Full waveform inversion is a computationally intensive task typically performed on high-performance computing systems. Traditional aggregated workflows are rigid, susceptible to failures, and lead to underutilization of HPC infrastructure. Moreover, they are not able to exploit heterogeneous hardware simultaneously, such as systems with CPUs and GPUs.

SEIS_ORC addresses these challenges by implementing a disaggregated approach to FWI, breaking the workflow into smaller, more manageable jobs. Acting as a layer above the FWI code, the orchestra- tor pilots job submissions, handles partial gradient computations, and ensures fault tolerance through automatic resubmissions. Written in Python, the tool is able to run on frontend or backend nodes of supercomputer environments.

This approach improves fault tolerance, optimizes resource utilization, and reduces wait times. However, it increases disk input/output due to the storage and retrieval of partial gradients. Performance tests on multiple supercomputers demonstrate that granularity settings affect both overhead and computational resource usage. Small granularities can potentially reduce wait times and computational resource consumption.

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.202510915
2025-06-02
2026-02-06
Loading full text...

Full text loading...

References

  1. Cao, J., Brossier, R., Gorszczyk, A., Metivier, L. and Virieux, J. [2022] 3D multi-parameter fullwaveform inversion for ocean-bottom seismic data using an efficient fluid-solid coupled spectral-element solver. Geophysical Journal International, 229(1), 671–703.
    [Google Scholar]
  2. Herault, T. and Robert, Y. [2015] Fault-Tolerance Techniques for High-Performance Computing.
    [Google Scholar]
  3. Hoffmann, A., Brossier, R., Metivier, L. and Tarayoun, A. [2024] Local uncertainty quantification for 3-D time-domain full-waveform inversion with ensemble Kalman filters: application to a North Sea OBC data set. Geophysical Journal International, 237(3), 1353–1383.
    [Google Scholar]
  4. Modrak, R.T., Borisov, D., Lefebvre, M. and Tromp, J. [2018] SeisFlows—Flexible waveform inversion software. Computers & Geosciences, 115, 88–95.
    [Google Scholar]
  5. Reed, D.A., Da Lu, C. and Mendes, C.L. [2006] Reliability challenges in large systems. Future Generation Computer Systems, 22(3), 293–302.
    [Google Scholar]
  6. Rodrigo, G.P., Östberg, P.O., Elmroth, E., Antypas, K., Gerber, R. and Ramakrishnan, L. [2018] Towards understanding HPC users and systems: A NERSC case study. Journal of Parallel and Distributed Computing, 111, 206–221.
    [Google Scholar]
  7. Virieux, J., Asnaashari, A., Brossier, R., Metivier, L., Ribodetti, A. and Zhou, W. [2017] 6. An introduction to full waveform inversion. R1-1–R1-40.
    [Google Scholar]
  8. Yang, P., Brossier, R., Metivier, L., Virieux, J. and Zhou, W. [2018] A Time-Domain Preconditioned Truncated Newton Approach to Visco-acoustic Multiparameter Full Waveform Inversion. SIAM Journal on Scientific Computing, 40(4), B1101–B1130.
    [Google Scholar]
/content/papers/10.3997/2214-4609.202510915
Loading
/content/papers/10.3997/2214-4609.202510915
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error