1887

Abstract

Summary

Traditional geostatistical methods face critical limitations in sparse-data environments, including frequent over-smoothing, challenging uncertainty quantification, and limited capability to integrate heterogeneous multi-source datasets for spatial property prediction. Conversely, pure machine learning approaches lack essential spatial coherence required for subsurface applications. This study presents a hybrid framework combining machine learning’s superior data integration capabilities with geostatistical spatial modeling.

The framework supports multiple machine learning algorithms with NGBoost utilized as the primary model due to its superior uncertainty quantification and predictive performance. The methodology effectively integrates diverse geological contexts including depositional environments (GDE), well log summaries, fault networks, paleoburial depth, and structural features, enabling complex pattern recognition beyond traditional geostatistical capabilities. Spatial cross-validation with automated bayesian optimization ensures model reliability, while Gaussian Random Field simulation preserves spatial coherence. Monte Carlo uncertainty propagation enables multi-property integration for reservoir quality assessment.

In a validation case study, the framework was applied to 1,679 wells across the Norwegian and UK Continental Shelves to estimate reservoir properties in potential carbon storage sites. The methodology achieved 35–80% RMSE reductions compared to geostatistical methods, with 84% MAE reduction for net-to-gross prediction and well-calibrated uncertainty estimates (71.2% coverage). These results illustrate its operational readiness for exploration and carbon storage applications.

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.202639036
2026-03-09
2026-02-07
Loading full text...

Full text loading...

References

  1. Chen, T. and Guestrin, C.2016. XGBoost: A scalable tree boosting system. Proc., 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, California, 13–17 August, 785–794.
    [Google Scholar]
  2. Chilès, J.P. and Delfiner, P.2012. Geostatistics: Modeling Spatial Uncertainty, second edition. John Wiley & Sons, Hoboken, New Jersey.
    [Google Scholar]
  3. Daly, C.2022. An application of random forests for spatial interpolation with quantified uncertainty. Mathematical Geosciences54(6): 979–1015.
    [Google Scholar]
  4. Deutsch, C.V. and Journel, A.G.1998. GSLIB: Geostatistical Software Library and User’s Guide, second edition. Oxford University Press, New York City, New York.
    [Google Scholar]
  5. Duan, T., Anand, A., Ding, D.Y., Thai, K.K., Basu, S., Ng, A., and Schuler, A.2020. NGBoost: Natural gradient boosting for probabilistic prediction. Proc., 37th International Conference on Machine Learning, Virtual Event, 13–18 July, 2690–2700.
    [Google Scholar]
  6. Goovaerts, P.1997. Geostatistics for Natural Resources Evaluation. Oxford University Press, New York City, New York.
    [Google Scholar]
  7. Journel, A.G. and Huijbregts, C.J.1978. Mining Geostatistics. Academic Press, London, United Kingdom.
    [Google Scholar]
  8. Meinshausen, N.2006. Quantile regression forests. Journal of Machine Learning Research7: 983–999.
    [Google Scholar]
  9. Pyrcz, M.J. and Deutsch, C.V.2014. Geostatistical Reservoir Modeling, second edition. Oxford University Press, New York City, New York.
    [Google Scholar]
/content/papers/10.3997/2214-4609.202639036
Loading
/content/papers/10.3997/2214-4609.202639036
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error