Fifth EAGE Digitalization Conference & Exhibition
- Conference date: March 24-26, 2025
- Location: Edinburgh, Scotland, United Kingdom
- Published: 24 March 2025
1 - 20 of 81 results
-
-
Automating the Unorganized: AI-Driven Document Classification for Scalable Data Management
More LessAuthors A. Edwards, M. Szupienko and S. ThomsonSummaryThe research presents an AI-driven solution for classifying unstructured data in the energy sector, where 80% of critical information is stored in unorganized formats. Using a dataset of 3,000 PDFs, an LLM-based workflow achieved up to 89% accuracy by transforming documents into semantic vectors for classification. The workflow presents a scalable framework streamlining data management, enabling efficient decision-making.
-
-
-
Unlocking Data Insights: A Data Lake Approach with Historical Drilling Data, Apache Spark and BI Tools
More LessAuthors F. RakhmangulovSummaryThis study explores the integration of data Lakehouse platforms, Apache Spark, and BI(Business Intelligence) tools to enhance the efficiency of historical drilling data analysis for well planning. Advanced downhole equipment generates terabytes of data at high frequencies, necessitating robust storage and processing solutions. By leveraging a corporate data lake and cloud-based PySpark, the study organizes and interconnects hundreds of well datasets, integrating them into a dedicated database for seamless visualization via BI tools.
Key analyses included Dogleg Severity (DLS) computations, machine learning (ML)-based bit performance evaluation using Logging While Drilling (LWD) data. Challenges such as data preprocessing, outlier removal, and code validation were addressed through iterative development and condition enhancements. The study highlights the automation of offset wells analysis, significantly reducing time and effort compared to manual approaches.
Novel contributions enabling Drilling Engineers to adopt roles of Data Engineers and Scientists. The findings underscore the potential of ML in automating analytical workflows and extracting actionable insights from extensive datasets, driving efficiency and innovation in drilling operations.
-
-
-
Chronostratigraphic Color Decomposition: A Fast Method for Identifying Sedimentary Features in Seismic Data
More LessAuthors R. Leblond and L. SoucheSummaryThe temporal chromatic decomposition methodology offers an alternative interpretation workflows. By utilizing RGT properties, this approach operates in two modes: horizon-based and horizon stack. The horizon mode enhances interpretation accuracy by normalizing input offsets and minimizing artifacts from signal intersections, preserving the geological morphology of structures. The horizon stack mode enables rapid visualization of broader stratigraphic frameworks, aiding in the identification of key sedimentary and structural features with real-time flexibility.
This method ensures geologically consistent results, with colors representing depth relative to the observed attribute, allowing interpreters to extract meaningful insights efficiently. Its dual functionality accelerates seismic interpretation workflows, making it particularly valuable for detailed stratigraphic analyses of large, complex datasets. The temporal chromatic decomposition provides geoscientists with a powerful tool for improving accuracy, efficiency, and the overall quality of seismic data interpretation.
-
-
-
Simulating Pulsed Electromagnetic Detection of Subsurface CO2 Leaks, with Application to Carbon Capture and Storage (CCS)
More LessAuthors G. Stove, K. Van den Doel and J. SharmaSummaryIn this paper we present findings from a study to explore the possibilities of using pulsed electromagnetic (EM) waves for CO2-storage monitoring purposes, to help mitigate the release of large quantities of CO2 in the atmosphere. The study focused on the geological setting of the Mississippi Basin, targeting the detection of CO2 leaks at depths ranging from 350 to 3000 m. The primary objective was to assess the effectiveness of surface-based monitoring technologies, such as pulsed EM detection, that eliminate the need for wellbore access. A sensitivity study was performed to determine the minimum CO2 concentration and leak size detectable using this technology by utilizing finite-difference and machine learning-based computational analysis.
The simulation was run for different D (diameter of CO2 cube) values. The CO2 scan is most sensitively detected in the data using our in-house anomaly detector. A neural network is trained to reproduce the time measured time series across many windows with minimal error. The simulations suggest CO2 leaks of the order of 2–4% volumetric concentration can be reliably detected from the surface at a depth of 350m. Simulation results indicate 3000m leak detection is difficult but theoretically possible for a concentration of 20% and up.
-
-
-
Learning from Unstructured Documents: Extracting Value Using Machine Learning and Generative Augmented Intelligence
More LessAuthors R. Blake, J. Kozman and L. PelegrinSummaryLearning from Unstructured Documents: Extracting Value Using Machine Learning and Generative Augmented Intelligence
AI/ML holds transformative potential for the energy industry, but its success depends on robust data preparation—particularly for unstructured data like reports, logs, and legacy documents. These datasets often lack metadata, have inconsistent formats, and require significant processing to unlock their value. By standardizing and enriching unstructured data, AI can uncover patterns and generate actionable insights that drive efficiency and sustainability.
Using a global multi-petabyte data store, we demonstrate how integrating structured and unstructured data enhances AI workflows. The FAIR principles—making data Findable, Accessible, Interoperable, and Reusable—are vital to project success. A “human-in-the-loop” approach ensures AI delivers reliable results while continuing to improve.
Applications such as natural language Q&A and fine-tuned generative AI models enable intuitive data discovery, offering measurable ROI. These innovations underscore the need for trustworthy datasets and industry-specific prompt engineering to achieve AI/ML success, ultimately aligning with the energy sector’s sustainability goals.
-
-
-
Maximizing Efficiency and Data Quality with a New Proprietary Platform for Discrete well Data Management
More LessAuthors P. Tempone, I. Casetto, C. Piras, F. Feneri, N. Lamonaca, A. Crottini, L. Orlando and C. OcchienaSummaryThis abstract presents a new proprietary solution adopted worldwide by Eni since 2024. The Platform is designed to address critical challenges in discrete real-time data management for rig operations, such as data unavailability, lack of standardization, and time-consuming processes. Utilizing a cloud-based platform, the proprietary solution for discrete data collection enables near real-time data upload and validation, significantly improving data operational efficiency and decision-making. By centralizing data management and monitoring, the new solution delivers measurable enhancements in data quality, streamlines operational workflows, and supports informed decision-making regarding data. This study underscores the Platform’s transformative role in tackling key operational challenges related to data management and driving digital innovation in rig operations.
The findings conclude that the solution accelerates the digitalization and automation of business processes, resulting in enhanced productivity and more effective data management. By simplifying data collection, validation, and distribution, the platform ensures the availability of high-quality data for analysis. Its real-time data management capabilities enable continuous monitoring and instant feedback, fostering adaptive strategies and operational excellence. These features empower faster, data-driven decision-making and more agile responses to data management related operational challenges, positioning the new platform as a pivotal tool for advancing efficiency and innovation.
-
-
-
Using Integrated Machine Learning Property Modeling for Delineating Optimum CO2 Storage Sites
More LessAuthors A. AhmadSummaryMachine Learning
CCS Storage
Carbon Storage & Injectivity
CCS Modelling
-
-
-
Enabling New Workflows for Geoscientists: Querying Relational Databases and Structured Data using Code Writing AI Agents
More LessAuthors T.B. Grant and R. LogadottirSummaryThis study explores the use of large language models (LLMs) in coding agents to enable geoscientists to converse with subsurface databases through natural language queries. By utilizing a ReAct (reasoning and action) framework, the agents can dynamically plan and execute SQL queries based on user input and adapt to errors or intermediate results. The study tests the performance of the agents on a complex SQL database of Petrel data and metadata from over 1200 projects. Initial challenges included the agents’ difficulties in understanding table relationships and correctly formulating queries, which were mitigated by providing database descriptions and adding specific ReAct loop strategies to help solve the task. The agents demonstrated improved accuracy, particularly in complex queries requiring joining data from multiple tables, while also reducing response time and resource costs. Results indicate that users can effectively interact with their data without needing SQL expertise, revealing the potential benefits of coding agents in enabling new subsurface workflows.
-
-
-
Digitalization of Reservoir Performance Benchmarking
More LessAuthors S. HemmingsSummaryBenchmarking reservoir performance predictions against known outcomes is integral to subsurface workflows and is an essential part of assurance. With the growth of computing power it has become straightforward to build complex reservoir models, and benchmarking provides a powerful way to calibrate predicted outcomes and ensure that they are grounded on actual performance.
Bp has therefore built an advanced reservoir benchmarking toolkit that seamlessly integrates static and dynamic data from internal and external data sources to enable powerful analysis and ML predictions. It is used to assure every major capital investment decision, including partners’ field development plans. It provides evidence to justify reserve bookings; informs Exploration and M&A activity and is also used to screen our portfolio for improved field recovery opportunities.
-
-
-
Enriched Clustering Methodology for the Automated Interpretation of Electrofacies from Wireline Data: Application to Offshore Australia
More LessAuthors F.S. Patacchini, A. Christ, A. Faraj, C. Cornet, N. Khvoenkova, L. Mattioni and A. BouziatSummaryClustering well-log data into electrofacies using machine learning is a well established problem in the geophysical and data-scientific communities and has become of major relevance in the last decade. Many of the publications in the field, however, have limitations pertaining to the size of the learning dataset, the choice of the right number of clusters, and the management of noisy output (particularly stemming from the underlying spatial autocorrelation of the electrofacies). In this work, we propose an automated workflow which responds to these shortcomings and apply it on a case study from offshore wells in Australia. The approach lies in three main steps, which are the pretreatment of data via principal-component analysis, the clustering of the data via a Gaussian mixture model and the optimization of the number of clusters using silhouette analysis, and the homogenization of the clustering results using the stochastic-relaxation algorithm. The approach shows promising results on the test case by being coherent with, albeit less nuanced than, the ground truth given by an expert’s classification.
-
-
-
From Subsurface Prediction Models to MLOps: A New Era for Geoscience Data
More LessAuthors K. Dehghan and A. FelthamSummaryThe abstract discusses integrating machine learning (ML) and MLOps workflows in geoscience for subsurface prediction models. Traditional methods often rely on limited datasets and static models, creating challenges in updating predictions when new data becomes available. The proposed MLOps approach ensures continuous integration and delivery of ML models, automating data ingestion, model training, validation, and updates. Using seismic data and attributes, the workflow achieves accurate predictions, as demonstrated with North Sea data for porosity modeling. This dynamic methodology enhances decision-making, resource management, and geoscience collaboration by continuously refining models with new data and interpretations.
-
-
-
Enhancing Geomechanical Event Classification in Oil Well Drilling with Large Language Models
More LessAuthors R. Lopes, R. Da Silva, T. Conceição, V. Carneiro, R. Dantas and P. CoutoSummaryThe occurrence of geomechanical events (e.g. drag, stall, kick) might hinder production and incur in losses. In this context, manual classification of daily drilling reports is a daunting task. In this work, we propose a method -grounded in the In-Context Learning paradigm - that leverages a commercial Large Language Model to classify daily drilling reports so that we enrich the prompt fed to the LLM to boost its performance. Experimental results attest the effectiveness of our approach comapred to other three variants. This work might help the oil industry in extracting valuable information from large amount of data in order to mitigate losses and to support data driven decision making.
-
-
-
Generative AI in Geosciences: Leveraging Large Language Models to Digitize Legacy Cuttings Descriptions
More LessAuthors M. Sadah, G. Chirila, B. Hungund and T. AlabbassiSummaryGeological cutting descriptions are essential data sources for the exploration of natural resources. These descriptions often detail vital attributes such as depths, lithology type and percentages, and physical characteristics such color, grain size, grain shape, hardness, sorting, cementation, porosity type and degree, and accessory minerals. Many of the reports are handwritten, with clear and legible to poor quality, which further complicates the digitization process. Moreover, the use of abbreviations and symbols in these handwritten records introduces another layer of complexity. We have overcome these complexities using sophisticated Optical Character Recognition (OCR) vision models (transformer-based further fine-tuned), although these models require additional effort to accurately interpret the reports. To mitigate the aforementioned challenges, a comprehensive Generative AI framework is developed, commencing with the acquisition of the optimal OCR output from our framework. Next, LLMs ranging from 13B to 70B open-source models are deployed on high-computational graphic servers. Although LLMs resolve many challenges, some issues related to percentages or lithology physical characteristics still require the judgement of professional geologists. Therefore, it is highly recommended that AI-driven automation is supervised with expert oversight to ensure high-quality data transformation.
-
-
-
Where Static Meets Dynamic – A Journey to Rapid, Integrate Data Visualization and Decision Making
More LessAuthors K. St Clair, C. Hill, P. Maraj, C. Smith and A. EdirisooriyaSummaryOil and gas companies invest heavily in acquiring surveillance datasets over producing fields. Quickly integrating and understanding these datasets is crucial for maximizing asset value and visualizing them in one place is an important step in this process. However, there are still significant challenges in co-visualizing live production and surveillance data with geological maps, seismic products and well data in 2D or 3D. Field teams often spend significant time manually creating ‘moment in time’ maps or models, which are quickly outdated. Subsurface practitioners need an easy-to-use application, accessible to all disciplines, which automates the visualization and integration of critical field static and dynamic data. This paper describes bp’s journey to creating a global, multidisciplinary web-based tool designed to help field teams collaborate more effectively and rapidly to make informed reservoir management decisions and to maximize the value of their surveillance data. It explores the initial requirements, challenges in developing and deploying the solution, and highlights the opportunities this tool can unlock for field teams.
-
-
-
Enhancing Satellite Imagery for Energy Applications with Deep Learning-Based Super-Resolution
More LessAuthors A. Yaner, M. Shultz, Y. Sihetii, W. Kong and K. DickeySummarySatellite Super-Resolution (SSR) technology, powered by deep learning, provides an affordable and scalable way to obtain high-resolution satellite imagery for the energy industry. By enhancing the resolution of open-source imagery, SSR empowers companies to optimize their operations, monitor assets efficiently, and make informed, data-driven decisions.
This abstract highlights SSR’s capabilities, including its energy industry applications in asset tracking, site evaluation, and environmental monitoring. ThinkOnward’s SSR model, designed for research and validation, combines Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to produce super-resolved images with 10 times more detail than the original input. This hybrid approach leverages the strengths of both architectures, with CNNs focusing on local pixel neighborhoods and ViTs capturing global relationships and long-range dependencies, surpassing models that rely solely on one.
The thorough preprocessing of training data, including image selection, multi-sample augmentation, and color-space augmentation, ensures optimal model performance. The result is a cost-efficient solution that provides high-quality, super-resolved imagery, enabling more accurate geospatial analyses and superior interpretations for subsequent machine learning models. This innovative SSR model supports the energy industry in making more informed decisions, proactively mitigating risks, and optimizing resource allocation.
-
-
-
Accelerating Unstructured and Semi-Structured Data Insights with Generative AI
More LessAuthors S. Gunturu, A. Gadad, Y. Gubanov and D. TishechkinSummaryUnstructured and semi-structured data like reports, logs, and other related records and documents are abundant in the oil and gas industry but challenging to analyze due to lack of standardization and complexity. Traditionally, custom data parsers had to be written to aggregate and extract insights from these data sources. However, generative AI (Gen AI) can now accelerate this process. Leveraging large language models (LLMs) and Gen AI techniques, it now possible to use Gen AI to help create data parsing templates by specifying sections of interest on the sample data files. Gen AI can then scale and apply these templates to a broader set of data and automatically adjust the template selection based on the file or document type. This approach streamlines and simplifies pipelines to aggregate diverse unstructured and semi-structured data sources into centralized enterprise data repositories, unlocking faster insights and data-driven decision making. As data volumes grow, Gen AI provides a powerful way to tame unstructured data complexity in the energy industry.
-
-
-
Identify Underperforming Wells using Geospatial Data and Generative AI
More LessAuthors Y. Gubanov, D. Tishechkin, T. Bartholomew Grant, D. Jacob and S. ThomasSummaryIdentifying underperforming wells is a crucial aspect of efficient oil and gas operations. Underperforming wells can significantly impact profitability, resource allocation, and operational decision-making. By promptly recognizing and addressing underperforming wells, companies can take proactive measures to mitigate losses and optimize production. However, pinpointing underperforming wells is a formidable task due to the complexity and diversity of data involved. Geographical information systems (GIS), geospatial analysis, and generative artificial intelligence (Gen AI) technologies offer a powerful combination for tackling these challenges. By integrating production data, well logs, and GIS data into a centralized platform, users can leverage natural language queries to interrogate the data and uncover the business-critical insights. The Gen AI tools, equipped with spatial reasoning capabilities, can analyze production data, well characteristics, and geographic factors to identify underperforming wells and provide explanations for their suboptimal performance. As Gen AI technology continues to evolve, its potential to revolutionize subsurface data management and analysis will only grow, providing valuable solutions to the challenges faced by the energy industry.
-
-
-
Automatic Characterization and Quantification of Sedimentary Components in Reef Cores using Deep Learning for Image Segmentation
More LessAuthors S. Bussod, Y. Hamon, D. Guillaume, J. Peyrelon-Braud and A. BouziatSummaryThis study presents an innovative deep learning-based workflow for the semantic segmentation of sedimentary components in core images. The approach focuses on identifying and quantifying major components of reef systems (corals, coralline algae, microbialites, bioclastic sands...), from cores collected during IODP Expedition 389 – Hawaiian Drowned Reefs.
High-resolution images of the cores were obtained with a linescan camera, resulting in continuous images. A U-Net architecture, widely recognized for its efficiency in image segmentation, was employed using a weighted Dice-Sorensen coefficient to address class imbalances. The model achieved a segmentation accuracy of 81.1% after post-processing with Conditional Random Fields (CRF), improving the quality of segmentation results. Secondly, the model also quantifies the relative proportions of sedimentary components along core depths, aiming at facilitating the interpretation of paleoenvironmental variations recorded by reef systems.
This workflow provides a robust, automated solution for core analysis, reducing the time and expertise required. It holds significant potential for industrial applications like reservoir characterization and societal studies, such as paleoenvironmental reconstructions. Perspectives includes technical challenges (color normalization, benchmark the loss weight...) and methodological improvements (model refinement for unusual or complex sedimentary patterns, integration of complementary datasets to develop a multiple data source approach.
-
-
-
Enhanced Imbibition Saturation Height Function for Schiehallion Field Using Digital Rocks Capillary Pressure Simulations
More LessAuthors H. Collini, Y. Ning, A. Ronald, S. Salunke and G. GettemySummaryThe Schiehallion reservoir has historically faced challenges in accurately modelling the saturation height function (SHF), particularly in capturing the sharpness of the transition zone. Traditional SHF models, derived from core and log methods, have consistently under-predicted hydrocarbon volumes near the transition zone, or lacked geological character. This study leverages digital rock technology to address these limitations by developing a new SHF model based on imbibition capillary pressure data. The digital rocks team utilized advanced simulations to generate capillary pressure data, offering a cost-effective and rapid alternative to conventional SCAL studies. The new SHF model, validated against logged water saturation interpretations, preserves the transition zone’s sharpness and better represents rock quality variations. This approach not only enhances the accuracy of hydrocarbon volume predictions but also demonstrates the potential of digital rocks technology in reservoir characterization and modelling.
-
-
-
The Critical Role of UX in the Digital Transformation of the Energy Industry
More LessAuthors C. Nguyen, J. Burke, G. Martin, S. Namasivayam, R. Bloor, L. Bailleul and J. BrittanSummaryThe energy industry is in the process of undergoing a profound digital transformation. This transformation is using advanced technologies to enhance efficiency, safety, and sustainability. User Experience (UX) design plays a pivotal role in this evolution, ensuring that sophisticated tools and data are not only accessible but also useable for professionals in diverse operational contexts. This paper explores the strategic importance of UX in enabling the digital transformation of the energy industry. We discuss examples where UX-driven solutions have empowered decision-making, leading to measurable improvements in performance and quality. By considering the role of the UX in the digital strategy, we will show how we aim to bridge the gap between the technology and the people who rely on it.
-