1887

Abstract

Summary

Learning from Unstructured Documents: Extracting Value Using Machine Learning and Generative Augmented Intelligence

AI/ML holds transformative potential for the energy industry, but its success depends on robust data preparation—particularly for unstructured data like reports, logs, and legacy documents. These datasets often lack metadata, have inconsistent formats, and require significant processing to unlock their value. By standardizing and enriching unstructured data, AI can uncover patterns and generate actionable insights that drive efficiency and sustainability.

Using a global multi-petabyte data store, we demonstrate how integrating structured and unstructured data enhances AI workflows. The FAIR principles—making data Findable, Accessible, Interoperable, and Reusable—are vital to project success. A “human-in-the-loop” approach ensures AI delivers reliable results while continuing to improve.

Applications such as natural language Q&A and fine-tuned generative AI models enable intuitive data discovery, offering measurable ROI. These innovations underscore the need for trustworthy datasets and industry-specific prompt engineering to achieve AI/ML success, ultimately aligning with the energy sector’s sustainability goals.

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.202539010
2025-03-24
2026-02-09
Loading full text...

Full text loading...

References

  1. Gallant, S., Patel, N. and Zaheri, S. [2023]. Artificial Intelligence and Machine Learning in Sustainable Energy. 18th International Congress of the Brazilian Geophysical Society, Expanded Abstracts.
    [Google Scholar]
  2. Wilkinson, M.D., Dumontier, M., Aalbersberg, I.J., et al. [2016]. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3, 160018.
    [Google Scholar]
/content/papers/10.3997/2214-4609.202539010
Loading
/content/papers/10.3997/2214-4609.202539010
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error