1887

Abstract

Summary

This study presents a **Graph-based Retrieval-Augmented Generation (Graph RAG)** framework for intelligent retrieval of information from legacy **geoscience and drilling reports**, which are often unstructured or scanned. Using the **1995 Statoil geological summary report** as a case study, text was extracted via **GPT-4o OCR**, cleaned, and divided into page-wise chunks. Each chunk was represented as a **node** in a knowledge graph built using **LightRAG**, with **edges** capturing semantic and conceptual relationships between related geological entities such as formations, lithology, and drilling operations. Unlike conventional RAG systems that rely solely on vector similarity, Graph RAG performs **graph-based retrieval and traversal**, enabling the system to capture contextual and relational dependencies across document sections. Experimental results show that Graph RAG achieves **higher retrieval accuracy and completeness**, successfully extracting formation-level information that naïve RAG models missed. The approach demonstrates the value of **relationship-aware retrieval** in geoscience applications, offering a scalable framework for interpreting and querying legacy petroleum reports. Future work will focus on integrating **geological ontologies** and expanding the pipeline to multi-document collections for enhanced exploration data management.

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.202639017
2026-03-09
2026-02-11
Loading full text...

Full text loading...

References

  1. Z.Guo, L.Xia, Y.Yu, T.Ao, and C.Huang, “LightRAG: Simple and Fast Retrieval-Augmented Generation,” arXiv preprint arXiv:2410.05779, Oct. 2024. [Online]. Available: https://arxiv.org/abs/2410.05779
    [Google Scholar]
  2. J.Wang et al., “RAG vs. GraphRAG: A Systematic Evaluation and Key Insights,” arXiv preprint arXiv:2502.11371, Feb. 2025. [Online]. Available: https://arxiv.org/abs/2502.11371
    [Google Scholar]
  3. Y.Gao et al., “Retrieval-Augmented Generation for Large Language Models: A Survey,” arXiv preprint arXiv:2312.10997, Mar. 2024. [Online]. Available: https://arxiv.org/abs/2312.10997
    [Google Scholar]
/content/papers/10.3997/2214-4609.202639017
Loading
/content/papers/10.3997/2214-4609.202639017
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error