1887

Abstract

Summary

Advancements in multimodal document mining are transforming geoscientific information extraction from heterogeneous sources such as text, tables, and images. This work investigates Retrieval-Augmented Generation (RAG) and GraphRAG approaches, combined with MinerU, a preprocessing tool that preserves document hierarchy and reduces noise, to enhance retrieval and synthesis of geoscientific knowledge.

The methods were evaluated on the Norway Relinquishment Licenses dataset, comprising 781 diverse documents. Five configurations of RAG and GraphRAG were benchmarked on 67 technical questions using metrics of completeness, correctness, verbosity, and response time. Results show that RAG with a 5000-token chunk size achieved the best combined score (0.498), outperforming smaller RAG chunks and GraphRAG Global Search, while GraphRAG Local Search performed better than its global variant. Limitations remain in handling complex tables and image-rich content, highlighting the importance of preprocessing quality.

These findings underline the potential of RAG and GraphRAG as assistive tools for geoscientific analysis, while human supervision remains necessary for critical data. Future work will focus on enhancing GraphRAG architectures, extending multimodal reasoning capabilities, and automating workflows to enable more robust, interpretable, and autonomous document mining in geosciences.

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.202639026
2026-03-09
2026-02-13
Loading full text...

Full text loading...

References

  1. Edge, D., Ding, J., Yang, F. et al., (2024). From Local to Global: A Graph RAG Approach to Query-Focused Summarization. ArXiv: 2404.16130.
    [Google Scholar]
  2. Lewis, P., Perez, E., Piktus, A. et al., (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. Advances in Neural Information Processing Systems, 33, 9459–9474.
    [Google Scholar]
  3. Mosser, L., Aursand, P., Brakstad, K. S., Lehre, C., & Myhre-Bakkevig, J. (2024). Exploration Robot Chat: Uncovering Decades of Exploration Knowledge and Data with Conversational Large Language Models. In SPE Norway Subsurface Conference, https://doi.org/10.2118/218439-MS
    [Google Scholar]
  4. Tibari, I., Giboreau, R., Crabie, T. et al., (2025). Enhancing Geoscience Document Mining with Large Language Models through GraphRAG Integration and Agentic Architectures. In Fifth EAGE Digitalization Conference & Exhibition, https://doi.org/10.3997/2214-4609.202539032
    [Google Scholar]
  5. Wang, B., Xu, C., Zhao, X. et al., (2024). MinerU: An Open-Source Solution for Precise Document Content Extraction. ArXiv: 2409.18839
    [Google Scholar]
  6. Zaharia, M. A., Chen, A., Davidson, A. et al., (2018). Accelerating the Machine Learning Lifecycle with MLflow. IEEE Data Engineering Bulletin, 41, 39–45. https://api.semanticscholar.org/CorpusID:83459546
    [Google Scholar]
/content/papers/10.3997/2214-4609.202639026
Loading
/content/papers/10.3997/2214-4609.202639026
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error