1887

Abstract

Summary

This abstract discusses the implementation and evaluation of Retrieval-Augmented Generation (RAG) systems in the subsurface domain, where professionals face challenges with large volumes of unstructured data and the need for effective knowledge transfer. The RAG tool, developed by a cross-functional team, leverages text-based experiences from subsurface professionals, enriched with metadata, to enable efficient information retrieval and support faster decision-making.

Three key lessons emerged from the evaluation phase:

  1. Metrics Alone Are Insufficient: Quantitative metrics like retrieval recall and faithfulness provide valuable diagnostics but do not fully capture system performance. Qualitative user feedback is essential to interpret results and guide improvements.
  2. User Feedback Is Critical: The system incorporates a feedback loop where users evaluate queries, retrieved documents, and generated answers. This feedback refines both the content and structure of responses, ensuring the tool meets real-world needs.
  3. Curated Evaluation Datasets Are Crucial: Domain-specific datasets, informed by user interactions, are necessary for robust evaluation. Building such datasets is challenging but vital for meaningful assessment and continuous improvement.

The abstract concludes that RAG systems, when combined with expert feedback and tailored evaluation data, can enhance knowledge retrieval and decision-making in safety-critical subsurface applications.

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.202639019
2026-03-09
2026-02-06
Loading full text...

Full text loading...

References

  1. Belcic, I. and Stryker, C. [n.d.] RAG vs. fine-tuning. IBM Think. Retrieved September 29, 2025 from https://www.ibm.com/think/topics/rag-vs-fine-tuning
    [Google Scholar]
  2. Douze, M., GuzhvaA., DengC., JohnsonJ., SzilvasyG., Mazaré P-E., LomeliM., HosseiniL. and JégouH. [2024] The faiss library. arXiv preprint arXiv:2401.08281.
    [Google Scholar]
  3. Es, J., James, J., Espinosa-Anke, L. and Schockhaert, S. [2025] Ragas: Automated Evaluation of Retrieval Augmented Generation. arXiv preprint arXiv: 2309.15217.
    [Google Scholar]
  4. Kalai, A. T., Nachum, O., Vempala, S. S. and Zhang, E. [2025]. Why language modelshallucinate. arXiv. https://arxiv.org/abs/2509.04664
    [Google Scholar]
  5. Lewis, P., PerezE., PiktusA., PetroniF., KarpukhinV., GoyalN., KüttlerH. [2021] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. arXiv: 2005.11401.
    [Google Scholar]
  6. Microsoft Learn [n.d.] Search indexes in Azure AI Search. Retrieved October 10, 2025, from https://learn.microsoft.com/en-us/azure/search/search-what-is-an-index
    [Google Scholar]
/content/papers/10.3997/2214-4609.202639019
Loading
/content/papers/10.3997/2214-4609.202639019
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error