1887

Abstract

Summary

The abstract presents a scalable machine learning-based workflow designed to classify critical information from over 8 million unstructured technical reports. Guided by subject matter experts, this classifies documents, tables, and images, enabling semantic and contextual search through a high performance platform. This approach allows for continuous improvement while ensuring accuracy and traceability. The proposed workflow significantly reduces manual effort and speeds up decision-making in complex, data-rich environments.

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.202639064
2026-03-09
2026-02-14
Loading full text...

Full text loading...

References

  1. Edge, D., Trinh, H., Cheng, N., Bradley, J., Chao, A., Mody, A.,… & Larson, J. [2024]. From local to global: A graph rag approach to query-focused summarization. arXiv preprint arXiv:2404.16130.
    [Google Scholar]
  2. Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N.,… & Kiela, D. [2020]. Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in neural information processing systems, 33, 9459–9474.
    [Google Scholar]
  3. Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z.,… & Guo, B. [2021]. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 10012–10022.
    [Google Scholar]
  4. Lun, C. H., Hewitt, T., & Hou, S. [2022]. A machine learning pipeline for document extraction. First Break, 40(2), 73–78.
    [Google Scholar]
/content/papers/10.3997/2214-4609.202639064
Loading
/content/papers/10.3997/2214-4609.202639064
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error