The purpose of the work is the development of methodology for the operational selection of wells for hydraulic fracturing treatment based on machine learning for Field A. In order to prepare information for machine learning the data from the simulation model, monthly field reports, well test, HF reports are collected and analyzed. Principal component analysis was used in order to find out correlating between themselves parameters and exclude them from the database. Target for binary classification parameter is efficiency of HF treatment.

After the final database has been formed, machine learning process was realized. Six different models of machine learning were trained and evaluated: k-neighbors, support vector method, decision tree, random forest, gradient boosting, neural network. Comparison of models was carried out and best classification model was selected on base of several independent metrics. The parameters most influencing on the result of the success of the HF were identified. Machine learning model based on neural networks has been trained to estimate the average annual oil production rate after HF treatment. Based on the selected model, predictions are made for the remaining wells not participating in the training. Well candidates for HF treatment were proposed. Economic efficiency was estimated.


Article metrics loading...

Loading full text...

Full text loading...


  1. Andreas C.Muller, SarahGuido
    . “Introduction to machine learning with python: a guide for data scientists”, 2016, pp.393
    [Google Scholar]
  2. TrevorHastie, RobertTibshirani, JeromeFriedman
    . “The elements of statistical learning”. 2015, pp.745
    [Google Scholar]
  3. HenrikBrink, Joseph W.Richards, MarkFetherolf
    . “Reai-world machine learning”. 2017, p.330
    [Google Scholar]
  4. LiawA., WienerM.
    “Classification and regression by random forest”. R News, 2002, no. 2(3), p 18–22
    [Google Scholar]
  5. ShapireR. E., Y.Freund
    . “Boosting: Foundations and algorithms2012, The MIT Press, pp. 544
    [Google Scholar]
  6. Venables, W.N., Ripley, B.D.
    Modern applied statistics with S, 4th edition”. 2002, pp. 497.
    [Google Scholar]

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error