1887

Abstract

Summary

Missing and corrupted data are main issues in oil industry and they cost up to $60 billion annually (Nobkhat, 2009). The focus here is on missing and corrupted data in production and injection rates for waterfloods. There are several missing patterns: Arbitary, monotone, multivariate and modified multivariate. The most severe pattern is modified multivariate pattern which happens when there is a major problem with the measurement system. Treating missing data has two options; 1- Dropping missing values, and it is not recommended for more than 3% missing. 2- Imputing the data using the simple reservoir models; Resistance Model. A decision tree was programmed in Matlab to process these scenarios. Two outliers approaches were suggested in this work, a sample dependant and a distance based approaches which depend on the distance of each point from the neighbour points. It is not recommended to work with data has more than 6% missing. The understanding of data behaviour plays a main role in which approach will be treat missing values. Two cases were tested 15% and 30% of missing with max resulted value of R2 .95 and min .75, a software tool was developed to process these scenarios

Loading

Article metrics loading...

/content/papers/10.3997/2214-4609.201414382
2015-11-16
2024-04-20
Loading full text...

Full text loading...

References

  1. Albertoni, Alejandro, Larry WLake, et al.
    [2003] Inferring interwell connectivity only from well-rate fluctuations in waterfloods. SPE Reservoir Evaluation & Engineering, 6(1), 6–16.
    [Google Scholar]
  2. Feller, Willliam
    . [2008] An introduction to probability theory and its applications. Volume 2. John Wiley & Sons.
    [Google Scholar]
  3. Nobakht, M., Mattar, L. et al.
    [2009] Diagnostics of Data Quality for Analysis of Production Data. Canadian International Petroleum Conference, Petroleum Society of Canada.
    [Google Scholar]
  4. Schafer, J.L. and Graham, J.W.
    [2002] Missing data: our view of the state of the art. Psychological methods, 7(2), 14.
    [Google Scholar]
  5. Chawla, S. and PeiSun
    [2006] Outlier detection: Principles, techniques and applications. Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), Singapore.
    [Google Scholar]
  6. Breunig, M.M., Kriegel, H.-P., Ng, R.T. and Sander, J.
    [2000] LOF: identifying density-based local outliers. ACM Sigmod Record, 29. ACM, 93–104.
    [Google Scholar]
http://instance.metastore.ingenta.com/content/papers/10.3997/2214-4609.201414382
Loading
/content/papers/10.3997/2214-4609.201414382
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error