- Home
- Conferences
- Conference Proceedings
- Conferences
EAGE Workshop on Data Science - From Fundamentals to Opportunities
- Conference date: October 17-18, 2023
- Location: Kuala Lumpur, Malaysia
- Published: 17 October 2023
1 - 20 of 31 results
-
-
Artificial Intelligent Solution for Microgrid Load Forecasting
More LessSummaryDistributed Generation (DG) is a technology that uses local resources such as sun and wind to address environmental and energy crises. Short-term load forecasting (STLF) is essential for MG energy management systems, but current research results on STLF for MGs are relatively low with a complex computation process. This research proposes a new hybrid model that combines deep learning such as Convolutional Neural Network (CNN) and eXtreme Gradient Boosting XG-Boost called ConvXGB to solve the multiclass classification problem. The proposed framework combines 1D- Convolutional Neural Network (CNN) and eXtreme gradient Boosting for the research paper. The hybrid 1D-CNN-XGBoost approach was effective at predicting the short-term spatial distribution of load when the sample size was limited to one month. Gradient boosting machines (XGBoost and LightGBM) were best at daily scale but tended to overfit on training data, with train RMSE values of 0.001 for train dataset and 0.006 for test dataset respectively. In contrast, hybrid 1D-CNN-XGBoost showed greater stability compared to gradient boosting and achieved higher statistical performance with a train MSE value of 0.0009 and test MSE value is 0.0022
-
-
-
Machine Learning Applications in E&P: Best Practices, Challenges, and Recommendations
By F. AnifowoseSummaryThis paper seeks to provide a template for best practices in the successful application of machine learning (ML) methodologies in the E&P business. The objective is to ensure that the benefits of these methodologies are maximized and the pitfalls are avoided. It starts by providing a clarification on the differences and relationships between the two common terminologies: ML and artificial intelligence. Focusing on the machine learning (ML) workflow, which deals with extracting hidden patterns from data and using the patterns to predict various target variables, this paper identifies various opportunities to apply ML and to maximize its value. Some of the challenges facing the implementation of ML in the petroleum industry are discussed. The paper concludes with a few recommendations, professional tips, and best practices to overcome the challenges and avoid the common pitfalls. The outcome of this paper will help to bridge the gap that currently exists between data scientists and domain experts, and provides a collaborative synergy for successful ML projects implementation and maximizing the gains.
-
-
-
Optimizing Performance in Big Data Handling for Enhanced Data Analytics
Authors M.X. Lee and A. ShariffSummaryIn this era of big data, five Vs (volume, variety, value, veracity, velocity) are important and challenging to handle in the industry for data analytics. The increasing amount of data accumulated across time from various sources are often the key concern in every data-science-related study. In our study in generating a well underperforming data analytics dashboard, we encountered a few challenges in consuming the big data, because of the request rate limit, data frequency inconsistency, and authentication limits from the database application programming interface (API) provider. These challenges impose incompleteness of retrieved data, slowness in retrieving data, and failure in automating data retrieval in regular basis. Thus, in this study, we are proposing several performance optimization techniques to enable faster data processing and analysis of large-scale datasets. We achieved a significant performance enhancement in building data pipelines starting from data consumption to publication, after implementing a combination of data partitioning and multiprocessing techniques. These approaches were validated using the real-time production data of 29 oil fields and demonstrated the potential for reducing the data retrieval and processing time. This finding would have significant implication in dealing with massive amounts of data and gives us broader implication on data analytics capabilities.
-
-
-
A Survey of Natural Language Processing in Oil and Gas: Opportunities and Challenges
More LessSummaryNatural Language Processing (NLP) is growing rapidly in this era, fuelled by the advancements in deep learning, the availability of computing resources, and large volume of data. The utilization of NLP for oil and gas has gained significant attention due to its potential in extracting valuable insights from unstructured textual data. However, applying NLP techniques to the oil and gas industry presents unique challenges and pain points that need to be addressed to fully leverage its potential. In this survey, we explore the history, applications, challenges, and future directions of NLP in the oil and gas sector.
-
-
-
Leveraging Anomaly Detection in Finance Services: A Paradigm Shift in Risk & Audit Management
Authors C. Dragomir and V. MirjeSummaryRisk management has long been important in the financial services industry. With the financial environment becoming increasingly complex and traditional risk assessment methods struggling to keep pace with the rapid pace of anomalies, financial institutions are seeking advanced methods of detecting these anomalies. Machine learning and artificial intelligence give the financial sector an opportunity to gain competitive advantage and protect its operations. Shared financial services and global business partnerships, entrusted with finance through internal and external stakeholders, are actively involved in auditing, managing cost data, and monitoring as they ensure secure invoicing. Their focus extends to fraud detection and rapid risk mitigation. By leveraging the power of machine learning and AI, these organizations can enhance their capabilities, provide effective risk solutions, and remain at the forefront of dynamic financial services Technology improved utilization is a strategic advantage to protect their operations and maintain a strong position in the market.
-
-
-
Enhancing Supply Chain Efficiency through Predictive Analytics: A Systematic Approach to Forecast Unit of Activity
More LessSummaryThe supply chain of the oil and gas industry is known for its complexity and uncertainty. Lasker (2022) conducted a comprehensive study to examine the challenges of the industry’s supply chain and the potential of data analytics to improve its performance. Our own supply chain faces similar difficulties, particularly in predicting Unit of Activity (UoA) such as transaction volume, which currently relies on inconsistent and manual methods due to decentralized data and diverse measurement approaches.
Existing approaches do not provide a systematic way of predicting UoA, and prone to inaccuracies and inefficiencies. To address these issues, this study proposed predictive analytics with a combination of statistical methods and machine learning models to forecast purchase order transaction volume.
According to Lee et al. (2022) , predictive analytics utilizes data, statistical techniques, and machine learning algorithms to predict future events and trends, ultimately improving organizational performance. The proposed methodology follows a defined process, including project definition, data collection, data analysis, statistical analysis, modelling, deployment, and model monitoring.
By embracing predictive analytics, proposed approach offers a more efficient and reliable method for UoA prediction, contributing to optimal supply chain performance.
-
-
-
New Approach to Automatic Fault Extraction by Utilizing LSTM Neural Networks
Authors A. Alnasser, S. Al-Dossary and G. AnanosSummaryThis paper presents a novel method of geological fault detection utilizing LSTM (long-short term memory) neural network. LSTM networks excel in sequences classification and are widely used in natural language applications and series forecasting. The new approach was developed to consider the 3D nature of seismic data by splitting and modifying the data to comply with the LSTM input. Furthermore, a blind test was done which predicted most faults with reasonable accuracy. Finally, based on our experiments, LSTM shows a massive potential in fault extraction and possibly other geophysical applications due to the nature of seismic traces being sequential data.
-
-
-
Using Data Science to Unlock Insights and Opportunities in Legal Function
Authors Y. Shariffuddin and V. MirjeSummaryThe increasing demand for legal contract services in today’s business environment requires efficient resource allocation. By leveraging data-driven approaches, organizations can better allocate resources, enhance efficiency, and improve overall performance. This project aims to develop a robust predictive model in forecasting work demand and provide actionable recommendations for resource allocation. Contract volume serves as the demand signal for workload forecasting. Traditional machine learning and time-series forecasting algorithms were assessed, including Random Forest, XGBoost, ARIMA, and Holt-Winters models. Statistical analysis were conducted to estimate the workload volume based on the predicted contract volume. Due to high spectrum of productivity rate, ABC classification method was employed to gain a realistic productivity rate. Integration of data science in the legal function has great potential in unlocking additional insights and opportunities for improvements.
-
-
-
Data Foundations: Unlocking the Potential of Subsurface Machine Learning Workflows
Authors J. Tomlinson and S. MahyildinSummaryThe subsurface industry has invested in exceptionally rich datasets to solve geoscience challenges. These data are typically used by experts to perform detailed technical workflows, which can be time consuming processes. There is an increased adoption of machine learning by the geoscience community (e.g., Beloborodov et al. 2021 and Martin et al. 2022 ), the objective of which is to reduce time to answers and increase the exploration of parameter space to minimize cognitive bias during an interpretive workflow. These workflows have covered a wide range of tasks including consistent petrophysical interpretation, litho-fluid facies classification and detailed prediction of reservoir properties.
The conclusion of many of the publications in this space highlights the key to enabling these digital workflows is investing in the underlying data quality and architecture. This paper presents a case study based on a recent set of projects focused on preparing rock and fluid data for use in reservoir modelling workflows with the objective of allowing interpreters to be limited only by the workflows they have developed and not the constraints of the data available to them.
-
-
-
Driving AI Democratization: A Strategic Framework for Wide-Scale Adoption Across Organization
Authors V. Mirje, Y.F. Lim, C.E. Dragomir, Y. Shariffuddin and S. SelvarajuSummaryArtificial intelligence (AI) has revolutionized the field of computer science by enabling computer systems to perform tasks that traditionally required human intelligence. As AI rapidly advances, organizations are faced with a critical choice: embrace this technology or risk being left behind in the competitive landscape. Therefore, the democratization of AI has become crucial for organizations, just as important as their day-to-day operations.
AI democratization aims to make AI accessible, inclusive, and available to a wide range of individuals and organizations. Its goal is to create an AI landscape that is inclusive, where people from diverse backgrounds can leverage AI to drive economic growth, address societal challenges, and improve quality of life. By democratizing AI, we strive to promote equity, diversity, and empowerment, ensuring that everyone has access to the transformative power of AI.
This paper presents a strategic framework for AI democratization, emphasizing the systematic efforts required to achieve fair and inclusive access to AI technologies. Key elements of the framework include setting clear objectives, assessing the current data and AI technology landscape, promoting education and skill development, fostering collaboration, establishing ethical guidelines and governance, addressing policy and regulatory considerations, and securing funding and investment.
-
-
-
Cutting Lithology Percentages Prediction using Artificial Intelligence
Authors M. Mezghani, E. Tolstaya and M. Saudi AramcoSummaryDrill cutting samples are valuable data that cover the major part of drilled well compared to the core samples that cover only a limited depth interval. Therefore, accurate and objective cutting description plays major role in decision making while drilling, in reservoir characterization studies, and in modeling workflows. We developed an Artificial Intelligence workflow to automatically predict cutting lithology percentages using cutting photos. The workflow can be applied in near-real-time as soon as photos are acquired.
-
-
-
Unleashing the Power of Text Analytics: Enhancing Service Desk Efficiency and User Satisfaction
More LessSummaryThe Global Business Service (GBS) Digital Data Science team is using Latent Semantic Analysis (LSA) to analyse the massive amount of unstructured text data generated by SLB’s Global Service Desk (GSD). The team is using Dataiku as the project’s data pipeline platform, and the results of various text analytics methods are saved in Azure Synapse for later visualisation with PowerBI. The solution has the potential to improve service desk management, decrease non-value-added activity, identify cost-cutting opportunities, and avoid downtime. The team can identify common issues and develop solutions to prevent them from occuring in the future by analysing the data. The team can use LSA to identify patterns and themes in the data, which can then be used to improve the service desk experience for users. In general, the solution has the potential to improve user experiences, operational efficiency, and decision-making.
-
-
-
An Attempt at Detecting Reflection Images from Buried Objects in Ground Penetrating Radar Data using Anomaly Detection Methods with Convolutional Auto Encoder (CAE)
By S. IsoSummaryGround Penetrating Radar (GPR) plays a pivotal role in road maintenance by efficiently detecting underground voids and buried objects. However, manual interpretation of the data collected, characterized by unique hyperbolic reflections from buried objects, is laborious. Machine learning methods, particularly Convolutional Neural Networks (CNN), have been actively proposed for detecting and interpreting these objects, yet they typically rely on supervised learning and extensive labelled data, which is burdensome.
In our approach, we apply Convolutional AutoEncoder (CAE) used in anomaly detection to learn only the background data in GPR exploration. This method identifies parts where buried objects may exist, reducing the need for creating labelled data and enhancing onsite flexibility by learning area-specific background noise.
We’ve successfully detected reflective anomalies from two-dimensional images obtained during subsurface exploration by learning parts without abnormalities. The CAE model, based on a VGG16-based SegNet, operates by dimensionally compressing the input image to match the output image. Anomalies become apparent when input images containing unlearned reflections aren’t restored fully, thereby indicating their location. The method holds promise for more efficient anomaly detection in GPR data.
-
-
-
Improving Interpretability for Basin-scale Well Datasets through the Active Learning Approach - A Case Study from the Norwegian Continental Shelf
Authors H. Nguyen, E. Larsen, D. Evans, D. Oikonomou and G. StefosSummaryWith the development of technology in recent years, the amount of data that geoscientists have to work with is increasing in size and quantity. Conventional analytics methods are gradually supplemented by big-data analytics methods and tools, such as artificial intelligence applications, to improve work efficiency and productivity. To collect a large enough set of labels with high confidence to train and predict geological features (such as lithofacies) is time and economically consuming.
In this study, we propose an active learning workflow as a solution for the challenge mentioned above by showing an example of how we improve the quality, quantity and diversity of the lithofacies interpretation. An extensive dataset was used in this study, including: i) 700 000s cuttings samples, ii) 77 000 meters of core, and iii) 70 000s kilometers of logs from 1744 exploration wells in the Norwegian Continental Shelf.
-
-
-
Case Study: Transfer Learning Application for Permeability Prediction in A Carbonate Reservoir
By N. TripathiSummaryIn this case study, we aim to develop a transfer learning-based workflow for predicting missing permeability in the reservoir. Trained data must be representative of the field under study. The overall idea here is to combine conventional PERM FACIMAGE input with the transfer learning method to leverage their complementary strength.
-
-
-
Integrated Meteorological and Geohazard System Advisory (iMGESA) for Pipeline Integrity
Authors E. Amirian, T.X. Moy, M.N.A. Ahmad, A. Abdollahzadeh, M.J. Rohani, M.S.K. Abdullah and H. HidzirSummaryThe integrity of onshore pipeline systems is a crucial element for oil and gas companies, including PETRONAS. Geohazards, ground movements, and severe metrological conditions of the environment can cause lots of risk to the onshore pipelines. Natural events including landslides, flooding, slope creep, subsidence, and seismic activities increase the potential for pipeline failure rates throughout the development and operation stages. To address these challenges, a data-driven solution called Integrated Meteorological-Geotechnical System Advisory (iMGESA) has been developed. iMGESA is an in-house risk assessment tool that uses machine learning models to predict the probability of geohazard impact on pipeline failure. It incorporates data from SSGP (Sabah Sarawak Gas Pipeline) Pipeline conditions, rainfall data, and geographical properties to generate outputs including probability of landslide, risk index for geohazard impact to pipeline (target PR), and geopig strain.
The objective of iMGESA is to provide accurate and early intervention on the effect of rainfall on terrain degradation and to prevent further deterioration of geohazard conditions at sites. Additionally, a dashboard system and an online platform have been developed to leverage automation and advanced analytics to generate timely reports on terrain health status at PETRONAS onshore pipeline assets, which staff can access at anytime from anywhere.
-
-
-
Integration Of Artificial Intelligence (AI) Into Real Time Monitoring System In Jambi Field CentralSumatera
Authors R. Alfajri, A.H. Susilo and H.Y. PriyanggaSummaryThis paper explains about digital transformation in Pertamina, Indonesia’s national oil & gas company. Th system is ccalled Pertamina Integrated Well Performance, which consists of real-time monitoring and integrate AI for recommendation. This project successfully delivered three useful recommendation that benefited compan during project monitoring.
-
-
-
CLARITY: Harnessing Advanced Analytics for Intelligent Water Management
Authors E. Amirian, S. Madalan, A. Abdollahzadeh, I.F. Ariff, M.A. Carle Pun and N. RamlySummaryThe exponential growth in population has resulted in a substantial increase in water use in domestic, agricultural, and industrial sectors. For Malacca state specifically, Syarikat Air Melaka Berhad (SAMB) becomes the main source of water supply for the state population within all sectors. The increase in demand plus a low water supply will results in water shortage issues. Some of the crucial issues faced by SAMB are delayed response and poor control of effluent lead to wastewater plant upsets and non-compliance of final discharge. The unreliable (fresh) water supply has led to disruptions in plant operations disruption which proven to be extremely costly. In addition, utility water systems experience various issues such as contamination, under-performing assets, and frequent leaks.
Water management and sustainability are vital elements in providing a clean and safe water supply for the public [ 1 ]. Thus, to practice these manifestos for Malacca, SAMB teamed up with PETRONAS, to develop a digital solution to predict the water shortage and provide automated prescriptive analytics which will help operators oversee water treatment issues solutions. This solution will help to predict when the next water shortage will occur by employing the dataset collected by SAMB.
-
-
-
Pitfalls in Fluid Substitution and Porosity Perturbation in Rock Physics Model using Data Science Approach
Authors I. Fadhil, G. Malo-Paul, T. Yuan Jiun and F.H. HasnanSummaryRock Physics Model (RPM) is an important tool for interpreting seismic data or provide quantitative interpretation of a seismic inversion. It is also used to model different fluids and various porosities in a well log to understand their seismic response. This is because partially saturated reservoirs with different porosities and pore fluids are among the major reasons of variations in seismic data.
The RPM is then deployed to generate synthetic seismograms at well locations which are then correlated with seismic data, and act as controls to gain accuracy in the subsurface map. This also can help to narrow down the non-unique interpretations of the seismic amplitudes to de-risk prospect areas.
However, the fluid substitution and porosity perturbation exercise are repetitive, time consuming and mainly manual which translate into limitations in terms of scalability and turnaround time. This work explores the Data Science (DS) approach to enhance the manual process. Nevertheless, the approach is currently having challenges to predict the elastic properties for the substitutions and perturbations scenarios, which is the main discussion in this paper.
-
-
-
Gas Transmission Optimization via Machine Learning
Authors N.A. Mohd Akil, M.N.A. Bujang, W.X. Ooi, M.H. Sa’aid and A. AbdollahzadehSummaryThis paper focuses on leveraging machine learning approach to optimize fuel consumption in compressor stations in Malaysia. The main objective is to predict the fuel consumption where the model recommends the optimal setpoint, thereby enhancing fuel efficiency and minimizing environmental effect in line with the goal of achieving net zero carbon emissions by 2050. A random forest regression model is used to train the input features that significantly affects the accuracy of the model. Extensive measures and factors were considered during model training in achieving the goal to ease or help operators in the decision-making processes. This study is divided into two processes which is developing regression model to predict fuel consumption and then proceed with optimizing the compressor stations with some identified constraint. The model produced makes a valuable contribution to the gas business sector, exhibiting a high accuracy rate with a correlation coefficient of approximately 95%. Furthermore, the findings demonstrate the potential for substantial cost savings by adopting the machine learning optimization model, with a notable 0.65% reduction in CO2 emissions attributed to fuel consumption over a six-month period.
-