Estimating oil and gas recovery factors via machine learning: Database-dependent accuracy and reliability
Alireza Roustazadeh, Behzad Ghanbarian, Mohammad B. Shadmand, Vahid, Taslimitehrani, Larry W. Lake

TL;DR
This study evaluates machine learning models for estimating hydrocarbon recovery factors in reservoirs, highlighting their database dependency and limited reliability across different datasets.
Contribution
It compares three ML models for RF estimation and demonstrates the importance of database similarity for model accuracy.
Findings
XGBoost outperformed SVM and MLR on training and testing datasets.
All models performed poorly on independent databases.
Model accuracy is highly dependent on the similarity of training and testing data distributions.
Abstract
With recent advances in artificial intelligence, machine learning (ML) approaches have become an attractive tool in petroleum engineering, particularly for reservoir characterizations. A key reservoir property is hydrocarbon recovery factor (RF) whose accurate estimation would provide decisive insights to drilling and production strategies. Therefore, this study aims to estimate the hydrocarbon RF for exploration from various reservoir characteristics, such as porosity, permeability, pressure, and water saturation via the ML. We applied three regression-based models including the extreme gradient boosting (XGBoost), support vector machine (SVM), and stepwise multiple linear regression (MLR) and various combinations of three databases to construct ML models and estimate the oil and/or gas RF. Using two databases and the cross-validation method, we evaluated the performance of the ML…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReservoir Engineering and Simulation Methods · Hydrocarbon exploration and reservoir analysis · Hydraulic Fracturing and Reservoir Analysis
MethodsTest · Support Vector Machine · Linear Regression
