A transparent Open-Box learning network provides insight to complex systems and a performance benchmark for more-opaque machine learning algorithms
Keywords:
Learning networks, transparency of variable relationships, benchmarking machine learning, performance, prediction of complex petroleum systems, soft-computing solutions, under-fitting/over-fittingAbstract
It is now commonplace to deploy neural networks and machine-learning algorithms to provide predictions derived from complex systems with multiple underlying variables. This is particularly useful where direct measurements for the key variables are limited in number and/or difficult to obtain. There are many petroleum systems that fit this description. Whereas artificial intelligence algorithms offer effective solutions to predicting these difficult-to-measure dependent variables they often fail to provide insight to the underlying systems and the relationships between the variables used to derive their predictions are obscure. To the user such systems often represent “black boxe”. The novel transparent open box (TOB) learning network algorithm described here overcomes these limitations by clearly revealing its intermediate calculations and the weightings applied to its independent variables in deriving its predictions. The steps involved in building and optimizing the TOB network are described in detail. For small to mid-sized datasets the TOB network can be deployed using spreadsheet formulas and standard optimizers; for larger systems coded algorithms and customised optimizers are easy to configure. Hybrid applications combining spreadsheet benefits (e.g. Microsoft Excel Solver) with algorithm code are also effective. The TOB learning network is applied to three petroleum datasets and demonstrates both its learning capabilities and the insight to the modelled systems that it is able to provide. TOB is not proposed as a replacement for neural networks and machine learning algorithms, but as a complementary tool; it can serve as a performance benchmark for some of the less transparent algorithms.
Cited as: Wood, D.A. A transparent Open-Box learning network provides insight to complex systems and a performance benchmark for more-opaque machine learning algorithms. Advances in Geo-Energy Research, 2018, 2(2): 148-162, doi: 10.26804/ager.2018.02.04
ReferencesAalst, W.M.P., Rubin, V., Verbeek, H.M.W., et al. Process mining: A two-step approach to balance between underfitting and overfitting. Softw. Syst. Model. 2010, 9(1): 87-111.
Behnke, S. Hierarchical neural networks for image interpreta-tion. Lect. Notes Comput. Sci. 2003, 2766(3): 1345-1346
Choubineh, A., Ghorbani, H., Wood, D.A., et al. Improved predictions of wellhead choke liquid critical-flow rates: modelling based on hybrid neural network training learning based optimization. Fuel 2017, 207: 547-560.
Cortes, C., Vapnik, V.N. Support-vector networks. Mach. Learn 1995, 20(3): 273-297.
Cybenko, G. Approximation by superpositions of a sigmoidal function. Math. Control Signal. 1989, 2(4): 303-314.
Elhaj, M.A., Anifowose, F., Abdulraheem, A. Single gas ow prediction through chokes using articial intelligence techniques. Paper SPE 177991 Presented at Society of Petroleum Engineers Saudi Arabia Section Annual Technical Symposium and Exhibition, Al-Khobar, Saudi Arabia, 21-23 April, 2015.
Elkatatny, S., Tariq, Z., Mahmoud, M. Real time prediction of drilling fluid rheological properties using Artificial Neural Networks visible mathematical model (white box). J. Pet. Sci. Eng. 2016, 146: 1202-1210.
Farley, B., Clark, W.A. Simulation of self-organizing systems by digital computer. IEEE Trans. Inf. Theory 1954, 4(4): 76-84.
Gomaa, S. New bubble point pressure correlation for middle east crude oils. Int. Adv. Res. J. Sci. Eng. Technol. 2016, 3(12): 1-9.
Hebb, D. The organization of behavior. Science 1961, 133(3466): 1749-1757.
Heinert, M. Artificial neural networks-how to open the black boxes. Application of Artificial Intelligence in Engineering Geodesy, 2008.
Ince, D.C. Collected Works of AM Turing-Mechanical Intelligence. Mechanical intelligence, 1992.
Jang, J.S. ANFIS: Adaptive-network-based fuzzy inference system. IEEE Trans. Syst. Man. Cybern. Syst. 1993, 23(3): 665-685.
Li, Y., Xie, R., Yu, L. Flow pattern identification of oil-gas-water three-phase flow based on NPSO-LSSVM algorithm. J. Theor. Appl. Inf. Technol. 2013, 48(2): 933-938.
McCulloch, W., Pitts, W. A logical calculus of ideas immanent in nervous activity. Bull. Math. Biol. 1943, 5(4): 115-133.
Meng, M., Zhao, C. Application of support vector machines to a small-sample prediction. Adv. Pet. Explor. Dev. 2015, 10(2): 72-75.
Moazzeni, A.R., Nabaei, M., Ghadami Jegarluei, S. Prediction of lost circulation using virtual intelligence in one of the Iranian oil fields. Paper SPE 136992 Presented at SPE Nigeria Annual International Conference and Exhibition, Tinapa-Calabar, Nigeria, 31 July-7 August, 2010.
Rosenblatt, F. The Perceptron: A probabilistic model for information storage and organization in the brain. Psychol. Rev. 1958, 65(6): 386-408.
Santos, R.B., Rupp, M., Bonzi, S.J., et al. Comparison between multilayer feedforward neural networks and a radial basis function network to detect and locate leaks in pipelines transporting gas. Chem. Eng. Trans. 2013, 32: 1375-1380.
Scarselli, F., Tsoil, A.C. Universal approximation using feedforward neural networks: A survey of some existing methods, and some new results. Neural Netw. 1998, 11(1): 15-37.
Schmidhuber, J. Learning complex, extended sequences using the principle of history compression. Neural Comput. 1992, 4(2): 234-242.
Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw. 2015, 61: 85-117.
Sheremetov, L., Batyrsgin, I., Filatov, D., et al. Fuzzy expert sSystem for solving lost circulation problem. Appl. Soft Comput. 2008, 8(1): 14-29.
Sugeno, M., Kang, G. Structure identification of fuzzy model. Fuzzy Sets. Syst. 1988, 28(1): 15-33.
Suykens, J.A.K., Vandewalle, J. Least squares support vector machine classifiers. Neural Process. Lett. 1999, 9(3): 293-300.
Werbos, P.J. Beyond regression: New tools for prediction and analysis in the behavioral sciences. Boston, Harvard University, 1975.
Yavari, H., Sabah, M., Khosravanian, R., et al. Application of adaptive neuro-fuzzy inference system and mathematical ROP models for prediction of drilling rate. J. Oil Gas Sci. Technol. 2018.
Zamani, H.A., Rafiee Taghanaki, S., Karimi, M., et al. Implementing ANFIS for prediction of reservoir oil solution gas-oil ratio. J. Nat. Gas Sci. Eng. 2015, 25: 325-334.