1887
25th International Conference and Exhibition – Interpreting the Past, Discovering the Future
  • ISSN: 2202-0586
  • E-ISSN:

Abstract

Machine Learning Algorithms (MLA) can be an effective means of lithological classification. The Random Forests (RF) supervised classification approach allows prediction of lithology from disparate geophysical, geochemical and remote sensing data. In this study, we examine the relationship between prediction accuracy and information entropy (H). Data were processed in accordance with industry best practice and input selection was optimised using RF. Using a training set containing 1.4% of available pixels, we produced a classified lithology map with an overall accuracy of 76% with regards to mapped geology. In addition, we produced a class membership probability for each pixel, a precursor to defining the ultimate class designation at each pixel. H was calculated at each pixel from output class membership probabilities; and in this context provides a measure of the state of disorder for each. H was normalised with 0-1 representing the minimum to maximum possible H for each pixel.

H equal to 1 at a pixel represents an equal probability of all candidate classes occurring, whereas H equal to 0 describes a 100% probability of single class occurring. In this study, we demonstrate that there is a significant difference in the distribution of H between correctly and incorrectly classified pixels. The median H of incorrectly classified samples occurs above the 75% percentile of H for correctly classified samples. Conversely, both the mean and median H for correctly classified pixels occurs below the 25% percentile level for incorrectly classified samples.

This information can be used to determine the well-defined transition range in H, above which classification is likely to be inaccurate. Using this approach, a geoscientist can produce a lithological map, a quantifiable measure of uncertainty and a quantifiable transition range above which they are likely to encounter incorrect classification, avoiding wasted expense in targeting based on an incorrect model.

Loading

Article metrics loading...

/content/journals/10.1071/ASEG2016ab196
2016-12-01
2026-01-17
Loading full text...

Full text loading...

References

  1. Breiman, L., 2001, Random Forrests. Machine Learning45, 5-32.
  2. Cracknell, M.J., Reading A.M., 2014, Geological Mapping Using Remote Sensing Data: A Comparison of Five Machine Learning Algorithms,Their Response to Variations in the Spatial Distribution of Training Data and the Use of Explicit Spatial Information. Computers & Geosciences63, 22 - 33.
  3. Cracknell, M.J., Reading A.M., McNeill A.W., 2014, Mapping Geology and Volcanic-Hosted Massive Sulfide Alteration in the Hellyer-Mt Charter Region, Tasmania, Using Random Forests™ and Self-Organising Maps. Australian Journal of Earth Sciences61, 287-304.
  4. Cracknell, M.J., Reading, A.M., 2013, The Upside of Uncertainty: Identification of Lithology Contact Zones from Airborne Geophysics and Satellite Data Using Random Forests and Support Vector Machines. Geophysics78, WB113 - WB26.
  5. Demsar, J., Curk, T., Erjavec, A., Gorup, C., Hocevar, T., Milutinovic, M., Mozina, M., Polajnar, M., Toplak, M., Staric, A., Stajdohar, M., Umek, L., Zagar, L., Zbontar, J., Zitnik, M., and Zupan, B., 2013, Orange: Data Mining Toolbox in Python. Journal of Machine Learning Research14, 2349-53.
  6. Hastie, T., Tibshirani, R., and Friedman, J.H., 2009, The Elements of Statistical Learning: Data Mining, Inference and Prediction. Springer.
  7. Shannon, 1948, A Mathematical Theory of Communication. Bell Systems Technical Journal27, 379-423.
  8. Waske, B., Benediktsson, J.A., Arnason, K., and Sveinsson, J.R., 2009, Mapping of Hyperspectral Aviris Data Using Machine-Learning Algorithms. Canadian Journal of Remote Sensing35, 106-16.
  9. Wellmann, J.F. and Regenauer-Lieb, K., 2012, Uncertainties Have a Meaning: Information Entropy as a Quality Measure for 3-D Geological Models. Tectonophysics526-529, 207-16.
/content/journals/10.1071/ASEG2016ab196
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error