Spatial machine-learning model diagnostics: a model-agnostic distance-based approach

Alexander Brenning

doi:10.1080/13658816.2022.2131789

ScienceGate Book Chapters

JOURNAL ARTICLE

Spatial machine-learning model diagnostics: a model-agnostic distance-based approach

Alexander Brenning

Year: 2022 Journal: International Journal of Geographical Information Systems Vol: 37 (3)Pages: 584-606 Publisher: Taylor & Francis

DOI: 10.1080/13658816.2022.2131789

Get Full-Text PDF Get Analytical Report

Abstract

While significant progress has been made towards explaining black-box\nmachine-learning (ML) models, there is still a distinct lack of diagnostic\ntools that elucidate the spatial behaviour of ML models in terms of predictive\nskill and variable importance. This contribution proposes spatial prediction\nerror profiles (SPEPs) and spatial variable importance profiles (SVIPs) as\nnovel model-agnostic assessment and interpretation tools for spatial prediction\nmodels with a focus on prediction distance. Their suitability is demonstrated\nin two case studies representing a regionalization task in an\nenvironmental-science context, and a classification task from remotely-sensed\nland cover classification. In these case studies, the SPEPs and SVIPs of\ngeostatistical methods, linear models, random forest, and hybrid algorithms\nshow striking differences but also relevant similarities. Limitations of\nrelated cross-validation techniques are outlined, and the case is made that\nmodelers should focus their model assessment and interpretation on the intended\nspatial prediction horizon. The range of autocorrelation, in contrast, is not a\nsuitable criterion for defining spatial cross-validation test sets. The novel\ndiagnostic tools enrich the toolkit of spatial data science, and may improve ML\nmodel interpretation, selection, and design.\n

Keywords:

Computer science Context (archaeology) Machine learning Random forest Contrast (vision) Artificial intelligence Spatial analysis Spatial contextual awareness Range (aeronautics) Focus (optics) Variable (mathematics) Data mining Predictive modelling Interpretation (philosophy) Task (project management) Geography Mathematics Statistics Engineering

Metrics

Cited By

2.65

FWCI (Field Weighted Citation Impact)

Refs

0.87

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Soil Geostatistics and Mapping

Physical Sciences → Environmental Science → Environmental Engineering

Hydrology and Watershed Management Studies

Physical Sciences → Environmental Science → Water Science and Technology

Hydrological Forecasting Using AI

Physical Sciences → Environmental Science → Environmental Engineering

Spatial machine-learning model diagnostics: a model-agnostic distance-based approach

Abstract

Metrics

Citation History

Topics

Related Documents

Code and data related to: Spatial Machine-Learning Model Diagnostics: A Model-Agnostic Distance-Based Approach

Code and data related to: Spatial Machine-Learning Model Diagnostics: A Model-Agnostic Distance-Based Approach

Code and data related to: Spatial Machine-Learning Model Diagnostics: A Model-Agnostic Distance-Based Approach

Interpreting machine learning models using model-agnostic approach

Model-agnostic interpretable machine learning