Evaluating Catchment Models as Multiple Working Hypotheses: on the Role of Error Metrics, Parameter Sampling, Model Structure, and Data Information Content
Sina Khatami, Timothy John Peterson, Murray C Peel, Andrew Western

TL;DR
This study introduces Flux Mapping to evaluate catchment models as hypotheses, analyzing how model structure, parameter sampling, error metrics, and data influence model performance and runoff generation understanding.
Contribution
It systematically disentangles the effects of error metrics, parameter sampling, and data content on catchment model evaluation using Flux Mapping.
Findings
KGEss is more reliable than NSE and WIA for model evaluation.
Changing the error metric alters the model solution space and performance.
Unreliable metrics and insufficient sampling impair runoff generation hypotheses.
Abstract
To evaluate models as hypotheses, we developed the method of Flux Mapping to construct a hypothesis space based on dominant runoff generating mechanisms. Acceptable model runs, defined as total simulated flow with similar (and minimal) model error, are mapped to the hypothesis space given their simulated runoff components. In each modeling case, the hypothesis space is the result of an interplay of factors: model structure and parameterization, chosen error metric, and data information content. The aim of this study is to disentangle the role of each factor in model evaluation. We used two model structures (SACRAMENTO and SIMHYD), two parameter sampling approaches (Latin Hypercube Sampling of the parameter space and guided-search of the solution space), three widely used error metrics (Nash-Sutcliffe Efficiency - NSE, Kling-Gupta Efficiency skill score - KGEss, and Willmott refined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
