RMSE is not enough: Guidelines to robust data-model comparisons for magnetospheric physics

Liemohn, Michael; Shane, Alexander; Azari, Abigail; Petersen, Alicia; Swiger, Brian; Mukhopadhyay, Agnit

RMSE is not enough: Guidelines to robust data-model comparisons for magnetospheric physics

dc.contributor.author	Liemohn, Michael
dc.contributor.author	Shane, Alexander
dc.contributor.author	Azari, Abigail
dc.contributor.author	Petersen, Alicia
dc.contributor.author	Swiger, Brian
dc.contributor.author	Mukhopadhyay, Agnit
dc.date.accessioned	2022-01-05T14:29:40Z
dc.date.available	2022-01-05T14:29:40Z
dc.date.issued	2021-04-01
dc.identifier.citation	Liemohn, M. W., Shane, A. D., Azari, A. R., Petersen, A. K., Swiger, B. M., & Mukhopadhyay, A. (2021). RMSE is not enough: guidelines to robust data-model comparisons for magnetospheric physics. Journal of Atmospheric and Solar-Terrestrial Physics, 218, 105624. https://doi.org/10.1016/j.jastp.2021.105624	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/171097	en
dc.description	This is a review article of recent data-model comparison methodologies used in magnetospheric physics studies, also presenting a systematic categorization of these metrics for robust usage and augmented scientific output.	en_US
dc.description.abstract	The magnetospheric physics research community uses a broad array of quantitative data-model comparison methods (metrics) when conducting their research investigations. It is often the case, though, that any particular study will only use one or two metrics, with the two most common being Pearson correlation coefficient and root mean square error (RMSE). Because metrics are designed to test a specific aspect of the data-model relationship, limiting the comparison to only one or two metrics reduces the physical insights that can be gleaned from the analysis, restricting the possible findings from modeling studies. Additional physical insights can be obtained when many types of metrics are applied. We organize metrics into two primary groups: 1) fit performance metrics, often based on the data-model value difference; and 2) event detection metrics, which use a discrete event classification of data and model values determined by a specified threshold. In addition to these groups, there are several major categories of metrics based on the aspect of the data-model relationship that the metric assesses: 1) accuracy; 2) bias; 3) precision; 4) association; 5) and extremes. Another category is skill, which is a measure of any of these metrics against the performance of a reference model. These can be applied to a subset of either the data or the model values, known as reliability and discrimination assessments. In the context of magnetospheric physics examples, we discuss best practices for choosing metrics for particular studies.	en_US
dc.description.sponsorship	The authors would like to thank the US government for sponsoring this research, in particular research grants from NASA (NNX17AB87G, NNX16AQ04G, 80NSSC17K0015) and NSF (1663770). This study received partial funding from the European Union Horizon 2020 Research and Innovation Programme under grant agreement 870452 (PAGER). A. Azari’s contributions are based on work supported by the NSF Graduate Research Fellowship Program (DGE 1256260), A. Mukhopadhyay’s contributions are based on work supported by the NASA Future Investigator fellowship 80NSSC18K1120. B. Swiger’s contributions were partially supported by the NASA Future Investigator fellowship number 80NSSC20K1504. Data for Fig. 3 is available at the University of Michigan Deep Blue Data Repository, https://doi. org/10.7302/Z25T3HQC. Figures in section 4 are reused with permission.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Elsevier	en_US
dc.rights	Attribution 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject	space physics, magnetosphere, data-model comparisons, metrics	en_US
dc.title	RMSE is not enough: Guidelines to robust data-model comparisons for magnetospheric physics	en_US
dc.type	Article	en_US
dc.subject.hlbsecondlevel	Atmospheric, Oceanic and Space Sciences
dc.subject.hlbtoplevel	Science
dc.subject.hlbtoplevel	Engineering
dc.description.peerreviewed	Peer Reviewed	en_US
dc.contributor.affiliationum	Climate and Space Sciences and Engineering, Department of	en_US
dc.contributor.affiliationumcampus	Ann Arbor	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/171097/1/Liemohn_JASTP_2021_RMSEisnotEnough.pdf
dc.identifier.doi	https://dx.doi.org/10.7302/3773
dc.identifier.source	Journal of Atmospheric and Solar-Terrestrial Physics	en_US
dc.identifier.orcid	0000-0002-7039-2631	en_US
dc.description.filedescription	Description of Liemohn_JASTP_2021_RMSEisnotEnough.pdf : Main article
dc.description.depositor	SELF	en_US
dc.identifier.name-orcid	Liemohn, Michael; 0000-0002-7039-2631	en_US
dc.working.doi	10.7302/3773	en_US
dc.owningcollname	Climate and Space Sciences and Engineering, Department of

Files in this item

Name:: license_rdf
Size:: 908bytes
Format:: application/rdf+xml

View/Open

Name:: Liemohn_JASTP_2021_RMSEisnotEn ...
Size:: 8.761MB
Format:: PDF
Description:: Main article

View/Open

Show simple item record

Except where otherwise noted, this item's license is described as Attribution 4.0 International

Remediation of Harmful Language

The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.