Show simple item record

RMSE is not enough: Guidelines to robust data-model comparisons for magnetospheric physics

dc.contributor.authorLiemohn, Michael
dc.contributor.authorShane, Alexander
dc.contributor.authorAzari, Abigail
dc.contributor.authorPetersen, Alicia
dc.contributor.authorSwiger, Brian
dc.contributor.authorMukhopadhyay, Agnit
dc.date.accessioned2022-01-05T14:29:40Z
dc.date.available2022-01-05T14:29:40Z
dc.date.issued2021-04-01
dc.identifier.citationLiemohn, M. W., Shane, A. D., Azari, A. R., Petersen, A. K., Swiger, B. M., & Mukhopadhyay, A. (2021). RMSE is not enough: guidelines to robust data-model comparisons for magnetospheric physics. Journal of Atmospheric and Solar-Terrestrial Physics, 218, 105624. https://doi.org/10.1016/j.jastp.2021.105624en_US
dc.identifier.urihttps://hdl.handle.net/2027.42/171097en
dc.descriptionThis is a review article of recent data-model comparison methodologies used in magnetospheric physics studies, also presenting a systematic categorization of these metrics for robust usage and augmented scientific output.en_US
dc.description.abstractThe magnetospheric physics research community uses a broad array of quantitative data-model comparison methods (metrics) when conducting their research investigations. It is often the case, though, that any particular study will only use one or two metrics, with the two most common being Pearson correlation coefficient and root mean square error (RMSE). Because metrics are designed to test a specific aspect of the data-model relationship, limiting the comparison to only one or two metrics reduces the physical insights that can be gleaned from the analysis, restricting the possible findings from modeling studies. Additional physical insights can be obtained when many types of metrics are applied. We organize metrics into two primary groups: 1) fit performance metrics, often based on the data-model value difference; and 2) event detection metrics, which use a discrete event classification of data and model values determined by a specified threshold. In addition to these groups, there are several major categories of metrics based on the aspect of the data-model relationship that the metric assesses: 1) accuracy; 2) bias; 3) precision; 4) association; 5) and extremes. Another category is skill, which is a measure of any of these metrics against the performance of a reference model. These can be applied to a subset of either the data or the model values, known as reliability and discrimination assessments. In the context of magnetospheric physics examples, we discuss best practices for choosing metrics for particular studies.en_US
dc.description.sponsorshipThe authors would like to thank the US government for sponsoring this research, in particular research grants from NASA (NNX17AB87G, NNX16AQ04G, 80NSSC17K0015) and NSF (1663770). This study received partial funding from the European Union Horizon 2020 Research and Innovation Programme under grant agreement 870452 (PAGER). A. Azari’s contributions are based on work supported by the NSF Graduate Research Fellowship Program (DGE 1256260), A. Mukhopadhyay’s contributions are based on work supported by the NASA Future Investigator fellowship 80NSSC18K1120. B. Swiger’s contributions were partially supported by the NASA Future Investigator fellowship number 80NSSC20K1504. Data for Fig. 3 is available at the University of Michigan Deep Blue Data Repository, https://doi. org/10.7302/Z25T3HQC. Figures in section 4 are reused with permission.en_US
dc.language.isoen_USen_US
dc.publisherElsevieren_US
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectspace physics, magnetosphere, data-model comparisons, metricsen_US
dc.titleRMSE is not enough: Guidelines to robust data-model comparisons for magnetospheric physicsen_US
dc.typeArticleen_US
dc.subject.hlbsecondlevelAtmospheric, Oceanic and Space Sciences
dc.subject.hlbtoplevelScience
dc.subject.hlbtoplevelEngineering
dc.description.peerreviewedPeer Revieweden_US
dc.contributor.affiliationumClimate and Space Sciences and Engineering, Department ofen_US
dc.contributor.affiliationumcampusAnn Arboren_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/171097/1/Liemohn_JASTP_2021_RMSEisnotEnough.pdf
dc.identifier.doihttps://dx.doi.org/10.7302/3773
dc.identifier.sourceJournal of Atmospheric and Solar-Terrestrial Physicsen_US
dc.identifier.orcid0000-0002-7039-2631en_US
dc.description.filedescriptionDescription of Liemohn_JASTP_2021_RMSEisnotEnough.pdf : Main article
dc.description.depositorSELFen_US
dc.identifier.name-orcidLiemohn, Michael; 0000-0002-7039-2631en_US
dc.working.doi10.7302/3773en_US
dc.owningcollnameClimate and Space Sciences and Engineering, Department of


Files in this item

Show simple item record

Attribution 4.0 International
Except where otherwise noted, this item's license is described as Attribution 4.0 International

Remediation of Harmful Language

The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.