Work Description

Title: Data for the STONE curve: A ROC-derived model performance assessment tool Open Access Deposited
Attribute Value
  • Two space environment prediction models were assessed against relevant observations using both a relative operating characteristic (ROC) curve as well as a sliding threshold of observations for numeric evaluation (STONE) curve. Similarities and differences between the two metrics are discussed in the paper.
  • Scientists often try to reproduce observations with a model, helping them explain the observations by adjusting known and controllable features within the model. They then use a large variety of metrics for assessing the ability of a model to reproduce the observations. One such metric is called the relative operating characteristic (ROC) curve, a tool that assesses a model’s ability to predict events within the data. The ROC curve is made by sliding the event-definition threshold in the model output, calculating certain metrics and making a graph of the results. Here, a new model assessment tool is introduced, called the sliding threshold of observation for numeric evaluation (STONE) curve. The STONE curve is created by sliding the event definition threshold not only for the model output but also simultaneously for the data values. This is applicable when the model output is trying to reproduce the exact values of a particular data set. While the ROC curve is still a highly valuable tool for optimizing the prediction of known and pre-classified events, it is argued here that the STONE curve is better for assessing model prediction of a continuous-valued data set.

  • Data and code were created using IDL, but can also be accessed with the open-source Gnu Data Language (GDL; see
Contact information
Funding agency
  • National Science Foundation (NSF)
  • National Aeronautics and Space Administration (NASA)
  • Other Funding Agency
Other Funding agency
  • European Union
Citations to related material
  • Liemohn, M. W., Azari, A. R., Ganushkina, N. Yu., & Rastätter, L. (2020). The STONE curve: A ROC-derived model performance assessment tool. Earth and Space Science, 7, e2020EA001106.
Resource type
Last modified
  • 02/19/2020
  • 02/19/2020
To Cite this Work:
Liemohn, M., Azari, A., Ganushkina, N., Rastätter, L. (2020). Data for the STONE curve: A ROC-derived model performance assessment tool [Data set]. University of Michigan - Deep Blue.


Files (Count: 36; Size: 66.9 MB)

Download All Files (To download individual files, select them in the “Files” panel above)

Best for data sets < 3 GB. Downloads all files plus metadata into a zip file.

Best for data sets > 3 GB. Globus is the platform Deep Blue Data uses to make large data sets available.   More about Globus