As spatial datasets are becoming increasingly large and unwieldy, exact inference on spatial models becomes computationally prohibitive. Various approximation methods have been proposed to reduce the computational burden. Although comprehensive reviews on these approximation methods exist, comparisons of their performances are limited to small and medium sizes of datasets for a few selected methods. To achieve a comprehensive comparison comprising as many methods as possible, we organized the Competition on Spatial Statistics for Large Datasets. This competition had the following novel features: (1) we generated synthetic datasets with the ExaGeoStat software so that the number of generated realizations ranged from 100 thousand to 1 million; (2) we systematically designed the data-generating models to represent spatial processes with a wide range of statistical properties for both Gaussian and non-Gaussian cases; (3) the competition tasks included both estimation and prediction, and the results were assessed by multiple criteria; and (4) we have made all the datasets and competition results publicly available to serve as a benchmark for other approximation methods. In this paper, we disclose all the competition details and results along with some analysis of the competition outcomes.
|Original language||English (US)|
|Journal||Journal of Agricultural, Biological and Environmental Statistics|
|State||Published - Jul 8 2021|
Bibliographical noteKAUST Repository Item: Exported on 2021-07-12
Acknowledgements: Funding was provided by King Abdullah University of Science and Technology.
ASJC Scopus subject areas
- Agricultural and Biological Sciences(all)
- Environmental Science(all)
- Applied Mathematics
- Statistics and Probability
- Statistics, Probability and Uncertainty
- Agricultural and Biological Sciences (miscellaneous)