Multiple Testing in Remote Sensing: Addressing the Elephant in the Room.

Loading...
Thumbnail Image

Files

Primary Manuscript.pdf (494.9 KB)

Description: Manuscript

Supplementary Material.zip (515 KB)

Description: Supplementary Material

Identifiers

Publication date

Reading date

Authors

Gutiérrez-Hernández, Oliver
García, Luís V.

Collaborators

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

SSRN

Metrics

Google Scholar

Share

Research Projects

Organizational Units

Journal Issue

Department/Institute

Abstract

In environmental remote sensing, statistical tests are often conducted at the pixel level, generating thousands of p-values and substantially increasing the risk of Type I errors. Traditional multiple testing corrections aim to control the probability of any false positives, but often at the cost of drastically reduced statistical power. In contrast, false discovery rate (FDR) control limits the proportion of false positives among significant results while preserving power. It is widely used in image-based disciplines such as neuroimaging and, more broadly, medical imaging, where error control is critical. However, its adoption in remote sensing remains unclear. We conducted a Scopus-based review of twenty years of literature (2004–2023) in remote sensing journals to assess the use of FDR control in spatiotemporal trend analyses, with particular attention to studies employing the Mann-Kendall test, a non-parametric method widely used for detecting monotonic trends at the pixel level. Our results reveal that only 0.03% of remote sensing articles cited the seminal Benjamini–Hochberg (1995) method, and none of the studies using pixel-wise Mann-Kendall testing explicitly applied FDR control. This striking absence highlights a critical risk of overestimating significance in large-scale remote sensing analyses, especially as increasing data precision demands more rigorous error control. We call for routinely incorporating FDR procedures into remote sensing workflows to control the inflation of Type I error in large-scale multiple testing.

Description

Update (17/09/2025): This is an improved version of our SSRN preprint, in which we refined the conceptual framework, strengthened the treatment of multiple testing with a focus on false discovery rate (FDR) control, expanded the literature review, and included supplementary material with R code.

Bibliographic citation

Gutiérrez Hernández, Oliver and García, Luis V., Multiple Testing in Remote Sensing: Addressing the Elephant in the Room (September 17, 2025). Available at SSRN: https://ssrn.com/abstract=4891512 or http://dx.doi.org/10.2139/ssrn.4891512

Collections

Endorsement

Review

Supplemented By

Referenced by