Methods for interpolating missing data in aerobiological databases
Loading...
Files
Description: Accepted article
Identifiers
Publication date
Reading date
Collaborators
Advisors
Tutors
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Share
Center
Department/Institute
Keywords
Abstract
The availability of extensive environmental time series is usually laborious and difficult, and sometimes unexpected failures are not detected until samples are processed. Consequently, environmental databases frequently have some gaps with missing data in it. Applying an interpolation method before starting the data analysis can be a good solution in order to complete this missing information. Nevertheless, there are several different approaches whose accuracy should be considered and compared. In this study, data from 6 aerobiological sampling stations were used as an example of environmental data series to assess the accuracy of different interpolation methods. For that, observed daily pollen/spore concentration data series were randomly removed, interpolated by using different methods and then, compared with the observed data to measure the errors produced. Different periods, gap sizes, interpolation methods and bioaerosols were considered in order to check their influence in the interpolation accuracy. The moving mean interpolation method obtained the highest success rate as average. By using this method, a success rate of the 70% was obtained when the risk classes used in the alert systems of the pollen information platforms were taken into account. In general, errors were mostly greater when there were high oscillations in the concentrations of biotic particles during consecutive days. That is the reason why the pre-peak and peak periods showed the highest interpolation errors. The errors were also higher when gaps longer than 5 days were considered. So, for completing long periods of missing data, it would be advisable to test other methodological approaches. A new Variation Index based on the behaviour of the pollen/spore season (measurement of the variability of the concentrations every 2 consecutive days) was elaborated, which allows to estimate the potential error before the interpolation is applied.
Description
Bibliographic citation
Picornell, A., Oteros, J., Ruiz-Mata, R., Recio, M., Trigo, M.M., Martínez-Bracero, M., Lara, B., Serrano-García, A., Galán, C., García-Mozo, H., Alcázar, P., Pérez-Badia, R., Cabezudo, B., Romero-Morte, J., Rojo, J., 2021. Methods for interpolating missing data in aerobiological databases. Environ Res 200, 111391. https://doi.org/10.1016/j.envres.2021.111391
Collections
Endorsement
Review
Supplemented By
Referenced by
Creative Commons license
Except where otherwised noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 Internacional










