Research Article |
Corresponding author: Joana Santana ( joanafsantana@cibio.up.pt ) Academic editor: Helen Sofaer
© 2024 Joana Santana, Neftalí Sillero, Joana Ribeiro, César Capinha, Ricardo Jorge Lopes, Luís Reino.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Santana J, Sillero N, Ribeiro J, Capinha C, Lopes RJ, Reino L (2024) Predicting the expansion of invasive species: how much data do we need? NeoBiota 95: 109-132. https://doi.org/10.3897/neobiota.95.122335
|
Ecological niche models (ENMs) are a powerful tool to predict the spread of invasive alien species (IAS) and support the implementation of actions aiming to reduce the impact of biological invasions. While calibrating ENMs with distribution data from species’ native ranges can underestimate the invasion potential due to possible niche shifts, using distribution data combining species’ native and invasive ranges may overestimate the invasion potential due to a reduced fitness and environmental tolerance of species in invaded ranges. An alternative may be using the increasingly available distribution data of IAS as they spread their invaded ranges, to iteratively forecast invasions as they unfold. However, while this approach accounts for possible niche shifts, it may also underestimate the species’ potential range, particularly at the early stages of the invasion when the most suitable conditions may not yet be represented in the distribution range data set. Here, we evaluate the capacity of ENMs to forecast the distribution of IAS based on distribution data on invaded ranges as these data become available. We further use dispersion models to assess the expansion process using the predicted potential distributions. Specifically, we used the common waxbill (Estrilda astrild) in the Iberian Peninsula as a model system. We built ENMs with 10×10 km grid cells distribution records cumulatively for each decade from 1960 to 2019, and yearly bioclimatic variables, to forecast the species potential range in the coming decades. Then, we assessed the performance of the models for each decade in forecasting the species’ observed range expansion in the following decades and evaluated how the number of distribution records determined the quality of the forecasts. Finally, we performed dispersal estimates (based on species traits, topography, climate and land cover) to analyse the prediction capacity of models as their uncertainty may be reduced when projecting them to the next decades. Our results show that invasion-only ENMs successfully forecasted the species’ range expansion over three decades after invasion, while dispersion models were not important in forecasting common waxbill expansion. Our study highlights the importance of constantly monitoring alien species, suggesting that iterative updating of ENMs with observed distribution data may accurately forecast the range expansion of alien species.
Alien species, common waxbill, dispersal analyses, ecological niche models, Estrilda astrild, forecasts
Biological invasions are among the most worrisome environmental problems in modern times (
Modelling and projecting the realised niche of IAS in the geographical space allows the identification of the areas at risk of invasion (
Considering the potential for realised niche shifts, previous studies have recommended calibrating ENMs using distribution data of IAS in both native and invasive ranges (
Invasion monitoring efforts are producing high-quality spatiotemporal data of spread for a large number of IAS in invaded ranges (e.g.
Here, we evaluate the capacity of iterative calibration of ENM models based on invasion-only distribution data to predict the invasion potential and analyse the expansion process of IAS. We specifically explore the relationship between the number of species distribution records since establishment and the capacity of models to inform about the species’ invasion potential. We also assessed the importance of accounting for the dispersal capacity of species to predict their expansion. To achieve these aims, we considered one of the most studied and successful avian invasive alien species, established in different environments and biogeographic regions worldwide: the afro-tropical common waxbill (Estrilda astrild). This species was first introduced to Portugal in 1964, and has spread across much of the country and into part of Spain fed by further introductions (Reino and Silva 1998;
Using a unique, high-quality, database on spatial dispersion of the common waxbill through the Iberian Peninsula over six decades (Reino and Silva 1998;
The Iberian Peninsula (southwestern Europe), covers an area of 582,860 km2 and mainly includes the continental territories of Spain and Portugal (Fig.
We gathered historical data on the common waxbill expansion in the Iberian Peninsula since its first introduction in the 1960s. For this, we obtained presence data of the species in the continental territories from
We obtained yearly climate data for the temporal period covered by the distribution data from the EuMedClim Database (http://gentree.data.inra.fr/climate/; Fréjaville and Garzón 2018), which provides yearly climate data between 1901–2014 at 1 km resolution for Europe and the Mediterranean Basin. We considered the seven bioclimatic variables available from this source: bio1 - annual mean temperature; bio2 - mean diurnal temperature range; bio5 - Maximal temperature of the warmest month; bio6 - minimal temperature of the coldest month; bio12 - annual precipitation; bio13 - precipitation of the wettest month; bio14 - precipitation of the driest month). From these variables, to minimise cross-correlation between variables, we kept four variables that had an absolute value of Pearson correlation coefficient below 0.7 (Suppl. material
We estimated the realised niche of the species (sensu
We calculated realised niche models using Maxent v.3.4.4 (
We measured model discrimination performance using the area under the curve (AUC) of the receiver operating characteristics (ROC) plots (
We used the Boyce Index (
We determined the contribution of each climatic variable in explaining the species’ distribution using a jackknife resampling based on: (1) values of the training and test gain; and (2) of AUC values. The jackknife resampling comprises two steps: (1) the generation of a model with all climatic variables except one; and (2) the generation of univariate models, each using only one climatic variable. In each step, the jackknife analysis measures the change in training and test gain, and the AUC determines the importance of each variable. Using the results from each of these procedures, Maxent calculated an average percentage contribution of each climatic variable. We also calculated the permutation importance: for each environmental variable in turn, the values of that variable on training presence and background data are randomly permuted. The model is re-evaluated on the permuted data, and the resulting drop in training AUC is calculated, and normalised to percentages (
We further validated the ENMs for each decade and their respective projections by counting the number of presences classified as presences or as absences. For this, we categorised the continuous models into two categories by applying the threshold ‘Maximum training sensitivity plus specificity’ for the Cloglog output. We used the presence records of each decade and previous decades, i.e., cumulatively. The total number of presences used to validate the projections of each model was the same (9, 114, 160, 334, 752, 1120, see Suppl. material
Accounting for dispersal barriers/capacity has been pointed out as important to reduce uncertainty in future projections of species distribution (
Parameter | Value |
---|---|
Continuous mode | 0 |
Number of environmental change steps to perform | 6 |
Number of dispersal steps to perform within each environmental change step | 10 |
Dispersal kernel: probability of colonising a directly adjacent cell | 1 |
Long-distance dispersal frequency | 0.0001 |
Minimum distance for long-distance dispersal in pixels | 2 |
Maximum distance for long-distance dispersal in pixels | 4 |
Initial maturity age of newly colonized cells | 1 |
Propagule production probability as a function of cell age | 1 |
Number of replicates | 100 |
Barriers | No barriers / Weak / Strong |
Dispersal barriers were represented by elevation, land cover, and hydrological factors (Fig.
The current geographic distribution of the common waxbill in the Iberian Peninsula spans most of the Iberian Atlantic coast, but also through large areas in Southern and Eastern Iberia extending to the Mediterranean coast, as far as Catalonia (Fig.
The expansion process was faster in the Central and Northern regions of Portugal (70’s), whereas the spread in the south seemed to have been boosted by an additional introduction event in the Algarve in the same decade that enabled the colonisation of western Andalucia during the following decade (Fig.
ENMs had test AUC values higher than 0.8 and significantly differed from random (Table
Results of AUC, TSS, and Boyce index for models considering the accumulated presences since the first introduction.
Decade | Training n | Test n | AUC | TSS | Boyce index |
---|---|---|---|---|---|
1960–1969 | 4 | 1 | 0.99 ± 0.010 | 0.83 ± 0.09 | 0.06 |
1960–1979 | 24 | 10 | 0.98 ± 0.010 | 0.91 ± 0.03 | 0.86 |
1960–1989 | 73 | 31 | 0.95 ± 0.010 | 0.80 ± 0.03 | 0.98 |
1960–1999 | 171 | 73 | 0.92 ± 0.010 | 0.71 ± 0.02 | 0.99 |
1960–2009 | 432 | 184 | 0.88 ± 0.010 | 0.63 ± 0.01 | 0.99 |
1960–2019 | 639 | 273 | 0.86 ± 0.010 | 0.60 ± 0.01 | 0.99 |
The variable with the highest contribution for the first decade (1960–1969), was the mean diurnal temperature range (bio2), whereas the minimum temperature of the coldest month (bio6) was the variable with the highest contribution in the following decades (Table
Contributions and permutation importance of the bioclimatic variables (mean diurnal temperature range (bio02), minimal temperature of the coldest month (bio06), precipitation of the wettest month (bio13) and precipitation of the driest month (bio14) of the Maxent models. Highest values of variable contribution and permutation importance for each model are highlighted in bold. Temperature, and particularly the minimal temperature, was the most important variable affecting the distribution of common waxbill for all models.
Decade | Variable contribution | Permutation importance | ||||||
---|---|---|---|---|---|---|---|---|
bio02 | bio06 | bio13 | bio14 | bio02 | bio06 | bio13 | bio14 | |
1960–1969 | 50.97 | 19.75 | 0.06 | 29.22 | 39.97 | 3.74 | 0.14 | 56.15 |
1960–1979 | 4.59 | 72.24 | 6.58 | 16.60 | 0.90 | 84.85 | 4.54 | 9.710 |
1960–1989 | 2.76 | 79.67 | 7.62 | 9.95 | 1.18 | 80.53 | 5.82 | 12.47 |
1960–1999 | 6.06 | 76.36 | 8.40 | 9.18 | 5.74 | 78.30 | 10.46 | 5.501 |
1960–2009 | 4.90 | 68.36 | 15.33 | 11.41 | 5.37 | 64.04 | 21.40 | 9.19 |
1960–2019 | 5.14 | 67.70 | 15.10 | 12.06 | 5.69 | 62.53 | 17.63 | 14.16 |
The areas identified as suitable widened over time, from the coastal areas towards the interior of the Iberian Peninsula (Fig.
Results of cumulative ecological niche models projecting habitat suitability to the next decades. The suitability maps are organised as in Suppl. material
Validation of the ENMs of each decade projected to the remaining periods (Fig.
Percentage of presences incorrectly (red) and correctly (green) classified over time, for each model (blue background) and projection (yellow background). Validation for each model was conducted using the number of cumulative presences from the previous decade(s) (for details see Suppl. material
The species’ potential range accounting for dispersal capacity increased over time, driven by the results of ENM projections. In the first decade, the range deemed susceptible to colonisation was narrow, and almost all of the Iberian Peninsula was beyond reach. On the other hand, for the last decade, these areas were much wider (Fig.
Results of dispersal models per decade and type of barriers (no barriers, weak barriers, and strong barriers). The colour sequence Blue -> Green -> Light Green -> Yellow indicates the dispersal of species over time in each decade. Yellow indicates areas where the species did not have time to arrive. Purple indicates areas where the species cannot occur because habitat suitability predicted by ENMs was low.
This study evaluates the use of increasingly available spatiotemporal data on IAS spread to iteratively forecast invasions as they unfold. The backbone of these forecasts were ENMs using detailed distribution data of the common waxbill expansion through the Iberian Peninsula over six decades. Our projections, based on invasion-range data, were successful in forecasting the species’ current distribution after three decades following its introduction. These results support the idea that ENMs can successfully forecast the species’ range expansion, although they may have limited utility in the early stages of invasion, supporting the use of an iterative approach (
Our results are in line with previous studies arguing that ENMs may underestimate the species’ potential ranges (
Models disregarding the species’ global distribution provide worse results than full distribution models (
Contrary to expectations, our results suggest that barriers to dispersion were not insuperable by the common waxbill, although they might be important for other species with lower dispersal capacity. MigClim considers long dispersal events where the species reaches new locations without human intervention. In that case, the species only needs to arrive at a pixel with enough suitability. The few differences found in the projections using the ENM-only and dispersion models (with strong and weak barriers) indicate that the species was able to disperse over time following the suitable areas predicted by ENMs. In line with the commonly observed lag period during the invasion process (generally attributed to the exponential growth process, stochastic extinction of propagules, or an evolutionary modification of species following establishment,
While the common waxbill has been a highly successful avian invader across various continents and islands, its spatial spread appears more limited in comparison to other, predominantly older, Palearctic invaders worldwide, such as the European starling (Sturnus vulgaris) or the house sparrow (Passer domesticus). However, it seems to show a rather eclectic adaptation as these last two species colonise a great diversity of open and semi-open habitats, but not limited to human-made habitats (e.g., agricultural habitats, gardens), but also to wetlands. Probably, its expansion is more comparable with a more recent invader: red-billed leiothrix (Leiothrix lutea) in Europe. Though this species is more associated with forest habitats and is likely to be more limited to tree-based habitats, it has been a very successful and established invader with populations in several European countries and regions (Pereira et al. 2019). It is worth noting that both the European starling and the house sparrow have much older introductions, with established populations in regions like North America and Australia. For instance, the European starling was successfully introduced to multiple areas during the same period, as in Australia (
Accurately anticipating the expansion of IAS is key to ensuring the successful implementation of preventive and mitigation actions. Forecasting invasions by means of different quantitative methods and modelling strategies have been used in the last decades, and new approaches are constantly emerging (
We thank Victor Encarnação from CEMPA/ICNF for providing information on the common waxbill ringing recaptures in Portugal. We are also thankful to Martin Sullivan for his help with data and initial discussions, and to the subject editor and three anonymous reviewers for their valuable comments and suggestions that helped improve the manuscript.
The authors have declared that no competing interests exist.
No ethical statement was reported.
JS was financed by the FEDER Funds through the Operational Competitiveness Factors Program-COMPETE and by National Funds through FCT-Foundation for Science and Technology within the scope of the project “PTDC/ BIA-EVL/30931/2017-POCI-01-0145-FEDER-030931” and the project EuropaBON (Grant agreement No. 101003553, EU Horizon 2020 Coordination and Support Action). NS and LR are supported by FCT - Fundação para a Ciência e a Tecnologia public institute (IP), under the Stimulus of Scientific Employment: Individual Support contract no. CEECIND/02213/2017 and CEECIND/00445/2017, respectively. CC acknowledges support from FCT through support to CEG/IGOT Research Unit (UIDB/00295/2020 and UIDP/00295/2020). RJL through an FCT - Transitory Norm contract [DL57/2016/CP1440/CT0006]. This work was supported by the European Union’s Horizon 2020 Research and Innovation Programme under the Grant Agreement Number 857251. The authors also acknowledge research support via national funds from the Portuguese Foundation for Science and Technology (FCT) in the scope of the project PTDC/BIA-ECO/0207/2020 (DOI: 10.54499/PTDC/BIA-ECO/0207/2020).
Conceptualization: Joana Santana, Luís Reino; Data curation: Joana Ribeiro, Luís Reino, César Capinha, Neftalí Sillero; Formal analysis: Neftalí Sillero; Funding acquisition: Luís Reino, Joana Ribeiro, Joana Santana; Investigation: Joana Santana, Luís Reino, César Capinha, Neftalí Sillero, Joana Ribeiro, Ricardo Jorge Lopes; Methodology: Neftalí Sillero, Joana Santana, César Capinha, Luís Reino, Joana Ribeiro, Ricardo Jorge Lopes; Project administration: Luís Reino; Validation: Joana Santana, Neftalí Sillero; Visualization: Joana Santana, Neftalí Sillero; Writing - original draft: Joana Santana, Neftalí Sillero, Luís Reino; Writing - review and editing: Joana Santana, Neftalí Sillero, César Capinha, Joana Ribeiro, Luís Reino, Ricardo Jorge Lopes.
Joana Santana https://orcid.org/0000-0002-4100-8012
Neftalí Sillero https://orcid.org/0000-0002-3490-3780
Joana Ribeiro https://orcid.org/0000-0002-3755-5785
César Capinha https://orcid.org/0000-0002-0666-9755
Ricardo Jorge Lopes https://orcid.org/0000-0003-2193-5107
Luís Reino https://orcid.org/0000-0002-9768-1097
EuMedClim Database: http://gentree.data.inra.fr/climate/. Shuttle Radar Topography Mission: https://www.earthdata.nasa.gov/sensors/srtm. Global Land Cover 2000 dataset: https://www.eea.europa.eu/data-and-maps/figures/global-land-cover-2000-250m. Free-flowing rivers in Europe: https://www.eea.europa.eu/data-and-maps/figures/free-flowing-rivers-in-europe. Estrilda astrild occurrence data: Suppl. material
Oldest records of common waxbill Estrilda astrild for each 10×10 UTM km grid cells of Portugal and Spain
Data type: csv
Supplementary figures and tables
Data type: docx
R code scripts used to run the analyses
Data type: zip