Research Article |
Corresponding author: Matthew R. Moore ( cyclocephala@gmail.com ) Academic editor: Deepa Pureswaran
© 2023 Matthew R. Moore, Elijah J. Talamas, Jonathan S. Bremer, Natalie McGathey, James C. Fulton, Zachary Lahey, Jessica Awad, Cheryl G. Roberts, Lynn A. Combee.
This is an open access article distributed under the terms of the CC0 Public Domain Dedication.
Citation:
Moore MR, Talamas EJ, Bremer JS, McGathey N, Fulton JC, Lahey Z, Awad J, Roberts CG, Combee LA (2023) Mining biodiversity databases establishes a global baseline of cosmopolitan Insecta mOTUs: a case study on Platygastroidea (Hymenoptera) with consequences for biological control programs. NeoBiota 88: 169-210. https://doi.org/10.3897/neobiota.88.106326
|
In the past decade, several species of platygastroid wasps were found to be adventive in North America and Europe while under evaluation in quarantine as biological control agents of invasive pests. The scope and relative risk of this phenomenon is not fully known, but it is clearly a trend with implications for classical biological control. As a means of assessing the issue and to provide a global baseline, we implemented a data-mining approach with DNA sequences in the Barcode of Life Database, yielding 201 platygastroid BINs with intercontinental and island distributions. At least fifty-five BINs displayed exact COI barcode matches across continents, with many more BINs scored as inconclusive due to sequence length variation. These intercontinental and island BINs include biocontrol agents known to be adventive, as well as many species identified only to genus with uncertain geographic origins. We provide 2,500 identifications for platygastroid BOLD BINs, 88% to genus, to encourage additional research on this distributional phenomenon. The intercontinental BOLD BINs were compared to literature records and GBIF occurrences of cosmopolitan species to identify gaps and discordance across data sources. Smaller COI barcode datasets from localities in Florida and Germany, including topotypical specimens, revealed more intercontinental matches. We analyzed COI sequences in BOLD for the entirety of Insecta and Araneae to assess this phenomenon more broadly and because these taxa contain many hosts for platygastroid wasps. This method revealed that the intercontinental distribution phenomenon is widespread with implications for assessing biological diversity, taxonomic methodology and regulatory frameworks.
adventive species, biological control, biodiversity databases, COI barcoding
Human-mediated movement of insect pests is a well-known phenomenon, and mitigation attempts often include classical biological control. Relatively recent invasive stink bugs (Hemiptera, Pentatomoidea) in the United States include the kudzu bug (Megacopta cribraria Fab.), the brown marmorated stink bug (Halyomorpha halys (Stål)) and bagrada bug (Bagrada hilaris Burmeister); management efforts for all of them included classical biological control using egg parasitoids in the family Scelionidae (Hymenoptera, Platygastroidea). In each case, the biological control agent in quarantine was found to be adventive prior to approved release, circumventing regulatory processes and establishing expanded geographic ranges without oversight (
The prevalence of unintended introductions remains largely unevaluated and the numerous detections of scelionid parasitoids of stink bugs is probably a reflection of the attention given to these organisms.
We suspect that unintended introductions of platygastroids will be increasingly detected as ‘dark taxa’ (
In this contribution, we employed multiple research tracks to address the question of how many species of Platygastroidea may be moving over intercontinental distances while eluding detection. (1) We used existing BOLD infrastructure to identify platygastroid BINs (Barcode Index Numbers; database calculated clusters of highly similar sequences used to approximate species) that spanned large geographic areas. (2) Intercontinental platygastroid BINS were identified to genus. To ameliorate taxonomic impediments in the database and facilitate future research, we identified one fifth of all platygastroid BINs in BOLD to genus. (3) For each platygastroid genus with an intercontinental BIN, we compared the results of our data-mining approach with distributional data from taxonomic literature and the Global Biodiversity Information Facility (GBIF). (4) We incorporated data from two geographically disparate COI barcoding projects that share the goal of reliably attaching species names to DNA sequences. The first was a year-long insect trapping effort in Jacksonville, Florida, USA, a type locality for many platygastroids described at the turn of the 20th century. Importantly, traps were operated near the Port of Jacksonville, a potential site of entry for exotic species. The second was the German Barcode of Life III: Dark Taxa, which seeks to enhance the size and quality of the German DNA barcode reference library and includes collecting localities from which platygastroids were described in the 19th century. By generating COI barcodes from topotypical specimens and comparing these specimens to primary types as part of the identification, we made the most reliable association possible between a DNA sequence and a historical name, apart from using DNA from primary types directly. Additionally, this is the most feasible means to infer directionality of movement for species that appear to have adventive distributions. (5) We implemented our data-mining protocol for Insecta and Araneae for two reasons. First, platygastroids parasitize a taxonomically diverse array of insect and spider hosts and data about their distributions might inform our results for Platygastroidea. Second, the phenomenon of intercontinental distributions of arthropods more broadly is worthy of attention, and we sought to demonstrate the scalability of our approach.
Beginning in October 2022, BOLD v.4 (
In February 2023, the dataset was expanded using an automated scripting approach (10.5281/zenodo.7930407). The Darwin Core data files for “Insecta” and “Araneae” were downloaded from BOLD. The Insecta files were downloaded on 3 January 2023. The Araneae files were downloaded on 16 March 2023. The Insecta file was examined for all the categories listed under the field “country”. The country fields were assigned to continents or island categories (Suppl. material
Each BIN in the intercontinental and island platygastroid dataset was individually analyzed to determine whether they contained exact COI barcode matches across large distances. Each BIN’s COI fasta file was aligned using the default settings of MUSCLE (
Trees were viewed in iTOL to determine individual sequence membership in haplotype clusters. Apparent haplotype matches were then examined in the p-distance calculation files for confirmation. The geographic distribution of exact matches was evaluated by examining the specimen level metadata present in the BIN’s Darwin Core text file. Putative exact intercontinental matches were then validated in the underlying DNA alignment. This was necessary due to slight variation in the length of COI barcode sequences. If longer sequences in the alignments displayed polymorphisms toward either the 5’ or 3’ ends and the putative intercontinental matches lacked these flanking data, then the matches were considered inconclusive. Ambiguous DNA base pairs were ignored for considering exact matches.
Individual BIN alignments were ultimately combined into one fasta file for tree building and visualization. A species of Periclistus Förster (Hymenoptera, Proctotrupomorpha, Cynipidae) with appropriate data coverage was selected to root subsequent analyses, based on the sister relationship of Cynipoidea to Platygastroidea (
During the above data-mining activity, it was noticed that a large proportion of platygastroid BINs were unidentified below the family level. A list of Platygastroidea BINs was pulled from BOLD as candidates for identification using the provided digital morphology framework. Identifications were made, when possible, by comparison to images of primary type specimens provided by
Genus level occurrence data (Table
GBIF occurrence datasets used for comparison with the intercontinental BINs.
Family | Genus | GBIF Occurrence Dataset Citation |
---|---|---|
Platygastridae | Allotropa | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.n3derh |
Platygastridae | Amitus | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.xjcn95 |
Platygastridae | Amblyaspis | GBIF.org (28 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.r8zptf |
Platygastridae | Aphanomerus | GBIF.org (28 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.gw43u4 |
Platygastridae | Euxestonotus | GBIF.org (28 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.gu7djg |
Platygastridae | Fidiobia | GBIF.org (14 February 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.w478bm |
Platygastridae | Leptacis | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.efh6t8 |
Platygastridae | Metaclisis | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.qjwpf5 |
Platygastridae | Platygaster | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.d7n4wt |
Platygastridae | Synopeas | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.ev6apy |
Scelionidae | Anteris | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.sjsua4 |
Scelionidae | Aradophagus | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.hbu7rt |
Scelionidae | Baeoneurella | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.dw6d4e |
Scelionidae | Baeus | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.9hxwkx |
Scelionidae | Baryconus | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.qaraf2 |
Scelionidae | Calliscelio | GBIF.org (15 February 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.bsua2e |
Scelionidae | Dicroscelio | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.67g54v |
Scelionidae | Dyscritobaeus | GBIF.org (20 February 2023) GBIF Occurrence Download https://doi.org/10.15468/dl.5mt6wa |
Scelionidae | Gryon | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.tn8kne |
Scelionidae | Hadronotus | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.9qz232 |
Scelionidae | Idris | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.h2qj4j |
Scelionidae | Platyscelio | GBIF.org (13 December 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.cprkz3 |
Scelionidae | Psix | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.cc9rkg |
Scelionidae | Telenomus | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.32xjmc |
Scelionidae | Trimorus | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.ytcg37 |
Scelionidae | Trissolcus | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.t4qbu5 |
Scelionidae | Xenomerus | GBIF.org (31 October 2022) GBIF Occurrence Download https://doi.org/10.15468/dl.5vzkr6 |
Topotypes of platygastroid species described from Duval County, Florida, were collected between July 2018 and December 2021 in and around the Timucuan Ecological and Historic Preserve near the mouth of the St. Johns River. These collections were largely from Malaise traps that were placed at three different sites in the area. Collecting heads were provisioned with propylene glycol and wrapped with aluminum foil to prevent UV damage to the specimens. 3D printed yellow cylinder traps were experimentally deployed on the ground and suspended from overhanging branches during one sampling period and additional specimens were collected in yellow pan traps and by sweep netting around the trap sites. Bulk samples were returned to the laboratory and sorted under a Zeiss Discovery V8 Stereomicroscope. All platygastroid specimens were transferred to 95% ethanol and screened for matches to species described from that area.
DNA was non-destructively extracted from specimens using the Qiagen DNeasy Blood and Tissue Kit. Molecular voucher specimens were recovered and deposited at the Florida State Collection of Arthropods (Florida Department of Agriculture and Consumer Services – Division of Plant Industry; Gainesville, Florida). PCRs were conducted as 25 µl reactions using the KAPA HiFi HotStart Readymix Kit (Roche Diagnostics) per the manufacturer’s recommended protocol. Oligonucleotide primers used for PCR and direct sequencing were the universal arthropod COI barcoding sets LCO1490/HCO2198 (
The German Barcode of Life III: Dark Taxa project (
Following non-destructive DNA extraction, voucher specimens were mounted and photographed with a Macropod imaging system consisting of a Canon EOS 6D Mark II camera body, EF 70–200 mm lens and 10× or 20× M Plan APO Mitutoyo objective lenses. Imaging software included Canon EOS Utility 3.14.30.4 and Helicon Focus Pro 7.7.5 for image stacking. Adobe Photoshop 23.2.2 was used for limited post-processing and addition of scale bars. Images were uploaded to BOLD with specimen metadata.
An R (
A randomized 5% of the recovered Insecta and Araneae BINs were validated by manual examination in the BOLD BIN database (Suppl. materials
Two hundred and one platygastrid and scelionid BINs were found to have intercontinental and island distributions (Table
Summary of platygastroid and host BINs displaying intercontinental and island distributions in BOLD. Data on host associations were derived from summaries in
Taxon (parasitoid) | Intercontinental BINS (parasitoid) | Host taxon | Intercontinental genera (host) | Intercontinental BINs (host) |
---|---|---|---|---|
Platygastridae | ||||
Amblyaspis | 5 | Cecidomyiidae | 43 | 476 |
Euxestonotus | 2 | |||
Leptacis | 7 | |||
Metaclisis | 1 | |||
Platygaster | 35 | |||
Synopeas | 13 | |||
Amitus | 1 | Aleyrodidae | 13 | 35 |
Aphanomerus | 1 | Flatidae | 2 | 4 |
Ricaniidae | 3 | 4 | ||
Allotropa | 1 | Pseudococcidae | 13 | 34 |
Fidiobia | 2 | Chrysomelidae | 45 | 94 |
Curculionidae | 92 | 210 | ||
Platygastrinae | 4 | |||
Scelionidae | ||||
Anteris | 1 | |||
Aradophagus | 2 | Theridiidae | 20 | 52 |
Baeoneurella | 1 | Lygaeidae | 7 | 20 |
Baeus | 4 | Araneae | ||
Baryconus | 1 | Tettigoniidae | 8 | 11 |
Calliscelio | 1 | Gryllidae | 6 | 8 |
Dicroscelio | 2 | |||
Dyscritobaeus | 1 | |||
Gryon | 6 | Heteroptera Lepidoptera | ||
Hadronotus | 5 | |||
Idris | 9 | Araneae | ||
Psix | 1 | Heteroptera | ||
Telenomus | 69 | Heteroptera Lepidoptera Diptera Neuroptera | ||
Trissolcus | 11 | Pentatomidae | 21 | 27 |
Trimorus | 7 | Carabidae | 48 | 105 |
Xenomerus | 1 | |||
Scelionidae | 1 | |||
Scelioninae | 2 | |||
Teleasinae | 1 | |||
Telenominae | 3 |
The minimum BIN size was two (necessary for a geographic match) and the largest, a Platygaster species, contained 677 COI sequences. Most BINs (180 of 201) contained fewer than 100 COI sequences (Fig.
Pareto chart displaying the number of COI barcodes contained per BIN in the intercontinental and island platygastroid dataset.
In the total evidence Platygastroidea dataset, North America (248 connections), Europe (211 connections) and Asia (198 connections) were the most common connections (Fig.
Circos plot displaying the relative contribution of continents and oceanic island groups to geographic connections in the combined platygastroid BOLD BIN, GBIF and literature dataset. Numbers in parentheses indicate the total number of connections to that geographic grouping.
Circos plot displaying the relative contribution of continents and oceanic island groups to geographic connections in the combined Platygaster, Synopeas, Telenomus and Trissolcus BOLD BIN, GBIF and literature datasets. Numbers in parentheses indicate the total number of connections to that geographic grouping.
The intercontinental and island BIN dataset contains 9,874 platygastroid COI barcode sequences. The MAFFT alignment was 669 base pairs wide after trimming excessive data from the 3’ end of some barcodes. The alignment contains some gap regions due to varying COI amino acid phenotypes present among Platygastroidea (
Circularized p-distance neighbor-joining tree of the intercontinental and island BOLD BIN dataset. The BOLD BINs containing Gryon aetherium, Platygaster sp. (BOLD:ACI8542), Psix striaticeps, Trissolcus basalis, Tr. japonicus and Telenomus sp. (BOLD:ACY0393) are highlighted by an enlarged terminal cluster. Lines emanating from these clusters show generalized geographic localities where the BIN was detected. Solid black lines indicate different COI haplotypes. Solid red or blue lines indicate exact COI haplotype matches across continents. Striped red and blue lines indicate multiple exact COI haplotype matches found at that generalized geographic locality.
A total of 2,565 Platygastroidea BINs were evaluated for their identification accuracy using the specimen images provided by BOLD (Suppl. material
GBIF and literature searches returned 130 intercontinental taxa (37 Platygastridae; 93 Scelionidae), with an overall discrepancy of plus 67 BINs (Table
Comparison of total platygastroid intercontinental and island distributions present in different data sources.
Taxon | GBIF/Literature | BOLD BINs | Discrepancy |
---|---|---|---|
Platygastridae Total | 37 | 72 | +35 DNA |
Allotropa | 3 | 1 | -2 DNA |
Amblyaspis | 0 | 5 | +5 DNA |
Amitus | 4 | 1 | -3 DNA |
Aphanomerus | 2 | 1 | -1 DNA |
Euxestonotus | 1 | 2 | +1 DNA |
Fidiobia | 2 | 2 | -/- |
Inostemma | 1 | 0 | -1 DNA |
Leptacis | 0 | 7 | +7 DNA |
Metaclisis | 1 | 1 | -/- |
Platygaster | 15 | 35 | +20 DNA |
Synopeas | 7 | 13 | +6 DNA |
Tetrabaeus | 1 | 0 | -1 DNA |
Scelionidae | 93 | 129 | +36 DNA |
Anteris | 1 | 1 | -/- |
Aradophagus | 4 | 2 | -2 DNA |
Baeoneurella | 0 | 1 | +1 DNA |
Baeus | 2 | 4 | +2 DNA |
Baryconus | 0 | 1 | +1 DNA |
Calliscelio | 14 | 1 | -13 DNA |
Dicroscelio | 1 | 2 | +1 DNA |
Duta | 1 | 0 | -1 DNA |
Dyscritobaeus | 3 | 1 | -2 DNA |
Gryon | 4 | 6 | +2 DNA |
Hadronotus | 4 | 5 | +1 DNA |
Idris | 3 | 9 | +6 DNA |
Platyscelio | 2 | 0 | -2 DNA |
Probaryconus | 1 | 0 | -1 DNA |
Psix | 3 | 1 | -2 DNA |
Scelio | 1 | 0 | -1 DNA |
Telenomus | 24 | 69 | +45 DNA |
Trimorus | 1 | 7 | +6 DNA |
Trissolcus | 24 | 11 | -13 DNA |
Xenomerus | 0 | 1 | +1 DNA |
BOLD identification engine identifications for COI barcoded platygastroid specimens collected at Timucuan Ecological and Historic Preserve and Buck Island (Jacksonville Port Authority).
Processid | Sampleid | Morphological Identification | Topotype | % Match | Match Lowest Taxonomy | Match BIN |
---|---|---|---|---|---|---|
SUPER027-23 | FSCA 00094159 | Baeus | No | 97.19 | Scelionidae | BOLD:ACZ5774 |
SUPER008-23 | FSCA 00094151 | Baryconus floridanus | Yes | 92.4 | Scelionidae | BOLD:ABA5967 |
SUPER034-23 | FSCA 00094155 | Calotelea | No | 100 | Scelionidae | BOLD:AAN8024 |
SUPER043-23 | FSCA 00093888 | Calotelea bicolor | No | 93.78 | Scelionidae | BOLD:ADY8276 |
SUPER047-23 | FSCA 00091193 | Hadronotus bicolor | No | 99.36 | Gryon bicolor | BOLD:AAN8046 |
SUPER048-23 | FSCA 00090993 | Hadronotus bicolor | No | 98.84 | Gryon bicolor | BOLD:AAN8046 |
SUPER049-23 | FSCA 00091003 | Hadronotus bicolor | No | 99.36 | Gryon bicolor | BOLD:AAN8046 |
SUPER002-23 | FSCA 00090995 | Hadronotus carinatifrons | No | 99.5 | Hadronotus carinatifrons | BOLD:AET1244 |
SUPER042-23 | FSCA 00093865 | Hadronotus chelinideae | No | 98.87 | Gryon chelinideae | BOLD:ACN3082 |
SUPER040-23 | FSCA 00097242 | Inostemma | No | 98.79 | Platygastridae | BOLD:ACT8287 |
SUPER045-23 | FSCA 00093946 | Leptacis longipes | Yes | 93.63 | Leptacis | BOLD:ABV2678 |
SUPER044-23 | FSCA 00093928 | Leptacis puncticeps | Yes | 97.47 | Platygastridae | BOLD:AEE0956 |
SUPER001-23 | FSCA 00091067 | Metaclisis | No | 92.26 | Platygastridae | BOLD:AEC9177 |
SUPER037-23 | FSCA 00097245 | Metanopedias brunneipes | Yes | 98.77 | Platygastridae | BOLD:ABY3815 |
SUPER028-23 | FSCA 00094161 | Phanuromyia | No | 91.2 | Scelionidae | BOLD:ACJ7306 |
SUPER033-23 | FSCA 00094146 | Phanuromyia | No | 99.84 | Telenomus autumnalis (unavailable name) | BOLD:ACM1917 |
SUPER004-23 | FSCA 00094172 | Phanuromyia | No | 89.62 | Scelionidae | BOLD:AEN8490 |
SUPER005-23 | FSCA 00094176 | Phanuromyia | No | 86.37 | Scelionidae | BOLD:ADH6867 |
SUPER006-23 | FSCA 00094152 | Phanuromyia | No | 86.9 | Scelionidae | BOLD:ADH6867 |
SUPER007-23 | FSCA 00094154 | Phanuromyia | No | 98.72 | Telenomus | BOLD:AAN8100 |
SUPER022-23 | FSCA 00095852 | Platygaster | No | 96.54 | Platygastridae | BOLD:AAN8090 |
SUPER039-23 | FSCA 00097243 | Platygaster | No | 87.64 | Platygastridae | BOLD:ADI9080 |
SUPER010-23 | FSCA 00094167 | Scelio floridanus | Yes | 98.74 | Scelionidae | BOLD:ACA7140 |
SUPER029-23 | FSCA 00094138 | Scelio floridanus | Yes | 98.89 | Scelionidae | BOLD:AEY5850 |
SUPER009-23 | FSCA 00094175 | Scelio incertus | No | 99.53 | Scelionidae | BOLD:ACA7140 |
SUPER041-23 | FSCA 00093881 | Scelio opacus | No | 95.27 | Scelionidae | BOLD:ACT6721 |
SUPER035-23 | FSCA 00093874 | Scelio pumilis | No | 99.81 | Scelio | BOLD:ACA7141 |
SUPER003-23 | FSCA 00095771 | Synopeas | No | 93.94 | Platygastridae | BOLD:ADH8704 |
SUPER038-23 | FSCA 00095854 | Synopeas | No | 99.53 | Platygastridae | BOLD:AAY6787 |
SUPER017-23 | FSCA 00094197 | Synopeas | No | 93.38 | Platygastridae | BOLD:ACM5719 |
SUPER018-23 | FSCA 00095010 | Synopeas | No | 94.16 | Platygastridae | BOLD:ADH9879 |
SUPER019-23 | FSCA 00095011 | Synopeas | No | 93.62 | Platygastridae | BOLD:ADH9879 |
SUPER020-23 | FSCA 00095851 | Synopeas | No | 99.84 | Synopeas | BOLD:AEP1939 |
SUPER021-23 | FSCA 00097239 | Synopeas | No | 99.68 | Platygastridae | BOLD:AEP1939 |
SUPER036-23 | FSCA 00097487 | Synopeas cynipsiphilum | Yes | 99.04 | Synopeas | BOLD:ADX3415 |
SUPER011-23 | FSCA 00094179 | Telenomus | No | 100 | Scelionidae | BOLD:ACY0393 |
SUPER012-23 | FSCA 00094185 | Telenomus | No | 100 | Scelionidae | BOLD:ACY0393 |
SUPER013-23 | FSCA 00094149 | Telenomus | No | 100 | Scelionidae | BOLD:ACV4748 |
SUPER014-23 | FSCA 00094165 | Telenomus | No | 100 | Scelionidae | BOLD:AEO7335 |
SUPER015-23 | FSCA 00094156 | Telenomus | No | 100 | Telenomus | BOLD:AAN8031 |
SUPER016-23 | FSCA 00094137 | Telenomus | No | 100 | Telenomus | BOLD:ABW3189 |
SUPER032-23 | FSCA 00094153 | Telenomus | No | 100 | Scelionidae | BOLD:ACI3554 |
SUPER046-23 | FSCA 00093915 | Telenomus | No | 99.35 | Telenomus sp. SL017 | BOLD:AEI6588 |
SUPER023-23 | FSCA 00094140 | Telenomus | No | 97.11 | Scelionidae | BOLD:ACX8754 |
SUPER024-23 | FSCA 00094150 | Telenomus | No | 93.81 | Scelionidae | BOLD:ADY7126 |
SUPER025-23 | FSCA 00094139 | Telenomus | No | 99.68 | Telenomus | BOLD:ABY2759 |
SUPER026-23 | FSCA 00094173 | Telenomus | No | 93.84 | Telenomus podisi | BOLD:ADK2938 |
SUPER030-23 | FSCA 00094249 | Trimorus | No | 94.94 | Scelionidae | BOLD:AEJ7657 |
SUPER031-23 | FSCA 00094136 | Trimorus | No | 99.52 | Trimorus | BOLD:ABV9390 |
Forty-nine specimens were COI barcoded from Timucuan Ecological and Historic Preserve and Buck Island, Jacksonville, Florida (Table
Comparison of BOLD data to platygastrine specimens from the GBOL III barcoding initiative yielded 14 intercontinental BINs, representing 11 species in five genera (Amblyaspis, Euxestonotus, Leptacis, Platygaster and Synopeas). BOLD identified six species (seven BINs) only to family, two species (four BINs) to subfamily and three species (three BINs) with binomials. Of the three species identifications provided by BOLD, we verified two (Pl. demades and Pl. sagana) by comparison of voucher specimens to primary types, while one (Pl. tuberosula) was unverifiable. One more species (E. error) was unidentified in BOLD, but identifiable by our own examination. The remaining seven species were unidentifiable due to the superficial description impediment (
The distributions of two species (Pl. demades and putative Pl. tuberosula) can be explained by deliberate introductions for pest control on apple/pear and wheat, respectively. One species (E. error) is probably an unintentional introduction, moving with its host, the wheat midge Sitodiplosis mosellana (Géhin) (
The R script recovered 15,391 Insecta BOLD BINs with intercontinental and island distributions (Suppl. material
Taxonomic summary of Insecta BINs displaying geographic distributions spanning continents and islands.
Order | Families | Genera | BINs |
---|---|---|---|
Archaeognatha | 1 | 1 | 1 |
Zygentoma | 1 | 3 | 3 |
Odonata | 8 | 51 | 91 |
Ephemeroptera | 7 | 18 | 37 |
Dermaptera | 4 | 7 | 11 |
Plecoptera | 4 | 8 | 10 |
Orthoptera | 6 | 44 | 62 |
Embioptera | 2 | 3 | 5 |
Phasmatodea | 2 | 4 | 4 |
Mantodea | 1 | 5 | 7 |
Blattodea | 7 | 31 | 60 |
Psocodea | 21 | 30 | 107 |
Thysanoptera | 3 | 29 | 86 |
Hemiptera | 64 | 454 | 924 |
Hymenoptera | 62 | 736 | 3,056 |
Raphidioptera | 1 | 1 | 1 |
Neuroptera | 5 | 15 | 43 |
Strepsiptera | 2 | 2 | 2 |
Coleoptera | 66 | 560 | 1,121 |
Trichoptera | 17 | 47 | 126 |
Lepidoptera | 77 | 1,984 | 5,336 |
Siphonaptera | 3 | 6 | 12 |
Diptera | 82 | 899 | 4,286 |
Total: 23 | 446 | 4,938 | 15,391 |
We randomly selected 769 BINs from the Insecta dataset for cursory validation of the scripting process (Suppl. material
Greater than 80% of recovered Insecta BINs were present on two continents or islands (Fig.
Histogram displaying the percentage of Insecta BINs with a given number of continent and island data points.
Circos plot displaying the relative contribution of continents and oceanic island groups to geographic connections in the Insecta BOLD BIN dataset. The single data point for Antarctica was eliminated from this visualization. Numbers in parentheses indicate the total number of connections to that geographic grouping.
Our BOLD data-mining approach confirmed several well-characterized cases of parasitoid range expansion in Tr. basalis, Tr. japonicus, Tr. hyalinipennis, Te. remus, G. aetherium, Ps. striaticeps and Pl. demades. Therefore, the geographic distribution patterns found in the platygastroid dataset are generally considered credible even when lacking any directionality. The effort of COI barcoding topotypical specimens can be helpful for determining directionality, particularly if the specimens match primary types described a long time ago. Antiquity of specimens does not preclude the possibility that they were adventive at the time of collection, but it gives, at minimum, a historical perspective.
Exact COI barcode matches from geographically disparate populations might be the most conclusive evidence of a new adventive population within our framework. However, even among the validated cases mentioned above, only Te. remus, G. aetherium and Ps. striaticeps were scored as exact long-distance matches. Trissolcus japonicus and Tr. basalis both have the appearance of exact intercontinental matches, but were scored inconclusive due to sequence length variation. Trissolcus hyalinipennis and Pl. demades have no evidence for exact intercontinental matches in our dataset. Specimens identified as Pl. demades are divided into five BOLD BINS, two of which contain populations introduced into Canada and the USA (
A majority of the intercontinental platygastroid BINs were unidentified below the family level at the beginning of this study, affording our group a “clean” taxonomic slate on which to analyze and interpret these results. Alternatively, the identifications for many of the validated intercontinental Insecta BINs were conflicted, which confounds interpretation of the data. Conflicting taxonomies present in BOLD BINs complicated our ability to extract and summarize higher-level taxonomic data from the Insecta dataset. Even if these taxonomic conflicts can be rectified or understood on a case-by-case basis by expert systematists, that database problem is likely to persist in future analyses. We advocate that experts use our baseline dataset to closely examine the intercontinental BINs in their group of interest and make informed judgements about their veracity.
Many of the BOLD BIN criticisms levelled by
Our case study of Platygastrinae using GBOL specimens indicated that BINs overestimated “species richness” and we reiterate that the BINs in the Insecta and Platygastroidea dataset do not necessarily equate to species. Results for poorly-known or hyperdiverse insect groups must be interpreted with caution.
Platygastroid wasps are one of the most dominant flying insect groups worldwide, with a high rate of geographically structured community turnover and high taxonomic neglect, further complicating faunistic studies (
Determining whether these distributions are natural or adventive for most of the platygastroid BINs is difficult pending taxonomic revisions and follow-up research.
Many Platygastroidea distributions in the BOLD dataset were from the Southern Hemisphere, between the Northern and Southern Tropics or island localities, precluding the need to consider a naturally Holarctic distribution as an explanation for the pattern. We think those cases are best considered introductions mediated by human activity. However, in some Platygaster and Telenomus BINs, the locality data imply enormous geographic ranges across the entirety of Canada, northern and central Europe, northeastern Asia and other spurious localities. For example, Telenomus sp. (BOLD:AAV1142; 461 public barcodes) has occurrences across Canada including Nunavut in the north, south to desert regions of eastern California and eastern Europe. One unidentified platygastrine (BOLD:ABW3192; 37 public barcodes with several exact matches) has occurrences in eastern and western Canada, Germany, the Russian Far East, South Africa and California. These geographic ranges encompass several climates and biomes, showing the apparent ability of the wasps to tolerate dramatically different environmental conditions for at least a short period, considering that species that fail to establish would have to persist long enough to be collected.
Historically, the Palearctic and Nearctic Regions have been regarded as separate by platygastroid taxonomists. Early European authors rarely made comparisons to the fauna of neighboring countries, let alone distant continents. Likewise, early American hymenopterists often treated the Nearctic fauna as unique. This approach contrasts with that of early American lepidopterists, who tended to misidentify Nearctic species as similar-looking European species (
Contrary to the assumptions of the past, the results of our study suggest that some platygastroid genera, such as Platygaster, include many naturally Holarctic species as well as human-mediated introductions.
Constraints due to a lack of data are a consistent theme throughout platygastroid taxonomy, especially when compared with better-studied groups of insects. Our method offers a path to gather and interpret the available data, albeit with limitations. For example, the Nearctic and western Palearctic were remarkably well sampled, allowing for more detailed examination of distribution patterns. On the other hand, the Pacific Islands and Caribbean yielded no Platygaster or Synopeas records. This likely reflects reality in the Pacific Islands but is a result of under sampling in the Caribbean, a distinction which cannot be made by our method alone.
Specimen images in BOLD allowed us to provide a list of genus level identifications for about one fifth of all platygastroid BOLD BINs. This has just begun the process of overcoming taxonomic impediments in the group and the database more broadly, as many thousands of BINs remain to be examined and identified. Platygastroid wasps are generally small insects (0.5 to 10 mm), making species level characters difficult to assess without proper microscopy and high-resolution images. Regardless, the image quality and habitus views in BOLD were generally sufficient to identify BINs to genus. We encourage systematists to examine BIN images in their group of expertise to see if this process can be repeated for other under-studied insect families. Given that we discovered several BINs which contained genera lacking any other DNA sequence data, is it likely that more such cases remain to be found.
Data presented here suggest that taxonomic revisions of platygastroids should be targeted at the global level as much as possible, even if there are feasibility concerns. A consideration of data from across the world will likely be necessary for the accurate and precise description of some sections of platygastrid and scelionid biodiversity. The BIN database should be preemptively searched to quickly quantify diversity present in an area and inform the taxonomic approach. Our genus level identification of BINS has already facilitated one such study.
On another research track, the molecular evolution of COI barcodes across Animalia was evaluated by
These analyses highlight the urgent need for more detailed and comprehensive approaches (
Approximately 84% (12,876/15,391) of the recovered intercontinental and island Insecta BINs were detected across two geographic categories. When these geographic categories are contiguous or adjacent, they are the most likely to be ‘false positives’ based on our data-mining approach. For example, BINs present in lowland rainforest habitats of both Costa Rica and Colombia were scored as intercontinental even though these are probably natural distributions (North America/South America; 2,248 BINs). Indeed, there are many such cases in the dataset, especially among Neotropical Lepidoptera BINs. In the Palearctic, Europe and Asia (2,324 BINs) were connected at a similar magnitude. BINs detected across contiguous landmasses warrant additional scrutiny by taxonomic experts to determine their status as adventive or natural distributions. Just over 2,500 Insecta BINs were detected from three or more continents or island chains. This set of BINs contains insect species which appear to be truly successful global invaders. Our results were intuitive for insect BOLD BINs with the most widespread geographic data; these included species long associated with human activity. For example, Ctenocephalides felis (cat flea; BOLD:AAY6332), Aedes aegypti (yellow fever mosquito; BOLD:AEI9358), Culex quinquefasciatus (southern house mosquito; BOLD:AAA4751) and Aphis gossypii (cotton aphid; BOLD:AAA3070) were recovered in the analysis with 9 or 10 intercontinental and island data points.
The highly structured global sampling of flying insect communities undertaken by
Biosecurity practices entail anticipatory risk assessment and preparedness, methods of surveillance, emergency management and policy enforcement (
Emerging infectious disease surveillance programs have implemented several automated, internet-based data-mining approaches for intelligence gathering (
Further scrutinizing the Insecta BINs with consideration for specimen spatiotemporal data might also inform pathway analyses for species moving via global trade, weather phenomena or animal migration events. Data from the Jacksonville area uncovered an intercontinental Telenomus species, with matching specimens collected in 2014 and 2015 from coastal areas of Bangladesh and California. Relevant agencies could have been alerted to the risk of this new introduction, possibly centered around international seaports-of-entry, eight years prior if the available data were scanned systematically. Thus, nearly a decade of potential research progress on this unidentified, globally-mobile Telenomus went unrealized. Cases like these might be valuable input for pathway models or agent-based models of long-distance insect dispersal. Detailed examination of the Insecta BIN dataset should reveal similar cases for groups with a widely-variable range of life history traits, dispersal potentials and introduction histories or modalities.
We extend our thanks to NPS staff Anne Lewellen, Daniel Tardona, Alberto Alvarado and Kate Henderson, who maintained traps and helped collect specimens at Timucuan National Park (Permit #TIMU-2018-SCI-0004); to Jacksonville Port Authority for access to Buck Island and Marvin Grieve for facilitating access. We are grateful to Erin Powell and Sam Bolton (FDACS-DPI) for their assistance in gathering literature for the discussion and their valuable comments on early drafts of the paper. Matthew Moore, Elijah Talamas, Jonathan Bremer, Natalie McGathey, James Fulton, Cheryl Roberts and Lynn Combee were supported by the Florida Department of Agriculture and Consumer Services, Division of Plant Industry. Jessica Awad was supported by the Bundesministerium für Bildung und Forschung, Berlin, Germany, project “German Barcode of Life III: Dark Taxa” (FKZ 16LI1901C). Zachary Lahey is a participant of the Oak Ridge Institute for Science and Education (ORISE) Agricultural Research Service (ARS) Research Participation Program, supported by the USDA-ARS, U.S. Vegetable Laboratory in Charleston, SC, USA. Mention of trade names or commercial products in this article is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the USDA. Mikko Pentinsaari, Jayme Sones, Allison Brown and other BOLD staff provided critical assistance with updating identifications in the BOLD portal.
Country to continent and island group assignments
Data type: xlsx
Explanation note: File detailing country and island continent assignments.
Randomized BOLD BINs for validation of the Insecta dataset
Data type: xlsx
Explanation note: Table detailing which Insecta BINs were validated by manual examination.
Intercontinental and island Platygastroidea COI dataset alignment
Data type: fas
Explanation note: DNA alignment of COI barcodes from intercontinental and island platygastroids.
Platygastroidea COI dataset NJ tree.tre
Data type: tre
Explanation note: P-distance NJ tree of platygastroid COI barcodes.
Platygastroidea COI dataset NJ tree annotated FIGTREE
Data type: tre
Explanation note: FigTree file of annotated p-distance NJ tree Newick tree.
Intercontinental and island records for targeted Platygastroidea in GBIF and the literature
Data type: xlsx
Explanation note: Data summary for platygastroid taxa with GBIF and literature Records.
List of intercontinental and island Insecta BINs with identification metadata
Data type: xlsx
Explanation note: List of Insecta BINs recovered by the scripting procerdure as intercontiental or island hits, BINs listed under Identification Remarks.
Number of distinct continent or island groupings recovered per Insecta BIN
Data type: csv
Explanation note: Table displaying number of continent and island hits per Insecta BIN.
Unique continent and island hit combinations in Insecta dataset
Data type: csv
Explanation note: Table showing frequencies of geographic combinations in the Insecta BIN dataset.
Pairwise geographic hit comparisons for the Insecta BIN dataset
Data type: xlsx
Explanation note: Pairwise matrix used to generate Circos plot in Figure
Summary of intercontinental and island Platygastroidea BINs
Data type: xlsx
Pairwise geographic hit comparisons for the Platygastroidea BIN, GBIF, and literature dataset
Data type: xlsx
Explanation note: Matrix used to generate Circos plot in Figure
Pairwise geographic hit comparisons for the Platygaster BIN, GBIF, and literature dataset
Data type: xlsx
Explanation note: Matrix used to generate Circos plot in Figure
Pairwise geographic hit comparisons for the Synopeas BIN, GBIF, and literature dataset
Data type: xlsx
Explanation note: Matrix used to generate Circos plot in Figure
Pairwise geographic hit comparisons for the Telenomus BIN, GBIF, and literature dataset
Data type: xlsx
Explanation note: Matrix used to generate Circos plot in Figure
Pairwise geographic hit comparisons for the Trissolcus BIN, GBIF, and literature dataset
Data type: xlsx
Explanation note: Matrix used to generate Circos plot in Figure
Platygastroidea BIN identifications using digital morphology infrastructure
Data type: xlsx
Explanation note: Table of wasp identifications provided to BOLD by using existing digital morphology infrastructure.
List of intercontinental and island Araneae BINs with identification metadata
Data type: csv
Explanation note: List of BOLD metadata for intercontinental and island Araneae identified using the scripting procedure. BINs are under Identification Remarks.
Randomized BOLD BINs for validation of the Araneae dataset
Data type: xlsx
Explanation note: Table of Araneae BINs manually examined for validation of Araneae BOLD scripting procedure.