Full text
81,385 characters
· extracted from
preprint-html
· click to expand
The phylogenetic position of the extinct Hawaiian honeyeaters: Overcoming the limitations of antique DNA | bioRxiv /* */ /* */ <!-- <!-- /*! * yepnope1.5.4 * (c) WTFPL, GPLv2 */ (function(a,b,c){function d(a){return"[object Function]"==o.call(a)}function e(a){return"string"==typeof a}function f(){}function g(a){return!a||"loaded"==a||"complete"==a||"uninitialized"==a}function h(){var a=p.shift();q=1,a?a.t?m(function(){("c"==a.t?B.injectCss:B.injectJs)(a.s,0,a.a,a.x,a.e,1)},0):(a(),h()):q=0}function i(a,c,d,e,f,i,j){function k(b){if(!o&&g(l.readyState)&&(u.r=o=1,!q&&h(),l.onload=l.onreadystatechange=null,b)){"img"!=a&&m(function(){t.removeChild(l)},50);for(var d in y[c])y[c].hasOwnProperty(d)&&y[c][d].onload()}}var j=j||B.errorTimeout,l=b.createElement(a),o=0,r=0,u={t:d,s:c,e:f,a:i,x:j};1===y[c]&&(r=1,y[c]=[]),"object"==a?l.data=c:(l.src=c,l.type=a),l.width=l.height="0",l.onerror=l.onload=l.onreadystatechange=function(){k.call(this,r)},p.splice(e,0,u),"img"!=a&&(r||2===y[c]?(t.insertBefore(l,s?null:n),m(k,j)):y[c].push(l))}function j(a,b,c,d,f){return q=0,b=b||"j",e(a)?i("c"==b?v:u,a,b,this.i++,c,d,f):(p.splice(this.i++,0,a),1==p.length&&h()),this}function k(){var a=B;return a.loader={load:j,i:0},a}var l=b.documentElement,m=a.setTimeout,n=b.getElementsByTagName("script")[0],o={}.toString,p=[],q=0,r="MozAppearance"in l.style,s=r&&!!b.createRange().compareNode,t=s?l:n.parentNode,l=a.opera&&"[object Opera]"==o.call(a.opera),l=!!b.attachEvent&&!l,u=r?"object":l?"script":"img",v=l?"script":u,w=Array.isArray||function(a){return"[object Array]"==o.call(a)},x=[],y={},z={timeout:function(a,b){return b.length&&(a.timeout=b[0]),a}},A,B;B=function(a){function b(a){var a=a.split("!"),b=x.length,c=a.pop(),d=a.length,c={url:c,origUrl:c,prefixes:a},e,f,g;for(f=0;f<d;f++)g=a[f].split("="),(e=z[g.shift()])&&(c=e(c,g));for(f=0;f<b;f++)c=x[f](c);return c}function g(a,e,f,g,h){var i=b(a),j=i.autoCallback;i.url.split(".").pop().split("?").shift(),i.bypass||(e&&(e=d(e)?e:e[a]||e[g]||e[a.split("/").pop().split("?")[0]]),i.instead?i.instead(a,e,f,g,h):(y[i.url]?i.noexec=!0:y[i.url]=1,f.load(i.url,i.forceCSS||!i.forceJS&&"css"==i.url.split(".").pop().split("?").shift()?"c":c,i.noexec,i.attrs,i.timeout),(d(e)||d(j))&&f.load(function(){k(),e&&e(i.origUrl,h,g),j&&j(i.origUrl,h,g),y[i.url]=2})))}function h(a,b){function c(a,c){if(a){if(e(a))c||(j=function(){var a=[].slice.call(arguments);k.apply(this,a),l()}),g(a,j,b,0,h);else if(Object(a)===a)for(n in m=function(){var b=0,c;for(c in a)a.hasOwnProperty(c)&&b++;return b}(),a)a.hasOwnProperty(n)&&(!c&&!--m&&(d(j)?j=function(){var a=[].slice.call(arguments);k.apply(this,a),l()}:j[n]=function(a){return function(){var b=[].slice.call(arguments);a&&a.apply(this,b),l()}}(k[n])),g(a[n],j,b,n,h))}else!c&&l()}var h=!!a.test,i=a.load||a.both,j=a.callback||f,k=j,l=a.complete||f,m,n;c(h?a.yep:a.nope,!!i),i&&c(i)}var i,j,l=this.yepnope.loader;if(e(a))g(a,0,l,0);else if(w(a))for(i=0;i (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0];var j=d.createElement(s);var dl=l!='dataLayer'?'&l='+l:'';j.src='//www.googletagmanager.com/gtm.js?id='+i+dl;j.type='text/javascript';j.async=true;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-M677548'); Skip to main content Home About Submit ALERTS / RSS Search for this keyword Advanced Search New Results The phylogenetic position of the extinct Hawaiian honeyeaters: Overcoming the limitations of antique DNA View ORCID Profile Min Zhao , View ORCID Profile Rebecca T. Kimball , View ORCID Profile Edward L. Braun doi: https://doi.org/10.1101/2025.06.02.657453 Min Zhao 1 Department of Biology, University of Florida , Gainesville, FL 32611, US 2 Department of Biological Sciences, Virginia Tech , Blacksburg, VA 24061, US Find this author on Google Scholar Find this author on PubMed Search for this author on this site ORCID record for Min Zhao Rebecca T. Kimball 1 Department of Biology, University of Florida , Gainesville, FL 32611, US Find this author on Google Scholar Find this author on PubMed Search for this author on this site ORCID record for Rebecca T. Kimball Edward L. Braun 1 Department of Biology, University of Florida , Gainesville, FL 32611, US Find this author on Google Scholar Find this author on PubMed Search for this author on this site ORCID record for Edward L. Braun For correspondence: ebraun68{at}ufl.edu Abstract Full Text Info/History Metrics Supplementary material Preview PDF Abstract Mohoidae, the Hawaiian honeyeaters, are a group of extinct birds that once inhabited the Hawaiian Islands. Their genetic data are only available from historical museum specimens, so it is based on highly fragmented and damaged DNA. Poor sequence recovery in these “antique” samples can bias phylogenetic inference which often results in exceptionally long terminal branches in the phylogeny. These long branches can distort estimates of phylogeny, an artifact known as long branch attraction. However, the best way to analyze this data is unclear because these long branches are typically artifactual, reflecting DNA quality. We investigated the position of Mohoidae within the superfamily Bombycilloidea using published alignments of ultraconserved elements (UCEs) and re-assembled UCEs from raw sequencing reads. The uncertainty involves two monotypic families (Hypocoliidae and Hylocitreidae). Re-aligning UCEs using a smaller taxon set closely related to Mohoidae, along with edge trimming and re-assembly, eliminated the long branches of Mohoidae and Hypocoliidae (also represented by an antique sample). The sister relationship of these two families in previous studies appears to be artifactual; our new analyses yielded congruent topologies that placed Mohoidae as sister to a clade comprising Hypocoliidae and Hylocitreidae. We also found that different assemblers (SPAdes vs Trinity) yielded sequences with conflicts towards the ends of the UCEs, especially for the antique samples, although this did not alter the topology. Finally, combining the UCEs with legacy data allowed us to produce a strongly supported complete phylogeny of Bombycilloidea the included all extant species as well as the extinct Mohoidae. Introduction Understanding the evolutionary history of extinct species is critical for reconstructing a complete and reliable Tree of Life. It provides a more accurate picture of how different lineages are related and how biodiversity has changed over time. However, while DNA can survive hundreds or thousands of years, ancient DNA samples typically suffer from degradation, such as reduced fragment size, depurination and cytosine deamination ( Briggs et al., 2007 ; Dabney et al., 2013 ; Pääbo et al., 2004 ). DNA extracted from more recent museum specimens and other types of natural history collections (“antique DNA”) is typically of higher quality than ancient DNA extracted from fossils ( Wandeler et al., 2007 ), although it is often associated with reduced fragment sizes and contains varying levels of hydrolytic and oxidative damage depending on the preservation methods, storage conditions, and age of the specimen ( Wandeler et al., 2003 ; Zimmermann et al., 2008 ). Poor sequence recovery can bias phylogenetic relationships. When sequences are missing or contain errors, they can result in erroneous alignments and incorrect estimates of genetic divergences among taxa, leading to distorted topology and biased branch length estimates (e.g., Hosner et al., 2016 ; Philippe et al., 2011 ; Roure et al., 2013 ). In some cases, taxa with poor sequence recovery appear to have exceptionally long terminal branches (e.g., Bernstein and Ruane, 2022 ; Braun et al., 2024 ; Derkarabetian et al., 2019 ; Kimball et al., 2021 ; McCullough et al., 2024 ; Nash et al., 2024 ; Zhao et al., 2025a ). While long branches can result from accelerated sequence evolution (e.g., Braun et al., 2019 ; Cai et al., 2019 ; James et al., 2013 ), those associated with poor data recovery are likely to be methodological artifacts rather than true evolutionary signals ( Braun et al., 2024 ; Kimball et al., 2021 ; McCullough et al., 2024 ; Zhao et al., 2025a ). The incorrect grouping of two or more long branches is an artifact called long branch attraction (reviewed by Bergsten, 2005 ) and a major goal of efforts to improve models of sequence evolution has been the reduction of artifacts like long branch attraction (e.g., Kapli et al., 2020 ; Philippe et al., 2011 ; Smith et al., 2015 ). However, the patterns in the data that lead to long branches in analyses including antique samples (where sequencing errors and missing data contribute to terminal branch length) are likely to differ from the site patterns generated by accelerated evolution. Taxa with sequencing errors and extensive missing data may have an even larger impact on multispecies coalescent methods based on combining estimated gene trees because the problematic sequences are non-randomly distributed ( Xi et al., 2016 ). Overall, artifactual long branches might lead to the incorrect placement of taxa that cannot be addressed using standard phylogenetic methods; unfortunately, this possibility has not been explored extensively (but see Xia et al., 2003 ). If sequencing errors can be a source of long branches, the choice of bioinformatic tools used to process raw sequence data might have a large impact on downstream alignment and analyses. These choices include the sequence assembler (e.g., Islam et al., 2021 ; Pightling et al., 2015 ; Ribeiro et al., 2021 ), alignment programs (e.g., Pais et al., 2014; Portik and Wiens, 2021 ; Thompson et al., 2011 ), and the choice of reference genome if one is used (e.g., Rick et al., 2024 ; Valiente-Mullor et al., 2021 ). Assemblers are less likely to be biased when they are used with good quality sequence data with high coverage ( Pightling et al., 2014 ). But the performance of specific assemblers is likely to become more important when the sequencing depth is more variable, which is expected to be the case for sequence capture data ( Faircloth et al., 2012 ), transcriptome sequencing ( Hölzer and Marz, 2019 ), and single-cell sequencing ( Daley and Smith, 2014 ). Choices among bioinformatic pipelines may become even more important when antique samples are analyzed because factors such as DNA fragmentation and damage are likely to further exacerbate these problems. The influence of sequence assembler choice on phylogenomic studies has received limited attention, since many phylogenomic studies focus on using one assembler for all data. Multiple initiatives across the globe have begun to generate genomic data for many parts of the Tree of Life, such as the Vertebrate Genomes Project (VGP), the Bird 10,000 Genomes (B10K) Project, the OpenWings Project, the 5,000 Insect Genomes Initiative (i5K) and the Ten Thousand Plant Genome Project (10KP). Efforts to build a complete Tree of Life will no doubt use data generated by these initiatives. While it might be viewed as a best practice to begin with raw sequence and perform all analyses including sequence assembly using a single pipeline, it is more typical to use existing assemblies. For well-studied groups like birds there are many ongoing efforts to generate sequence data using multiple methods, such as whole genome sequencing (WGS) ( Jarvis et al., 2014 ; Lamichhaney et al., 2015 ; Sackton et al., 2019 ; Stiller et al., 2024 ), sequence capture ( Harvey et al., 2020 ; McCormack et al., 2013 ; Moyle et al., 2016 ; Smith et al., 2023 ), and transcriptome sequencing ( Kuhl et al. 2021 ). Sequence capture is likely to be especially important for groups that can only be obtained as museum specimens ( McCormack et al., 2016 ; McCullough et al., 2019 ; Zhao et al., 2025a ). In these cases, it is not always clear how integrating these types of heterogeneous sequence data might affect estimates of phylogeny. The extinct Hawaiian honeyeaters (Mohoidae) were a group of birds that specialized on nectar feeding ( Billerman, 2020 ). Mohoidae includes at least five species from two genera, Moho and Chaetoptila , that went extinct in the past two centuries ( Lovette, 2008 ); in addition to the historical records of Mohoidae there is fossil evidence for one or two additional Chaetoptila species ( James and Olson, 1991 ). While it is clear that Mohoidae fits within the passerines or perching birds (Passeriformes, the largest avian order), its placement has been variable. Mohoidae were originally treated as Australasian honeyeaters (Meliphagidae) due to morphological similarities that reflect their adaptation for nectar-feeding, as well as plumage and biogeographical distribution pattern ( Lovette, 2008 ). However, molecular data ( Fleischer et al., 2008 ) revealed that the Hawaiian honeyeaters were most closely related with three small families: Bombycillidae (waxwings), Ptiliogonatidae (silky flycatchers), and monotypic Dulidae (Palmchat). Other studies ( Alström et al., 2014 ; Braun et al., 2024 ; Burleigh et al., 2015 ; Oliveros et al., 2019 ; Spellman et al., 2008 ; Zhao et al., 2025b ) subsequently found evidence for a clade, which Alström et al. (2014) named Bombycilloidea, comprising Mohoidae, Bombycillidae, Ptiliogonatidae, and Dulidae, as well as two other monotypic families, Hylocitreidae (Hylocitrea) and Hypocoliidae (Gray Hypocolius). Two previous phylogenomic studies, Oliveros et al., (2019) and Braun et al., (2024), sampled representatives of all six families in Bombycilloidea using thousands of ultraconserved elements (UCEs). Both were large-scale studies, with the former study focused across all passerines, whereas the latter included representatives for almost all avian families. The Mohoidae and Hypocoliidae sequences in these two studies were generated from toe pads cut of antique study skins collected in 1901 and 1937, respectively; sequences for the other four families were generated from frozen tissues. Oliveros et al. (2019) used one assembler (SPAdes) for toe pads and a different assembler (Trinity) for tissues; Braun et al. (2024) used the Oliveros et al. (2019) assemblies for these taxa, though these data were re-aligned. In Braun et al. (2024) , both Moho and Hypocolius were represented by exceptionally long terminal branches and connected by a relatively short internal branch (see also Fig. 1 ), raising the concern that the sister relationship could reflect long branch attraction. To address this concern, we asked several questions. First, we asked how choice of assemblers might affect phylogenetic inference when incorporating sequences from antique samples. Second, we asked whether adding taxa might aid in breaking up the long branches. Third, we asked whether re-aligning loci using a smaller, more focused taxon set might resolve phylogenetic noise introduced by missing data and potential alignment errors. Lastly, we provided a comprehensive, well-resolved phylogeny for the Bombycilloidea superfamily by combining UCEs with legacy loci (mitochondrial and nuclear loci) that have been commonly used in many previous phylogenetic studies. Download figure Open in new tab Fig. 1. Maximum likelihood trees estimated based on 10 UCE datasets (re-assembled datasets below black line). Except for slight differences in taxon sampling, their topologies only differ in the arrangements among Moho , Hypocolius and Hylocitrea (marked in orange and blue: orange – Hypocolius is sister to Moho ; blue – Hypocolius is sister to Hylocitrea ). Values at nodes are ultrafast bootstrap support values based on 1000 replicates. Methods Estimating trees using published alignments We downloaded the complete UCE alignments directly from Oliveros et al. (2019) and Braun et al. (2024) and extracted sequences only for the six Bombycilloidea species and three outgroup species ( Table 1 ) from the supermatrices; referred as Oliveros alignment and Braun alignment respectively. Oliveros et al. (2019) used two different assemblers: Trinity ( Hölzer and Marz, 2019 ) was used for fresh tissue, while SPAdes ( Prjibelski et al., 2020 ) was used for antique (toe pad) samples. Braun et al. (2024) used the Oliveros et al. (2019) UCE assemblies, but then added other species (distantly related taxa that are not part of this study) and re-aligned the data. In addition, internal trimming was used on the Braun alignment, but not on the Oliveros alignment. Both studies used MAFFT ( Katoh and Standley, 2013 ) as the sequence aligner. Three outgroup species were chosen to represent Muscicapoidea, Regulidae and Certhioidea. Though the exact relationships among these outgroups and Bombycilloidea have not been fully resolved (cf. Braun et al., 2024 ; Oliveros et al., 2019 ; Stiller et al., 2024 ; Zhao et al., 2025b ), it is clear that Bombycilloidea is most closely related to these three clades. Therefore, we included a total of nine species in the subset alignments. Since these alignments came from datasets with many more taxa, we then used a custom perl script to remove any all-gap (“N”, “?” and “-”) columns in these alignments (this step may not be necessary as most tree inference programs such as IQ-TREE would ignore those columns). For each dataset ( Table 2 ; data sets 1-2), we used IQ-TREE2 v.2.2.2 ( Minh et al., 2020 ) to run a concatenated analysis using the implemented ModelFinder ( Kalyaanamoorthy et al., 2017 ) to select the best-fit partitioning models (-m TESTMERGE) while allowing each partition to have its own rate (-p) and reducing the computational burden with a relaxed hierarchical clustering algorithm (--rcluster 10). Branch support was assessed using 1000 ultrafast bootstrap replicates (-B 1000) (Hoang et al., 2018). View this table: View inline View popup Table 1. Summary statistics of UCEs extracted for each sample. For the five samples with target capture raw reads, we show statistics for both the SPAdes assembly and the Trinity assembly, as well as the number of overlapping UCEs between the two assemblies. View this table: View inline View popup Table 2. UCE datasets used in this study. All alignments underwent edge trimming in PHYLUCE unless otherwise stated. Adding taxa to Oliveros alignment It has been suggested that adding in taxa can improve estimates of phylogeny by “breaking up” long branches ( Bergsten, 2005 ; Wiens and Tiu, 2012 ). Therefore, we added more Bombycilloidea species for which data were available to the Oliveros alignment (we chose this dataset because it was not internally trimmed). We downloaded the genome assemblies for Phainopepla nitens (Ptiliogonatidae) and Bombycilla garrulus (Bombycillidae), which were collected as part of Feng et al. (2020) , from NCBI GenBank and used PHYLUCE ( Faircloth, 2016 ) to extract UCE loci based on the 5K UCE probe set (available from https://github.com/faircloth-lab/uce-probe-sets ; Sun et al., 2014 ). We extracted 500 bp upstream and downstream of the UCE core for each locus. We also downloaded the WGS reads for the extinct Moho braccatus from NCBI SRA and used Trimmomatic v.0.39 ( Bolger et al., 2014 ) to trim any remaining adapters. We used Bowtie2 v.2.2.5 ( Langmead and Salzberg, 2012 ) and samtools v.1.12 ( Danecek et al., 2021 ) to map the raw reads of Moho braccatus to a reference file that contained the UCEs extracted from the Bombycilla garrulus genome assembly. We then used SPAdes v.3.15.3 ( Prjibelski et al., 2020 ) to assemble the mapped reads of Moho braccatus , with the careful flag to perform mismatch corrections, the single-cell mode ( Nurk et al., 2013 ) to accommodate the uneven coverage of reads which is typical of target enrichment sequencing data and a coverage cutoff at five. Same as described above, we then used the standard PHYLUCE pipeline to extract UCEs from the SPAdes assembly of Moho braccatus . We used MAFFT v.7.520 ( Katoh and Standley, 2013 ) to align UCEs of Phainopepla nitens and Moho braccatus to the Oliveros alignment which were directly extracted from the original, large supermatrix as described above. To preserve the original structure of alignments, we used the add feature ( Katoh and Frith, 2012 ) and --keeplength option in MAFFT. Since the UCEs of Phainopepla nitens were extracted from a genome, they are typically longer than the Oliveros alignment, therefore, we used the --addlong option of the add feature. For UCEs of Moho braccatus which were extracted from an antique sample, we added them in a second step using the --addfragments option. As described above, we estimated a phylogeny on data set 3 ( Table 2 ) using IQ-TREE2 with the same settings. Re-aligning Oliveros UCEs The Oliveros alignment was subsetted from a large supermatrix with over 200 taxa spanning Passeriformes, so it could include alignment errors that reflect the inclusion of distantly related taxa in the alignment; these errors may be especially problematic for taxa with poor data recovery (e.g., taxa represented by antique sample). To address this, we re-aligned the raw UCE assemblies for the six Bombycilloidea species and three outgroup species from Oliveros et al. (2019) using MAFFT with the default settings implemented in PHYLUCE. Similarly, we estimated a phylogeny using the re-aligned Oliveros UCEs ( Table 2 ; data set 4) in IQ-TREE2. Re-assembling target enrichment UCE data We also built datasets from available genome assemblies and new assemblies from target enrichment raw reads. We prioritized use of published genome assemblies over UCE target enrichment reads ( Table 1 ). We also identified three outgroup species for Bombycilloidea that had genome assemblies available to represent Muscicapoidea, Regulidae and Certhioidea respectively. For UCE target enrichment reads (five Bombycilloidea species), we used Trimmomatic v.0.39 ( Bolger et al., 2014 ) to trim adapters and prinseq v.0.20.4 ( Schmieder and Edwards, 2011 ) to filter duplicate reads. Since Oliveros et al. (2019) used Trinity to assemble fresh tissue samples and SPAdes to assemble antique samples from museum specimens, we assembled reads for all five species using both SPAdes v.3.15.3 ( Prjibelski et al., 2020 ) and Trinity v.2.15.1 ( Hölzer and Marz, 2019 ) to see how different assemblers may have different effects on resulting sequences. As described above, we then used PHYLUCE to extract UCEs from the SPAdes, Trinity and genome assemblies. We then performed two types of alignments: 1) using MAFFT directly without edge trimming; and 2) using MAFFT implemented in PHYLUCE where edge trimming is conducted as a default setting. We estimated phylogenies using IQ-TREE2 on six datasets ( Table 2 ; datasets 5-10): SPAdes and genome assemblies before and post edge trimming; SPAdes and genome assemblies with Moho braccatus ; Trinity and genome assemblies before and post edge trimming; three Trinity assemblies for fresh tissue samples, two SPAdes assemblies for antique samples ( Hypocolius ampelinus and Moho nobilis ) and genome assemblies. Conflicting sites between SPAdes and Trinity assemblies We evaluated base conflicts in the UCEs extracted from SPAdes and Trinity assemblies. For each of the five Bombycilloidea species, we first identified overlapping UCE loci recovered by both SPAdes and Trinity assemblies and performed pairwise alignments in MAFFT. We used snp-sites v.2.0.3 ( Page et al., 2016 ) to generate a vcf file for each alignment and output the location of variable sites (i.e., conflicts). We counted the number of conflicting sites and tallied the type of bases in each assembly. The relative location of each conflicting site was calculated based on its position in the alignment (position divided by alignment length, where small and large numbers indicate locations near the edges and numbers close to 0.5 indicate more central location in the alignment). Since PHYLUCE performed a default edge-trimming on all alignments, we repeated this process both before and post trimming sequences. To assess whether conflicting sites between SPAdes and Trinity assemblies would affect tree topology or branch length estimates, we conducted additional analyses in IQ-TREE2 using two datasets. First, we treated each SPAdes or Trinity assembly as a separate taxon, creating a dataset that included five SPAdes assemblies, five Trinity assemblies, five genome assemblies and Moho braccatus ( Table 2 ; data set 11). Second, we generated strict consensus sequences for the overlapping UCE loci that were recovered by both SPAdes and Trinity assemblies, using a custom Perl script; for any site where the two assemblies disagreed, we treated that site as ambiguous (“-”; equivalent to “N” or “?” in tree inference using IQ-TREE). This dataset included consensus sequences from five taxa, five genome assemblies and Moho braccatus ( Table 2 ; data set 12). Multispecies coalescent analysis We also estimated Bombycilloidea phylogeny by using a multispecies coalescent method, ASTRAL, to analyze the consensus UCE dataset ( Table 2 ; data set 12). We used this dataset since it included the strict consensus sequences between SPAdes and Trinity assemblies for five species, which can minimize sequence errors for gene tree estimation. We estimated a gene tree for each locus alignment using IQ-TREE2 and combined the gene trees to estimate a species tree using weighted ASTRAL Hybrid ( Zhang and Mirarab, 2022 ). Combining UCEs with legacy data To obtain a comprehensive taxon sampling for Bombycilloidea, we combined the consensus UCE dataset with legacy data, which we define as mitochondrial sequences and nuclear gene regions used as phylogenetic markers in the pre-phylogenomic era. The legacy data available from Bombycilloidea include several mitochondrial regions, two nuclear coding regions (RAG1 and RAG2), and two nuclear introns (FIB5 and FIB7). We downloaded published mitogenomes (when they were unavailable, we downloaded individual mitochondrial regions) and nuclear loci from NCBI GenBank (accessions in Suppl. Table S1). We supplemented the sampling by extracting 13 protein-coding mitochondrial genes and two ribosomal regions from target enrichment raw reads using MitoFinder ( Allio et al. 2020 ). To further enrich the data matrix, we extracted the four nuclear sequences available from Bombycilla cedrorum from the five genome assemblies ( Table 1 ) as described by Reddy et al. (2017) , using the pipeline available from https://github.com/aakanksha12/Extract_seq . We used IQ-TREE2 to estimate a phylogeny using the combined dataset (UCEs data set 12 + legacy data) as well as a phylogeny using only the legacy data. Results Phylogenetics based on different UCE data sets In total, we estimated concatenated trees from 12 UCE datasets ( Table 2 ). These trees exhibited two topologies which differed in the relationships among Mohoidae, Hypocoliidae and Hylocitreidae ( Fig. 1 ). When using the original alignments extracted from the large supermatrix of Oliveros et al. (2019) and Braun et al. (2024), the trees supported a sister relationship between Moho and Hypocolius . After adding Phainopepla nitens and Moho braccatus to the Oliveros alignment and re-aligning while preserving the original alignment structure, the tree still supported that Moho and Hypocolius together were sister to Hylocitrea . Two datasets based on our re-assemblies of the target capture data without edging trimming also supported placing Moho sister to Hypocolius . In all of these above trees, the branch for Hypocolius ampelinus (based on an antique sample) was much longer than the other taxa and the other antique sample, Moho nobilis was also somewhat longer than the remaining taxa ( Fig. 1 ). However, trees estimated using the re-aligned Oliveros UCEs and datasets based on our re-assemblies with edge trimming, all supported a sister relationship between Hypocoliidae and Hylocitreidae ( Fig. 1 ). Additionally, none of the taxa exhibited extremely long terminal branches in the trees that support a Hypocoliidae + Hylocitreidae clade. SPAdes vs. Trinity assemblies When we used both assemblers to assemble the target enrichment raw reads for five species, SPAdes in general yielded more data than Trinity (both as the number of loci and the number of sites), especially for the two antique samples from museum specimens, Moho nobilis and Hypocolius ampelinus ( Table 1 ). If the two assemblers produced similar sequences, treating each SPAdes and Trinity assembly as a separate taxon in analyses should result in very short terminal branches (little or no difference between sequences). This was true for taxa with high quality tissues but it was not true for the low-quality samples ( Fig. 2a ). Although the SPAdes and Trinity assemblies derived from the same taxon were consistently grouped together as “sister taxa,” the terminal branches were longer for the tips based on antique samples than for those based on fresh tissues. The tree based on this analysis supported a sister relationship between Hypocolius and Hylocitrea ( Fig. 2a ); i.e., it had the same topology as the trees based on methods that yielded relatively homogeneous branch lengths in Figure 1 . The phylogeny built with consensus sequences between the SPAdes and Trinity assemblies exhibited the same topology and it had 100% support for placing Hypocolius sister to Hylocitrea ( Fig. 2b ). Download figure Open in new tab Fig. 2. a. The maximum likelihood tree using five SPAdes assemblies, five Trinity assemblies, five genome assemblies and WGS reads of Moho braccatus , with each assembly treated as a taxon. b. The maximum likelihood tree estimated using the strict consensus sequences between the SPAdes and Trinity assemblies for five samples (in bold, labeled with *), five genome assemblies and WGS reads of Moho braccatus . Sites that conflicted between the SPAdes and Trinity assemblies were more common at the ends of the UCE sequences ( Fig. 3 ). For the fresh tissues, this pattern disappeared after edge trimming, but it persisted (but to a lesser degree) for the antique samples. Relative to fresh tissues, the two antique samples also had more conflicting sites in total, with proportionately more of these as sites resolved As and Ts in both assemblies (Suppl. Fig. S1). It is possible that some observed conflicts reflect heterozygous sites present in the source DNA that were resolved differently by the two assemblers, the much the larger proportion of conflicting sites in assemblies based on antique samples is more consistent with the hypothesis that most conflicts are artifactual. Transition differences (C – T and A – G) dominated the conflicts we observed for all five samples, although the proportion of transition-type conflicts was below 50% for the two antique samples, reflecting the large number of A – T conflicts. Download figure Open in new tab Fig. 3. Number of conflicting sites between SPAdes and Trinity assemblies and their relative location in the alignment. From zero to one, small and large numbers indicate locations near the edges and numbers close to 0.5 indicate more central location in the alignment. For each species, we looked at alignments before and post edge-trimming. However, we did not observe patterns that are consistent with elevated C-to-T deamination typical of damaged ancient DNA for the two antique samples. Multispecies coalescent method The ASTRAL tree based on the consensus UCE dataset placed Hypocolius and two Moho species, all antique samples, at the base of Bombycilloidea as the successive sisters of the remaining members of the superfamily (Suppl. Fig. S2). The ASTRAL tree also placed Hylocitrea sister to Bombycilla with a short internal branch, conflicting with our phylogenomic trees ( Figs. 1 and 2 ). Braun et al. (2024) explored multiple different multispecies coalescent methods and observed varying placements of the antique samples (including some outside of Bombycilloidea), but all multispecies coalescent analyses of taxa with high-quality sequences recovered the topology our concatenated analyses recover for those taxa and conflict with the topology of those taxa in our ASTRAL tree. Our ASTRAL results are consistent with both simulation and empirical studies that have suggested that gene tree summary methods tend to yield biased results for low quality samples with large amounts of missing data (e.g., Hosner et al., 2016 ; Xi et al., 2016 ; Zhao et al., 2025a ). Therefore, our discussion below focused on results from the concatenation analyses. Comprehensive phylogeny of Bombycilloidea Combining the consensus UCE dataset with legacy data allowed us to produce a phylogeny that included all currently recognized species within Bombycilloidea ( Fig. 4 ). This tree shows congruent backbone topology to the trees built with re-aligned Oliveros UCEs, all datasets based on our re-assemblies with edging trimming ( Fig. 1 ), and the consensus UCE dataset ( Fig. 2 ). For Bombycillidae, Ptiliogonatidae, and Mohoidae, all taxa were placed within their corresponding family with strong support. The internal nodes for the clade comprising Moho nobilis , M. apicalis , and M. bishopi received poor bootstrap support, likely due to large amounts of missing data for the latter two taxa (which were only sampled for four and five legacy loci, respectively; Suppl. Table S1). Consistent with this hypothesis, the Moho nobilis + M. apicalis + M. bishopi clade had the same topology but was strongly supported when the legacy data were analyzed without UCEs (Suppl. Fig. S3; however, excluding the UCEs resulted in a different topology at the base of Bombycilloidea, with no support in that part of the tree). Regardless, the node uniting this clade and Chaetoptila angustipluma was strongly supported, rendering Moho paraphyletic. Download figure Open in new tab Fig. 4. The maximum likelihood tree estimated using the consensus UCE dataset, 15 mitochondrial regions, two nuclear genes and two nuclear introns. Illustrations for Moho nobilis (by John Gerrard Keulemans 1893), Ptiliogonys caudatus (by Joseph Smit 1869) and Hylocitrea bonensis (from Adolf Bernhard Meyer and Lionel William Wiglesworth 1898, The birds of Celebes and the neighbouring islands ; artist unclear) were adapted from images in Wikimedia Commons that are in the Public Domain. Illustrations of Bombycilla garrulus and Hypocolius ampelinus were drawn by M.Z. and Dulus dominicus was drawn by Rick Stanley. Discussion Our results showed that the long branches of Hypocolius ampelinus and Moho nobilis and their sister relationship in previous work ( Braun et al., 2024 ; Oliveros et al., 2019 ) were likely caused by artifactual factors such as alignment errors and missing data that reflect the quality of the source DNA. Simply adding in more taxa, e.g., Moho braccatus , did not break up the attraction of the two long branches. However, when the UCEs were re-aligned using a smaller, more focused taxon set, the branches were shortened and the topology shifted, resulting in a tree with a Hypocolius + Hylocitrea clade sister to Moho . This topology was also supported by our re-assembled datasets that were edge-trimmed and contained more loci and sites. Choice of assemblers had a large impact on the sequences recovered, especially towards the two ends of the UCEs for the antique samples. However, for our focal group, the choice of assemblers and mixed use of assemblies did not explicitly contribute to the topological conflicts observed. Systematics of Bombycilloidea Bombycilloidea is part of the Muscicapida radiation which also includes Muscicapoidea, Regulidae and Certhioidea ( Oliveros et al., 2019 ). However, compared to the latter three clades, Bombycilloidea comprises an extremely small number of species. While previous studies have reported varying relationships among the three outgroup lineages (cf. Braun et al., 2024 ; Oliveros et al., 2019 ; Stiller et al., 2024 ; Zhao et al., 2025b ), our trees have an outgroup topology consistent with that of Stiller et al. (2024 )—which utilized whole genome data—and support Regulidae as the sister group to the clade comprising Muscicapoidea and Certhioidea. Within Bombycilloidea, our results agree with many previous studies that Dulidae is the first lineage to diverge in the superfamily (e.g., Braun et al., 2024 ; Burleigh et al., 2015 ; Oliveros et al., 2019 ; Spellman et al., 2008 ; Zhao et al., 2025b ), although the exact placements of Bombycillidae and Ptiliogonatidae, as well as our focal clade (Mohoidae, Hypocoliidae and Hylocitreidae), have been more unstable. Our trees align with other genomic-scale studies (e.g., Braun et al., 2024 ; Oliveros et al., 2019 ; Zhao et al., 2025b ) showing that Bombycillidae and Ptiliogonatidae diverged successively after Dulidae, and that the clade comprising Mohoidae, Hypocoliidae, and Hylocitreidae is sister to Ptiliogonatidae. For our focal clade, our results support a sister relationship between Hypocoliidae and Hylocitreidae and together they are sister to the extinct Mohoidae. Bombycilloidea is a species-poor but geographically widespread clade. Our phylogeny suggests an early, relatively rapid, speciation into the different families. Oliveros et al. (2019) estimated a crown age at ca. 20 million years ago (Ma) for Bombycilloidea, and the six families diverged between ca. 17–20 Ma. However, since then there has been little net diversification within any of the families. Many of the families have restricted distributions (e.g., Dulidae is endemic to the island of Hispaniola, Mohoidae were endemic to the Hawaiian Islands, and Hylocitreidae is endemic to Sulawesi), though others are much more widespread (e.g., Bombycillidae includes three waxwing species, one found throughout North America, one found in Asia, and one with a circumpolar distribution). This puzzling biogeographic pattern probably suggests a once widespread or highly vagile common ancestor (like the waxwings) that became established on various islands across the globe during the Miocene. However, it is also possible that there were pervasive extinctions in the superfamily that have obscured the biogeographic history of the group, leading to the confusing distribution that we observe today. The common ancestor of Bombycilloidea was likely to be frugivorous given the diets of the extant species. Although the Mohoidae were primarily nectivores (leading to their initial association with the Hawaiian honeycreepers, Meliphagidae; ( Lovette, 2008 ), they also consumed fruit ( Billerman, 2020 ), consistent with the primarily frugivorous diets of other Bombycilloidea. Consistent with a diet primarily composed of foods that are patchily distributed in space and time (e.g., fruiting trees), and thus not readily defensible, many of these species do not defend traditional territories, with Dulus dominicus being highly colonial ( Winkler et al., 2020a ), and others, such as Bombycilla spp. ( Winkler et al., 2020b ), Ptiliogonys caudatus , and some Phainopepla nitens populations ( Winkler et al. 2020c ). Little is known about the behavior of the Mohoidae, but flocks outside of the breeding season has been suggested for many Mohoidae species ( Billerman, 2020 ), as well as for many other Bombycilloidea. The phylogeny within Mohoidae is consistent with Hennig’s “progression rule” as articulated for the Hawaiian Islands by Wagner and Funk (1995) , where species colonize the oldest islands first and later disperse to younger islands. Moho braccatus was distributed on Kauai, the oldest of the major Hawaiian Islands ( Neall and Trewick 2008 ), whereas all other Mohoidae (including Chaetoptila angustipluma ) were on younger islands. Although there are exceptions to the progression rule (Heads 2010), there are many phylogenies of Hawaiian taxa that provide at least partial support for the progression rule, both in birds ( Lerner et al. 2011 ) and in other taxa ( Kleinkopf et al., 2019 ; Knope et al., 2020 ; Rubinoff and Schmitz, 2010 ; Rundell et al., 2004 ). Monophyly of Moho nobilis , M. apicalis , and M. bishopi is also consistent with the similarities among these taxa that were observed before their extinction ( Perkins 1903 ); indeed, these taxa are similar enough that they have sometimes been viewed as subspecies of the same species (cf. Pratt 1979 ). Our phylogeny also suggests that Chaetoptila Sclater, PL 1871 (including C. angustipluma and likely the Chaetoptila species known from fossils) should be subsumed into Moho Lesson, RP 1830 based on priority of the generic names. Effects of inaccurate alignments and alignment trimming Numerous studies have found that inaccurate alignments can result in erroneous topologies (e.g., Ashkenazy et al., 2019 ; Ogden and Rosenberg, 2006 ; Wang et al., 2011 ). Our results also reflected this. We extracted both the Oliveros and Braun alignments directly from large supermatrices that contained hundreds of taxa spanning the Avian Tree of Life and preserved their original alignment structures. These large alignments naturally included many gaps due to accumulated mutations, deletions and insertions across divergent taxa. The highly conserved UCE cores can act like anchor points during alignment to ensure that homologous sequences are easily aligned ( Faircloth et al., 2012 ). However, since DNA from antique samples are often fragmented and damaged, UCEs extracted from these samples are usually short and contain more missing data in their flanking regions. Therefore, aligning sequences from antique samples with less closely related taxa is more likely to result in spurious alignments by chance due to their short length and gappy alignments (de Filippo et al., 2018; Landan and Graur, 2009 ; Smith et al., 1985 ). It has been suggested that large sparse supermatrices are more sensitive to phylogenetic artifacts than smaller but more complete data sets ( Roure et al., 2013 ). To examine the role of alignment, we re-aligned the Oliveros UCEs, focusing on a smaller number of more closely related taxa and conducted edge trimming. Both re-alignment and edge trimming are likely to have reduced spuriously aligned sequences. Our results suggest that long branches can, at least in part, be driven by alignment issues when aligning divergent taxa. Thus, tackling persistent phylogenetic problems, especially those that involve large-scale genomic trees and low-quality samples, may still require examination within a more focused set of species. Additionally, the topological discrepancy could also stem from differences in information content in addition to alignment errors, since the dataset of re-aligned Oliveros UCEs had many more loci and sites than the original Oliveros alignment ( Table 2 ). Alignment trimming is widely used to reduce alignment errors (reviewed by Steenwyk et al., 2023 ), though it often comes with the cost of reducing informative sites. As implemented in the default settings of PHYLUCE, the seqcap_align program is used to trim the edges of resulting alignments to remove poorly aligned or sparsely sampled regions, ensuring that the remaining alignment has sufficient coverage across taxa and is not overly divergent from the consensus ( Faircloth, 2016 ). Therefore, re-aligned UCEs of the more closely related taxon set went through less aggressive edge trimming compared to the Oliveros/Braun alignments—or those that preserved the original alignment structure—which were aligned across hundreds of more distantly related taxa. The Braun alignments also underwent additional internal trimming, thus further reducing the number of sites ( Table 2 ). However, determining the optimal amount of trimming to eliminate alignment errors without compromising phylogenetic information is not straightforward (e.g., Francis and Canfield, 2020 ; Portik and Wiens, 2021 ; Tan et al., 2015 ; Zhao et al., 2025b ). Excessive trimming can reduce phylogenetic signal which might lead to biased phylogenies ( Tan et al., 2015 ). For UCEs, where informative sites tend to accumulate at the two ends, UCE data can be especially sensitive to edge trimming ( Portik and Wiens, 2021 ). Nevertheless, failure to conduct any edge trimming resulted in exceptionally long terminal branches for two antique samples in our dataset ( Moho and Hypocolius ), and this is likely to have led to incorrect topologies ( Fig. 1 ; before edge trimming). Several studies where taxa with sequences generated from antique DNA are clustered as long branches have referred to the observed phenomenon as long branch attraction (e.g., McCullough et al. 2024 ; Zhao et al. 2025a ), but there is a fundamental distinction between long branches due to poor sequence recovery and long branches due to accelerated sequence evolution. Susko and Roger (2021) have argued that the model space is larger for trees with long branches clustered together than trees with long branches separated and suggested this issue is central to long branch attraction; these differences in the size of model space should exist whether the basis for the long branches is biological (i.e., due to accelerated sequence evolution) or technical (i.e., reflecting the consequences of missing data in antique samples). However, the assumption that long branch attraction can be ameliorated by using more “biologically realistic” models that better reflect the processes underlying sequence evolution (e.g., Lartillot et al., 2007 ; Pandey and Braun, 2020 ; Wang et al., 2018 ) clearly does not apply to long branches that reflect missing data and/or sequencing and assembly errors. It may be difficult (or even impossible) to successfully model the processes that result in long branches in trees that include antique samples. Although it is premature to propose general methods for the correction of long branches in trees with antique samples, we found that improving sequence alignments combined with edge trimming had the biggest impact in this study. Conflicting sites are not randomly distributed Although choice of assemblers (SPAdes vs Trinity) did not appear to affect the topology of the Bombycilloidea tree, comparisons of the assemblies produced with the different assemblers revealed conflicting sites towards the two ends of the alignments, especially before edge trimming ( Fig. 3 ). This partly reflects the characteristics of UCEs and target capture sequencing. UCE cores are highly conserved across vertebrates, whereas the flanking regions contain more variable sites as they locate further away from the conserved cores ( Faircloth et al., 2012 ). Those variable sites provide the essential information for phylogenetic inference. However, since the target capture process uses bait sets to target the desired DNA fragments (i.e., the UCE cores), and allows for selective enrichment of these regions, the flanking regions will have lower read coverage. Low read coverage is expected to reduce the ability of any assembler to detect sequencing errors and extending contigs reliably (e.g., Ayling et al., 2020 ; Haiminen et al., 2011 ). Antique samples with DNA fragmentation and damage are likely to be prone to a higher rate of sequencing errors and the flanking regions may have even more heterogeneous read coverage. Consistent with this, we found more conflicts between the two assemblies for antique samples than the fresh tissue samples, and many conflicts persisted even after edge trimming ( Fig. 3 ). This pattern was also reflected as relatively longer terminal branches in the tree with different assemblies treated as independent taxa ( Fig. 2a ). When edge trimming was used to remove potential sequencing and alignment errors, our datasets produced congruent topologies in supporting a sister relationship between Hypocolius and Hylocitrea . Both Trinity ( Hölzer and Marz, 2019 ) and the single-cell mode of SPAdes ( Nurk et al., 2013 ) were designed to accommodate highly non-uniform read coverage such as in transcriptome and single-cell sequencing data; while Trinity utilizes a single k-mer size when building the de Bruijn graph, SPAdes uses a set of k-mer sizes to refine the assembly with multiple steps of read error correction. Though tested with a limited number of samples, we found that SPAdes in general recovered more UCE loci and sites than Trinity, especially for antique samples ( Table 1 ). As the true sequence is unknown, generating the consensus between the SPAdes and Trinity assemblies could be an effective way to resolve these conflicts ( Fig. 2b ), although this came at the cost of reducing sites since we treated conflicting sites as ambiguous. Future studies can apply these methods, e.g., treating an assembly as an independent taxon, or generating consensus sequences, to assess the performance of different assemblies and their impact on estimated phylogenies. Conclusions The phylogenomic era has made it possible to imagine generating an estimate of the Tree of Life that is both complete and accurate, but it will be necessary to include data from both ancient and antique samples to achieve this goal. Antique samples may be especially important, because they can provide access not just to data from extinct taxa (like Mohoidae) and but also from extant taxa where tissues are unavailable either because they are distributed in areas without recent expeditions (like Hypocoliidae) or because their conservation status precludes their collection. Antique samples are likely to be associated with higher levels of missing data and higher rates of sequencing errors. Therefore, the sequences from antique samples may yield long branches and, potentially, incorrect topologies when they are analyzed. We suggest that studies where long branches are recovered with antique DNA samples should use multiple assembles to examine error rates, examine the impact of sequence alignment, and explore sequence trimming. In this study, these approaches allowed us to generate a strongly supported backbone phylogeny for Bombycilloidea without long branches. Combining the clean phylogenomic data with legacy sequence data ultimately allowed us to generate a complete phylogeny for the superfamily. Data availability The data that support the findings of this study (alignments and tree files) are openly available in figshare at http://doi.org/10.6084/m9.figshare.28255466 . Acknowledgements We thank Rick Stanley for drawing the palmchat illustration. The Kimball-Braun lab provided helpful comments on earlier drafts of this manuscript. This work was supported by a National Science Foundation grant (DEB 1655683) to RTK and ELB. Funder Information Declared NSF DEB , 1655683 References ↵ Allio , R. , Schomaker-Bastos , A. , Romiguier , J. , Prosdocimi , F. , Nabholz , B. , Delsuc , F ., 2020 . MitoFinder: Efficient automated large-scale extraction of mitogenomic data in target enrichment phylogenomics . Mol. Ecol. Resour 20 , 892 – 905 . doi: 10.1111/1755-0998.13160 . OpenUrl CrossRef ↵ Alström , P. , Hooper , D.M. , Liu , Y. , Olsson , U. , Mohan , D. , Gelang , M. , Le Manh , H. , Zhao , J. , Lei , F. , Price , T.D. , 2014 . Discovery of a relict lineage and monotypic family of passerine birds . Biol. Lett . 10 , 20131067 . doi: 10.1098/rsbl.2013.1067 . OpenUrl CrossRef PubMed ↵ Ashkenazy , H. , Sela , I. , Levy Karin , E. , Landan , G. , Pupko , T ., 2019 . Multiple sequence alignment averaging improves phylogeny reconstruction . Syst. Biol . 68 , 117 – 130 . doi: 10.1093/sysbio/syy036 . OpenUrl CrossRef PubMed ↵ Ayling , M. , Clark , M.D. , Leggett , R.M ., 2020 . New approaches for metagenome assembly with short reads . Brief. Bioinformatics 21 , 584 – 594 . doi: 10.1093/bib/bbz020 . OpenUrl CrossRef ↵ Bergsten , J ., 2005 . A review of long-branch attraction . Cladistics 21 , 163 – 193 . doi: 10.1111/j.1096-0031.2005.00059.x . OpenUrl CrossRef PubMed Web of Science ↵ Bernstein , J.M. , Ruane , S ., 2022 . Maximizing Molecular Data From Low-Quality Fluid-Preserved Specimens in Natural History Collections . Front. Ecol. Evol . 10 . doi: 10.3389/fevo.2022.893088 . OpenUrl CrossRef ↵ Billerman , S.M. , 2020 . Hawaiian Honeyeaters (Mohoidae), in: Billerman, S.M., Keeney, B.K., Rodewald, P.G., Schulenberg, T.S. (Eds.), Birds of the World. Cornell Lab of Ornithology. doi: 10.2173/bow.mohoid1.01 . ↵ Bolger , A.M. , Lohse , M. , Usadel , B ., 2014 . Trimmomatic: A flexible trimmer for Illumina sequence data . Bioinformatics 30 , 2114 – 2120 . doi: 10.1093/bioinformatics/btu170 . OpenUrl CrossRef PubMed Web of Science ↵ Kraus , R Braun , E.L. , Cracraft , J. , Houde , P ., 2019 . Resolving the Avian Tree of Life from Top to Bottom: The Promise and Potential Boundaries of the Phylogenomic Era . In: Kraus , R . (eds) Avian Genomics in Ecology and Evolution . Springer , Cham . doi: 10.1007/978-3-030-16477-5_6 OpenUrl CrossRef ↵ Braun , E.L. , Oliveros , C.H. , White Carreiro , N.D. , Zhao , M. , Glenn , T.C. , Brumfield , R.T. , Braun , M.J. , Kimball , R.T. , Faircloth , B.C ., 2024 . Testing the mettle of METAL: A comparison of phylogenomic methods using a challenging but well-resolved phylogeny . BioRxiv . doi: 10.1101/2024.02.28.582627 . OpenUrl Abstract / FREE Full Text ↵ Briggs , A.W. , Stenzel , U. , Johnson , P.L.F. , Green , R.E. , Kelso , J. , Prüfer , K. , Meyer , M. , Krause , J. , Ronan , M.T. , Lachmann , M. , Pääbo , S ., 2007 . Patterns of damage in genomic DNA sequences from a Neandertal . Proc Natl Acad Sci USA 104 , 14616 – 14621 . doi: 10.1073/pnas.0704665104 . OpenUrl Abstract / FREE Full Text ↵ Burleigh , J.G. , Kimball , R.T. , Braun , E.L ., 2015 . Building the avian tree of life using a large-scale, sparse supermatrix . Mol. Phylogenet. Evol . 84 , 53 – 63 . doi: 10.1016/j.ympev.2014.12.003 . OpenUrl CrossRef PubMed ↵ Cai , T. , Cibois , A. , Alström , P. , Moyle , R.G. , Kennedy , J.D. , Shao , S. , Zhang , R. , Irestedt , M. , Ericson , P.G.P. , Gelang , M. , Qu , Y. , Lei , F. , Fjeldså , J ., 2019 . Near-complete phylogeny and taxonomic revision of the world’s babblers (Aves: Passeriformes) . Mol. Phylogenet. Evol . 130 , 346 – 356 . doi: 10.1016/j.ympev.2018.10.010 . OpenUrl CrossRef PubMed ↵ Dabney , J. , Meyer , M. , Pääbo , S ., 2013 . Ancient DNA damage . Cold Spring Harb. Perspect. Biol . 5 , a012567 . doi: 10.1101/cshperspect.a012567 . OpenUrl Abstract / FREE Full Text ↵ Daley , T. , Smith , A.D ., 2014 . Modeling genome coverage in single-cell sequencing . Bioinformatics 30 , 3159 – 3165 . doi: 10.1093/bioinformatics/btu540 . OpenUrl CrossRef PubMed ↵ Danecek , P. , Bonfield , J.K. , Liddle , J. , Marshall , J. , Ohan , V. , Pollard , M.O. , Whitwham , A. , Keane , T. , McCarthy , S.A. , Davies , R.M. , Li , H ., 2021 . Twelve years of SAMtools and BCFtools . Gigascience 10 . doi: 10.1093/gigascience/giab008 . OpenUrl CrossRef PubMed de Filippo, C., Meyer, M., Prüfer, K., 2018 . Quantifying and reducing spurious alignments for the analysis of ultra-short ancient DNA sequences . BMC Biol . 16 , 121 . doi: 10.1186/s12915-018-0581-9 . OpenUrl CrossRef PubMed ↵ Derkarabetian , S. , Benavides , L.R. , Giribet , G ., 2019 . Sequence capture phylogenomics of historical ethanol-preserved museum specimens: Unlocking the rest of the vault . Mol. Ecol. Resour 19 , 1531 – 1544 . doi: 10.1111/1755-0998.13072 . OpenUrl CrossRef PubMed ↵ Faircloth , B.C ., 2016 . PHYLUCE is a software package for the analysis of conserved genomic loci . Bioinformatics 32 , 786 – 788 . doi: 10.1093/bioinformatics/btv646 . OpenUrl CrossRef PubMed ↵ Faircloth , B.C. , McCormack , J.E. , Crawford , N.G. , Harvey , M.G. , Brumfield , R.T. , Glenn , T.C ., 2012 . Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales . Syst. Biol . 61 , 717 – 726 . doi: 10.1093/sysbio/sys004 . OpenUrl CrossRef PubMed ↵ Feng , S. , Stiller , J. , Deng , Y. , Armstrong , J. , Fang , Q.I. , Reeve , A.H. , Xie , D. , Chen , G. , Guo , C. , Faircloth , B.C. , Petersen , B. , … Zhang G ., 2020 . Dense sampling of bird diversity increases power of comparative genomics . Nature , 587 , 252 – 257 . doi: 10.1038/s41586-020-2873-9 OpenUrl CrossRef PubMed ↵ Fleischer , R.C. , James , H.F. , Olson , S.L ., 2008 . Convergent evolution of Hawaiian and Australo-Pacific honeyeaters from distant songbird ancestors . Curr. Biol . 18 , 1927 – 1931 . doi: 10.1016/j.cub.2008.10.051 . OpenUrl CrossRef PubMed Web of Science ↵ Francis , W.R. , Canfield , D.E ., 2020 . Very few sites can reshape the inferred phylogenetic tree . PeerJ 8 , e8865 . doi: 10.7717/peerj.8865 . OpenUrl CrossRef PubMed Ginolhac , A. , Rasmussen , M. , Gilbert , M.T.P. , Willerslev , E. , Orlando , L ., 2011 . mapDamage: testing for damage patterns in ancient DNA sequences . Bioinformatics 27 , 2153 – 2155 . doi: 10.1093/bioinformatics/btr347 . OpenUrl CrossRef PubMed Web of Science ↵ Haiminen , N. , Kuhn , D.N. , Parida , L. , Rigoutsos , I ., 2011 . Evaluation of methods for de novo genome assembly from high-throughput sequencing reads reveals dependencies that affect the quality of the results. PLoS ONE 6 , e24182 . doi: 10.1371/journal.pone.0024182 . OpenUrl CrossRef PubMed ↵ Harvey , M.G. , Bravo , G.A. , Claramunt , S. , Cuervo , A.M. , Derryberry , G.E. , Battilana , J. , Seeholzer , G.F. , McKay , J.S. , O’Meara , B.C. , Faircloth , B.C. , Edwards , S.V. , Pérez-Emán , J. , Moyle , R.G. , Sheldon , F.H. , Aleixo , A. , Smith , B.T. , Chesser , R.T. , Silveira , L.F. , Cracraft , J. , Brumfield , R.T. , Derryberry , E.P ., 2020 . The evolution of a tropical biodiversity hotspot . Science 370 , 1343 – 1348 . doi: 10.1126/science.aaz6970 . OpenUrl Abstract / FREE Full Text Heads , M ., 2011 . Old taxa on young islands: a critique of the use of island age to date island-endemic clades and calibrate phylogenies . Syst. Biol . 60 , 204 – 218 . doi: 10.1093/sysbio/syq075 . OpenUrl CrossRef PubMed Hoang, D.T., Chernomor, O., von Haeseler, A., Minh, B.Q., Vinh, L.S., 2018 . Ufboot2: improving the ultrafast bootstrap approximation . Mol. Biol. Evol . 35 , 518 – 522 . doi: 10.1093/molbev/msx281 . OpenUrl CrossRef PubMed ↵ Hölzer , M. , Marz , M ., 2019 . De novo transcriptome assembly: a comprehensive cross-species comparison of short-read RNA-Seq assemblers. Gigascience 8 , giz039 . doi: 10.1093/gigascience/giz039 . OpenUrl CrossRef PubMed ↵ Hosner , P.A. , Faircloth , B.C. , Glenn , T.C. , Braun , E.L. , Kimball , R.T ., 2016 . Avoiding missing data biases in phylogenomic inference: an empirical study in the landfowl (aves: galliformes) . Mol. Biol. Evol . 33 , 1110 – 1125 . doi: 10.1093/molbev/msv347 . OpenUrl CrossRef PubMed ↵ Islam , R. , Raju , R.S. , Tasnim , N. , Shihab , I.H. , Bhuiyan , M.A. , Araf , Y. , Islam , T ., 2021 . Choice of assemblers has a critical impact on de novo assembly of SARS-CoV-2 genome and characterizing variants. Brief. Bioinformatics 22 . doi: 10.1093/bib/bbab102 . OpenUrl CrossRef PubMed ↵ James , H.F. , Olson , S.L ., 1991 . Descriptions of Thirty-Two New Species of Birds from the Hawaiian Islands: Part II . Passeriformes. Ornithol. Monogr . 1 – 88 . doi: 10.2307/40166713 . OpenUrl CrossRef ↵ James , T.Y. , Pelin , A. , Bonen , L. , Ahrendt , S. , Sain , D. , Corradi , N. , Stajich , J.E ., 2013 . Shared signatures of parasitism and phylogenomics unite Cryptomycota and microsporidia . Curr. Biol . 23 , 1548 – 1553 . doi: 10.1016/j.cub.2013.06.057 . OpenUrl CrossRef PubMed ↵ Jarvis , E.D. , Mirarab , S. , Aberer , A.J. , Li , B. , Houde , P. , Li , C. , Ho , S.Y.W. , Faircloth , B.C. , Nabholz , B. , Howard , J.T. , Suh , A. , Weber , C.C. , da Fonseca , R.R. , Li , J. , Zhang , F. , Li , H. , Zhou , L. , Narula , N. , Liu , L. , Ganapathy , G. , … Zhang , G. , 2014 . Whole-genome analyses resolve early branches in the tree of life of modern birds . Science 346 , 1320 – 1331 . doi: 10.1126/science.1253451 . OpenUrl Abstract / FREE Full Text ↵ Kalyaanamoorthy , S. , Minh , B.Q. , Wong , T.K.F. , von Haeseler , A. , Jermiin , L.S. , 2017 . ModelFinder: fast model selection for accurate phylogenetic estimates . Nat. Methods 14 , 587 – 589 . doi: 10.1038/nmeth.4285 . OpenUrl CrossRef PubMed ↵ Kapli , P. , Yang , Z. , Telford , M.J ., 2020 . Phylogenetic tree building in the genomic age . Nat. Rev. Genet . 21 , 428 – 444 . doi: 10.1038/s41576-020-0233-0 . OpenUrl CrossRef ↵ Katoh , K. , Frith , M.C ., 2012 . Adding unaligned sequences into an existing alignment using MAFFT and LAST . Bioinformatics 28 , 3144 – 3146 . doi: 10.1093/bioinformatics/bts578 . OpenUrl CrossRef PubMed Web of Science ↵ Katoh , K. , Standley , D.M ., 2013 . MAFFT multiple sequence alignment software version 7: improvements in performance and usability . Mol. Biol. Evol . 30 , 772 – 780 . doi: 10.1093/molbev/mst010 . OpenUrl CrossRef PubMed Web of Science ↵ Kimball , R.T. , Hosner , P.A. , & Braun , E.L ., 2021 . A phylogenomic supermatrix of Galliformes (Landfowl) reveals biased branch lengths . Mol. Phylogenet. Evol ., 158 , 107091 . doi: 10.1016/j.ympev.2021.107091 OpenUrl CrossRef PubMed ↵ Kleinkopf , J.A. , Roberts , W.R. , Wagner , W.L. , Roalson , E.H ., 2019 . Diversification of Hawaiian Cyrtandra (Gesneriaceae) under the influence of incomplete lineage sorting and hybridization . J. Syst. Evol . 57 , 561 – 578 . doi: 10.1111/jse.12519 . OpenUrl CrossRef ↵ Knope , M.L. , Bellinger , M.R. , Datlof , E.M. , Gallaher , T.J. , Johnson , M.A ., 2020 . Insights into the Evolutionary History of the Hawaiian Bidens (Asteraceae) Adaptive Radiation Revealed Through Phylogenomics . J. Hered . 111 , 119 – 137 . doi: 10.1093/jhered/esz066 . OpenUrl CrossRef PubMed ↵ Kuhl , H. , Frankl-Vilches , C. , Bakker , A. , Mayr , G. , Nikolaus , G. , Boerno , S.T. , Klages , S. , Timmermann , B. and Gahr , M ., 2021 . An unbiased molecular approach using 3ʹ-UTRs resolves the avian family-level tree of life . Mol. Biol. Evol . 38 , 108 – 127 . doi: 10.1093/molbev/msaa191 OpenUrl CrossRef PubMed ↵ Lamichhaney , S. , Berglund , J. , Almén , M.S. , Maqbool , K. , Grabherr , M. , Martinez-Barrio , A. , Promerová , M. , Rubin , C.-J. , Wang , C. , Zamani , N. , Grant , B.R. , Grant , P.R. , Webster , M.T. , Andersson , L ., 2015 . Evolution of Darwin’s finches and their beaks revealed by genome sequencing . Nature 518 , 371 – 375 . doi: 10.1038/nature14181 . OpenUrl CrossRef PubMed ↵ Landan , G. , Graur , D ., 2009 . Characterization of pairwise and multiple sequence alignment errors . Gene 441 , 141 – 147 . doi: 10.1016/j.gene.2008.05.016 . OpenUrl CrossRef PubMed Web of Science ↵ Langmead , B. , Salzberg , S.L ., 2012 . Fast gapped-read alignment with Bowtie 2 . Nat. Methods 9 , 357 – 359 . doi: 10.1038/nmeth.1923 . OpenUrl CrossRef PubMed Web of Science ↵ Lartillot , N. , Brinkmann , H. , Philippe , H ., 2007 . Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model . BMC Evol. Biol . 7 Suppl 1 , S4 . doi: 10.1186/1471-2148-7-S1-S4 . OpenUrl CrossRef PubMed ↵ Lerner , H.R.L. , Meyer , M. , James , H.F. , Hofreiter , M. , Fleischer , R.C ., 2011 . Multilocus resolution of phylogeny and timescale in the extant adaptive radiation of Hawaiian honeycreepers . Curr. Biol . 21 , 1838 – 1844 . doi: 10.1016/j.cub.2011.09.039 . OpenUrl CrossRef PubMed ↵ Lovette , I.J ., 2008 . Convergent evolution: raising a family from the dead . Curr. Biol . 18 , R1132 – 4 . doi: 10.1016/j.cub.2008.11.006 . OpenUrl CrossRef PubMed ↵ McCormack , J.E. , Harvey , M.G. , Faircloth , B.C. , Crawford , N.G. , Glenn , T.C. , Brumfield , R.T ., 2013 . A phylogeny of birds based on over 1,500 loci collected by target enrichment and high-throughput sequencing . PLoS ONE 8 , e54848 . doi: 10.1371/journal.pone.0054848 . OpenUrl CrossRef PubMed ↵ McCormack , J.E. , Tsai , W.L.E. , Faircloth , B.C ., 2016 . Sequence capture of ultraconserved elements from bird museum specimens . Mol. Ecol. Resour 16 , 1189 – 1203 . doi: 10.1111/1755-0998.12466 . OpenUrl CrossRef ↵ McCullough , J.M. , Eliason , C.M. , Hackett , S. , Myers , C.E. and Andersen , M.J ., 2024 . Phylogenomics of a genus of ‘Great Speciators’ reveals rampant incomplete lineage sorting, gene flow, and mitochondrial capture in island systems . bioRxiv 2024 . 08 .28.610082. doi: 10.1101/2024.08.28.610082 . OpenUrl Abstract / FREE Full Text ↵ McCullough , J.M. , Moyle , R.G. , Smith , B.T. , Andersen , M.J ., 2019 . A Laurasian origin for a pantropical bird radiation is supported by genomic and fossil data (Aves: Coraciiformes) . Proc. Biol. Sci . 286 , 20190122 . doi: 10.1098/rspb.2019.0122 . OpenUrl CrossRef PubMed ↵ Minh , B.Q. , Schmidt , H.A. , Chernomor , O. , Schrempf , D. , Woodhams , M.D. , von Haeseler , A. , Lanfear , R. , 2020 . IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era . Mol. Biol. Evol . 37 , 1530 – 1534 . doi: 10.1093/molbev/msaa015 . OpenUrl CrossRef PubMed ↵ Moyle , R.G. , Oliveros , C.H. , Andersen , M.J. , Hosner , P.A. , Benz , B.W. , Manthey , J.D. , Travers , S.L. , Brown , R.M. , Faircloth , B.C ., 2016 . Tectonic collision and uplift of Wallacea triggered the global songbird radiation . Nat. Commun . 7 , 12709 . doi: 10.1038/ncomms12709 . OpenUrl CrossRef PubMed ↵ Nash , J.A. , Harrington , R.C. , Zyskowski , K. , Near , T.J. , Prum , R.O ., 2024 . Species status and phylogenetic relationships of the enigmatic Negros Fruit Dove (Ptilinopus arcanus) . Ibis 166 , 1023 – 1040 . doi: 10.1111/ibi.13305 . OpenUrl CrossRef ↵ Neall , V.E. , Trewick , S.A ., 2008 . The age and origin of the Pacific islands: a geological overview . Philos. Trans. R. Soc. Lond. B Biol. Sci . 363 , 3293 – 3308 . doi: 10.1098/rstb.2008.0119 . OpenUrl CrossRef GeoRef PubMed ↵ Nurk , S. , Bankevich , A. , Antipov , D. , Gurevich , A. , Korobeynikov , A. , Lapidus , A. , Prjibelsky , A. , Pyshkin , A. , Sirotkin , A. , Sirotkin , Y. , Stepanauskas , R. , McLean , J. , Lasken , R. , Clingenpeel , S.R. , Woyke , T. , Tesler , G. , Alekseyev , M.A. , Pevzner , P.A. , 2013 . Assembling Genomes and Mini-metagenomes from Highly Chimeric Reads, in: Deng, M., Jiang, R., Sun, F., Zhang, X. (Eds.), Research in Computational Molecular Biology, Lecture Notes in Computer Science . Springer Berlin Heidelberg , Berlin, Heidelberg , pp. 158 – 170 . doi: 10.1007/978-3-642-37195-0_13 . OpenUrl CrossRef ↵ Ogden , T.H. , Rosenberg , M.S ., 2006 . Multiple sequence alignment accuracy and phylogenetic inference . Syst. Biol . 55 , 314 – 328 . doi: 10.1080/10635150500541730 . OpenUrl CrossRef PubMed Web of Science ↵ Oliveros , C.H. , Field , D.J. , Ksepka , D.T. , Barker , F.K. , Aleixo , A. , Andersen , M.J. , Alström , P. , Benz , B.W. , Braun , E.L. , Braun , M.J. , Bravo , G.A. , Brumfield , R.T. , Chesser , R.T. , Claramunt , S. , Cracraft , J. , Cuervo , A.M. , Derryberry , E.P. , Glenn , T.C. , Harvey , M.G. , Hosner , P.A. , Faircloth , B.C ., 2019 . Earth history and the passerine superradiation . Proc Natl Acad Sci USA 116 , 7916 – 7925 . doi: 10.1073/pnas.1813206116 . OpenUrl Abstract / FREE Full Text ↵ Pääbo , S. , Poinar , H. , Serre , D. , Jaenicke-Despres , V. , Hebler , J. , Rohland , N. , Kuch , M. , Krause , J. , Vigilant , L. , Hofreiter , M ., 2004 . Genetic analyses from ancient DNA . Annu. Rev. Genet . 38 , 645 – 679 . doi: 10.1146/annurev.genet.37.110801.143214 . OpenUrl CrossRef PubMed Web of Science ↵ Page , A.J. , Taylor , B. , Delaney , A.J. , Soares , J. , Seemann , T. , Keane , J.A. , Harris , S.R ., 2016 . SNP-sites: rapid efficient extraction of SNPs from multi-FASTA alignments . Microb. Genom . 2 , e000056 . doi: 10.1099/mgen.0.000056 . OpenUrl CrossRef PubMed Pais , F.S.-M. , Ruy, P. de C., Oliveira, G., Coimbra, R.S., 2014 . Assessing the efficiency of multiple sequence alignment programs . Algorithms Mol. Biol . 9 , 4 . doi: 10.1186/1748-7188-9-4 . OpenUrl CrossRef ↵ Pandey , A. , Braun , E.L ., 2020 . Phylogenetic analyses of sites in different protein structural environments result in distinct placements of the metazoan root . Biology (Basel ) 9 . doi: 10.3390/biology9040064 . OpenUrl CrossRef PubMed ↵ Sharp , D Perkins R.C.L ., 1903 . Vertebrate , in Sharp , D , (Eds.), Fauna Hawaiiensis or the zoology of the Sandwich (Hawaiian) isles . United Kingdom : The University Press . ↵ Philippe , H. , Brinkmann , H. , Lavrov , D.V. , Littlewood , D.T.J. , Manuel , M. , Wörheide , G. , Baurain , D ., 2011 . Resolving difficult phylogenetic questions: why more sequences are not enough . PLoS Biol . 9 , e1000602 . doi: 10.1371/journal.pbio.1000602 . OpenUrl CrossRef PubMed ↵ Pightling , A.W. , Petronella , N. , Pagotto , F ., 2014 . Choice of reference sequence and assembler for alignment of Listeria monocytogenes short-read sequence data greatly influences rates of error in SNP analyses . PLoS ONE 9 , e104579 . doi: 10.1371/journal.pone.0104579 . OpenUrl CrossRef PubMed ↵ Pightling , A.W. , Petronella , N. , Pagotto , F ., 2015 . Choice of reference-guided sequence assembler and SNP caller for analysis of Listeria monocytogenes short-read sequence data greatly influences rates of error . BMC Res. Notes 8 , 748 . doi: 10.1186/s13104-015-1689-4 . OpenUrl CrossRef PubMed ↵ Portik , D.M. , Wiens , J.J ., 2021 . Do alignment and trimming methods matter for phylogenomic (UCE) analyses? Syst. Biol . 70 , 440 – 462 . doi: 10.1093/sysbio/syaa064 . OpenUrl CrossRef PubMed ↵ Pratt , H.D ., 1979 . A systematic analysis of the endemic avifauna of the Hawaiian Islands. search . proquest.com . ↵ Prjibelski , A. , Antipov , D. , Meleshko , D. , Lapidus , A. , Korobeynikov , A ., 2020 . Using SPAdes de novo assembler. Curr. Protoc. Bioinformatics 70 , e102 . doi: 10.1002/cpbi.102 . OpenUrl CrossRef PubMed ↵ Reddy , S. , Kimball , R.T. , Pandey , A. , Hosner , P.A. , Braun , M.J. , Hackett , S.J. , Han , K.L. , Harshman , J. , Huddleston , C.J. , Kingston , S. and Marks , B.D ., 2017 . Why do phylogenomic data sets yield conflicting trees? Data type influences the avian tree of life more than taxon sampling . Systematic biology , 66 , 857 – 879 . doi: 10.1093/sysbio/syx041 OpenUrl CrossRef PubMed ↵ Ribeiro , P.G. , Torres Jiménez , M.F. , Andermann , T. , Antonelli , A. , Bacon , C.D. , Matos-Maraví , P ., 2021 . A bioinformatic platform to integrate target capture and whole genome sequences of various read depths for phylogenomics . Mol. Ecol . 30 , 6021 – 6035 . doi: 10.1111/mec.16240 . OpenUrl CrossRef ↵ Rick , J.A. , Brock , C.D. , Lewanski , A.L. , Golcher-Benavides , J. , Wagner , C.E ., 2024 . Reference genome choice and filtering thresholds jointly influence phylogenomic analyses . Syst. Biol . 73 , 76 – 101 . doi: 10.1093/sysbio/syad065 . OpenUrl CrossRef PubMed ↵ Roure , B. , Baurain , D. , Philippe , H ., 2013 . Impact of missing data on phylogenies inferred from empirical phylogenomic data sets . Mol. Biol. Evol . 30 , 197 – 214 . doi: 10.1093/molbev/mss208 . OpenUrl CrossRef PubMed Web of Science ↵ Rubinoff , D. , Schmitz , P ., 2010 . Multiple aquatic invasions by an endemic, terrestrial Hawaiian moth radiation . Proc Natl Acad Sci USA 107 , 5903 – 5906 . doi: 10.1073/pnas.0912501107 . OpenUrl Abstract / FREE Full Text ↵ Rundell , R.J. , Holland , B.S. , Cowie , R.H ., 2004 . Molecular phylogeny and biogeography of the endemic Hawaiian Succineidae (Gastropoda: Pulmonata) . Mol. Phylogenet. Evol . 31 , 246 – 255 . doi: 10.1016/j.ympev.2003.07.014 . OpenUrl CrossRef PubMed Web of Science ↵ Sackton , T.B. , Grayson , P. , Cloutier , A. , Hu , Z. , Liu , J.S. , Wheeler , N.E. , Gardner , P.P. , Clarke , J.A. , Baker , A.J. , Clamp , M. , Edwards , S.V ., 2019 . Convergent regulatory evolution and loss of flight in paleognathous birds . Science 364 , 74 – 78 . doi: 10.1126/science.aat7244 . OpenUrl Abstract / FREE Full Text ↵ Schmieder , R. , Edwards , R ., 2011 . Quality control and preprocessing of metagenomic datasets . Bioinformatics 27 , 863 – 864 . doi: 10.1093/bioinformatics/btr026 . OpenUrl CrossRef PubMed Web of Science ↵ Smith , B.T. , Merwin , J. , Provost , K.L. , Thom , G. , Brumfield , R.T. , Ferreira , M. , Mauck , W.M. , Moyle , R.G. , Wright , T.F. , Joseph , L ., 2023 . Phylogenomic Analysis of the Parrots of the World Distinguishes Artifactual from Biological Sources of Gene Tree Discordance . Syst. Biol . 72 , 228 – 241 . doi: 10.1093/sysbio/syac055 . OpenUrl CrossRef ↵ Smith , M.D. , Wertheim , J.O. , Weaver , S. , Murrell , B. , Scheffler , K. , Kosakovsky Pond , S.L ., 2015 . Less is more: an adaptive branch-site random effects model for efficient detection of episodic diversifying selection . Mol. Biol. Evol . 32 , 1342 – 1353 . doi: 10.1093/molbev/msv022 . OpenUrl CrossRef PubMed ↵ Smith , T.F. , Waterman , M.S. , Burks , C ., 1985 . The statistical distribution of nucleic acid similarities . Nucleic Acids Res . 13 , 645 – 656 . doi: 10.1093/nar/13.2.645 . OpenUrl CrossRef PubMed Web of Science ↵ Spellman , G.M. , Cibois , A. , Moyle , R.G. , Winker , K. , Keith Barker , F ., 2008 . Clarifying the systematics of an enigmatic avian lineage: what is a bombycillid? Mol. Phylogenet. Evol . 49 , 1036 – 1040 . doi: 10.1016/j.ympev.2008.09.006 . OpenUrl CrossRef PubMed Web of Science ↵ Steenwyk , J.L. , Li , Y. , Zhou , X. , Shen , X.-X. , Rokas , A ., 2023 . Incongruence in the phylogenomics era . Nat. Rev. Genet . 24 , 834 – 850 . doi: 10.1038/s41576-023-00620-x . OpenUrl CrossRef ↵ Stiller , J. , Feng , S. , Chowdhury , A.-A. , Rivas-González , I. , Duchêne , D.A. , Fang , Q. , Deng , Y. , Kozlov , A. , Stamatakis , A. , Claramunt , S. , Nguyen , J.M.T. , Ho , S.Y.W. , Faircloth , B.C. , Haag , J. , Houde , P. , Cracraft , J. , Balaban , M. , Mai , U. , Chen , G. , Gao , R. , … Zhang , G ., 2024 . Complexity of avian evolution revealed by family-level genomes . Nature 629 , 851 – 860 . doi: 10.1038/s41586-024-07323-1 . OpenUrl CrossRef ↵ Sun , K. , Meiklejohn , K. A. , Faircloth , B. C. , Glenn , T. C. , Braun , E. L. , & Kimball , R. T ., 2014 . The evolution of peafowl and other taxa with ocelli (eyespots): a phylogenomic approach . Proc. Biol. Sci , 281 , 20140823 . doi: 10.1098/rspb.2014.0823 OpenUrl CrossRef PubMed ↵ Susko , E. , Roger , A.J ., 2021 . Long branch attraction biases in phylogenetics . Syst. Biol . 70 , 838 – 843 . doi: 10.1093/sysbio/syab001 . OpenUrl CrossRef PubMed ↵ Tan , G. , Muffato , M. , Ledergerber , C. , Herrero , J. , Goldman , N. , Gil , M. , Dessimoz , C ., 2015 . Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference . Syst. Biol . 64 , 778 – 791 . doi: 10.1093/sysbio/syv033 . OpenUrl CrossRef PubMed ↵ Thompson , J.D. , Linard , B. , Lecompte , O. , Poch , O ., 2011 . A comprehensive benchmark study of multiple sequence alignment methods: current challenges and future perspectives . PLoS ONE 6 , e18093 . doi: 10.1371/journal.pone.0018093 . OpenUrl CrossRef PubMed ↵ Valiente-Mullor , C. , Beamud , B. , Ansari , I. , Francés-Cuesta , C. , García-González , N. , Mejía , L. , Ruiz-Hueso , P. , González-Candelas , F ., 2021 . One is not enough: On the effects of reference genome for the mapping and subsequent analyses of short-reads . PLoS Comput. Biol . 17 , e1008678 . doi: 10.1371/journal.pcbi.1008678 . OpenUrl CrossRef ↵ Wagner , W.L. , Funk , V.A ., 1995 . Hawaiian biogeography: evolution on a hot spot archipelago . Smithsonian Institution Press , Washington . ↵ Wandeler , P. , Hoeck , P.E.A. , Keller , L.F ., 2007 . Back to the future: museum specimens in population genetics . Trends Ecol. Evol . 22 , 634 – 642 . doi: 10.1016/j.tree.2007.08.017 . OpenUrl CrossRef PubMed Web of Science ↵ Wandeler , P. , Smith , S. , Morin , P.A. , Pettifor , R.A. , Funk , S.M ., 2003 . Patterns of nuclear DNA degeneration over time-a case study in historic teeth samples . Mol. Ecol . 12 , 1087 – 1093 . doi: 10.1046/j.1365-294X.2003.01807.x . OpenUrl CrossRef PubMed Web of Science ↵ Wang , H.-C. , Minh , B.Q. , Susko , E. , Roger , A.J ., 2018 . Modeling Site Heterogeneity with Posterior Mean Site Frequency Profiles Accelerates Accurate Phylogenomic Estimation . Syst. Biol . 67 , 216 – 235 . doi: 10.1093/sysbio/syx068 . OpenUrl CrossRef PubMed ↵ Wang , L.-S. , Leebens-Mack , J. , Kerr Wall , P. , Beckmann , K. , dePamphilis, C.W., Warnow, T., 2011 . The impact of multiple protein sequence alignment on phylogenetic estimation . IEEE/ACM Trans Comput Biol Bioinform 8 , 1108 – 1119 . doi: 10.1109/TCBB.2009.68 . OpenUrl CrossRef PubMed ↵ Wiens , J.J. , Tiu , J ., 2012 . Highly incomplete taxa can rescue phylogenetic analyses from the negative impacts of limited taxon sampling . PLoS ONE 7 , e42925 . doi: 10.1371/journal.pone.0042925 . OpenUrl CrossRef PubMed ↵ Winkler , D.W. , Billerman , S.M. , Lovette I.J. , 2020a . Palmchat (Dulidae) , in: Billerman , S.M. , Keeney , B.K. , Rodewald , P.G. , Schulenberg , T.S. (Eds.), Birds of the World. Cornell Lab of Ornithology . doi: 10.2173/bow.dulida1.01 OpenUrl CrossRef ↵ Winkler , D.W. , Billerman , S.M. , Lovette I.J. , 2020b . Waxwings (Bombycillidae) , in: Billerman , S.M. , Keeney , B.K. , Rodewald , P.G. , Schulenberg , T.S. (Eds.), Birds of the World. Cornell Lab of Ornithology . doi: 10.2173/bow.bombyc1.01 OpenUrl CrossRef ↵ Winkler , D.W. , Billerman , S.M. , Lovette I.J. , 2020c . Silky-flycatchers (Ptiliogonatidae) , in: Billerman , S.M. , Keeney , B.K. , Rodewald , P.G. , Schulenberg , T.S. (Eds.), Birds of the World. Cornell Lab of Ornithology . doi: 10.2173/bow.ptilio1.01 OpenUrl CrossRef ↵ Xi , Z. , Liu , L. , Davis , C.C ., 2016 . The impact of missing data on species tree estimation . Mol. Biol. Evol . 33 , 838 – 860 . doi: 10.1093/molbev/msv266 . OpenUrl CrossRef PubMed ↵ Xia , X. , Xie , Z. , Kjer , K.M ., 2003 . 18S ribosomal RNA and tetrapod phylogeny . Syst. Biol . 52 , 283 – 295 . doi: 10.1080/10635150390196948 . OpenUrl CrossRef PubMed Web of Science ↵ Zhang , C. , Mirarab , S ., 2022 . Weighting by Gene Tree Uncertainty Improves Accuracy of Quartet-based Species Trees . Mol. Biol. Evol . 39 . doi: 10.1093/molbev/msac215 . OpenUrl CrossRef PubMed ↵ Zhao , M. , Oswald , J.A. , Allen , J.M. , Owens , H.L. , Hosner , P.A. , Guralnick , R.P. , Braun , E.L. , Kimball , R.T ., 2025a . A phylogenomic tree of wood-warblers (Aves: Parulidae): Dealing with good, bad, and ugly samples . Mol. Phylogenet. Evol . 202 , 108235 . doi: 10.1016/j.ympev.2024.108235 . OpenUrl CrossRef PubMed ↵ Zhao , M. , Thom , G. , Faircloth , B.C. , Andersen , M.J. , Barker , F.K. , Benz , B.W. , Braun , M.J. , Bravo , G.A. , Brumfield , R.T. , Chesser , R.T. , Derryberry , E.P. , Glenn , T.C. , Harvey , M.G. , Hosner , P.A. , Imfeld , T.S. , Joseph , L. , Manthey , J.D. , McCormack , J.E. , McCullough , J.M. , Moyle , R.G. , Smith , B.T ., 2025b . Efficient Inference of Macrophylogenies: Insights from the Avian Tree of Life . BioRxiv . doi: 10.1101/2025.02.17.638170 . OpenUrl Abstract / FREE Full Text ↵ Zimmermann , J. , Hajibabaei , M. , Blackburn , D.C. , Hanken , J. , Cantin , E. , Posfai , J. , Evans , T.C ., 2008 . DNA damage in preserved specimens and tissue samples: a molecular assessment . Front. Zool . 5 , 18 . doi: 10.1186/1742-9994-5-18 . OpenUrl CrossRef PubMed View the discussion thread. Back to top Previous Next Posted June 06, 2025. Download PDF Supplementary Material Email Thank you for your interest in spreading the word about bioRxiv. NOTE: Your email address is requested solely to identify you as the sender of this article. Your Email * Your Name * Send To * Enter multiple addresses on separate lines or separate them with commas. You are going to email the following The phylogenetic position of the extinct Hawaiian honeyeaters: Overcoming the limitations of antique DNA Message Subject (Your Name) has forwarded a page to you from bioRxiv Message Body (Your Name) thought you would like to see this page from the bioRxiv website. Your Personal Message CAPTCHA This question is for testing whether or not you are a human visitor and to prevent automated spam submissions. Share The phylogenetic position of the extinct Hawaiian honeyeaters: Overcoming the limitations of antique DNA Min Zhao , Rebecca T. Kimball , Edward L. Braun bioRxiv 2025.06.02.657453; doi: https://doi.org/10.1101/2025.06.02.657453 Share This Article: Copy Citation Tools The phylogenetic position of the extinct Hawaiian honeyeaters: Overcoming the limitations of antique DNA Min Zhao , Rebecca T. Kimball , Edward L. Braun bioRxiv 2025.06.02.657453; doi: https://doi.org/10.1101/2025.06.02.657453 Citation Manager Formats BibTeX Bookends EasyBib EndNote (tagged) EndNote 8 (xml) Medlars Mendeley Papers RefWorks Tagged Ref Manager RIS Zotero Tweet Widget Facebook Like Google Plus One Subject Area Evolutionary Biology Subject Areas All Articles Animal Behavior and Cognition (7618) Biochemistry (17635) Bioengineering (13859) Bioinformatics (41846) Biophysics (21401) Cancer Biology (18534) Cell Biology (25422) Clinical Trials (138) Developmental Biology (13352) Ecology (19860) Epidemiology (2067) Evolutionary Biology (24285) Genetics (15582) Genomics (22463) Immunology (17700) Microbiology (40298) Molecular Biology (17141) Neuroscience (88424) Paleontology (666) Pathology (2825) Pharmacology and Toxicology (4813) Physiology (7633) Plant Biology (15107) Scientific Communication and Education (2042) Synthetic Biology (4284) Systems Biology (9808) Zoology (2267)
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.