Full text
1,927 characters
· extracted from
oa-doi-fallback
· click to expand
Abstract
Advances in single cell whole genome sequencing enable profiling of the copy number state of thousands of cells with minimal sequencing bias across the genome. The Direct Library Preparation + technique is an whole genome amplification-free single cell whole genome sequencing method that achieves high throughput by fragmenting each cell’s genome and ligating sequencing adapters using a modified Tn5 transposase, and sequencing to less than 0.1x coverage. Despite recent advances in experimental approaches, data analysis of single cell whole genome sequencing lags behind and the existing methods are not optimized for the analysis of frozen samples with variable DNA preservation. Furthermore, existing tools predominantly rely on read depth ratio in predefined genomic bins to call copy number, making whole genome duplication unidentifiable. To address this, we introduce Songbird, a single cell whole genome sequencing copy number caller that is whole genome duplication sensitive, and outperforms existing tools both in breakpoints identification and true copy number detection. We demonstrate that Songbird is robust down to extremely low coverage, adaptable to a variety of genome versions (hg19, hg38, hs.1), and is extensible to other single cell whole genome sequencing methods that rely on Tn5 tagmentation to fragment the genome.
Competing Interest Statement
The authors have declared no competing interest.
6 DATA AVAILABILITY
Songbird is available as an R Package at https://github.com/GastroEsoLab/Songbird. Sequencing files for the hTERT/HEK ladder are available upon request. Files for the scAbsolute Ladder were retrieved from the European Nucleotide Archive project PRJEB61928. Sequencing files for Lianti and SmoothSeq were retrieved from NCBI bioproject PRJNA379710 and PRJNA633502 respectively.
Sequencing files for the esophageal adenocarcinoma organoid are available under restricted access.
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.