Maximizing the clinical utility and performance of cytology samples for comprehensive genetic profiling – A report on the impact of process optimization through the analysis of 4,871 cytology samples profiled by MSK-IMPACT

preprint OA: closed
Full text JSON View at publisher
Full text 133,094 characters · extracted from preprint-html · click to expand
Maximizing the clinical utility and performance of cytology samples for comprehensive genetic profiling – A report on the impact of process optimization through the analysis of 4,871 cytology samples profiled by MSK-IMPACT | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Maximizing the clinical utility and performance of cytology samples for comprehensive genetic profiling – A report on the impact of process optimization through the analysis of 4,871 cytology samples profiled by MSK-IMPACT David Kim, Chad Vanderbilt, Soo-Ryum Yang, Subhiksha Nandakumar, and 11 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-4746484/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 02 Jan, 2025 Read the published version in Nature Communications → Version 1 posted You are reading this latest preprint version Abstract Comprehensive molecular profiling by next generation sequencing (NGS) has revolutionized tumor classification and biomarker evaluation. However, routine implementation is challenged by the scant nature of diagnostic material obtained through minimally invasive procedures. Here, we describe our long-term experience in profiling cytology samples with an in-depth assessment of the performance, quality metrics, biomarker identification capabilities, and potential pitfalls. We highlight the impact of several optimization strategies to maximize performance with 4,871 prospectively sequenced clinical cytology samples tested by MSK-IMPACT™. Special emphasis is given to the use of residual supernatant cell free DNA (ScfDNA) as a valuable source of tumor DNA. Overall, cytology samples were similar in performance to surgical samples in identifying clinically relevant genomic alterations, achieving success rates up to 93% with full optimization. While cell block (CB) samples had excellent performance overall, low-level cross-contamination was identified in a small proportion of cases (4.7%), a common pitfall intrinsic to the processing of paraffin blocks, suggesting that more stringent precautions and processing modifications should be considered in quality control initiatives. By contrast ScfDNA samples had negligible contamination. Finally, ScfDNA testing exclusively used as a rescue strategy delivered successful results in 71% of cases where tumor tissue from CB was depleted. Health sciences/Oncology/Cancer/Cancer genomics Health sciences/Oncology/Cancer/Cancer therapy/Targeted therapies Health sciences/Oncology/Surgical oncology Cell-free DNA Supernatant Fluid Cytology Next-Generation Sequencing Molecular Cytology Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Introduction Comprehensive tumor molecular profiling using next generation sequencing (NGS) technology is steadily increasing in routine oncologic practice, in order to guide precise disease classification and the selection of targeted therapies 1,2 . Concurrently, minimally invasive procedures have also steadily and systematically become a dominant tumor sampling modality. Despite indisputable patient benefits, the amount of tissue procured through such procedures is limited, raising concerns on their sufficiency and suitability for comprehensive downstream analysis. Cytologic specimens are among the most limited tissue samples that are obtained through minimally invasive procedures. While often the only source of tumor for both diagnostic and biomarker evaluation, judicious protocols for tissue testing have been explored to maximize the material available for genetic studies. In this context, the performance of NGS across various preparations, including cellblocks (CB), smears, and liquid-based suspensions have been studied and described in the literature primarily focusing on small gene panels (< 100 genes) 3–11 . However, with the increasing need for the assessment of a wider range of genetic alterations, the sufficiency and robust performance for comprehensive NGS assays have remained a major concern. To address this, our pathology department embarked on a comprehensive performance improvement project which involved several years of sequential process optimization to strategically improve the use of cytologic tissue samples for molecular profiling. This encompassed coordinated changes by the cytology lab, including the use of a modified HistoGel based cell-block processing to improve pellet density 12,13 , as well as changes in the diagnostic molecular lab in tissue processing such as deparaffinization with mineral oil 14–16 , improved bead-based extraction techniques, implementation of dual index sequencing and adjustments of minimum DNA input requirements (Supplementary Table 1). Throughout this time, we also implemented the use of residual cytology supernatant fluids as an additional source of tumor DNA for NGS applications. Commonly discarded in routine cytology practice, supernatant fluids contain variable amounts of DNA from fragmented cells as well as whole cells, denoted here-on as supernatant cell-free DNA (ScfDNA) 16 . The strategic use of this DNA enables the preservation of cellular tissue for other ancillary studies that rely on visual assessment of intact cells, such as immunohistochemistry and cytogenetics. This study presents our overall clinical experience using cytologic material for comprehensive NGS, integrating the use of ScfDNA as a rescue sample when other material is unavailable. To our knowledge, this is the largest cytology cohort to date including the largest cohort of residual supernatant fluid, across a wide range of tumor types. We re-analyzed all cytology sample data collected from our institution-wide prospective sequencing effort using the Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT ™) assay, an FDA cleared, paired tumor-normal hybridization-capture based NGS test, designed to comprehensively assess mutations, copy number alterations, and select rearrangements 17 . A summary of the performance characteristics across years of process optimization is presented, describing the utility and potential pitfalls of cytology samples for the identification of clinically relevant biomarkers, with comparisons to existing sequencing data from biopsies and resections from the same patients. Materials and methods Patient consent and cohort selection The prospectively maintained database of samples submitted for sequencing using our institution’s large-panel NGS assay (MSK-IMPACT™) between the years 2014 and August 2022 was queried to identify all cytology samples; this included all requests on samples deemed to be malignant by morphologic assessment prior to their assessment of suitability for sequencing. MSK-IMPACT™ testing was ordered by the treating physician to identify clinically significant genomic alterations for the clinical care of patients with cancer. Patient’s receiving testing signed a clinical consent form and was enrolled on an institutional IRB-approved research protocol (MSKCC; NCT01775072). Additionally following consent, a patient blood draw was obtained as a source for normal (germline) DNA. Basic demographic data (age and sex) and any existing pre-analytic information, including the type of preparation, tissue source, tumor type, tumor content, and DNA yield were collected. All sequencing data, encompassing QC metrics, sequencing coverage, somatic variants identified, and variant allele frequency (VAF) were gathered, as well as sequencing qualification (pass, fail, reason for failure) as established at the time of clinical signout. When available, the same sequencing metrics and information described above were also collected for corresponding biopsy and resections samples from the same patient tumor, to compare results side-by-side. All samples were collected with informed consent and testing was performed in our CLIA certified laboratory. This study was approved by the MSKCC Institutional Review Board/Privacy Board. Cytology sample processing Cytologic samples were received as formalin fixed paraffin embedded (FFPE) tissue sections from cell block (CB) preparations or as supernatants. Samples were either collected in CytoLyt fixative (Hologic, Malborough, MA, USA) or in 10% neutral buffered formalin fixative and were paraffin embedded (FFPE). CB preparation for MSKCC procured samples followed a modified HistoGel-Based Cell Block Preparation Method as previously described 12,13 . Procedural details of externally procured samples (cases submitted for review at MSKCC for diagnosis confirmation and IMPACT testing) were not available. For each case, 20 unstained sections (5 um thick) were submitted mounted on glass slides, along with a hematoxylin and eosin–stained (H&E) section to assess adequacy and tumor fraction. Macro-dissection was performed to enrich for tumor, when possible and necessary, aiming for > 50% tumor cell content. Samples were rejected / failed if the tumor proportion was < 10% and the sample was not amenable to manual enrichment. For MSKCC samples that were collected in CytoLyt fluid, residual material was saved after the ThinPrep® and CB were prepared. The corresponding ThinPrep® slide was assessed as a surrogate for tumor presence and content. If tumor cells were present at ≥ 10% based on visual inspection, the supernatant was considered suitable and DNA was extracted. Further details of the processing of supernatants for ScfDNA extraction are described in a previous publication 16 . Extraction procedures: FFPE material was deparaffinized using Citrasolv (2014 to March 2016) or mineral oil (March 2016 to 2022) and DNA was extracted using the Chemagic STAR DNATissue-10 Kit (Perkin Elmer, Waltham, MA) with the magnetic-bead method automated on a Chemagic STAR Standard Solutions Workstation (Hamilton, Bonaduz, GR, Switzerland), following manufacturer’s protocols. DNA from supernatants was extracted using the same kit and automated system, eliminating deparaffinization and overnight lysis incubation at 56 0 C and, instead, 1 hour lysis incubation was used (56 0 C). Extracted DNA was eluted and quantified using a Qubit DNA high-sensitivity assay kit (Life Technologies, Carlsbad, CA). FFPE samples with DNA concentration of < 0.9 ng/uL were deemed insufficient for further testing until 09/2021 when the threshold was lowered to 0.54ng/uL to proceed with sequencing, which translates to minimal total inputs of 50 and 30ng, respectively based on maximal volume inputs for the assay of 55ul. ScfDNA samples were sequenced below these thresholds aiming to spare the patient from a future biopsy and to further evaluate performance characteristics. Next Generation Sequencing DNA was sheared and processed (along with matched DNA from blood as normal control) to generate bar-coded libraries which were pooled and subjected to targeted capture using custom-designed probes as previously detailed 17 . All samples in this study underwent testing by MSK-IMPACT ™, targeting all coding regions of up to 505 genes, select introns and over 1,000 custom intergenic and intronic regions throughout the genome (centered on common SNPs). Updates to the panel sequentially increased the number of genes captured from 341 (year 2014), 410 (years 2015–2016), 468 (years 2017–2020), to 505 (years 2021–2022). Captured DNA fragments were sequenced on an Illumina HiSeq2500 or NovaSeq 6000 system, before being submitted to the bioinformatics analysis pipeline for calling of somatic alterations. Clinical actionability and treatment associations of the genomic alterations detected were assessed and annotated using OncoKB 18 MSK’s precision oncology knowledge base. Levels of evidence are assigned to each alteration based upon therapeutic levels of evidence specific to the tumor type profiled including alterations predictive of resistance to a therapy. Results were compared to the previously published data for corresponding tumor types in the AACR Project GENIE Pan-Cancer Cohort 19 . Tumor mutational burden (TMB) was calculated for each sample as the total number of non-synonymous mutations, including driver mutations in oncogenes, normalized to the exonic coverage of the respective MSK-IMPACT panel in megabases (Mb). Samples were deemed clinically successful if they passed all quality control metrics defined for our assay (e.g. adequate tumor quantity, coverage, base quality, etc.) and were formally reviewed by a board-certified molecular pathologist before the report was released clinically. Next generation sequencing quality metrics and contamination assessment For QC purposes, in addition to standard NGS quality metrics, assessment for potential sample contamination was a critical component of our assessment. Our established analysis pipelines compute pairwise genotype concordance across all SNP sites included in the panel. This unique genotype analysis, enabled by our paired tumor:normal sequencing approach, allows us to identify potential sample swaps and contamination, either due to the presence of DNA from another individual or contamination among different barcode adapters, which could lead to erroneous mutation calling. Contamination levels are defined by the analysis of SNP sites at which the patient is homozygous (based on normal control profile). Because a homozygous site is defined by 2 identical alleles at the particular genetic locus, any allelic discrepancy where the variant is not expected indicates contamination. A cutoff of ≥ 2% is used to denote clinically significant contamination (the threshold for mutation calling). Samples with a contamination rate higher than 2% were evaluated in the context of the tumor content and mutation profile; low level contamination in samples with very high tumor content, remained partially evaluable by filtering variants within the range of contamination. Sample type comparisons and concordance analysis To further assess the performance of cytologic samples compared to larger tissue samples, existing MSK-IMPACT sequencing data from corresponding tissue core biopsies or subsequent resection samples from the same tumor were obtained. Once matched, metrics including sequencing coverage, genomic alterations identified, mutation VAFs, and OncoKB levels were compared to the cytologic counterpart. In a subset of cytology cases, results from CB deemed adequate for testing were also compared to the corresponding ScfDNA. This data was analyzed separately to avoid duplication. Statistical Analysis Statistical analysis for group comparisons of continuous data were performed using a two-tailed Mann-Whitney test. A Pearson’s Chi-squared test was performed for comparing three or more categorical groups. A Fischer’s exact test was performed for comparing two categorical groups. Statistical significance was set at p < 0.05. Cases with missing values were removed from the analyses and only complete cases were considered. All statistics and graphical representations were performed using R project. Results Clinicopathologic characteristics of cytology sequencing cohort In total, 4,871 cytology tumor samples from 4,633 patients were received for MSK-IMPACT testing with patient demographics detailed in Table 1 . Most samples were from CB preparations, 94.2% (4,588/4,871) while 5.8% (283/4,871) were received as ScfDNA. Of note, ScfDNA testing was requested only when no other material was suitable or available. The majority, 63%, were procured at MSKCC and processed internally while 37% were submitted from outside institutions (Table 1 ). Testing was cancelled on 3% (146/4,871) prior to sample processing for logistical considerations, including the lack of a submitted normal control for matched testing or testing no longer relevant for patient management. A diverse array of tissue sites and sample types were profiled as detailed in Fig. 1 a and 1 b. The cohort encompassed 181 unique tumor types with lung and pancreatic adenocarcinomas being the most common. The number of samples and relative frequencies of the profiled tumor types are further summarized in Supplementary Table 2. Table 1: Demographic information of study cohort Characteristic Overall , N = 4,871 1 CB , N = 4,588 1 ScfDNA , N = 283 1 p-value 2 Age 66 (59, 74) 66 (58, 74) 67 (60, 74) 0.4 Sex 0.2 F 2,639 (54%) 2,497 (54%) 142 (50%) M 2,232 (46%) 2,091 (46%) 141 (50%) Outside/Internal <0.001 Internal 3,091 (63%) 2,809 (61%) 282 (100%) Outside 1,780 (37%) 1,779 (39%) 1 (0.4%) 3 Procedure <0.001 Fluid 693 (14%) 633 (14%) 60 (21%) FNA 4,178 (86%) 3,955 (86%) 223 (79%) 1 Median (IQR); n (%) 2 Wilcoxon rank sum test; Pearson's Chi-squared test 3 Received in CytoLyt and processed Internally Success rate of NGS testing on Cytology Samples Overall, 81% (3,806/4,725) of all samples were successfully tested. The success rate was higher for CB (81%; 3,616/4,457) samples compared to ScfDNA (71%; 190/268), noting that ScfDNA samples encompassed only cases for which the CB had already been deemed unsuitable for any analysis. Across the study period, the use of ScfDNA as a rescue sample boosted the overall success rate of the cytologic procedures from 77–81%. Causes of failure, in descending order of frequency, included low DNA yield below the minimum cutoffs established for sequencing (11.3%), very scant tumor tissue (< 10% tumor) seen on manual review (4.6%), low sequencing coverage below a median of 50x (1.8%), high sample level DNA contamination (1.6%) and low DNA quality (0.1%) including adequate coverage but high background noise and low base quality scores. (Fig. 1 c ) . To assess the impact of optimization efforts implemented across the study period, all requests (excluding cancellations) were stratified by year and testing success status. Sequential and statistically significant improvements were observed, even in the context of increased number of genes tested by panel updates, reaching 89% in the last year of assessment (Fig. 2 a). For samples deemed sufficient for sequencing (following qualification for tumor content and DNA yield), success rates were consistently high across all years (range 96–98%) (Fig. 2 b ) . Among CB preparations, success rates were significantly higher for internal samples compared to those from outside laboratories, with highest success rates of 92–93% (internal) and 79–82% (external) in the last 2 years, in accordance with the full optimization efforts ( p < 0.01; Fig. 2 c, Supplementary Table 3). On average, successful cytology samples had higher tumor purities compared to samples that failed testing, across both preparations (CB: p = 0.0003, ScfDNA: p = 0.63; Fig. 2 c ) . Median total DNA yields were 427.5 ng and 182.2 ng ( p = 2.2 x 10 − 16 ) for CB and ScfDNA, respectively (Fig. 2 d). The lower DNA yield in ScfDNA was expected as these constituted rescue samples when the corresponding cytology tissue was too scant or exhausted. Sequencing performance: total coverage and sample quality metrics Among the 4,725 samples sequenced, the total median coverage was 586x. Coverages were significantly higher for CB samples compared to rescue ScfDNA samples, at 595x and 263x, respectively (p = 2.2 x 10 − 16 ) ( Fig. 3 a ) . Despite the lower coverage, most rescue samples retained coverages above 200x, which is above our established requirements to maintain sensitivity for variants calling at 2%. Notably, in 2021 the minimum DNA input requirement for MSK-IMPACT was lowered from 50ng to 30ng for cell blocks. Following this change, we saw no significant differences in sequencing coverage ( Fig. 3 b ) and sequencing success rate remained steady at 98% across sequenced cases. Among all samples sequenced, contamination checks revealed clinically relevant non-patient DNA contamination ( ≥ 2%) in 5.2% of cases (246/4725) (Fig. 3 c). Excluding those that failed due to very low coverage (< 50X), the overall rate was 4.8% (227/4725), with a significantly higher rate for CB samples compared to ScfDNA, at 4.7% (226/4725) and 0.3% (1/4725) respectively. Notably, in the context of optimal coverage (> 200X), no ScfDNA samples exhibited clinically significant sample contamination ( Fig. 3 d ) . Also, no contamination was identified, even for samples with low coverage, following the implementation of dual indexing. By contrast, CB samples showed variable and significantly higher levels of contamination (range: 2–32%), which remained present despite adequate coverage and after implementation of dual indexing. Among samples with optimal coverage, 4% (189) of the CB samples exhibited contamination rates above 2%. In 34% (65) of these samples, for which sufficient material for re-extraction and STR analysis was available, contamination could be tracked to foreign tissue material embedded in the tissue blocks. Representative cases are included in Supplementary Fig. 1. Biomarker/mutation identification and therapeutic actionability A total of 30,149 somatic alterations were detected across the 3,806 successfully sequenced cases. Of these, 93.8% of cases (3,570/3,806) harbored at least 1 somatic alteration, including 3,394 (93.9%) of CB and 176 (92.6%) of the ScfDNA samples. No significant differences in sample coverage were observed between samples with and those without alterations ( CB: p = 0.19; ScfDNA: p = 0.91, Supplementary Fig. 2 ). However, tumor purity estimations were significantly lower for the subset without alterations, with a median tumor content of 10% vs 30% for those with detected alterations. When stratified, the median number of alterations was similar for both sample types, 9 for CB (range 1–170; 95% CI: 11.9–12.7) and 10 for ScfDNA (range 1–63; 95% CI: 8.5–11.9). The average TMB was 7.64 mutations/Mb and 7.43 mutations/Mb for CB and ScfDNA samples, respectively. Overall, the mutational profiles recapitulated the expected landscape and frequency of driver and common alterations for the tumor type. Stratified by level of actionability, 65% (n = 2487) had at least one targetable alteration as defined by the presence of an OncoKB level 1, 2, 3A, or 3B alteration and 2% (n = 93) had a standard care resistance mutation (OncoKB level R1). The highest frequency of level 1 OncoKB alterations was observed in thyroid, breast, non-small cell lung (NSCLC), and bladder cancer patients at 58%, 58%, 45%, and 29% respectively. For resistance mutations, 87 CB and 6 ScfDNA samples identified an OncoKB level R1 alteration. To ensure that significant alterations were being identified at similar rates to non-cytology samples, results were compared to those published in the AACR GENIE cohort. Across the different histologic tumor types, similar rates of OncoKB alterations were identified ( Fig. 4 a ) . Representative oncoplots of the most frequent, clinically actionable alterations detected in NSCLC, Bladder Cancer, and Breast Cancer (most common tumor types in our cohort) are presented in Fig. 4 b and 4 c which demonstrate the expected distributions across both CB and ScfDNA. OncoKB level 1 alterations were commonly seen in EGFR, KRAS, PIK3CA, and ERBB2 genes. ALK, BRAF, RET, and ROS1 level 1 alterations were also seen at lower frequencies. Comparison with surgical core biopsies/resections To further assess the general performance of cytology samples, we identified 526 cases (CB: n = 482; ScfDNA: n = 44) of patients who had a corresponding surgical sample of the same tumor assessed by MSK-IMPACT. While the same tumor was profiled across each surgical:cytology pair, it should be noted that there were variations in the time of collection across their treatment course as the samples were profiled clinically. Thus, many of the cytologic samples were collected at the time of disease progression or development of resistance. Overall, cytologic samples demonstrated similar sequencing metrics compared to their surgical biopsy/excision counterparts with adequate average coverages of 584x for cytology samples and 628x for corresponding surgical samples ( p = 0.00028). Comparing detected alterations, a large proportion of the cytology samples identified all the alterations detected on the corresponding surgical sample, 266 (55%) CB ( Fig. 5 a ) and 19 (43%) ScfDNA samples ( Fig. 5 e ) . The median VAF’s for shared alterations were slightly higher for cytology samples, compared to the surgical pair in both CB and ScfDNA samples (CB: p = 4.9 x 10 − 11 ; ScfDNA: p = 0.13) ( Fig. 5 b and Fig. 5 f ) . In all, a total of 5,593 mutations were identified in the CB:surgical paired set, of which 2789 events (49.8%) were shared ( Fig. 5 c ) . For the ScfDNA:surgical paired set, 692 mutations were detected with slightly lower overlap (34.8%; Fig. 5 g). Importantly, when alterations were stratified by level of actionability, the overwhelming majority of driver alterations with OncoKB Level 1 actionability were shared events, at 93% and 83% for the CB and ScfDNA sets, respectively. Non-detection of OncoKB Level 1 alterations in the surgical or the cytology sample was related to low coverage or low tumor content in all cases. Events categorized as Level R1 or No level showed the lowest overlap, with 27% and 42% shared events, respectively, likely reflecting differential passenger events, the acquisition of additional mutations in the time interval of the two samples, or the heterogeneous nature of resistance mechanisms in the samples. Further details are provided in Fig. 5 d and 5 h and Supplementary Table 4. Review of contamination check data for surgical samples revealed that < 1% of surgical samples had clinically relevant contamination (0.81%; 5/619; Supplementary Fig. 3a ). Of note, all 6 surgical samples with contamination were minute biopsy samples with low tumor purity (Supplementary Fig. 3b) with contamination below 4%. Comparison of successful CB preparations with corresponding ScfDNA Among CB cytology samples with adequate tumor and successful sequencing, 24 had the corresponding ScfDNA samples tested to allow direct comparisons. Both DNA concentration and sequencing coverages were significantly lower for the ScfDNA samples. Total DNA yields averaged 505ng (range: 76–5453) and 1200ng (range 83.4–3340) for ScfDNA and CB preparations, respectively. Accordingly, ScfDNA had resulting lower sequencing coverage averaging 387x (range 3x – 1335x) compared to 669x (74x – 1193x) for the corresponding CB. Of the 24 ScfDNA samples, 7 (27%) failed sequencing due to low sample coverage. Detection of clinically relevant alterations and the VAF were the same across both sample preparations based on comparison of successfully sequenced sets. Discussion Comprehensive NGS sequencing is becoming a common approach for upfront assessment of a broad range of genetic biomarkers that are pivotal for diagnostic, prognostic, and therapeutic decisions in cancer patients. While ideally, molecular testing is greatly facilitated when large tumor samples are available (i.e. resections or excisional biopsies), the reality of clinical practice is that a very large proportion of testing must be performed on scant material obtained through minimally invasive procedures. Historically, this has presented distinct challenges, prompting the adoption of alternate approaches, such as liquid biopsies, which attempt to circumvent tumoral cell assessment altogether. At present, while arguments can be made for the superiority or inferiority of each modality over another, cytologic samples stand as the one middle approach that unites the most desirable attributes of both worlds. Namely, they retain the key morphologic correlates required for tumor diagnosis, while still sparing the patient from the more invasive procedures. One fact remains constant, however, which is that small samples require very high optimization of the entire process to maximize the genomic yield. In this study, we have outlined our institutional approach and longitudinal experience in comprehensive profiling of cytology samples in routine clinical care. To our knowledge, this represents the largest prospective clinical cohort reported to date, demonstrating that molecular testing can be performed on routinely procured cytology samples with high success rates, similar to surgical samples. Proportions of clinically actionable genomic alterations, specifically OncoKB Level 1–3B, as well as R1 alterations, recapitulated the expected patterns across all tumor types when compared to those published in AACR GENIE cohort. For immediately actionable alterations (OncoKB Level 1), the concordance of cytology to corresponding surgical samples from the same patient were very high (93%). Notably rescue ScfDNA from supernatant CytoLyt fluid material, utilized for our internal cases, proved highly valuable and enabled the detection of a level 1 alteration in 83% of the successfully sequenced cases. Our review of data compiled across 8 years, highlighted the central roles of optimized sample handling and processing. In our hands, 2 critical early steps enabled higher DNA recovery which, consequently, promoted increased utilization of cytologic material for molecular testing. The first was the optimization of cell block preparation, which incorporated pretreatment of pelleted cells with 95% ethanol before addition of HistoGel 12,13 . This enhanced the density of cell pellets to deliver higher amount of cellular material in fewer sections of the paraffin block. The second was the transition to mineral oil deparaffinization which markedly reduced tube transfers, centrifugation, and decanting steps, all key vulnerabilities responsible for major nucleic acid losses in the processing of scant FFPE material 14–16 . It should be noted that, with the implementation of mineral oil extraction, requests for testing on cytologic material vs needle biopsies markedly increased at our institution. Details of this transition have been previously published by our group 16,20 . Notably, among lung cancer patients undergoing endobronchial ultrasound transbronchial needle aspiration (EBUS-TBNA) this change, alone, significantly improved sequencing success rates from 76.3–93%. Moreover, these success rates corresponded to NGS testing that was performed after standard rapid testing for EGFR on the same samples 21 , further supporting the high suitability and sufficiency of the DNA recovered. An important, and often underreported, consideration in molecular testing of cytology samples are the diagnostic challenges and inaccuracies that may arise from sample cross-contamination. While sample-to-sample contamination may happen across any point, highly vulnerable points lie in processes that involve batching and pooling of multiple samples in a single run. In particular, established histopathology practices of tissue processing (i.e. carry over from microtome blades, common water baths, pooled tissue processors, etc.), pose distinct risks for contamination for small tissue samples as processes are primarily optimized to enhance microscopic diagnostic analysis but not downstream molecular applications. Common holding of numerous specimens in single chambers in automated tissue processors, the use of common equipment for embedding, cutting and tissue mounting, all increase the potential for low level cross contamination. While this may remain inconsequential for morphologic assessment or the molecular analysis of large tissue samples, this can distinctly impact small samples where similar contamination levels become proportionally higher. Cytology samples may be even more vulnerable due to processing of cell blocks with paper wrapping and HistoGels, which may promote trapping of cellular impurities from other samples. Indeed, in our analysis of cytologic samples, contamination was significantly higher across CB samples compared to all other samples. This held true when analyzing samples with adequate coverage (> 200x) with 4% of CB samples exhibiting contamination. ScfDNA samples by contrast, which are processed individually in a closed system and not batched, had negligible levels with contamination patterns exclusively associated with sequencing failures or borderline coverage and more likely related to artifact rather than true contamination. Despite the presence of higher contamination levels in cell blocks, the overall rate was low (4.8%) among successfully sequenced samples, which encompassed samples procured and processed across numerous laboratories across the county. These rates are in keeping with sequencing data on surgical samples published by Sehn et al 22 but are significantly lower to what is reported by the ASC Clinical Practice Committee/Workgroup for Cross-Contamination in a recent survey for general cytopathology practice, quoting rates as high as 56% for cell-block preparations 23 . This high rate may be related to the reporting of contamination per case, affecting some but not all unstained sections and which may not be high enough to be detectable in the sequencing of DNA recovered from a set of several slides. Importantly, while contamination was detectable in several cases in our cohort, most were sufficiently low in comparison to the overall tumor content of the sample, allowing informed filtering of low–variant allele fraction events without compromising all mutation calling. In all, only 1.3% of the samples were failed due to contamination, while others could be reported with modification. Within the molecular laboratory, a notable source of cross-contamination may arise from index-hopping during multiplexing. This, however, is generally lower level (well below 2%) and more prone to affect higher sensitivity applications. Nonetheless, in the process of improvement for our MSK-IMPACT assay we have incorporated several strategies to mitigate this phenomenon, including optimization of PCR conditions and the implementation of dual indexing to facilitate the removal of misaligned reads. These finetuning steps facilitated our decision to reduce the assay input requirements which markedly reduced failures due to insufficient DNA. No significant changes in coverage or contamination rates were seen with this change. Finally, a pivotal component of our optimization process was the implementation of testing ScfDNA recovered from liquid cytology preparations. While, generally, this sample type was not submitted if the cell block was deemed suitable for testing, it became an important rescue sample to avoid re-biopsy procedures. The use of this material also relieved some of the challenges in triaging very small biopsy samples for other ancillary studies. In all, while the success rate of the ScfDNA samples was approximately 71%, which is below what is seen across tissue biopsies and CB, these samples were specifically tested after the corresponding cytologic material was deemed unsuitable, thus boosting the overall success for the individual aspirate procedures by approximately 3%. An important observation, gathered from the comparison of ScfDNA and corresponding cell blocks, is that the VAF’s of detected alterations were similar for both preparations, supporting that the assessment of the block or cytoprep represents a suitable surrogate for estimating the proportion of tumor derived DNA that may be present in the ScfDNA sample. Additionally, given the high integrity of the DNA in these non-formalinized samples, lower DNA inputs still delivered excellent results, provided that the tumor proportion was suitable. Confirmatory testing with higher sensitivity methods may also be implemented for low tumor samples, without concerns for false positivity due to artifacts imparted by formalin fixation. In conclusion, this study confirms that the routine use of cytologic samples for molecular testing constitute a robust approach that can deliver the same results as larger biopsy samples. Process optimization and the implementation of robust quality control processes, including contamination checks are pivotal to maximizing the yield and utility of these samples. A reassessment on how tissue blocks are processed and prepared would be an important aspect of cytology practice as a whole, to include specialized instrumentation for processing small samples without risk of cross-contamination. ScfDNA recovered from supernatants is an invaluable source of tumor derived DNA which circumvents the processing where most contamination is bound to happen in current practice, and while failure rates due to limited nucleic acid recovery are higher than tissue blocks, their use could rescue the majority of cases where high tumor is identified but FFPE material is insufficient for sequencing. Declarations Acknowledgements This research was funded in part through the NIH/NCI Cancer Center Support Grant P30 CA008748. Ethics declarations/Competing interests C.V. reports intellectual property rights and equity interest in Paige.AI, Inc. A.R.B. has ownership/equity interests in Johnson and Johnson. M.B has received advisory or consulting fees from AstraZeneca, Eli Lilly and Company, and PetDx, Inc. M.L. has received advisory or consulting fees from Takeda Oncology, Janssen Pharmaceuticals, AstraZeneca, ADC Therapeutics, Paige.AI, Merck, Bayer, and Lilly Oncology and has received research funding from Loxo Oncology, Helsinn Therapeutics, Merus NV, Elevation Oncology, and Rain Therapeutics. O.L. has received advisory or consulting fees from Hologic and Janssen Research & Development, LLC. M.A. has received advisory or consulting fees from Axis Medical Education, Clinical Education Alliance, LLC, Merck Sharp & Dohme, PeerView Institute for Medical Education (PVI), Physicians’ Education Resource, RMEI Medical Education, LLC, and Roche. The following Authors declare no competing interests: D.K., S.Y., S.N., K.N., R.F., N.R., I.R., J.C., and A.Y. Authors’ contributions All authors of this article declare that we qualify for authorship for this manuscript. Each author has participated sufficiently in the work and takes public responsibility for appropriate portions of the content of this article. D.K. and M.A. carried out the background research and drafted the manuscript. D.K. and M.A. conceived the idea for the manuscript, its design, and coordination. D.K., C.V., A.Y., and S.N. generated and interpreted the data. D.K. and C.V. performed the statistical analysis for this study. D.K., C.V., S.Y., S.N., K.N., R.F., N.R., I.R., J.C., A.Y., A.R.B., M.B., M.L., O.L., and M.A. reviewed the manuscript critically for important intellectual content. All authors read and approved the final manuscript. References Garraway LA (2013) Genomics-driven oncology: framework for an emerging paradigm. J Clin Oncol 31:1806–1814 Wakai T et al (2019) Next-generation sequencing-based clinical sequencing: toward precision medicine in solid tumors. Int J Clin Oncol 24:115–122 Turner SR et al (2018) Feasibility of endobronchial ultrasound transbronchial needle aspiration for massively parallel next-generation sequencing in thoracic cancer patients. Lung Cancer 119:85–90 Ramani NS et al (2021) Utilization of cytology smears improves success rates of RNA-based next-generation sequencing gene fusion assays for clinically relevant predictive biomarkers. Cancer Cytopathol 129:374–382 Baum JE et al (2017) Accuracy of next-generation sequencing for the identification of clinically relevant variants in cytology smears in lung adenocarcinoma. Cancer Cytopathol 125:398–406 Roy-Chowdhuri S et al (2018) Salvaging the supernatant: next generation cytopathology for solid tumor mutation profiling. Mod Pathol 31:1036–1045 Kanagal-Shamanna R et al (2014) Next-generation sequencing-based multi-gene mutation profiling of solid tumors using fine needle aspiration samples: promises and challenges for routine clinical diagnostics. Mod Pathol 27:314–327 Gan Q, Roy-Chowdhuri S (2020) Small but powerful: the promising role of small specimens for biomarker testing. J Am Soc Cytopathol 9:450–460 Velizheva NP et al (2017) Cytology smears as excellent starting material for next-generation sequencing-based molecular testing of patients with adenocarcinoma of the lung. Cancer Cytopathol 125:30–40 Akahane T et al (2021) Direct next-generation sequencing analysis using endometrial liquid-based cytology specimens for rapid cancer genomic profiling. Diagn Cytopathol 49:1078–1085 Roy-Chowdhuri S et al (2015) Factors affecting the success of next-generation sequencing in cytology specimens. Cancer Cytopathol 123:659–668 Rudomina D et al (2013) Improved Cell Block Preparation Method: A Protocol for Increased Cellularity which is Suitable for Immunocytochemical and Molecular Assays. J Am Soc Cytopathol 2:S81–S82 Rekhtman N et al (2018) Novel Modification of HistoGel-Based Cell Block Preparation Method: Improved Sufficiency for Molecular Studies. Arch Pathol Lab Med 142:529–535 Heikal N, Nussenzveig RH, Agarwal AM (2014) Deparaffinization with mineral oil: a simple procedure for extraction of high-quality DNA from archival formalin-fixed paraffin-embedded samples. Appl Immunohistochem Mol Morphol AIMM 22:623–626 Lin J et al (2009) High-quality genomic DNA extraction from formalin-fixed and paraffin-embedded samples deparaffinized using mineral oil. Anal Biochem 395:265–267 Tian SK et al (2016) Optimizing Workflows and Processing of Cytologic Samples for Comprehensive Analysis by Next-Generation Sequencing: Memorial Sloan Kettering Cancer Center Experience. Arch Pathol Lab Med 140:1200–1205 Cheng DT et al (2015) Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT): A Hybridization Capture-Based Next-Generation Sequencing Clinical Assay for Solid Tumor Molecular Oncology. J Mol Diagn 17:251–264 Chakravarty D et al (2017) OncoKB: A Precision Oncology Knowledge Base. JCO Precis. Oncol. (2017) Suehnholz SP et al (2024) Quantifying the Expanding Landscape of Clinical Actionability for Patients with Cancer. Cancer Discov 14:49–65 Turner SR et al (2018) Feasibility of endobronchial ultrasound transbronchial needle aspiration for massively parallel next-generation sequencing in thoracic cancer patients. Lung Cancer 119:85–90 Arcila ME et al (2020) Ultrarapid EGFR Mutation Screening Followed by Comprehensive Next-Generation Sequencing: A Feasible, Informative Approach for Lung Carcinoma Cytology Specimens With a High Success Rate. JTO Clin Res Rep 1 Sehn JK et al (2015) Occult Specimen Contamination in Routine Clinical Next-Generation Sequencing Testing. Am J Clin Pathol 144:667–674 Roberson J et al (2022) Cross-contamination in cytology processing: a review of current practice. J Am Soc Cytopathol 11:194–200 Additional Declarations Yes there is potential Competing Interest. C.V. reports intellectual property rights and equity interest in Paige.AI, Inc. A.R.B. has ownership/equity interests in Johnson and Johnson. M.B has received advisory or consulting fees from AstraZeneca, Eli Lilly and Company, and PetDx, Inc. M.L. has received advisory or consulting fees from Takeda Oncology, Janssen Pharmaceuticals, AstraZeneca, ADC Therapeutics, Paige.AI, Merck, Bayer, and Lilly Oncology and has received research funding from Loxo Oncology, Helsinn Therapeutics, Merus NV, Elevation Oncology, and Rain Therapeutics. O.L. has received advisory or consulting fees from Hologic and Janssen Research & Development, LLC. M.A. has received advisory or consulting fees from Axis Medical Education, Clinical Education Alliance, LLC, Merck Sharp & Dohme, PeerView Institute for Medical Education (PVI), Physicians’ Education Resource, RMEI Medical Education, LLC, and Roche. The following Authors declare no competing interests: D.K., S.Y., S.N., K.N., R.F., N.R., I.R., J.C., A.Y., and O.L. Supplementary Files Suppdataset1.xlsx Supplementary Dataset 1 Suppdataset2.xlsx Supplementary Dataset 2 Suppdataset3.xlsx Supplementary Dataset 3 Suppdataset4.xlsx Supplementary Dataset 4 SuppFigure1.png Supplementary Figure 1: Representative short tandem repeat (STR) analysis of CB samples with matched plasma normal samples demonstrating low- and high-level contamination by non-patient DNA as evidenced by additional peaks in the tumor sample not seen in the normal sample. SuppFigure2.png Supplementary Figure 2: The distribution of total sample coverage between cytology cases that identified no somatic alterations and those that identified at least one somatic alteration compared for CB (p = 0.19) and ScfDNA ( p = 0.91) samples. The p value was assessed by a two-tailed Mann-Whitney test. All boxplots show the median (center line with value) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers). SuppFigure3.png Supplementary Figure 3: a Non-patient DNA contamination rate distribution of cytology samples and any matching surgical sample from the same patient tumor. The horizontal dashed line indicates a contamination rate of 2% for which samples above this threshold has a significant concern for contamination. While the majority of samples for both cytology and paired surgical samples were below 2%, there was a higher total number of samples above this threshold for cytology samples (26 cytology samples vs 5 surgical samples). The p value was assessed by a two-tailed Mann-Whitney test. All boxplots show the median (center line with value) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers). b The distribution of contamination rates between cytology sample and their matching surgical sample charted by sample coverage. The dashed vertical line indicates the threshold contamination rate of 0.02 and the dashed horizontal line denotes the threshold for adequate coverage (200x). Samples in the top-right portion indicate a high contamination rate in the face of adequate coverage, whereas samples in the bottom-right had low coverage that may falsely elevate the contamination rate. Cite Share Download PDF Status: Published Journal Publication published 02 Jan, 2025 Read the published version in Nature Communications → Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-4746484","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":330248653,"identity":"b52bd0ac-340a-436c-bf46-017f85dd5b79","order_by":0,"name":"David Kim","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAs0lEQVRIiWNgGAWjYBACCWYg/lABJEjSIjnjDElagFiat40ULZLt3Ik3Z87bZi/ZfvgBw8c9tYS1SDPzbrb4uO124myeNAPGGc+OE9Yix8y7TXLmttsJchI8DMw8B44Rp0Wad85te+K1SIO1NNxmnA3RUkNYi2Qz72bLGcduJ87sSTM4OOPAAcJaJM6f3XjjQ81te4njhx8++HCgjrAWFAC04jCJWoCAVFtGwSgYBaNgJAAAvJw4uuKs89oAAAAASUVORK5CYII=","orcid":"","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":true,"prefix":"","firstName":"David","middleName":"","lastName":"Kim","suffix":""},{"id":330248654,"identity":"0a75793a-87bd-4883-8c36-311a1e4c560c","order_by":1,"name":"Chad Vanderbilt","email":"","orcid":"https://orcid.org/0000-0002-8114-0237","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Chad","middleName":"","lastName":"Vanderbilt","suffix":""},{"id":330248655,"identity":"8604768d-8e81-4d45-87af-e01fd0730607","order_by":2,"name":"Soo-Ryum Yang","email":"","orcid":"","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Soo-Ryum","middleName":"","lastName":"Yang","suffix":""},{"id":330248656,"identity":"5a888251-fd3e-4c39-8903-e3aa7ebb261b","order_by":3,"name":"Subhiksha Nandakumar","email":"","orcid":"","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Subhiksha","middleName":"","lastName":"Nandakumar","suffix":""},{"id":330248657,"identity":"81289ad9-46c5-429f-9851-16422289e382","order_by":4,"name":"Khedoudja Nafa","email":"","orcid":"","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Khedoudja","middleName":"","lastName":"Nafa","suffix":""},{"id":330248658,"identity":"3296e952-5806-4c5d-b0a5-14704a512ce2","order_by":5,"name":"Rusmir Feratovic","email":"","orcid":"","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Rusmir","middleName":"","lastName":"Feratovic","suffix":""},{"id":330248659,"identity":"b1c54a47-67e6-4acf-bbc3-85ab0b07eaed","order_by":6,"name":"Natasha Rekhtman","email":"","orcid":"","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Natasha","middleName":"","lastName":"Rekhtman","suffix":""},{"id":330248660,"identity":"e950674d-b233-4a0b-9032-6ffad7aa8568","order_by":7,"name":"Ivelise Rijo","email":"","orcid":"","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Ivelise","middleName":"","lastName":"Rijo","suffix":""},{"id":330248661,"identity":"069d2408-52d1-41be-a4e0-03a7ee66bfea","order_by":8,"name":"Jacklyn Casanova","email":"","orcid":"https://orcid.org/0000-0002-3586-7048","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Jacklyn","middleName":"","lastName":"Casanova","suffix":""},{"id":330248662,"identity":"1cfcf9ae-0953-4836-b617-c0aa1a534ce2","order_by":9,"name":"Anita Yun","email":"","orcid":"","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Anita","middleName":"","lastName":"Yun","suffix":""},{"id":330248663,"identity":"367eb0aa-3301-463b-90be-be957b06f038","order_by":10,"name":"Angela Rose Brannon","email":"","orcid":"https://orcid.org/0000-0003-4610-0149","institution":"Memorial Sloan Kettering","correspondingAuthor":false,"prefix":"","firstName":"Angela","middleName":"Rose","lastName":"Brannon","suffix":""},{"id":330248664,"identity":"52c953fd-eaec-469b-b569-760d10f6af1d","order_by":11,"name":"Michael Berger","email":"","orcid":"https://orcid.org/0000-0003-3882-5000","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Michael","middleName":"","lastName":"Berger","suffix":""},{"id":330248665,"identity":"262e60dd-c79e-4bf1-8622-99dbe0b562a7","order_by":12,"name":"Marc Ladanyi","email":"","orcid":"https://orcid.org/0000-0001-9055-7213","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Marc","middleName":"","lastName":"Ladanyi","suffix":""},{"id":330248666,"identity":"704de7f2-fdfd-43ae-942e-59a63a6a7a42","order_by":13,"name":"Oscar Lin","email":"","orcid":"","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Oscar","middleName":"","lastName":"Lin","suffix":""},{"id":330248667,"identity":"a121a534-461c-45a3-8ca6-5c9b97727937","order_by":14,"name":"Maria Arcila","email":"","orcid":"https://orcid.org/0000-0001-5824-6554","institution":"Memorial Sloan Kettering Cancer Center","correspondingAuthor":false,"prefix":"","firstName":"Maria","middleName":"","lastName":"Arcila","suffix":""}],"badges":[],"createdAt":"2024-07-16 02:50:09","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-4746484/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-4746484/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1038/s41467-024-55456-8","type":"published","date":"2025-01-02T05:00:00+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":60912202,"identity":"838d0b58-87d1-4e03-924d-5e3261c22c2c","added_by":"auto","created_at":"2024-07-23 13:00:56","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":1202518,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eOverview of clinical cytology samples profiled by MSK-IMPACT\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003ea\u003c/strong\u003e The composition of the cytology cohort by sample preparation type, either cellblock (CB) or supernatant cfDNA (ScfDNA), and their respective collection method of either fine needle aspiration (FNA) or cytology fluid (e.g. pleural fluids, bronchial washes, ascites fluid, etc.). \u003cstrong\u003eb \u003c/strong\u003eThe distribution of cancer types profiled by MSK-IMPACT clinically from cytology samples. The most common cancer types are at the top and are ordered in descending order for specific cancer types. Represented are all cytology samples and by sample type. \u003cstrong\u003ec \u003c/strong\u003eThe distribution of testing outcome and sample type of cytology samples. In cytology cases that failed testing for MSK-IMPACT the cause of failure is further broken down with the corresponding number of sample type for each reason.\u003c/p\u003e","description":"","filename":"Figure1.png","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/1fc3033f4a881e9de6988392.png"},{"id":60911000,"identity":"4ed617fd-19ea-4711-a005-e7f6e0ab2bd6","added_by":"auto","created_at":"2024-07-23 12:52:56","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":1521870,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eDeterminants of successful mutational profiling by MSK-IMPACT in cytology samples.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003ea\u003c/strong\u003e Success rates of MSK-IMPACT testing on cytology samples by year and sample type. The colors denote successfully tested samples, samples failed due to scant/low tumor tissue, failed due to DNA content below sequencing thresholds, and all other failures (e.g. low DNA quality, contamination, etc.). Throughout the study period the panel genes increased in number and are denoted in the top bar by the number of panel genes included for that year. Various key optimization efforts for cytology samples were introduced in the clinical workflow at the corresponding timepoints including the use of a modified HistoGel cell-block preparation (1), improved bead-extraction procedures (2), dual-indexed libraries (3), and decreased lower DNA input for sequencing (4). \u003cstrong\u003eb\u003c/strong\u003e Success rates charted similar to that seen in (2a) but including only samples that were deemed to have adequate tumor for sequencing. The success rates therefore indicate the success rate of cytology sampled deemed to have adequate tumor on manual review as opposed to the overall testing success rate of a cytology sample received for testing. \u003cstrong\u003ec\u003c/strong\u003eOverall sequencing success rates on MSK-IMPACT for cytology samples charted by year and stratified by samples processed at an external laboratory (External), samples processed at the study institution (Inhouse), and all cytology cases (All Cases). Chi-squared test revealed a significant difference in the years 2019, 2021, and 2022 with a p value less than 0.01. \u003cstrong\u003ed\u003c/strong\u003e The distribution of tumor purity for cytology samples by outcome of MSK-IMPACT testing for CB (p = 0.00039) and ScfDNA (p = 0.63). \u003cstrong\u003ee\u003c/strong\u003e Comparisons of total extracted DNA yields for CB and ScfDNA cytology samples (p = 2.2 x 10-16). Group comparisons for continuous data (e,d) were performed with a two-tailed Mann-Whitney test set at a p \u0026lt; 0.01. All boxplots show the median (center line with value) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers).\u003c/p\u003e","description":"","filename":"Figure2.png","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/24877105c9bab1803c17188a.png"},{"id":60913243,"identity":"a280d67b-7524-4885-8c5f-c1e3849de181","added_by":"auto","created_at":"2024-07-23 13:08:56","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":1249095,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSequencing quality metrics of profiled cytology samples.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003ea \u003c/strong\u003eSample coverage distribution by sequenced CB and ScfDNA cytology samples (\u003cem\u003ep = \u003c/em\u003e2.2 x 10\u003csup\u003e-16\u003c/sup\u003e). The \u003cem\u003ep \u003c/em\u003evalue was obtained by a two-sided Mann-Whitney test. The dashed horizontal line indicates the 200x coverage mark in which all samples tested by MSK-IMPACT are deemed to have low coverage with concerns for false negative assessment. The solid red line indicates the 50x mark for which all samples with lower coverage are highly considered to be failed due to low coverage. \u003cstrong\u003eb\u003c/strong\u003e The distribution of total sample coverage compared between samples before and after the decrease in DNA threshold for sequencing by MSK-IMPACT testing outcome. The sample coverage distributions were further stratified by cytology sample type. Comparisons of sample coverage before and after lowered DNA threshold for failed CB samples (\u003cem\u003ep\u003c/em\u003e = 0.79), successful CB samples (\u003cem\u003ep\u003c/em\u003e = 2.2 x 10\u003csup\u003e-16\u003c/sup\u003e), failed ScfDNA samples (\u003cem\u003ep \u003c/em\u003e= 0.31), and successful ScfDNA samples (p = 0.69) were performed with a two-tailed Mann-Whitney test. \u003cstrong\u003ec\u003c/strong\u003e The distribution of non-patient DNA contamination rates and mean (red dot), as determined by comparing homozygous SNP sites between the sequenced tumor and matched patient normal sample, by sample type. The dashed horizontal line indicates a contamination rate of 0.02 for which samples with a higher rate are considered to have a concern for non-patient DNA contamination. The number and percentage of samples that exceed this threshold of 0.02 are shown adjacent to the representative bracket for each cytology sample type. \u003cstrong\u003ed\u003c/strong\u003e The distribution of contamination rates between cytology sample preparation method charted by sample coverage. Samples are colored based on sequencing result. The dashed vertical line indicates the threshold contamination rate of 0.02 and the dashed horizontal line denotes the threshold for adequate coverage (200x). Samples in the top-right quadrant indicate a high contamination rate in the face of adequate coverage, whereas samples in the bottom-right had low coverage that may falsely elevate the contamination rate. All boxplots show the median (center line with value) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers).\u003c/p\u003e","description":"","filename":"Figure3.png","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/b5787f76b8e14bd5ab0ac18c.png"},{"id":60912205,"identity":"fb1092b7-47a5-4bbc-838e-01568e3bafd8","added_by":"auto","created_at":"2024-07-23 13:00:56","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":1561810,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eComparison of mutation calls between cytology cellblock and supernatant cfDNA samples.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003ea\u003c/strong\u003e Comparative oncoprints of significant genomic alterations in three common tumor types profiled (bladder cancer, breast cancer, and non-small cell lung cancer) for samples with reported alterations identified from CB and ScfDNA samples. The significance of genomic alterations are color coded by corresponding OncoKB level. Sample level tumor mutational burden (TMB, mutations per megabase) is provided for each corresponding sample at the top. Comparative oncoprints of significant genomic alterations in three common tumor types profiled (bladder cancer, breast cancer, and non-small cell lung cancer) for samples with reported alterations identified from \u003cstrong\u003e(b)\u003c/strong\u003e CB and \u003cstrong\u003e(c) \u003c/strong\u003eScfDNA samples. The significance of genomic alterations are color coded by corresponding OncoKB level. Sample level tumor mutational burden (TMB, mutations per megabase) is provided for each corresponding sample at the top with the horizontal dashed line indicating 10 mutations/megabase, for which sample with a higher number are TMB-High.\u003c/p\u003e","description":"","filename":"Figure4.png","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/6636714667be137a5c79f2de.png"},{"id":60911003,"identity":"f25d583a-ea11-4112-b873-25f7d29d04c0","added_by":"auto","created_at":"2024-07-23 12:52:56","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":2175187,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eComparison of mutation calls by MSK-IMPACT between matched cytology and surgical samples.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eCytology samples with a corresponding surgical sample (e.g. core biopsy, resection, etc.) of the same patient tumor profiled by MSK-IMPACT (n = 526) for comparison were identified. A total of 482 cytology CB samples profiled by MSK-IMPACT had a corresponding surgical sample profiled by MSK-IMPACT. \u003cstrong\u003ea \u003c/strong\u003eThe number of cases tallied by the proportion of genomic alterations identified on the surgical sample that was also identified on the corresponding cytology CB\u003cstrong\u003e. b \u003c/strong\u003eThe distribution of variant allele frequency (VAF) of shared mutations identified on both cytology CB samples and corresponding surgical samples (\u003cem\u003ep \u003c/em\u003e= 4.9 x 10\u003csup\u003e-11\u003c/sup\u003e) of the same patient tumor. \u003cstrong\u003ec \u003c/strong\u003eVenn diagram representing the total mutation calls in cytology CB samples only, their corresponding surgical sample only, and those found in both. \u003cstrong\u003ed \u003c/strong\u003eThe proportion of significant alterations identified in the cytology CB and corresponding pairs analyzed by if the mutation was identified exclusively in the cytology CB, corresponding surgical sample, or if it was seen in both (shared). The same analysis performed for ScfDNA samples (n = 44) with the proportion of genomic alterations identified (\u003cstrong\u003ee)\u003c/strong\u003e, VAF distribution of shared alterations \u003cstrong\u003e(f)\u003c/strong\u003e, venn diagram of mutation calls \u003cstrong\u003e(g)\u003c/strong\u003e, and proportion of significant alterations identified by tissue sample and those seen in both \u003cstrong\u003e(h)\u003c/strong\u003e.The \u003cem\u003ep \u003c/em\u003evalues were assessed as group comparisons for continuous data with a two-tailed Mann-Whitney test set at a \u003cem\u003ep \u0026lt; \u003c/em\u003e0.01. All boxplots show the median (center line) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers).\u003c/p\u003e","description":"","filename":"Figure5.png","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/60c7a10d246ccf696319858d.png"},{"id":72949991,"identity":"3cc52572-e31f-47e9-82ff-8214d07ad0fb","added_by":"auto","created_at":"2025-01-04 08:19:33","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":8993810,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/d7bbf776-37dd-45d5-8ac7-93abc6230005.pdf"},{"id":60912203,"identity":"f57001ca-059c-4d4a-92f4-4d22125b0993","added_by":"auto","created_at":"2024-07-23 13:00:56","extension":"xlsx","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":10207,"visible":true,"origin":"","legend":"\u003cp\u003eSupplementary Dataset 1\u003c/p\u003e","description":"","filename":"Suppdataset1.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/fc1232431f8fdbcbf3e310e9.xlsx"},{"id":60911004,"identity":"7b0a7aa9-f085-4180-b5b4-e8ca3c2684ce","added_by":"auto","created_at":"2024-07-23 12:52:56","extension":"xlsx","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":14677,"visible":true,"origin":"","legend":"\u003cp\u003eSupplementary Dataset 2\u003c/p\u003e","description":"","filename":"Suppdataset2.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/283ca17ec9edb94b36940d2f.xlsx"},{"id":60911006,"identity":"280bdced-cfe4-49c1-8140-eb6282813810","added_by":"auto","created_at":"2024-07-23 12:52:56","extension":"xlsx","order_by":4,"title":"","display":"","copyAsset":false,"role":"supplement","size":10603,"visible":true,"origin":"","legend":"\u003cp\u003eSupplementary Dataset 3\u003c/p\u003e","description":"","filename":"Suppdataset3.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/52dcc9eb4caf81d57f6f01bc.xlsx"},{"id":60910999,"identity":"bd400f87-46e7-470d-bfdd-3d6aea42b051","added_by":"auto","created_at":"2024-07-23 12:52:56","extension":"xlsx","order_by":5,"title":"","display":"","copyAsset":false,"role":"supplement","size":10848,"visible":true,"origin":"","legend":"Supplementary Dataset 4","description":"","filename":"Suppdataset4.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/f3f8a7e9ada14c65fb9abc7a.xlsx"},{"id":60912204,"identity":"2cca686c-46cb-4baf-bb6f-bbe3fc9f758c","added_by":"auto","created_at":"2024-07-23 13:00:56","extension":"png","order_by":6,"title":"","display":"","copyAsset":false,"role":"supplement","size":725587,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSupplementary Figure 1: \u003c/strong\u003eRepresentative short tandem repeat (STR) analysis of CB samples with matched plasma normal samples demonstrating low- and high-level contamination by non-patient DNA as evidenced by additional peaks in the tumor sample not seen in the normal sample.\u003c/p\u003e","description":"","filename":"SuppFigure1.png","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/d95a454d6257c31928787d52.png"},{"id":60911010,"identity":"21c4c11a-61c0-4edb-9201-a50e001a6faf","added_by":"auto","created_at":"2024-07-23 12:52:56","extension":"png","order_by":7,"title":"","display":"","copyAsset":false,"role":"supplement","size":270522,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSupplementary Figure 2: \u003c/strong\u003eThe distribution of total sample coverage between cytology cases that identified no somatic alterations and those that identified at least one somatic alteration compared for CB (p = 0.19) and ScfDNA (\u003cem\u003ep\u003c/em\u003e = 0.91) samples. The \u003cem\u003ep value\u003c/em\u003ewas assessed by a two-tailed Mann-Whitney test. All boxplots show the median (center line with value) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers).\u003c/p\u003e","description":"","filename":"SuppFigure2.png","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/18da9d1c05d50ca20d8dfffe.png"},{"id":60911011,"identity":"d9e927b5-f665-4b6d-813d-dae8421771d8","added_by":"auto","created_at":"2024-07-23 12:52:56","extension":"png","order_by":8,"title":"","display":"","copyAsset":false,"role":"supplement","size":952168,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSupplementary Figure 3: a\u003c/strong\u003e Non-patient DNA contamination rate distribution of cytology samples and any matching surgical sample from the same patient tumor. The horizontal dashed line indicates a contamination rate of 2% for which samples above this threshold has a significant concern for contamination. While the majority of samples for both cytology and paired surgical samples were below 2%, there was a higher total number of samples above this threshold for cytology samples (26 cytology samples vs 5 surgical samples). The \u003cem\u003ep value\u003c/em\u003e was assessed by a two-tailed Mann-Whitney test. All boxplots show the median (center line with value) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers). \u003cstrong\u003eb\u003c/strong\u003e The distribution of contamination rates between cytology sample and their matching surgical sample charted by sample coverage. The dashed vertical line indicates the threshold contamination rate of 0.02 and the dashed horizontal line denotes the threshold for adequate coverage (200x). Samples in the top-right portion indicate a high contamination rate in the face of adequate coverage, whereas samples in the bottom-right had low coverage that may falsely elevate the contamination rate.\u003c/p\u003e","description":"","filename":"SuppFigure3.png","url":"https://assets-eu.researchsquare.com/files/rs-4746484/v1/8400841f5039d052233ce558.png"}],"financialInterests":"\u003cb\u003eYes\u003c/b\u003e there is potential Competing Interest.\nC.V. reports intellectual property rights and equity interest in Paige.AI, Inc.\r\nA.R.B. has ownership/equity interests in Johnson and Johnson. \r\nM.B has received advisory or consulting fees from AstraZeneca, Eli Lilly and Company, and PetDx, Inc. \r\nM.L. has received advisory or consulting fees from Takeda Oncology, Janssen Pharmaceuticals, AstraZeneca, ADC Therapeutics, Paige.AI, Merck, Bayer, and Lilly Oncology and has received research funding from Loxo Oncology, Helsinn Therapeutics, Merus NV, Elevation Oncology, and Rain Therapeutics.\r\nO.L. has received advisory or consulting fees from Hologic and Janssen Research \u0026 Development, LLC.\r\nM.A. has received advisory or consulting fees from Axis Medical Education, Clinical Education Alliance, LLC, Merck Sharp \u0026 Dohme, PeerView Institute for Medical Education (PVI), Physicians’ Education Resource, RMEI Medical Education, LLC, and Roche. \r\nThe following Authors declare no competing interests: D.K., S.Y., S.N., K.N., R.F., N.R., I.R., J.C., A.Y., and O.L.","formattedTitle":"Maximizing the clinical utility and performance of cytology samples for comprehensive genetic profiling – A report on the impact of process optimization through the analysis of 4,871 cytology samples profiled by MSK-IMPACT","fulltext":[{"header":"Introduction","content":"\u003cp\u003eComprehensive tumor molecular profiling using next generation sequencing (NGS) technology is steadily increasing in routine oncologic practice, in order to guide precise disease classification and the selection of targeted therapies\u003csup\u003e1,2\u003c/sup\u003e. Concurrently, minimally invasive procedures have also steadily and systematically become a dominant tumor sampling modality. Despite indisputable patient benefits, the amount of tissue procured through such procedures is limited, raising concerns on their sufficiency and suitability for comprehensive downstream analysis.\u003c/p\u003e \u003cp\u003eCytologic specimens are among the most limited tissue samples that are obtained through minimally invasive procedures. While often the only source of tumor for both diagnostic and biomarker evaluation, judicious protocols for tissue testing have been explored to maximize the material available for genetic studies. In this context, the performance of NGS across various preparations, including cellblocks (CB), smears, and liquid-based suspensions have been studied and described in the literature primarily focusing on small gene panels (\u0026lt;\u0026thinsp;100 genes)\u003csup\u003e3\u0026ndash;11\u003c/sup\u003e. However, with the increasing need for the assessment of a wider range of genetic alterations, the sufficiency and robust performance for comprehensive NGS assays have remained a major concern.\u003c/p\u003e \u003cp\u003eTo address this, our pathology department embarked on a comprehensive performance improvement project which involved several years of sequential process optimization to strategically improve the use of cytologic tissue samples for molecular profiling. This encompassed coordinated changes by the cytology lab, including the use of a modified HistoGel based cell-block processing to improve pellet density\u003csup\u003e12,13\u003c/sup\u003e, as well as changes in the diagnostic molecular lab in tissue processing such as deparaffinization with mineral oil\u003csup\u003e14\u0026ndash;16\u003c/sup\u003e, improved bead-based extraction techniques, implementation of dual index sequencing and adjustments of minimum DNA input requirements \u003cb\u003e(Supplementary Table\u0026nbsp;1).\u003c/b\u003e Throughout this time, we also implemented the use of residual cytology supernatant fluids as an additional source of tumor DNA for NGS applications. Commonly discarded in routine cytology practice, supernatant fluids contain variable amounts of DNA from fragmented cells as well as whole cells, denoted here-on as supernatant cell-free DNA (ScfDNA)\u003csup\u003e16\u003c/sup\u003e. The strategic use of this DNA enables the preservation of cellular tissue for other ancillary studies that rely on visual assessment of intact cells, such as immunohistochemistry and cytogenetics.\u003c/p\u003e \u003cp\u003eThis study presents our overall clinical experience using cytologic material for comprehensive NGS, integrating the use of ScfDNA as a rescue sample when other material is unavailable. To our knowledge, this is the largest cytology cohort to date including the largest cohort of residual supernatant fluid, across a wide range of tumor types. We re-analyzed all cytology sample data collected from our institution-wide prospective sequencing effort using the Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT \u0026trade;) assay, an FDA cleared, paired tumor-normal hybridization-capture based NGS test, designed to comprehensively assess mutations, copy number alterations, and select rearrangements\u003csup\u003e17\u003c/sup\u003e. A summary of the performance characteristics across years of process optimization is presented, describing the utility and potential pitfalls of cytology samples for the identification of clinically relevant biomarkers, with comparisons to existing sequencing data from biopsies and resections from the same patients.\u003c/p\u003e"},{"header":"Materials and methods","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003ePatient consent and cohort selection\u003c/h2\u003e \u003cp\u003eThe prospectively maintained database of samples submitted for sequencing using our institution\u0026rsquo;s large-panel NGS assay (MSK-IMPACT\u0026trade;) between the years 2014 and August 2022 was queried to identify all cytology samples; this included all requests on samples deemed to be malignant by morphologic assessment prior to their assessment of suitability for sequencing. MSK-IMPACT\u0026trade; testing was ordered by the treating physician to identify clinically significant genomic alterations for the clinical care of patients with cancer. Patient\u0026rsquo;s receiving testing signed a clinical consent form and was enrolled on an institutional IRB-approved research protocol (MSKCC; NCT01775072). Additionally following consent, a patient blood draw was obtained as a source for normal (germline) DNA. Basic demographic data (age and sex) and any existing pre-analytic information, including the type of preparation, tissue source, tumor type, tumor content, and DNA yield were collected. All sequencing data, encompassing QC metrics, sequencing coverage, somatic variants identified, and variant allele frequency (VAF) were gathered, as well as sequencing qualification (pass, fail, reason for failure) as established at the time of clinical signout. When available, the same sequencing metrics and information described above were also collected for corresponding biopsy and resections samples from the same patient tumor, to compare results side-by-side. All samples were collected with informed consent and testing was performed in our CLIA certified laboratory. This study was approved by the MSKCC Institutional Review Board/Privacy Board.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec4\" class=\"Section2\"\u003e \u003ch2\u003eCytology sample processing\u003c/h2\u003e \u003cp\u003eCytologic samples were received as formalin fixed paraffin embedded (FFPE) tissue sections from cell block (CB) preparations or as supernatants. Samples were either collected in CytoLyt fixative (Hologic, Malborough, MA, USA) or in 10% neutral buffered formalin fixative and were paraffin embedded (FFPE). CB preparation for MSKCC procured samples followed a modified HistoGel-Based Cell Block Preparation Method as previously described\u003csup\u003e12,13\u003c/sup\u003e. Procedural details of externally procured samples (cases submitted for review at MSKCC for diagnosis confirmation and IMPACT testing) were not available. For each case, 20 unstained sections (5 um thick) were submitted mounted on glass slides, along with a hematoxylin and eosin\u0026ndash;stained (H\u0026amp;E) section to assess adequacy and tumor fraction. Macro-dissection was performed to enrich for tumor, when possible and necessary, aiming for \u0026gt;\u0026thinsp;50% tumor cell content. Samples were rejected / failed if the tumor proportion was \u0026lt;\u0026thinsp;10% and the sample was not amenable to manual enrichment.\u003c/p\u003e \u003cp\u003eFor MSKCC samples that were collected in CytoLyt fluid, residual material was saved after the ThinPrep\u0026reg; and CB were prepared. The corresponding ThinPrep\u0026reg; slide was assessed as a surrogate for tumor presence and content. If tumor cells were present at \u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e\u0026ge;\u003c/span\u003e\u0026thinsp;10% based on visual inspection, the supernatant was considered suitable and DNA was extracted. Further details of the processing of supernatants for ScfDNA extraction are described in a previous publication\u003csup\u003e16\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eExtraction procedures:\u003c/p\u003e \u003cp\u003eFFPE material was deparaffinized using Citrasolv (2014 to March 2016) or mineral oil (March 2016 to 2022) and DNA was extracted using the Chemagic STAR DNATissue-10 Kit (Perkin Elmer, Waltham, MA) with the magnetic-bead method automated on a Chemagic STAR Standard Solutions Workstation (Hamilton, Bonaduz, GR, Switzerland), following manufacturer\u0026rsquo;s protocols. DNA from supernatants was extracted using the same kit and automated system, eliminating deparaffinization and overnight lysis incubation at 56\u003csup\u003e0\u003c/sup\u003eC and, instead, 1 hour lysis incubation was used (56\u003csup\u003e0\u003c/sup\u003eC).\u003c/p\u003e \u003cp\u003eExtracted DNA was eluted and quantified using a Qubit DNA high-sensitivity assay kit (Life Technologies, Carlsbad, CA). FFPE samples with DNA concentration of \u0026lt;\u0026thinsp;0.9 ng/uL were deemed insufficient for further testing until 09/2021 when the threshold was lowered to 0.54ng/uL to proceed with sequencing, which translates to minimal total inputs of 50 and 30ng, respectively based on maximal volume inputs for the assay of 55ul. ScfDNA samples were sequenced below these thresholds aiming to spare the patient from a future biopsy and to further evaluate performance characteristics.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003eNext Generation Sequencing\u003c/h2\u003e \u003cp\u003eDNA was sheared and processed (along with matched DNA from blood as normal control) to generate bar-coded libraries which were pooled and subjected to targeted capture using custom-designed probes as previously detailed\u003csup\u003e17\u003c/sup\u003e. All samples in this study underwent testing by MSK-IMPACT \u0026trade;, targeting all coding regions of up to 505 genes, select introns and over 1,000 custom intergenic and intronic regions throughout the genome (centered on common SNPs). Updates to the panel sequentially increased the number of genes captured from 341 (year 2014), 410 (years 2015\u0026ndash;2016), 468 (years 2017\u0026ndash;2020), to 505 (years 2021\u0026ndash;2022). Captured DNA fragments were sequenced on an Illumina HiSeq2500 or NovaSeq 6000 system, before being submitted to the bioinformatics analysis pipeline for calling of somatic alterations. Clinical actionability and treatment associations of the genomic alterations detected were assessed and annotated using OncoKB\u003csup\u003e18\u003c/sup\u003e MSK\u0026rsquo;s precision oncology knowledge base. Levels of evidence are assigned to each alteration based upon therapeutic levels of evidence specific to the tumor type profiled including alterations predictive of resistance to a therapy. Results were compared to the previously published data for corresponding tumor types in the AACR Project GENIE Pan-Cancer Cohort\u003csup\u003e19\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eTumor mutational burden (TMB) was calculated for each sample as the total number of non-synonymous mutations, including driver mutations in oncogenes, normalized to the exonic coverage of the respective MSK-IMPACT panel in megabases (Mb).\u003c/p\u003e \u003cp\u003eSamples were deemed clinically successful if they passed all quality control metrics defined for our assay (e.g. adequate tumor quantity, coverage, base quality, etc.) and were formally reviewed by a board-certified molecular pathologist before the report was released clinically.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec6\" class=\"Section2\"\u003e \u003ch2\u003eNext generation sequencing quality metrics and contamination assessment\u003c/h2\u003e \u003cp\u003eFor QC purposes, in addition to standard NGS quality metrics, assessment for potential sample contamination was a critical component of our assessment. Our established analysis pipelines compute pairwise genotype concordance across all SNP sites included in the panel. This unique genotype analysis, enabled by our paired tumor:normal sequencing approach, allows us to identify potential sample swaps and contamination, either due to the presence of DNA from another individual or contamination among different barcode adapters, which could lead to erroneous mutation calling. Contamination levels are defined by the analysis of SNP sites at which the patient is homozygous (based on normal control profile). Because a homozygous site is defined by 2 identical alleles at the particular genetic locus, any allelic discrepancy where the variant is not expected indicates contamination. A cutoff of \u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e\u0026ge;\u003c/span\u003e\u0026thinsp;2% is used to denote clinically significant contamination (the threshold for mutation calling). Samples with a contamination rate higher than 2% were evaluated in the context of the tumor content and mutation profile; low level contamination in samples with very high tumor content, remained partially evaluable by filtering variants within the range of contamination.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec7\" class=\"Section2\"\u003e \u003ch2\u003eSample type comparisons and concordance analysis\u003c/h2\u003e \u003cp\u003eTo further assess the performance of cytologic samples compared to larger tissue samples, existing MSK-IMPACT sequencing data from corresponding tissue core biopsies or subsequent resection samples from the same tumor were obtained. Once matched, metrics including sequencing coverage, genomic alterations identified, mutation VAFs, and OncoKB levels were compared to the cytologic counterpart. In a subset of cytology cases, results from CB deemed adequate for testing were also compared to the corresponding ScfDNA. This data was analyzed separately to avoid duplication.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003eStatistical Analysis\u003c/h2\u003e \u003cp\u003eStatistical analysis for group comparisons of continuous data were performed using a two-tailed Mann-Whitney test. A Pearson\u0026rsquo;s Chi-squared test was performed for comparing three or more categorical groups. A Fischer\u0026rsquo;s exact test was performed for comparing two categorical groups. Statistical significance was set at p\u0026thinsp;\u0026lt;\u0026thinsp;0.05. Cases with missing values were removed from the analyses and only complete cases were considered. All statistics and graphical representations were performed using R project.\u003c/p\u003e \u003c/div\u003e"},{"header":"Results","content":"\u003cdiv id=\"Sec10\" class=\"Section2\"\u003e \u003ch2\u003eClinicopathologic characteristics of cytology sequencing cohort\u003c/h2\u003e \u003cp\u003eIn total, 4,871 cytology tumor samples from 4,633 patients were received for MSK-IMPACT testing with patient demographics detailed in Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e. Most samples were from CB preparations, 94.2% (4,588/4,871) while 5.8% (283/4,871) were received as ScfDNA. Of note, ScfDNA testing was requested only when no other material was suitable or available. The majority, 63%, were procured at MSKCC and processed internally while 37% were submitted from outside institutions (Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e). Testing was cancelled on 3% (146/4,871) prior to sample processing for logistical considerations, including the lack of a submitted normal control for matched testing or testing no longer relevant for patient management. A diverse array of tissue sites and sample types were profiled as detailed in Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003ea and \u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eb. The cohort encompassed 181 unique tumor types with lung and pancreatic adenocarcinomas being the most common. The number of samples and relative frequencies of the profiled tumor types are further summarized in \u003cb\u003eSupplementary Table\u0026nbsp;2.\u003c/b\u003e\u003c/p\u003e \n\u003cp\u003e\u003cstrong\u003eTable 1: Demographic information of study cohort\u003c/strong\u003e\u003c/p\u003e\n\u003cdiv\u003e\n \u003ctable border=\"0\" cellspacing=\"0\" cellpadding=\"0\"\u003e\n \u003cthead\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u003cstrong\u003eCharacteristic\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u003cstrong\u003eOverall\u003c/strong\u003e, N = 4,871\u003cem\u003e\u003csup\u003e1\u003c/sup\u003e\u003c/em\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u003cstrong\u003eCB\u003c/strong\u003e, N = 4,588\u003cem\u003e\u003csup\u003e1\u003c/sup\u003e\u003c/em\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u003cstrong\u003eScfDNA\u003c/strong\u003e, N = 283\u003cem\u003e\u003csup\u003e1\u003c/sup\u003e\u003c/em\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u003cstrong\u003ep-value\u003c/strong\u003e\u003cem\u003e\u003csup\u003e2\u003c/sup\u003e\u003c/em\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003c/thead\u003e\n \u003ctbody\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u003cstrong\u003eAge\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e66 (59, 74)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e66 (58, 74)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e67 (60, 74)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e0.4\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u003cstrong\u003eSex\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e0.2\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp; F\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e2,639 (54%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e2,497 (54%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e142 (50%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp; M\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e2,232 (46%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e2,091 (46%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e141 (50%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u003cstrong\u003eOutside/Internal\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026lt;0.001\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp; Internal\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e3,091 (63%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e2,809 (61%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e282 (100%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp; Outside\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e1,780 (37%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e1,779 (39%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e1 (0.4%)\u003cem\u003e\u003csup\u003e3\u003c/sup\u003e\u003c/em\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u003cstrong\u003eProcedure\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026lt;0.001\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp; Fluid\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e693 (14%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e633 (14%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e60 (21%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp; FNA\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e4,178 (86%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e3,955 (86%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e223 (79%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd colspan=\"5\" valign=\"top\"\u003e\n \u003cp\u003e\u003cem\u003e\u003csup\u003e1\u003c/sup\u003e\u003c/em\u003eMedian (IQR); n (%)\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd colspan=\"5\" valign=\"top\"\u003e\n \u003cp\u003e\u003cem\u003e\u003csup\u003e2\u003c/sup\u003e\u003c/em\u003eWilcoxon rank sum test; Pearson\u0026apos;s Chi-squared test\u003c/p\u003e\n \u003cp\u003e\u003csup\u003e3\u003c/sup\u003eReceived in CytoLyt and processed Internally\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003c/tbody\u003e\n \u003c/table\u003e\n\u003c/div\u003e\u003c/br\u003e\n\u003cp\u003eSuccess rate of NGS testing on Cytology Samples\u003c/p\u003e \u003cp\u003eOverall, 81% (3,806/4,725) of all samples were successfully tested. The success rate was higher for CB (81%; 3,616/4,457) samples compared to ScfDNA (71%; 190/268), noting that ScfDNA samples encompassed only cases for which the CB had already been deemed unsuitable for any analysis. Across the study period, the use of ScfDNA as a rescue sample boosted the overall success rate of the cytologic procedures from 77\u0026ndash;81%. Causes of failure, in descending order of frequency, included low DNA yield below the minimum cutoffs established for sequencing (11.3%), very scant tumor tissue (\u0026lt;\u0026thinsp;10% tumor) seen on manual review (4.6%), low sequencing coverage below a median of 50x (1.8%), high sample level DNA contamination (1.6%) and low DNA quality (0.1%) including adequate coverage but high background noise and low base quality scores. (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003ec\u003cb\u003e)\u003c/b\u003e.\u003c/p\u003e \u003cp\u003eTo assess the impact of optimization efforts implemented across the study period, all requests (excluding cancellations) were stratified by year and testing success status. Sequential and statistically significant improvements were observed, even in the context of increased number of genes tested by panel updates, reaching 89% in the last year of assessment (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003ea). For samples deemed sufficient for sequencing (following qualification for tumor content and DNA yield), success rates were consistently high across all years (range 96\u0026ndash;98%) (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eb\u003cb\u003e)\u003c/b\u003e.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eAmong CB preparations, success rates were significantly higher for internal samples compared to those from outside laboratories, with highest success rates of 92\u0026ndash;93% (internal) and 79\u0026ndash;82% (external) in the last 2 years, in accordance with the full optimization efforts (\u003cem\u003ep\u003c/em\u003e\u0026thinsp;\u0026lt;\u0026thinsp;0.01; Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003ec, \u003cb\u003eSupplementary Table\u0026nbsp;3).\u003c/b\u003e\u003c/p\u003e \u003cp\u003eOn average, successful cytology samples had higher tumor purities compared to samples that failed testing, across both preparations (CB: \u003cem\u003ep\u003c/em\u003e\u0026thinsp;=\u0026thinsp;0.0003, ScfDNA: \u003cem\u003ep\u003c/em\u003e\u0026thinsp;=\u0026thinsp;0.63; Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003ec\u003cb\u003e)\u003c/b\u003e. Median total DNA yields were 427.5 ng and 182.2 ng (\u003cem\u003ep\u003c/em\u003e\u0026thinsp;=\u0026thinsp;2.2 x 10\u003csup\u003e\u0026minus;\u0026thinsp;16\u003c/sup\u003e) for CB and ScfDNA, respectively (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003ed). The lower DNA yield in ScfDNA was expected as these constituted rescue samples when the corresponding cytology tissue was too scant or exhausted.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003eSequencing performance: total coverage and sample quality metrics\u003c/h2\u003e \u003cp\u003eAmong the 4,725 samples sequenced, the total median coverage was 586x. Coverages were significantly higher for CB samples compared to rescue ScfDNA samples, at 595x and 263x, respectively \u003cem\u003e(p\u003c/em\u003e\u0026thinsp;=\u0026thinsp;2.2 x 10\u003csup\u003e\u0026minus;\u0026thinsp;16\u003c/sup\u003e) \u003cb\u003e(\u003c/b\u003eFig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003ea\u003cb\u003e)\u003c/b\u003e. Despite the lower coverage, most rescue samples retained coverages above 200x, which is above our established requirements to maintain sensitivity for variants calling at 2%.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eNotably, in 2021 the minimum DNA input requirement for MSK-IMPACT was lowered from 50ng to 30ng for cell blocks. Following this change, we saw no significant differences in sequencing coverage \u003cb\u003e(\u003c/b\u003eFig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eb\u003cb\u003e)\u003c/b\u003e and sequencing success rate remained steady at 98% across sequenced cases.\u003c/p\u003e \u003cp\u003eAmong all samples sequenced, contamination checks revealed clinically relevant non-patient DNA contamination (\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e\u0026ge;\u003c/span\u003e\u0026thinsp;2%) in 5.2% of cases (246/4725) (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003ec). Excluding those that failed due to very low coverage (\u0026lt;\u0026thinsp;50X), the overall rate was 4.8% (227/4725), with a significantly higher rate for CB samples compared to ScfDNA, at 4.7% (226/4725) and 0.3% (1/4725) respectively. Notably, in the context of optimal coverage (\u0026gt;\u0026thinsp;200X), no ScfDNA samples exhibited clinically significant sample contamination \u003cb\u003e(\u003c/b\u003eFig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003ed\u003cb\u003e)\u003c/b\u003e. Also, no contamination was identified, even for samples with low coverage, following the implementation of dual indexing. By contrast, CB samples showed variable and significantly higher levels of contamination (range: 2\u0026ndash;32%), which remained present despite adequate coverage and after implementation of dual indexing. Among samples with optimal coverage, 4% (189) of the CB samples exhibited contamination rates above 2%. In 34% (65) of these samples, for which sufficient material for re-extraction and STR analysis was available, contamination could be tracked to foreign tissue material embedded in the tissue blocks. Representative cases are included in \u003cb\u003eSupplementary Fig.\u0026nbsp;1.\u003c/b\u003e\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003eBiomarker/mutation identification and therapeutic actionability\u003c/h2\u003e \u003cp\u003eA total of 30,149 somatic alterations were detected across the 3,806 successfully sequenced cases. Of these, 93.8% of cases (3,570/3,806) harbored at least 1 somatic alteration, including 3,394 (93.9%) of CB and 176 (92.6%) of the ScfDNA samples. No significant differences in sample coverage were observed between samples with and those without alterations \u003cb\u003e(\u003c/b\u003eCB: \u003cem\u003ep\u003c/em\u003e\u0026thinsp;=\u0026thinsp;0.19; ScfDNA: \u003cem\u003ep\u003c/em\u003e\u0026thinsp;=\u0026thinsp;0.91, \u003cb\u003eSupplementary Fig.\u0026nbsp;2\u003c/b\u003e). However, tumor purity estimations were significantly lower for the subset without alterations, with a median tumor content of 10% vs 30% for those with detected alterations.\u003c/p\u003e \u003cp\u003eWhen stratified, the median number of alterations was similar for both sample types, 9 for CB (range 1\u0026ndash;170; 95% CI: 11.9\u0026ndash;12.7) and 10 for ScfDNA (range 1\u0026ndash;63; 95% CI: 8.5\u0026ndash;11.9). The average TMB was 7.64 mutations/Mb and 7.43 mutations/Mb for CB and ScfDNA samples, respectively. Overall, the mutational profiles recapitulated the expected landscape and frequency of driver and common alterations for the tumor type. Stratified by level of actionability, 65% (n\u0026thinsp;=\u0026thinsp;2487) had at least one targetable alteration as defined by the presence of an OncoKB level 1, 2, 3A, or 3B alteration and 2% (n\u0026thinsp;=\u0026thinsp;93) had a standard care resistance mutation (OncoKB level R1). The highest frequency of level 1 OncoKB alterations was observed in thyroid, breast, non-small cell lung (NSCLC), and bladder cancer patients at 58%, 58%, 45%, and 29% respectively. For resistance mutations, 87 CB and 6 ScfDNA samples identified an OncoKB level R1 alteration. To ensure that significant alterations were being identified at similar rates to non-cytology samples, results were compared to those published in the AACR GENIE cohort. Across the different histologic tumor types, similar rates of OncoKB alterations were identified \u003cb\u003e(\u003c/b\u003eFig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003ea\u003cb\u003e)\u003c/b\u003e. Representative oncoplots of the most frequent, clinically actionable alterations detected in NSCLC, Bladder Cancer, and Breast Cancer (most common tumor types in our cohort) are presented in Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eb and \u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003ec which demonstrate the expected distributions across both CB and ScfDNA. OncoKB level 1 alterations were commonly seen in EGFR, KRAS, PIK3CA, and ERBB2 genes. ALK, BRAF, RET, and ROS1 level 1 alterations were also seen at lower frequencies.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec13\" class=\"Section2\"\u003e \u003ch2\u003eComparison with surgical core biopsies/resections\u003c/h2\u003e \u003cp\u003eTo further assess the general performance of cytology samples, we identified 526 cases (CB: n\u0026thinsp;=\u0026thinsp;482; ScfDNA: n\u0026thinsp;=\u0026thinsp;44) of patients who had a corresponding surgical sample of the same tumor assessed by MSK-IMPACT. While the same tumor was profiled across each surgical:cytology pair, it should be noted that there were variations in the time of collection across their treatment course as the samples were profiled clinically. Thus, many of the cytologic samples were collected at the time of disease progression or development of resistance. Overall, cytologic samples demonstrated similar sequencing metrics compared to their surgical biopsy/excision counterparts with adequate average coverages of 584x for cytology samples and 628x for corresponding surgical samples (\u003cem\u003ep\u003c/em\u003e\u0026thinsp;=\u0026thinsp;0.00028).\u003c/p\u003e \u003cp\u003eComparing detected alterations, a large proportion of the cytology samples identified all the alterations detected on the corresponding surgical sample, 266 (55%) CB \u003cb\u003e(\u003c/b\u003eFig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003ea\u003cb\u003e)\u003c/b\u003e and 19 (43%) ScfDNA samples \u003cb\u003e(\u003c/b\u003eFig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003ee\u003cb\u003e)\u003c/b\u003e. The median VAF\u0026rsquo;s for shared alterations were slightly higher for cytology samples, compared to the surgical pair in both CB and ScfDNA samples (CB: \u003cem\u003ep\u003c/em\u003e\u0026thinsp;=\u0026thinsp;4.9 x 10\u003csup\u003e\u0026minus;\u0026thinsp;11\u003c/sup\u003e; ScfDNA: \u003cem\u003ep\u003c/em\u003e\u0026thinsp;=\u0026thinsp;0.13) \u003cb\u003e(\u003c/b\u003eFig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eb \u003cb\u003eand\u003c/b\u003e Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003ef\u003cb\u003e)\u003c/b\u003e.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eIn all, a total of 5,593 mutations were identified in the CB:surgical paired set, of which 2789 events (49.8%) were shared \u003cb\u003e(\u003c/b\u003eFig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003ec\u003cb\u003e)\u003c/b\u003e. For the ScfDNA:surgical paired set, 692 mutations were detected with slightly lower overlap (34.8%; Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eg). Importantly, when alterations were stratified by level of actionability, the overwhelming majority of driver alterations with OncoKB Level 1 actionability were shared events, at 93% and 83% for the CB and ScfDNA sets, respectively. Non-detection of OncoKB Level 1 alterations in the surgical or the cytology sample was related to low coverage or low tumor content in all cases. Events categorized as Level R1 or No level showed the lowest overlap, with 27% and 42% shared events, respectively, likely reflecting differential passenger events, the acquisition of additional mutations in the time interval of the two samples, or the heterogeneous nature of resistance mechanisms in the samples. Further details are provided in Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003ed and \u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eh \u003cb\u003eand Supplementary Table\u0026nbsp;4.\u003c/b\u003e\u003c/p\u003e \u003cp\u003eReview of contamination check data for surgical samples revealed that \u0026lt;\u0026thinsp;1% of surgical samples had clinically relevant contamination (0.81%; 5/619; \u003cb\u003eSupplementary Fig.\u0026nbsp;3a\u003c/b\u003e). Of note, all 6 surgical samples with contamination were minute biopsy samples with low tumor purity \u003cb\u003e(Supplementary Fig.\u0026nbsp;3b)\u003c/b\u003e with contamination below 4%.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec14\" class=\"Section2\"\u003e \u003ch2\u003eComparison of successful CB preparations with corresponding ScfDNA\u003c/h2\u003e \u003cp\u003eAmong CB cytology samples with adequate tumor and successful sequencing, 24 had the corresponding ScfDNA samples tested to allow direct comparisons. Both DNA concentration and sequencing coverages were significantly lower for the ScfDNA samples. Total DNA yields averaged 505ng (range: 76\u0026ndash;5453) and 1200ng (range 83.4\u0026ndash;3340) for ScfDNA and CB preparations, respectively. Accordingly, ScfDNA had resulting lower sequencing coverage averaging 387x (range 3x \u0026ndash; 1335x) compared to 669x (74x \u0026ndash; 1193x) for the corresponding CB. Of the 24 ScfDNA samples, 7 (27%) failed sequencing due to low sample coverage. Detection of clinically relevant alterations and the VAF were the same across both sample preparations based on comparison of successfully sequenced sets.\u003c/p\u003e \u003c/div\u003e"},{"header":"Discussion","content":"\u003cp\u003eComprehensive NGS sequencing is becoming a common approach for upfront assessment of a broad range of genetic biomarkers that are pivotal for diagnostic, prognostic, and therapeutic decisions in cancer patients. While ideally, molecular testing is greatly facilitated when large tumor samples are available (i.e. resections or excisional biopsies), the reality of clinical practice is that a very large proportion of testing must be performed on scant material obtained through minimally invasive procedures. Historically, this has presented distinct challenges, prompting the adoption of alternate approaches, such as liquid biopsies, which attempt to circumvent tumoral cell assessment altogether. At present, while arguments can be made for the superiority or inferiority of each modality over another, cytologic samples stand as the one middle approach that unites the most desirable attributes of both worlds. Namely, they retain the key morphologic correlates required for tumor diagnosis, while still sparing the patient from the more invasive procedures. One fact remains constant, however, which is that small samples require very high optimization of the entire process to maximize the genomic yield.\u003c/p\u003e \u003cp\u003eIn this study, we have outlined our institutional approach and longitudinal experience in comprehensive profiling of cytology samples in routine clinical care. To our knowledge, this represents the largest prospective clinical cohort reported to date, demonstrating that molecular testing can be performed on routinely procured cytology samples with high success rates, similar to surgical samples. Proportions of clinically actionable genomic alterations, specifically OncoKB Level 1\u0026ndash;3B, as well as R1 alterations, recapitulated the expected patterns across all tumor types when compared to those published in AACR GENIE cohort. For immediately actionable alterations (OncoKB Level 1), the concordance of cytology to corresponding surgical samples from the same patient were very high (93%). Notably rescue ScfDNA from supernatant CytoLyt fluid material, utilized for our internal cases, proved highly valuable and enabled the detection of a level 1 alteration in 83% of the successfully sequenced cases.\u003c/p\u003e \u003cp\u003e Our review of data compiled across 8 years, highlighted the central roles of optimized sample handling and processing. In our hands, 2 critical early steps enabled higher DNA recovery which, consequently, promoted increased utilization of cytologic material for molecular testing. The first was the optimization of cell block preparation, which incorporated pretreatment of pelleted cells with 95% ethanol before addition of HistoGel\u003csup\u003e12,13\u003c/sup\u003e. This enhanced the density of cell pellets to deliver higher amount of cellular material in fewer sections of the paraffin block. The second was the transition to mineral oil deparaffinization which markedly reduced tube transfers, centrifugation, and decanting steps, all key vulnerabilities responsible for major nucleic acid losses in the processing of scant FFPE material\u003csup\u003e14\u0026ndash;16\u003c/sup\u003e. It should be noted that, with the implementation of mineral oil extraction, requests for testing on cytologic material vs needle biopsies markedly increased at our institution. Details of this transition have been previously published by our group\u003csup\u003e16,20\u003c/sup\u003e. Notably, among lung cancer patients undergoing endobronchial ultrasound transbronchial needle aspiration (EBUS-TBNA) this change, alone, significantly improved sequencing success rates from 76.3\u0026ndash;93%. Moreover, these success rates corresponded to NGS testing that was performed after standard rapid testing for EGFR on the same samples\u003csup\u003e21\u003c/sup\u003e, further supporting the high suitability and sufficiency of the DNA recovered.\u003c/p\u003e \u003cp\u003eAn important, and often underreported, consideration in molecular testing of cytology samples are the diagnostic challenges and inaccuracies that may arise from sample cross-contamination. While sample-to-sample contamination may happen across any point, highly vulnerable points lie in processes that involve batching and pooling of multiple samples in a single run. In particular, established histopathology practices of tissue processing (i.e. carry over from microtome blades, common water baths, pooled tissue processors, etc.), pose distinct risks for contamination for small tissue samples as processes are primarily optimized to enhance microscopic diagnostic analysis but not downstream molecular applications. Common holding of numerous specimens in single chambers in automated tissue processors, the use of common equipment for embedding, cutting and tissue mounting, all increase the potential for low level cross contamination. While this may remain inconsequential for morphologic assessment or the molecular analysis of large tissue samples, this can distinctly impact small samples where similar contamination levels become proportionally higher. Cytology samples may be even more vulnerable due to processing of cell blocks with paper wrapping and HistoGels, which may promote trapping of cellular impurities from other samples. Indeed, in our analysis of cytologic samples, contamination was significantly higher across CB samples compared to all other samples. This held true when analyzing samples with adequate coverage (\u0026gt;\u0026thinsp;200x) with 4% of CB samples exhibiting contamination. ScfDNA samples by contrast, which are processed individually in a closed system and not batched, had negligible levels with contamination patterns exclusively associated with sequencing failures or borderline coverage and more likely related to artifact rather than true contamination. Despite the presence of higher contamination levels in cell blocks, the overall rate was low (4.8%) among successfully sequenced samples, which encompassed samples procured and processed across numerous laboratories across the county. These rates are in keeping with sequencing data on surgical samples published by Sehn et al\u003csup\u003e22\u003c/sup\u003e but are significantly lower to what is reported by the ASC Clinical Practice Committee/Workgroup for Cross-Contamination in a recent survey for general cytopathology practice, quoting rates as high as 56% for cell-block preparations\u003csup\u003e23\u003c/sup\u003e. This high rate may be related to the reporting of contamination per case, affecting some but not all unstained sections and which may not be high enough to be detectable in the sequencing of DNA recovered from a set of several slides. Importantly, while contamination was detectable in several cases in our cohort, most were sufficiently low in comparison to the overall tumor content of the sample, allowing informed filtering of low\u0026ndash;variant allele fraction events without compromising all mutation calling. In all, only 1.3% of the samples were failed due to contamination, while others could be reported with modification. Within the molecular laboratory, a notable source of cross-contamination may arise from index-hopping during multiplexing. This, however, is generally lower level (well below 2%) and more prone to affect higher sensitivity applications. Nonetheless, in the process of improvement for our MSK-IMPACT assay we have incorporated several strategies to mitigate this phenomenon, including optimization of PCR conditions and the implementation of dual indexing to facilitate the removal of misaligned reads. These finetuning steps facilitated our decision to reduce the assay input requirements which markedly reduced failures due to insufficient DNA. No significant changes in coverage or contamination rates were seen with this change.\u003c/p\u003e \u003cp\u003eFinally, a pivotal component of our optimization process was the implementation of testing ScfDNA recovered from liquid cytology preparations. While, generally, this sample type was not submitted if the cell block was deemed suitable for testing, it became an important rescue sample to avoid re-biopsy procedures. The use of this material also relieved some of the challenges in triaging very small biopsy samples for other ancillary studies. In all, while the success rate of the ScfDNA samples was approximately 71%, which is below what is seen across tissue biopsies and CB, these samples were specifically tested after the corresponding cytologic material was deemed unsuitable, thus boosting the overall success for the individual aspirate procedures by approximately 3%. An important observation, gathered from the comparison of ScfDNA and corresponding cell blocks, is that the VAF\u0026rsquo;s of detected alterations were similar for both preparations, supporting that the assessment of the block or cytoprep represents a suitable surrogate for estimating the proportion of tumor derived DNA that may be present in the ScfDNA sample. Additionally, given the high integrity of the DNA in these non-formalinized samples, lower DNA inputs still delivered excellent results, provided that the tumor proportion was suitable. Confirmatory testing with higher sensitivity methods may also be implemented for low tumor samples, without concerns for false positivity due to artifacts imparted by formalin fixation.\u003c/p\u003e \u003cp\u003eIn conclusion, this study confirms that the routine use of cytologic samples for molecular testing constitute a robust approach that can deliver the same results as larger biopsy samples. Process optimization and the implementation of robust quality control processes, including contamination checks are pivotal to maximizing the yield and utility of these samples. A reassessment on how tissue blocks are processed and prepared would be an important aspect of cytology practice as a whole, to include specialized instrumentation for processing small samples without risk of cross-contamination. ScfDNA recovered from supernatants is an invaluable source of tumor derived DNA which circumvents the processing where most contamination is bound to happen in current practice, and while failure rates due to limited nucleic acid recovery are higher than tissue blocks, their use could rescue the majority of cases where high tumor is identified but FFPE material is insufficient for sequencing.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eAcknowledgements\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThis research was funded in part through the NIH/NCI Cancer Center Support Grant P30 CA008748.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eEthics declarations/Competing interests\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eC.V. reports intellectual property rights and equity interest in Paige.AI, Inc.\u003c/p\u003e\n\u003cp\u003eA.R.B. has ownership/equity interests in Johnson and Johnson.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eM.B has received advisory or consulting fees from AstraZeneca, Eli Lilly and Company, and PetDx, Inc.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eM.L. has received advisory or consulting fees from Takeda Oncology, Janssen Pharmaceuticals, AstraZeneca, ADC Therapeutics, Paige.AI, Merck, Bayer, and Lilly Oncology and has received research funding from Loxo Oncology, Helsinn Therapeutics, Merus NV, Elevation Oncology, and Rain Therapeutics.\u003c/p\u003e\n\u003cp\u003eO.L. has received advisory or consulting fees from Hologic and Janssen Research \u0026amp; Development, LLC.\u003c/p\u003e\n\u003cp\u003eM.A. has received advisory or consulting fees from Axis Medical Education, Clinical Education Alliance, LLC, Merck Sharp \u0026amp; Dohme, PeerView Institute for Medical Education (PVI), Physicians\u0026rsquo; Education Resource, RMEI Medical Education, LLC, and Roche.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eThe following Authors declare no competing interests: D.K., S.Y., S.N., K.N., R.F., N.R., I.R., J.C., and A.Y.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthors\u0026rsquo; contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eAll authors of this article declare that we qualify for authorship for this manuscript. \u0026nbsp;Each author has participated sufficiently in the work and takes public responsibility for appropriate portions of the content of this article. D.K. and M.A. carried out the background research and drafted the manuscript. D.K. and M.A. conceived the idea for the manuscript, its design, and coordination. D.K., C.V., A.Y., and S.N. generated and interpreted the data. D.K. and C.V. performed the statistical analysis for this study. D.K., C.V., S.Y., S.N., K.N., R.F., N.R., I.R., J.C., A.Y., A.R.B., M.B., M.L., O.L., and M.A. reviewed the manuscript critically for important intellectual content. All authors read and approved the final manuscript.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eGarraway LA (2013) Genomics-driven oncology: framework for an emerging paradigm. J Clin Oncol 31:1806\u0026ndash;1814\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWakai T et al (2019) Next-generation sequencing-based clinical sequencing: toward precision medicine in solid tumors. Int J Clin Oncol 24:115\u0026ndash;122\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTurner SR et al (2018) Feasibility of endobronchial ultrasound transbronchial needle aspiration for massively parallel next-generation sequencing in thoracic cancer patients. Lung Cancer 119:85\u0026ndash;90\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRamani NS et al (2021) Utilization of cytology smears improves success rates of RNA-based next-generation sequencing gene fusion assays for clinically relevant predictive biomarkers. Cancer Cytopathol 129:374\u0026ndash;382\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBaum JE et al (2017) Accuracy of next-generation sequencing for the identification of clinically relevant variants in cytology smears in lung adenocarcinoma. Cancer Cytopathol 125:398\u0026ndash;406\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRoy-Chowdhuri S et al (2018) Salvaging the supernatant: next generation cytopathology for solid tumor mutation profiling. Mod Pathol 31:1036\u0026ndash;1045\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKanagal-Shamanna R et al (2014) Next-generation sequencing-based multi-gene mutation profiling of solid tumors using fine needle aspiration samples: promises and challenges for routine clinical diagnostics. Mod Pathol 27:314\u0026ndash;327\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGan Q, Roy-Chowdhuri S (2020) Small but powerful: the promising role of small specimens for biomarker testing. J Am Soc Cytopathol 9:450\u0026ndash;460\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eVelizheva NP et al (2017) Cytology smears as excellent starting material for next-generation sequencing-based molecular testing of patients with adenocarcinoma of the lung. Cancer Cytopathol 125:30\u0026ndash;40\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAkahane T et al (2021) Direct next-generation sequencing analysis using endometrial liquid-based cytology specimens for rapid cancer genomic profiling. Diagn Cytopathol 49:1078\u0026ndash;1085\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRoy-Chowdhuri S et al (2015) Factors affecting the success of next-generation sequencing in cytology specimens. Cancer Cytopathol 123:659\u0026ndash;668\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRudomina D et al (2013) Improved Cell Block Preparation Method: A Protocol for Increased Cellularity which is Suitable for Immunocytochemical and Molecular Assays. J Am Soc Cytopathol 2:S81\u0026ndash;S82\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRekhtman N et al (2018) Novel Modification of HistoGel-Based Cell Block Preparation Method: Improved Sufficiency for Molecular Studies. Arch Pathol Lab Med 142:529\u0026ndash;535\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHeikal N, Nussenzveig RH, Agarwal AM (2014) Deparaffinization with mineral oil: a simple procedure for extraction of high-quality DNA from archival formalin-fixed paraffin-embedded samples. Appl Immunohistochem Mol Morphol AIMM 22:623\u0026ndash;626\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLin J et al (2009) High-quality genomic DNA extraction from formalin-fixed and paraffin-embedded samples deparaffinized using mineral oil. Anal Biochem 395:265\u0026ndash;267\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTian SK et al (2016) Optimizing Workflows and Processing of Cytologic Samples for Comprehensive Analysis by Next-Generation Sequencing: Memorial Sloan Kettering Cancer Center Experience. Arch Pathol Lab Med 140:1200\u0026ndash;1205\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCheng DT et al (2015) Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT): A Hybridization Capture-Based Next-Generation Sequencing Clinical Assay for Solid Tumor Molecular Oncology. J Mol Diagn 17:251\u0026ndash;264\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChakravarty D et al (2017) OncoKB: A Precision Oncology Knowledge Base. \u003cem\u003eJCO Precis. Oncol.\u003c/em\u003e (2017)\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSuehnholz SP et al (2024) Quantifying the Expanding Landscape of Clinical Actionability for Patients with Cancer. Cancer Discov 14:49\u0026ndash;65\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTurner SR et al (2018) Feasibility of endobronchial ultrasound transbronchial needle aspiration for massively parallel next-generation sequencing in thoracic cancer patients. Lung Cancer 119:85\u0026ndash;90\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eArcila ME et al (2020) Ultrarapid EGFR Mutation Screening Followed by Comprehensive Next-Generation Sequencing: A Feasible, Informative Approach for Lung Carcinoma Cytology Specimens With a High Success Rate. JTO Clin Res Rep 1\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSehn JK et al (2015) Occult Specimen Contamination in Routine Clinical Next-Generation Sequencing Testing. Am J Clin Pathol 144:667\u0026ndash;674\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRoberson J et al (2022) Cross-contamination in cytology processing: a review of current practice. J Am Soc Cytopathol 11:194\u0026ndash;200\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"nature-portfolio","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"","title":"Nature Portfolio","twitterHandle":"","acdcEnabled":false,"dfaEnabled":false,"editorialSystem":"ejp","reportingPortfolio":"","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"Cell-free DNA, Supernatant Fluid, Cytology, Next-Generation Sequencing, Molecular Cytology","lastPublishedDoi":"10.21203/rs.3.rs-4746484/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-4746484/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eComprehensive molecular profiling by next generation sequencing (NGS) has revolutionized tumor classification and biomarker evaluation. However, routine implementation is challenged by the scant nature of diagnostic material obtained through minimally invasive procedures. Here, we describe our long-term experience in profiling cytology samples with an in-depth assessment of the performance, quality metrics, biomarker identification capabilities, and potential pitfalls. We highlight the impact of several optimization strategies to maximize performance with 4,871 prospectively sequenced clinical cytology samples tested by MSK-IMPACT\u0026trade;. Special emphasis is given to the use of residual supernatant cell free DNA (ScfDNA) as a valuable source of tumor DNA. Overall, cytology samples were similar in performance to surgical samples in identifying clinically relevant genomic alterations, achieving success rates up to 93% with full optimization. While cell block (CB) samples had excellent performance overall, low-level cross-contamination was identified in a small proportion of cases (4.7%), a common pitfall intrinsic to the processing of paraffin blocks, suggesting that more stringent precautions and processing modifications should be considered in quality control initiatives. By contrast ScfDNA samples had negligible contamination. Finally, ScfDNA testing exclusively used as a rescue strategy delivered successful results in 71% of cases where tumor tissue from CB was depleted.\u003c/p\u003e","manuscriptTitle":"Maximizing the clinical utility and performance of cytology samples for comprehensive genetic profiling – A report on the impact of process optimization through the analysis of 4,871 cytology samples profiled by MSK-IMPACT","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-07-23 12:52:51","doi":"10.21203/rs.3.rs-4746484/v1","editorialEvents":[],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"nature-communications","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"NCOMMS","sideBox":"Learn more about [Nature Communications](http://www.nature.com/ncomms/)","snPcode":"","submissionUrl":"https://mts-ncomms.nature.com/","title":"Nature Communications","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"ejp","reportingPortfolio":"Nature Communications","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"2eb2d469-710a-467d-b0d2-182354f005e4","owner":[],"postedDate":"July 23rd, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[{"id":34957581,"name":"Health sciences/Oncology/Cancer/Cancer genomics"},{"id":34957582,"name":"Health sciences/Oncology/Cancer/Cancer therapy/Targeted therapies"},{"id":34957583,"name":"Health sciences/Oncology/Surgical oncology"}],"tags":[],"updatedAt":"2025-01-04T08:19:22+00:00","versionOfRecord":{"articleIdentity":"rs-4746484","link":"https://doi.org/10.1038/s41467-024-55456-8","journal":{"identity":"nature-communications","isVorOnly":false,"title":"Nature Communications"},"publishedOn":"2025-01-02 05:00:00","publishedOnDateReadable":"January 2nd, 2025"},"versionCreatedAt":"2024-07-23 12:52:51","video":"","vorDoi":"10.1038/s41467-024-55456-8","vorDoiUrl":"https://doi.org/10.1038/s41467-024-55456-8","workflowStages":[]},"version":"v1","identity":"rs-4746484","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-4746484","identity":"rs-4746484","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00