Higher expression of HPV16 derived E7_LI Transcript Observed in Men with HIV and Recurrent Anal Cancer

preprint OA: closed
Full text JSON View at publisher
Full text 49,189 characters · extracted from preprint-html · click to expand
Higher expression of HPV16 derived E7_LI Transcript Observed in Men with HIV and Recurrent Anal Cancer | Authorea try { document.documentElement.classList.add('js'); } catch (e) { } var _gaq = _gaq || []; _gaq.push(['_setAccount', 'G-8VDV14Y67G']); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); Skip to main content Preprints Collections Wiley Open Research IET Open Research Ecological Society of Japan All Collections About About Authorea FAQs Contact Us Quick Search anywhere Search for preprint articles, keywords, etc. Search Search ADVANCED SEARCH SCROLL Journal of Medical Virology This is a preprint and has not been peer reviewed. Data may be preliminary. 21 January 2025 V1 Latest version Share on Higher expression of HPV16 derived E7_LI Transcript Observed in Men with HIV and Recurrent Anal Cancer Authors : Kevin J. Maroney 0000-0002-6758-5256 , Yuanfan Ye , Staci Sudenga L , Sameer Al Diffalha , N. Sanjib Banerjee , Sadeep Shrestha , and Anju Bansal [email protected] Authors Info & Affiliations https://doi.org/10.22541/au.173744203.33460262/v1 Published Journal of Medical Virology Version of record Peer review timeline 310 views 199 downloads Contents Abstract Supplementary Material Information & Authors Metrics & Citations View Options References Figures Tables Media Share Abstract Squamous cell carcinoma of the anus (SCCA) or anal cancer (AC) is an understudied cancer with a high occurrence rate in people with HIV (PWH), especially men having sex with men (MSM). Furthermore, AC recurs in approximately one fourth of patients who undergo standard care with chemoradiation therapy (CRT). Using bulk RNA sequencing data of AC obtained from 12 patients with non-recurrent (NR, N=9) or recurrent (R, N=3) cancer, we previously showed upregulated expression of key immune genes in the NR compared to the R group. Although the main causative agent of anal cancer is high-risk human papillomavirus (HPV), association of host and viral RNA transcript expression contributing to AC recurrence has not been extensively studied. The objective of the current study was to determine whether enrichment of specific HPV genotypes and/or HPV gene expression patterns differentiate the two groups and if any specific viral (HPV) and host (human) immune mediators correlate with each other. Using bulk RNA sequencing data and VIRTUS 2, we detected viral RNA reads mapping to 7 high-risk and 6 low-risk HPV types of which the high-risk HPV16 observed in 83% (10/12) AC tumors (7/9 non-recurrent and 3/3 recurrent). Rate of all HPV genomes trended towards a decrease in non-recurrent anal cancer isolates and correlation between HPV types was more commonly observed in low-risk ones. Analysis of HPV 16 gene expression profile showed a significantly lower positivity rate for a polycistronic transcript encoding for E7^L1 in the non-recurrent group (1/9, NR versus 3/3, R, p value <0.05). An unbiased correlation analysis of HPV-human transcript expression showed a direct correlation between HPV transcripts and human genes involved in cell growth. The data also identified human transcripts showing an inverse correlation with HPV gene expression. These included genes involved in negative regulation of growth, proliferation and immune response. Taken together, these data indicate that concurrent analyses of viral and host factors in the same tumor can identify potential new therapeutic targets to ameliorate cancer recurrence post treatment. INTRODUCTION: Due to antiretroviral treatment (ART), although people with HIV (PWH) are living longer, they also have an increased risk of developing multi-comorbidities including cancer as chronic immune activation and inflammation persists in PWH despite suppressive ART 1 . Squamous cell carcinoma of the anus (SCCA) or anal cancer (AC) is a relatively rare cancer in the general population 2 . However, it has a significantly higher occurrence rate in PWH, especially among men who have sex with men (MSM) 2,3 and is one of the most common non-AIDS defining malignancy among PWH in the US 4 . Although chemoradiation therapy (CRT) is the standard of care for AC, five-year survival rate is 76% and recurrence of the cancer in the same anatomical site, within five years, can be seen in ~36% of CRT treated patients 5 . Human Papilloma Virus (HPV) is the leading causative agent of anal cancer, with approximately 90% of all cases occurring in those with detectable HPV, and with HPV16 being the most prevalent of type associated with this cancer, according to the National Cancer Institute (NCI) 6 . Human papillomaviruses (HPV) are small non-enveloped DNA viruses, and its infection includes two general outcomes: (a) rapid completion of its productive life cycle, manifested as painful but benign papilloma’s, or (b) an unproductive phase as a subclinical infection for many years. In most patients the infected cells, containing the unproductive viral DNA, are eventually cleared by the host immune system. But in a subset of patients infected with high-risk HPV (such as HPV16), especially under immunocompromised conditions, expression of viral oncogenes (E6 and E7), induce dysplasia which often develop into cancers 7 . Specifically, in terms of life cycle, HPVs naturally infect actively dividing or transit amplifying (TA) cells of the basal or parabasal layers of the squamous or columnar epithelium through endocytic vesicle internalization, and eventual passage into the nucleus 8–11 . The transcription of viral genes, their translation and replication of viral DNA are tightly regulated by epithelial differentiation of the infected cells 12–14 . HPV encoded E6 and E7 proteins which bind to and inactivate or degrade master regulators of cell cycle, p53 and pRb pocket proteins, which induce S-phase protein expression and initiate host DNA replication in postmitotic host cells 16 . E7 also induces a prolonged G2 phase 17 , which facilitates rapid multiplications of HPV DNA assisted by viral E1 and E2 proteins, in the upper differentiated strata of the epithelium 18 , expression of capsid proteins L1 and L2 and progeny virus packaging in the nuclei of cornified cells. Eventual liberation of the virus particles from the exfoliated infected cells initiates new round of viral lifecycle. However, in some individuals, under immunocompromised conditions, a part of the viral DNA of high-risk HPV types, containing E6 and E7 coding sequence but interrupted in the E1 region, integrate into host DNA as tandem copies or singly which abolishes virus lifecycle and induce neoplastic progression, 19–21 , 23–30 . Our prior work using bulk RNA sequencing data of AC obtained from 12 patients with either non-recurrent (NR, N=9) or recurrent (R, N=3) cancer, showed upregulation of key immune gene expression related signatures in the NR compared to the R group 31 . It is well known that about 90% of anal cancer is caused by high-risk HPV, in most cases HPV16, 18 or occasionally HPV33 32 , but the role of other HPV types in AC occurrence and recurrence is unknown, specifically among PWH; thus, evaluating this could be important for diagnostic and/or prognostic AC biomarker discovery. The clinical detection of HPV commonly uses hybrid capture or PCR based assays 33–35 . However, determining viral genotypes and their gene expression patterns from bulk RNA sequencing has not been feasible until recently due to paucity of appropriate computational tools as viral reads, typically comprise less than 0.01% of total mRNA in any given human sample. Such analyses are now feasible with the newer next generation sequencing based methods which provide high sequencing depth and availability of viral analysis enabling computational tools such as Viral Transcript Usage Sensor version2 (VIRTUS 2) as used in this study 36 . In summary, although previous work on many HPV-associated cancers including anal cancer have examined the host gene expression or HPV types by using bulk RNA sequencing and PCR based assays, respectively, prior studies do not typically examine in tandem a) both the host transcriptome and HPV virome of AC in the same sample by using the same bulk RNA sequencing data; and b) determine associations between specific host and viral genes. In the current study, we used VIRTUS2 to determine the HPV virome in AC samples with an overall study objective to determine whether enrichment of specific HPV genotypes and/or HPV gene expression patterns differentiate the recurrent and non-recurrent groups to identify potentially novel biomarker(s) of AC recurrence. Additionally, we sought to characterize the correlation landscape of interactions between HPV16 genotype and human transcripts gene expression patterns to identify gene signatures associated with AC treatment outcome. RESULTS: Formalin fixed paraffin embedded (FFPE) tissues from those with non-recurrent (NR) and recurrent (R) anal cancer were utilized for this work and subjected to paired-end bulk RNA-sequencing, as previously described 31 . VIRTUS2 was used to map these reads to a human reference first before mapping those reads (which did not match any human source) to a reference containing the complete genome sequences of all HPV genotypes to identify the different HPV types present in these samples. We also examined if we could detect specific spliced transcripts of HPV16 that differed between the R and NR groups. A graphical overview of the viral analysis approach used is shown in Supplementary Figure 1. Higher number of HPV genotypes (both low and high-risk) are observed in non-recurrent anal cancer isolates. All HPV complete genome references were originally sourced and are publicly available through the Papillomavirus Episteme (PaVE) or NCBI (Supplementary Table 1) . The rate of viral reads to human reads was low even at the highest rate in these samples (max rate v/h (rate viral/human) reads of 3X 10 -4 ). Despite this, in our cohort, the majority of samples demonstrated coverage of viral reads mapping to high-risk HPV16 in addition to a diverse range of other high and low-risk HPV types ( Figure 1) . Due to the small sample size, there was no single HPV type which was found to change significantly between the 2 groups, however, as expected, HPV16 was observed in most samples (10/12, 83%). While not statistically significant, the number of HPV types observed in NR isolates appeared to be higher (Median # positive HPV types 3 in NR, 2 in R) than in R isolates (Supplementary Figure 2A) and average rate mapped reads (v/h) for all samples appeared to be higher in R isolates (Supplementary Figure 2B) . The latter increase was driven most notably by HPV16, which increased in samples from the NR group (median average rate was 5.52x10 -5 ) to those which eventually progressed to recurrent AC (median average rate 8.26x10 -5 ); Figure 2A, p-value=0.07. Lastly, we determined whether the presence of certain HPV types correlate with the presence of others (Figure 2B) . This correlation was more commonly observed in low-risk types (HPV 81 and 42, 81 and 114, 81 and 74), while in the high-risk group this was only seen for HPV 35 and 58. It should also be noted that each of the non-HPV16 and -HPV18 types (the high-risk types most commonly observed in carcinomas anal cancer) were only observed in a single sample. HPV16 E7-L1 polycistronic transcript is significantly enriched in recurrent cancer isolates. We next examined polycistronic spliced transcripts identified using an HPV16 transcriptome map reported in a prior publication 37 . We searched for sequences covering the spliced junctions to infer prevalence of spliced transcripts in this cohort using a pipeline adapted from VIRTUS2 36 . Transcript positions were initially adapted from the paper by Yu et al 37 , then reference sequences were constructed from these positional coordinates as spliced polycistronic full-length transcripts. We detected the presence of most of the HPV16 spliced transcripts in at least one sample, though their magnitude (v/h reads) was low. Only the splice signature of transcript number 20 (nucleotides 562-858, 5639-7156 ) as identified in HPV16 transcription map was detected in all samples from the recurrent group. Moreover, the transcription rate of transcript 20, encoding for “HPV16 E7^L1” ORFs, was significantly higher (by Mann-Whitney U-test) in recurrent group (3/3) but detected at only 1/9 in non-recurrent group (p value (nucleotides 104-226, 409-2814), a spliced polycistronic transcript “HPV16 E6*I^E7^L1”, encoding, in order, for E6*, E7, and E1 was detected in both recurrent and non-recurrent samples, with a trend for higher expression (p-value 0.09) in the former (Figure 3C) . HPV16 (v/h rate) inversely correlated with host HLA-A and directly with DEAD box protein DDX24 expression in AC. Previously 31 , we showed upregulation of key immune markers (encoding for HLA and dead box proteins) in the NR group. Because HPV16 was most ubiquitously identified in 83% of all samples analyzed, we wanted to determine whether HPV16 could also be interacting with the human transcriptome in a way that would be associated with non-recurrence. We therefore examined whether the rate of HPV16 reads (using the whole HPV genome as reference, referred to as “HPV16 genome” later in this manuscript) correlates with the expression of these human transcripts. Among differentially expressed HLA transcripts, HLA-A was found to inversely correlate with HPV16 rate, though this correlation was not significant (p = 0.2, r = -0.4), ( Figure 4A ). However, DDX24 , a DEAD Box protein family subunit, directly and significantly correlated with the rate of HPV16 (p < 0.05, r = 0.6) ( Figure 4B ). These data suggest markers of HPV-associated AC, irrespective of treatment outcome. This targeted analysis, although informative, was biased and did not provide the breadth of coverage to capture all genes directly or indirectly correlated with HPV16 genome or transcript rate. Therefore, we next performed an unbiased bioinformatic approach to capture the correlation landscape of human genes whose expression increased with or decreased with HPV16 infection. Unbiased high-throughput analysis determined that HPV16 and HPV16 transcripts correlates with human gene ontology (GO) signatures of cell growth. To capture the full unbiased correlation landscape of all human genes which may potentially correlate directly or inversely with either a) HPV16 rate (Figure 5A,B) or b) individual polycistronic ORF-encoding transcripts (Figure 5C-F) , we developed a bioinformatic pipeline to determine the correlation between the rate of detection of HPV16 genome (full length HPV16 sequence used as reference) or individual HPV-16 polycistronic transcripts (v/h rate) relative to expression (Reads Per Kilobase per Million mapped or RPKM) of human genes across all AC samples. Examining at the HPV16 genome level, we found several genes, both studied (official HUGO Gene Nomenclature Committee (HGNC) symbol) and unstudied (only Ensembl ID such as ENSG00000259694, usually encoding for lncRNA or pseudogenes) that were differentially regulated. Interestingly, a higher number of genes showed a direct correlation (N=1502) as compared to those showing an inverse correlation (N=158) with HPV16 genome rate. Of the genes whose HGNC symbol was recognized and able to be enriched for gene ontology signatures, directly correlated genes were found to be primarily those associated with cell growth with signatures of “Metabolism of RNA,” “DNA repair,” “Cell division,” Translation, and “Protein catabolic process” ( Figure 5A) and this human transcript expression pattern is consistent with HPV infection induced upregulation of cell cycle and proliferation pathways in cancer. Examining at the HPV16 transcript level, we found that the majority of human genes directly correlating with the various HPV16 transcripts also enriched to GO terms also associated with cell growth such as “Signaling by Rho GTPases”, “Metabolism of RNA”, “DNA metabolic process”, or “Cell cycle” ( Figure 5C ). The GO term with the highest number of directly (Figure 5C) or indirectly ( Figure 5D) correlated human genes for the numbered HPV transcript shown (T1 is HPV16 Transcript 1, T2 is Transcript 2, and so on). The most directly (Figure 5E) or indirectly (Figure 5F) correlated human gene with each HPV16 transcript within the highest human gene number GO term are also shown as examples. Interestingly, not only is PRIMPOL the most directly correlated gene within the highest gene number GO term for HPV16 T1 (correlation = 0.82, p = 1.05E-03). It is also the most directly correlated human gene with both HPV16 T1 and HPV16 genome rate overall (correlation = 0.89, p = 1.17E-04). The most directly correlated human gene with HPV16 T20 which was found to be significantly enriched in recurrent AC was also MIR212 as indicated (correlation = 0.99, p = 4.82E-10). Lastly, although transcripts from the HIV-1 genome itself were below the limit of detection, we investigated the “HIV-1 Infection” GO term which was enriched from positively correlated genes for every HPV16 transcript and the HPV16 genome (full length HPV16 sequence used as reference) itself. Almost every HPV16 polycistronic transcript as well as the HPV16 genome itself had several genes which positively correlated with it enriching to this GO term. Those human genes enriching to the term “HIV-1 Infection” positively correlating with HPV16 whole genome transcription rates are shown in Supplemental Figure 3 . DISCUSSION In this study, using VIRTUS 2, we showed that HPV genotypes and their transcripts can be readily detected using bulk RNA sequencing data allowing for directly assessing, in the same cancer isolate, any associations between host and viral factors. The latter analyses show that it is possible to identify host factors whose expression is altered (upregulated/downregulated) by viral transcripts within the same anal cancer isolate. To date, it has been difficult to detect viral reads (which typically comprise less than 0.01% of total mRNA in any given human sample), in data obtained from bulk RNA sequencing. In recent years, several open-source computational packages have been released allowing for examining the virome in data obtained from bulk RNA sequencing. The most common strategy has been to align reads from an entire bulk RNA-sequencing sample to a given whole genome reference for a single virus, then count them. However, the problems with this approach were twofold. One is that many viral open reading frames (ORFs) may have the potential to overlap with regions of the human transcriptome/genome and so disregarding this fact will result in overestimation of viral sequences. Second, most bulk RNA-sequencing datasets are not sequenced to enough depth to accommodate the sensitivity required to detect viral transcript reads which are in comparison extremely low compared to any host transcripts. However, the tool we used, VIRTUS, alleviates the first issue by first aligning reads to the human genome, then taking any unmapped reads and aligning them to any viral reference it is given 36,38 . The references used in VIRTUS2 contain more than 700 unique complete virus genome sequences including different HPV types to provide a complete picture of the “virome” in any given sample. Additionally, a novel feature of this package is the determination of a “rate” constant, which is used to normalize any mapped viral reads to the total number of mapped human reads in each sample (rate v/h or rate viral/human). Both strategies ensure that an accurate measure of viral reads relative to human reads is ascertained. Detection of HPV16 as the predominant genotype in our study correlates well with what is reported in literature for other HPV+ squamous cellular carcinomas and determined by PCR based assays, thus indicating the feasibility of using bulk RNA sequencing for determining viral features 32,39–41 . Furthermore, our approach is also sensitive to detect low frequency polycistronic transcripts such as HPV16 Transcript 20 encoding for E7^L1, which can perhaps be used as predictive biomarker of anal cancer recurrence although our study is limited by small sample size and future studies will address whether this HPV transcript is unique to anal cancer recurrence or other HPV associated human cancers observed in PWH and/or non-PWH. Also, based on this study design, while these results seem predictive of recurrence, no isolates from the actual recurrent cancer were procured. It will be important in future studies to compare and determine whether the HPV types observed and their transcript expression patterns between a primary and its recurrent AC isolate are the same or differ. It should be noted that in HPV+ tumors HPV DNA may exist in three forms: episomal, integrated into host chromatin, often near an actively transcribing gene, characterized with distinct virus-host junction or episomes of hybrid HPV-host DNA or their combinations 42–44 . In all HPV integration events, HPV-DNA is truncated in E1 and/or E2 sequence and typically lack other downstream sequences such as L1, and L2 40,46,47 . In this context the detection of transcript 20 in AC is unexpected. Presumably, the transcript 20 (E7^L1) counts observed were indicative of new productive infection in some non-tumor cells present in the same tissue along with the tumors. Transcript 1 (14-226^409-4237), the main source of E7 protein is typically most abundant in all high-risk HPV infected cells (including HPV+ cancers). In contrast, the minor transcript 20 is considered to express L1 protein only in productive infection. From this data we infer that concurrent productive infection most likely occurred in AC isolates from patients who showed cancer recurrence, suggesting re-infection perhaps reinvigorates a tumor promoting environment in these specimens. More extensive testing would however be required to confirm this hypothesis. Our findings that HPV 16 showed a trend for an inverse correlation with HLA-A expression suggests that HPV16 plays at least some role in the downregulation of HLA-class I expression in AC to facilitate immune evasion from CD8+ T cell-mediated cytotoxicity 49,50 . DEAD box proteins represent a family of RNA helicases with a pro or anti-viral roles 51 . A direct correlation was observed between DDX24 and HPV16 genome rate in AC. Elevated DDX24 is significant for its role in negatively regulating RIG-1 mediated innate immune response against cytosolic VSV RNA 52 , DNA repair and cell cycle progression in vascular smooth muscle cells 53 . and HIV replication 51,54,55 .and its expression is found elevated in other cancers such as breast and gastric cancer 54,56 where it was shown to control p53 activity and increased expression was associated with worse survival 54,54,56 . Thus, further study on DDX24 expression is needed to unravel its potential role in HPV+ AC. Our unbiased correlation analysis between HPV and host transcripts showed that a majority of human genes positively correlated with HPV16 genome rate enriched to GO terms associated with cell growth, proliferation, or catabolic processes as expected for an oncogenic virus 57,58 . While the HIV genome itself through VIRTUS2 was below the limit of detection for all samples (based on transcripts) within the isolates, 22 genes mapping to the “HIV Infection” term and therefore previously observed to be upregulated in active HIV infected samples were directly correlated with HPV16 genome rate, suggesting that perhaps HPV16, present in the cancer, is promoting an HIV-permissive environment. For example, one of the genes identified in this study, BANF1, also known as BAF1, the host factor most positively correlated with HPV16 Genome rate in the “HIV Infection” GO term in this analysis, has previously been shown to be exploited by HIV to restore the activity of pre-integration complexes (PIC’s) and prevent their auto-integration into themselves, thereby improving efficiency of integration into the host chromosome 59–61 . One of our study limitations is a small sample size for the recurrent cancer group and thus future studies involving larger sample sets would be required to validate our study findings. However, this does not affect the correlation analyses as these were not based on group comparisons. Another caveat is that with bulk RNA-sequencing data, a direct assessment between host and viral factors cannot be performed at a single cell level. Nevertheless, single cell-based assays, although more informative, are expensive and require more material to perform and thus bulk sequencing is a common choice due to it lower costs and represents the vast repertoire of data available in many database repositories. Thus, the ability to examine both host and viral features in the same bulk RNA sequencing based data set will still yield more relevant information. In summary, we have developed a unique approach to simultaneously analyze correlation between host and HPV gene transcription using the same bulk RNA sequencing dataset from AC specimens. Future studies of spatial transcriptomics on FFPE cancer tissue will allow us to determine whether HLA-A downregulation occurs in cells that are HPV infected and what impact this has on the proximity and function of tumor infiltrating lymphocytes. According to studies published on other SCC tumors using spatial transcriptomics, HPV diversity and heterogeneity exists between different cell type compartments and so this may be a more accurate representation of the AC landscape 62 . Additionally, follow-up studies will involve a larger, well characterized cohort with longitudinal timepoints post initial CRT will confirm whether the candidate transcripts (host and viral) identified in this study are continually expressed and associated with recurrence of AC. Cohorts and Study Design The study cohort is described in detail in our prior work 31 . In brief, 12 AC patients (3 recurrent and 9 non-recurrent) from the UAB O’Neal Comprehensive Cancer Center (UAB-CCC) with electronic health records (EHR) from the University at Alabama Birmingham (UAB) HIV Clinic were reviewed by a licensed oncologist and confirmed by a pathologist. This study was approved by the UAB IRB Review Board. These archived formalin-fixed and paraffin-embedded (FFPE) primary tissues from individuals with confirmed AC diagnoses in the EHR were then requested from the UAB Tissue Biorepository and associated with the requisite metadata. For both recurrent and non-recurrent, we used the primary tumor sample prior to the patient receiving chemoradiation therapy. Non-recurrence is defined as the absence of disease at the site of the primary tumor and regional lymph nodes within 6 months from the end of chemoradiation therapy. Local recurrence (LR) is defined as persistent disease or recurrence at the site of the primary tumor. Each patient’s EHR was retrospectively reviewed up to 5 years after the last session of chemoradiation therapy to determine if LR occurred. RNA extraction, library preparation and RNA-sequencing RNA was purified from FFPE tissues with the Quick-DNA/RNA FFPE kit (R1009, Zymo Research). The concentration of RNA was assessed with a Nanodrop spectrophotometer. Libraries were prepared as previously described 31 . A 20 uL volume of at least 400 ng RNA were used for library creation. Libraries were generated with the SMARTer Stranded Total RNA-Seq Kit v2 – Pico Input Mammalian (Takara Bio USA). The pooled libraries were sequenced on a NovaSeq 6000 (Illumina) to a depth of > 40 million paired-end 150 bp reads for each sample. Phred quality score for all samples was good, with a mean of > Q30. Bioinformatics Analysis VIRTUS2: Initial pre-processing and analysis of FASTQ sequences for human mRNA gene expression was performed as previously described 31 . However, VIRTUS2 was used for whole genome splicing-aware HPV type alignment and counting 36,38 . The most recent human genome from the human genome project, GRCh38 was downloaded alongside the concatenated list of all HPV type genome sequences included within the VIRTUS2 repository. These viral sequences were acquired from the Virtect repository but originate from NCBI and PaVE. These references were first indexed with the createindex.cwl script and STAR to create an indexed human genome reference as well as “virome” genome reference containing whole genome sequences from >700 viruses 63 . Donor library fastq files were trimmed to minimum sequence length of 75 bp and a Phred score cutoff of 30 using Trim Galore as previously described 31,64 . A targets file defined sequences as belonging to either recurrent or non-recurrent groups. Singularity was used in place of Docker because of the lack of root privileges on the UAB Cheaha high performance computing (HPC) cluster. This was used alongside the Spliced Transcripts Alignment to a Reference (STAR) human and viral genome reference as well as the VIRTUS_wrapper.py python wrapper script to count viral genome reads as described in the VIRTUS2 repository. VIRTUS2 first filtered polyA/T sequences and mapped fastq sequences to human genome reference through STAR. All unmapped reads were then mapped to the indexed “virome” genome reference. Both coverage and rate viral/human (v/h) % reads were returned for all viral genome sequences contained in the reference fasta file. HPV type results above 0 rate v/h % and Coverage were then represented as a bubbleplot generated through R so as not to miss low frequency HPV types. “Rate v/h %” returns the viral reads for a given genome or transcript divided by the reads mapping to the human reference. For specific HPV16 ORF’s or transcripts, additional fasta references were generated separately and then indexed with the same workflow. Virtus2 no longer supports gene-level quantification and recommends using the original Virtus tool for this purpose. However, the original Virtus does not have a wrapper which performs analysis in a high-throughput manner, so the same wrapper script used for genome quantification in VIRTUS2 was also used for ORF- and transcript-level quantification, simply replacing the “virome” reference full of individual HPV type whole genome sequences with individual PaVE ORF gene or transcript sequences. Additional analyses and graphical representations were also performed using a combination of PRISM or R 65,66 . HPV-type specific ORF sequences obtained through the Papillomavirus Episteme (PaVE) database for the HPV types previously identified as having detectable genome rates were used to create a targeted reference in place of the whole genome “virome” reference used previously. To ensure specificity and reproducibility, the whole genome sequences for these types were also included. All samples positive for an indicated HPV type ORF were positive for only the whole genome reference sequences of only those HPV types to which the ORF mapped (data not shown). The final analysis included all currently identified specific HPV16 polycistronic transcripts as described through PaVE 37 . Correlation Analysis: Both human gene RPKM (HGNC symbols or Ensembl ID if HGNC symbol did not exist for a given reference transcript) and viral rate (v/h) for all HPV full genome or HPV16 individual transcripts were included in one Excel file. R was then used to compare every human gene RPKM value to every viral rate through a parallel processing script across all analysis pairs. For every pair, a corresponding correlation and p value of correlation was output, and those which were significant (p < 0.05) were also output into a separate sheet. GO analysis was performed through Metascape either as single analysis for direct or inversely correlated with HPV16 genome only, or using the “Multi-Gene List” feature for all directly or inversely correlated transcript genes. Further generation of the Pyramid plot, radar charts, individual correlation plots, or panel correlation plots was performed through Python. Individual plots available on request. Acknowledgements We thank all study participants from the UAB 1917 Clinic. We also thank the UAB Research and Informatics Service Center (RISC) for data access. Funding Information This research was supported by the Tissue Procurement Shared Resource of the O’Neal Comprehensive Cancer Center (P30CA013148)/UAB-Tissue Biorepository (UAB-TBR). This work was also supported by the Quetelet Endowed Professorship Research Fund (SS). The UAB Centers for AIDS Research sup­ported YY for the anal cancer research (the 2018 World Aids Day Poster Award). SLS is supported by the National Cancer Institute (K07 CA225404). Data Availability Statement: Upon acceptance, all raw fasta files will be made available through a data repository service such as Gene Expression Omnibus (GEO) or Zenodo according to editor preference. REFERENCES 1. Castilho, J. L. et al. CD4/CD8 Ratio and Cancer Risk Among Adults With HIV. J. Natl. Cancer Inst. 114 , 854–862 (2022). 2. Clifford, G. M. et al. A meta-analysis of anal cancer incidence by risk group: Toward a unified anal cancer risk scale. Int. J. Cancer 148 , 38–47 (2021). 3. Deshmukh, A. A. et al. Recent and projected incidence trends and risk of anal cancer among people with HIV in North america. J. Natl. Cancer Inst. djae096 (2024) doi:10.1093/jnci/djae096. 4. Rihana, N. et al. Malignancy Trends in HIV-Infected Patients Over the Past 10 Years in a Single-Center Retrospective Observational Study in the United States. Cancer Control J. Moffitt Cancer Cent. 25 , 1073274818797955 (2018). 5. Faynsod, M. et al. Patterns of recurrence in anal canal carcinoma. Arch. Surg. Chic. Ill 1960 135 , 1090–1093; discussion 1094-1095 (2000). 6. Anal Cancer Prevention (PDQ®) - NCI. https://www.cancer.gov/types/anal/hp/anal-prevention-pdq (2014). 7. Xue, J., Vesper, B. J. & Radosevich, J. A. The Life Cycle of Human Papillomavirus. in HPV and Cancer (ed. Radosevich, J. A.) 49–74 (Springer Netherlands, Dordrecht, 2012). doi:10.1007/978-94-007-5437-9_3. 8. Sapp, M. & Bienkowska-Haba, M. Viral entry mechanisms: human papillomavirus and a long journey from extracellular matrix to the nucleus. FEBS J. 276 , 7206–7216 (2009). 9. Smith, J. L., Campos, S. K. & Ozbun, M. A. Human papillomavirus type 31 uses a caveolin 1- and dynamin 2-mediated entry pathway for infection of human keratinocytes. J. Virol. 81 , 9922–9931 (2007). 10. Selinka, H.-C., Giroglou, T. & Sapp, M. Analysis of the infectious entry pathway of human papillomavirus type 33 pseudovirions. Virology 299 , 279–287 (2002). 11. Day, P. M., Lowy, D. R. & Schiller, J. T. Papillomaviruses infect cells via a clathrin-dependent pathway. Virology 307 , 1–11 (2003). 12. Stoler, M. H., Wolinsky, S. M., Whitbeck, A., Broker, T. R. & Chow, L. T. Differentiation-linked human papillomavirus types 6 and 11 transcription in genital condylomata revealed by in situ hybridization with message-specific RNA probes. Virology 172 , 331–340 (1989). 13. Stoler, M. H. & Broker, T. R. In situ hybridization detection of human papillomavirus DNAs and messenger RNAs in genital condylomas and a cervical carcinoma. Hum. Pathol. 17 , 1250–1258 (1986). 14. Doorbar, J. et al. The biology and life-cycle of human papillomaviruses. Vaccine 30 Suppl 5 , F55-70 (2012). 15. Wilson, V. G., West, M., Woytek, K. & Rangasamy, D. Papillomavirus E1 proteins: form, function, and features. Virus Genes 24 , 275–290 (2002). 16. Banerjee, N. S. et al. Conditionally activated E7 proteins of high-risk and low-risk human papillomaviruses induce S phase in postmitotic, differentiated human keratinocytes. J. Virol. 80 , 6517–6524 (2006). 17. Banerjee, N. S., Wang, H.-K., Broker, T. R. & Chow, L. T. Human papillomavirus (HPV) E7 induces prolonged G2 following S phase reentry in differentiated human keratinocytes. J. Biol. Chem. 286 , 15473–15482 (2011). 18. Wang, H.-K., Duffy, A. A., Broker, T. R. & Chow, L. T. Robust production and passaging of infectious HPV in squamous epithelium of primary human keratinocytes. Genes Dev. 23 , 181–194 (2009). 19. Choo, K. B., Pan, C. C. & Han, S. H. Integration of human papillomavirus type 16 into cellular DNA of cervical carcinoma: preferential deletion of the E2 gene and invariable retention of the long control region and the E6/E7 open reading frames. Virology 161 , 259–261 (1987). 20. Cricca, M. et al. Disruption of HPV 16 E1 and E2 genes in precancerous cervical lesions. J. Virol. Methods 158 , 180–183 (2009). 21. Kalantari, M. et al. Disruption of the E1 and E2 reading frames of HPV 16 in cervical carcinoma is associated with poor prognosis. Int. J. Gynecol. Pathol. Off. J. Int. Soc. Gynecol. Pathol. 17 , 146–153 (1998). 22. Pett, M. & Coleman, N. Integration of high-risk human papillomavirus: a key event in cervical carcinogenesis? J. Pathol. 212 , 356–367 (2007). 23. Schwarz, E. et al. Structure and transcription of human papillomavirus sequences in cervical carcinoma cells. Nature 314 , 111–114 (1985). 24. Luft, F. et al. Detection of integrated papillomavirus sequences by ligation-mediated PCR (DIPS-PCR) and molecular characterization in cervical cancer cells. Int. J. Cancer 92 , 9–17 (2001). 25. Rösl, F. et al. Extinction of the HPV18 upstream regulatory region in cervical carcinoma cells after fusion with non-tumorigenic human keratinocytes under non-selective conditions. EMBO J. 10 , 1337–1345 (1991). 26. Stoler, M. H. et al. Human papillomavirus type 16 and 18 gene expression in cervical neoplasias. Hum. Pathol. 23 , 117–128 (1992). 27. Bernard, B. A. et al. The human papillomavirus type 18 (HPV18) E2 gene product is a repressor of the HPV18 regulatory region in human keratinocytes. J. Virol. 63 , 4317–4324 (1989). 28. Romanczuk, H., Thierry, F. & Howley, P. M. Mutational analysis of cis elements involved in E2 modulation of human papillomavirus type 16 P97 and type 18 P105 promoters. J. Virol. 64 , 2849–2859 (1990). 29. Dowhanick, J. J., McBride, A. A. & Howley, P. M. Suppression of cellular proliferation by the papillomavirus E2 protein. J. Virol. 69 , 7791–7799 (1995). 30. Francis, D. A., Schmid, S. I. & Howley, P. M. Repression of the integrated papillomavirus E6/E7 promoter is required for growth suppression of cervical cancer cells. J. Virol. 74 , 2679–2686 (2000). 31. Ye, Y. et al. RNA-seq analysis identifies transcriptomic profiles associated with anal cancer recurrence among people living with HIV. Ann. Med. 55 , 2199366 (2023). 32. de Martel, C., Plummer, M., Vignat, J. & Franceschi, S. Worldwide burden of cancer attributable to HPV by site, country and HPV type. Int. J. Cancer 141 , 664–670 (2017). 33. Katanga, J. et al. Agreement between careHPV and hybrid capture 2 in detecting high-risk HPV in women in Tanzania. Acta Obstet. Gynecol. Scand. 100 , 786–793 (2021). 34. Eleutério, J., Barros, I. C., Cavalcante, D. I. M., Eleutério, R. M. N. & Giraldo, P. C. HPV-DNA hybrid capture test: influence of cellularity in penile samples. Acta Cytol. 54 , 546–550 (2010). 35. Kuroki, H., Sakamoto, J., Shibata, T., Takakura, M. & Sasagawa, T. Comparison of Aptima and hybrid capture-2 HPV tests and Pap test in the referral population in Japan. J. Med. Virol. 93 , 5076–5083 (2021). 36. Yasumizu, Y., Hara, A., Sakaguchi, S. & Ohkura, N. VIRTUS: a pipeline for comprehensive virus analysis from conventional RNA-seq data. Bioinforma. Oxf. Engl. 37 , 1465–1467 (2021). 37. Yu, L., Majerciak, V. & Zheng, Z.-M. HPV16 and HPV18 Genome Structure, Expression, and Post-Transcriptional Regulation. Int. J. Mol. Sci. 23 , 4943 (2022). 38. Yasumizu, Y. VIRTUS : VIRal Transcript Usage Sensor v1.2.1. (2023). 39. Le, T. M. et al. Association of human papillomavirus 16 and 18 with ovarian cancer risk: Insights from a meta‑analysis. Oncol. Lett. 28 , 556 (2024). 40. Zhang, R. et al. Rapid detection of HPV16 utilizing recombinase polymerase amplification with the employment of an extremely low concentration of the probe. Anal. Methods Adv. Methods Appl. (2024) doi:10.1039/d4ay01625d. 41. Tung, H.-J. et al. Human papillomavirus prevalence, genotype distribution, and prognostic factors of vaginal cancer. Int. J. Cancer 155 , 1996–2008 (2024). 42. Holmes, A. et al. Mechanistic signatures of HPV insertions in cervical carcinomas. NPJ Genomic Med. 1 , 16004 (2016). 43. Morel, A. et al. Mechanistic Signatures of Human Papillomavirus Insertions in Anal Squamous Cell Carcinomas. Cancers 11 , 1846 (2019). 44. Mainguené, J. et al. Human papilloma virus integration sites and genomic signatures in head and neck squamous cell carcinoma. Mol. Oncol. 16 , 3001–3016 (2022). 45. Cancer Genome Atlas Research Network et al. Integrated genomic and molecular characterization of cervical cancer. Nature 543 , 378–384 (2017). 46. Yu, L. et al. HPV oncogenes expressed from only one of multiple integrated HPV DNA copies drive clonal cell expansion in cervical cancer. mBio 15 , e0072924 (2024). 47. Boada, E. A., Cuschieri, K., Graham, C., Moncur, S. & Bhatia, R. Agreement between L1 and E6/E7-based assays for detection of high-risk HPV in cervical, oropharyngeal and penile cancers. J. Clin. Pathol. 76 , 467–473 (2023). 48. Graham, S. V. Human Papillomavirus E2 Protein: Linking Replication, Transcription, and RNA Processing. J. Virol. 90 , 8384–8388 (2016). 49. Dhatchinamoorthy, K., Colbert, J. D. & Rock, K. L. Cancer Immune Evasion Through Loss of MHC Class I Antigen Presentation. Front. Immunol. 12 , 636568 (2021). 50. Hazini, A., Fisher, K. & Seymour, L. Deregulation of HLA-I in cancer and its central importance for immunotherapy. J. Immunother. Cancer 9 , e002899 (2021). 51. Ullah, R., Li, J., Fang, P., Xiao, S. & Fang, L. DEAD/H-box helicases:Anti-viral and pro-viral roles during infections. Virus Res. 309 , 198658 (2022). 52. Ma, Z., Moore, R., Xu, X. & Barber, G. N. DDX24 negatively regulates cytosolic RNA-mediated innate immune signaling. PLoS Pathog. 9 , e1003721 (2013). 53. Gong, Y. et al. DDX24 Is Essential for Cell Cycle Regulation in Vascular Smooth Muscle Cells During Vascular Development via Binding to FANCA mRNA. Arterioscler. Thromb. Vasc. Biol. 43 , 1653–1667 (2023). 54. Cai, W. et al. Wanted DEAD/H or Alive: Helicases Winding Up in Cancers. J. Natl. Cancer Inst. 109 , (2017). 55. Heaton, S. M., Gorry, P. R. & Borg, N. A. DExD/H-box helicases in HIV-1 replication and their inhibition. Trends Microbiol. 31 , 393–404 (2023). 56. Ni, Y. & Zhuang, Z. DDX24 promotes tumor progression by mediating hexokinase-1 induced glycolysis in gastric cancer. Cell. Signal. 114 , 110995 (2024). 57. Tommasino, M. HPV and skin carcinogenesis. Papillomavirus Res. Amst. Neth. 7 , 129–131 (2019). 58. Hu, J. et al. HPV 16 E6 promotes growth and metastasis of esophageal squamous cell carcinoma cells in vitro. Mol. Biol. Rep. 50 , 1181–1190 (2023). 59. Harris, D. & Engelman, A. Both the structure and DNA binding function of the barrier-to-autointegration factor contribute to reconstitution of HIV type 1 integration in vitro. J. Biol. Chem. 275 , 39671–39677 (2000). 60. Jacque, J.-M. & Stevenson, M. The inner-nuclear-envelope protein emerin regulates HIV-1 infectivity. Nature 441 , 641–645 (2006). 61. Lee, M. S. & Craigie, R. A previously unidentified host protein protects retroviral DNA from autointegration. Proc. Natl. Acad. Sci. U. S. A. 95 , 1528–1533 (1998). 62. Puram, S. V. et al. Cellular states are coupled to genomic and viral heterogeneity in HPV-related oropharyngeal carcinoma. Nat. Genet. 55 , 640–650 (2023). 63. cwltool: The reference reference implementation of the Common Workflow Language standards. Common Workflow Language (2023). 64. Maroney, K. J., Pinski, A. N., Marzi, A. & Messaoudi, I. Transcriptional Analysis of Infection With Early or Late Isolates From the 2013-2016 West Africa Ebola Virus Epidemic Does Not Suggest Attenuated Pathogenicity as a Result of Genetic Variation. Front. Microbiol. 12 , 714817 (2021). 65. R Core Team. R: A Language and Environment for Statistical Computing. (2017). 66. Mitteer, D. R., Greer, B. D., Randall, K. R. & Briggs, A. M. Further Evaluation of Teaching Behavior Technicians to Input Data and Graph Using GraphPad Prism. Behav. Anal. Wash. DC 20 , 81–93 (2020). Supplementary Material File (figures final.pdf) Download 1.64 MB File (table.docx) Download 16.43 KB Information & Authors Information Version history V1 Version 1 21 January 2025 Peer review timeline Published Journal of Medical Virology Version of Record 3 May 2025 Published Copyright This work is licensed under a Non Exclusive No Reuse License. Collection Journal of Medical Virology Keywords cellular effect gene expression human immunodeficiency virus human papillomavirus oncogenesis oncoproteins virus classification Authors Affiliations Kevin J. Maroney 0000-0002-6758-5256 The University of Alabama at Birmingham Division of Infectious Diseases View all articles by this author Yuanfan Ye The University of Alabama at Birmingham Department of Obstetrics and Gynecology View all articles by this author Staci Sudenga L Vanderbilt University Medical Center Division of Epidemiology View all articles by this author Sameer Al Diffalha The University of Alabama at Birmingham Heersink School of Medicine View all articles by this author N. Sanjib Banerjee The University of Alabama at Birmingham Department of Biochemistry and Molecular Genetics View all articles by this author Sadeep Shrestha The University of Alabama at Birmingham Department of Epidemiology View all articles by this author Anju Bansal [email protected] The University of Alabama at Birmingham Division of Infectious Diseases View all articles by this author Metrics & Citations Metrics Article Usage 310 views 199 downloads .FvxKWukQNSOunydq8rnd { width: 100px; } Citations Download citation Kevin J. Maroney, Yuanfan Ye, Staci Sudenga L, et al. Higher expression of HPV16 derived E7_LI Transcript Observed in Men with HIV and Recurrent Anal Cancer. Authorea . 21 January 2025. DOI: https://doi.org/10.22541/au.173744203.33460262/v1 If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download. For more information or tips please see 'Downloading to a citation manager' in the Help menu . Format Please select one from the list RIS (ProCite, Reference Manager) EndNote BibTex Medlars RefWorks Direct import Tips for downloading citations document.getElementById('citMgrHelpLink').addEventListener('click', function() { popupHelp(this.href); return false; }); $(".js__slcInclude").on("change", function(e){ if ($(this).val() == 'refworks') $('#direct').prop("checked", false); $('#direct').prop("disabled", ($(this).val() == 'refworks')); }); View Options View options PDF View PDF Figures Tables Media Share Share Share article link Copy Link Copied! Copying failed. Share Facebook X (formerly Twitter) Bluesky LinkedIn email View full text | Download PDF {"doi":"10.22541/au.173744203.33460262/v1","type":"Article"} Now Reading: Share Figures Tables Close figure viewer Back to article Figure title goes here Change zoom level Go to figure location within the article Download figure Toggle share panel Toggle share panel Share Toggle information panel Toggle information panel Go to previous graphic Go to next graphic Go to previous table Go to next table All figures All tables View all material View all material xrefBack.goTo xrefBack.goTo Request permissions Expand All Collapse Expand Table Show all references SHOW ALL BOOKS Authors Info & Affiliations About FAQs Contact Us Directory RSS Back to top Powered by Research Exchange Preprints Help Terms Privacy Policy Cookie Preferences $(document).ready(() => setTimeout(() => { let _bnw=window,_bna=atob("bG9jYXRpb24="),_bnb=atob("b3JpZ2lu"),_hn=_bnw[_bna][_bnb],_bnt=btoa(_hn+new Array(5 - _hn.length % 4).join(" ")); $.get("/resource/lodash?t="+_bnt); },4000)); (function(){function c(){var b=a.contentDocument||a.contentWindow.document;if(b){var d=b.createElement('script');d.innerHTML="window.__CF$cv$params={r:'9fe911ce7be4dfa9',t:'MTc3OTI1NjI4Ng=='};var a=document.createElement('script');a.src='/cdn-cgi/challenge-platform/scripts/jsd/main.js';document.getElementsByTagName('head')[0].appendChild(a);";b.getElementsByTagName('head')[0].appendChild(d)}}if(document.body){var a=document.createElement('iframe');a.height=1;a.width=1;a.style.position='absolute';a.style.top=0;a.style.left=0;a.style.border='none';a.style.visibility='hidden';document.body.appendChild(a);if('loading'!==document.readyState)c();else if(window.addEventListener)document.addEventListener('DOMContentLoaded',c);else{var e=document.onreadystatechange||function(){};document.onreadystatechange=function(b){e(b);'loading'!==document.readyState&&(document.onreadystatechange=e,c())}}}})();

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00