{"paper_id":"ab0459dc-de0c-411b-823b-96d2aa58951b","body_text":"Actinomyces turicensis  was reclassified ( 1 ) as  Schaalia turicensis . These Gram-positive facultative anaerobes exist in the human oral ( 2, 3 ), gut ( 4 ), and urogenital tracts ( 5 – 7 ).  S. turicensis  is an opportunistic pathogen with cases involving urethritis ( 5 ), bacteremia ( 8 ), meningitis ( 9 ), soft tissue infections ( 10 ), and wound infections ( 11 ).\nIn this announcement, we describe the draft genome of  S. turicensis  R31, which was isolated from the endometrium, thereby expanding our understanding of  S. turicensis  in gynecologic health.\nThis isolate was obtained from an endometrial swab of a patient diagnosed with adenomyosis, in which endometrial-like tissue grows into the myometrium. This was conducted as part of a multi-omic study investigating patients undergoing hysterectomy ( 12 – 14 ). The Institutional Review Board of the University of Arizona approved this study (reference no. 1708726047), and written informed consent was obtained from the study participant. Swabs were frozen in Amies transport media with 10% glycerol. Serial dilutions were performed to isolate the bacteria under anaerobic conditions at 37°C on Tryptic Soy Agar supplemented with 5% sheep’s blood for 48 h. Bacterial DNA was extracted using the Qiagen DNeasy PowerSoil Pro Kit (MO BIO Laboratories, Carlsbad) and sequenced at the University of Arizona PANDA Core for Genomics and Microbiome Research. Paired-end sequencing was performed using Illumina’s PCR-Free Library Prep Kit and the NextSeq 1000 Platform (300-cycle) with read length of 35–151 bp. Trimmomatic (v0.39, ILLUMINACLIP:TruSeq3-PE-2.fa:2:30:10, SLIDINGWINDOW:4:20, MINLEN:100, HEADCROP:15) ( 15 ) improved read quality and was assessed using FastQC (v0.11.9) ( 16 ). Kraken2 (v2.1.3) ( 17 ) and Bracken (v2.8) ( 17 ) were used for species-level read classification based on the k2_pluspf database (downloaded on 2023-06-05) ( 18 ). Krakentools (v1.2) ( 17 ) (extract_kraken_reads.py, --taxid 9606, --include-children) was used to separate human from microbial reads. Assembly was performed using Unicycler (v16.0) ( 19 ), followed by quality checks with Checkm2 (v1.0.1, -m 500) ( 20 ) and Quast (v5.2.0) ( 21 ). Annotation was performed using PGAP (v 6.1) ( 22 ). Default parameters were used for all tools unless otherwise specified. All code is available on GitHub ( https://github.com/hurwitzlab/vaginal_genome_assembly ). Genomic analyses were performed on the Bacterial and Viral Bioinformatics Resources Center website ( 23 ).\nThe draft genome of  S. turicensis  R31 comprises 38 contigs, totaling 1,973,689 base pairs, with a GC content of 56.83%. The assembly exhibits moderate contiguity with an N50 of 102,621 bp. Annotation revealed 1,750 coding sequences, 47 tRNA genes, and three rRNA genes. Of these, 229 were annotated as hypothetical proteins, while 1,521 had functional assignments.\nSubsystem classification identified metabolism (196 genes) as the largest functional group ( Table 1 ). Amino acid metabolism made up 34.7% of the genome’s capability. Protein comparisons with public  S. turicensis  strains indicated 438 shared protein families and 24 protein families unique to R31 ( Table 1 ). Two of these were related to cell wall/invasion-associated proteins and histone acetyltransferase, while 22 families were of hypothetical origin ( Table 1 ). Antibiotic resistance genes (24 genes) were identified, which may be crucial for treating actinomycosis ( Table 1 ).\nGenome characteristics table for  S. turicensis  R31 a\nThe table includes isolate information, including the source and health status of the patient from which the isolate was obtained. Further strain identity information, including taxonomy classification based on two databases (Kraken2) and average nucleotide identity compared to all  S. turicensis  genomes, genomes within Clade 1, and genomes within Clade 2, is highlighted in the phylogenetic tree. In addition, genomic characteristics were obtained from annotated and evaluated with BV-BRC, including genome size, number of contigs, contig N50, contig L50, GC content, genome coverage, number of 5S rRNA, 16S rRNA, 23S rRNA, tRNAs, CDS, and CDS with functional assignments. The data also include the number of unique protein families specific to the R31 genome and their corresponding PATRIC cross-genus family IDs, as well as the number of antibiotic resistance genes identified by PATRIC, along with their gene IDs. Methods and version numbers for genome assembly can be found at  https://github.com/hurwitzlab/vaginal_genome_assembly .\nKraken2 classification confirms the R31 identity within the  Schaalia  genus. Additional strain-specific phylogenetic analysis revealed a non-specific placement based on isolation source ( Fig. 1 ), with an average nucleotide identity of 96.8% to other  S. turicensis  in Clade 1 and 68% in Clade 2 ( Table 1 ). Together, these findings highlight the fundamental genomic capabilities of the endometrial  S. turicensis  R31 genome, laying a foundation for investigating its role in gynecologic health.\nSingle-copy orthologous phylogenetic tree among 16 publicly available  S. turicensis  strains and R31. Single-copy orthologous genes were used to create the bacterial genome tree ( n  = 278) and were aligned by codon using RaXML as part of the BV-BRC bacterial genome tree pipeline using default parameters. The tree comprises 11 whole-genome assemblies and five metagenome-assembled genomes of S. turicensis.  S. turicensis  R31 is the strain discussed in this microbial resource announcement and indicated by a red diamond. The colors of  S. turicensis  strain names are based on the environmental source from which the genomes originated.","source_license":"CC-BY-4.0","license_restricted":false}