A Predictive Framework Integrating Network Toxicology and Machine Learning Elucidates 6PPD-Quinone-Induced Glioblastoma Pathogenesis

preprint OA: closed
Full text JSON View at publisher
Full text 134,787 characters · extracted from preprint-html · click to expand
A Predictive Framework Integrating Network Toxicology and Machine Learning Elucidates 6PPD-Quinone-Induced Glioblastoma Pathogenesis | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article A Predictive Framework Integrating Network Toxicology and Machine Learning Elucidates 6PPD-Quinone-Induced Glioblastoma Pathogenesis Yuan Xu, Yu Yao, Junlei Huang, Haojun Zhang, Guiming Zhang This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8679784/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Through machine learning analysis, we identified five core genes with special diagnostic performance, namely SCN2B, VIPR1, PAK1, MAP2K1 and SYNJ1. The AUC value of this integrated model in the validation queue reached 0.957, and its prediction accuracy is relatively high. From the analysis of SHAP interpretability, it can be found that MAP2K1 is the most influential predictor, and there are complex nonlinear relationships among core genes. Molecular docking simulations have provided structural evidence that can support the direct interaction between 6 PPD-Q and the target protein. The vina score ranges from − 5.9 to -8.2, and they have a strong specific binding affinity. 6PPD-Quinone bioinformatics glioblastoma machine learning molecular docking environmental carcinogenesis neural signaling pathways Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Introduction Glioblastoma is the most common primary malignant brain tumor in adults, accounting for approximately 45% to 50% of all primary malignant brain tumors. Its incidence rate begins to increase after the age of 40 and peaks between the ages of 75 and 84 (Schaff & Mellinghoff, 2023 ). According to the classification of brain tumors by the World Health Organization (WHO), GBM is classified as grade 4, which is the most malignant type of astrocytoma. It is highly invasive, grows at an extremely fast rate, and often invents the surrounding brain tissue, making it one of the most lethal brain tumors (Pace et al., 2017 ). Currently, the standard treatment methods for GBM include surgical tumor removal, radiotherapy combined with temozolomide for chemotherapy (Lan et al., 2024 ). Although in the past ten years, there have been certain advancements in surgery, radiotherapy and chemotherapy, but the overall survival probability of GBM patients is still very low, with an average survival time of only 12 to 15 months (Schaff & Mellinghoff, 2023 ). The causes of GBM involve complex interactions among genetic alterations, epigenetic modifications, and environmental exposure. These factors jointly promote the generation and development of tumors (Esemen et al., 2022 ). Due to its heterogeneity, highly invasive disease progression, and resistance to existing treatment methods, GBM has always been a major clinical challenge. Although risk factors such as ionizing radiation, certain genetic syndromes and family history have now been identified, the impact of widespread environmental pollutants on the causes of GBM has not been fully understood. This is currently an area where active research is being conducted (Pouyan et al., 2025 ). Tire wear particles enter the aquatic environment via stormwater runoff and can release highly toxic pollutants with teratogenic and mutagenic effects. One chemical released from tire wear particles, N-(1,3-dimethylbutyl)-N′-phenyl-p-phenylenediamine-quinone (6PPD-quinone, 6PPD-Q), has been reported as the second most acutely toxic chemical to aquatic species ever identified (Chen et al., 2024 ). Its precursor, N-(1,3-dimethylbutyl)-N′-phenyl-p-phenylenediamine (6PPD), is an antioxidant widely used in rubber tires and readily transforms into 6PPD-Q in the presence of ozone (Li et al., 2025 ). 6PPD-Q has recently gained significant attention as an emerging environmental contaminant. It was initially identified as the causative agent behind acute mortality in coho salmon exposed to urban stormwater runoff and has since been detected in various environmental media, including airborne particulate matter, soil, and water systems (Fang et al., 2024 ; Zhang et al., 2023 ). The widespread use of tires and the consequent leaching of 6PPD and its transformation products into the environment suggest potential broad human exposure pathways, particularly in urban settings. The chemical stability and lipophilicity of 6PPD-Q raise concerns about its potential for bioaccumulation and tissue-specific toxicity. 6PPD-Q can rapidly penetrate the blood-brain barrier within such a short period of 0.5 hours. Its structure is quite similar to that of other known neurotoxic quinones. Its molecular weight is relatively low, and it has moderate lipophilicity. All these circumstances indicate that 6 PPD-Q can indeed enter neural tissue (Huang et al., 2024 ; Li et al., 2024 ). Previous studies have shown that various quinone compounds can induce oxidative stress, cause mitochondrial dysfunction, and damage the DNA of nerve cells. These mechanisms are also related to carcinogenesis. However, although 6PPD-Q has these worrying characteristics, However, the possible role that 6ppD-Q may play in the development of brain tumors and in glioblastoma has not been studied at all up to now. Previous studies have shown that various quinone compounds can induce oxidative stress, cause mitochondrial dysfunction, and damage the DNA of nerve cells. These mechanisms are actually related to carcinogenesis (Greer et al., 2023 ; Jana et al., 2011 ). However, although 6PPD-Q has these worrying characteristics, it may play a role in the development of brain tumors. Its role in the development of glioblastoma has not been studied at all up to now. The emergence of network toxicology and computational systems biology methods has provided a very powerful framework for uncovering the complex interactions between toxins and hosts. These methods, by integrating multi-omics datasets, dynamic network modeling, and computational structure-activity analysis, can systematically identify molecular targets and pathways that are disturbed by environmental exposure (Cecarini et al., 2012 ). The integration of machine learning algorithms also enables us to prioritize key hub molecules and develop predictive models for toxicity assessment. To fill the key knowledge gap regarding the possible glioma effects of 6PPD-Q, our study this time adopted a comprehensive triangular system toxicology approach, which consists of three main parts. The first part is to integrate multi-omics data and identify the possible targets of 6PPD-Q. To reconstruct the network of the pathogenesis of glioblastoma. The second part is to determine the priority of key hub molecules by conducting topological and functional enrichment analysis and using machine learning. The third part is to combine the molecules, verify the target binding at the atomic level, and describe the thermodynamic characteristics of the interaction by clarifying the possible multi-scale carcinogenic network of 6PPD-Q. This article aims to identify the nodes where drugs can be used for chemophobic and precise treatment of glioblastoma, thereby addressing a key unsolved need in environmental neuro-oncology. Materials and Methods Acquisition and Processing of GBM Transcriptomic Data This article systematically sorted out five independent GBM transcriptome datasets from the NCBI Comprehensive Gene expression database (GSE12657, GSE42656, GSE50161, GSE67089 and GSE86574). The selection of these datasets was based on certain specific criteria, which included sample size, requiring each dataset to have at least 20 samples, and platform consistency. It should be based on the Affymetrix or Illumina platform. Additionally, the availability of the raw data and whether the dataset contains tumor and normal control samples must also be taken into consideration. These five datasets contain a total of 342 GBM samples and 44 normal brain tissue controls. Among them, GSE12657 and GSE42656 were designated as discovery cohorts with 98 samples, while the remaining three datasets (GSE50161, GSE67089, GSE86574) were independent validation cohorts with 288 samples. Considering that the gene expression data in public databases usually contain some non-biological confounding factors, such as unknown batch effects or heterogeneity among individuals, which may blur the true disease-related signals, we first adopt the method of alternative variable analysis to reduce such interference situations. We wanted to visually assess the corrective impact of SVA on the data and explore the natural clustering of the samples based on the overall gene expression pattern. Therefore, we conducted principal component analysis before and after SVA adjustment. With the combined strategy of SVA and PCA, we effectively controlled the data quality It was also confirmed that the adjusted data could robustly reflect the transcriptomic characteristics related to the disease status. Differential Gene Expression Analysis Based on this purified dataset, this article uses limma software to conduct differential expression analysis on GBM tissues and normal brain controls, that is, to see what differences there are in gene expression between them. When identifying differentially expressed genes, a relatively strict threshold was adopted, requiring the FDR correction p value to be less than 0.05. Moreover, the log2FC should be greater than 0.585, where 0.585 corresponds to a 1.5 times change. Subsequently, this paper uses the ggplot2 software package to visualize the analysis results. For instance, a volcano plot is used to display the distribution and statistical magnitude of expression changes, allowing us to more intuitively observe the distribution and magnitude of gene expression changes. Heat maps are also employed to illustrate the expression patterns of different groups, thus enabling a clear understanding of the characteristics of gene expression patterns in different groups. Weighted Gene Co-expression Network Analysis (WGCNA) A scale-free co-expression network was constructed using the WGCNA package in R to identify functionally related gene modules associated with GBM pathogenesis. When using the WGCNA package to construct gene co-expression networks, quality control is first performed through sample hierarchical clustering to remove outlier samples. Subsequently, this paper relies on the hierarchical clustering of topological overlap matrices to identify co-expressed modules. Here, the parameters are set to a minimum module size of 60, and the threshold for module merging is set at 0.25. Then, this paper will analyze the correlation between modules and traits. The screening criterion is that the Pearson correlation coefficient between the characteristic genes and phenotypes of the modules should be greater than 0.5. The threshold for greater magnitude is that the p value should be less than 0.05. In this article, hub genes will be screened based on in-mode connectivity, and genes with a kME greater than 0.8 will be retained. Acquisition of Chemical Components and Targets of 6PPD-Q 6PPD-Q was systematically characterized through multisource database integration to ensure comprehensive target prediction. The canonical SMILES notation [CC(C)CC(C)NC1 = CC(= O)C(= CC1 = O)NC2 = CC = CC=C2] and 2D molecular structure were extracted from the PubChem database. Target prediction employed a tripartite strategy to maximize coverage and reliability: ChEMBL Database: We queried the ChEMBL database ( https://www.ebi.ac.uk/chembl/ ) for compounds structurally similar to 6PPD-Q. Targets associated with these similar compounds were extracted and considered as potential targets for 6PPD-Q. SwissTargetPrediction: The canonical SMILES of 6PPD-Q was submitted to the SwissTargetPrediction web server ( https://swisstargetprediction.ch/ ), which employs a combination of 2D and 3D similarity measures to predict protein targets based on known ligand-protein interactions. PharmMapper Server: The 3D structure of 6PPD-Q was energy-minimized using the MMFF94 force field and submitted to the PharmMapper server ( https://www.lilab-ecust.cn/pharmmapper/index.html ) for reverse pharmacophore mapping against the human protein database. The selection criterion for potential targets is Norm Fit > 0.5. All predicted targets from the three approaches were restricted to the Homo sapiens proteome, and redundant entries were removed to generate a final set of unique potential targets for 6PPD-Q. Identification of 6PPD-Q-Associated Disease Targets Intersection analysis between the union set of 6PPD-Q-predicted targets and the GBM-related genes (from DEGs and WGCNA hub genes) was performed to identify core targets potentially involved in 6PPD-Q-induced gliomagenesis. Venn diagrams were constructed to visualize the overlap using the VennDiagram package in R. The resulting overlapping genes were considered high-priority candidates for subsequent functional characterization and machine learning analysis. Functional Enrichment Analysis To elucidate the biological functions and pathways associated with the overlapping genes, we performed comprehensive Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses using the clusterProfiler package in R. Results were visualized using bar plots, dot plots, and circos plots to illustrate the relationships between genes and enriched terms. Machine Learning-Based Core Gene Screening This paper aims to systematically screen the diagnostic biomarkers related to 6ppd-q in GBM. Therefore, a stratified sampling method is adopted to divide the dataset into training subsets and internal validation subsets. This is done to keep the category distribution stable. This paper constructs a comprehensive multi-algorithm ensemble machine learning prediction framework. This framework integrates ten classic machine learning algorithms, including Lasso regression, Radial Basis Function kernel Support vector machine (SVM), Random Forest (RF), Gradient boosting machine (GBM), XGBoost, linear discriminant analysis (LDA), Naive Bayes, and ensemble methods. For each algorithm, this paper has developed several models with different hyperparameter Settings. Eventually, a total of 113 prediction models were obtained. This paper relies on 5x cross-validation to optimize the hyperparameters and uses stratified sampling to separate training and validation. The performance of the models is evaluated using the area under the curve, or AUC, as a metric. Subsequently, the stacked ensemble learning method was adopted to integrate the predictions from the best-performing individual models. Those high-confidence models, that is, those achieving an AUC greater than 0.85 during both training and validation, were selected for this ensemble. Then, the features were ranked and analyzed based on their frequency of occurrence in these high-performance models. The genes with relatively high rankings have been identified as the core genes that may be involved in the formation of glioma induced by 6ppd-q. Model Interpretation Using SHAP Analysis This study aims to clarify the decision-making mechanism of the optimal prediction model and identify key features. Therefore, a SHAP framework based on R language was used to conduct an interpretability analysis of the model. We utilized the SHAP package to calculate the SHAP value of each feature in every sample from the trained optimal classification model. This value can quantitatively describe the magnitude and direction of the contribution of features to the model output. Molecular Docking Analysis To validate the interaction between the key target and 6PPD-Q at the atomic level, we performed molecular docking simulations. To validate the interaction between the key target and 6PPD-Q at the atomic level, we performed molecular docking simulations. The three-dimensional crystal structure of the core target protein was obtained from the Protein Data Bank (PDB) and downloaded in PDB format. Molecular docking was conducted using the online platform CB-Dock2, which can automatically identify potential binding pockets of the protein and perform rapid docking. The docking results were evaluated based on the binding free energy predicted by the Vina scoring function, and the optimal conformation with the highest score was selected for subsequent analysis. Results Identification of GBM-Related Target Genes Multiple transcriptome datasets were successfully integrated, laying a foundation for the accurate identification of GBM-related genes. The comprehensive batch effect correction strategy we adopted effectively addressed the technical differences among different datasets, as evidenced by the significant improvement in data distribution consistency (Fig. 1 A-B). Principal component analysis revealed that before correction, The two datasets, GSE12657 and GSE42656, are clearly separated from each other, and there are a large number of batch effects (Fig. 1 C). After the coordinated pipeline was achieved, the samples of these two datasets presented an overlapping distribution in the dimensionality reduction space, which confirmed the success of the integration (Fig. 1 D). Such strict preprocessing of the data laid a reliable foundation for the subsequent differential expression and co-expression analysis. Differential expression analysis was conducted on GBM tissues and the normal brain control group. A total of 1,660 significantly differentially expressed genes that met our strict criteria were identified, with fdr correction P 0.585. We visualized the analysis results using a volcano map (Fig. 2 A). The heat map shows the different expression patterns of these DEGs. It can be seen that there is a significant separation between GBM and the control group, and it also highlights the molecular heterogeneity of glioblastoma (Fig. 2 B). Based on the results of WGCNA, we identified the key gene modules significantly associated with 6PPD-Q exposure. The module-trait relationship analysis showed that the positive correlation between the turquoise module and the black module and the treated trait was the strongest. Their correlation coefficients were 0.73 and 0.71 respectively, and the p values were both less than 1e-16. Under the exposure of 6PPD-Q, the co-expression changes of genes in these two modules were the most prominent (Fig. 2 C-D). An internal analysis of the modules revealed that among these key modules, there was a high degree of consistency between the module membership degree of the treated traits and the gene magnitude. The cor of the turquoise module was 0.73, with a p value of 2.5e-165, and the cor of the black module was 0.76, with a p value of 4e-22. This means that in the turquoise and black modules, those genes located at the center of the network, that is, with stronger connectivity, have a closer association between their expression levels and 6PPD-Q exposure. These findings verify that these modules are biologically reliable and can be given priority when exploring the toxicological mechanisms of the network in the future. Integration of the 1,660 DEGs from conventional differential expression analysis and the 989 turquoise module genes from WGCNA yielded a final, high-confidence set of 1,885 unique GBM-related genes after removing duplicates (Fig. 2 E). This integrated approach enhanced the reliability of GBM-associated gene identification by combining both differential expression and functional co-expression criteria. Identification of 6PPD-Q Target Proteins and Associated Disease Targets The canonical chemical structure of 6PPD-Q was obtained from the PubChem database (Fig. 3 A). Subsequently, target prediction was performed using the ChEMBL, PharmMapper, and SwissTargetPrediction platforms. After data integration and removal of redundant entries, a total of 2,623 unique potential targets for 6PPD-Q were identified (Fig. 3 B). The distribution of these targets can be visualized using a Venn diagram. This relatively comprehensive target prediction result shows that 6PPD-Q has the potential to interact with multiple proteins, which is consistent with its structural complexity. After conducting a key cross-analysis of the 2,623 predicted targets of 6PPD-Q and 1,885 GBM-related genes, 225 overlapping genes were found (Fig. 3 C). These 225 overlapping genes form a set of core potential molecular targets. 6PPD-Q May participate in the formation process of glioma through these targets, so they were selected for subsequent functional characterization and machine learning analysis. Functional Enrichment Analysis of 6PPD-Q-Associated Targets in GBM To systematically elucidate the biological functions of the potential core targets of 6PPD-Q, we performed Gene Ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. The integrated results clearly demonstrate that the toxic mechanisms of 6PPD-Q are highly concentrated in nervous system functions and cancer-related pathways. At the biological function level, GO analysis revealed that the core targets were significantly enriched in key neurological processes such as "regulation of membrane potential", "learning or memory", "gliogenesis", "cognition", "regulation of synaptic plasticity" and "potassium ion transport". In terms of cellular components, the targets were mainly located at key neuronal structures, such as the "synaptic membrane", "ion channel complexes", and "postsynaptic specialization" (Fig. 4 A). These findings together suggest that 6PPD-Q May impair the electrophysiological signals and synaptic functions of neurons, thereby affecting higher-order neural activities. At the level of signaling pathways, KEGG analysis provided more detailed insights into the mechanism. In typical neural signaling pathways such as "glutamatergic synapses", "cholinergic synapses", and "retrograde endogenous cannabinoid signaling", there was a very prominent enrichment of the targets, which strongly proved the synaptic functional terms discovered in the graphene oxide analysis. A coherent chain of evidence has been formed, ranging from molecular functions to signaling pathways. The analysis also revealed that multiple pathways closely related to tumorigenesis were highly enriched, including the "PI3K-Akt signaling pathway", "focal adhesion", "Ras signaling pathway", and various solid cancer pathways (Fig. 4 B). This finding links the neurotoxic effects of 6PPD-Q with potential carcinogenic risks. Its toxicity in the nervous system may be related to the activation or disruption of key oncogenic signaling networks. Based on the comprehensive GO and KEGG enrichment analysis, the neurotoxicity of 6PPD-Q is derived from the interaction among multiple processes and pathways, mainly involving the disruption of neuronal synaptic transmission and plasticity, as well as the possible abnormal activation of cancer signaling pathways related to the pathogenesis of glioblastoma. This provides a clear direction and a crucial theoretical basis for studying the specific toxicological mechanism of 6PPD-Q. Machine Learning-Based Identification of Core Genes in 6PPD-Q-Induced Gliomagenesis This paper aims to establish a reliable diagnostic model for glioblastoma induced by 6ppd-q and identify the most critical molecular targets. Therefore, a comprehensive machine learning framework was implemented, which integrates multiple algorithms and integration strategies. Based on the expression profiles of 225 overlapping genes, this paper employed 10 classic machine learning algorithms. A total of 113 prediction models were established. Model Performance Evaluation demonstrated exceptional predictive capability across both training and validation cohorts (Fig. 5 A). The top-performing models included: RF+NaiveBayes (AUC = 0.997 training, 0.995 validation, combined score = 0.957), RF+Ridge (AUC = 0.999 training, 0.995 validation, combined score = 0.953), XGBoost (AUC = 1.000 training, 0.986 validation, combined score = 0.946), Ridge (AUC = 0.998 training, 0.997 validation, combined score = 0.942) and RF+Enet[alpha = 0.4] (AUC = 0.999 training, 0.993 validation, combined score = 0.942). The integrated model performed particularly well in the independent validation cohort (GSE50161, GSE67089 and GSE86574), with an average AUC of 0.934, an accuracy rate of 0.892, and an f1 score of 0.885. This confirms that the diagnostic framework is relatively robust. Through core gene identification and feature criticality analysis, it can be known that there are five key genes with the highest diagnostic value. They are SCN2B, VIPR1, PAK1, MAP2K1 and SYNJ1 respectively (Fig. 5 B). The volcano plot clearly shows that there are significant differences in the expression of these genes between the treatment group and the control group. To assess the diagnostic potential of these five key genes, we conducted a receiver operating characteristic curve analysis. The analysis results indicated that all genes demonstrated strong discriminative power, with the area under the curve values all exceeding 0.94. It should be mentioned here that the prediction accuracy demonstrated by MAP2K1 and PAK1 is particularly outstanding (Fig. 5 C). This indicates that the screened key genes can very reliably distinguish 6PPD-Q exposed samples from control samples, demonstrating strong potential as biomarkers. Model Interpretation Using SHAP Analysis Based on SHAP model interpretability analysis, we further revealed the contribution levels and functional patterns of five key genes in predicting 6PPD-Q exposure. SHAP value analysis clearly quantified the importance of each feature, with MAP2K1 contributing most significantly to the model output (mean |SHAP value| = 0.075), followed by PAK1 (0.0575) and SYNJ1 (0.0442), while SCN2B and VIPR1 also showed substantial influence (Fig. 6 A). A bee swarm plot detailed the distribution of SHAP values for each gene and its relationship with gene expression levels. It should be noted that if MAP2K1 and PAK1 have relatively high expression, it is easier to increase the prediction score of the model. This indicates that they are upregulated in 6PPD-Q exposed samples and play a crucial positive role in promoting the classification results. In other words, their high expression has a very crucial positive effect on the classification results. In contrast to this situation, SYNJ1 and SCN2B exhibit more complex patterns. Their low expression states and high expression states respectively have positive and negative impacts on model predictions, which implies that there may be more complex regulatory mechanisms, as shown in Fig. 6 B. In addition, the SHAP-dependent plot reveals possible biological interactions among key genes. The expression levels of MAP2K1 and PAK1 tend to change in a coordinated way. When both genes show high expression, their positive combined effect on model prediction is the strongest. Similarly, there is also a very prominent covariant relationship between MAP2K1 and SYNJ1. These genes may be jointly involved in and regulate the same signaling pathways or cellular processes in the neurotoxic mechanism of 6PPD-Q, as can be seen from Fig. 6 C. Overall, SHAP analysis confirmed from the perspective of machine learning that MAP2K1, PAK1, SYNJ1, SCN2B and VIPR1 are the core genes that respond to 6PPD-Q exposure. It also explained their respective directional effects and potential functional synergies. Such analysis results provide clear priority goals for the subsequent experimental verification and mechanism research, and also offer theoretical basis. Molecular Docking Validation of 6PPD-Q-Core Gene Interactions In this paper, to verify the possible direct interaction between 6PPD-Q and the identified core genes, molecular docking simulation was carried out. The results showed that 6PPD-Q had relatively strong binding affinity with the five target proteins, and the binding energy was always lower than − 5.0 kcal/mol (Table 1). The molecular interaction was stable and the specificity was also very strong. According to the standards already established in molecular docking research, if the binding energy is less than 0 kilocalories per mole, it indicates that the substance has the ability to bind spontaneously. If the binding energy is less than − 5.0 kilocalories per mole, it indicates that the binding affinity is relatively good. If the binding energy is less than − 7.0 kilocalories per mole, it indicates that the binding affinity is very strong. The binding energies of these five complexes all exceeded the threshold that could achieve strong binding affinity. Among them, MAP2K1 demonstrated the most favorable interaction. Discussion This research is pioneering. It systematically interprets the possible molecular mechanisms that link the emerging environmental pollutant 6PPD-Q with the pathogenesis of glioblastoma (Li et al., 2025 ). By adopting an integrated computational strategy, we have coordinated multi-omics data, advanced machine learning, and structural biology simulations. A very notable interaction network has been discovered. Our research results show that 6PPD-Q may promote glioma formation by targeting a specific group of genes, which are particularly crucial for neural function and oncogenic signal transduction. The most important aspect of our research is identifying 225 highly reliable targets that can link 6PPD-Q exposure to GBM. Then, relying on a powerful machine learning framework, a detailed analysis was conducted on the five core diagnostic genes. SHAP interpretability analysis was used to explain their functional roles and mutual interactions. Additionally, molecular docking was employed to structionally verify the binding ability of 6PPD-Q to these targets (Gao et al., 2025 ; Liao et al., 2024 ). Our work has a key and novel discovery: at the unique intersection of neuro-specific processes and mature oncogenic pathways, there are clearly many potential 6PPD-Q targets (Wang et al., 2022 ; Yu et al., 2025 ). After a comprehensive analysis of GO and KEGG, it is unanimously indicated that there is a dual attack mechanism: 6PPD-Q seems to disrupt fundamental neurophysiological processes, including regulating membrane potential, synaptic plasticity, glial formation and ion channel activity, while also interfering with key cancer driver pathways such as PI3K-Akt, Ras and Focal Adhesion (Liao et al., 2024 ). This situation is crucial. It speculates that the neurotoxicity of 6PPD-Q is not only a parallel effect but may also be intrinsically linked to its carcinogenic potential. After the core neuron signals and homeostasis are disrupted, it may create a loose cellular environment or cause initial damage. When combined with the activation of proliferation and survival pathways, this will prompt the transformation of glial cells (Cecarini et al., 2012 ). This provides a computationally derived mechanism hypothesis for the sequence "environmental exposure - neurodysfunction - glioma formation", which goes beyond the known links between quinine drugs and general oxidative stress as well as DNA damage by accurately identifying specific downstream pathways (Guo et al., 2025 ). By comprehensively applying the machine learning framework, which includes 113 different models, it has been proven that this approach can eliminate the complexity of transcriptomic data, identify powerful biomarkers, and the five core genes verified in independent cohorts have excellent diagnostic performance, with all AUCs greater than 0.94 (Guo et al., 2025 ). This highlights their possible role as biomarkers for 6PPD-Q exposure or early detection of related pathological changes. What is even more worth mentioning is that the SHAP interpretability analysis has transformed these biomarkers from mere statistical associations into functional suspects with contextual implications. After quantifying the criticality of the features, it was determined that MAP2K1 and PAK1 are the main driving factors for the model's prediction (Crampon et al., 2022 ; Kuang et al., 2025 ). This discovery is quite astonishing from a biological perspective. MAP2K1 is the central kinase of the RAS/RAF/MEK/ERK pathway, which is a key basis for cell proliferation and survival that are often dysregulated in cancer, while PAK1 is a key node that integrates Rho GTPases signaling to regulate cytoskeletal dynamics, cell movement and transcription (Wang et al., 2024 ). Just as the SHAP dependency diagram shows, the synergistic relationship among them indicates that 6PPD-Q exposure may affect these interrelated growth-promoting and invasion-promoting signaling modules (Zhang et al., 2025 ). After adding SYNJ1, SCN2B and VIPR1, a more comprehensive picture is depicted. This suggests that there are dysfunctions in synaptic communication, ionic homeostasis and glial signaling in the toxic mechanism (Choudhry et al., 2021 ; Song et al., 2023 ). Molecular docking simulation provides a crucial structural basis for our hypothesis. Predictions show that 6PPD-Q has always had a strong binding affinity with the protein products of all five core genes, which provides a reasonable physical basis for the observed computational association (Song et al., 2024 ). The binding with MAP2K1 is particularly favorable, and it may be the main target. It may cause abnormal activation of the entire MAPK signal cascade. Although the results obtained from these computer simulations are highly enlightening, they cannot serve as the ultimate evidence of interactions or functional regulation in biological systems (Dankner et al., 2025 ; Lampron et al., 2023 ). They are the key starting point for future experimental work. This study also has some shortcomings. It is mainly based on calculation, and the causal relationship between 6PPD-Q exposure and glioma formation needs to be verified through rigorous experiments using both in vitro and in vivo models (Nair et al., 2025 ). It remains to be determined what exact functional outcomes will occur when 6PPD-Q binds to targets such as MAP2K1 or pak1, whether it activates their functions or inhibits them. Additionally, although the model organisms used in source transcriptomics studies have certain correlations, they may not fully reflect the complexity of human glioma formation (Luo et al., 2022 ; Zhou, 2024 ). Overall, our comprehensive analysis has constructed a very convincing new theory regarding the potential gliogenicity risk of 6PPD-Q. We have mapped out a coherent molecular network, prioritizing those highly credible core genes with specific diagnostic capabilities and pointing out the synergistic relationships among them. It also provides structural evidence for direct targeting (Cao et al., 2024 ; Zhou, 2024 ). This work has transformed 6PPD-Q from a common poison into a molecule that may cause specific and potential damage to key carcinogenic and neurodevelopmental pathways. The core genes identified, along with the synergistic genes for MAP2K1 and PAK1, are promising research objects for future mechanotoxicological studies. It may also be a biomarker for environmental monitoring and risk assessment of the population exposed to this ubiquitous environmental pollutant. Although this study has provided new insights into the possible molecular mechanisms of glioblastoma induced by 6PPD-Q, there are some limitations that need to be noted. Firstly, most of these research results are obtained through computational analysis and publicly available transcriptome data, lacking direct experiments to prove them. Although the results of molecular docking can indicate the possibility of binding between 6PPD-Q and the core target, these predictions based on calculation still need to be confirmed through in vitro and in vivo experiments to determine the causal relationship between them. The transcriptome datasets used in this study have significant heterogeneity and were obtained based on different platforms. Although strict methods were adopted to correct batch effects, there might still be some unprocessed confounding factors. Although the machine learning models demonstrated relatively high prediction accuracy, they were trained on existing glioblastoma data. Perhaps it is impossible to fully capture the dynamic changes and heterogeneity of glioblastoma induced by 6PPD-Q. Because the data was mainly obtained from human tissue samples, this study did not take into account the possible differences in toxicological responses among different species, which limited the generalization of the research results in the in vivo model system. In future research, it is necessary to integrate multi-omics analyses in cell and animal models exposed to 6PPD-Q This is crucial for verifying these computational predictions and clarifying the functional pathways in the context of biology. Conclusion Based on comprehensive computational analysis, this article determined that 6PPD-quinone might be involved in the pathogenesis of glioblastoma by disrupting key molecular networks. We identified 225 potential targets related to 6PPD-Q exposure and GBM, and extracted five core diagnostic genes using machine learning methods. The five core diagnostic genes are MAP2K1, PAK1, SYNJ1, SCN2B and VIPR1 respectively. These genes exhibit rather unique diagnostic properties and are involved in neural functions and carcinogenic pathways. The identified genes can be regarded as promising candidates for future experimental verification and potential environmental risk assessment biomarkers. This research work highlights the value of computational methods in the field of environmental toxicology. It also emphasized the necessity of conducting experimental verification on these prediction results. Declarations Ethics approval and consent to participate Not applicable. This study does not involve human participants. Consent for publication Not applicable. This study does not involve animal and human participants. Availability of data and materials All data analyzed in this study are from publicly available sources. The specific identifiers and direct links are as follows: Gene Expression Omnibus (GEO) datasets : GSE12657, GSE42656, GSE50161, GSE67089, GSE86574 (Access: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=ACCESSION_NUMBER) PubChem Compound : CID 154926030 (Access: https://pubchem.ncbi.nlm.nih.gov/compound/154926030) Protein Data Bank (PDB) structures : MAP2K1 (8YP5), PAK1 (4ZJI), SCN2B (6VRR), SYNJ1 (7A17), VIPR1 (3DTX) (Access: https://www.rcsb.org/structure/PDB_ID) Competing interests None declared. Funding No Funding。 Authors' contributions G.Z.: Conceptualization, Data curation, Formal analysis, Investigation, Validation, Supervision; Y.X.: Investigation, Data Curation, Writing—Original Draft; Y.Y., J.L. and H.J: Writing—Original Draft, Writing—Review and Editing, Supervision. References Cao Z, Zhao S, Hu S, Wu T, Sun F, Shi LI. Screening COPD-Related Biomarkers and Traditional Chinese Medicine Prediction Based on Bioinformatics and Machine Learning. Int J Chronic Obstr Pulm Dis. 2024;19:2073–95. https://doi.org/10.2147/copd.S476808 . Cecarini V, Bonfili L, Cuccioloni M, Mozzicafreddo M, Rossi G, Buizza L, Uberti D, Angeletti M, Eleuteri AM. Crosstalk between the ubiquitin–proteasome system and autophagy in a human cellular model of Alzheimer's disease. Biochim et Biophys Acta (BBA) - Mol Basis Disease. 2012;1822(11):1741–51. https://doi.org/10.1016/j.bbadis.2012.07.015 . Chen L, Hu J, Borthwick AGL, Sun W, Zhang H, Jia D, Liu W. Solar-light-activated periodate for degradation and detoxification of highly toxic 6PPD-quinone at environmental levels. Nat Water. 2024;2(5):453–63. https://doi.org/10.1038/s44221-024-00236-3 . Choudhry H, Aggarwal M, Pan P-Y. (2021). Mini-review: Synaptojanin 1 and its implications in membrane trafficking. Neuroscience Letters , 765 . https://doi.org/10.1016/j.neulet.2021.136288 Crampon K, Giorkallos A, Deldossi M, Baud S, Steffenel LA. Machine-learning methods for ligand–protein molecular docking. Drug Discovery Today. 2022;27(1):151–64. https://doi.org/10.1016/j.drudis.2021.09.007 . Dankner M, Rousselle E, Petrecca S, Fabi F, Nowakowski A, Lazaratos A-M, Rajadurai CV, Stein AJB, Bian D, Tai P, Belaiche A, Li M, Quaiattini A, Normanno N, Arcila M, Elkrief A, Johnson DB, Ladanyi M, Rose AAN. Clinical Activity of Mitogen-Activated Protein Kinase Inhibitors in Patients With MAP2K1 (MEK1)-Mutated Metastatic Cancers. JCO Precision Oncol. 2025;9 https://doi.org/10.1200/po.24.00199 . Esemen Y, Awan M, Parwez R, Baig A, Rahman S, Masala I, Franchini S, Giakoumettis D. Molecular Pathogenesis of Glioblastoma in Adults and Future Perspectives: A Systematic Review. Int J Mol Sci. 2022;23(5). https://doi.org/10.3390/ijms23052607 . Fang J, Wang X, Cao G, Wang F, Ru Y, Wang B, Zhang Y, Zhang D, Yan J, Xu J, Ji J, Ji F, Zhou Y, Guo L, Li M, Liu W, Cai X, Cai Z. 6PPD-quinone exposure induces neuronal mitochondrial dysfunction to exacerbate Lewy neurites formation induced by α-synuclein preformed fibrils seeding. J Hazard Mater. 2024;465. https://doi.org/10.1016/j.jhazmat.2023.133312 . Gao J, Zhang M, Chen Q, Ye K, Wu J, Wang T, Zhang P, Feng G. Integrating machine learning and molecular docking to decipher the molecular network of aflatoxin B1-induced hepatocellular carcinoma. Int J Surg. 2025;111(7):4539–49. https://doi.org/10.1097/js9.0000000000002455 . Greer JB, Dalsky EM, Lane RF, Hansen JD. Tire-Derived Transformation Product 6PPD-Quinone Induces Mortality and Transcriptionally Disrupts Vascular Permeability Pathways in Developing Coho Salmon. Environ Sci Technol. 2023;57(30):10940–50. https://doi.org/10.1021/acs.est.3c01040 . Guo B, Jiang X, Zhu L, He X. Exploring the Diagnostic Potential of Core Targets of 6PPD and Its Metabolite 6PPD-Q in Cardiovascular Diseases: An Integrated Analysis Based on Network Toxicology, Molecular Docking, and In Vitro Validation. J Appl Toxicol. 2025. https://doi.org/10.1002/jat.4892 . Huang Z, Chen C, Guan K, Xu S, Chen X, Lin Y, Li X, Shan Y. Protective role of ghrelin against 6PPD-quinone-induced neurotoxicity in zebrafish larvae (Danio rerio) via the GHSR pathway. Ecotoxicol Environ Saf. 2024;285. https://doi.org/10.1016/j.ecoenv.2024.117031 . Jana S, Sinha M, Chanda D, Roy T, Banerjee K, Munshi S, Patro BS, Chakrabarti S. Mitochondrial dysfunction mediated by quinone oxidation products of dopamine: Implications in dopamine cytotoxicity and pathogenesis of Parkinson's disease. Biochim et Biophys Acta (BBA) - Mol Basis Disease. 2011;1812(6):663–73. https://doi.org/10.1016/j.bbadis.2011.02.013 . Kuang L, Wang X, He Z, Zhang Y, Luo J, Zhang W, Fu Z, Tuo X. Explore the toxicological mechanism of 6PPD-Q on human health through a novel perspective: The interaction between lactate dehydrogenase and 6PPD-Q. Int J Biol Macromol. 2025;293. https://doi.org/10.1016/j.ijbiomac.2024.139266 . Lampron M-C, Paré I, Al-Zharani M, Semlali A, Loubaki L. Cannabinoid Mixture Affects the Fate and Functions of B Cells through the Modulation of the Caspase and MAP Kinase Pathways. Cells. 2023;12(4). https://doi.org/10.3390/cells12040588 . Lan Z, Li X, Zhang X. Glioblastoma: An Update in Pathology, Molecular Mechanisms and Biomarkers. Int J Mol Sci. 2024;25(5). https://doi.org/10.3390/ijms25053040 . Li G, Dan L, Zhai W, Liu C, Chen M, Xu Q, Huang Y. Respiratory toxicity mechanism of 6PPD and 6PPD-quinone: An integrated study based on network toxicology and molecular docking. Ecotoxicol Environ Saf. 2025;301. https://doi.org/10.1016/j.ecoenv.2025.118494 . Li Y, Zeng J, Liang Y, Zhao Y, Zhang S, Chen Z, Zhang J, Shen X, Wang J, Zhang Y, Sun Y. A Review of N-(1,3-Dimethylbutyl)-N′-phenyl-p-Phenylenediamine (6PPD) and Its Derivative 6PPD-Quinone in the Environment. Toxics. 2024;12(6). https://doi.org/10.3390/toxics12060394 . Liao X-L, Zhou J-M, Wang Y, Chen Z-F, Cai Z. Network pharmacology and transcriptomics reveal androgen receptor as a potential protein target for 6PPD-quinone. Sci Total Environ. 2024;957. https://doi.org/10.1016/j.scitotenv.2024.177678 . Luo J, Junaid M, Hamid N, Duan J-J, Yang X, Pei D-S. Current understanding of gliomagenesis: from model to mechanism. Int J Med Sci. 2022;19(14):2071–9. https://doi.org/10.7150/ijms.77287 . Nair P, Barrett H, Tanoto K, Xie L, Sun J, Yang D, Yao H, Song D, Peng H. Structure and Toxicity Characterization of Alkyl Hydroxylated Metabolites of 6PPD-Q. Environ Sci Technol. 2025;59(15):7474–84. https://doi.org/10.1021/acs.est.4c11823 . Pace A, Dirven L, Koekkoek JAF, Golla H, Fleming J, Rudà R, Marosi C, Rhun EL, Grant R, Oliver K, Oberg I, Bulbeck HJ, Rooney AG, Henriksson R, Pasman HRW, Oberndorfer S, Weller M, Taphoorn MJB. European Association for Neuro-Oncology (EANO) guidelines for palliative care in adults with glioma. Lancet Oncol. 2017;18(6):e330–40. https://doi.org/10.1016/s1470-2045(17)30345-5 . Pouyan A, Ghorbanlo M, Eslami M, Jahanshahi M, Ziaei E, Salami A, Mokhtari K, Shahpasand K, Farahani N, Meybodi TE, Entezari M, Taheriazam A, Hushmandi K, Hashemi M. Glioblastoma multiforme: insights into pathogenesis, key signaling pathways, and therapeutic strategies. Mol Cancer. 2025;24(1). https://doi.org/10.1186/s12943-025-02267-0 . Schaff LR, Mellinghoff IK. Glioblastoma and Other Primary Brain Malignancies in Adults. JAMA. 2023;329(7). https://doi.org/10.1001/jama.2023.0023 . Song DY, Yuan L, Cui N, Feng C, Meng L, Wang XH, Xiang M, Liu D, Wang C, Zhang Z, Li JY, Li W. α-Synuclein induces deficiency in clathrin‐mediated endocytosis through inhibiting synaptojanin1 expression. J Neurochem. 2023;167(3):461–84. https://doi.org/10.1111/jnc.15974 . Song Y, Weng W, Wu S. Investigating the Potential Effects of 6PPDQ on Prostate Cancer Through Network Toxicology and Molecular Docking. Toxics. 2024;12(12). https://doi.org/10.3390/toxics12120891 . Wang X, Li H, Xu A, Peng J, Wu Y, Liu Y, Zhang J, Cai C, Ma S, Zhang K. Inhibition of the long non-coding RNA MALAT1 downregulates MAP2K1, suppressing the progression of hypopharyngeal squamous cell carcinoma. Biomolecules Biomed. 2024;25(5):1023–37. https://doi.org/10.17305/bb.2024.11151 . Wang Y, Yuan Y, Wang W, He Y, Zhong H, Zhou X, Chen Y, Cai X-J, Liu L-q. (2022). Mechanisms underlying the therapeutic effects of Qingfeiyin in treating acute lung injury based on GEO datasets, network pharmacology and molecular docking. Computers in Biology and Medicine , 145 . https://doi.org/10.1016/j.compbiomed.2022.105454 Yu H, Zhang W, Wang D, Shi B, Zhu Y, Hu W, He J, Hong J, Xu X, Zheng X, Chen W, Wang F, Qu F. Exposure to 6PPD-Q induces dysfunctions of ovarian granulosa cells: Its potential role in PCOS. J Hazard Mater. 2025;486. https://doi.org/10.1016/j.jhazmat.2024.137037 . Zhang S-Y, Gan X, Shen B, Jiang J, Shen H, Lei Y, Liang Q, Bai C, Huang C, Wu W, Guo Y, Song Y, Chen J. 6PPD and its metabolite 6PPDQ induce different developmental toxicities and phenotypes in embryonic zebrafish. J Hazard Mater. 2023;455. https://doi.org/10.1016/j.jhazmat.2023.131601 . Zhang W, Wu B, Hu C, Rong W, Huang Y, Hu S, Qin Y. EGFR and CYP signaling disruption underlies 6PPD-quinone hepatotoxicity: Insights from a network and machine learning approach. Toxicology. 2025;517. https://doi.org/10.1016/j.tox.2025.154235 . Zhou L. Generation and banking of patient-derived glioblastoma organoid and its application in cancer neuroscience. Am J cancer Res. 2024;14(10):5000–10. https://doi.org/10.62347/nsva5836 . Additional Declarations No competing interests reported. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8679784","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":592942064,"identity":"721f25a9-8447-4816-b1d1-1e75f58a1c30","order_by":0,"name":"Yuan Xu","email":"","orcid":"","institution":"The Affiliated Hospital of Qingdao University","correspondingAuthor":false,"prefix":"","firstName":"Yuan","middleName":"","lastName":"Xu","suffix":""},{"id":592942066,"identity":"d9106922-569f-4977-a520-b2f662efc443","order_by":1,"name":"Yu Yao","email":"","orcid":"","institution":"The Affiliated Hospital of Qingdao University","correspondingAuthor":false,"prefix":"","firstName":"Yu","middleName":"","lastName":"Yao","suffix":""},{"id":592942068,"identity":"7a5f0107-b393-4f9c-86cf-9db86ab8a90f","order_by":2,"name":"Junlei Huang","email":"","orcid":"","institution":"Peking University People's Hospital","correspondingAuthor":false,"prefix":"","firstName":"Junlei","middleName":"","lastName":"Huang","suffix":""},{"id":592942071,"identity":"b9de29af-a9f0-466c-bf94-65f519254b91","order_by":3,"name":"Haojun Zhang","email":"","orcid":"","institution":"The Affiliated Hospital of Qingdao University","correspondingAuthor":false,"prefix":"","firstName":"Haojun","middleName":"","lastName":"Zhang","suffix":""},{"id":592942074,"identity":"81f9c9b4-cc45-477c-8bf3-634c2c52a108","order_by":4,"name":"Guiming Zhang","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA7ElEQVRIiWNgGAWjYDACCQjFw8AMJD9USMjxk6SFccYZC2PJBiK1gAEzb1tF4gZCWuRnNz97+OXXYRnddt7Dr3nnSTBuYGB++OgGHi2Mc46ZG8v2HeYxO8yXZjl3mwSzOQObsXEOHi3MEglm0pI9IC08ZgZvt0mwWTbwsEnj08Imkf4NoYV3jgSPwQECWngkcswkP/wAazF+yNsgIUFQi4RETpk0Y0M62BbGGcckDCSbCfhFfkb6Nskff6ztzc6fMf7woaauvp+9+eFjfFpAABgdUH9BuASUgwDjjz8QtR+IUDwKRsEoGAUjEAAApj5HbVI4PeQAAAAASUVORK5CYII=","orcid":"","institution":"The Affiliated Hospital of Qingdao University","correspondingAuthor":true,"prefix":"","firstName":"Guiming","middleName":"","lastName":"Zhang","suffix":""}],"badges":[],"createdAt":"2026-01-23 13:39:42","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-8679784/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8679784/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":103050084,"identity":"5b0a3398-71e7-40d6-9d24-8e70f1c62ec5","added_by":"auto","created_at":"2026-02-20 07:48:06","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":211436,"visible":true,"origin":"","legend":"\u003cp\u003eA-B: PCA plots before (A) and after (B) SVA correction. Dataset-specific clustering (GSE12657 vs GSE42656) is eliminated post-correction; C-D: 2D visualizations confirm technical artifacts (C) are resolved (D) after processing. Data were normalized using SVA and visualized via PCA.\u003c/p\u003e","description":"","filename":"1.png","url":"https://assets-eu.researchsquare.com/files/rs-8679784/v1/e847792a6f928946ba39057f.png"},{"id":102993540,"identity":"d2f0aed8-1621-4020-9513-53c75a805ddb","added_by":"auto","created_at":"2026-02-19 11:48:26","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":327898,"visible":true,"origin":"","legend":"\u003cp\u003e\u0026nbsp;Volcano plot of DEGs between treatment and control groups (FDR\u0026lt;0.05, |logFC|\u0026gt;0.585); B: Heatmap displaying normalized expression patterns of significant DEGs across samples; C: Module-trait relationships from WGCNA. Turquoise and black modules show strongest correlation with treatment (r=0.73, 0.71; p\u0026lt;1e-16); D: Gene dendrogram with module assignments from WGCNA analysis; E: Venn diagram identifying 225 overlapping genes between DEGs and WGCNA modules for subsequent analysis.\u003c/p\u003e","description":"","filename":"2.png","url":"https://assets-eu.researchsquare.com/files/rs-8679784/v1/db42b9208a850f3651db25fc.png"},{"id":103050193,"identity":"282dffca-faa8-43a8-b1d2-a181de6d6352","added_by":"auto","created_at":"2026-02-20 07:48:43","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":89967,"visible":true,"origin":"","legend":"\u003cp\u003eA: Chemical structure of AFB1; B: Overlap of 6PPD-Q targets predicted by three databases, totaling 2,623 unique targets; C: Integration identifies 225 overlapping genes between 6PPD-Q targets and glioblastoma genes as core candidates.\u003c/p\u003e","description":"","filename":"3.png","url":"https://assets-eu.researchsquare.com/files/rs-8679784/v1/719d73f49288abc48fcd095f.png"},{"id":102993545,"identity":"c105fbee-af84-4466-83a1-f74ca01ab5eb","added_by":"auto","created_at":"2026-02-19 11:48:26","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":368987,"visible":true,"origin":"","legend":"\u003cp\u003eA: GO enrichment analysis reveals significant terms in biological process, cellular component, and molecular function, highlighting neuronal and synaptic regulation; B: KEGG pathway analysis identifies key cancer-related pathways and neuroactive ligand-receptor interactions.\u003c/p\u003e","description":"","filename":"4.png","url":"https://assets-eu.researchsquare.com/files/rs-8679784/v1/3df014054045beee99933a8f.png"},{"id":102993541,"identity":"f0e8c6b8-871f-47c3-9c8c-093f60c28871","added_by":"auto","created_at":"2026-02-19 11:48:26","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":403140,"visible":true,"origin":"","legend":"\u003cp\u003eA: Performance ranking of 113 machine learning models. Top models achieved AUC \u0026gt;0.99 in training and \u0026gt;0.95 in validation; B: Volcano plot identifying five key diagnostic genes (MAP2K1, PAK1, SYNJ1, SCN2B, VIPR1) from DEG analysis; C: ROC curves validating diagnostic performance of five key genes, with all AUCs \u0026gt;0.94 (MAP2K1: 0.997; PAK1: 0.993).\u003c/p\u003e","description":"","filename":"5.png","url":"https://assets-eu.researchsquare.com/files/rs-8679784/v1/24bd5bf871268954b67e2f03.png"},{"id":103050120,"identity":"48130315-d914-457e-aed9-06abc7cc9aec","added_by":"auto","created_at":"2026-02-20 07:48:20","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":176310,"visible":true,"origin":"","legend":"\u003cp\u003eA: Feature importance ranking. MAP2K1 contributes most (mean |SHAP value|=0.075), followed by PAK1 (0.0575); B: Bee swarm plot showing directionality of feature impacts. High expression of MAP2K1/PAK1 increases prediction scores (positive SHAP values); C: Dependence plots revealing synergistic interactions between key genes.\u003c/p\u003e","description":"","filename":"6.png","url":"https://assets-eu.researchsquare.com/files/rs-8679784/v1/6b80cf0d8e26b4a9b703ab17.png"},{"id":108805562,"identity":"fe36fb73-de6b-492a-a31a-63e86206193d","added_by":"auto","created_at":"2026-05-08 15:26:16","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1620671,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8679784/v1/77285b69-8964-4312-b8af-5d8ec0a6aa0c.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"A Predictive Framework Integrating Network Toxicology and Machine Learning Elucidates 6PPD-Quinone-Induced Glioblastoma Pathogenesis","fulltext":[{"header":"Introduction","content":"\u003cp\u003eGlioblastoma is the most common primary malignant brain tumor in adults, accounting for approximately 45% to 50% of all primary malignant brain tumors. Its incidence rate begins to increase after the age of 40 and peaks between the ages of 75 and 84 (Schaff \u0026amp; Mellinghoff, \u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). According to the classification of brain tumors by the World Health Organization (WHO), GBM is classified as grade 4, which is the most malignant type of astrocytoma. It is highly invasive, grows at an extremely fast rate, and often invents the surrounding brain tissue, making it one of the most lethal brain tumors (Pace et al., \u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e2017\u003c/span\u003e). Currently, the standard treatment methods for GBM include surgical tumor removal, radiotherapy combined with temozolomide for chemotherapy (Lan et al., \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). Although in the past ten years, there have been certain advancements in surgery, radiotherapy and chemotherapy, but the overall survival probability of GBM patients is still very low, with an average survival time of only 12 to 15 months (Schaff \u0026amp; Mellinghoff, \u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). The causes of GBM involve complex interactions among genetic alterations, epigenetic modifications, and environmental exposure. These factors jointly promote the generation and development of tumors (Esemen et al., \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). Due to its heterogeneity, highly invasive disease progression, and resistance to existing treatment methods, GBM has always been a major clinical challenge. Although risk factors such as ionizing radiation, certain genetic syndromes and family history have now been identified, the impact of widespread environmental pollutants on the causes of GBM has not been fully understood. This is currently an area where active research is being conducted (Pouyan et al., \u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e2025\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eTire wear particles enter the aquatic environment via stormwater runoff and can release highly toxic pollutants with teratogenic and mutagenic effects. One chemical released from tire wear particles, N-(1,3-dimethylbutyl)-N\u0026prime;-phenyl-p-phenylenediamine-quinone (6PPD-quinone, 6PPD-Q), has been reported as the second most acutely toxic chemical to aquatic species ever identified (Chen et al., \u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). Its precursor, N-(1,3-dimethylbutyl)-N\u0026prime;-phenyl-p-phenylenediamine (6PPD), is an antioxidant widely used in rubber tires and readily transforms into 6PPD-Q in the presence of ozone (Li et al., \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e2025\u003c/span\u003e). 6PPD-Q has recently gained significant attention as an emerging environmental contaminant. It was initially identified as the causative agent behind acute mortality in coho salmon exposed to urban stormwater runoff and has since been detected in various environmental media, including airborne particulate matter, soil, and water systems (Fang et al., \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e2024\u003c/span\u003e; Zhang et al., \u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). The widespread use of tires and the consequent leaching of 6PPD and its transformation products into the environment suggest potential broad human exposure pathways, particularly in urban settings. The chemical stability and lipophilicity of 6PPD-Q raise concerns about its potential for bioaccumulation and tissue-specific toxicity.\u003c/p\u003e \u003cp\u003e6PPD-Q can rapidly penetrate the blood-brain barrier within such a short period of 0.5 hours. Its structure is quite similar to that of other known neurotoxic quinones. Its molecular weight is relatively low, and it has moderate lipophilicity. All these circumstances indicate that 6 PPD-Q can indeed enter neural tissue (Huang et al., \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e2024\u003c/span\u003e; Li et al., \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). Previous studies have shown that various quinone compounds can induce oxidative stress, cause mitochondrial dysfunction, and damage the DNA of nerve cells. These mechanisms are also related to carcinogenesis. However, although 6PPD-Q has these worrying characteristics, However, the possible role that 6ppD-Q may play in the development of brain tumors and in glioblastoma has not been studied at all up to now.\u003c/p\u003e \u003cp\u003ePrevious studies have shown that various quinone compounds can induce oxidative stress, cause mitochondrial dysfunction, and damage the DNA of nerve cells. These mechanisms are actually related to carcinogenesis (Greer et al., \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e2023\u003c/span\u003e; Jana et al., \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2011\u003c/span\u003e). However, although 6PPD-Q has these worrying characteristics, it may play a role in the development of brain tumors. Its role in the development of glioblastoma has not been studied at all up to now.\u003c/p\u003e \u003cp\u003eThe emergence of network toxicology and computational systems biology methods has provided a very powerful framework for uncovering the complex interactions between toxins and hosts. These methods, by integrating multi-omics datasets, dynamic network modeling, and computational structure-activity analysis, can systematically identify molecular targets and pathways that are disturbed by environmental exposure (Cecarini et al., \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2012\u003c/span\u003e). The integration of machine learning algorithms also enables us to prioritize key hub molecules and develop predictive models for toxicity assessment.\u003c/p\u003e \u003cp\u003eTo fill the key knowledge gap regarding the possible glioma effects of 6PPD-Q, our study this time adopted a comprehensive triangular system toxicology approach, which consists of three main parts. The first part is to integrate multi-omics data and identify the possible targets of 6PPD-Q. To reconstruct the network of the pathogenesis of glioblastoma. The second part is to determine the priority of key hub molecules by conducting topological and functional enrichment analysis and using machine learning. The third part is to combine the molecules, verify the target binding at the atomic level, and describe the thermodynamic characteristics of the interaction by clarifying the possible multi-scale carcinogenic network of 6PPD-Q. This article aims to identify the nodes where drugs can be used for chemophobic and precise treatment of glioblastoma, thereby addressing a key unsolved need in environmental neuro-oncology.\u003c/p\u003e"},{"header":"Materials and Methods","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eAcquisition and Processing of GBM Transcriptomic Data\u003c/h2\u003e \u003cp\u003eThis article systematically sorted out five independent GBM transcriptome datasets from the NCBI Comprehensive Gene expression database (GSE12657, GSE42656, GSE50161, GSE67089 and GSE86574). The selection of these datasets was based on certain specific criteria, which included sample size, requiring each dataset to have at least 20 samples, and platform consistency. It should be based on the Affymetrix or Illumina platform. Additionally, the availability of the raw data and whether the dataset contains tumor and normal control samples must also be taken into consideration. These five datasets contain a total of 342 GBM samples and 44 normal brain tissue controls. Among them, GSE12657 and GSE42656 were designated as discovery cohorts with 98 samples, while the remaining three datasets (GSE50161, GSE67089, GSE86574) were independent validation cohorts with 288 samples.\u003c/p\u003e \u003cp\u003eConsidering that the gene expression data in public databases usually contain some non-biological confounding factors, such as unknown batch effects or heterogeneity among individuals, which may blur the true disease-related signals, we first adopt the method of alternative variable analysis to reduce such interference situations. We wanted to visually assess the corrective impact of SVA on the data and explore the natural clustering of the samples based on the overall gene expression pattern. Therefore, we conducted principal component analysis before and after SVA adjustment. With the combined strategy of SVA and PCA, we effectively controlled the data quality It was also confirmed that the adjusted data could robustly reflect the transcriptomic characteristics related to the disease status.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eDifferential Gene Expression Analysis\u003c/h3\u003e\n\u003cp\u003eBased on this purified dataset, this article uses limma software to conduct differential expression analysis on GBM tissues and normal brain controls, that is, to see what differences there are in gene expression between them. When identifying differentially expressed genes, a relatively strict threshold was adopted, requiring the FDR correction p value to be less than 0.05. Moreover, the log2FC should be greater than 0.585, where 0.585 corresponds to a 1.5 times change. Subsequently, this paper uses the ggplot2 software package to visualize the analysis results. For instance, a volcano plot is used to display the distribution and statistical magnitude of expression changes, allowing us to more intuitively observe the distribution and magnitude of gene expression changes. Heat maps are also employed to illustrate the expression patterns of different groups, thus enabling a clear understanding of the characteristics of gene expression patterns in different groups.\u003c/p\u003e\n\u003ch3\u003eWeighted Gene Co-expression Network Analysis (WGCNA)\u003c/h3\u003e\n\u003cp\u003eA scale-free co-expression network was constructed using the WGCNA package in R to identify functionally related gene modules associated with GBM pathogenesis. When using the WGCNA package to construct gene co-expression networks, quality control is first performed through sample hierarchical clustering to remove outlier samples. Subsequently, this paper relies on the hierarchical clustering of topological overlap matrices to identify co-expressed modules. Here, the parameters are set to a minimum module size of 60, and the threshold for module merging is set at 0.25. Then, this paper will analyze the correlation between modules and traits. The screening criterion is that the Pearson correlation coefficient between the characteristic genes and phenotypes of the modules should be greater than 0.5. The threshold for greater magnitude is that the p value should be less than 0.05. In this article, hub genes will be screened based on in-mode connectivity, and genes with a kME greater than 0.8 will be retained.\u003c/p\u003e\n\u003ch3\u003eAcquisition of Chemical Components and Targets of 6PPD-Q\u003c/h3\u003e\n\u003cp\u003e6PPD-Q was systematically characterized through multisource database integration to ensure comprehensive target prediction. The canonical SMILES notation [CC(C)CC(C)NC1\u0026thinsp;=\u0026thinsp;CC(=\u0026thinsp;O)C(=\u0026thinsp;CC1\u0026thinsp;=\u0026thinsp;O)NC2\u0026thinsp;=\u0026thinsp;CC\u0026thinsp;=\u0026thinsp;CC=C2] and 2D molecular structure were extracted from the PubChem database.\u003c/p\u003e \u003cp\u003eTarget prediction employed a tripartite strategy to maximize coverage and reliability:\u003c/p\u003e \u003cp\u003eChEMBL Database: We queried the ChEMBL database (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.ebi.ac.uk/chembl/\u003c/span\u003e\u003cspan address=\"https://www.ebi.ac.uk/chembl/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e) for compounds structurally similar to 6PPD-Q. Targets associated with these similar compounds were extracted and considered as potential targets for 6PPD-Q.\u003c/p\u003e \u003cp\u003eSwissTargetPrediction: The canonical SMILES of 6PPD-Q was submitted to the SwissTargetPrediction web server (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://swisstargetprediction.ch/\u003c/span\u003e\u003cspan address=\"https://swisstargetprediction.ch/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e), which employs a combination of 2D and 3D similarity measures to predict protein targets based on known ligand-protein interactions.\u003c/p\u003e \u003cp\u003ePharmMapper Server: The 3D structure of 6PPD-Q was energy-minimized using the MMFF94 force field and submitted to the PharmMapper server (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.lilab-ecust.cn/pharmmapper/index.html\u003c/span\u003e\u003cspan address=\"https://www.lilab-ecust.cn/pharmmapper/index.html\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e) for reverse pharmacophore mapping against the human protein database. The selection criterion for potential targets is Norm Fit\u0026thinsp;\u0026gt;\u0026thinsp;0.5.\u003c/p\u003e \u003cp\u003eAll predicted targets from the three approaches were restricted to the Homo sapiens proteome, and redundant entries were removed to generate a final set of unique potential targets for 6PPD-Q.\u003c/p\u003e\n\u003ch3\u003eIdentification of 6PPD-Q-Associated Disease Targets\u003c/h3\u003e\n\u003cp\u003eIntersection analysis between the union set of 6PPD-Q-predicted targets and the GBM-related genes (from DEGs and WGCNA hub genes) was performed to identify core targets potentially involved in 6PPD-Q-induced gliomagenesis. Venn diagrams were constructed to visualize the overlap using the VennDiagram package in R. The resulting overlapping genes were considered high-priority candidates for subsequent functional characterization and machine learning analysis.\u003c/p\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003eFunctional Enrichment Analysis\u003c/h2\u003e \u003cp\u003eTo elucidate the biological functions and pathways associated with the overlapping genes, we performed comprehensive Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses using the clusterProfiler package in R. Results were visualized using bar plots, dot plots, and circos plots to illustrate the relationships between genes and enriched terms.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eMachine Learning-Based Core Gene Screening\u003c/h3\u003e\n\u003cp\u003eThis paper aims to systematically screen the diagnostic biomarkers related to 6ppd-q in GBM. Therefore, a stratified sampling method is adopted to divide the dataset into training subsets and internal validation subsets. This is done to keep the category distribution stable. This paper constructs a comprehensive multi-algorithm ensemble machine learning prediction framework. This framework integrates ten classic machine learning algorithms, including Lasso regression, Radial Basis Function kernel Support vector machine (SVM), Random Forest (RF), Gradient boosting machine (GBM), XGBoost, linear discriminant analysis (LDA), Naive Bayes, and ensemble methods. For each algorithm, this paper has developed several models with different hyperparameter Settings. Eventually, a total of 113 prediction models were obtained. This paper relies on 5x cross-validation to optimize the hyperparameters and uses stratified sampling to separate training and validation. The performance of the models is evaluated using the area under the curve, or AUC, as a metric. Subsequently, the stacked ensemble learning method was adopted to integrate the predictions from the best-performing individual models. Those high-confidence models, that is, those achieving an AUC greater than 0.85 during both training and validation, were selected for this ensemble. Then, the features were ranked and analyzed based on their frequency of occurrence in these high-performance models. The genes with relatively high rankings have been identified as the core genes that may be involved in the formation of glioma induced by 6ppd-q.\u003c/p\u003e\n\u003ch3\u003eModel Interpretation Using SHAP Analysis\u003c/h3\u003e\n\u003cp\u003eThis study aims to clarify the decision-making mechanism of the optimal prediction model and identify key features. Therefore, a SHAP framework based on R language was used to conduct an interpretability analysis of the model. We utilized the SHAP package to calculate the SHAP value of each feature in every sample from the trained optimal classification model. This value can quantitatively describe the magnitude and direction of the contribution of features to the model output.\u003c/p\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003eMolecular Docking Analysis\u003c/h2\u003e \u003cp\u003eTo validate the interaction between the key target and 6PPD-Q at the atomic level, we performed molecular docking simulations. To validate the interaction between the key target and 6PPD-Q at the atomic level, we performed molecular docking simulations. The three-dimensional crystal structure of the core target protein was obtained from the Protein Data Bank (PDB) and downloaded in PDB format. Molecular docking was conducted using the online platform CB-Dock2, which can automatically identify potential binding pockets of the protein and perform rapid docking. The docking results were evaluated based on the binding free energy predicted by the Vina scoring function, and the optimal conformation with the highest score was selected for subsequent analysis.\u003c/p\u003e \u003c/div\u003e"},{"header":"Results","content":"\u003cdiv id=\"Sec13\" class=\"Section2\"\u003e \u003ch2\u003eIdentification of GBM-Related Target Genes\u003c/h2\u003e \u003cp\u003eMultiple transcriptome datasets were successfully integrated, laying a foundation for the accurate identification of GBM-related genes. The comprehensive batch effect correction strategy we adopted effectively addressed the technical differences among different datasets, as evidenced by the significant improvement in data distribution consistency (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eA-B). Principal component analysis revealed that before correction, The two datasets, GSE12657 and GSE42656, are clearly separated from each other, and there are a large number of batch effects (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eC). After the coordinated pipeline was achieved, the samples of these two datasets presented an overlapping distribution in the dimensionality reduction space, which confirmed the success of the integration (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eD). Such strict preprocessing of the data laid a reliable foundation for the subsequent differential expression and co-expression analysis.\u003c/p\u003e \u003cp\u003eDifferential expression analysis was conducted on GBM tissues and the normal brain control group. A total of 1,660 significantly differentially expressed genes that met our strict criteria were identified, with fdr correction P\u0026thinsp;\u0026lt;\u0026thinsp;0.05 and log2FC\u0026thinsp;\u0026gt;\u0026thinsp;0.585. We visualized the analysis results using a volcano map (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eA). The heat map shows the different expression patterns of these DEGs. It can be seen that there is a significant separation between GBM and the control group, and it also highlights the molecular heterogeneity of glioblastoma (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eB).\u003c/p\u003e \u003cp\u003eBased on the results of WGCNA, we identified the key gene modules significantly associated with 6PPD-Q exposure. The module-trait relationship analysis showed that the positive correlation between the turquoise module and the black module and the treated trait was the strongest. Their correlation coefficients were 0.73 and 0.71 respectively, and the p values were both less than 1e-16. Under the exposure of 6PPD-Q, the co-expression changes of genes in these two modules were the most prominent (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eC-D). An internal analysis of the modules revealed that among these key modules, there was a high degree of consistency between the module membership degree of the treated traits and the gene magnitude. The cor of the turquoise module was 0.73, with a p value of 2.5e-165, and the cor of the black module was 0.76, with a p value of 4e-22. This means that in the turquoise and black modules, those genes located at the center of the network, that is, with stronger connectivity, have a closer association between their expression levels and 6PPD-Q exposure. These findings verify that these modules are biologically reliable and can be given priority when exploring the toxicological mechanisms of the network in the future.\u003c/p\u003e \u003cp\u003eIntegration of the 1,660 DEGs from conventional differential expression analysis and the 989 turquoise module genes from WGCNA yielded a final, high-confidence set of 1,885 unique GBM-related genes after removing duplicates (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eE). This integrated approach enhanced the reliability of GBM-associated gene identification by combining both differential expression and functional co-expression criteria.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec14\" class=\"Section2\"\u003e \u003ch2\u003eIdentification of 6PPD-Q Target Proteins and Associated Disease Targets\u003c/h2\u003e \u003cp\u003eThe canonical chemical structure of 6PPD-Q was obtained from the PubChem database (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eA). Subsequently, target prediction was performed using the ChEMBL, PharmMapper, and SwissTargetPrediction platforms. After data integration and removal of redundant entries, a total of 2,623 unique potential targets for 6PPD-Q were identified (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eB). The distribution of these targets can be visualized using a Venn diagram. This relatively comprehensive target prediction result shows that 6PPD-Q has the potential to interact with multiple proteins, which is consistent with its structural complexity. After conducting a key cross-analysis of the 2,623 predicted targets of 6PPD-Q and 1,885 GBM-related genes, 225 overlapping genes were found (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eC). These 225 overlapping genes form a set of core potential molecular targets. 6PPD-Q May participate in the formation process of glioma through these targets, so they were selected for subsequent functional characterization and machine learning analysis.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec15\" class=\"Section2\"\u003e \u003ch2\u003eFunctional Enrichment Analysis of 6PPD-Q-Associated Targets in GBM\u003c/h2\u003e \u003cp\u003eTo systematically elucidate the biological functions of the potential core targets of 6PPD-Q, we performed Gene Ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. The integrated results clearly demonstrate that the toxic mechanisms of 6PPD-Q are highly concentrated in nervous system functions and cancer-related pathways.\u003c/p\u003e \u003cp\u003eAt the biological function level, GO analysis revealed that the core targets were significantly enriched in key neurological processes such as \"regulation of membrane potential\", \"learning or memory\", \"gliogenesis\", \"cognition\", \"regulation of synaptic plasticity\" and \"potassium ion transport\". In terms of cellular components, the targets were mainly located at key neuronal structures, such as the \"synaptic membrane\", \"ion channel complexes\", and \"postsynaptic specialization\" (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eA). These findings together suggest that 6PPD-Q May impair the electrophysiological signals and synaptic functions of neurons, thereby affecting higher-order neural activities.\u003c/p\u003e \u003cp\u003eAt the level of signaling pathways, KEGG analysis provided more detailed insights into the mechanism. In typical neural signaling pathways such as \"glutamatergic synapses\", \"cholinergic synapses\", and \"retrograde endogenous cannabinoid signaling\", there was a very prominent enrichment of the targets, which strongly proved the synaptic functional terms discovered in the graphene oxide analysis. A coherent chain of evidence has been formed, ranging from molecular functions to signaling pathways. The analysis also revealed that multiple pathways closely related to tumorigenesis were highly enriched, including the \"PI3K-Akt signaling pathway\", \"focal adhesion\", \"Ras signaling pathway\", and various solid cancer pathways (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eB). This finding links the neurotoxic effects of 6PPD-Q with potential carcinogenic risks. Its toxicity in the nervous system may be related to the activation or disruption of key oncogenic signaling networks.\u003c/p\u003e \u003cp\u003eBased on the comprehensive GO and KEGG enrichment analysis, the neurotoxicity of 6PPD-Q is derived from the interaction among multiple processes and pathways, mainly involving the disruption of neuronal synaptic transmission and plasticity, as well as the possible abnormal activation of cancer signaling pathways related to the pathogenesis of glioblastoma. This provides a clear direction and a crucial theoretical basis for studying the specific toxicological mechanism of 6PPD-Q.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec16\" class=\"Section2\"\u003e \u003ch2\u003eMachine Learning-Based Identification of Core Genes in 6PPD-Q-Induced Gliomagenesis\u003c/h2\u003e \u003cp\u003eThis paper aims to establish a reliable diagnostic model for glioblastoma induced by 6ppd-q and identify the most critical molecular targets. Therefore, a comprehensive machine learning framework was implemented, which integrates multiple algorithms and integration strategies. Based on the expression profiles of 225 overlapping genes, this paper employed 10 classic machine learning algorithms. A total of 113 prediction models were established.\u003c/p\u003e \u003cp\u003eModel Performance Evaluation demonstrated exceptional predictive capability across both training and validation cohorts (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eA). The top-performing models included: RF+NaiveBayes (AUC\u0026thinsp;=\u0026thinsp;0.997 training, 0.995 validation, combined score\u0026thinsp;=\u0026thinsp;0.957), RF+Ridge (AUC\u0026thinsp;=\u0026thinsp;0.999 training, 0.995 validation, combined score\u0026thinsp;=\u0026thinsp;0.953), XGBoost (AUC\u0026thinsp;=\u0026thinsp;1.000 training, 0.986 validation, combined score\u0026thinsp;=\u0026thinsp;0.946), Ridge (AUC\u0026thinsp;=\u0026thinsp;0.998 training, 0.997 validation, combined score\u0026thinsp;=\u0026thinsp;0.942) and RF+Enet[alpha\u0026thinsp;=\u0026thinsp;0.4] (AUC\u0026thinsp;=\u0026thinsp;0.999 training, 0.993 validation, combined score\u0026thinsp;=\u0026thinsp;0.942). The integrated model performed particularly well in the independent validation cohort (GSE50161, GSE67089 and GSE86574), with an average AUC of 0.934, an accuracy rate of 0.892, and an f1 score of 0.885. This confirms that the diagnostic framework is relatively robust.\u003c/p\u003e \u003cp\u003eThrough core gene identification and feature criticality analysis, it can be known that there are five key genes with the highest diagnostic value. They are SCN2B, VIPR1, PAK1, MAP2K1 and SYNJ1 respectively (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eB). The volcano plot clearly shows that there are significant differences in the expression of these genes between the treatment group and the control group.\u003c/p\u003e \u003cp\u003eTo assess the diagnostic potential of these five key genes, we conducted a receiver operating characteristic curve analysis. The analysis results indicated that all genes demonstrated strong discriminative power, with the area under the curve values all exceeding 0.94. It should be mentioned here that the prediction accuracy demonstrated by MAP2K1 and PAK1 is particularly outstanding (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eC). This indicates that the screened key genes can very reliably distinguish 6PPD-Q exposed samples from control samples, demonstrating strong potential as biomarkers.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec17\" class=\"Section2\"\u003e \u003ch2\u003eModel Interpretation Using SHAP Analysis\u003c/h2\u003e \u003cp\u003eBased on SHAP model interpretability analysis, we further revealed the contribution levels and functional patterns of five key genes in predicting 6PPD-Q exposure. SHAP value analysis clearly quantified the importance of each feature, with MAP2K1 contributing most significantly to the model output (mean |SHAP value| = 0.075), followed by PAK1 (0.0575) and SYNJ1 (0.0442), while SCN2B and VIPR1 also showed substantial influence (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eA).\u003c/p\u003e \u003cp\u003eA bee swarm plot detailed the distribution of SHAP values for each gene and its relationship with gene expression levels. It should be noted that if MAP2K1 and PAK1 have relatively high expression, it is easier to increase the prediction score of the model. This indicates that they are upregulated in 6PPD-Q exposed samples and play a crucial positive role in promoting the classification results. In other words, their high expression has a very crucial positive effect on the classification results. In contrast to this situation, SYNJ1 and SCN2B exhibit more complex patterns. Their low expression states and high expression states respectively have positive and negative impacts on model predictions, which implies that there may be more complex regulatory mechanisms, as shown in Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eB.\u003c/p\u003e \u003cp\u003eIn addition, the SHAP-dependent plot reveals possible biological interactions among key genes. The expression levels of MAP2K1 and PAK1 tend to change in a coordinated way. When both genes show high expression, their positive combined effect on model prediction is the strongest. Similarly, there is also a very prominent covariant relationship between MAP2K1 and SYNJ1. These genes may be jointly involved in and regulate the same signaling pathways or cellular processes in the neurotoxic mechanism of 6PPD-Q, as can be seen from Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eC.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eOverall, SHAP analysis confirmed from the perspective of machine learning that MAP2K1, PAK1, SYNJ1, SCN2B and VIPR1 are the core genes that respond to 6PPD-Q exposure. It also explained their respective directional effects and potential functional synergies. Such analysis results provide clear priority goals for the subsequent experimental verification and mechanism research, and also offer theoretical basis.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec18\" class=\"Section2\"\u003e \u003ch2\u003eMolecular Docking Validation of 6PPD-Q-Core Gene Interactions\u003c/h2\u003e \u003cp\u003eIn this paper, to verify the possible direct interaction between 6PPD-Q and the identified core genes, molecular docking simulation was carried out. The results showed that 6PPD-Q had relatively strong binding affinity with the five target proteins, and the binding energy was always lower than \u0026minus;\u0026thinsp;5.0 kcal/mol (Table\u0026nbsp;1). The molecular interaction was stable and the specificity was also very strong.\u003c/p\u003e \u003cp\u003e \u003cp\u003e\u003cimg width=\"272\" height=\"144\" src=\"data:image/emf;base64,R0lGODlhmAHYAHcAMSH+GlNvZnR3YXJlOiBNaWNyb3NvZnQgT2ZmaWNlACH5BAEAAAAALAAABwCYAdAAhgAAAAAAAB0AABwcHAAAHQAdHR0dAB0AHR4AHh0dHR4eAAAdMgAAMx0dNAAcSB0dSB00NAAyMh0zWh4zRwAzWgAyWh0yWh1IWx1JSRxIWh1GbDMAADQdHTIdADMdADQdNDIyADQ0HTQ0NDMzSDMzWzVIWzNGRjNdXTNbWzNGbjNZfzVZbDVbbjNGbDNbgEgdAEgcAEgdHUczHlszHUg0HlszAFozHVozAEg0AEgzM0g0NEYzRltIHVtINUhIM0ZGM0ZdXVtbW11dRkhbSEZGRkZZRltISEhIW0ZGWUZbW0ZZbEhbbl1dblluf1lubkhuf1Vqe0Rqe113d11/f0RublVqakRqamxGHW5IHW5bNW5GM25INW5bSG5dXW5Zbn9uSH9/XX9uWW5uWW5/WW5uRGpqamZ3d2ZmZmZmf3d3Zm5ugG6AbmaIiIBbM4BuboiIZgECAwECAwECAwECAwECAwECAwECAwECAwECAwECAwECAwECAwECAwECAwECAwECAwf/gACCg4SFhoeIiYqLjI2Oj5CRkpOUlZaXmJmam5ydIgCfoaAAKYtfAi6LAplfkzmjn4anqZcbtACrnbq7s7uMWQIBi1wAAwMlhT2wAE7Jy82GoiOZPsYAEL6aMsbGAQHcA425iuOVPAEEkhsLIiIDwQ6FHJkbhPPZ+Jj3+YcCF4szCEwAIENAvEEvCoDiYCDAAkIJFzZ8SGgDAW4NHVZCF6qDRn6YYEhCpepSBwBCDjyJVI8QFwEUdNkCSbPmJgaL2qQjJEAFQkMbcAp6AVSooA0aCokBIEzSzkIvCPi0KQlGTEjlEGWNJFLQAXVJCV0xuqkl1bNoG41d9CIsohdX/3mGhWtIgFuzhWo0ffTU0AuyaRd1fdSLHCuVLA/11TQzsOPHggAbQsD2ENGhliu6NXSwEd5+kAXHJeTR2+JVwLztHUSSp+qBjr6y3AxgLWtCsoN5oy1IdwAJAPCW+6rFtyEbqgmU+qz1Ne7eqnnjcj5ItjdCyE0f5ZnMOCEPycWp3ntl/OpCkqGOHnqVrvpBzAktVjQfaufQhgZXLNRXwCcqg3RAVlZ9bXCfIohBwtwGo7UGwFcNDILcLYIsFkFjuOD2QyHnLYYBhog8RYaB1QEQoSAT8jfIiJ1BWMgNBLAwSAzB8XQLjEvA9xMh6RFyXlY9DtJZLFAdYtZlheCFVP8iWyUSpCBYPAmZVYxQ2Ztk5WRZV2zqwAbADwIAxx1rgOmXYSEwrJZlmXFtlWZJhYwjAJtj1nnAiXUO8uZtfI4ZH5OgCWIbIjXURgA2Hm3mno8IrcfUfoksyUgNByqGHyJmInLDIKc4CkBn42yaXyNfJPjIBuZRxKmDTXZ2xSFtBHDLOKfQJioAN0gWK4UvHjJYq4PcSsivh6S3V2GfHhLTn4cMWkhnUipECBYBTAUAkoL0ANOOg2g7mqRGPlJpITpFcWl+nnLWJ3p9WknIDeMaIpuChnSg6pm9Faunp1quO4hQ7uaJ7iEHbTVowELmi6ZWdSKMazww1GdZukY9WcP/fIu+YN5iGo83H7PBSedXvPKdOyoi5tmor8IApGmeN1J6tRK9J/c268otI4JhlrzalulRvA77Msws86jn0AEI1aSnwt2CtFHe4PmWk/8ukl5n2Ga2CLhbCrJFMWBLnawiUZqMpqO/vQsqIkaN4/Aj854qXdsC12bU28LVDQDATCvytr9272vY2eG26wgHAZC8qFhCOWvIFZWWzZ4iixd+CBZ9iUIklBITQnJoP693K7JGs/yzI6XOfGqzQiHb5N2HuL5qzwDH7mDNDBsC++CF8Dv7IOkGqjXOUl7N7dRbizy2OFubTTikw4LqKLys3RJzInE7wnXVRd8euCC5HqKm/94Vi28174M4++RW4RdSXp7XT872xNwzaa0gNcRV+XuKMHveIjUI0sWcNyygrEdNWLqZIGq1sEaYSnvCw4UC2bVA76VpghJ0X9to45GgyaWB+Cpdb2hDrK5xSGVFG0RSHlAkJtEmVtaSUqHiZK2s+YUR2xPE/0whgHsNglrKww/CYESIBoCgRSbiyX3KcYMA/EMQAuKSIXYIHy/pcIl1cxaMZATFGq3rdcFy4iDmATJciRGKdNPgu84IgCimEHyrIaPAVuPGMA2CB52rTdqyZUdBxcwgrLnP/iASvKNYsSBUZIQAFgOeIArxENnRyKsWiBjfiMlfxVFNTpA2iF1Fyv9jYpOd7rqjneBMkHTf46M3pAWiQ2TyOuQrxCt/xKRV5sgsqPSNl1qRHC4mghjJWQEFFwEeb1jxWpXZWsqQYQnfJJKA0IymLprkPGpK85rDxKY2t1mJNsTPJjvAGTfH2ZNxmvOcdbOL2balN3RGc4DujOc2gxGOhlzyXG0IRjEaIk9udqxa/QwoNDcHzVcI9KAbQZpCF8rQhjr0oRCNqEQnStGKWvSiGM2oRjfK0Y52lCkeDalIR0rSkpr0pChNqUpHitCWuvSlMI2pTGdK05ra9KY4zalOd8rTnvr0p0ANqlCHStSiGvWoSE2qUvupDEBBpoxokcZSIUGCRTAzhFP/5Ufw9uEYqFKlHqEYADpKkdVElMupAGiFB8vaidOdy6s22R5yLnVPboIsj2z1RSFDA9eaxMetZ9lrNIMk2Lxy4m+X6itNcri30AjLnFS0pmH1ilYUhceUnALpWDlVyej0QzXAUSxI4uMsbWnyOOMhJSxLlBpv3CtlhGitNy6ZOpCudbSsI8QprEWZWRaCl6ZpATEvO4gZpDa2zrRAIShTnh9Rh5uAdRARZ/ShisTFBn1xkWV51RcYXMhkOaQe+Ap0n+nqMFgF0pDU3CiIt7ERGEgUW1rSs55yIOAHeDKudYubxxfcxyjwPO8gApCBQcCXEAgogtgCHJzPETB4tLKg/6wG8QEQnqJMcTLfWzWcVu+Nw5Ol8zCZTCgIR6kzTmG5E37KKbDdjrh3FG6hzu7XSYCGmDcnBgBlSNwbGksTseUw1iLWpi5BjeuZaNmAl4zgkWcNTBDxEu+wrsJOtelJnHoq2KWyFkAS5xh/nyMZs0iWHv+yZj2UuuE4IXxlWXjvXb9jHCF899aL4IIAGAAhnNubiJ/dqknOWo9bccJA/JBley5+Y2MJtYj62PB4w1wawaDLO7zJwzxEHubpRMsPcLnpae1Nl8teprT5hfp5o8TqY1gMgPr0S85jxLQiaBA1Qr6FYqyh8T/N8038sJnPRnJarypIuzlbDj+fWQxgc/+GKTiRiyyCploG8YOkGf6W1WAU8CCsrQghPAozyMOZpOUJZMGZ0GGZFuHbJJvkzcDgPohtmagLSc3dTTm3qn6MUBj76vQJZX8OHtMgr+Upyan60ef8Na4kUx5abG8wqFQfh5FNrgn7URGPhXXuhg08NSqrw7dNcirqk+h875s2CE9ETLhdCIPDWNG9JmB0MQjFvTAncYBb9Bv3lFhbDSg/bjmPeCU7JxI7rEl7YfdZCiXlDAPONjcnGQshMuC8aHmKPPHxl5E5zngEAQBf/3oKz3MCDD0lCwAAQZz9/a4YBciLiT32t+tXoTsGIOiDWArd8ASM9Qg5T29+DAEMIJ3/kme71QZOOzkuiceSCeIceBdYk/RSYEF4IOZTGhrgdMNKDrkWyjkHjGnRsZ23Sse5xhSfacjKmueeyZL18saBSkP6dnY1SP2++IA/v7xDFLP2PgpuIX4/H2puwTs4ReVksbL85u8inKl2fiSULv3qM6JTTrf+I6iv/e5/1hj89L4juC/+8gviFUAwfyMMqv6YrvT98I+//OdP//rbn6Htz7/+98///vv//wAYgAI4gARYgAZ4gAiYgAq4gALIAdZQCKEgAldFCMrQDgRFCBYIANOACdyADSZzgdEwgRCYCBEogoSwDeFgCKxHCBtYCC3IUxFYgolAAw+oU9g1EAXR/xkwUACfwBA7pBCfMBFJYmcDkBE+5AgyQHqf4BFHGBiosleJ9IS+woMA4IOgcQRVeHdtdkJThFczRQNgEw7hkEirgARZ6EgudRrmBh9kgTBBoRmxxxVuhyZeuFgDsD4DEC8M8APp4YaHNhpX8BQIsxX28iBGxVgMMi11iFC04QVbKDDrUWhwh3WOEIiqE31dNYkapHaIsCy+IjyWmCSPqImC8A+1RVTjgjnmAj01lT2otoY/ox/M8m7aQ29o2GkP02xNpyO5+Iqn1mdX4S4w8CSu+FPkR3A25U1+84nGZghxwSyB6Ah4tWyLlVbFiHgZ9z06gTujCBRbaF6xc408xf9yOIRTQ/JkcPiLrMhYimYI7aMImCcTSeEw4lUp7sIcDsOO2kYl77MI4rhTxzh3NLUpgYgoWghsXNhxhABipJiOQ5YTBHCJSUZ3QIMrEdR7zJaQiKCDABArjnSKQRVwTmZTNyAtPkILgzgamVJlDcmLghFwi4gPZrE9QrGLzrKN98YThdSGBRAMMfOPOSVa8ehO4IiQo1ZKnpccK9iSR3GLDoaTmQg+IJRxD4cmG9NoVpkU7FYqQqWMfKFTxiNvopE8lQUOA4Anu4hl7TYI2eU5HNIAmmMAbVhYLTMf7oJkQBlQZilfCskIUaFTk+Rx1AgAewWNETkImqMiGMdXL7f/PI/VRGJYDEVocRnJCH2YTb/1QC+VmDrzCEPZT2EZb5XZmZgikrgQRProVxSEF033J8JYWBITMCxZCHlpUzvGCDHZUqh0K4NZmImAZO/yJLmZDZ9RD14plY5nKQhJH33mPgeZmUJ1PcPZUrvoNnslmowFnHHShALZVbThADiwHjpIMicmmpI1RBJTmzSVZs7WU+PwEhzJCL4ZdgShTyMhEIPABU1GcQsJfMgJconAm4sAIycAdgYqdqNZeoPwdT1wAMiAoDzlKXtxMShwoAaaU7RnRaJJjagyHsdQCTbgTHzphNzYizoXIuroK0hzEO61ey9Dly81SDa2a+Nhmgb4/44MmKNLFZA62qM6NZ0+GqQ0NWpCWqRBBaFGmqRKuqRM2qRO+qRQGqVSOqVUWqVWeqVYmqVauqVc2qVeCqUOmILd8gyHIAomCACvIApIKgk/YA0eeE4VOArQQIKJsJSC8IKDgKdIFaecmQjGcEw3JRBfB0jtBYQ+6EM72A6HOoQYwXuSwBGgwIToxIMisKg6EwCe8jM75J9JRYNmKVapGAAD8XXa2VJquJxHARjr8YYuSRrPKY2+QgAh5zyZwqrPxoedmH2E8BDqeVTxIYl1d1OAdXRucXQOuUaQ0DnDOE7E6o6eiI4NhiZP1KtFhQV6CBS3GFAgWaKj6SnP2JwQhP8+2BSLljMucbGs2LN8JTctkpFyL/UkDiOMzKigyombLwldjoIwtgGV34hX1IqK0nasMhWYG2ICetaqhImtCKtbgYeJ+KZNPxMf3+qMejSr/ypUkrU9QOpOMXEoIrCfKaptYqmR+uhVaWkIG5sWDvM/THQgVPKqiHCxQGWtqgAbMpCy51SSU1d1qOotjZlCIJOaCaMIOiGR0WQmPqtxHXmYhNCTScMIMvtTQqstg1eqB1WUI3uUnNoyVymKkdIItIiV3LQxdvpwKqmFShe1PiUlZrZtn2lOOCoo3fhx/ZMIyBKZYgp6iuA4P9YIwhCXqsqz6SpUeJsI5Nhy7WqOK7P/oYWpPIsRgS0YjQ+5TaIJmWYZfuqIrjFLVJCrM0E0SDYqT8eZPnNLsXXbbOPXPJRGltwYF9i3uXl1m+HGrjfFPsF4neWIMrMaLEGCtT+2V50jiFMkMmrLU2zrKOxpUwfiNo3gm4ZwDjB6Ck1YHtl6KcvmVpIoOulpWA6GOd6IU4SKCzp4nYV5TMFQV+KwGIliTvEmWeKVj5XSoGeKVIcrkC9wL5eXU8B1Hr2pTB56Cc7kTssmJU8hoS5qHjDqUwZMCLR2Wkr6tl8awZrQsBJcwfnguxacwdkwarurwR78wSAcwiI8wiRcwiZ8wiicwiq8wizcwi78wjCcVPOQtwDA/6d0+glnmqajsKaRsCHh8KbXRAzhAKiDEAJgkyOEwH6IYKdgA8RougxIlZh6WgjhML80tRM5qCeUaqla3INCWBEXAX6OCglJSAAWKKnS1EQDwWT/w4Q96J+oAq48QRGIUxHs4A7wUFT3G4OJQC0R0gMKAMHndKoJaqslliSHdgjrm6yx2sFpwZDBmqpFoSOd8y3r8RSf8RIJXFO3CKwJK6x+Y2LFGkF/0kSOjHiYIsj4UCl4ITEntgEXELav+DFhsT1861Puipk6dlNcKccF1IyNOYurI67nko2s+LWnSbdyHB8461Ib4GPDQ3U3Ba/5eq7VDB+OC6t+E7pU8STauf8kkvvL/VPLtHHLPQVX+zNwLkWwAGCw4nzMjgIuzKJ8aulxZvO69fy9CfrJv7mwiThUzwwAXcAMifBouYxQN9CxEPCxq7Gy7yy4TBnRwQKT0bS1C+cZbpEV8SbLSjYIYIK+P0UZ6BAMEpO88MHNRFkAO7uP3bEe9eW1ntsIT9nMILFKI6m6FBk8ONqhqsGdPXUKB1IfMKSRV/xkWqscVukxCkua9yq214Q4BxG3MQ0f5xqgi7E9hahU/5wXfcEAR0SSDFe6Pyt3DHsLheuWe6vKNbEX1/MZDJmpTLuw+yxTZw1AiTB6A1G/MDUufCMYuYuigkAMGfiCeHWy0RQP/Br/KT4Xsi2THoyl1tgk2O2ggY0QP+rcUqMrt6iaq6c7r4qE09r0Z55hOdprtA1pzkKFBcNsU+PGuKzrnJDQuzRNnLwhXlLdlLiNdY8Vzkutyz1VVaZrr4rbMM0L2nYXvT3ULDBbzJKBS7MZGXINeq67dcfcjjt1VnrjcocMvqAyvn69NaMqCIgE0qqAnwFitYHRQ0jcRoDxEQCgnwUiMlI4Xv3jJd6Gc0P1l4EE0euAEJA9Tg682cp8qf/bTMeFTbSXSOMhNtCIqUejeUcBSkfVwKslsgDwez6to7cdwyZMzxxewhj84STMwSJe4iZ+4iie4iq+4ize4i7+4jAe4zI+/+M0XuM2fuPjFKbRAAoSeAgVONmIMNiZ0IHrrU06LglKfAhL2Q5TXFae2ghPvlM3WJ86SIVWaAhACABfDB9EaISTUMajgMbYJAzpt8jdidZxjLqDoAV39rwWTVTC8HVJqDxleIY5Rch+mJNsKLBthN51Gatj7j5dzWEbMACx6Snqy6tKtRUSs9UA4L041YhzmxWRmGP+8wihaGrQlI3OTRtK8g/j8jNFu4XFO1Pa3ZCqCNOtGMroSK7R3TKm6eh1Ub1oYdhslz7plcy+GMm0qVTtSpeXjVCZfbCo6q2vztvMuYzRhNqtyjXgbJewsr1INex9xWnxdI67Di75yOc55/+OMfPfu/DcKAs+gGEWsszYqCwvWj3ujuDn8USQh9Lno0yJfSkIDBm0TgmRpn0u6EDeHbcVnkbsS2vapU5TqyDnUiEuN1WSU4SS9aWSdfEtnb3Nwi1NSRgAJomyBsDK2fQ20r7oVatcjgDuaQzt7dW1Sbl6qt7bijDT3ATVh2DKLL/Vov7xSHUg0gmWkzaywPjaJLaXE22Z45QFy32iC7sXHg+7PQX0ItQolDPb4xSYsNa/Ey/oqtOnvB7zOfuwAts+NU/wP4X1XNdy57NToYm7xh0944eatI4fJ+vYhZP0h1DwMTVIQQL15rSbYg3MTE3vjLDhWW828ZGNt5ydFvn/OD1C9zD1aHqN9zm7vGuIKY07ceLAne6OFs09N2QNPvGQr/Gi+DD1MXTmU+8ZvlT/tfSJSJOwSF6in5d/FtKbn0ave0ki33uBoOdBn1/xdXNKVD4ENdtGABVKnzw8Uxmq52r+SR5qxYwQogvOTQlOxNCtz+ITjEhp75yEVOOx0ntBo6qB0gII+DhupTw6/k3q+Oafo0Sa/lpa/Oz//vAf//I///Rf//Z///if//q///yPCGEqgoAgAiBYAmB4CNAjIiiIiMgoMuI4SUk5cAlRqbnJ2el5SMJZaJhy2DO4qMkIIPnp+gobKyu7uEpIWTuoOzrb6/trQzABICPgcAhT/yDIYRBAWaDbvOC4QXA50BwwHSsTQKDbof07/tpG8KQpgOwILShNrSsy4F2KyONNnq+/D0BzCTAAIABnk/wFPDiQn0J+5hwJcGEIBgVqDBDBmLSh4qENlMLBgvFtEsiFJDlWuhISgERkExFldKnBkQ2ChjpMO0Ayp85XG2J66rkzKCyfh7ysm6QOUctDXwQQNUmJZieUmzQKHWd10tKLSpE+pcRV5SGcV8sKPfYJrdm1jshWWukIrsqlLDcSjauW0wa6XtnOkoooacStYGFSymrIrd/F4wR3csy4bBvEcQt31bpRE0p0nVK+jQwLKqIrWcMa4ns68yTSkxSDfv2phv8r2bDNqm1kETXQiJREi0bq6QblScNrExdJV65c1YZ2I7qRN7Hx6ZsgV6ce9AYAlJnCETV9CDDfNgEgNt/kfFP0SQ2xU2pq/hBk8OPLM0cEWLr7/QDWa/LPXz4FPDAaTcsBMB9dPQhAV3rwdAIDgIh4FuBczyGmnCMLNjiMIUZ4RIlrFb72G3ojKnTDOZVFFECLLVJoiIsv1nMfRnf9x4k5nJ2YlYPgyUgPNdYgSAAGmoh44mKTfbJkkvoIRwlaB4rUSYmBHaLFNZc0cAh0nRS3n2MwKocaNURZhwiSTuaTpZZcUlJDmXDKuWYsV1DCmlid0HneSZ7Z4giMjmi3Zlj/hK54CJ8A/CaofnUqBGglVkr6aD7DSanolGa+JaFDN266pkYOWsibXndFWImalQaFgCutruoLfIMexYmikyb0CZSaNBrgBhAJSp9ejpAXX5qwsgXmccf24mVg5mlqUZWVBKCoQ9tMkt+JN1DQbFxb2Xqjrm0tW1YNnSJiLrm9CMaFMbRuYisAQchLjADZPibMIVzgWikBBnyqZ6LCYnTIvPPiNK8T6upUZrbVLlxJOC12OBgn0G4E5AC8xGKDvS6+uepLYBFmalQxAtkixCS9MO0kLKsMcyXixkxzzTbffCXOOu/MM6y89gx00EIvBoOLQx+NdNJBzat0004/DXXUI1Lrg3LVVl+NddZab811115/DXbYYo9Ndtlmn4122mp7DUAgADs=\" alt=\"image\"\u003e\u003c/p\u003e\n\u003cp\u003e\u003cbr\u003e\u003c/p\u003e \u003c/p\u003e \u003cp\u003eAccording to the standards already established in molecular docking research, if the binding energy is less than 0 kilocalories per mole, it indicates that the substance has the ability to bind spontaneously. If the binding energy is less than \u0026minus;\u0026thinsp;5.0 kilocalories per mole, it indicates that the binding affinity is relatively good. If the binding energy is less than \u0026minus;\u0026thinsp;7.0 kilocalories per mole, it indicates that the binding affinity is very strong. The binding energies of these five complexes all exceeded the threshold that could achieve strong binding affinity. Among them, MAP2K1 demonstrated the most favorable interaction.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e"},{"header":"Discussion","content":"\u003cp\u003eThis research is pioneering. It systematically interprets the possible molecular mechanisms that link the emerging environmental pollutant 6PPD-Q with the pathogenesis of glioblastoma (Li et al., \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e2025\u003c/span\u003e). By adopting an integrated computational strategy, we have coordinated multi-omics data, advanced machine learning, and structural biology simulations. A very notable interaction network has been discovered. Our research results show that 6PPD-Q may promote glioma formation by targeting a specific group of genes, which are particularly crucial for neural function and oncogenic signal transduction. The most important aspect of our research is identifying 225 highly reliable targets that can link 6PPD-Q exposure to GBM. Then, relying on a powerful machine learning framework, a detailed analysis was conducted on the five core diagnostic genes. SHAP interpretability analysis was used to explain their functional roles and mutual interactions. Additionally, molecular docking was employed to structionally verify the binding ability of 6PPD-Q to these targets (Gao et al., \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e2025\u003c/span\u003e; Liao et al., \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e2024\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eOur work has a key and novel discovery: at the unique intersection of neuro-specific processes and mature oncogenic pathways, there are clearly many potential 6PPD-Q targets (Wang et al., \u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e2022\u003c/span\u003e; Yu et al., \u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e2025\u003c/span\u003e). After a comprehensive analysis of GO and KEGG, it is unanimously indicated that there is a dual attack mechanism: 6PPD-Q seems to disrupt fundamental neurophysiological processes, including regulating membrane potential, synaptic plasticity, glial formation and ion channel activity, while also interfering with key cancer driver pathways such as PI3K-Akt, Ras and Focal Adhesion (Liao et al., \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). This situation is crucial. It speculates that the neurotoxicity of 6PPD-Q is not only a parallel effect but may also be intrinsically linked to its carcinogenic potential. After the core neuron signals and homeostasis are disrupted, it may create a loose cellular environment or cause initial damage. When combined with the activation of proliferation and survival pathways, this will prompt the transformation of glial cells (Cecarini et al., \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2012\u003c/span\u003e). This provides a computationally derived mechanism hypothesis for the sequence \"environmental exposure - neurodysfunction - glioma formation\", which goes beyond the known links between quinine drugs and general oxidative stress as well as DNA damage by accurately identifying specific downstream pathways (Guo et al., \u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e2025\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eBy comprehensively applying the machine learning framework, which includes 113 different models, it has been proven that this approach can eliminate the complexity of transcriptomic data, identify powerful biomarkers, and the five core genes verified in independent cohorts have excellent diagnostic performance, with all AUCs greater than 0.94 (Guo et al., \u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e2025\u003c/span\u003e). This highlights their possible role as biomarkers for 6PPD-Q exposure or early detection of related pathological changes. What is even more worth mentioning is that the SHAP interpretability analysis has transformed these biomarkers from mere statistical associations into functional suspects with contextual implications. After quantifying the criticality of the features, it was determined that MAP2K1 and PAK1 are the main driving factors for the model's prediction (Crampon et al., \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e2022\u003c/span\u003e; Kuang et al., \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e2025\u003c/span\u003e). This discovery is quite astonishing from a biological perspective. MAP2K1 is the central kinase of the RAS/RAF/MEK/ERK pathway, which is a key basis for cell proliferation and survival that are often dysregulated in cancer, while PAK1 is a key node that integrates Rho GTPases signaling to regulate cytoskeletal dynamics, cell movement and transcription (Wang et al., \u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). Just as the SHAP dependency diagram shows, the synergistic relationship among them indicates that 6PPD-Q exposure may affect these interrelated growth-promoting and invasion-promoting signaling modules (Zhang et al., \u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e2025\u003c/span\u003e). After adding SYNJ1, SCN2B and VIPR1, a more comprehensive picture is depicted. This suggests that there are dysfunctions in synaptic communication, ionic homeostasis and glial signaling in the toxic mechanism (Choudhry et al., \u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e2021\u003c/span\u003e; Song et al., \u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e2023\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eMolecular docking simulation provides a crucial structural basis for our hypothesis. Predictions show that 6PPD-Q has always had a strong binding affinity with the protein products of all five core genes, which provides a reasonable physical basis for the observed computational association (Song et al., \u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). The binding with MAP2K1 is particularly favorable, and it may be the main target. It may cause abnormal activation of the entire MAPK signal cascade. Although the results obtained from these computer simulations are highly enlightening, they cannot serve as the ultimate evidence of interactions or functional regulation in biological systems (Dankner et al., \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e2025\u003c/span\u003e; Lampron et al., \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). They are the key starting point for future experimental work.\u003c/p\u003e \u003cp\u003eThis study also has some shortcomings. It is mainly based on calculation, and the causal relationship between 6PPD-Q exposure and glioma formation needs to be verified through rigorous experiments using both in vitro and in vivo models (Nair et al., \u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e2025\u003c/span\u003e). It remains to be determined what exact functional outcomes will occur when 6PPD-Q binds to targets such as MAP2K1 or pak1, whether it activates their functions or inhibits them. Additionally, although the model organisms used in source transcriptomics studies have certain correlations, they may not fully reflect the complexity of human glioma formation (Luo et al., \u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e2022\u003c/span\u003e; Zhou, \u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e2024\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eOverall, our comprehensive analysis has constructed a very convincing new theory regarding the potential gliogenicity risk of 6PPD-Q. We have mapped out a coherent molecular network, prioritizing those highly credible core genes with specific diagnostic capabilities and pointing out the synergistic relationships among them. It also provides structural evidence for direct targeting (Cao et al., \u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e2024\u003c/span\u003e; Zhou, \u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). This work has transformed 6PPD-Q from a common poison into a molecule that may cause specific and potential damage to key carcinogenic and neurodevelopmental pathways. The core genes identified, along with the synergistic genes for MAP2K1 and PAK1, are promising research objects for future mechanotoxicological studies. It may also be a biomarker for environmental monitoring and risk assessment of the population exposed to this ubiquitous environmental pollutant.\u003c/p\u003e \u003cp\u003eAlthough this study has provided new insights into the possible molecular mechanisms of glioblastoma induced by 6PPD-Q, there are some limitations that need to be noted. Firstly, most of these research results are obtained through computational analysis and publicly available transcriptome data, lacking direct experiments to prove them. Although the results of molecular docking can indicate the possibility of binding between 6PPD-Q and the core target, these predictions based on calculation still need to be confirmed through in vitro and in vivo experiments to determine the causal relationship between them. The transcriptome datasets used in this study have significant heterogeneity and were obtained based on different platforms. Although strict methods were adopted to correct batch effects, there might still be some unprocessed confounding factors. Although the machine learning models demonstrated relatively high prediction accuracy, they were trained on existing glioblastoma data. Perhaps it is impossible to fully capture the dynamic changes and heterogeneity of glioblastoma induced by 6PPD-Q. Because the data was mainly obtained from human tissue samples, this study did not take into account the possible differences in toxicological responses among different species, which limited the generalization of the research results in the in vivo model system. In future research, it is necessary to integrate multi-omics analyses in cell and animal models exposed to 6PPD-Q This is crucial for verifying these computational predictions and clarifying the functional pathways in the context of biology.\u003c/p\u003e"},{"header":"Conclusion","content":"\u003cp\u003eBased on comprehensive computational analysis, this article determined that 6PPD-quinone might be involved in the pathogenesis of glioblastoma by disrupting key molecular networks. We identified 225 potential targets related to 6PPD-Q exposure and GBM, and extracted five core diagnostic genes using machine learning methods. The five core diagnostic genes are MAP2K1, PAK1, SYNJ1, SCN2B and VIPR1 respectively. These genes exhibit rather unique diagnostic properties and are involved in neural functions and carcinogenic pathways. The identified genes can be regarded as promising candidates for future experimental verification and potential environmental risk assessment biomarkers. This research work highlights the value of computational methods in the field of environmental toxicology. It also emphasized the necessity of conducting experimental verification on these prediction results.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eEthics approval and consent to participate\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot applicable. This study does not involve human participants.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConsent for publication\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot applicable. This study does not involve animal and human participants.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAvailability of data and materials\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eAll data analyzed in this study are from publicly available sources. The specific identifiers and direct links are as follows:\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eGene Expression Omnibus (GEO) datasets\u003c/strong\u003e:\u003cbr\u003e\u0026nbsp;GSE12657, GSE42656, GSE50161, GSE67089, GSE86574\u003cbr\u003e\u0026nbsp;(Access:\u0026nbsp;https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=ACCESSION_NUMBER)\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003ePubChem Compound\u003c/strong\u003e:\u003cbr\u003e\u0026nbsp;CID 154926030\u003cbr\u003e\u0026nbsp;(Access:\u0026nbsp;https://pubchem.ncbi.nlm.nih.gov/compound/154926030)\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eProtein Data Bank (PDB) structures\u003c/strong\u003e:\u003cbr\u003e\u0026nbsp;MAP2K1 (8YP5), PAK1 (4ZJI), SCN2B (6VRR), SYNJ1 (7A17), VIPR1 (3DTX)\u003cbr\u003e\u0026nbsp;(Access:\u0026nbsp;https://www.rcsb.org/structure/PDB_ID)\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCompeting interests\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNone declared.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNo Funding。\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthors' contributions\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eG.Z.: Conceptualization, Data curation, Formal analysis, Investigation, Validation, Supervision; Y.X.: Investigation, Data Curation, Writing—Original Draft; Y.Y., J.L. and H.J: Writing—Original Draft, Writing—Review and Editing, Supervision.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eCao Z, Zhao S, Hu S, Wu T, Sun F, Shi LI. Screening COPD-Related Biomarkers and Traditional Chinese Medicine Prediction Based on Bioinformatics and Machine Learning. Int J Chronic Obstr Pulm Dis. 2024;19:2073\u0026ndash;95. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.2147/copd.S476808\u003c/span\u003e\u003cspan address=\"10.2147/copd.S476808\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCecarini V, Bonfili L, Cuccioloni M, Mozzicafreddo M, Rossi G, Buizza L, Uberti D, Angeletti M, Eleuteri AM. Crosstalk between the ubiquitin\u0026ndash;proteasome system and autophagy in a human cellular model of Alzheimer's disease. Biochim et Biophys Acta (BBA) - Mol Basis Disease. 2012;1822(11):1741\u0026ndash;51. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.bbadis.2012.07.015\u003c/span\u003e\u003cspan address=\"10.1016/j.bbadis.2012.07.015\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChen L, Hu J, Borthwick AGL, Sun W, Zhang H, Jia D, Liu W. Solar-light-activated periodate for degradation and detoxification of highly toxic 6PPD-quinone at environmental levels. Nat Water. 2024;2(5):453\u0026ndash;63. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1038/s44221-024-00236-3\u003c/span\u003e\u003cspan address=\"10.1038/s44221-024-00236-3\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChoudhry H, Aggarwal M, Pan P-Y. (2021). Mini-review: Synaptojanin 1 and its implications in membrane trafficking. \u003cem\u003eNeuroscience Letters\u003c/em\u003e, \u003cem\u003e765\u003c/em\u003e. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.neulet.2021.136288\u003c/span\u003e\u003cspan address=\"10.1016/j.neulet.2021.136288\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCrampon K, Giorkallos A, Deldossi M, Baud S, Steffenel LA. Machine-learning methods for ligand\u0026ndash;protein molecular docking. Drug Discovery Today. 2022;27(1):151\u0026ndash;64. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.drudis.2021.09.007\u003c/span\u003e\u003cspan address=\"10.1016/j.drudis.2021.09.007\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDankner M, Rousselle E, Petrecca S, Fabi F, Nowakowski A, Lazaratos A-M, Rajadurai CV, Stein AJB, Bian D, Tai P, Belaiche A, Li M, Quaiattini A, Normanno N, Arcila M, Elkrief A, Johnson DB, Ladanyi M, Rose AAN. Clinical Activity of Mitogen-Activated Protein Kinase Inhibitors in Patients With MAP2K1 (MEK1)-Mutated Metastatic Cancers. JCO Precision Oncol. 2025;9\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1200/po.24.00199\u003c/span\u003e\u003cspan address=\"10.1200/po.24.00199\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eEsemen Y, Awan M, Parwez R, Baig A, Rahman S, Masala I, Franchini S, Giakoumettis D. Molecular Pathogenesis of Glioblastoma in Adults and Future Perspectives: A Systematic Review. Int J Mol Sci. 2022;23(5). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3390/ijms23052607\u003c/span\u003e\u003cspan address=\"10.3390/ijms23052607\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFang J, Wang X, Cao G, Wang F, Ru Y, Wang B, Zhang Y, Zhang D, Yan J, Xu J, Ji J, Ji F, Zhou Y, Guo L, Li M, Liu W, Cai X, Cai Z. 6PPD-quinone exposure induces neuronal mitochondrial dysfunction to exacerbate Lewy neurites formation induced by α-synuclein preformed fibrils seeding. J Hazard Mater. 2024;465. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jhazmat.2023.133312\u003c/span\u003e\u003cspan address=\"10.1016/j.jhazmat.2023.133312\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGao J, Zhang M, Chen Q, Ye K, Wu J, Wang T, Zhang P, Feng G. Integrating machine learning and molecular docking to decipher the molecular network of aflatoxin B1-induced hepatocellular carcinoma. Int J Surg. 2025;111(7):4539\u0026ndash;49. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1097/js9.0000000000002455\u003c/span\u003e\u003cspan address=\"10.1097/js9.0000000000002455\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGreer JB, Dalsky EM, Lane RF, Hansen JD. Tire-Derived Transformation Product 6PPD-Quinone Induces Mortality and Transcriptionally Disrupts Vascular Permeability Pathways in Developing Coho Salmon. Environ Sci Technol. 2023;57(30):10940\u0026ndash;50. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1021/acs.est.3c01040\u003c/span\u003e\u003cspan address=\"10.1021/acs.est.3c01040\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGuo B, Jiang X, Zhu L, He X. Exploring the Diagnostic Potential of Core Targets of 6PPD and Its Metabolite 6PPD-Q in Cardiovascular Diseases: An Integrated Analysis Based on Network Toxicology, Molecular Docking, and In Vitro Validation. J Appl Toxicol. 2025. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/jat.4892\u003c/span\u003e\u003cspan address=\"10.1002/jat.4892\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHuang Z, Chen C, Guan K, Xu S, Chen X, Lin Y, Li X, Shan Y. Protective role of ghrelin against 6PPD-quinone-induced neurotoxicity in zebrafish larvae (Danio rerio) via the GHSR pathway. Ecotoxicol Environ Saf. 2024;285. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.ecoenv.2024.117031\u003c/span\u003e\u003cspan address=\"10.1016/j.ecoenv.2024.117031\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJana S, Sinha M, Chanda D, Roy T, Banerjee K, Munshi S, Patro BS, Chakrabarti S. Mitochondrial dysfunction mediated by quinone oxidation products of dopamine: Implications in dopamine cytotoxicity and pathogenesis of Parkinson's disease. Biochim et Biophys Acta (BBA) - Mol Basis Disease. 2011;1812(6):663\u0026ndash;73. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.bbadis.2011.02.013\u003c/span\u003e\u003cspan address=\"10.1016/j.bbadis.2011.02.013\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKuang L, Wang X, He Z, Zhang Y, Luo J, Zhang W, Fu Z, Tuo X. Explore the toxicological mechanism of 6PPD-Q on human health through a novel perspective: The interaction between lactate dehydrogenase and 6PPD-Q. Int J Biol Macromol. 2025;293. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.ijbiomac.2024.139266\u003c/span\u003e\u003cspan address=\"10.1016/j.ijbiomac.2024.139266\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLampron M-C, Par\u0026eacute; I, Al-Zharani M, Semlali A, Loubaki L. Cannabinoid Mixture Affects the Fate and Functions of B Cells through the Modulation of the Caspase and MAP Kinase Pathways. Cells. 2023;12(4). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3390/cells12040588\u003c/span\u003e\u003cspan address=\"10.3390/cells12040588\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLan Z, Li X, Zhang X. Glioblastoma: An Update in Pathology, Molecular Mechanisms and Biomarkers. Int J Mol Sci. 2024;25(5). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3390/ijms25053040\u003c/span\u003e\u003cspan address=\"10.3390/ijms25053040\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi G, Dan L, Zhai W, Liu C, Chen M, Xu Q, Huang Y. Respiratory toxicity mechanism of 6PPD and 6PPD-quinone: An integrated study based on network toxicology and molecular docking. Ecotoxicol Environ Saf. 2025;301. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.ecoenv.2025.118494\u003c/span\u003e\u003cspan address=\"10.1016/j.ecoenv.2025.118494\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi Y, Zeng J, Liang Y, Zhao Y, Zhang S, Chen Z, Zhang J, Shen X, Wang J, Zhang Y, Sun Y. A Review of N-(1,3-Dimethylbutyl)-N\u0026prime;-phenyl-p-Phenylenediamine (6PPD) and Its Derivative 6PPD-Quinone in the Environment. Toxics. 2024;12(6). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3390/toxics12060394\u003c/span\u003e\u003cspan address=\"10.3390/toxics12060394\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLiao X-L, Zhou J-M, Wang Y, Chen Z-F, Cai Z. Network pharmacology and transcriptomics reveal androgen receptor as a potential protein target for 6PPD-quinone. Sci Total Environ. 2024;957. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.scitotenv.2024.177678\u003c/span\u003e\u003cspan address=\"10.1016/j.scitotenv.2024.177678\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLuo J, Junaid M, Hamid N, Duan J-J, Yang X, Pei D-S. Current understanding of gliomagenesis: from model to mechanism. Int J Med Sci. 2022;19(14):2071\u0026ndash;9. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.7150/ijms.77287\u003c/span\u003e\u003cspan address=\"10.7150/ijms.77287\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNair P, Barrett H, Tanoto K, Xie L, Sun J, Yang D, Yao H, Song D, Peng H. Structure and Toxicity Characterization of Alkyl Hydroxylated Metabolites of 6PPD-Q. Environ Sci Technol. 2025;59(15):7474\u0026ndash;84. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1021/acs.est.4c11823\u003c/span\u003e\u003cspan address=\"10.1021/acs.est.4c11823\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePace A, Dirven L, Koekkoek JAF, Golla H, Fleming J, Rud\u0026agrave; R, Marosi C, Rhun EL, Grant R, Oliver K, Oberg I, Bulbeck HJ, Rooney AG, Henriksson R, Pasman HRW, Oberndorfer S, Weller M, Taphoorn MJB. European Association for Neuro-Oncology (EANO) guidelines for palliative care in adults with glioma. Lancet Oncol. 2017;18(6):e330\u0026ndash;40. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/s1470-2045(17)30345-5\u003c/span\u003e\u003cspan address=\"10.1016/s1470-2045(17)30345-5\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePouyan A, Ghorbanlo M, Eslami M, Jahanshahi M, Ziaei E, Salami A, Mokhtari K, Shahpasand K, Farahani N, Meybodi TE, Entezari M, Taheriazam A, Hushmandi K, Hashemi M. Glioblastoma multiforme: insights into pathogenesis, key signaling pathways, and therapeutic strategies. Mol Cancer. 2025;24(1). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1186/s12943-025-02267-0\u003c/span\u003e\u003cspan address=\"10.1186/s12943-025-02267-0\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSchaff LR, Mellinghoff IK. Glioblastoma and Other Primary Brain Malignancies in Adults. JAMA. 2023;329(7). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1001/jama.2023.0023\u003c/span\u003e\u003cspan address=\"10.1001/jama.2023.0023\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSong DY, Yuan L, Cui N, Feng C, Meng L, Wang XH, Xiang M, Liu D, Wang C, Zhang Z, Li JY, Li W. α-Synuclein induces deficiency in clathrin‐mediated endocytosis through inhibiting synaptojanin1 expression. J Neurochem. 2023;167(3):461\u0026ndash;84. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1111/jnc.15974\u003c/span\u003e\u003cspan address=\"10.1111/jnc.15974\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSong Y, Weng W, Wu S. Investigating the Potential Effects of 6PPDQ on Prostate Cancer Through Network Toxicology and Molecular Docking. Toxics. 2024;12(12). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3390/toxics12120891\u003c/span\u003e\u003cspan address=\"10.3390/toxics12120891\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang X, Li H, Xu A, Peng J, Wu Y, Liu Y, Zhang J, Cai C, Ma S, Zhang K. Inhibition of the long non-coding RNA MALAT1 downregulates MAP2K1, suppressing the progression of hypopharyngeal squamous cell carcinoma. Biomolecules Biomed. 2024;25(5):1023\u0026ndash;37. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.17305/bb.2024.11151\u003c/span\u003e\u003cspan address=\"10.17305/bb.2024.11151\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang Y, Yuan Y, Wang W, He Y, Zhong H, Zhou X, Chen Y, Cai X-J, Liu L-q. (2022). Mechanisms underlying the therapeutic effects of Qingfeiyin in treating acute lung injury based on GEO datasets, network pharmacology and molecular docking. \u003cem\u003eComputers in Biology and Medicine\u003c/em\u003e, \u003cem\u003e145\u003c/em\u003e. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.compbiomed.2022.105454\u003c/span\u003e\u003cspan address=\"10.1016/j.compbiomed.2022.105454\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYu H, Zhang W, Wang D, Shi B, Zhu Y, Hu W, He J, Hong J, Xu X, Zheng X, Chen W, Wang F, Qu F. Exposure to 6PPD-Q induces dysfunctions of ovarian granulosa cells: Its potential role in PCOS. J Hazard Mater. 2025;486. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jhazmat.2024.137037\u003c/span\u003e\u003cspan address=\"10.1016/j.jhazmat.2024.137037\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhang S-Y, Gan X, Shen B, Jiang J, Shen H, Lei Y, Liang Q, Bai C, Huang C, Wu W, Guo Y, Song Y, Chen J. 6PPD and its metabolite 6PPDQ induce different developmental toxicities and phenotypes in embryonic zebrafish. J Hazard Mater. 2023;455. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jhazmat.2023.131601\u003c/span\u003e\u003cspan address=\"10.1016/j.jhazmat.2023.131601\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhang W, Wu B, Hu C, Rong W, Huang Y, Hu S, Qin Y. EGFR and CYP signaling disruption underlies 6PPD-quinone hepatotoxicity: Insights from a network and machine learning approach. Toxicology. 2025;517. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.tox.2025.154235\u003c/span\u003e\u003cspan address=\"10.1016/j.tox.2025.154235\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhou L. Generation and banking of patient-derived glioblastoma organoid and its application in cancer neuroscience. Am J cancer Res. 2024;14(10):5000\u0026ndash;10. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.62347/nsva5836\u003c/span\u003e\u003cspan address=\"10.62347/nsva5836\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"6PPD-Quinone, bioinformatics, glioblastoma, machine learning, molecular docking, environmental carcinogenesis, neural signaling pathways","lastPublishedDoi":"10.21203/rs.3.rs-8679784/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8679784/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eThrough machine learning analysis, we identified five core genes with special diagnostic performance, namely SCN2B, VIPR1, PAK1, MAP2K1 and SYNJ1. The AUC value of this integrated model in the validation queue reached 0.957, and its prediction accuracy is relatively high. From the analysis of SHAP interpretability, it can be found that MAP2K1 is the most influential predictor, and there are complex nonlinear relationships among core genes. Molecular docking simulations have provided structural evidence that can support the direct interaction between 6 PPD-Q and the target protein. The vina score ranges from \u0026minus;\u0026thinsp;5.9 to -8.2, and they have a strong specific binding affinity.\u003c/p\u003e","manuscriptTitle":"A Predictive Framework Integrating Network Toxicology and Machine Learning Elucidates 6PPD-Quinone-Induced Glioblastoma Pathogenesis","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-02-19 11:48:18","doi":"10.21203/rs.3.rs-8679784/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"5fa5de55-d18b-4205-b0f2-66d81ff3c7d3","owner":[],"postedDate":"February 19th, 2026","published":true,"recentEditorialEvents":[{"type":"decision","content":"Withdrawn","date":"2026-05-06T11:40:51+00:00","index":"","fulltext":""}],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2026-05-06T11:57:16+00:00","versionOfRecord":[],"versionCreatedAt":"2026-02-19 11:48:18","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8679784","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8679784","identity":"rs-8679784","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00