Machine learning and data-driven inverse modeling of metabolomics unveil key process of active aging

doi:10.21203/rs.3.rs-5377652/v1

Machine learning and data-driven inverse modeling of metabolomics unveil key process of active aging

2024 · doi:10.21203/rs.3.rs-5377652/v1

preprint OA: closed

Full text JSON View at publisher

Full text 182,350 characters · extracted from preprint-html · click to expand

Machine learning and data-driven inverse modeling of metabolomics unveil key process of active aging | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Machine learning and data-driven inverse modeling of metabolomics unveil key process of active aging Jiahang Li, Martin Brenner, Iro Pierides, Barbara Wessner, Bernhard Franzke, and 4 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-5377652/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 24 Sep, 2025 Read the published version in npj Systems Biology and Applications → Version 1 posted 19 You are reading this latest preprint version Abstract Physical inactivity and weak fitness status have become a global health concern. Metabolomics, as an integrative systematic approach, might link to individual’s fitness at the molecular level. In this study, we performed blood samples metabolomics analysis of a cohort of elderly people with different treatments. By defining two groups of fitness and corresponding metabolites profiles, we tested several machine learning classifications to identify key metabolite biomarkers, which showed robustly aspartate as a dominant negative marker of fitness. Following, the metabolomics data of the two groups were analyzed by a novel approach for metabolic network interaction termed COVRECON. Where we identified the enzyme AST as the most important metabolic regulation between the fit and the less fit groups. Routine blood tests in two cohorts validated significant differences in AST and ALT. In summary, we combine machine-learning classification and COVRECON to identify metabolomics biomarkers and causal processes for fitness of elderly people. Health sciences/Biomarkers Health sciences/Health care Biological sciences/Systems biology/Biochemical networks Biological sciences/Systems biology/Dynamic networks Active aging metabolic network inference automatic machine learning data-driven modeling Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Figure 8 Introduction Physical inactivity is a worldwide health problem, and is ranked as the fourth leading behavioral risk factor for global mortality [ 1 ]. The imperative to maintain body activity, physically and metabolically, is on the rise. The concept of active aging, inspired by Robert Havighurst's activity theory [ 2 ], suggests that maintaining an active lifestyle is crucial for the well-being of older individuals. The thought of active aging emerged and began to develop in the 1990s, placing strong emphasis on the link between activity and health [ 3 ].This focus became particularly pertinent due to the worldwide aging population, leading to concerns about inactivity permeating various social domains [ 4 ]. Within the transition into the 2020s, there has been an escalating emphasis on harnessing technology to foster healthy aging [ 5 – 7 ]. Beyond longevity, active aging encompasses regular physical activity, better management of chronic diseases, and improved quality of life [ 8 ]. Conventionally, several studies have examined various physical aspects of active aging, such as sleep, sedentary time, muscle strengthening activities, and movement behaviors [ 9 , 10 ]. As the new technology to diagnose diseases, metabolomics involves the comprehensive analysis of small-molecule metabolites (< 10 kDa) present in a biological sample, including metabolic intermediates, hormones, signaling molecules, and secondary metabolites [ 11 , 12 ]. Functioning as the culmination of all biological processes in the body, metabolites play a pivotal role in energy generation, signal transmission, and carrying essential information about the body's status and ongoing functions. Consequently, important metabolites possess the potential to serve as aging biomarkers or as integral components of the metabolic signature. This signature mirrors the active state of the organism as it traverses the aging process [ 13 ]. The development of metabolomics empowers us to scrutinize health issues at the molecular level [ 14 ]. Notably, amid the COVID-19 pandemic, metabolomics has demonstrated its potency as a diagnostic, prognostic, and drug intervention tool [ 15 ]. As expected, COVID-19 has been extensively investigated using metabolomics methodologies, contributing to biomarker studies [ 16 , 17 ] and evaluations of drug impacts [ 18 , 19 ]. Beyond specific disease diagnosis, metabolomics can also illuminate our comprehension of bodily activities (active aging). Recent endeavors have delved into metabolic profiling within aging studies [ 13 , 20 ], providing us with overarching insights into metabolic changes during the aging process. Here we specifically focus on the metabolomics profiling of older adults and aim to detect key biomarkers and important metabolic interactions related to active aging. The emerging large scale datasets from OMICS (metabolomics, proteomics, transcriptomic and genomics) measurements empower us to scrutinize any question in biology from a systemic perspective [ 21 , 22 ]. In the field of systems biology, a central goal is to identify biomarkers and infer biochemical regulations from large-scale metabolomics data [ 23 ]. Statistical methods, especially when combined with machine learning techniques, have shown power on constructing accurate classifiers capable of distinguishing between diverse sample groups and revealing underlying biomarkers [ 24 – 27 ]. However, statistical methods offer limited insights into how information is transferred within a biochemical network, the critical regulatory steps involved, and how regulatory mechanisms change under different conditions [ 11 , 21 , 28 , 29 ]. Several studies have emphasized the necessity of analyzing the dynamic behavior of metabolism to understand the evolution and maintenance of stable metabolic homeostasis under varying environmental conditions [ 30 – 34 ]. Mathematically, kinetic models can provide systemic insights into metabolic networks, but constructing these models and estimating parameters poses challenges, particularly for large-scale models [ 35 ]. In recent years, several studies have focused on the steady-state Jacobian investigation of metabolomics data [ 36 – 40 ] integrating with fluxomics or time-series measurements. In addition, with only large sampled metabolomics measurements, recent studies have developed inverse differential Jacobian algorithms, which provide a convenient way to infer differences in the dynamics of metabolic networks between different conditions [ 21 , 31 – 34 , 41 – 45 ]. Among them, the most recent study has developed a novel method and workflow termed COVRECON for analyzing key biochemical regulations through the solution of a differential Jacobian problem[ 21 , 33 , 34 ]. The COVRECON approach integrates the covariance matrix of metabolomics data with automatic metabolic network modeling based on genome-scale metabolic reconstructions and biochemical reaction databases. Figure 1 illustrates the workflow of this study, which consists of three main steps. In step 1, we aim to cluster the original samples into different groups based on physical and functional measurements, where each group represents different body activity conditions. Step 2 involves building machine learning-based classifiers to identify these different groups using metabolomics data, thus enabling the identification of key metabolites as biomarkers. Finally, in step 3, we employ the inverse Jacobian analysis and the COVRECON workflow to uncover the most important biochemical regulations associated with the identified body activity conditions. By conducting this approach, we intend to contribute to the understanding of active aging from a metabolomics perspective and shed light on the key biochemical regulations underlying different fitness conditions. Results This study was performed in 5 retirement homes in Vienna managed by Curatorship of Viennese Retirement Homes, with the main aim to assess the impact of resistance training and protein-vitamin supplementation or a cognitive training on physical performance (Oesen et al. 2015). In this secondary analysis, we focus on the plasma metabolomics changes and the identification of potential biomarkers and biochemical processes for fitness. The cohort of older adults with an average age above life expectancy consisted of 117 participants at baseline and altogether we measured 263 plasma metabolomic samples. The subjects were randomly assigned to three groups (see supplementary figure S1 ): resistance training (T), resistance training and supplements (E) and cognitive training, acting as a control group (K). Blood samples were collected at the baseline (T1), after three months (T2) and after six months (T3). To establish the relationship between body activity, fitness and metabolomics profiles, we initially investigated the physical data measurements, which consisted of two types: body functionality and body shape. Moreover, the body strength measurements can be further divided into resistance exercise and endurance exercise types. Table 1 shows the group differences of the physical measurements across the three groups. As expected, compared to the control group (K), the resistance training groups (T and E) exhibit better resistance measurements. Nevertheless, there was no influence on endurance measurements (e.g. walking distance). Notably, endurance exercise has been reported more related to body aging conditions than resistance measurements [ 46 , 47 ]. This is also consistent with the experimental design, where the old adults were randomly assigned to the three groups regardless of their fitness. Canonical Correlation (CCA) based clustering to assess physical fitness in a cohort of older institutionalized adults Since our question is the relationship between metabolomics and body activity, we employed Canonical Correlation Analysis (CCA) to generate a body activity index based on the body functionality dataset. Subsequently, we clustered the old adults and samples into two, four, and six groups based on this body activity index. As demonstrated in Fig. 2 A, the generated body activity index has a high correlation to the metabolomics index (Pearson Coeff = 0.8471), where the CCA loadings of the body activity index is listed aside. Among all physical indexes, walking distance showed the most dominant effect within the body activity index. This observation is biologically reasonable since walking distance directly reflects an individual's endurance condition, which is directly related to the aging process [ 46 , 47 ]. Considering the potential non-linear relationship between the generated body activity index and the metabolomics index, we constructed an automated machine-learning classifier using the XGBoosting algorithm as described in the method part. The classifier was trained with 100 maximum models, over 25 random training-test separation, the averaged AUCs calculated on the 25% hold-out test sets were determined to be 93.76%, 83.47%, and 61.75% for the two, four, and six-group clusters, respectively (Fig. 2 C). This indicates that the CCA generated body activity index and metabolomics index exhibit a strong correlation. Meanwhile, we group all the old adults into two groups for the inverse Jacobian analysis using the mean body activity index as shown in Fig. 2 B. For comparison, we also performed CCA analysis between the metabolomics data and body shape features, such as gender, height, and age. The biplots of the CCA from the metabolomics and body shape analysis are presented in Supplementary Figure S2 . The highest Spearman's correlation coefficient obtained was only 0.4963 for the age index. Additionally, we conducted a further CCA analysis considering the metabolomics data along with both body functionality and body shape data. However, the Pearson correlation coefficient increased only marginally from 0.8773 to 0.8903. This indicates that the metabolomics data are primarily influenced by the body strength/ functionality aspects. Consequently, this validates both the body activity index and the metabolomics index that we developed. The results of the CCA-based cluster analysis highlight the strong relationship between the derived body activity index and the metabolomics index. The dominance of walking distance as a key factor indicates its significance as a reflection of an individual's health condition and metabolic activity. In the following analysis, we will focus on the two old adult groups clustered based on the body activity index, labelled as active group and less-active group. Machine Learning based classifiers and variables importance reveals strong association of metabolites and fitness In this section, we built several machine learning based classifiers to predict the active/less-active groups from the metabolomics dataset. This approach can provide us valuable insights on the nonlinear influence between the metabolomics index and body activity index. As mentioned in the methods part, we evaluated the predicting performance of five machine learning algorithms: XGBoosting, DRF, GLM, GBM, and DeepLearning algorithms. We applied the automatic machine learning framework and selected the best model for each algorithm based on cross-validation AUC values. The chosen models were then assessed on the 25% hold-out test dataset. As shown in Fig. 3 , the classifiers performances were compared among the five Machine Learning algorithms. Figure 3 A illustrates the averaged AUC values calculated on the 25% hold-out test sets for each algorithm across the 25 random train-test separations. The detailed AUC results are listed in Supplementary material S2. XGBoosting achieved the highest performance with an average AUC value of 0.9150. This result was statistically significant (Wilcoxon signed-rank test P < 0.01) compared to the generalized linear models (GLM) that had an average AUC value of 0.8695. The superior performance of XGBoosting suggests the presence of non-linear effects originating from the metabolic network systems. To assess the effect of sample size on various classifiers’ performance, we randomly removed a quarter of the training sets and evaluated the five algorithms. The AUC accuracy of each algorithm hardly changed. Surprisingly, DRF, GLM, and Deep Learning showed improved AUC accuracy with fewer samples. This effect may be attributed to the presence of outlier samples in the original dataset, which introduced noise during the training process, resulting in poorer performance. In order to assess the importance of metabolites directly related to the two body activity groups, we ranked the metabolites extracted from the five algorithms based on the testing dataset. We identified the top 10 metabolites for each algorithm by calculating the average variable importance across the 25 repeats. The algorithm-metabolite bipartite graph is shown in Fig. 3 B, where Aspartate, Proline, Fructose, Pyruvate and Malic Acid were consistently identified as the top metabolites across almost all classifiers. The detailed metabolite importance values of each algorithm are presented in Supplementary material S2. For a better understanding of the variable importance results, we applied a multi-algorithm auto-machine learning approach, including all five algorithms with a maximum of 100 models, using the 'automl' function in the H2o.py package. XGBoosting demonstrated the best performance, as shown in Supplementary Table S1 . The Pareto front plot in Supplementary Figure S3 determined the optimal subset classifier, which included XGBoosting and GBM classifiers, highlighting the superiority of boosting methods for this task. Figure 3 C and 3 D present the variable importance and SHAP summary plot for the leading XGBoosting classifier on the test set. The analysis revealed that Aspartate was the most important metabolite, accounting for over 90% of the importance. This highlights the direct influence of the metabolomics aspect on the body activity index. The Spearman's correlation heatmap shown in Fig. 4 further supports this observation, with Aspartate exhibiting the most significant correlation with body strength data. Although other metabolites, such as Proline, Malic Acid, and Pyruvate, had lower importance values, they consistently appeared among the top 10 metabolites across different classifiers. In Fig. 4 , we also did the t-test for all metabolites between the two groups, where the differences with significance are plotted. Interestingly, they didn’t fully match the classifier results, e.g. Pyruvate is identified as key metabolites by all classifiers but didn’t show significance. This may suggest that the effect of Pyruvate is non-linear between the two groups. In addition, as shown in Fig. 3 D, the SHAP plot of the classifier top metabolites still shows good separation between two groups, albeit with less pronounced distinctions compared to Aspartate. This further indicates that they play a role in reflecting non-linear metabolic effects on the body activity index. We choose the eight most important metabolites: Aspartate, Proline, Fructose, Malic Acid, Pyruvate, Valine, Citrate and Ornithine, and map them to the KEGG pathways as shown in Supplementary Figure S4. We can see aside from a few large comprehensive pathways, the top metabolites identified in the classifier results are most related to Central carbon metabolism in cancer and 2-Oxocarboxylic acid metabolism. However, it merely revealed a surface-level connection between active aging and these pathways, which falls short of providing a comprehensive understanding of the underlying biochemical regulations of the active aging dynamics. Predictive inverse metabolic interaction modelling using the COVRECON platform While the machine learning and classifier results provide insights into the variable importance between the measured metabolites and the body activity index, this does not explain the mechanistic change between the two groups. Since for each old adult, metabolomics analysis was done three times, first time point, after 3 months and after 6 months, we plotted the correlation heatmap of the change of all body features and metabolomics measurement changes within two the time intervals in Fig. 5 . It is evident that the correlation patterns within the metabolomics measurement changes show high similarity. This reflects the internal dynamics of the metabolic networks. Nevertheless, when we check the highly correlated metabolites, we may find no biochemical reactions between the two metabolites from any database. This situation frequently happens, e.g. in Fig. 5 , Threonine, Tyrosine and Valine show a high correlation, yet no direct biochemical reactions occur among them. This is because the high correlations originate from the network dynamics. Thus, finding the causal interactions among the metabolites is crucial. In recent years, inverse differential Jacobian algorithms have been developed, providing a convenient way to infer causal dynamics of metabolic networks from metabolomics data (Nägele, et al., 2014; Sun and Weckwerth, 2012; Weckwerth, 2019; Wilson, et al., 2020; Li, et al., 2023). Besides the metabolomics measurements, metabolic reconstruction is used as complementary information to build a topological model for metabolic interaction network. Based on this, we have developed the COVRECON toolbox (available at: https://bitbucket.org/mosys-univie/covrecon/src/main/ ) (Li, et al., 2023). As shown in the method part, we applied the COVRECON workflow to the two group datasets. The COVRECON workflow consists of two steps: building the metabolic interaction network and the inverse Jacobian calculation. As described in COVRECON (Li, et al., 2023), we used a default setting in the Sim-Network part to generate a metabolic super-pathway network of the measured metabolites. Each edge in the network represents a feasible pathway between two nodes (metabolites) and reflects a non-zero component in the system Jacobian matrix. The default setting assigns a fixed weight of one to each reaction, and the reverse reaction weight is based on the log value of its delta Gibbs free energy. Additionally, a pathway-steps limitation of 4 is set. Detailed information about reactions, enzymes and genes of the resulting metabolic interaction network can be found in the Supplemental Material S3. By integrating the covariance of the metabolomics data from both groups and the Jacobian structure matrix, we can perform the inverse Jacobian analysis in the second part of COVRECON toolbox. The COVRECON workflow and toolbox address the ill-conditioned matrix problem associated with the inverse Jacobian approach through a regression loss-based algorithm, significantly improving its stability and feasibility [ 33 , 34 , 43 ]. However, given that the inverse Jacobian approach is based on the Jacobian structure and is more reliable in smaller-sized models, we selected a tailored core part of the whole model containing 10–20 metabolites based on the classifier variable importance results as described in method part. The same network reduction strategy as in Sim-Network was employed, with additional indirect connections added to the reduced model. For example, an additional connection from Proline to Aspartate was added to account for the indirect effects through the connections from Proline to Asparagine and from Asparagine to Aspartate (Fig. 6 ). Figure 5 presents 12 typical results in the repeated calculation. All the repeated results are available in Supplementary material S6. It is evident that even though the local results are different due to the influence from the Jacobian structure information, the Inverse Jacobian approach shows stability on several highlighted metabolic interactions. For example, the interactions Proline->Aspartate, Ornithine->Aspartate, Citrate->Aspartate and Glutamate->2-oxo glutaric acid are high valued in the resulted differential metabolic interaction network of many repeats. To present the overall metabolic interaction importance, we integrated all the 200 local results into the full differential Jacobian (DJ) by calculating the average value of each metabolic interaction within the repeats. The final R* matrix and the differential interaction network are presented in Fig. 6 A & 6 B respectively. In Fig. 6 B, we plot only the highlighted metabolic interactions with calculated value (scaled to 0–1) above 0.5. Here we note, the result showed robustness, with similar overall R* using 100, 200 and 500 repeats. Further results are using 200 repeats. Through this COVRECON approach, we are able to find several important perturbed metabolic interactions between the two body activity index clustered groups. The highlighted interactions and the detailed reactions, enzymes and gene information are presented in Supplementary material S4. These findings provide valuable insights into the regulatory interactions and dynamics of the metabolic network related to Aspartate, further supporting its importance as the dominant biomarker in the classifiers results. As shown in Fig. 7 C, several reactions are consistently identified in several highlighted metabolic interactions. Among these, enzyme aspartate transaminase (AST, EC number 2.6.1.1) is identified in 11 out of the 15 highlighted interactions and shown in all the largest valued interactions: Proline->Aspartate, Valine->Aspartate, Citrate->Aspartate and Glutamate->2-oxo glutaric acid. The enzyme Glutamic-Pyruvic Transaminase (ALT, EC number: 2.6.1.2) is also highlighted. Notably, both AST and ALT are important enzymes in amino acid metabolism, and recently there is indication of their involvement in health-related issues of older adults [ 48 – 50 ]. Furthermore, enzyme asparagine synthetase B (EC number: 6.3.5.4) was identified in 8 out of the 15 highlighted interactions. This enzyme is less studied for health issues of elderly peoples. However, asparagine synthetase (ASNS) deficiency was recently discovered as a metabolic disorder of non-essential amino acids [ 51 ]. Moreover, it is evident that most identified enzymes in Fig. 6 c belong to enzyme class of transaminases (EC:2.6.1.-). The transaminase enzymes are important in the production of various amino acids, and measuring the concentrations of various transaminases in the blood is important in diagnosing and tracking of many diseases [ 52 ]. For a further analysis of the enzymes, we conducted routine blood tests measurements of the old adults across the three time points. Four metabolic enzymes were measured: AST, ALT, Gamma-glutamyltransferase (GGT) and Creatine Kinase (CK). The data measurements are presented in Supplementary material S2. As shown in Fig. 8 and Supplementary Figure S6, we compared the enzyme measurements between the two groups (active/less active). The results suggested significant differences in AST and ALT, while GGT and CK did not exhibit such significant variations. This observation validates the inverse Jacobian results in Fig. 7 . Furthermore, we compared the AST and ALT changes within the two 3-months’ time intervals. As demonstrated in Fig. 7 , both AST and ALT showed significant changes in the “active group”, while the changes were not significant in the “less active group” during both 3-months intervals. Notably, the changes also exhibited significant differences between the two groups. Specifically, in the “active group”, AST and ALT demonstrated a significant larger decrease during the first 3 months, followed by a significant larger increase in the subsequent 3-months interval. This suggests that a larger plasticity of enzymatic liver and muscle systems in individuals with a high level of body activity. Interestingly, a few studies have revealed similar observations while investigating enzyme variations. In a long-term study of 29 routine laboratory measurements of 30 athletes, AST and ALT exhibited significantly larger variations over an 11-months period compared to those reported for general population [ 53 , 54 ]. Moreover, various studies have evidenced the enzyme fluctuations within healthy individuals’ blood samples from physical activity and exercises [ 10 , 55 – 59 ]. Discussion In this article, we measured 263 plasma metabolomics samples to study active aging and fitness in a cohort of very old adults close to or above the average life expectancy. Using a CCA approach, we clustered all old adults and samples into two groups based on a body activity index. Then we identified several key biomarkers between these two groups through machine- and deep learning analysis. The identified metabolites are Aspartate, Proline, Fructose, Malic Acid, Pyruvate, Valine, Citrate and Ornithine, where Aspartate showed dominant effects. XGboosting showed the best performance. In a further analysis, we applied the COVRECON (Li, et al., 2023) approach to the two group metabolomics datasets. Through this method, we identified several key metabolic interaction changes between the two active-less active groups. Many of these interactions are related to aspartate, this is consistent with the machine learning results. By checking the detailed enzyme information of the highlighted metabolic interactions, we identified several important enzyme regulations. The enzyme AST showed a relation to most highlighted interactions. The blood measurements of all individuals across the three time points validate the results. Existing studies also showed that AST and ALT is highly related to health issues of older adults [ 60 ]. Metabolomics chances for resistance training As shown in Supplementary table S2 , we conducted a group difference t-test for the metabolomics measurements. Where Alpha Tocopherol shows significant difference between nutritional supplement intake group (E) and the other two groups, as it is a part of the supplement FortiFit. The metabolites Linnileic acid, Methionine, Palmitic acid, Succinate and Tyrosine show a significant difference between the control group (K) and the resistance training groups (T & E). Interestingly, this divergence contrasts with the results obtained from the body activity classifiers, suggesting distinct metabolic mechanisms for resistance exercise and endurance exercise. This mechanistic difference between endurance and resistance exercise has been previously explored [ 61 ], where the metabolites changes induced by endurance or resistance exercise are identified in two different modes. Moreover, several studies have reported that endurance exercise but not resistance exercise has a high relevance to aging related questions. In Cao Dinh, et al., 2019, among 100 old women (aged over 65 years) the study reported that strength endurance training significantly reduces senescence-prone T cells, which is widely recognized as age-related [ 62 ],while intensive training showed no significant influence. In another study, Weiner, et al., 2019 concluded that endurance but not resistance training has anti-aging effects while examining a total of 124 healthy previously inactive individuals [ 46 ]. These studies provide additional support for our body activity index and metabolic network analysis. Aspartate as a blood biomarker for body activity Aspartic acid is one of the 22 protein-generic amino acids. It is involved in the malate-aspartate shuttle, which facilitates the transfer of electrons and energy between the cytoplasm and mitochondria, ultimately contributing to the production of ATP and the efficient functioning of cellular energy metabolism [ 63 ]. Thus, it is particularly important in tissues with high-energy demands, such as muscle, liver and the heart. This may account for the larger aspartate metabolism in the “active group”. From this point, several groups have evidenced the effect of aspartate as an important supplement for attenuation of exercise-induced hyperammonemia and an increase in exercise endurance [ 64 , 65 ]. On the other hand, aspartate is involved in the removal of ammonia from the body through the urea cycle [ 66 ]. Performing exercise can lead to ammonia production as a byproduct of energy metabolism. Aspartate may be used to help detoxify ammonia, potentially altering its levels. Old adults with better body activity have larger plasticity of enzymatic liver and muscle system AST and ALT are two of the routine blood test enzymes highly related to individual’s liver but also muscle and heart health [ 67 ], where elevated levels of AST and ALT enzymes beyond a specified threshold may indicate medical condition like hepatitis, liver disease or myonecrosis. The ratio AST/ALT is a significant sign of liver disease. We plotted the AST/ALT ratio changes over the three time points for the two groups in Supplementary Figure S7. The results showed no significant changes across the time points and groups. This suggests that AST and ALT variations originate from non-disease related factors. Furthermore, scientific investigations have furnished evidence supporting the notion that physical exercise and improved fitness levels can also lead to a transient elevation of these enzyme levels within a healthy range for individuals without underlying liver issues [ 55 – 57 ]. This exercise-induced transaminase elevation is a well-documented phenomenon, commonly observed in response to vigorous physical activity. It is essential to recognize that these exercise-related increases in AST and ALT levels are typically temporary and return to baseline levels shortly after physical exertion. This indicates larger AST and ALT variations for individuals with better body functionality/activity, as observed in Fig. 8 . This viewpoint is also suggested in a long-term study of 29 routine laboratory measurements of 30 athletes, where AST and ALT exhibited significantly larger variations over an 11-months period compared to those reported for general population [ 53 , 54 ]. In conclusion, this study is the first time we integrate machine learning statistical analysis and COVRECON inverse Jacobian analysis together. In metabolomics analysis, machine learning based statistical methods aid us find the key metabolites. As for the dynamical analysis, aside from kinetic modeling which needs many parameters fitting processes, we showed the predictive metabolic interaction modelling using the inverse differential Jacobian approach. This novel approach might be highly relevant to find the important dynamic regulations between two conditions. By integrating the machine learning results, we showed a robust approach for the inverse differential Jacobian calculation. Materials and Methods Experimental design This study was performed in 5 retirement homes in Vienna managed by Curatorship of Viennese Retirement Homes. The aim of this study was to assess the impact of strength training, strength training and protein-vitamin supplement or cognitive training on very old, institutionalized adults. This study was conducted in a randomized, controlled, observer-blind design. The subjects were randomly assigned to three groups: resistance training (T), resistance training and supplements (E) and cognitive training, acting as a control group (K). The details are presented in Supplementary material S1. Blood samples were collected at the baseline (T1), after three months (T2) and after six months (T3). One hundred and seventeen subjects were recruited from five senior residencies (Supplementary Figure S1 ). The exclusion criteria consisted of physical fitness (Short Physical Performance Battery > 4) and mental performance (Mini Mental State Examination ≥ 23). Moreover, they were free of severe diseases such as diabetic retinopathy, CVDs and regular use of cortisone-containing drugs. Before starting the intervention the health and nutritional status was assessed by specialists in internal medicine and gerontology [ 68 ]. All subjects signed informed consent before inclusion in accordance with the Declaration of Helsinki. The study was approved by the ethics committee of the City of Vienna (EK-11-151-0811) and registered at ClinicalTrials.gov, NCT01775111 [ 68 ]. Subject characteristics The sex distribution (87.6% women; 12.4% men) among participants was representative for the population living in nursing homes. The mean age of the study population was 82.9 ± 6.0 years for women and 84.9 ± 6.7 years for men. The participants had a BMI of 29.27 kg/m 2 ± 5.00 kg/m 2 [ 68 ]. Sample Preparation Blood Plasma Metabolite Extraction Several studies addressed the choice of blood sample, revealing that Heparin plasma produces a smaller side effect in the chromatogram spectrum [ 69 , 70 ]. Concordant with these findings, Heparin was used as an anticoagulant, while blood plasma was separated from fresh blood samples and kept in -80° C for further clinical analysis. Metabolite profiles of obtained human plasma samples was measured using a gas chromatograph coupled to mass spectrometer [ 71 ]. The samples were thawed on ice for 45 min and were vigorously vortexed for 10 s. The extraction consisted of two steps. First, 100µl plasma were transferred into 1.5 ml Eppendorf tubes, followed by the addition of 600 µl ice cooled MeOH, immediately vortexed for 10 s and left one ice for 15 min for incubation. In order to remove proteins, the samples were centrifuged at 14000 g for 4 min at 4° C. The supernatant was transferred into new tubes and dried down in a SpeedVac. Afterwards the dried pellets were stored at -20°C. The second step consisted of extraction with CHCl3. 300 µl of CHCl3 were added to pellets. The further procedure was a repetition of the first step. The supernatant was transferred into new Eppendorf tubes and dried down in SpeedVac. Metabolite extractions were performed in batches of 30 samples of randomly selected subjects. Quality control-Mix Quality control (QC) consisted of specific metabolites, including organic acids, amino acids, mono- and disaccharides and substrates of the TCA cycle. The table of metabolites for QC-Mix is attached to supplemental material S2. A calibration curve was prepared with concentrations of 2 µl, 5 µl, 10 µl, 20 µl, 40 µl, 80 µl and 100 µl. Internal Standard (10 µl Pinitol and 10 µl Sorbitol) was added to each sample and to each QC just the day before GC-MS analysis. Afterwards, the samples were dried in a SpeedVac. Derivation First, addition of 20 µl of 40 mg mL − 1 of methoxyamine hydrochloride (MeOX) dissolved in pyridine were added to each sample in order to dissolve MeOX in pyridine appropriately, the solution was vigorously vortexed several times and tube was put into hot water. After that, samples were vortexed until pellets were completely dissolved, followed by agitation at 30°C for 90 min at 750 rpm with a thermoshaker. N-Methyl-N-(trimethylsilyl) trifluoroacetamide (MSTFA) flasks of 1 ml content was spiked with 30 µl retention index marker solution of alkanes from C10- C40 in hexane. After addition of 80 µl of prepared MSTFA, samples were incubated at 37° C for 30 min at 750 rpm, followed by centrifugation at 14000 g for 2 min at room temperature (24° C). Immediately after this step, 70 µl of the supernatant were transferred to GC-vials with micro inserts and closed with crimp caps. GC- MS Analysis Finally, samples were analysed using GC-MS (LECO Pegasus® 4D GCxGC-TOF-MS, Mönchengladbach, Germany) according to Weckwerth, Wenzel, and Fiehn 2004. Immediately after derivation, 1 µl of sample were injected utilizing a split ratio of 1:5. The split/splitless injector was kept at a constant temperature of 230°C equipped with a single-tapered liner with deactivated wool. The GC-MS consisted of an Agilent 6890 (Agilent Technologies, Glostrup, Denmark) using helium as carrier gas at a flow rate of 1 mL min–1. Gas separation was performed on the HP-5MS column (30 m 3 0.25 mm 3 0.25 mm, Agilent Technologies). The initial temperature of the GC oven was set to 70°C isothermal for 1 min, followed by a heating ramp of 9°C $\:{\text{m}\text{i}\text{n}}^{-1}$ to reach 330°C and hold for 7 min. Transfer line temperature was 250°C, and ion source temperature was set to 200°C. The MS detector was switched off during the first 260 sec. Mass spectra were acquired with an acquisition rate of 20 spectra $\:{\text{s}}^{-1}$ and were recorded in the range of 40 to 600 m/z, utilizing a detector voltage of 1,550 V and electron impact ionization of 70 eV. The metabolite assessment required an exchange of the liner every 70 injections, thus every 2 batches in a row. The whole data acquisition was performed within 14 batches. Each batch was measured in the same chronological order. First, alkanes were injected, followed by QC calibration curve. Blank sample that contained only dried extraction reagents and derivation solvents were injected each 5 or 7 samples. Each batch consisted of a single plasma sample from 20–30 subjects and was analyzed within 24–32 hours. One pooled sample was measured for each batch, in order to assess instrument stability. At the end of every batch, the same QC was measured again to monitor instrumental performance over time. To minimize systematic bias induced by preparation order, samples were randomly distributed into 14 batches. However, each batch consists of a representative cross section of total samples and was comparable to the total experimental population. Metabolite identification Mass spectra data were obtained from GC-MS, the next step is to transform this to biologically relevant information. After the GC-MS analysis the raw data consisted of ion peaks and were preprocessed using LECO Chroma-TOF. The ion fragmentation spectra were matched to fragmentation spectra in NIST library and scored with a match probability, taking into account only metabolites with at least 700 similarity score. Afterwards, analytes were identified by comparison of ion fragments to a reference library of chemical standards. In detail, the metabolites were confirmed based on ion features e.g. 1) retention index and retention time, 2) m/z, 3) in-source fragmentation, particular for each metabolite. With the latter the identification of the analytes became definitive. Alkanes measured at the beginning of each batch provided retention indices that were assigned to all ion peaks. The original data is presented in Supplementary material S2. Data Processing The data processing steps involved several procedures. Initially, missing values in the metabolomics measurements were imputed using the K-Nearest Neighbors (KNN) method. Following, normalization was performed to reduce heteroscedasticity and adjust for the offset between high and low intensity features as in Eq. (0), where the log transformation of each metabolite by centering it around its mean (x̅) and scaling it by its standard deviation (s): $$\:{\widehat{x}}_{ij}=\left(\frac{{log}_{2}\left({x}_{ij}\right)-\stackrel{-}{{log}_{2}\left({x}_{i}\right)}}{s}\right)$$ $$\:\left(0\right)$$ Data Clustering To identify biomarkers and perform the inverse Jacobian analysis, the samples were firstly clustered into distinct groups. The clustering process comprised the following steps. Firstly, based on the information provided in Supplementary Material S2, it was observed that physical measurements could be categorized into two types: “body-shape” data (e.g., gender and height) and “body-functional” data (e.g., walking distance and left standing time). In order to generate a body activity index that reflects body functionality while minimizing the influence of body-shape differences, Canonical Correlation Analysis (CCA) [ 72 ] was applied. The loadings of this body activity index are presented in Fig. 2 A, where it can be observed that walking distance exhibits the strongest effects. The metabolomics-related body activity index generated through CCA was then used to cluster the samples using the k-means method, grouping them based on this body activity index. Machine Learning based Classifiers While the CCA-based clustering approach analyzes the relationship between the body activity index and the metabolic index as a linear method, it may not fully capture the dynamic nature of the metabolic mechanism, which inherently exhibits predominantly non-linear behavior. To capture this non-linear influence and achieve higher accuracy with the identification of important variables, several machine learning based classifiers were employed within an automated machine learning framework, implemented using the H2o package in Python. The methods utilized are as follows: 1, Generalized Linear Models (GLM): GLM implements regularized linear models with stochastic gradient descent (SGD) learning. The model is updated iteratively using a decreasing strength schedule, estimating the loss gradient for each sample at a time. This method offers a baseline for the linear effects. 2, Random Forest Classifier: A random forest is an ensemble meta-estimator that fits multiple decision tree classifiers on different sub-samples of the dataset, utilizing averaging to improve predictive accuracy and mitigate overfitting. 3–4, Boosting Methods: Boosting is an ensemble meta-algorithm that reduces bias and variance in supervised learning. It integrates a family of machine learning algorithms that convert weak learners to strong ones [ 73 ]. The main variation between many boosting algorithms is their method of weighting training data points and hypotheses. We employed two common boosting methods, LGBMClassifier and XGBClassifier. LGBMClassifier (GBM) is a distributed gradient-boosting framework based on decision tree algorithms, originally developed by Microsoft [ 74 ], while XGBClassifier (eXtreme Gradient Boosting) is an open-source library for regularizing gradient boosting [ 75 ]. 5, Autoencoder + deep learning: Deep learning, also known as deep neural networks, is a powerful machine learning method extensively used in pattern recognition, image processing, and bioinformatics [ 76 ]. Prior to training the model, we employed an autoencoder to pre-train it, using the entire unlabeled data, improving model performance, preventing random weight initialization. In our approach, each of these machine learning methods was integrated into an automated framework that encompasses hyper-parameter optimization. Hyper-parameter optimization entails the selection of ideal parameter values that govern the learning process, aiming to enhance model performance [ 77 ]. The supplementary Figure S5 provides an overview of the scope of hyper-parameters associated with each machine learning method. For evaluation, we randomly generate 25 training-test separations where the training-test ratio is 75/25%. Feature Importance Feature importance was estimated using a model-based approach, considering a feature to be important if it significantly contributed to the model's performance. Here, the ‘varimp’ function within the H2o.py package was utilized to rank the important metabolites of each classifier. The importance value is averaged over the 25 training-test separations, and we choose the top 10 metabolites for each machine-learning method. Predictive metabolic modelling using an inverse Jacobian approach Statistical and machine learning methods have limitations when it comes to understanding the dynamics of a biochemical network, identifying critical regulatory steps, and capturing changes in regulatory mechanisms under different conditions [ 24 ]. In recent years, the inverse differential Jacobian algorithms have been developed as a convenient approach to infer the dynamic regulation of metabolic networks from metabolomics data [ 21 , 31 – 34 , 41 – 43 , 78 ]. In previous studies, we introduced the COVRECON workflow and Matlab toolbox as the standard inverse Jacobian workflow (Weckwerth, 2019; Li, et al., 2023). This method combines the covariance matrix of metabolomics data with automatic metabolic network modeling based on genome-scale metabolic reconstructions and biochemical reaction databases. Consider a metabolic network that consists of n metabolites denoted by $\:\{{X}_{i}{\}}_{i=1\dots\:n}$ . The system dynamics can be modeled with the set of ordinary differential equations (ODEs): $$\:\frac{d\varvec{M}}{dt}=\varvec{F}\left(\varvec{M}\right)\to\:\left\{\begin{array}{c}\begin{array}{c}\frac{d{M}_{1}}{dt}={f}_{1}\left({M}_{1},{M}_{2},\dots\:,{M}_{n}\right)\\\:\frac{d{M}_{2}}{dt}={f}_{2}\left({M}_{1},{M}_{2},\dots\:,{M}_{n}\right)\end{array}\\\:\begin{array}{c}\:\:\:⋮\\\:\frac{d{M}_{n}}{dt}\end{array}={f}_{n}\left({M}_{1},{M}_{2},\dots\:,{M}_{n}\right),\end{array}\:\:\right.$$ $$\:\left(1\right)$$ where $\:M=\left\{{M}_{i}\right\}=\left\{\right|{X}_{i}\left|\right\}$ are the concentrations of the n metabolites, and $\:{F={f}_{i}(M}_{i})$ are composed of the reaction rates for these metabolites (e.g., Michaelis-Menten kinetics, or mass action). The steady-state Jacobian matrix $\:J$ of the model is defined as a $\:{R}^{n\times\:n}$ matrix in which $\:{J}_{ij}\:$ is the first-order derivative of the rate $\:{f}_{i}\:$ for the concentration of substances $\:\:{M}_{j}$ at steady state, noted as $\:\:{J}_{ij}={\frac{\partial\:{f}_{i}}{\partial\:{M}_{j}}|}_{steady}$ : $$\:\varvec{J}={\frac{\partial\:\varvec{F}}{\partial\:\varvec{M}}}_{steady}={\left[\begin{array}{ccc}\begin{array}{cc}\frac{\partial\:{f}_{1}}{\partial\:{M}_{1}}&\:\frac{\partial\:{f}_{1}}{\partial\:{M}_{2}}\\\:\frac{\partial\:{f}_{2}}{\partial\:{M}_{1}}&\:\frac{\partial\:{f}_{2}}{\partial\:{M}_{2}}\end{array}&\:\cdots\:&\:\begin{array}{c}\frac{\partial\:{f}_{1}}{\partial\:{M}_{n}}\\\:\frac{\partial\:{f}_{2}}{\partial\:{M}_{n}}\end{array}\\\:⋮&\:\ddots\:&\:⋮\\\:\begin{array}{cc}\frac{\partial\:{f}_{n}}{\partial\:{M}_{1}}&\:\frac{\partial\:{f}_{n}}{\partial\:{M}_{2}}\end{array}&\:\cdots\:&\:\frac{\partial\:{f}_{n}}{\partial\:{M}_{n}}\end{array}\right]\:}_{steady}$$ $$\:\left(2\right)$$ The steady-state Jacobian matrix represents the first-order derivatives of the rate equations with respect to the concentrations of the metabolites at steady state. It contains valuable information about the system's dynamics, including regulatory interactions among the metabolites. As derived in a previous study by Steuer et al. (Steuer et al., 2003), the following Eq. (3) was established between the covariance matrix of the metabolic data, and the steady-state Jacobian matrix: $$\:J*C+C*{J}^{T}=-2D.$$ $$\:\left(3\right)$$ Here, $\:C\in\:{R}^{n\times\:n}$ represents the covariance matrix of the compounds’ concentrations $\:\:{M}_{j}$ near its steady-state value $\:{M}_{j}^{steady}$ , while the fluctuation matrix D represents the covariance of noise sources acting on the system. The differences between two conditions can be quantified by the differential Jacobian $\:D\varvec{J}$ , which is calculated from the Jacobians of the two groups: $$\:{D\varvec{J}}_{ij}=\left\{\begin{array}{c}max\left(\left|\frac{{\left({\varvec{J}}_{\varvec{d}}\right)}_{\varvec{i}\varvec{j}}}{{{(\varvec{J}}_{\varvec{h}})}_{\varvec{i}\varvec{j}}}\right|,\:\left|\frac{{\left({\varvec{J}}_{\varvec{h}}\right)}_{\varvec{i}\varvec{j}}}{{{(\varvec{J}}_{\varvec{d}})}_{\varvec{i}\varvec{j}}}\right|\right)\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\\\:1,\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:{{\varvec{i}\varvec{f}\:(\varvec{J}}_{\varvec{h}})}_{\varvec{i}\varvec{j}}=0.\:\:\:\end{array}\right.$$ (4) The differential Jacobian $\:D\varvec{J}$ encompasses crucial insights into the dynamic regulatory mechanisms between two conditions. An inverse problem is to analyze the differential Jacobian $\:D\varvec{J}$ from the measured metabolomics. This task involves two key aspects: establishing the structural information of the Jacobian matrix and resolving the optimization problem associated with the differential Jacobian. In a recent study, we introduced the COVRECON approach and related matlab toolbox (Li, et al., 2023). This innovative approach combines the automatic assembly of a metabolic interaction network and the inverse differential Jacobian calculation through a regression-loss-based algorithm. This approach automatically constructs a metabolic interaction network which contains the Jacobian structure information and then calculates a regression loss matrix R* to estimate the differential Jacobian matrix. The result R* is presented in a matlab format figure where the interaction pathway details can be interactively checked. The details of the algorithm can be found in the Supplementary material S1 and the original publication (Li, et al., 2023). By employing the COVRECON approach, we aim to uncover the key components and regulatory interactions within the differential Jacobian, thereby gaining insights into the dynamics of the metabolic network. Integrate Classifier Biomarkers and Group Differential Jacobian Analysis Since we have clustered the samples into two groups in the data clustering part, we are now able to do the inverse Jacobian analysis for the two groups. As discussed in Supplementary material S1, similar to the general approach of most kinetic models, we consider the dynamics within each group is simulated in a group model, thus the steady state dynamics can be represented as a group Jacobian. Consequently, the inverse Jacobian algorithm can offer valuable information of the regulated dynamics between the two groups. The results from the inverse Jacobian analysis are closely linked to the structural information of the Jacobian obtained from the automatically generated super-pathway metabolic interaction networks. It is essential to highlight that we combine the significance of classifier variables in the context of inverse Jacobian analysis. Simply put, we retain the pivotal biomarkers and introduce a controlled mix of randomly chosen additional metabolites. The augmented networks, encompassing 10–20 metabolites, are subsequently subjected to the COVRECON workflow. Notably, in COVRECON results, large values serve as indicators of the dynamics difference between the two distinct groups. We are able to identify the important reactions or enzymes involved in the active aging context by checking the detailed information behind these large values (Li, et al., 2023). Declarations Data availability The data underlying this article are available in the online supplementary material. The original data of the breast cancer case study can be accessed in the reference [ 54 ]. Code availability The Matlab code is available in https://bitbucket.org/mosys-univie/covrecon/ . Competing Interests The authors declare no competing interests. Author Contribution W.W., K.H.W., and J.L. conceived the study. J.L. and W.W. developed the method. M.B., B.W., B.F., and E.M.S. implemented and performed the experiments, and J.L., S.W., and I.P. interpreted the results. J.L., W.W., and S.W. wrote the first version of the manuscript. W.W., K.H.W., J.L., and S.W. revised the manuscript. All authors reviewed and approved the final version of the manuscript. Acknowledgments This work was supported by a Ph.D. scholarship provided by the China Scholarship Council (CSC) [grant number: 201806010428 to J.L.]. Open access funding provided by University of References Kohl, H.W., et al., The pandemic of physical inactivity: global action for public health . The lancet, 2012. 380(9838): p. 294–305. Havighurst, R.J., Successful aging. Processes of aging: Social and psychological perspectives, 1963. 1: p. 299–320. WHO, Active ageing: A policy framework . 2002, World Health Organization. Boudiny, K. and D. Mortelmans, A critical perspective: towards a broader understanding of'active ageing' . E-journal of Applied Psychology, 2011. 7(1): p. 8–14. Offerman, J., et al., Attitudes related to technology for active and healthy aging in a national multigenerational survey . Nature Aging, 2023. 3(5): p. 617–625. Wongsala, M., E.-M. Anbäcken, and S. Rosendahl, Active ageing–perspectives on health, participation, and security among older adults in northeastern Thailand–a qualitative study . BMC geriatrics, 2021. 21: p. 1–10. Malkowski, O.S., R. Kanabar, and M.J. Western, Socio-economic status and trajectories of a novel multidimensional metric of Active and Healthy Ageing: the English Longitudinal Study of Ageing . Scientific Reports, 2023. 13(1): p. 6107. Fernández-Ballesteros, R., et al., Active aging: a global goal . 2013, Hindawi. Caprara, M., et al., Active aging promotion: results from the Vital Aging Program. Current Gerontology and Geriatrics Research, 2013. 2013. Taylor, A.W., Physiology of exercise and healthy aging . 2022: Human Kinetics. Weckwerth, W., Metabolomics: an integral technique in systems biology . Bioanalysis, 2010. 2(4): p. 829–836. Patti, G.J., O. Yanes, and G. Siuzdak, Metabolomics: the apogee of the omics trilogy . Nature reviews Molecular cell biology, 2012. 13(4): p. 263–269. Balashova, E.E., et al., Metabolome Profiling in Aging Studies . Biology, 2022. 11(11): p. 1570. Gonzalez-Covarrubias, V., E. Martínez-Martínez, and L. del Bosque-Plata, The potential of metabolomics in biomedical applications . Metabolites, 2022. 12(2): p. 194. Bruzzone, C., et al., Metabolomics as a powerful tool for diagnostic, pronostic and drug intervention analysis in COVID-19. Frontiers in Molecular Biosciences, 2023. 10: p. 1111482. Su, Y., et al., Multi-omics resolves a sharp disease-state shift between mild and moderate COVID-19 . Cell, 2020. 183(6): p. 1479–1495. e20. Sindelar, M., et al., Longitudinal metabolomics of human plasma reveals prognostic markers of COVID-19 disease severity . Cell Reports Medicine, 2021. 2(8). Meoni, G., et al., Metabolomic/lipidomic profiling of COVID-19 and individual response to tocilizumab . PLoS Pathogens, 2021. 17(2): p. e1009243. Ghini, V., et al., Serum NMR profiling reveals differential alterations in the lipoproteome induced by pfizer-BioNTech vaccine in COVID-19 recovered subjects and naïve subjects . Frontiers in molecular biosciences, 2022. 9: p. 839809. Panyard, D.J., B. Yu, and M.P. Snyder, The metabolomics of human aging: Advances, challenges, and opportunities . Science Advances, 2022. 8(42): p. eadd6155. Weckwerth, W., Toward a unification of system-theoretical principles in biology and ecology—the stochastic lyapunov matrix equation and its inverse application . Frontiers in Applied Mathematics and Statistics, 2019. 5: p. 29. Weckwerth, W., Green systems biology—from single genomes, proteomes and metabolomes to ecosystems research and biotechnology . Journal of proteomics, 2011. 75(1): p. 284–305. Weckwerth, W., Unpredictability of metabolism–the key role of metabolomics science in combination with next-generation genome sequencing . Anal Bioanal Chem, 2011. 400(7): p. 1967–78. Sidak, D., et al., Interpretable machine learning methods for predictions in systems biology from omics data . Frontiers in Molecular Biosciences, 2022. 9: p. 926623. Liebal, U.W., et al., Machine learning applications for mass spectrometry-based metabolomics . Metabolites, 2020. 10(6): p. 243. Pomyen, Y., et al., Deep metabolome: Applications of deep learning in metabolomics . Computational and Structural Biotechnology Journal, 2020. 18: p. 2818–2825. Alakwaa, F.M., K. Chaudhary, and L.X. Garmire, Deep learning accurately predicts estrogen receptor status in breast cancer metabolomics data . Journal of proteome research, 2018. 17(1): p. 337–347. Weckwerth, W., Unpredictability of metabolism—the key role of metabolomics science in combination with next-generation genome sequencing . Analytical and Bioanalytical Chemistry, 2011. 400(7): p. 1967–1978. Weckwerth, W., Metabolomics in systems biology . Annual review of plant biology, 2003. 54(1): p. 669–689. Wienkoop, S., et al., Integration of metabolomic and proteomic phenotypes: analysis of data covariance dissects starch and RFO metabolism from low and high temperature compensation response in Arabidopsis thaliana . Molecular & Cellular Proteomics, 2008. 7(9): p. 1725–1736. Nägele, T., et al., Solving the differential biochemical Jacobian from metabolomics covariance data . PloS one, 2014. 9(4): p. e92299. Wilson, J.L., et al., Inverse data-driven modeling and multiomics analysis reveals phgdh as a metabolic checkpoint of macrophage polarization and proliferation . Cell Reports, 2020. 30(5): p. 1542–1552. e7. Li, J., S. Waldherr, and W. Weckwerth, COVRECON: automated integration of genome- and metabolome-scale network reconstruction and data-driven inverse modeling of metabolic interaction networks . Bioinformatics, 2023. 39(7): p. btad397. Li, J., W. Weckwerth, and S. Waldherr, Enzyme fluctuations data improve inference of metabolic interaction networks with an inverse differential Jacobian approach. bioRxiv, 2023: p. 2023.12. 11.570118. King, Z.A., et al., BiGG Models: A platform for integrating, standardizing and sharing genome-scale models . Nucleic acids research, 2016. 44(D1): p. D515-D522. Steuer, R., et al., Structural kinetic modeling of metabolic networks. Proceedings of the National Academy of Sciences, 2006. 103(32): p. 11868–11873. Jamshidi, N. and B.Ø. Palsson, Mass action stoichiometric simulation models: incorporating kinetics and regulation into stoichiometric models . Biophysical journal, 2010. 98(2): p. 175–185. Haiman, Z.B., et al., MASSpy: Building, simulating, and visualizing dynamic biological models in Python using mass action kinetics . PLoS computational biology, 2021. 17(1): p. e1008208. Akbari, A., Z.B. Haiman, and B.O. Palsson, A data-driven approach for timescale decomposition of biochemical reaction networks . Msystems, 2024. 9(2): p. e01001-23. Nägele, T., Metabolic regulation of subcellular sucrose cleavage inferred from quantitative analysis of metabolic functions . Quantitative Plant Biology, 2022. 3: p. e10. Sun, X. and W. Weckwerth, COVAIN: a toolbox for uni-and multivariate statistics, time-series and correlation network analysis and inverse estimation of the differential Jacobian from metabolomics covariance data . Metabolomics, 2012. 8(1): p. 81–93. Kügler, P. and W. Yang, Identification of alterations in the Jacobian of biochemical reaction networks from steady state covariance data at two conditions . Journal of Mathematical Biology, 2014. 68(7): p. 1757–1783. Sun, X., B. Länger, and W. Weckwerth, Challenges of inversely estimating jacobian from metabolomics data . Frontiers in bioengineering and biotechnology, 2015. 3: p. 188. Weiszmann, J., et al., Metabolome plasticity in 241 Arabidopsis thaliana accessions reveals evolutionary cold adaptation processes . Plant Physiology, 2023: p. kiad298. Chaturvedi, P., et al., Natural variation in the chickpea metabolome under drought stress . Plant biotechnology journal, 2024. Werner, C.M., et al., Differential effects of endurance, interval, and resistance training on telomerase activity and telomere length in a randomized, controlled study . European heart journal, 2019. 40(1): p. 34–46. Cao Dinh, H., et al., Strength endurance training but not intensive strength training reduces senescence-prone T cells in peripheral blood in community-dwelling elderly women . The Journals of Gerontology: Series A, 2019. 74(12): p. 1870–1878. Le Couteur, D.G., et al., The association of alanine transaminase with aging, frailty, and mortality . Journals of Gerontology Series A: Biomedical Sciences and Medical Sciences, 2010. 65(7): p. 712–717. Goh, G.B.-B., et al., Age impacts ability of aspartate–alanine aminotransferase ratio to predict advanced fibrosis in nonalcoholic fatty liver disease . Digestive diseases and sciences, 2015. 60: p. 1825–1831. Nakajima, K., et al. High aspartate Aminotransferase/Alanine aminotransferase ratio may be Associated with all-cause mortality in the Elderly: a Retrospective Cohort Study using Artificial Intelligence and Conventional Analysis . in Healthcare . 2022. MDPI. Yamamoto, T., et al., The first report of Japanese patients with asparagine synthetase deficiency . Brain and Development, 2017. 39(3): p. 236–242. Oh, R.C., et al., Mildly elevated liver transaminase levels: causes and evaluation . American family physician, 2017. 96(11): p. 709–715. Diaz-Garzon, J., et al., Long-term within-and between-subject biological variation of 29 routine laboratory measurands in athletes . Clinical Chemistry and Laboratory Medicine (CCLM), 2022. 60(4): p. 618–628. Diaz-Garzon, J., et al., Long-Term Within-and Between-Subject Biological Variation Data of Hematological Parameters in Recreational Endurance Athletes . Clinical Chemistry, 2023. 69(5): p. 500–509. Pavletic, A.J. and M.E. Wright, Exercise-induced elevation of liver enzymes in a healthy female research volunteer . Psychosomatics, 2015. 56(5): p. 604. Pettersson, J., et al., Muscular exercise can cause highly pathological liver function tests in healthy men . British journal of clinical pharmacology, 2008. 65(2): p. 253–259. Tiller, N.B. and W.W. Stringer, Exercise-induced increases in “liver function tests” in a healthy adult male: Is there a knowledge gap in primary care? Journal of Family Medicine and Primary Care, 2023. 12(1): p. 177. Nunez, D.J., et al., Factors influencing longitudinal changes of circulating liver enzyme concentrations in subjects randomized to placebo in four clinical trials . American Journal of Physiology-Gastrointestinal and Liver Physiology, 2019. 316(3): p. G372-G386. Ruiz, J.R., et al., Physical activity, sedentary time, and liver enzymes in adolescents: the HELENA study . Pediatric research, 2014. 75(6): p. 798–802. Andy, S.Y. and E.B. Keeffe, Elevated AST or ALT to nonalcoholic fatty liver disease: accurate predictor of disease prevalence? 2003, LWW. p. 955–956. Morville, T., et al., Plasma metabolome profiling of resistance exercise and endurance exercise in humans . Cell reports, 2020. 33(13). Childs, B.G., et al., Cellular senescence in aging and age-related disease: from mechanisms to therapy . Nature medicine, 2015. 21(12): p. 1424–1435. Borst, P., The malate–aspartate shuttle (Borst cycle): How it started and developed into a major metabolic pathway . Iubmb Life, 2020. 72(11): p. 2241–2259. Marquezi, M.L., et al., Effect of aspartate and asparagine supplementation on fatigue determinants in intense exercise . International journal of sport nutrition and exercise metabolism, 2003. 13(1): p. 65–75. Trudeau, F., Aspartate as an ergogenic supplement . Sports Medicine, 2008. 38: p. 9–16. Fibriansah, G., et al., Structural basis for the catalytic mechanism of aspartate ammonia lyase . Biochemistry, 2011. 50(27): p. 6053–6062. Lala, V., M. Zubair, and D.A. Minter, Liver function tests , in StatPearls [internet] . 2022, StatPearls Publishing. Oesen, S., et al., Effects of elastic band resistance training and nutritional supplementation on physical performance of institutionalised elderly—A randomized controlled trial . Experimental gerontology, 2015. 72: p. 99–108. Teahan, O., et al., Impact of analytical bias in metabonomic studies of human blood serum and plasma . Analytical chemistry, 2006. 78(13): p. 4307–4318. Dunn, W.B., et al., Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry . Nature protocols, 2011. 6(7): p. 1060–1083. Weckwerth, W., K. Wenzel, and O. Fiehn, Process for the integrated extraction, identification and quantification of metabolites, proteins and RNA to reveal their co-regulation in biochemical networks. Proteomics, 2004. 4(1): p. 78–83. Hardoon, D.R., S. Szedmak, and J. Shawe-Taylor, Canonical correlation analysis: An overview with application to learning methods . Neural computation, 2004. 16(12): p. 2639–2664. Zhou, Z.-H., Ensemble methods: foundations and algorithms . 2012: CRC press. Ke, G., et al., Lightgbm: A highly efficient gradient boosting decision tree . Advances in neural information processing systems, 2017. 30. Chen, T. and C. Guestrin. Xgboost: A scalable tree boosting system . in Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining . 2016. LeCun, Y., Y. Bengio, and G. Hinton, Deep learning. nature, 2015. 521(7553): p. 436–444. Feurer, M. and F. Hutter, Hyperparameter optimization , in Automated machine learning . 2019, Springer, Cham. p. 3–33. Steuer, R., et al., Observing and interpreting correlations in metabolomic networks . Bioinformatics, 2003. 19(8): p. 1019–1026. Tables Table 1 is available in the Supplementary Files section. Additional Declarations No competing interests reported. Supplementary Files Table1.png Table 1, physical measurement difference between three treatment groups SupplementalmaterialS1.pdf SupplementalmaterialS2.xlsx SupplementarymaterialS3S5.zip Cite Share Download PDF Status: Published Journal Publication published 24 Sep, 2025 Read the published version in npj Systems Biology and Applications → Version 1 posted Editorial decision: Revision requested 30 Mar, 2025 Reviews received at journal 30 Mar, 2025 Reviews received at journal 28 Mar, 2025 Reviewers agreed at journal 11 Mar, 2025 Reviewers agreed at journal 11 Mar, 2025 Reviewers agreed at journal 11 Mar, 2025 Reviewers agreed at journal 10 Mar, 2025 Reviewers agreed at journal 09 Mar, 2025 Reviewers agreed at journal 29 Jan, 2025 Reviewers agreed at journal 26 Jan, 2025 Reviewers agreed at journal 24 Jan, 2025 Reviewers agreed at journal 13 Dec, 2024 Reviews received at journal 13 Nov, 2024 Reviewers agreed at journal 07 Nov, 2024 Reviewers agreed at journal 06 Nov, 2024 Reviewers invited by journal 06 Nov, 2024 Editor assigned by journal 05 Nov, 2024 Submission checks completed at journal 05 Nov, 2024 First submitted to journal 02 Nov, 2024 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-5377652","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":377816592,"identity":"6130146b-5a98-45c0-aeee-59e4daa791b6","order_by":0,"name":"Jiahang Li","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABA0lEQVRIiWNgGAWjYBACxmYGNhAtw8/ffICBwYCBh2gtPJIzjiUgazHApwuixeBADooq3FqY25mfPfi5oxao5cw36YKCwzL8DMwPPzDU/MHjMDZzw94zx3kkD/duk55hcJhHsoHNWILhGG5bgH4xk+BtO8bDd+DsNmkegzSgdQxmQNfi08L+TfIvUAvDgZxnYC32B9i/MTD8w6eFx0yat62GR+BADhtQiw0PMNDMGBjb8Gopk5ZtOwAKZGNrkBaJwzzFEol9xji1GPYf3yb5tq1ODhiVD2/z/JGw529v3/jhwzc53FoawNRhJCFmIE7AqYGBQR5C1eFRMgpGwSgYBSMeAADAB0i5sOXNbgAAAABJRU5ErkJggg==","orcid":"","institution":"University of Vienna","correspondingAuthor":true,"prefix":"","firstName":"Jiahang","middleName":"","lastName":"Li","suffix":""},{"id":377816593,"identity":"7ed77a6c-728d-422e-adf6-d485aed2cf80","order_by":1,"name":"Martin Brenner","email":"","orcid":"","institution":"University of Vienna","correspondingAuthor":false,"prefix":"","firstName":"Martin","middleName":"","lastName":"Brenner","suffix":""},{"id":377816594,"identity":"8ac9a597-f255-4b68-abee-99262bfb4da3","order_by":2,"name":"Iro Pierides","email":"","orcid":"","institution":"University of Vienna","correspondingAuthor":false,"prefix":"","firstName":"Iro","middleName":"","lastName":"Pierides","suffix":""},{"id":377816595,"identity":"a9ecab8b-bcbd-4db1-ba29-7497c62f0417","order_by":3,"name":"Barbara Wessner","email":"","orcid":"","institution":"University of Vienna","correspondingAuthor":false,"prefix":"","firstName":"Barbara","middleName":"","lastName":"Wessner","suffix":""},{"id":377816596,"identity":"3cfbba46-0c02-4baa-83d8-598d8096a9b4","order_by":4,"name":"Bernhard Franzke","email":"","orcid":"","institution":"University of Vienna","correspondingAuthor":false,"prefix":"","firstName":"Bernhard","middleName":"","lastName":"Franzke","suffix":""},{"id":377816597,"identity":"b65abc48-5b4e-4518-9dfb-4b47bcb4b89a","order_by":5,"name":"Eva-Maria Strasser","email":"","orcid":"","institution":"University of Vienna","correspondingAuthor":false,"prefix":"","firstName":"Eva-Maria","middleName":"","lastName":"Strasser","suffix":""},{"id":377816598,"identity":"03008056-5929-4e4d-a8cb-6549666e0429","order_by":6,"name":"Steffen Waldherr","email":"","orcid":"","institution":"University of Vienna","correspondingAuthor":false,"prefix":"","firstName":"Steffen","middleName":"","lastName":"Waldherr","suffix":""},{"id":377816599,"identity":"29b14477-a8ea-4000-b321-0a867ca101f8","order_by":7,"name":"Karl-Heinz Wagner","email":"","orcid":"","institution":"University of Vienna","correspondingAuthor":false,"prefix":"","firstName":"Karl-Heinz","middleName":"","lastName":"Wagner","suffix":""},{"id":377816600,"identity":"48847d08-5c0f-4e82-b002-90b777557645","order_by":8,"name":"Wolfram Weckwerth","email":"","orcid":"","institution":"University of Vienna","correspondingAuthor":false,"prefix":"","firstName":"Wolfram","middleName":"","lastName":"Weckwerth","suffix":""}],"badges":[],"createdAt":"2024-11-02 10:08:25","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-5377652/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-5377652/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1038/s41540-025-00580-4","type":"published","date":"2025-09-24T15:57:05+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":69336249,"identity":"82a94d48-9e9f-4b25-b811-f5726b3d530b","added_by":"auto","created_at":"2024-11-19 10:03:37","extension":"jpg","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":2348208,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eWork scheme of the proposed approach\u003c/strong\u003e. This figure illustrates the key steps of our proposed methodology. In the first stage, we employ the Canonical Correlation Algorithm (CCA) to derive a body activity index that is highly linked to metabolomics data. Subsequently, we perform sample clustering, wherein samples are grouped into 2, 4, and 6 clusters based on this index. To address the non-linear effects of metabolites, we develop a set of automated machine-learning classifiers specifically tailored to metabolomics data, allowing us to identify crucial variables. Finally, we employ the COVRECON workflow (Li, 2023) to construct a topological metabolic interaction model for measured metabolites and to solve the differential Jacobian problem of the system.\u003c/p\u003e","description":"","filename":"Figure1.jpg","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/45092b270d5007bb2e08a759.jpg"},{"id":69337103,"identity":"b888b76f-c0cd-4425-9824-b8faa13cd815","added_by":"auto","created_at":"2024-11-19 10:11:37","extension":"jpg","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":2786724,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eBody Activity Index and Metabolomics Index from canonical corresponding analysis (CCA) and cluster analysis.\u003c/strong\u003e (a), The scatter plot of the generated Body Activity Index and Metabolomics Index and loadings of Body Activity Index. (b), The scatter plots of physical (left) and metabolomics (right) profiles, where all old adults are clustered into two groups based on Body Activity Index. (c), The scatter plots of physical (left) and metabolomics (right) profiles with all samples clustered into 2, 4 and 6 groups.\u003c/p\u003e","description":"","filename":"Figure2.jpg","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/4b4d78ae7f81b29245a971f6.jpg"},{"id":69337275,"identity":"0201eab9-1aa7-48ab-aae3-30dc265a6827","added_by":"auto","created_at":"2024-11-19 10:19:37","extension":"jpg","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":2537991,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eDifferent machine learning classifiers results.\u003c/strong\u003e (A) Average AUC on 20 hold out test sets of the XGBoosting algorithm (0.9144) against four other machine learning algorithms for the prediction of two body activity index groups from metabolomics data: Distributed Random Forest and Extremely Randomized Trees (DRF) (0.8804), Generalized Linear Model with regularization (GLM) (0.8719), H2O GBM (GBM) (0.8982) and DeepLearning models (0.8921). For each algorithm, we assess the effect of sample size by building a separated classifier with 1/4 training set removed. (B) Bipartite graph of the top metabolites extracted from the five machine-learning algorithms. For each algorithm, we keep the metabolites if they are identified over 10times in the top ten metabolites over 20 hold out test sets. (C) The variable importance for the XGBoosting algorithm in one hold-out test set. (D) SHAP summary plot of the XGBoosting algorithm in one hold out test set. It shows the contribution of the features for each instance (row of data).\u003c/p\u003e","description":"","filename":"Figure3.jpg","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/5b46a7024dda4baa3e147356.jpg"},{"id":69336252,"identity":"f0a77f8e-aba5-4912-845f-d78cca016f0e","added_by":"auto","created_at":"2024-11-19 10:03:37","extension":"jpg","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":4203453,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003e(A) heatmap of the Pearson correlation matrix between the measured physical data and metabolomics data and (B) metabolites’ concentrations comparison between two groups.\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"Figure4.jpg","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/3170faad96a238e5c1a3f71a.jpg"},{"id":69337104,"identity":"e9e4aa70-f1b7-4b63-8e4b-2e133cb69016","added_by":"auto","created_at":"2024-11-19 10:11:37","extension":"jpg","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":386480,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eheatmaps of the Pearson correlation matrix of all body features and metabolomics measurement changes within two the time intervals (left: first 3 months, right: another 3 months).\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"Figure5.jpg","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/263a06467faad8a2cec71443.jpg"},{"id":69337106,"identity":"2b58912e-b0e4-4f5b-80db-bc233b5e3c03","added_by":"auto","created_at":"2024-11-19 10:11:37","extension":"jpg","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":7853174,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eBiochemical pathway network reconstruction and inverse Jacobian calculation repeats with 12 different local Jacobian structures Using COVRECON tool.\u003c/strong\u003e The detailed information of each metabolic interaction can be interactively checked in the matlab figure format result in Supplementary Material. An excel file listing all interaction information is available in the supplementary material.\u003c/p\u003e","description":"","filename":"Figure6.jpg","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/89025ddbd890b7b02dfc06d9.jpg"},{"id":69337277,"identity":"54ef73d4-3499-434b-b834-289ab3bcb51c","added_by":"auto","created_at":"2024-11-19 10:19:37","extension":"jpg","order_by":7,"title":"Figure 7","display":"","copyAsset":false,"role":"figure","size":2883540,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eThe overall inverse Jacobian results integrating all 200 local calculation repeats.\u003c/strong\u003e Figure 7A is the average inverse Jacobian matrix; Figure 7B only plotted highlighted metabolic interactions with inverse Jacobian calculation value above 0.5 (scaled to 0-1); Figure 7C listed the most relevant enzymes, where Aspartate Transaminase showed 11 times out of the 15 highlighted interactions, with the accumulated value 6.2681.\u003c/p\u003e","description":"","filename":"Figure7.jpg","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/e82d8956733d882a979f968a.jpg"},{"id":69337276,"identity":"35ffac78-a68b-4afb-9614-20fa628fed7a","added_by":"auto","created_at":"2024-11-19 10:19:37","extension":"jpg","order_by":8,"title":"Figure 8","display":"","copyAsset":false,"role":"figure","size":1579173,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eThe enzyme measurements for the two enzymes identified in inverse Jacobian results: (A) Aspartate Transaminase (AST) and (B) Glutamic-Pyruvic Transaminase (ALT). \u003c/strong\u003eThe enzymes are measured in three time points: first measurement, 3 months later and 6 months later. The enzymes’ concentrations are compared between two groups (physical active and less active groups) and different time points, where significances are highlighted.\u003c/p\u003e","description":"","filename":"Figure8.jpg","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/78cbafb9ed574951c879d9dd.jpg"},{"id":92430464,"identity":"4930ad1f-54ae-49bd-a411-bb228a40a94c","added_by":"auto","created_at":"2025-09-29 16:05:04","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":26028734,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/8f861655-a04c-435e-a80f-e4982d28445a.pdf"},{"id":69336248,"identity":"c08876d6-ea41-4283-a462-ac75bb875039","added_by":"auto","created_at":"2024-11-19 10:03:36","extension":"png","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":6845,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eTable 1, physical measurement difference between three treatment groups\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"Table1.png","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/266ddd6b95cc5c8ade9abd55.png"},{"id":69336255,"identity":"32bf57a3-011e-4111-86e6-d9970a96148f","added_by":"auto","created_at":"2024-11-19 10:03:37","extension":"pdf","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":2022619,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementalmaterialS1.pdf","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/60a00282dae31261a8674693.pdf"},{"id":69336251,"identity":"a070e10a-05e5-46d9-a35e-44a0012748b0","added_by":"auto","created_at":"2024-11-19 10:03:37","extension":"xlsx","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":406345,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementalmaterialS2.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/10e9bd77ec01d9db7e6dddcc.xlsx"},{"id":69337107,"identity":"c3a602cd-1880-49b8-96e0-55d3971aa855","added_by":"auto","created_at":"2024-11-19 10:11:37","extension":"zip","order_by":4,"title":"","display":"","copyAsset":false,"role":"supplement","size":2442290,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementarymaterialS3S5.zip","url":"https://assets-eu.researchsquare.com/files/rs-5377652/v1/fb3ed64edd2d5e37e6346643.zip"}],"financialInterests":"No competing interests reported.","formattedTitle":"Machine learning and data-driven inverse modeling of metabolomics unveil key process of active aging","fulltext":[{"header":"Introduction","content":"\u003cp\u003ePhysical inactivity is a worldwide health problem, and is ranked as the fourth leading behavioral risk factor for global mortality [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e]. The imperative to maintain body activity, physically and metabolically, is on the rise. The concept of active aging, inspired by Robert Havighurst's activity theory [\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e], suggests that maintaining an active lifestyle is crucial for the well-being of older individuals. The thought of active aging emerged and began to develop in the 1990s, placing strong emphasis on the link between activity and health [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e].This focus became particularly pertinent due to the worldwide aging population, leading to concerns about inactivity permeating various social domains [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e]. Within the transition into the 2020s, there has been an escalating emphasis on harnessing technology to foster healthy aging [\u003cspan additionalcitationids=\"CR6\" citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. Beyond longevity, active aging encompasses regular physical activity, better management of chronic diseases, and improved quality of life [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eConventionally, several studies have examined various physical aspects of active aging, such as sleep, sedentary time, muscle strengthening activities, and movement behaviors [\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e, \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e]. As the new technology to diagnose diseases, metabolomics involves the comprehensive analysis of small-molecule metabolites (\u0026lt;\u0026thinsp;10 kDa) present in a biological sample, including metabolic intermediates, hormones, signaling molecules, and secondary metabolites [\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e, \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e]. Functioning as the culmination of all biological processes in the body, metabolites play a pivotal role in energy generation, signal transmission, and carrying essential information about the body's status and ongoing functions. Consequently, important metabolites possess the potential to serve as aging biomarkers or as integral components of the metabolic signature. This signature mirrors the active state of the organism as it traverses the aging process [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e]. The development of metabolomics empowers us to scrutinize health issues at the molecular level [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e]. Notably, amid the COVID-19 pandemic, metabolomics has demonstrated its potency as a diagnostic, prognostic, and drug intervention tool [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e]. As expected, COVID-19 has been extensively investigated using metabolomics methodologies, contributing to biomarker studies [\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e] and evaluations of drug impacts [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e, \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e]. Beyond specific disease diagnosis, metabolomics can also illuminate our comprehension of bodily activities (active aging). Recent endeavors have delved into metabolic profiling within aging studies [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e, \u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e20\u003c/span\u003e], providing us with overarching insights into metabolic changes during the aging process. Here we specifically focus on the metabolomics profiling of older adults and aim to detect key biomarkers and important metabolic interactions related to active aging.\u003c/p\u003e \u003cp\u003eThe emerging large scale datasets from OMICS (metabolomics, proteomics, transcriptomic and genomics) measurements empower us to scrutinize any question in biology from a systemic perspective [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e, \u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e]. In the field of systems biology, a central goal is to identify biomarkers and infer biochemical regulations from large-scale metabolomics data [\u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e]. Statistical methods, especially when combined with machine learning techniques, have shown power on constructing accurate classifiers capable of distinguishing between diverse sample groups and revealing underlying biomarkers [\u003cspan additionalcitationids=\"CR25 CR26\" citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e]. However, statistical methods offer limited insights into how information is transferred within a biochemical network, the critical regulatory steps involved, and how regulatory mechanisms change under different conditions [\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e, \u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e, \u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e28\u003c/span\u003e, \u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e29\u003c/span\u003e]. Several studies have emphasized the necessity of analyzing the dynamic behavior of metabolism to understand the evolution and maintenance of stable metabolic homeostasis under varying environmental conditions [\u003cspan additionalcitationids=\"CR31 CR32 CR33\" citationid=\"CR30\" class=\"CitationRef\"\u003e30\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e34\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eMathematically, kinetic models can provide systemic insights into metabolic networks, but constructing these models and estimating parameters poses challenges, particularly for large-scale models [\u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e35\u003c/span\u003e]. In recent years, several studies have focused on the steady-state Jacobian investigation of metabolomics data [\u003cspan additionalcitationids=\"CR37 CR38 CR39\" citationid=\"CR36\" class=\"CitationRef\"\u003e36\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR40\" class=\"CitationRef\"\u003e40\u003c/span\u003e] integrating with fluxomics or time-series measurements. In addition, with only large sampled metabolomics measurements, recent studies have developed inverse differential Jacobian algorithms, which provide a convenient way to infer differences in the dynamics of metabolic networks between different conditions [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e, \u003cspan additionalcitationids=\"CR32 CR33\" citationid=\"CR31\" class=\"CitationRef\"\u003e31\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e34\u003c/span\u003e, \u003cspan additionalcitationids=\"CR42 CR43 CR44\" citationid=\"CR41\" class=\"CitationRef\"\u003e41\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR45\" class=\"CitationRef\"\u003e45\u003c/span\u003e]. Among them, the most recent study has developed a novel method and workflow termed COVRECON for analyzing key biochemical regulations through the solution of a differential Jacobian problem[\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e, \u003cspan citationid=\"CR33\" class=\"CitationRef\"\u003e33\u003c/span\u003e, \u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e34\u003c/span\u003e]. The COVRECON approach integrates the covariance matrix of metabolomics data with automatic metabolic network modeling based on genome-scale metabolic reconstructions and biochemical reaction databases.\u003c/p\u003e \u003cp\u003eFigure \u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e illustrates the workflow of this study, which consists of three main steps. In step 1, we aim to cluster the original samples into different groups based on physical and functional measurements, where each group represents different body activity conditions. Step 2 involves building machine learning-based classifiers to identify these different groups using metabolomics data, thus enabling the identification of key metabolites as biomarkers. Finally, in step 3, we employ the inverse Jacobian analysis and the COVRECON workflow to uncover the most important biochemical regulations associated with the identified body activity conditions. By conducting this approach, we intend to contribute to the understanding of active aging from a metabolomics perspective and shed light on the key biochemical regulations underlying different fitness conditions.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e"},{"header":"Results","content":"\u003cp\u003eThis study was performed in 5 retirement homes in Vienna managed by Curatorship of Viennese Retirement Homes, with the main aim to assess the impact of resistance training and protein-vitamin supplementation or a cognitive training on physical performance (Oesen et al. 2015). In this secondary analysis, we focus on the plasma metabolomics changes and the identification of potential biomarkers and biochemical processes for fitness. The cohort of older adults with an average age above life expectancy consisted of 117 participants at baseline and altogether we measured 263 plasma metabolomic samples.\u003c/p\u003e \u003cp\u003eThe subjects were randomly assigned to three groups (see supplementary figure \u003cspan refid=\"MOESM1\" class=\"InternalRef\"\u003eS1\u003c/span\u003e): resistance training (T), resistance training and supplements (E) and cognitive training, acting as a control group (K). Blood samples were collected at the baseline (T1), after three months (T2) and after six months (T3).\u003c/p\u003e \u003cp\u003eTo establish the relationship between body activity, fitness and metabolomics profiles, we initially investigated the physical data measurements, which consisted of two types: body functionality and body shape. Moreover, the body strength measurements can be further divided into resistance exercise and endurance exercise types. Table\u0026nbsp;1 shows the group differences of the physical measurements across the three groups. As expected, compared to the control group (K), the resistance training groups (T and E) exhibit better resistance measurements. Nevertheless, there was no influence on endurance measurements (e.g. walking distance). Notably, endurance exercise has been reported more related to body aging conditions than resistance measurements [\u003cspan citationid=\"CR46\" class=\"CitationRef\"\u003e46\u003c/span\u003e, \u003cspan citationid=\"CR47\" class=\"CitationRef\"\u003e47\u003c/span\u003e]. This is also consistent with the experimental design, where the old adults were randomly assigned to the three groups regardless of their fitness.\u003c/p\u003e \u003cp\u003e \u003cb\u003eCanonical Correlation (CCA) based clustering to assess physical fitness in a cohort of older institutionalized adults\u003c/b\u003e \u003c/p\u003e \u003cp\u003eSince our question is the relationship between metabolomics and body activity, we employed Canonical Correlation Analysis (CCA) to generate a body activity index based on the body functionality dataset. Subsequently, we clustered the old adults and samples into two, four, and six groups based on this body activity index.\u003c/p\u003e \u003cp\u003eAs demonstrated in Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eA, the generated body activity index has a high correlation to the metabolomics index (Pearson Coeff\u0026thinsp;=\u0026thinsp;0.8471), where the CCA loadings of the body activity index is listed aside. Among all physical indexes, walking distance showed the most dominant effect within the body activity index. This observation is biologically reasonable since walking distance directly reflects an individual's endurance condition, which is directly related to the aging process [\u003cspan citationid=\"CR46\" class=\"CitationRef\"\u003e46\u003c/span\u003e, \u003cspan citationid=\"CR47\" class=\"CitationRef\"\u003e47\u003c/span\u003e].\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eConsidering the potential non-linear relationship between the generated body activity index and the metabolomics index, we constructed an automated machine-learning classifier using the XGBoosting algorithm as described in the method part. The classifier was trained with 100 maximum models, over 25 random training-test separation, the averaged AUCs calculated on the 25% hold-out test sets were determined to be 93.76%, 83.47%, and 61.75% for the two, four, and six-group clusters, respectively (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eC). This indicates that the CCA generated body activity index and metabolomics index exhibit a strong correlation. Meanwhile, we group all the old adults into two groups for the inverse Jacobian analysis using the mean body activity index as shown in Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eB.\u003c/p\u003e \u003cp\u003eFor comparison, we also performed CCA analysis between the metabolomics data and body shape features, such as gender, height, and age. The biplots of the CCA from the metabolomics and body shape analysis are presented in Supplementary Figure \u003cspan refid=\"MOESM2\" class=\"InternalRef\"\u003eS2\u003c/span\u003e. The highest Spearman's correlation coefficient obtained was only 0.4963 for the age index. Additionally, we conducted a further CCA analysis considering the metabolomics data along with both body functionality and body shape data. However, the Pearson correlation coefficient increased only marginally from 0.8773 to 0.8903. This indicates that the metabolomics data are primarily influenced by the body strength/ functionality aspects. Consequently, this validates both the body activity index and the metabolomics index that we developed.\u003c/p\u003e \u003cp\u003eThe results of the CCA-based cluster analysis highlight the strong relationship between the derived body activity index and the metabolomics index. The dominance of walking distance as a key factor indicates its significance as a reflection of an individual's health condition and metabolic activity. In the following analysis, we will focus on the two old adult groups clustered based on the body activity index, labelled as active group and less-active group.\u003c/p\u003e \u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eMachine Learning based classifiers and variables importance reveals strong association of metabolites and fitness\u003c/h2\u003e \u003cp\u003eIn this section, we built several machine learning based classifiers to predict the active/less-active groups from the metabolomics dataset. This approach can provide us valuable insights on the nonlinear influence between the metabolomics index and body activity index. As mentioned in the methods part, we evaluated the predicting performance of five machine learning algorithms: XGBoosting, DRF, GLM, GBM, and DeepLearning algorithms. We applied the automatic machine learning framework and selected the best model for each algorithm based on cross-validation AUC values. The chosen models were then assessed on the 25% hold-out test dataset. As shown in Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e, the classifiers performances were compared among the five Machine Learning algorithms.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eFigure \u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eA illustrates the averaged AUC values calculated on the 25% hold-out test sets for each algorithm across the 25 random train-test separations. The detailed AUC results are listed in Supplementary material S2. XGBoosting achieved the highest performance with an average AUC value of 0.9150. This result was statistically significant (Wilcoxon signed-rank test P\u0026thinsp;\u0026lt;\u0026thinsp;0.01) compared to the generalized linear models (GLM) that had an average AUC value of 0.8695. The superior performance of XGBoosting suggests the presence of non-linear effects originating from the metabolic network systems. To assess the effect of sample size on various classifiers\u0026rsquo; performance, we randomly removed a quarter of the training sets and evaluated the five algorithms. The AUC accuracy of each algorithm hardly changed. Surprisingly, DRF, GLM, and Deep Learning showed improved AUC accuracy with fewer samples. This effect may be attributed to the presence of outlier samples in the original dataset, which introduced noise during the training process, resulting in poorer performance.\u003c/p\u003e \u003cp\u003eIn order to assess the importance of metabolites directly related to the two body activity groups, we ranked the metabolites extracted from the five algorithms based on the testing dataset. We identified the top 10 metabolites for each algorithm by calculating the average variable importance across the 25 repeats. The algorithm-metabolite bipartite graph is shown in Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eB, where Aspartate, Proline, Fructose, Pyruvate and Malic Acid were consistently identified as the top metabolites across almost all classifiers. The detailed metabolite importance values of each algorithm are presented in Supplementary material S2.\u003c/p\u003e \u003cp\u003eFor a better understanding of the variable importance results, we applied a multi-algorithm auto-machine learning approach, including all five algorithms with a maximum of 100 models, using the 'automl' function in the H2o.py package. XGBoosting demonstrated the best performance, as shown in Supplementary Table \u003cspan refid=\"MOESM1\" class=\"InternalRef\"\u003eS1\u003c/span\u003e. The Pareto front plot in Supplementary Figure \u003cspan refid=\"MOESM3\" class=\"InternalRef\"\u003eS3\u003c/span\u003e determined the optimal subset classifier, which included XGBoosting and GBM classifiers, highlighting the superiority of boosting methods for this task. Figure\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eC and \u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eD present the variable importance and SHAP summary plot for the leading XGBoosting classifier on the test set. The analysis revealed that Aspartate was the most important metabolite, accounting for over 90% of the importance. This highlights the direct influence of the metabolomics aspect on the body activity index. The Spearman's correlation heatmap shown in Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003e further supports this observation, with Aspartate exhibiting the most significant correlation with body strength data. Although other metabolites, such as Proline, Malic Acid, and Pyruvate, had lower importance values, they consistently appeared among the top 10 metabolites across different classifiers. In Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003e, we also did the t-test for all metabolites between the two groups, where the differences with significance are plotted. Interestingly, they didn\u0026rsquo;t fully match the classifier results, e.g. Pyruvate is identified as key metabolites by all classifiers but didn\u0026rsquo;t show significance. This may suggest that the effect of Pyruvate is non-linear between the two groups. In addition, as shown in Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eD, the SHAP plot of the classifier top metabolites still shows good separation between two groups, albeit with less pronounced distinctions compared to Aspartate. This further indicates that they play a role in reflecting non-linear metabolic effects on the body activity index.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eWe choose the eight most important metabolites: Aspartate, Proline, Fructose, Malic Acid, Pyruvate, Valine, Citrate and Ornithine, and map them to the KEGG pathways as shown in Supplementary Figure S4. We can see aside from a few large comprehensive pathways, the top metabolites identified in the classifier results are most related to Central carbon metabolism in cancer and 2-Oxocarboxylic acid metabolism. However, it merely revealed a surface-level connection between active aging and these pathways, which falls short of providing a comprehensive understanding of the underlying biochemical regulations of the active aging dynamics.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003ePredictive inverse metabolic interaction modelling using the COVRECON platform\u003c/h3\u003e\n\u003cp\u003eWhile the machine learning and classifier results provide insights into the variable importance between the measured metabolites and the body activity index, this does not explain the mechanistic change between the two groups. Since for each old adult, metabolomics analysis was done three times, first time point, after 3 months and after 6 months, we plotted the correlation heatmap of the change of all body features and metabolomics measurement changes within two the time intervals in Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e. It is evident that the correlation patterns within the metabolomics measurement changes show high similarity. This reflects the internal dynamics of the metabolic networks. Nevertheless, when we check the highly correlated metabolites, we may find no biochemical reactions between the two metabolites from any database. This situation frequently happens, e.g. in Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e, Threonine, Tyrosine and Valine show a high correlation, yet no direct biochemical reactions occur among them. This is because the high correlations originate from the network dynamics. Thus, finding the causal interactions among the metabolites is crucial.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eIn recent years, inverse differential Jacobian algorithms have been developed, providing a convenient way to infer causal dynamics of metabolic networks from metabolomics data (N\u0026auml;gele, et al., 2014; Sun and Weckwerth, 2012; Weckwerth, 2019; Wilson, et al., 2020; Li, et al., 2023). Besides the metabolomics measurements, metabolic reconstruction is used as complementary information to build a topological model for metabolic interaction network. Based on this, we have developed the COVRECON toolbox (available at: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://bitbucket.org/mosys-univie/covrecon/src/main/\u003c/span\u003e\u003cspan address=\"https://bitbucket.org/mosys-univie/covrecon/src/main/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e) (Li, et al., 2023). As shown in the method part, we applied the COVRECON workflow to the two group datasets. The COVRECON workflow consists of two steps: building the metabolic interaction network and the inverse Jacobian calculation.\u003c/p\u003e \u003cp\u003eAs described in COVRECON (Li, et al., 2023), we used a default setting in the Sim-Network part to generate a metabolic super-pathway network of the measured metabolites. Each edge in the network represents a feasible pathway between two nodes (metabolites) and reflects a non-zero component in the system Jacobian matrix. The default setting assigns a fixed weight of one to each reaction, and the reverse reaction weight is based on the log value of its delta Gibbs free energy. Additionally, a pathway-steps limitation of 4 is set. Detailed information about reactions, enzymes and genes of the resulting metabolic interaction network can be found in the Supplemental Material S3. By integrating the covariance of the metabolomics data from both groups and the Jacobian structure matrix, we can perform the inverse Jacobian analysis in the second part of COVRECON toolbox. The COVRECON workflow and toolbox address the ill-conditioned matrix problem associated with the inverse Jacobian approach through a regression loss-based algorithm, significantly improving its stability and feasibility [\u003cspan citationid=\"CR33\" class=\"CitationRef\"\u003e33\u003c/span\u003e, \u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e34\u003c/span\u003e, \u003cspan citationid=\"CR43\" class=\"CitationRef\"\u003e43\u003c/span\u003e]. However, given that the inverse Jacobian approach is based on the Jacobian structure and is more reliable in smaller-sized models, we selected a tailored core part of the whole model containing 10\u0026ndash;20 metabolites based on the classifier variable importance results as described in method part. The same network reduction strategy as in Sim-Network was employed, with additional indirect connections added to the reduced model. For example, an additional connection from Proline to Aspartate was added to account for the indirect effects through the connections from Proline to Asparagine and from Asparagine to Aspartate (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003e). Figure\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e presents 12 typical results in the repeated calculation. All the repeated results are available in Supplementary material S6. It is evident that even though the local results are different due to the influence from the Jacobian structure information, the Inverse Jacobian approach shows stability on several highlighted metabolic interactions. For example, the interactions Proline-\u0026gt;Aspartate, Ornithine-\u0026gt;Aspartate, Citrate-\u0026gt;Aspartate and Glutamate-\u0026gt;2-oxo glutaric acid are high valued in the resulted differential metabolic interaction network of many repeats. To present the overall metabolic interaction importance, we integrated all the 200 local results into the full differential Jacobian (DJ) by calculating the average value of each metabolic interaction within the repeats. The final R* matrix and the differential interaction network are presented in Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eA \u0026amp; \u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eB respectively. In Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eB, we plot only the highlighted metabolic interactions with calculated value (scaled to 0\u0026ndash;1) above 0.5. Here we note, the result showed robustness, with similar overall R* using 100, 200 and 500 repeats. Further results are using 200 repeats.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eThrough this COVRECON approach, we are able to find several important perturbed metabolic interactions between the two body activity index clustered groups. The highlighted interactions and the detailed reactions, enzymes and gene information are presented in Supplementary material S4. These findings provide valuable insights into the regulatory interactions and dynamics of the metabolic network related to Aspartate, further supporting its importance as the dominant biomarker in the classifiers results. As shown in Fig.\u0026nbsp;\u003cspan refid=\"Fig7\" class=\"InternalRef\"\u003e7\u003c/span\u003eC, several reactions are consistently identified in several highlighted metabolic interactions. Among these, enzyme aspartate transaminase (AST, EC number 2.6.1.1) is identified in 11 out of the 15 highlighted interactions and shown in all the largest valued interactions: Proline-\u0026gt;Aspartate, Valine-\u0026gt;Aspartate, Citrate-\u0026gt;Aspartate and Glutamate-\u0026gt;2-oxo glutaric acid. The enzyme Glutamic-Pyruvic Transaminase (ALT, EC number: 2.6.1.2) is also highlighted. Notably, both AST and ALT are important enzymes in amino acid metabolism, and recently there is indication of their involvement in health-related issues of older adults [\u003cspan additionalcitationids=\"CR49\" citationid=\"CR48\" class=\"CitationRef\"\u003e48\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR50\" class=\"CitationRef\"\u003e50\u003c/span\u003e]. Furthermore, enzyme asparagine synthetase B (EC number: 6.3.5.4) was identified in 8 out of the 15 highlighted interactions. This enzyme is less studied for health issues of elderly peoples. However, asparagine synthetase (ASNS) deficiency was recently discovered as a metabolic disorder of non-essential amino acids [\u003cspan citationid=\"CR51\" class=\"CitationRef\"\u003e51\u003c/span\u003e]. Moreover, it is evident that most identified enzymes in Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003ec belong to enzyme class of transaminases (EC:2.6.1.-). The transaminase enzymes are important in the production of various amino acids, and measuring the concentrations of various transaminases in the blood is important in diagnosing and tracking of many diseases [\u003cspan citationid=\"CR52\" class=\"CitationRef\"\u003e52\u003c/span\u003e].\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eFor a further analysis of the enzymes, we conducted routine blood tests measurements of the old adults across the three time points. Four metabolic enzymes were measured: AST, ALT, Gamma-glutamyltransferase (GGT) and Creatine Kinase (CK). The data measurements are presented in Supplementary material S2. As shown in Fig.\u0026nbsp;\u003cspan refid=\"Fig8\" class=\"InternalRef\"\u003e8\u003c/span\u003e and Supplementary Figure S6, we compared the enzyme measurements between the two groups (active/less active). The results suggested significant differences in AST and ALT, while GGT and CK did not exhibit such significant variations. This observation validates the inverse Jacobian results in Fig.\u0026nbsp;\u003cspan refid=\"Fig7\" class=\"InternalRef\"\u003e7\u003c/span\u003e. Furthermore, we compared the AST and ALT changes within the two 3-months\u0026rsquo; time intervals. As demonstrated in Fig.\u0026nbsp;\u003cspan refid=\"Fig7\" class=\"InternalRef\"\u003e7\u003c/span\u003e, both AST and ALT showed significant changes in the \u0026ldquo;active group\u0026rdquo;, while the changes were not significant in the \u0026ldquo;less active group\u0026rdquo; during both 3-months intervals. Notably, the changes also exhibited significant differences between the two groups. Specifically, in the \u0026ldquo;active group\u0026rdquo;, AST and ALT demonstrated a significant larger decrease during the first 3 months, followed by a significant larger increase in the subsequent 3-months interval. This suggests that a larger plasticity of enzymatic liver and muscle systems in individuals with a high level of body activity. Interestingly, a few studies have revealed similar observations while investigating enzyme variations. In a long-term study of 29 routine laboratory measurements of 30 athletes, AST and ALT exhibited significantly larger variations over an 11-months period compared to those reported for general population [\u003cspan citationid=\"CR53\" class=\"CitationRef\"\u003e53\u003c/span\u003e, \u003cspan citationid=\"CR54\" class=\"CitationRef\"\u003e54\u003c/span\u003e]. Moreover, various studies have evidenced the enzyme fluctuations within healthy individuals\u0026rsquo; blood samples from physical activity and exercises [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e, \u003cspan additionalcitationids=\"CR56 CR57 CR58\" citationid=\"CR55\" class=\"CitationRef\"\u003e55\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR59\" class=\"CitationRef\"\u003e59\u003c/span\u003e].\u003c/p\u003e \u003cp\u003e \u003c/p\u003e"},{"header":"Discussion","content":"\u003cp\u003eIn this article, we measured 263 plasma metabolomics samples to study active aging and fitness in a cohort of very old adults close to or above the average life expectancy. Using a CCA approach, we clustered all old adults and samples into two groups based on a body activity index. Then we identified several key biomarkers between these two groups through machine- and deep learning analysis. The identified metabolites are Aspartate, Proline, Fructose, Malic Acid, Pyruvate, Valine, Citrate and Ornithine, where Aspartate showed dominant effects. XGboosting showed the best performance. In a further analysis, we applied the COVRECON (Li, et al., 2023) approach to the two group metabolomics datasets. Through this method, we identified several key metabolic interaction changes between the two active-less active groups. Many of these interactions are related to aspartate, this is consistent with the machine learning results. By checking the detailed enzyme information of the highlighted metabolic interactions, we identified several important enzyme regulations. The enzyme AST showed a relation to most highlighted interactions. The blood measurements of all individuals across the three time points validate the results. Existing studies also showed that AST and ALT is highly related to health issues of older adults [\u003cspan citationid=\"CR60\" class=\"CitationRef\"\u003e60\u003c/span\u003e].\u003c/p\u003e\n\u003ch3\u003eMetabolomics chances for resistance training\u003c/h3\u003e\n\u003cp\u003eAs shown in Supplementary table \u003cspan refid=\"MOESM2\" class=\"InternalRef\"\u003eS2\u003c/span\u003e, we conducted a group difference t-test for the metabolomics measurements. Where Alpha Tocopherol shows significant difference between nutritional supplement intake group (E) and the other two groups, as it is a part of the supplement FortiFit. The metabolites Linnileic acid, Methionine, Palmitic acid, Succinate and Tyrosine show a significant difference between the control group (K) and the resistance training groups (T \u0026amp; E). Interestingly, this divergence contrasts with the results obtained from the body activity classifiers, suggesting distinct metabolic mechanisms for resistance exercise and endurance exercise. This mechanistic difference between endurance and resistance exercise has been previously explored [\u003cspan citationid=\"CR61\" class=\"CitationRef\"\u003e61\u003c/span\u003e], where the metabolites changes induced by endurance or resistance exercise are identified in two different modes.\u003c/p\u003e \u003cp\u003eMoreover, several studies have reported that endurance exercise but not resistance exercise has a high relevance to aging related questions. In Cao Dinh, et al., 2019, among 100 old women (aged over 65 years) the study reported that strength endurance training significantly reduces senescence-prone T cells, which is widely recognized as age-related [\u003cspan citationid=\"CR62\" class=\"CitationRef\"\u003e62\u003c/span\u003e],while intensive training showed no significant influence. In another study, Weiner, et al., 2019 concluded that endurance but not resistance training has anti-aging effects while examining a total of 124 healthy previously inactive individuals [\u003cspan citationid=\"CR46\" class=\"CitationRef\"\u003e46\u003c/span\u003e]. These studies provide additional support for our body activity index and metabolic network analysis.\u003c/p\u003e\n\u003ch3\u003eAspartate as a blood biomarker for body activity\u003c/h3\u003e\n\u003cp\u003eAspartic acid is one of the 22 protein-generic amino acids. It is involved in the malate-aspartate shuttle, which facilitates the transfer of electrons and energy between the cytoplasm and mitochondria, ultimately contributing to the production of ATP and the efficient functioning of cellular energy metabolism [\u003cspan citationid=\"CR63\" class=\"CitationRef\"\u003e63\u003c/span\u003e]. Thus, it is particularly important in tissues with high-energy demands, such as muscle, liver and the heart. This may account for the larger aspartate metabolism in the \u0026ldquo;active group\u0026rdquo;. From this point, several groups have evidenced the effect of aspartate as an important supplement for attenuation of exercise-induced hyperammonemia and an increase in exercise endurance [\u003cspan citationid=\"CR64\" class=\"CitationRef\"\u003e64\u003c/span\u003e, \u003cspan citationid=\"CR65\" class=\"CitationRef\"\u003e65\u003c/span\u003e]. On the other hand, aspartate is involved in the removal of ammonia from the body through the urea cycle [\u003cspan citationid=\"CR66\" class=\"CitationRef\"\u003e66\u003c/span\u003e]. Performing exercise can lead to ammonia production as a byproduct of energy metabolism. Aspartate may be used to help detoxify ammonia, potentially altering its levels.\u003c/p\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003eOld adults with better body activity have larger plasticity of enzymatic liver and muscle system\u003c/h2\u003e \u003cp\u003eAST and ALT are two of the routine blood test enzymes highly related to individual\u0026rsquo;s liver but also muscle and heart health [\u003cspan citationid=\"CR67\" class=\"CitationRef\"\u003e67\u003c/span\u003e], where elevated levels of AST and ALT enzymes beyond a specified threshold may indicate medical condition like hepatitis, liver disease or myonecrosis. The ratio AST/ALT is a significant sign of liver disease. We plotted the AST/ALT ratio changes over the three time points for the two groups in Supplementary Figure S7. The results showed no significant changes across the time points and groups. This suggests that AST and ALT variations originate from non-disease related factors. Furthermore, scientific investigations have furnished evidence supporting the notion that physical exercise and improved fitness levels can also lead to a transient elevation of these enzyme levels within a healthy range for individuals without underlying liver issues [\u003cspan additionalcitationids=\"CR56\" citationid=\"CR55\" class=\"CitationRef\"\u003e55\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR57\" class=\"CitationRef\"\u003e57\u003c/span\u003e]. This exercise-induced transaminase elevation is a well-documented phenomenon, commonly observed in response to vigorous physical activity. It is essential to recognize that these exercise-related increases in AST and ALT levels are typically temporary and return to baseline levels shortly after physical exertion. This indicates larger AST and ALT variations for individuals with better body functionality/activity, as observed in Fig.\u0026nbsp;\u003cspan refid=\"Fig8\" class=\"InternalRef\"\u003e8\u003c/span\u003e. This viewpoint is also suggested in a long-term study of 29 routine laboratory measurements of 30 athletes, where AST and ALT exhibited significantly larger variations over an 11-months period compared to those reported for general population [\u003cspan citationid=\"CR53\" class=\"CitationRef\"\u003e53\u003c/span\u003e, \u003cspan citationid=\"CR54\" class=\"CitationRef\"\u003e54\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eIn conclusion, this study is the first time we integrate machine learning statistical analysis and COVRECON inverse Jacobian analysis together. In metabolomics analysis, machine learning based statistical methods aid us find the key metabolites. As for the dynamical analysis, aside from kinetic modeling which needs many parameters fitting processes, we showed the predictive metabolic interaction modelling using the inverse differential Jacobian approach. This novel approach might be highly relevant to find the important dynamic regulations between two conditions. By integrating the machine learning results, we showed a robust approach for the inverse differential Jacobian calculation.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eMaterials and Methods\u003c/h3\u003e\n\u003cdiv id=\"Sec10\" class=\"Section2\"\u003e \u003ch2\u003eExperimental design\u003c/h2\u003e \u003cp\u003eThis study was performed in 5 retirement homes in Vienna managed by Curatorship of Viennese Retirement Homes. The aim of this study was to assess the impact of strength training, strength training and protein-vitamin supplement or cognitive training on very old, institutionalized adults. This study was conducted in a randomized, controlled, observer-blind design. The subjects were randomly assigned to three groups: resistance training (T), resistance training and supplements (E) and cognitive training, acting as a control group (K). The details are presented in Supplementary material S1. Blood samples were collected at the baseline (T1), after three months (T2) and after six months (T3).\u003c/p\u003e \u003cp\u003eOne hundred and seventeen subjects were recruited from five senior residencies (Supplementary Figure \u003cspan refid=\"MOESM1\" class=\"InternalRef\"\u003eS1\u003c/span\u003e). The exclusion criteria consisted of physical fitness (Short Physical Performance Battery\u0026thinsp;\u0026gt;\u0026thinsp;4) and mental performance (Mini Mental State Examination\u0026thinsp;\u0026ge;\u0026thinsp;23). Moreover, they were free of severe diseases such as diabetic retinopathy, CVDs and regular use of cortisone-containing drugs. Before starting the intervention the health and nutritional status was assessed by specialists in internal medicine and gerontology [\u003cspan citationid=\"CR68\" class=\"CitationRef\"\u003e68\u003c/span\u003e]. All subjects signed informed consent before inclusion in accordance with the Declaration of Helsinki. The study was approved by the ethics committee of the City of Vienna (EK-11-151-0811) and registered at ClinicalTrials.gov, NCT01775111 [\u003cspan citationid=\"CR68\" class=\"CitationRef\"\u003e68\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003eSubject characteristics\u003c/h2\u003e \u003cp\u003eThe sex distribution (87.6% women; 12.4% men) among participants was representative for the population living in nursing homes. The mean age of the study population was 82.9\u0026thinsp;\u0026plusmn;\u0026thinsp;6.0 years for women and 84.9\u0026thinsp;\u0026plusmn;\u0026thinsp;6.7 years for men. The participants had a BMI of 29.27 kg/m\u003csup\u003e2\u003c/sup\u003e\u0026thinsp;\u0026plusmn;\u0026thinsp;5.00 kg/m\u003csup\u003e2\u003c/sup\u003e [\u003cspan citationid=\"CR68\" class=\"CitationRef\"\u003e68\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003eSample Preparation\u003c/h2\u003e \u003cdiv id=\"Sec13\" class=\"Section3\"\u003e \u003ch2\u003eBlood Plasma Metabolite Extraction\u003c/h2\u003e \u003cp\u003eSeveral studies addressed the choice of blood sample, revealing that Heparin plasma produces a smaller side effect in the chromatogram spectrum [\u003cspan citationid=\"CR69\" class=\"CitationRef\"\u003e69\u003c/span\u003e, \u003cspan citationid=\"CR70\" class=\"CitationRef\"\u003e70\u003c/span\u003e]. Concordant with these findings, Heparin was used as an anticoagulant, while blood plasma was separated from fresh blood samples and kept in -80\u0026deg; C for further clinical analysis. Metabolite profiles of obtained human plasma samples was measured using a gas chromatograph coupled to mass spectrometer [\u003cspan citationid=\"CR71\" class=\"CitationRef\"\u003e71\u003c/span\u003e]. The samples were thawed on ice for 45 min and were vigorously vortexed for 10 s. The extraction consisted of two steps. First, 100\u0026micro;l plasma were transferred into 1.5 ml Eppendorf tubes, followed by the addition of 600 \u0026micro;l ice cooled MeOH, immediately vortexed for 10 s and left one ice for 15 min for incubation. In order to remove proteins, the samples were centrifuged at 14000 g for 4 min at 4\u0026deg; C. The supernatant was transferred into new tubes and dried down in a SpeedVac. Afterwards the dried pellets were stored at -20\u0026deg;C.\u003c/p\u003e \u003cp\u003eThe second step consisted of extraction with CHCl3. 300 \u0026micro;l of CHCl3 were added to pellets. The further procedure was a repetition of the first step. The supernatant was transferred into new Eppendorf tubes and dried down in SpeedVac. Metabolite extractions were performed in batches of 30 samples of randomly selected subjects.\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv id=\"Sec14\" class=\"Section2\"\u003e \u003ch2\u003eQuality control-Mix\u003c/h2\u003e \u003cp\u003eQuality control (QC) consisted of specific metabolites, including organic acids, amino acids, mono- and disaccharides and substrates of the TCA cycle. The table of metabolites for QC-Mix is attached to supplemental material S2. A calibration curve was prepared with concentrations of 2 \u0026micro;l, 5 \u0026micro;l, 10 \u0026micro;l, 20 \u0026micro;l, 40 \u0026micro;l, 80 \u0026micro;l and 100 \u0026micro;l.\u003c/p\u003e \u003cp\u003eInternal Standard (10 \u0026micro;l Pinitol and 10 \u0026micro;l Sorbitol) was added to each sample and to each QC just the day before GC-MS analysis. Afterwards, the samples were dried in a SpeedVac.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec15\" class=\"Section2\"\u003e \u003ch2\u003eDerivation\u003c/h2\u003e \u003cp\u003eFirst, addition of 20 \u0026micro;l of 40 mg mL\u003csup\u003e\u0026minus;\u0026thinsp;1\u003c/sup\u003e of methoxyamine hydrochloride (MeOX) dissolved in pyridine were added to each sample in order to dissolve MeOX in pyridine appropriately, the solution was vigorously vortexed several times and tube was put into hot water. After that, samples were vortexed until pellets were completely dissolved, followed by agitation at 30\u0026deg;C for 90 min at 750 rpm with a thermoshaker.\u003c/p\u003e \u003cp\u003eN-Methyl-N-(trimethylsilyl) trifluoroacetamide (MSTFA) flasks of 1 ml content was spiked with 30 \u0026micro;l retention index marker solution of alkanes from C10- C40 in hexane. After addition of 80 \u0026micro;l of prepared MSTFA, samples were incubated at 37\u0026deg; C for 30 min at 750 rpm, followed by centrifugation at 14000 g for 2 min at room temperature (24\u0026deg; C). Immediately after this step, 70 \u0026micro;l of the supernatant were transferred to GC-vials with micro inserts and closed with crimp caps.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec16\" class=\"Section2\"\u003e \u003ch2\u003eGC- MS Analysis\u003c/h2\u003e \u003cp\u003eFinally, samples were analysed using GC-MS (LECO Pegasus\u0026reg; 4D GCxGC-TOF-MS, M\u0026ouml;nchengladbach, Germany) according to Weckwerth, Wenzel, and Fiehn 2004. Immediately after derivation, 1 \u0026micro;l of sample were injected utilizing a split ratio of 1:5. The split/splitless injector was kept at a constant temperature of 230\u0026deg;C equipped with a single-tapered liner with deactivated wool. The GC-MS consisted of an Agilent 6890 (Agilent Technologies, Glostrup, Denmark) using helium as carrier gas at a flow rate of 1 mL min\u0026ndash;1. Gas separation was performed on the HP-5MS column (30 m 3 0.25 mm 3 0.25 mm, Agilent Technologies).\u003c/p\u003e \u003cp\u003eThe initial temperature of the GC oven was set to 70\u0026deg;C isothermal for 1 min, followed by a heating ramp of 9\u0026deg;C \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\text{m}\\text{i}\\text{n}}^{-1}\$\u003c/span\u003e\u003c/span\u003e to reach 330\u0026deg;C and hold for 7 min.\u003c/p\u003e \u003cp\u003eTransfer line temperature was 250\u0026deg;C, and ion source temperature was set to 200\u0026deg;C. The MS detector was switched off during the first 260 sec. Mass spectra were acquired with an acquisition rate of 20 spectra \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\text{s}}^{-1}\$\u003c/span\u003e\u003c/span\u003e and were recorded in the range of 40 to 600 m/z, utilizing a detector voltage of 1,550 V and electron impact ionization of 70 eV. The metabolite assessment required an exchange of the liner every 70 injections, thus every 2 batches in a row.\u003c/p\u003e \u003cp\u003eThe whole data acquisition was performed within 14 batches. Each batch was measured in the same chronological order. First, alkanes were injected, followed by QC calibration curve. Blank sample that contained only dried extraction reagents and derivation solvents were injected each 5 or 7 samples. Each batch consisted of a single plasma sample from 20\u0026ndash;30 subjects and was analyzed within 24\u0026ndash;32 hours. One pooled sample was measured for each batch, in order to assess instrument stability. At the end of every batch, the same QC was measured again to monitor instrumental performance over time. To minimize systematic bias induced by preparation order, samples were randomly distributed into 14 batches. However, each batch consists of a representative cross section of total samples and was comparable to the total experimental population.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec17\" class=\"Section2\"\u003e \u003ch2\u003eMetabolite identification\u003c/h2\u003e \u003cp\u003eMass spectra data were obtained from GC-MS, the next step is to transform this to biologically relevant information.\u003c/p\u003e \u003cp\u003eAfter the GC-MS analysis the raw data consisted of ion peaks and were preprocessed using LECO Chroma-TOF. The ion fragmentation spectra were matched to fragmentation spectra in NIST library and scored with a match probability, taking into account only metabolites with at least 700 similarity score. Afterwards, analytes were identified by comparison of ion fragments to a reference library of chemical standards. In detail, the metabolites were confirmed based on ion features e.g. 1) retention index and retention time, 2) m/z, 3) in-source fragmentation, particular for each metabolite. With the latter the identification of the analytes became definitive.\u003c/p\u003e \u003cp\u003eAlkanes measured at the beginning of each batch provided retention indices that were assigned to all ion peaks.\u003c/p\u003e \u003cp\u003eThe original data is presented in Supplementary material S2.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec18\" class=\"Section2\"\u003e \u003ch2\u003eData Processing\u003c/h2\u003e \u003cp\u003eThe data processing steps involved several procedures. Initially, missing values in the metabolomics measurements were imputed using the K-Nearest Neighbors (KNN) method. Following, normalization was performed to reduce heteroscedasticity and adjust for the offset between high and low intensity features as in Eq.\u0026nbsp;(0), where the log transformation of each metabolite by centering it around its mean (x̅) and scaling it by its standard deviation (s):\u003cdiv id=\"Equa\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equa\" name=\"EquationSource\"\u003e\n$$\\:{\\widehat{x}}_{ij}=\\left(\\frac{{log}_{2}\\left({x}_{ij}\\right)-\\stackrel{-}{{log}_{2}\\left({x}_{i}\\right)}}{s}\\right)$$\u003c/div\u003e\u003c/div\u003e\u003cdiv id=\"Equb\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equb\" name=\"EquationSource\"\u003e\n$$\\:\\left(0\\right)$$\u003c/div\u003e\u003c/div\u003e\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec19\" class=\"Section2\"\u003e \u003ch2\u003eData Clustering\u003c/h2\u003e \u003cp\u003eTo identify biomarkers and perform the inverse Jacobian analysis, the samples were firstly clustered into distinct groups. The clustering process comprised the following steps. Firstly, based on the information provided in Supplementary Material S2, it was observed that physical measurements could be categorized into two types: \u0026ldquo;body-shape\u0026rdquo; data (e.g., gender and height) and \u0026ldquo;body-functional\u0026rdquo; data (e.g., walking distance and left standing time). In order to generate a body activity index that reflects body functionality while minimizing the influence of body-shape differences, Canonical Correlation Analysis (CCA) [\u003cspan citationid=\"CR72\" class=\"CitationRef\"\u003e72\u003c/span\u003e] was applied. The loadings of this body activity index are presented in Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eA, where it can be observed that walking distance exhibits the strongest effects. The metabolomics-related body activity index generated through CCA was then used to cluster the samples using the k-means method, grouping them based on this body activity index.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec20\" class=\"Section2\"\u003e \u003ch2\u003eMachine Learning based Classifiers\u003c/h2\u003e \u003cp\u003eWhile the CCA-based clustering approach analyzes the relationship between the body activity index and the metabolic index as a linear method, it may not fully capture the dynamic nature of the metabolic mechanism, which inherently exhibits predominantly non-linear behavior. To capture this non-linear influence and achieve higher accuracy with the identification of important variables, several machine learning based classifiers were employed within an automated machine learning framework, implemented using the H2o package in Python. The methods utilized are as follows:\u003c/p\u003e \u003cp\u003e1, Generalized Linear Models (GLM): GLM implements regularized linear models with stochastic gradient descent (SGD) learning. The model is updated iteratively using a decreasing strength schedule, estimating the loss gradient for each sample at a time. This method offers a baseline for the linear effects.\u003c/p\u003e \u003cp\u003e2, Random Forest Classifier: A random forest is an ensemble meta-estimator that fits multiple decision tree classifiers on different sub-samples of the dataset, utilizing averaging to improve predictive accuracy and mitigate overfitting.\u003c/p\u003e \u003cp\u003e3\u0026ndash;4, Boosting Methods: Boosting is an ensemble meta-algorithm that reduces bias and variance in supervised learning. It integrates a family of machine learning algorithms that convert weak learners to strong ones [\u003cspan citationid=\"CR73\" class=\"CitationRef\"\u003e73\u003c/span\u003e]. The main variation between many boosting algorithms is their method of weighting training data points and hypotheses. We employed two common boosting methods, LGBMClassifier and XGBClassifier. LGBMClassifier (GBM) is a distributed gradient-boosting framework based on decision tree algorithms, originally developed by Microsoft [\u003cspan citationid=\"CR74\" class=\"CitationRef\"\u003e74\u003c/span\u003e], while XGBClassifier (eXtreme Gradient Boosting) is an open-source library for regularizing gradient boosting [\u003cspan citationid=\"CR75\" class=\"CitationRef\"\u003e75\u003c/span\u003e].\u003c/p\u003e \u003cp\u003e5, Autoencoder\u0026thinsp;+\u0026thinsp;deep learning: Deep learning, also known as deep neural networks, is a powerful machine learning method extensively used in pattern recognition, image processing, and bioinformatics [\u003cspan citationid=\"CR76\" class=\"CitationRef\"\u003e76\u003c/span\u003e]. Prior to training the model, we employed an autoencoder to pre-train it, using the entire unlabeled data, improving model performance, preventing random weight initialization.\u003c/p\u003e \u003cp\u003eIn our approach, each of these machine learning methods was integrated into an automated framework that encompasses hyper-parameter optimization. Hyper-parameter optimization entails the selection of ideal parameter values that govern the learning process, aiming to enhance model performance [\u003cspan citationid=\"CR77\" class=\"CitationRef\"\u003e77\u003c/span\u003e]. The supplementary Figure S5 provides an overview of the scope of hyper-parameters associated with each machine learning method. For evaluation, we randomly generate 25 training-test separations where the training-test ratio is 75/25%.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec21\" class=\"Section2\"\u003e \u003ch2\u003eFeature Importance\u003c/h2\u003e \u003cp\u003eFeature importance was estimated using a model-based approach, considering a feature to be important if it significantly contributed to the model's performance. Here, the \u0026lsquo;varimp\u0026rsquo; function within the H2o.py package was utilized to rank the important metabolites of each classifier. The importance value is averaged over the 25 training-test separations, and we choose the top 10 metabolites for each machine-learning method.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec22\" class=\"Section2\"\u003e \u003ch2\u003ePredictive metabolic modelling using an inverse Jacobian approach\u003c/h2\u003e \u003cp\u003eStatistical and machine learning methods have limitations when it comes to understanding the dynamics of a biochemical network, identifying critical regulatory steps, and capturing changes in regulatory mechanisms under different conditions [\u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e]. In recent years, the inverse differential Jacobian algorithms have been developed as a convenient approach to infer the dynamic regulation of metabolic networks from metabolomics data [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e, \u003cspan additionalcitationids=\"CR32 CR33\" citationid=\"CR31\" class=\"CitationRef\"\u003e31\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e34\u003c/span\u003e, \u003cspan additionalcitationids=\"CR42\" citationid=\"CR41\" class=\"CitationRef\"\u003e41\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR43\" class=\"CitationRef\"\u003e43\u003c/span\u003e, \u003cspan citationid=\"CR78\" class=\"CitationRef\"\u003e78\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eIn previous studies, we introduced the COVRECON workflow and Matlab toolbox as the standard inverse Jacobian workflow (Weckwerth, 2019; Li, et al., 2023). This method combines the covariance matrix of metabolomics data with automatic metabolic network modeling based on genome-scale metabolic reconstructions and biochemical reaction databases.\u003c/p\u003e \u003cp\u003eConsider a metabolic network that consists of n metabolites denoted by \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\{{X}_{i}{\\}}_{i=1\\dots\\:n}\$\u003c/span\u003e\u003c/span\u003e. The system dynamics can be modeled with the set of ordinary differential equations (ODEs):\u003cdiv id=\"Equc\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equc\" name=\"EquationSource\"\u003e\n$$\\:\\frac{d\\varvec{M}}{dt}=\\varvec{F}\\left(\\varvec{M}\\right)\\to\\:\\left\\{\\begin{array}{c}\\begin{array}{c}\\frac{d{M}_{1}}{dt}={f}_{1}\\left({M}_{1},{M}_{2},\\dots\\:,{M}_{n}\\right)\\\\\\:\\frac{d{M}_{2}}{dt}={f}_{2}\\left({M}_{1},{M}_{2},\\dots\\:,{M}_{n}\\right)\\end{array}\\\\\\:\\begin{array}{c}\\:\\:\\:⋮\\\\\\:\\frac{d{M}_{n}}{dt}\\end{array}={f}_{n}\\left({M}_{1},{M}_{2},\\dots\\:,{M}_{n}\\right),\\end{array}\\:\\:\\right.$$\u003c/div\u003e\u003c/div\u003e\u003cdiv id=\"Equd\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equd\" name=\"EquationSource\"\u003e\n$$\\:\\left(1\\right)$$\u003c/div\u003e\u003c/div\u003e\u003c/p\u003e \u003cp\u003ewhere \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:M=\\left\\{{M}_{i}\\right\\}=\\left\\{\\right|{X}_{i}\\left|\\right\\}\$\u003c/span\u003e\u003c/span\u003e are the concentrations of the n metabolites, and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{F={f}_{i}(M}_{i})\$\u003c/span\u003e\u003c/span\u003e are composed of the reaction rates for these metabolites (e.g., Michaelis-Menten kinetics, or mass action).\u003c/p\u003e \u003cp\u003eThe steady-state Jacobian matrix \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:J\$\u003c/span\u003e\u003c/span\u003e of the model is defined as a \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{R}^{n\\times\\:n}\$\u003c/span\u003e\u003c/span\u003e matrix in which \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{J}_{ij}\\:\$\u003c/span\u003e\u003c/span\u003eis the first-order derivative of the rate \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{f}_{i}\\:\$\u003c/span\u003e\u003c/span\u003efor the concentration of substances\u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\:{M}_{j}\$\u003c/span\u003e\u003c/span\u003e at steady state, noted as\u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\:{J}_{ij}={\\frac{\\partial\\:{f}_{i}}{\\partial\\:{M}_{j}}|}_{steady}\$\u003c/span\u003e\u003c/span\u003e:\u003cdiv id=\"Eque\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Eque\" name=\"EquationSource\"\u003e\n$$\\:\\varvec{J}={\\frac{\\partial\\:\\varvec{F}}{\\partial\\:\\varvec{M}}}_{steady}={\\left[\\begin{array}{ccc}\\begin{array}{cc}\\frac{\\partial\\:{f}_{1}}{\\partial\\:{M}_{1}}\u0026amp;\\:\\frac{\\partial\\:{f}_{1}}{\\partial\\:{M}_{2}}\\\\\\:\\frac{\\partial\\:{f}_{2}}{\\partial\\:{M}_{1}}\u0026amp;\\:\\frac{\\partial\\:{f}_{2}}{\\partial\\:{M}_{2}}\\end{array}\u0026amp;\\:\\cdots\\:\u0026amp;\\:\\begin{array}{c}\\frac{\\partial\\:{f}_{1}}{\\partial\\:{M}_{n}}\\\\\\:\\frac{\\partial\\:{f}_{2}}{\\partial\\:{M}_{n}}\\end{array}\\\\\\:⋮\u0026amp;\\:\\ddots\\:\u0026amp;\\:⋮\\\\\\:\\begin{array}{cc}\\frac{\\partial\\:{f}_{n}}{\\partial\\:{M}_{1}}\u0026amp;\\:\\frac{\\partial\\:{f}_{n}}{\\partial\\:{M}_{2}}\\end{array}\u0026amp;\\:\\cdots\\:\u0026amp;\\:\\frac{\\partial\\:{f}_{n}}{\\partial\\:{M}_{n}}\\end{array}\\right]\\:}_{steady}$$\u003c/div\u003e\u003c/div\u003e\u003cdiv id=\"Equf\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equf\" name=\"EquationSource\"\u003e\n$$\\:\\left(2\\right)$$\u003c/div\u003e\u003c/div\u003e\u003c/p\u003e \u003cp\u003eThe steady-state Jacobian matrix represents the first-order derivatives of the rate equations with respect to the concentrations of the metabolites at steady state. It contains valuable information about the system's dynamics, including regulatory interactions among the metabolites. As derived in a previous study by Steuer et al. (Steuer et al., 2003), the following Eq.\u0026nbsp;(3) was established between the covariance matrix of the metabolic data, and the steady-state Jacobian matrix:\u003cdiv id=\"Equg\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equg\" name=\"EquationSource\"\u003e\n$$\\:J*C+C*{J}^{T}=-2D.$$\u003c/div\u003e\u003c/div\u003e\u003cdiv id=\"Equh\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equh\" name=\"EquationSource\"\u003e\n$$\\:\\left(3\\right)$$\u003c/div\u003e\u003c/div\u003e\u003c/p\u003e \u003cp\u003eHere, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:C\\in\\:{R}^{n\\times\\:n}\$\u003c/span\u003e\u003c/span\u003e represents the covariance matrix of the compounds\u0026rsquo; concentrations\u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\:{M}_{j}\$\u003c/span\u003e\u003c/span\u003e near its steady-state value \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{M}_{j}^{steady}\$\u003c/span\u003e\u003c/span\u003e, while the fluctuation matrix \u003cb\u003eD\u003c/b\u003e represents the covariance of noise sources acting on the system.\u003c/p\u003e \u003cp\u003eThe differences between two conditions can be quantified by the differential Jacobian \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:D\\varvec{J}\$\u003c/span\u003e\u003c/span\u003e, which is calculated from the Jacobians of the two groups:\u003cdiv id=\"Equi\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equi\" name=\"EquationSource\"\u003e\n$$\\:{D\\varvec{J}}_{ij}=\\left\\{\\begin{array}{c}max\\left(\\left|\\frac{{\\left({\\varvec{J}}_{\\varvec{d}}\\right)}_{\\varvec{i}\\varvec{j}}}{{{(\\varvec{J}}_{\\varvec{h}})}_{\\varvec{i}\\varvec{j}}}\\right|,\\:\\left|\\frac{{\\left({\\varvec{J}}_{\\varvec{h}}\\right)}_{\\varvec{i}\\varvec{j}}}{{{(\\varvec{J}}_{\\varvec{d}})}_{\\varvec{i}\\varvec{j}}}\\right|\\right)\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\\\\\:1,\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:{{\\varvec{i}\\varvec{f}\\:(\\varvec{J}}_{\\varvec{h}})}_{\\varvec{i}\\varvec{j}}=0.\\:\\:\\:\\end{array}\\right.$$\u003c/div\u003e\u003c/div\u003e\u003c/p\u003e \u003cdiv id=\"Sec23\" class=\"Section3\"\u003e \u003ch2\u003e(4)\u003c/h2\u003e \u003cp\u003eThe differential Jacobian \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:D\\varvec{J}\$\u003c/span\u003e\u003c/span\u003e encompasses crucial insights into the dynamic regulatory mechanisms between two conditions. An inverse problem is to analyze the differential Jacobian \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:D\\varvec{J}\$\u003c/span\u003e\u003c/span\u003e from the measured metabolomics. This task involves two key aspects: establishing the structural information of the Jacobian matrix and resolving the optimization problem associated with the differential Jacobian.\u003c/p\u003e \u003cp\u003eIn a recent study, we introduced the COVRECON approach and related matlab toolbox (Li, et al., 2023). This innovative approach combines the automatic assembly of a metabolic interaction network and the inverse differential Jacobian calculation through a regression-loss-based algorithm. This approach automatically constructs a metabolic interaction network which contains the Jacobian structure information and then calculates a regression loss matrix R* to estimate the differential Jacobian matrix. The result R* is presented in a matlab format figure where the interaction pathway details can be interactively checked. The details of the algorithm can be found in the Supplementary material S1 and the original publication (Li, et al., 2023).\u003c/p\u003e \u003cp\u003eBy employing the COVRECON approach, we aim to uncover the key components and regulatory interactions within the differential Jacobian, thereby gaining insights into the dynamics of the metabolic network.\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv id=\"Sec24\" class=\"Section2\"\u003e \u003ch2\u003eIntegrate Classifier Biomarkers and Group Differential Jacobian Analysis\u003c/h2\u003e \u003cp\u003eSince we have clustered the samples into two groups in the data clustering part, we are now able to do the inverse Jacobian analysis for the two groups. As discussed in Supplementary material S1, similar to the general approach of most kinetic models, we consider the dynamics within each group is simulated in a group model, thus the steady state dynamics can be represented as a group Jacobian. Consequently, the inverse Jacobian algorithm can offer valuable information of the regulated dynamics between the two groups.\u003c/p\u003e \u003cp\u003eThe results from the inverse Jacobian analysis are closely linked to the structural information of the Jacobian obtained from the automatically generated super-pathway metabolic interaction networks. It is essential to highlight that we combine the significance of classifier variables in the context of inverse Jacobian analysis. Simply put, we retain the pivotal biomarkers and introduce a controlled mix of randomly chosen additional metabolites. The augmented networks, encompassing 10\u0026ndash;20 metabolites, are subsequently subjected to the COVRECON workflow. Notably, in COVRECON results, large values serve as indicators of the dynamics difference between the two distinct groups. We are able to identify the important reactions or enzymes involved in the active aging context by checking the detailed information behind these large values (Li, et al., 2023).\u003c/p\u003e "},{"header":"Declarations","content":"\u003cdiv id=\"Sec25\" class=\"Section3\"\u003e \u003ch2\u003eData availability\u003c/h2\u003e \u003cp\u003eThe data underlying this article are available in the online supplementary material. The original data of the breast cancer case study can be accessed in the reference [\u003cspan citationid=\"CR54\" class=\"CitationRef\"\u003e54\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec26\" class=\"Section3\"\u003e \u003ch2\u003eCode availability\u003c/h2\u003e \u003cp\u003eThe Matlab code is available in \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://bitbucket.org/mosys-univie/covrecon/\u003c/span\u003e\u003cspan address=\"https://bitbucket.org/mosys-univie/covrecon/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e\u003cp\u003e \u003ch2\u003eCompeting Interests\u003c/h2\u003e \u003cp\u003eThe authors declare no competing interests.\u003c/p\u003e \u003c/p\u003e\u003ch2\u003eAuthor Contribution\u003c/h2\u003e\u003cp\u003eW.W., K.H.W., and J.L. conceived the study. J.L. and W.W. developed the method. M.B., B.W., B.F., and E.M.S. implemented and performed the experiments, and J.L., S.W., and I.P. interpreted the results. J.L., W.W., and S.W. wrote the first version of the manuscript. W.W., K.H.W., J.L., and S.W. revised the manuscript. All authors reviewed and approved the final version of the manuscript.\u003c/p\u003e\u003ch2\u003eAcknowledgments\u003c/h2\u003e \u003cp\u003e This work was supported by a Ph.D. scholarship provided by the China Scholarship Council (CSC) [grant number: 201806010428 to J.L.]. Open access funding provided by University of\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eKohl, H.W., et al., \u003cem\u003eThe pandemic of physical inactivity: global action for public health\u003c/em\u003e. The lancet, 2012. 380(9838): p. 294\u0026ndash;305.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHavighurst, R.J., \u003cem\u003eSuccessful aging.\u003c/em\u003e Processes of aging: Social and psychological perspectives, 1963. 1: p. 299\u0026ndash;320.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWHO, \u003cem\u003eActive ageing: A policy framework\u003c/em\u003e. 2002, World Health Organization.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBoudiny, K. and D. Mortelmans, \u003cem\u003eA critical perspective: towards a broader understanding of'active ageing'\u003c/em\u003e. E-journal of Applied Psychology, 2011. 7(1): p. 8\u0026ndash;14.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eOfferman, J., et al., \u003cem\u003eAttitudes related to technology for active and healthy aging in a national multigenerational survey\u003c/em\u003e. Nature Aging, 2023. 3(5): p. 617\u0026ndash;625.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWongsala, M., E.-M. Anb\u0026auml;cken, and S. Rosendahl, \u003cem\u003eActive ageing\u0026ndash;perspectives on health, participation, and security among older adults in northeastern Thailand\u0026ndash;a qualitative study\u003c/em\u003e. BMC geriatrics, 2021. 21: p. 1\u0026ndash;10.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMalkowski, O.S., R. Kanabar, and M.J. Western, \u003cem\u003eSocio-economic status and trajectories of a novel multidimensional metric of Active and Healthy Ageing: the English Longitudinal Study of Ageing\u003c/em\u003e. Scientific Reports, 2023. 13(1): p. 6107.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFern\u0026aacute;ndez-Ballesteros, R., et al., \u003cem\u003eActive aging: a global goal\u003c/em\u003e. 2013, Hindawi.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCaprara, M., et al., \u003cem\u003eActive aging promotion: results from the Vital Aging Program.\u003c/em\u003e Current Gerontology and Geriatrics Research, 2013. 2013.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTaylor, A.W., \u003cem\u003ePhysiology of exercise and healthy aging\u003c/em\u003e. 2022: Human Kinetics.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWeckwerth, W., \u003cem\u003eMetabolomics: an integral technique in systems biology\u003c/em\u003e. Bioanalysis, 2010. 2(4): p. 829\u0026ndash;836.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePatti, G.J., O. Yanes, and G. Siuzdak, \u003cem\u003eMetabolomics: the apogee of the omics trilogy\u003c/em\u003e. Nature reviews Molecular cell biology, 2012. 13(4): p. 263\u0026ndash;269.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBalashova, E.E., et al., \u003cem\u003eMetabolome Profiling in Aging Studies\u003c/em\u003e. Biology, 2022. 11(11): p. 1570.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGonzalez-Covarrubias, V., E. Mart\u0026iacute;nez-Mart\u0026iacute;nez, and L. del Bosque-Plata, \u003cem\u003eThe potential of metabolomics in biomedical applications\u003c/em\u003e. Metabolites, 2022. 12(2): p. 194.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBruzzone, C., et al., \u003cem\u003eMetabolomics as a powerful tool for diagnostic, pronostic and drug intervention analysis in COVID-19.\u003c/em\u003e Frontiers in Molecular Biosciences, 2023. 10: p. 1111482.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSu, Y., et al., \u003cem\u003eMulti-omics resolves a sharp disease-state shift between mild and moderate COVID-19\u003c/em\u003e. Cell, 2020. 183(6): p. 1479\u0026ndash;1495. e20.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSindelar, M., et al., \u003cem\u003eLongitudinal metabolomics of human plasma reveals prognostic markers of COVID-19 disease severity\u003c/em\u003e. Cell Reports Medicine, 2021. 2(8).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMeoni, G., et al., \u003cem\u003eMetabolomic/lipidomic profiling of COVID-19 and individual response to tocilizumab\u003c/em\u003e. PLoS Pathogens, 2021. 17(2): p. e1009243.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGhini, V., et al., \u003cem\u003eSerum NMR profiling reveals differential alterations in the lipoproteome induced by pfizer-BioNTech vaccine in COVID-19 recovered subjects and na\u0026iuml;ve subjects\u003c/em\u003e. Frontiers in molecular biosciences, 2022. 9: p. 839809.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePanyard, D.J., B. Yu, and M.P. Snyder, \u003cem\u003eThe metabolomics of human aging: Advances, challenges, and opportunities\u003c/em\u003e. Science Advances, 2022. 8(42): p. eadd6155.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWeckwerth, W., \u003cem\u003eToward a unification of system-theoretical principles in biology and ecology\u0026mdash;the stochastic lyapunov matrix equation and its inverse application\u003c/em\u003e. Frontiers in Applied Mathematics and Statistics, 2019. 5: p. 29.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWeckwerth, W., \u003cem\u003eGreen systems biology\u0026mdash;from single genomes, proteomes and metabolomes to ecosystems research and biotechnology\u003c/em\u003e. Journal of proteomics, 2011. 75(1): p. 284\u0026ndash;305.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWeckwerth, W., \u003cem\u003eUnpredictability of metabolism\u0026ndash;the key role of metabolomics science in combination with next-generation genome sequencing\u003c/em\u003e. Anal Bioanal Chem, 2011. 400(7): p. 1967\u0026ndash;78.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSidak, D., et al., \u003cem\u003eInterpretable machine learning methods for predictions in systems biology from omics data\u003c/em\u003e. Frontiers in Molecular Biosciences, 2022. 9: p. 926623.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLiebal, U.W., et al., \u003cem\u003eMachine learning applications for mass spectrometry-based metabolomics\u003c/em\u003e. Metabolites, 2020. 10(6): p. 243.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePomyen, Y., et al., \u003cem\u003eDeep metabolome: Applications of deep learning in metabolomics\u003c/em\u003e. Computational and Structural Biotechnology Journal, 2020. 18: p. 2818\u0026ndash;2825.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAlakwaa, F.M., K. Chaudhary, and L.X. Garmire, \u003cem\u003eDeep learning accurately predicts estrogen receptor status in breast cancer metabolomics data\u003c/em\u003e. Journal of proteome research, 2018. 17(1): p. 337\u0026ndash;347.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWeckwerth, W., \u003cem\u003eUnpredictability of metabolism\u0026mdash;the key role of metabolomics science in combination with next-generation genome sequencing\u003c/em\u003e. Analytical and Bioanalytical Chemistry, 2011. 400(7): p. 1967\u0026ndash;1978.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWeckwerth, W., \u003cem\u003eMetabolomics in systems biology\u003c/em\u003e. Annual review of plant biology, 2003. 54(1): p. 669\u0026ndash;689.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWienkoop, S., et al., \u003cem\u003eIntegration of metabolomic and proteomic phenotypes: analysis of data covariance dissects starch and RFO metabolism from low and high temperature compensation response in Arabidopsis thaliana\u003c/em\u003e. Molecular \u0026amp; Cellular Proteomics, 2008. 7(9): p. 1725\u0026ndash;1736.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eN\u0026auml;gele, T., et al., \u003cem\u003eSolving the differential biochemical Jacobian from metabolomics covariance data\u003c/em\u003e. PloS one, 2014. 9(4): p. e92299.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWilson, J.L., et al., \u003cem\u003eInverse data-driven modeling and multiomics analysis reveals phgdh as a metabolic checkpoint of macrophage polarization and proliferation\u003c/em\u003e. Cell Reports, 2020. 30(5): p. 1542\u0026ndash;1552. e7.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi, J., S. Waldherr, and W. Weckwerth, \u003cem\u003eCOVRECON: automated integration of genome- and metabolome-scale network reconstruction and data-driven inverse modeling of metabolic interaction networks\u003c/em\u003e. Bioinformatics, 2023. 39(7): p. btad397.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi, J., W. Weckwerth, and S. Waldherr, \u003cem\u003eEnzyme fluctuations data improve inference of metabolic interaction networks with an inverse differential Jacobian approach.\u003c/em\u003e bioRxiv, 2023: p. 2023.12. 11.570118.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKing, Z.A., et al., \u003cem\u003eBiGG Models: A platform for integrating, standardizing and sharing genome-scale models\u003c/em\u003e. Nucleic acids research, 2016. 44(D1): p. D515-D522.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSteuer, R., et al., \u003cem\u003eStructural kinetic modeling of metabolic networks.\u003c/em\u003e Proceedings of the National Academy of Sciences, 2006. 103(32): p. 11868\u0026ndash;11873.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJamshidi, N. and B.\u0026Oslash;. Palsson, \u003cem\u003eMass action stoichiometric simulation models: incorporating kinetics and regulation into stoichiometric models\u003c/em\u003e. Biophysical journal, 2010. 98(2): p. 175\u0026ndash;185.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHaiman, Z.B., et al., \u003cem\u003eMASSpy: Building, simulating, and visualizing dynamic biological models in Python using mass action kinetics\u003c/em\u003e. PLoS computational biology, 2021. 17(1): p. e1008208.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAkbari, A., Z.B. Haiman, and B.O. Palsson, \u003cem\u003eA data-driven approach for timescale decomposition of biochemical reaction networks\u003c/em\u003e. Msystems, 2024. 9(2): p. e01001-23.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eN\u0026auml;gele, T., \u003cem\u003eMetabolic regulation of subcellular sucrose cleavage inferred from quantitative analysis of metabolic functions\u003c/em\u003e. Quantitative Plant Biology, 2022. 3: p. e10.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSun, X. and W. Weckwerth, \u003cem\u003eCOVAIN: a toolbox for uni-and multivariate statistics, time-series and correlation network analysis and inverse estimation of the differential Jacobian from metabolomics covariance data\u003c/em\u003e. Metabolomics, 2012. 8(1): p. 81\u0026ndash;93.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eK\u0026uuml;gler, P. and W. Yang, \u003cem\u003eIdentification of alterations in the Jacobian of biochemical reaction networks from steady state covariance data at two conditions\u003c/em\u003e. Journal of Mathematical Biology, 2014. 68(7): p. 1757\u0026ndash;1783.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSun, X., B. L\u0026auml;nger, and W. Weckwerth, \u003cem\u003eChallenges of inversely estimating jacobian from metabolomics data\u003c/em\u003e. Frontiers in bioengineering and biotechnology, 2015. 3: p. 188.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWeiszmann, J., et al., \u003cem\u003eMetabolome plasticity in 241 Arabidopsis thaliana accessions reveals evolutionary cold adaptation processes\u003c/em\u003e. Plant Physiology, 2023: p. kiad298.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChaturvedi, P., et al., \u003cem\u003eNatural variation in the chickpea metabolome under drought stress\u003c/em\u003e. Plant biotechnology journal, 2024.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWerner, C.M., et al., \u003cem\u003eDifferential effects of endurance, interval, and resistance training on telomerase activity and telomere length in a randomized, controlled study\u003c/em\u003e. European heart journal, 2019. 40(1): p. 34\u0026ndash;46.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCao Dinh, H., et al., \u003cem\u003eStrength endurance training but not intensive strength training reduces senescence-prone T cells in peripheral blood in community-dwelling elderly women\u003c/em\u003e. The Journals of Gerontology: Series A, 2019. 74(12): p. 1870\u0026ndash;1878.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLe Couteur, D.G., et al., \u003cem\u003eThe association of alanine transaminase with aging, frailty, and mortality\u003c/em\u003e. Journals of Gerontology Series A: Biomedical Sciences and Medical Sciences, 2010. 65(7): p. 712\u0026ndash;717.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGoh, G.B.-B., et al., \u003cem\u003eAge impacts ability of aspartate\u0026ndash;alanine aminotransferase ratio to predict advanced fibrosis in nonalcoholic fatty liver disease\u003c/em\u003e. Digestive diseases and sciences, 2015. 60: p. 1825\u0026ndash;1831.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNakajima, K., et al. \u003cem\u003eHigh aspartate Aminotransferase/Alanine aminotransferase ratio may be Associated with all-cause mortality in the Elderly: a Retrospective Cohort Study using Artificial Intelligence and Conventional Analysis\u003c/em\u003e. in \u003cem\u003eHealthcare\u003c/em\u003e. 2022. MDPI.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYamamoto, T., et al., \u003cem\u003eThe first report of Japanese patients with asparagine synthetase deficiency\u003c/em\u003e. Brain and Development, 2017. 39(3): p. 236\u0026ndash;242.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eOh, R.C., et al., \u003cem\u003eMildly elevated liver transaminase levels: causes and evaluation\u003c/em\u003e. American family physician, 2017. 96(11): p. 709\u0026ndash;715.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDiaz-Garzon, J., et al., \u003cem\u003eLong-term within-and between-subject biological variation of 29 routine laboratory measurands in athletes\u003c/em\u003e. Clinical Chemistry and Laboratory Medicine (CCLM), 2022. 60(4): p. 618\u0026ndash;628.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDiaz-Garzon, J., et al., \u003cem\u003eLong-Term Within-and Between-Subject Biological Variation Data of Hematological Parameters in Recreational Endurance Athletes\u003c/em\u003e. Clinical Chemistry, 2023. 69(5): p. 500\u0026ndash;509.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePavletic, A.J. and M.E. Wright, \u003cem\u003eExercise-induced elevation of liver enzymes in a healthy female research volunteer\u003c/em\u003e. Psychosomatics, 2015. 56(5): p. 604.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePettersson, J., et al., \u003cem\u003eMuscular exercise can cause highly pathological liver function tests in healthy men\u003c/em\u003e. British journal of clinical pharmacology, 2008. 65(2): p. 253\u0026ndash;259.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTiller, N.B. and W.W. Stringer, \u003cem\u003eExercise-induced increases in \u0026ldquo;liver function tests\u0026rdquo; in a healthy adult male: Is there a knowledge gap in primary care?\u003c/em\u003e Journal of Family Medicine and Primary Care, 2023. 12(1): p. 177.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNunez, D.J., et al., \u003cem\u003eFactors influencing longitudinal changes of circulating liver enzyme concentrations in subjects randomized to placebo in four clinical trials\u003c/em\u003e. American Journal of Physiology-Gastrointestinal and Liver Physiology, 2019. 316(3): p. G372-G386.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRuiz, J.R., et al., \u003cem\u003ePhysical activity, sedentary time, and liver enzymes in adolescents: the HELENA study\u003c/em\u003e. Pediatric research, 2014. 75(6): p. 798\u0026ndash;802.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAndy, S.Y. and E.B. Keeffe, \u003cem\u003eElevated AST or ALT to nonalcoholic fatty liver disease: accurate predictor of disease prevalence?\u003c/em\u003e 2003, LWW. p. 955\u0026ndash;956.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMorville, T., et al., \u003cem\u003ePlasma metabolome profiling of resistance exercise and endurance exercise in humans\u003c/em\u003e. Cell reports, 2020. 33(13).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChilds, B.G., et al., \u003cem\u003eCellular senescence in aging and age-related disease: from mechanisms to therapy\u003c/em\u003e. Nature medicine, 2015. 21(12): p. 1424\u0026ndash;1435.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBorst, P., \u003cem\u003eThe malate\u0026ndash;aspartate shuttle (Borst cycle): How it started and developed into a major metabolic pathway\u003c/em\u003e. Iubmb Life, 2020. 72(11): p. 2241\u0026ndash;2259.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMarquezi, M.L., et al., \u003cem\u003eEffect of aspartate and asparagine supplementation on fatigue determinants in intense exercise\u003c/em\u003e. International journal of sport nutrition and exercise metabolism, 2003. 13(1): p. 65\u0026ndash;75.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTrudeau, F., \u003cem\u003eAspartate as an ergogenic supplement\u003c/em\u003e. Sports Medicine, 2008. 38: p. 9\u0026ndash;16.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFibriansah, G., et al., \u003cem\u003eStructural basis for the catalytic mechanism of aspartate ammonia lyase\u003c/em\u003e. Biochemistry, 2011. 50(27): p. 6053\u0026ndash;6062.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLala, V., M. Zubair, and D.A. Minter, \u003cem\u003eLiver function tests\u003c/em\u003e, in \u003cem\u003eStatPearls [internet]\u003c/em\u003e. 2022, StatPearls Publishing.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eOesen, S., et al., \u003cem\u003eEffects of elastic band resistance training and nutritional supplementation on physical performance of institutionalised elderly\u0026mdash;A randomized controlled trial\u003c/em\u003e. Experimental gerontology, 2015. 72: p. 99\u0026ndash;108.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTeahan, O., et al., \u003cem\u003eImpact of analytical bias in metabonomic studies of human blood serum and plasma\u003c/em\u003e. Analytical chemistry, 2006. 78(13): p. 4307\u0026ndash;4318.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDunn, W.B., et al., \u003cem\u003eProcedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry\u003c/em\u003e. Nature protocols, 2011. 6(7): p. 1060\u0026ndash;1083.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWeckwerth, W., K. Wenzel, and O. Fiehn, \u003cem\u003eProcess for the integrated extraction, identification and quantification of metabolites, proteins and RNA to reveal their co-regulation in biochemical networks.\u003c/em\u003e Proteomics, 2004. 4(1): p. 78\u0026ndash;83.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHardoon, D.R., S. Szedmak, and J. Shawe-Taylor, \u003cem\u003eCanonical correlation analysis: An overview with application to learning methods\u003c/em\u003e. Neural computation, 2004. 16(12): p. 2639\u0026ndash;2664.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhou, Z.-H., \u003cem\u003eEnsemble methods: foundations and algorithms\u003c/em\u003e. 2012: CRC press.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKe, G., et al., \u003cem\u003eLightgbm: A highly efficient gradient boosting decision tree\u003c/em\u003e. Advances in neural information processing systems, 2017. 30.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChen, T. and C. Guestrin. \u003cem\u003eXgboost: A scalable tree boosting system\u003c/em\u003e. in \u003cem\u003eProceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining\u003c/em\u003e. 2016.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLeCun, Y., Y. Bengio, and G. Hinton, Deep learning. nature, 2015. 521(7553): p. 436\u0026ndash;444.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFeurer, M. and F. Hutter, \u003cem\u003eHyperparameter optimization\u003c/em\u003e, in \u003cem\u003eAutomated machine learning\u003c/em\u003e. 2019, Springer, Cham. p. 3\u0026ndash;33.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSteuer, R., et al., \u003cem\u003eObserving and interpreting correlations in metabolomic networks\u003c/em\u003e. Bioinformatics, 2003. 19(8): p. 1019\u0026ndash;1026.\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"},{"header":"Tables","content":"\u003cp\u003eTable 1 is available in the Supplementary Files section.\u003c/p\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"npj-systems-biology-and-applications","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"npjsba","sideBox":"Learn more about [npj Systems Biology and Applications](http://www.nature.com/npjsba/)","snPcode":"41540","submissionUrl":"https://submission.springernature.com/new-submission/41540/3","title":"npj Systems Biology and Applications","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"NPJ","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"Active aging, metabolic network inference, automatic machine learning, data-driven modeling","lastPublishedDoi":"10.21203/rs.3.rs-5377652/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-5377652/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003ePhysical inactivity and weak fitness status have become a global health concern. Metabolomics, as an integrative systematic approach, might link to individual\u0026rsquo;s fitness at the molecular level. In this study, we performed blood samples metabolomics analysis of a cohort of elderly people with different treatments. By defining two groups of fitness and corresponding metabolites profiles, we tested several machine learning classifications to identify key metabolite biomarkers, which showed robustly aspartate as a dominant negative marker of fitness. Following, the metabolomics data of the two groups were analyzed by a novel approach for metabolic network interaction termed COVRECON. Where we identified the enzyme AST as the most important metabolic regulation between the fit and the less fit groups. Routine blood tests in two cohorts validated significant differences in AST and ALT. In summary, we combine machine-learning classification and COVRECON to identify metabolomics biomarkers and causal processes for fitness of elderly people.\u003c/p\u003e","manuscriptTitle":"Machine learning and data-driven inverse modeling of metabolomics unveil key process of active aging","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-11-19 10:03:32","doi":"10.21203/rs.3.rs-5377652/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2025-03-30T08:01:17+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-03-30T06:20:45+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-03-28T10:55:55+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"57807401047206321731528265817639531561","date":"2025-03-11T07:36:41+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"270739844824078076573473403621623703470","date":"2025-03-11T05:49:21+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"94001379801260722167650880798964012911","date":"2025-03-11T05:12:02+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"330892250731476100212212292538240978509","date":"2025-03-11T02:57:16+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"101220980223999901451853110858578748963","date":"2025-03-09T07:53:15+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"294879331508772953934823616971204919296","date":"2025-01-29T11:49:20+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"80534360109830384266414757537412927536","date":"2025-01-26T14:40:37+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"153422745210352549254442604258640460059","date":"2025-01-24T15:22:01+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"121859932260634382384245686908913684780","date":"2024-12-13T19:37:05+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2024-11-13T18:49:43+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"200406094338313943935024466749336326713","date":"2024-11-07T13:21:19+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"78292735512860522992751604450230702442","date":"2024-11-06T11:02:21+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2024-11-06T07:49:04+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2024-11-06T00:30:55+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2024-11-05T13:41:25+00:00","index":"","fulltext":""},{"type":"submitted","content":"npj Systems Biology and Applications","date":"2024-11-02T10:04:32+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"npj-systems-biology-and-applications","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"npjsba","sideBox":"Learn more about [npj Systems Biology and Applications](http://www.nature.com/npjsba/)","snPcode":"41540","submissionUrl":"https://submission.springernature.com/new-submission/41540/3","title":"npj Systems Biology and Applications","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"NPJ","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"71b976c5-2110-434d-b880-119f2b18a17c","owner":[],"postedDate":"November 19th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[{"id":40230122,"name":"Health sciences/Biomarkers"},{"id":40230123,"name":"Health sciences/Health care"},{"id":40230124,"name":"Biological sciences/Systems biology/Biochemical networks"},{"id":40230125,"name":"Biological sciences/Systems biology/Dynamic networks"}],"tags":[],"updatedAt":"2025-09-29T16:00:18+00:00","versionOfRecord":{"articleIdentity":"rs-5377652","link":"https://doi.org/10.1038/s41540-025-00580-4","journal":{"identity":"npj-systems-biology-and-applications","isVorOnly":false,"title":"npj Systems Biology and Applications"},"publishedOn":"2025-09-24 15:57:05","publishedOnDateReadable":"September 24th, 2025"},"versionCreatedAt":"2024-11-19 10:03:32","video":"","vorDoi":"10.1038/s41540-025-00580-4","vorDoiUrl":"https://doi.org/10.1038/s41540-025-00580-4","workflowStages":[]},"version":"v1","identity":"rs-5377652","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-5377652","identity":"rs-5377652","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00