Driving analysis and prediction of COD based on frequency division

preprint OA: closed
Full text JSON View at publisher
Full text 131,014 characters · extracted from preprint-html · click to expand
Driving analysis and prediction of COD based on frequency division | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Driving analysis and prediction of COD based on frequency division Mei Li, Kexing Chen, Deke Wang, Rui Xu This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-4716541/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 20 Feb, 2025 Read the published version in Stochastic Environmental Research and Risk Assessment → Version 1 posted 4 You are reading this latest preprint version Abstract COD (Chemical Oxygen Demand) is an important indicator to measure organic pollution of water body. To strengthen in-depth analysis and prediction of COD, a new method was proposed in this paper. A frequency division method, Variational Mode Decomposition (VMD) was used to complete time domain decomposition of COD data before model simulation. The original data was separated into five signals with different frequency bands, IMF1, IMF2, IMF3, IMF4 and IMF5, with which the influence of meteorological factors and water quality factors on COD were explored. The long-term COD content is mainly driven by nutrient factors phosphorus and nitrogen, while the immediate fluctuation characteristics exhibit relatively stability. Random Forest, Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) were used to predict COD with the original data and the signal data processed by VMD. It is found that frequency division can improve simulation stability and accuracy of GRU and LSTM more significantly than Random Forest. VMD-GRU and VMD-LSTM models can be used reliably for COD analyzation and prediction in Chengdu area. frequency division driving analysis prediction model COD Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Figure 8 1 Introduction COD is considered an important indicator for characterizing the presence of organic pollutants in the water environment. The higher COD value indicates more severe pollution by organic pollutants in the water. Organic pollutants are often transferred to water bodies through atmospheric dry and wet deposition, agricultural runoff and wastewater discharge (Arfaeinia et al. 2017 ). Bioaccumulation may be caused by these pollutants when they are deposited into the atmosphere and enter water bodies, posing potential toxicity to both biota and humans (Agrell et al. 2002 ). A large amount of organic substances have been found to exist in the aqueous phase or particle extract during rainstorm sampling, expanding the scope of pollution as they migrate with the runoff (Glaser et al. 2023 ). A study examining organic compounds in 38 rivers of the United States were examined, which showed the 10 most widespread anthropogenic organisms are strongly associated with agricultural runoff (Bradley et al. 2017 ). By time-of-flight mass spectrometry (GC × GC-TOF-MS) combined with two-dimensional gas chromatography, organic pollutants in Yongding River basin were analyzed. It was found that the types of organic pollutants in downstream rivers were relatively similar to those in the effluent of sewage treatment plants (Jiang et al. 2023 ), which showed that wastewater treatment plant effluents are a non-negligible source of organic pollutants in surface runoff. A survey of 46 wastewater treatment plant effluents in China were found to contain 302 trace organic pollutants, accounting for 59 percent of the total organic matter (Liu et al. 2024 ). Organic pollutants pose a serious threat to the environment due to their toxic characteristics over prolonged period (Kumar et al. 2022 ). Organic pollutants can suffocate aquatic organisms by disrupting photosynthesis of aquatic plants and leading to significant reduction in water quality which can indirectly affect human health (Muralikrishna and Manickam 2017 ). Mussels farmed in the coast of the Gulf of Naples and Domitio had been found to pose a high food safety risk in terms of heavy metals and persistent organic pollutants (POPs) (Esposito et al. 2020 ). POPs pose a greater risk in these environments because they are difficult to be broken down and can easy to be accumulated in the air, water, soil, and fat cells of animals (Shetty et al. 2023 ; Zaynab et al. 2021 ; Lan et al. 2024 ; Mallah et al. 2022 ; Yua et al. 2018). Due to the toxicity of some organic pollutants to human health, its characteristics and toxicity have become the focus of global attention (Li et al. 2017 ). Researches by scholars not only focus on individual organic pollutants but also emphasize the measurement of organic content. Measuring and analyzing COD content has also been a hot topic of research. In China, a lot of work has been taken out to measure, analyze and control COD. But there are still many challenges (Li et al. 2012 ). Table 1 shows the status of COD emission and control in China. Table 1 Status of COD emission and control in China Items China Formulation Mainly national standards, A small quantity local standards Implementation 1) National standard specified uniform limits for similar dischargers 2) The control items are mainly centralized control. The total quantity control is comprehensively implemented. 3) Local standards outperform national standards in execution Control level 1) Compared to the lower treatment rate of ammonia nitrogen, COD treatment rate reaches over 90%. 2) The monitoring and supervision system is largely established, but the development of the shared system is uneven. Limits of major pollutants 1)National standard: Standard A: 50mg/L, Standard B: 60mg/L. 2)Sichuan Province Standards: Urban sewage treatment plant༚30mg/L The continuous determination of COD with long time series can be helpful to obtain a lot of necessary data for the analysis and prediction of organic matter in watershed. In order to achieve this goal, many methods for COD measurement were developed continuously and the regional, conventional and continuous monitoring methods were gradually realized. Traditional methods for COD measurement, such as potassium dichromate, potassium permanganate, and UV-vis spectroscopy, Ultraviolet-visible (UV-vis) spectroscopy combined with stoichiometric tools was used to determine COD content. This method is particularly suitable for real-time and rapid determinations while a large number of sample data is required for modeling (Li et al. 2018 ). Carbon point fluorescent capillary sensors are also used to determine COD. This method has the advantages of low detection limit, good linear range and reliability, portability and low cost. But it is easy to be limited by interference protection and environmental impact (Zhang et al. 2023 ). A nano-lead dioxide-composite electrochemical sensor has been used for COD determination offering fast response times, simple instrumentation, low cost, high detection sensitivity and wide linear range. However, sampling limitations and sensor stability can pose challenges (Wang et al. 2022 ). Currently, COD monitoring has been further developed to automatic monitoring. The CL system with flow injection was applied for COD determination, which has the obvious advantages of much shorter analysis time, simple operation except for a lower detection rate (Li et al. 2003 ). A new reagent-free method for measuring chemical oxygen demand (COD) was proposed based on ultraviolet absorption spectroscopy (UV-AS) which enables high-precision and long-range COD measurement by automatically selecting the wavelength and analyzing the full spectrum of the data. This method can therefore be used for in situ and online environmental monitoring (Wang et al. 2019 ). The development of automated monitoring has significantly improved the efficiency and frequency of COD monitoring. It can realize the simultaneous determination of COD over a wide area and provide technical and methodological support for regional COD monitoring research. To better understand the changing patterns and drivers of organic matter in water bodies, a variety of models have been established by scientists to analyze and predict COD. These models input actual monitoring data and predicted values of COD. The dynamics of COD in water bodies are studied to analyze and manage the flow patterns and long-term series of organic matter. The removal of COD from polluted solutions has been simulated and predicted using artificial neural networks (ANN) (Elmolla et al. 2010 ; Masouleh et al. 2022 ). One main advantage of ANN is its ability to handle complex relationships between input and output variables (Ataei et al. 2021). However, ANN techniques require large datasets to train the model and are computationally expensive (Khanmohammadi et al. 2024 ). Faster convergence, higher convergence accuracy, and better pattern recognition are achieved by recurrent neural networks (RNN) compared to ANN (Al-Qaili et al. 2024 ; Gholami et al. 2023 ). RNNs are more suitable for time series prediction. However, RNNs suffer from the problem of long-term dependence, i.e., gradient vanishing and gradient explosion will be met when RNNs learns long sequences, which causes its difficulty to understand nonlinear relationships for long periods of time (Hochreiter 1991 ). Therefore, the use of convolutional neural networks (CNNs) was investigated by some researchers for efficient feature extraction to extract data features. A correlation between COD concentration and spectral reflectance in urban rivers was found and COD concentration was accurately predicted by using a one-dimensional convolutional neural network (1D-CNN) (Cai et al. 2022 ). The Long Short-Term Memory (LSTM) network, introduced by Hochreiter and Schmidhuber ( 1997 ), is an extension of the RNN. Long-term dependencies can be learned by LSTMs, avoiding the exploding or vanishing gradient problem that affects traditional RNNs (Xu et al. 2020 ). For monitoring and predicting the performance of COD in wastewater treatment, a novel LSTM-based soft sensor was developed (Xu et al. 2023 ). The LSTM method can mine the potential information between different water quality indicators at different time scales improving prediction accuracy. Major pollutants such as Biological Oxygen Demand(BOD), Chemical Oxygen Demand(COD), Total Nitrogen(TN), Total Phosphorus(TP), and Ammonia Nitrogen(AN)were predicted and demonstrated to have a high degree of correlation with each other using an integrated multivariate LSTM network(Wang, Xue et al. 2024 ). Although LSTMs can effectively process and predict events with long intervals and delays (Yousfi et al. 2017). A large number of parameters was needed and low convergence rate is existed. For this reason, GRU based LSTM was proposed (Cho et al. 2014 )to simplify the internal cell architecture of LSTM and reduce the network training time in order to guarantee the prediction accuracy. Combining machine learning with sensor networks, multiple machine learning algorithms are employed to predict COD emissions. Comparative results showed that GRU is better than LSTM (Miao et al. 2021 ). Prediction results can be influenced by the model characteristics themselves. How to improve their performance has become a necessary research topic nowadays. Time-frequency domain transformation technique was tried to be combined with some models to enhance the simulation effects. A complex signal can be decomposed into a series of intrinsic modal functions (IMF) with different frequencies and amplitudes by frequency division. the complexity and strong nonlinearities in a time series can be effectively reduce while obtaining a relatively stable subsequence that contains several different frequency scales. This approach extracts the main data components, and representing them as subsequences with different frequency features, thus optimizing the simulation effect of some intelligent models. The Variational Mode Decomposition (VMD), an adaptive and fully non-recursive signal processing method, was proposed in 2014(Dragomiretskiy and Zosso 2014 ) with the advantage of determining the number of mode decompositions. The VMD was used to decompose river flow into IMF, and the mixed models RVFL_VMD, GRNN_VMD and RBFNN_VMD were established to predict river turbidity. The best performance was achieved by RVFL_VMD on the hourly time scale, while GRNN_VMD provided the best prediction on the daily time scale (Heddam et al. 2022). VMD can improve the accuracy of power consumption forecasts. The combination of Bi-directional Gated Recurrent Unit (BiGRU) and LSTM models with VMD showed excellent prediction accuracy on various assessment metrics, especially for short to medium term predictions (Ahmed et al. 2023 ). The VMD method can be used to study the periodicity of vegetation and its relationship with climate (Wang et al. 2023 ). A new wind power forecasting method called IVMD combining VMD and HFCM was developed to achieve more accurate wind power generation prediction and reduce the prediction error by extracting time series features and learning the weights using Bayesian ridge regression method (Qiao et al. 2022 ). The spectral characteristics of VMD can be used to sort iron ore in hyperspectral images (Nie et al. 2023 ). In this paper, taking the rivers in Chengdu area of China were selected as the research focus, with COD as the primary research index. The frequency division technique of VMD was applied to process the raw COD data. Moreover, three models — Random Forest, LSTM and GRU were integrated with VMD to develop an optimized model framework. The research aimed to find a new method was hoped to be found for the prediction and analysis of organic matter in the Chengdu watershed. 2 Methodology 2.1 Used data parameters Data parameters include water quality data and meteorological data. Water quality data includes WT (Water Temperature), PH (pH Value), DO (Dissolved Oxygen), COD Mn (Potassium Permanganate Index), NH 3 -N (Ammonia Nitrogen), TP (Total Phosphorus), TN (Total Nitrogen), EC (Electric Conductivity) and NTU (Turbidity). Data from 2020 to 2022 were obtained from daily monitoring of state-controlled cross sections. Monitoring was conducted six times a day at 0:00, 4:00, 8:00, 12:00, 16:00 and 20:00. Meteorological data includes T (Mean Air Temperature), MT (Maximum Air Temperature), LT (Minimum Air Temperature), BP (Mean Air Pressure), MBP (Maximum Air Pressure), LBP (Minimum Air Pressure), RH (Relative Humidity), LH (Minimum Relative Humidity) and P (Precipitation). The daily meteorological data from 2020 to 2022 were obtained from two meteorological stations of Chengdu area, monitoring once a day. 2.2 Data processing (1) Missing values: A small amount of missing data was found in COD Mn , NH 3 -N, TP and TN. Polynomial interpolation was used to make up the missing values. (2) Anomalous values: A small amount of anomalous data was found in EC, NTU, P, LH, RH and LT. The anomalous values were filled in by polynomial interpolation. 2.3Variational mode decomposition If the COD (Chemical Oxygen Demand) in the river water corresponds to an approximate function f (t) and it is the sum of the set of k (i.e., IMF) components, the constraint problem is expressed ass (Ling et al. 2021 ; Li et al. 2021 b; Xu et al. 2020 ): $$\:\begin{array}{rr}&\:\underset{{u}_{k},{\omega\:}_{k}}{\text{m}\text{i}\text{n}}\left\{\sum\:_{k=1}^{K}{∥{\partial\:}_{t}\left[\left(\delta\:\left(t\right)+\frac{j}{\pi\:t}\right)\text{*}{\mu\:}_{k}\left(t\right)\right]{e}^{-j{\omega\:}_{k}\left(t\right)}∥}_{2}^{2}\right.\\\:&\:such.that.\sum\:_{k=1}^{K}{\mu\:}_{k}=f\end{array}$$ 1 where (t) is the time, δ (t) corresponds to the impulse function, i.e., the Dirac distribution and { u k ​ (t)¼ { u 1 (t), u 2 (t),…, u k ​ (t)} is the series of the decomposed K Intrinsic Mode Functions (IMF) modes, and the set of the center frequencies of each IMF is given as {ω k (t)¼{ω 1 (t), ω 2 (t),…,ω k (t)} (Xu et al., 2020 ). f is a given constant. 3 Results and Discussion 3.1 VMD decomposition results Figure 1 shows the original data and the decomposed IMF1 to IMF5 signals from low to high frequencies. IMF1 usually corresponds to the low-frequency component of the data and represents the long-term trend. IMF2 to IMF5 representing the short-term abrupt changes in the data, are usually correspond to rapid changes, fluctuations or transient features in the data. Figure 2 shows the Fourier spectrum of the original data and the decomposed signals. The distribution of the signal in the frequency domain in Fig. 2 indicates the intensity or amplitude of the different frequency components contained in the signal. Each wave peak corresponds to the intensity or energy of a particular frequency component. The higher the crest, the stronger the energy of the corresponding frequency component, indicating the greater the contribution of that frequency component in the signal. As can be seen from Fig. 2 , there is basically no frequency entanglement between the decomposed signals. 3.2Driving analysis of COD change Pearson's correlation analysis was used to analyze the correlation between variables driving the COD change and COD content. Positive value of the correlation coefficient indicates positive correlation, negative value indicates negative correlation. The larger the absolute value of the value, the stronger the correlation. The correlation judgment criteria are shown in Table 2 (Jia 2018). The correlation values of COD Mn with water quality data and hydrological data in the original data and the decomposed signals IMF1-IMF5 are shown in Table 3 . Table 2 Correlation criterion (Jia et al. 2018) Absolute value of the correlation value Relevance 0.0-0.3 No relevant 0.3–0.8 relatively strong correlated 0.8-1.0 Strong correlation Table 3 Correlation between the original data, decomposed signals and COD Mn . Factor Original Data IMF1 IMF2 IMF3 IMF4 IMF5 WT 0.28 0.38 0.11 0.00 0.00 0.01 PH -0.44 -0.41 -0.58 -0.01 -0.00 -0.02 DO -0.57 -0.60 -0.33 -0.01 -0.00 -0.01 NH 3 -N 0.57 0.55 0.42 0.01 0.01 0.04 TP 0.69 0.70 0.71 0.02 0.10 0.45 TN 0.55 0.54 0.34 0.02 0.05 0.34 EC 0.20 0.26 0.04 -0.00 0.00 0.01 NTU 0.30 0.32 0.17 0.01 0.02 0.09 T 0.22 0.36 -0.19 -0.14 0.00 0.00 MT 0.20 0.37 -0.33 -0.20 0.01 0.00 BP -0.25 -0.41 0.11 0.01 0.00 0.01 MBP -0.24 -0.41 0.12 0.01 0.00 0.01 LBP -0.26 -0.44 0.12 0.01 0.00 0.01 RH 0.03 -0.10 0.15 -0.00 -0.00 -0.00 LH 0.03 -0.11 0.19 -0.02 -0.01 -0.00 P 0.15 0.13 0.28 0.08 -0.01 0.00 In the long term, the absolute correlation value between TP, DO, NH 3 -N, TN, LBP, PH, BP, MBP, WT, T, NTU and COD is between 0.3–0.6, indicating a relatively strong correlation. Among these, the correlation values for range from 0.54 to 0.70, showing that COD content is more influenced by water quality factors than by climate factors. Nitrogen and phosphorus are the main indicators of eutrophication in water bodies, suggesting that long-term changes in COD are strongly positively correlated with eutrophication. That is, the more serious the pollution of organic matter in water bodies, the more serious the eutrophication of water bodies. The change in DO also positively correlates with eutrophication, further verifying the strong correlations between eutrophication and COD. When there is an excess input of nutrients (e.g., nitrogen, phosphorus, etc.) to a water body, eutrophic substances increase the concentration of COD and cause other changes. Eutrophic waters usually contain more organic waste and organic material, resulting in an increase in COD Mn content in the water bodies as well. This relation between eutrophic substances with COD provides a simplified idea for analyzing the long-term pollution characteristics of water bodies in Chengdu. When data on nitrogen, phosphorus, or COD are missing or abnormal, the strong correlations among these indicators can be used to infer the missing values. Characteristics changes of one indicator can also be used to infer the changes in other indicators. For the relatively longer short-term change characteristics of COD, the eutrophication index nitrogen and phosphorus and PH, DO, MT still have some influence. But except the nitrogen and phosphorus, the influence of other factors basically disappeared with the gradual elimination of the signals of the influencing factors in the short time series. It is reasonable to speculate that short-term changes in COD are largely unaffected by water and climatic parameters, demonstrating the relative stability of the instantaneous characteristics of COD. 3.3 Simulation results with or without frequency division Firstly, Random Forest, LSTM, and GRU models were used to predict COD with the original data. Then, the original data was decomposed, and the signals with a correlation of less than 0.3 with the predicted target COD were eliminated from the variable modal decomposition signals. Thus, Random Forest, LSTM, and GRU models were used again to predict COD with the filtered signal data. The predicted results with and without frequency division were compared by two methods. One is Scatter plots of the predicted value and the true values, the other is three kinds of error values, namely mean absolute error (MAE), root mean square error (RMSE) and symmetric mean absolute percentage error (SMAPE). 3.3.1 Simulation results with Random Forest and VMD-Random Forest The prediction results with the Random Forest and VMD-Random Forest model were shown in Fig. 3. The scatter plot showing the fitting degree between the true values and the predicted values was shown in Fig. 4. The simulation effect before and after frequency division was evaluated by mean absolute error (MAE), root mean square error (RMSE), and symmetric mean absolute percentage error (SMAPE) were shown in Table 4 . Table 4 Quantitative metrics for Random Forest and VMD- Random Forest MODEL RMSE MAE SMAPE Random Forest 0.397 0.285 0.14 VMD-Random Forest 0.293 0.224 0.104 By Fig. 4 and Table 4 , it can be concluded that the simulation effect is improved because of VMD processing. The improvement rates are 26.20%, 21.40%, and 25.71%, respectively. It can be found that VMD can improve the simulation effect of Random Forest to some extent. 3.3.2 Simulation results with GRU and VMD-GRU The simulation results with GRU and VMD-GRU models are shown in Fig. 5. Scatter plot was shown in Fig. 6. The values of MAE, RMSE and SMAPE were shown in Table 5 . Table 5 Quantitative metrics for GRU and VMD-GRU MODEL RMSE MAE SMAPE GRU 0.502 0.395 0.175 VMD-GRU 0.14 0.098 0.046 After the original data was processed by VMD, the values of RMSE, MAE and SMAPE decreased, and the degree of dispersion shown in Fig. 6 was reduced. This indicates that VMD can significantly improve the prediction accuracy of the GRU model. The simulation performance of GRU was improved by 72.11%, 75.19%, and 73.14% respectively. This demonstrates that VMD frequency division can more effectively enhance the GRU model's ability to understand and capture data by eliminating interference factors, thereby improving the simulation effect of the GRU. 3.3.3 Simulation results with LSTM and VMD-LSTM The simulation results with LSTM and VMD-LSTM were shown in Fig. 7. Scatter plot was shown in Fig. 8. The error values of MAE, RMSE and SMAPE were shown in Table 6 . Table 6 Quantitative metrics for the LSTM and VMD-LSTM MODEL RMSE MAE SMAPE LSTM 0.772 0.578 0.24 VMD-LSTM 0.205 0.156 0.073 The values of LSTM RMSE, MAE and SMAPE reduced after VMD processing, indicating that VMD-LSTM model achieves lower error and higher accuracy compared to the original LSTM model. VMD frequency division significantly improves the prediction performance. The improvement rate of VMD to LSTM model is 73.45%, 73.01%, and 69.58% respectively. 4 Conclusions COD is often used as an important indicator to evaluate the organic pollution in water environments. Although there have been many researches on water quality prediction models, the prediction accuracy of these models using raw data often falls short of ideal results. This is because water quality is affected by many factors from weather, human activities, and the water environment. There are complex interaction relations among these factors, which is difficult to be captured by traditional water quality prediction models so as to reduce the prediction accuracy of the model. In this paper, a new COD prediction method was proposed. Taking water system in Chengdu area of China as the research object, a kind of frequency division method, VMD method was tried to complete the time domain decomposition of COD data before model simulation. the frequency division method was used to separate the COD original data into signals of different frequency bands, namely IMF1, IMF2, IMF3, IMF4 and IMF5. Using these different signal data, the influence of COD on meteorological factors and water quality factors was explored. Based on the correlation between signals of different frequency bands and COD, it is found that the long-term COD content can be affected by eutrophication factors such as phosphorus and nitrogen relatively easily. The short-term immediate change characteristics of COD are relatively stable and are less affected by any water quality factors and meteorological factors. Using the original data and the signal data without weak correlated factors, the simulation effects of GRU, LSTM and Random Forest models on COD were compared. By the frequency division, the original data was decomposed to signals with different frequency, and the signals with a correlation of less than 0.3 were eliminated further from the variable modal decomposition signals. Removing the interference factors of COD is conducive to improving the simulation effect. The simulation results verified this. When combining Random Forest, GRU or LSTM with variational mode decomposition in COD prediction, the frequency division is helpful to improve the stability and accuracy of the simulation of these models. It has better promotion effects on GRU and LSTM with higher improvement rates. The improvement rate can even amount to more than 70% after VMD processing, especially for LSTM and GRU simulation. The VMD-GRU model and the VMD-LSTM model can be used as more reliable tools for water quality analyzation and management in Chengdu area. This simulation method constructed in this paper can also be considered for exploring other water environment indicators in different regions, potentially providing a more reliable tool for water quality analysis and management. Declarations Funding This work was supported by [the National Natural Science Foundation] (Grant numbers [52050410328] and [62266014]) and [the Guangxi Natural Science Foundation] (Grant number 2021GXNSFAA220056). Author Contributions [Mei Li] was involved in the initial draft and editing; [Kexing Chen] contributed to the investigation and methodology; [Deke Wang] was responsible for data curation; [Rui Xu] was involved in the review and overall supervision. Data Availability Statement Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request: (1) original data of COD, (2) models, and (3) codes. Conflict of interest The authors declare they have no conflict of interest. References Ahmed AAM, Bailek N, Abualigah L, Bouchouicha K, Kuriqi A, Sharifi A, Sareh P (2023) Global control of electrical supply: A variational mode decomposition- aided deep learning model forenergy consumption prediction. Energy Rep 10:2152–2165. https://doi.org/10.1016/j.egyr.2023.08.076 Agrell C, Larsson P, Okla L, Agrell J (2002) PCB congeners in precipitation, wash out ratios and depositional fluxes within the Baltic Sea region, Europe. Atmos Environ 36(2):371–383. https://doi.org/10.1016/S1352-2310(01)00228-X Al-Qaili AH, Al-Mansour AI, Al-Solieman H, AlSharabi K (2024) RNN-based pavement moduli prediction for flexible pavement design enhancement. Case Stud Constr Mater 20. https://doi.org/10.1016/j.cscm.2023.e02811 Arfaeinia H, Asadgol Z, Ahmadi E, Seifi M, Moradi M, Dobaradaran S (2017) Characteristics, distribution and sources of polychlorinated biphenyls (PCBs) in coastal sediments from the heavily industrialized area of Asalouyeh, Iran. Water Sci Technol 76(11–12):3340–3350. https://doi.org/10.2166/wst.2017.500 Bradley PM, Journey CA, Romanok KM, Barber LB, Buxton HT, Foreman WT, Furlong ET, Glassmeyer ST, Hladik ML (2017) Expanded Target-Chemical Analysis Reveals Extensive Mixed-Organic-Contaminant Exposure in U.S. Streams. Environ Sci Technol 51(9):4792–4802. https://doi.org/10.1021/acs.est.7b00012 Cai J, Meng L, Liu H, Chen J, Xing Q (2022) Estimating Chemical Oxygen Demand in estuarine urban rivers using unmanned aerial vehicle hyperspectral images. Ecol Indic 139:108936. https://doi.org/10.1016/j.ecolind.2022.108936 Cho K, van Merrienboer B, Bahdanau D, Bengio Y (2014) On the Properties of Neural Machine Translation: Encoder-Decoder Approaches. In Proceedings of SSST@ EMNLP (pp. 103–111). https://doi.org/10.1016/j.jconhyd.2023.104287 Dragomiretskiy K, Zosso D (2014) Variational Mode Decomposition. IEEE Trans Signal Process 62(3):531–544. https://doi.org/10.1109/TSP.2013.2288675 Elmolla ES, Chaudhuri M, Eltoukhy MM (2010) The use of artificial neural network (ANN) for modeling of COD removal from antibiotic aqueous solution by the Fenton process. J Hazard Mater 179(1):127–134. https://doi.org/10.1016/j.jhazmat.2010.02.068 Esposito M, Canzanella S, Lambiase S, Scaramuzzo A (2020) Organic pollutants (PCBs, PCDD/Fs, PAHs) and toxic metals in farmed mussels from the Gulf of Naples (Italy): Monitoring and human exposure. Reg Stud Mar Sci 40:101497. https://doi.org/10.1016/j.rsma.2020.101497 Gholami H, Mohammadifar A, Golzari S, Song Y (2023) Interpretability of simple RNN and GRU deep learning models used to map land susceptibility to gully erosion. Sci Total Environ 904:166960. https://doi.org/10.1016/j.scitotenv.2023.166960 Glaser C, Escher BI, Engelhardt M, Liu Y, Krauss M (2023) Particle-associated organic contaminant and cytotoxicity transport in a river during storm events. J Hydrol 621:129600. https://doi.org/10.1016/j.jhydrol.2023.129600 Hochreiter S (1991) Investigations on dynamic neural networks. Dissertation, Johannes Kepler University, Linz Hochreiter S, Schmidhuber J (1997) Long Short-Term Memory. Neural Comput 9(8):1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735 Jiang J, Zhao J, Zhao G, Liu L (2023) Recognition, possible source, and risk assessment of organic pollutants in surface water from the Yongding River Basin by non-target and target screening. Environ Pollut 331:121895. https://doi.org/10.1016/j.envpol.2023.121895 Khanmohammadi M, Rahmani F, Shahrouzi JR, Sene RA (2024) Insightful properties-performance study of Ti–Cu–O heterojunction sonochemically embedded in mesoporous silica matrix for efficient tetracycline adsorption and photodegradation: RSM and ANN-based modeling and optimization. Chemosphere 141223. https://doi.org/10.1016/j.chemosphere.2024.141223 Kolb M, Bahadir M, Teichgräber B (2017) Determination of chemical oxygen demand (COD) using an alternative wet chemical method free of mercury and dichromate. Water Res 122:645–654. https://doi.org/10.1016/j.watres.2017.06.034 Kumar R, Gupta K, Bordoloi N (2022) The potential of engineered endophytic bacteria to improve phytoremediation of organic pollutants. In: Advances in Microbe-assisted Phytoremediation of Polluted Sites, 477–496. Elsevier. https://doi.org/10.1016/B978-0-12-823443-3.00014-4 Lan Y, Gao X, Xu H, Li M (2024) 20 years of polybrominated diphenyl ethers on toxicity assessments. Water Res 249:121007. https://doi.org/10.1016/j.watres.2023.121007 Li B, Zhang Z, Wang J, Chunli (2003) Chemiluminescence system for automatic determination of chemical oxygen demand using flow injection analysis. Talanta 61(5):651–658. https://doi.org/10.1016/S0039-9140(03)00326-6 Li J, Li F, Liu Q (2017) PAHs behavior in surface water and groundwater of the Yellow River estuary: Evidence from isotopes and hydrochemistry. Chemosphere 178:143–153. https://doi.org/10.1007/s11356-011-0572-7 Li J, Tong Y, Guan L, Wu S, Li D (2018) Optimization of COD determination by UV–vis spectroscopy using PLS chemometrics algorithms. Optik 174:591–599. https://doi.org/10.1016/j.ijleo.2018.08.111 Li R, Chen X, Balezentis T, Streimikiene D, Niu Z (2021) Multi-step least squares support vector machine modeling approach for forecasting short-term electricity demand with application. Neural Comput Appl 33:301–320. https://doi.org/10.1007/s00521-020-04996-3 Li WW, Sheng GP, Zeng RJ, Liu XW, Yu HQ (2012) China’s wastewater discharge standards in urbanization. Environ Sci Pollut Res 19(5):1422–1431. https://doi.org/10.1016/j.chemosphere.2017.03.052 Ling Q, Zhang Q, Zhang J, Kong L, Zhang W, Zhu L (2021) Prediction of landslide displacement using multikernel extreme learning machine and maximum information coefficient based on variational mode decomposition: a case study in Shaanxi, China. Nat Hazards 108(3):1–22. https://doi.org/10.1007/s11069-021-04713-w Liu J, Yang F, Cai Y, Lu G, Li Y, Li M, Fan L, Gao L (2024) Unveiling the existence and ecological hazards of trace organic pollutants in wastewater treatment plant effluents across China. Eco-Environment Health 3(1):21–29. https://doi.org/10.1016/j.eehl.2023.09.006 Mallah MA, Changxing L, Mallah MA, Noreen S, Liu Y (2022) Polycyclic aromatic hydrocarbon and its effects on human health: An overview. Chemosphere 296:133948. https://doi.org/10.1016/j.chemosphere.2022.133948 Masouleh SY, Mozaffarian M, Dabir B, Ramezani SF (2022) COD and ammonia removal from landfill leachate by UV/PMS/Fe2 + process: ANN/RSM modeling and optimization. Process Saf Environ Prot 159:716–726. https://doi.org/10.1016/j.psep.2022.01.031 Miao S, Zhou C, AlQahtani SA, Alrashoud M (2021) Applying machine learning in intelligent sewage treatment: A case study of chemical plant in sustainable cities. Sustain Cities Soc 72:103009. https://doi.org/10.1016/j.scs.2021.103009 Muralikrishna IV, Manickam V (2017) Chapter One - Introduction. In: Environmental Management, Muralikrishna IV, Manickam V (eds) Butterworth-Heinemann, pp 1–4. https://doi.org/10.1016/B978-0-12-811989-1.00001-4 Nie C, Jiang J, Deng J, Li K, Jia L, Sun T (2023) Predicting TFe content and sorting iron ores from hyperspectral image by variational mode decomposition-based spectral feature. J Clean Prod 429:139629. https://doi.org/10.1016/j.jclepro.2023.139629 Qiao B, Liu J, Wu P, Teng Y (2022) Wind power forecasting based on variational mode decomposition and high-order fuzzy cognitive maps. Appl Soft Comput 129:109586. https://doi.org/10.1016/j.asoc.2022.109586 Shetty SS, Deepthi D, Harshitha S, Sonkusare S, Naik PB, Kumari SN, Madhyastha H (2023) Environmental pollutants and their effects on human health. Heliyon 9(9). https://doi.org/10.1016/j.heliyon.2023.e19496 Wang C, Li W, Huang M (2019) High precision wide range online chemical oxygen demand measurement method based on ultraviolet absorption spectroscopy and full-spectrum data analysis. Sens Actuators B Chem 300:126943. https://doi.org/10.1016/j.snb.2019.126943 Wang H, Kang C, Tian Z, Zhang A, Cao Y (2023) Vegetation periodic changes and relationships with climate in Inner Mongolia Based on the VMD method. Ecol Indic 146:109764. https://doi.org/10.1016/j.ecolind.2022.109764 Wang J, Xue B, Wang Y, Wang AY G (2024) Identification of pollution source and prediction of water quality based on deep learning techniques. J Contam Hydrol 261:104287. https://doi.org/10.1016/j.jconhyd.2023.104287 Wang X, Wu D, Yuan D, Wu X (2022) A nano-lead dioxide-composite electrochemical sensor for the determination of chemical oxygen demand. J Environ Chem Eng 10(3):107464. https://doi.org/10.1016/j.jece.2022.107464 Xu B, Pooi CK, Tan KM, Huang S, Shi X, Ng HY (2023) A novel long short-term memory artificial neural network (LSTM)-based soft-sensor to monitor and forecast wastewater treatment performance. J Water Process Eng 54:104041. https://doi.org/10.1016/j.jwpe.2023.104041 Xu W, Jiang Y, Zhang X, Li Y, Zhang R, Fu G (2020) Using long short-term memory networks for river flow prediction. Hydrol Res 51(6):1358–1376. https://doi.org/10.2166/nh.2020.026 Yuan J, Liu Y, Wang J, Zhao Y, Li K, Jing Y, Zhang X (2018) Long-term Persistent Organic Pollutants Exposure Induced Telomere Dysfunction and Senescence- Associated Secretary Phenotype. J Gerontol Biol Sci Med Sci 73(8):1027–1035. https://doi.org/10.1093/gerona/gly002 Yousefi S, Berrani S-A, Garcia C (2017) Contribution of recurrent connectionist language models in improving LSTM-based Arabic text recognition in videos. Pattern Recognit 64:245–254. https://doi.org/10.1016/j.patcog.2016.11.011 Zhang R, Li YS, Luo YX, Zhang XY, Wen R (2023) A carbon-dot fluorescence capillary sensor for the determination of chemical oxygen demand. Microchem J 187:108445. https://doi.org/10.1016/j.microc.2023.108445 Zhang R, Li YS, Luo YX, Zhang XY, Wen R (2023) A carbon-dot fluorescence capillary sensor for the determination of chemical oxygen demand. Microchem J 187:108445. https://doi.org/10.1016/j.microc.2023.108445 Zaynab M, Fatima M, Sharif Y, Sughra K, Sajid M, Khan KA, Sneharani AH, Li S (2021) Health and environmental effects of silent killers Organochlorine pesticides and polychlorinated biphenyl. J King Saud Univ Sci 33(6):101511. https://doi.org/10.1016/j.jksus.2021.101511 Zhang R, Li YS, Luo YX, Zhang XY, Wen R (2023) A carbon-dot fluorescence capillary sensor for the determination of chemical oxygen demand. Microchem J 187:108445. https://doi.org/10.1016/j.microc.2023.108445 Additional Declarations No competing interests reported. Cite Share Download PDF Status: Published Journal Publication published 20 Feb, 2025 Read the published version in Stochastic Environmental Research and Risk Assessment → Version 1 posted Editorial decision: Revision requested 10 Jul, 2024 Editor assigned by journal 10 Jul, 2024 Submission checks completed at journal 10 Jul, 2024 First submitted to journal 10 Jul, 2024 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-4716541","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":325392704,"identity":"7579ced4-8a5c-42bd-892b-ac8ba7feabe7","order_by":0,"name":"Mei Li","email":"","orcid":"","institution":"Chengdu University","correspondingAuthor":false,"prefix":"","firstName":"Mei","middleName":"","lastName":"Li","suffix":""},{"id":325392705,"identity":"259432bd-93a6-480d-a13d-cb47bb920301","order_by":1,"name":"Kexing Chen","email":"","orcid":"","institution":"Chengdu University","correspondingAuthor":false,"prefix":"","firstName":"Kexing","middleName":"","lastName":"Chen","suffix":""},{"id":325392706,"identity":"7d0aabd3-7a0a-4898-81d2-80f8e0f9b201","order_by":2,"name":"Deke Wang","email":"","orcid":"","institution":"Guilin University of Electronic Technology","correspondingAuthor":false,"prefix":"","firstName":"Deke","middleName":"","lastName":"Wang","suffix":""},{"id":325392707,"identity":"cf42fc8d-febe-4701-8fad-7c24f244db2c","order_by":3,"name":"Rui Xu","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAtklEQVRIiWNgGAWjYHACxgdQhgHRWphBSiVI0sImQZoWgxu5zyq/ttnUMbA3b5NgqLlDWIvkjHSz27JtaRIMPMfKJBiOPSOshV8ije22ZNthCQaJHDMJxobDhLWwAbUUS7b9l2CQf0OkFpAtjB/bDgBt4SFSi2TPM2ZphnPJkm08acUWCceI0GJwPI3x448yO35+9sMbb3yoIUILCDDzAAk2ECuBOA3AFPODWJWjYBSMglEwMgEAW/Yvcd2tvt0AAAAASUVORK5CYII=","orcid":"","institution":"Guilin University of Electronic Technology","correspondingAuthor":true,"prefix":"","firstName":"Rui","middleName":"","lastName":"Xu","suffix":""}],"badges":[],"createdAt":"2024-07-10 08:11:24","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-4716541/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-4716541/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1007/s00477-025-02921-5","type":"published","date":"2025-02-20T15:56:53+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":61757229,"identity":"fe6d24f5-d11d-40b0-8637-c11703672c0e","added_by":"auto","created_at":"2024-08-05 08:48:12","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":117056,"visible":true,"origin":"","legend":"\u003cp\u003ePlot of original data and decomposed signals\u003c/p\u003e","description":"","filename":"1.png","url":"https://assets-eu.researchsquare.com/files/rs-4716541/v1/e97f521b8a255738f4602bf6.png"},{"id":61758779,"identity":"0a3d5976-b3cb-483c-a983-74c8259300ad","added_by":"auto","created_at":"2024-08-05 09:04:12","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":51766,"visible":true,"origin":"","legend":"\u003cp\u003eFourier spectra of original data and decomposed signals\u003c/p\u003e","description":"","filename":"2.png","url":"https://assets-eu.researchsquare.com/files/rs-4716541/v1/65e5bd98b85d8f2dbe747731.png"},{"id":61758170,"identity":"6a6b3d58-a211-4638-ba83-853652611cae","added_by":"auto","created_at":"2024-08-05 08:56:12","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":201542,"visible":true,"origin":"","legend":"\u003cp\u003eSimulation results with Random Forest and VMD-Random Forest\u003c/p\u003e","description":"","filename":"3.png","url":"https://assets-eu.researchsquare.com/files/rs-4716541/v1/3e63d900c659e22139ef5bb1.png"},{"id":61758172,"identity":"ae4d749a-6ce8-4951-b48f-80b11ddaa1f4","added_by":"auto","created_at":"2024-08-05 08:56:12","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":149813,"visible":true,"origin":"","legend":"\u003cp\u003eScatter plot of predicted and true values for Random Forest and VMD- Random Forest\u003c/p\u003e","description":"","filename":"4.png","url":"https://assets-eu.researchsquare.com/files/rs-4716541/v1/806e97e3b495ceaee389e06c.png"},{"id":61757230,"identity":"4e94ae06-3208-42f7-aa50-ab6d41e786d9","added_by":"auto","created_at":"2024-08-05 08:48:12","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":200355,"visible":true,"origin":"","legend":"\u003cp\u003eSimulation results with GRU and VMD-GRU\u003c/p\u003e","description":"","filename":"5.png","url":"https://assets-eu.researchsquare.com/files/rs-4716541/v1/43264b6e5b2570f9dbca406b.png"},{"id":61757233,"identity":"e98daf75-114c-48f0-9668-16c210447b54","added_by":"auto","created_at":"2024-08-05 08:48:12","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":143755,"visible":true,"origin":"","legend":"\u003cp\u003eScatter plot of predicted and true values for GRU and VMD- GRU\u003c/p\u003e","description":"","filename":"6.png","url":"https://assets-eu.researchsquare.com/files/rs-4716541/v1/1774ff1d53edc78f94e3612c.png"},{"id":61757236,"identity":"0af4e581-ea67-4da1-92eb-b6c62e4a97ad","added_by":"auto","created_at":"2024-08-05 08:48:12","extension":"png","order_by":7,"title":"Figure 7","display":"","copyAsset":false,"role":"figure","size":184462,"visible":true,"origin":"","legend":"\u003cp\u003eSimulation results with LSTM and VMD-LSTM\u003c/p\u003e","description":"","filename":"7.png","url":"https://assets-eu.researchsquare.com/files/rs-4716541/v1/c317e54439c7d9af58965c61.png"},{"id":61757235,"identity":"f5deb2be-4538-4629-863e-0f22e970c3f4","added_by":"auto","created_at":"2024-08-05 08:48:12","extension":"png","order_by":8,"title":"Figure 8","display":"","copyAsset":false,"role":"figure","size":144117,"visible":true,"origin":"","legend":"\u003cp\u003eScatter plot of predicted and true values for LSTM and VMD- LSTM\u003c/p\u003e","description":"","filename":"8.png","url":"https://assets-eu.researchsquare.com/files/rs-4716541/v1/9f1f5cf8c240bbc007c6284e.png"},{"id":77052462,"identity":"c42df26d-ca5a-4c72-8480-b7205e810318","added_by":"auto","created_at":"2025-02-24 16:05:37","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1718466,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-4716541/v1/3ae31ecd-1bd8-4a0d-a50f-fd8db9a96796.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Driving analysis and prediction of COD based on frequency division","fulltext":[{"header":"1 Introduction","content":"\u003cp\u003eCOD is considered an important indicator for characterizing the presence of organic pollutants in the water environment. The higher COD value indicates more severe pollution by organic pollutants in the water. Organic pollutants are often transferred to water bodies through atmospheric dry and wet deposition, agricultural runoff and wastewater discharge (Arfaeinia et al. \u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e2017\u003c/span\u003e). Bioaccumulation may be caused by these pollutants when they are deposited into the atmosphere and enter water bodies, posing potential toxicity to both biota and humans (Agrell et al. \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2002\u003c/span\u003e). A large amount of organic substances have been found to exist in the aqueous phase or particle extract during rainstorm sampling, expanding the scope of pollution as they migrate with the runoff (Glaser et al. \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). A study examining organic compounds in 38 rivers of the United States were examined, which showed the 10 most widespread anthropogenic organisms are strongly associated with agricultural runoff (Bradley et al. \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e2017\u003c/span\u003e). By time-of-flight mass spectrometry (GC \u0026times; GC-TOF-MS) combined with two-dimensional gas chromatography, organic pollutants in Yongding River basin were analyzed. It was found that the types of organic pollutants in downstream rivers were relatively similar to those in the effluent of sewage treatment plants (Jiang et al. \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2023\u003c/span\u003e), which showed that wastewater treatment plant effluents are a non-negligible source of organic pollutants in surface runoff. A survey of 46 wastewater treatment plant effluents in China were found to contain 302 trace organic pollutants, accounting for 59 percent of the total organic matter (Liu et al. \u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). Organic pollutants pose a serious threat to the environment due to their toxic characteristics over prolonged period (Kumar et al. \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). Organic pollutants can suffocate aquatic organisms by disrupting photosynthesis of aquatic plants and leading to significant reduction in water quality which can indirectly affect human health (Muralikrishna and Manickam \u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e2017\u003c/span\u003e). Mussels farmed in the coast of the Gulf of Naples and Domitio had been found to pose a high food safety risk in terms of heavy metals and persistent organic pollutants (POPs) (Esposito et al. \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e2020\u003c/span\u003e). POPs pose a greater risk in these environments because they are difficult to be broken down and can easy to be accumulated in the air, water, soil, and fat cells of animals (Shetty et al. \u003cspan citationid=\"CR33\" class=\"CitationRef\"\u003e2023\u003c/span\u003e; Zaynab et al. \u003cspan citationid=\"CR44\" class=\"CitationRef\"\u003e2021\u003c/span\u003e; Lan et al. \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e2024\u003c/span\u003e; Mallah et al. \u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e2022\u003c/span\u003e; Yua et al. 2018). Due to the toxicity of some organic pollutants to human health, its characteristics and toxicity have become the focus of global attention (Li et al. \u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e2017\u003c/span\u003e). Researches by scholars not only focus on individual organic pollutants but also emphasize the measurement of organic content. Measuring and analyzing COD content has also been a hot topic of research. In China, a lot of work has been taken out to measure, analyze and control COD. But there are still many challenges (Li et al. \u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e2012\u003c/span\u003e). Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e shows the status of COD emission and control in China.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eStatus of COD emission and control in China\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"2\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eItems\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eChina\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eFormulation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eMainly national standards, A small quantity local standards\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eImplementation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e1) National standard specified uniform limits for similar dischargers\u003c/p\u003e \u003cp\u003e2) The control items are mainly centralized control. The total quantity control is comprehensively implemented.\u003c/p\u003e \u003cp\u003e3) Local standards outperform national standards in execution\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eControl level\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e1) Compared to the lower treatment rate of ammonia nitrogen, COD treatment rate reaches over 90%.\u003c/p\u003e \u003cp\u003e2) The monitoring and supervision system is largely established, but the development of the shared system is uneven.\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\" morerows=\"1\" rowspan=\"2\"\u003e \u003cp\u003eLimits of major pollutants\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e1)National standard: Standard A: 50mg/L, Standard B: 60mg/L.\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e2)Sichuan Province Standards: Urban sewage treatment plant༚30mg/L\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eThe continuous determination of COD with long time series can be helpful to obtain a lot of necessary data for the analysis and prediction of organic matter in watershed. In order to achieve this goal, many methods for COD measurement were developed continuously and the regional, conventional and continuous monitoring methods were gradually realized. Traditional methods for COD measurement, such as potassium dichromate, potassium permanganate, and UV-vis spectroscopy, Ultraviolet-visible (UV-vis) spectroscopy combined with stoichiometric tools was used to determine COD content. This method is particularly suitable for real-time and rapid determinations while a large number of sample data is required for modeling (Li et al. \u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e2018\u003c/span\u003e). Carbon point fluorescent capillary sensors are also used to determine COD. This method has the advantages of low detection limit, good linear range and reliability, portability and low cost. But it is easy to be limited by interference protection and environmental impact (Zhang et al. \u003cspan citationid=\"CR42\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). A nano-lead dioxide-composite electrochemical sensor has been used for COD determination offering fast response times, simple instrumentation, low cost, high detection sensitivity and wide linear range. However, sampling limitations and sensor stability can pose challenges (Wang et al. \u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). Currently, COD monitoring has been further developed to automatic monitoring. The CL system with flow injection was applied for COD determination, which has the obvious advantages of much shorter analysis time, simple operation except for a lower detection rate (Li et al. \u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e2003\u003c/span\u003e). A new reagent-free method for measuring chemical oxygen demand (COD) was proposed based on ultraviolet absorption spectroscopy (UV-AS) which enables high-precision and long-range COD measurement by automatically selecting the wavelength and analyzing the full spectrum of the data. This method can therefore be used for in situ and online environmental monitoring (Wang et al. \u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e2019\u003c/span\u003e). The development of automated monitoring has significantly improved the efficiency and frequency of COD monitoring. It can realize the simultaneous determination of COD over a wide area and provide technical and methodological support for regional COD monitoring research.\u003c/p\u003e \u003cp\u003eTo better understand the changing patterns and drivers of organic matter in water bodies, a variety of models have been established by scientists to analyze and predict COD. These models input actual monitoring data and predicted values of COD. The dynamics of COD in water bodies are studied to analyze and manage the flow patterns and long-term series of organic matter. The removal of COD from polluted solutions has been simulated and predicted using artificial neural networks (ANN) (Elmolla et al. \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e2010\u003c/span\u003e; Masouleh et al. \u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). One main advantage of ANN is its ability to handle complex relationships between input and output variables (Ataei et al. 2021). However, ANN techniques require large datasets to train the model and are computationally expensive (Khanmohammadi et al. \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). Faster convergence, higher convergence accuracy, and better pattern recognition are achieved by recurrent neural networks (RNN) compared to ANN (Al-Qaili et al. \u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e2024\u003c/span\u003e; Gholami et al. \u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). RNNs are more suitable for time series prediction. However, RNNs suffer from the problem of long-term dependence, i.e., gradient vanishing and gradient explosion will be met when RNNs learns long sequences, which causes its difficulty to understand nonlinear relationships for long periods of time (Hochreiter \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e1991\u003c/span\u003e). Therefore, the use of convolutional neural networks (CNNs) was investigated by some researchers for efficient feature extraction to extract data features. A correlation between COD concentration and spectral reflectance in urban rivers was found and COD concentration was accurately predicted by using a one-dimensional convolutional neural network (1D-CNN) (Cai et al. \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). The Long Short-Term Memory (LSTM) network, introduced by Hochreiter and Schmidhuber (\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e1997\u003c/span\u003e), is an extension of the RNN. Long-term dependencies can be learned by LSTMs, avoiding the exploding or vanishing gradient problem that affects traditional RNNs (Xu et al. \u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e2020\u003c/span\u003e). For monitoring and predicting the performance of COD in wastewater treatment, a novel LSTM-based soft sensor was developed (Xu et al. \u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). The LSTM method can mine the potential information between different water quality indicators at different time scales improving prediction accuracy. Major pollutants such as Biological Oxygen Demand(BOD), Chemical Oxygen Demand(COD), Total Nitrogen(TN), Total Phosphorus(TP), and Ammonia Nitrogen(AN)were predicted and demonstrated to have a high degree of correlation with each other using an integrated multivariate LSTM network(Wang, Xue et al. \u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). Although LSTMs can effectively process and predict events with long intervals and delays (Yousfi et al. 2017). A large number of parameters was needed and low convergence rate is existed. For this reason, GRU based LSTM was proposed (Cho et al. \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e2014\u003c/span\u003e)to simplify the internal cell architecture of LSTM and reduce the network training time in order to guarantee the prediction accuracy. Combining machine learning with sensor networks, multiple machine learning algorithms are employed to predict COD emissions. Comparative results showed that GRU is better than LSTM (Miao et al. \u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e2021\u003c/span\u003e).\u003c/p\u003e \u003cp\u003ePrediction results can be influenced by the model characteristics themselves. How to improve their performance has become a necessary research topic nowadays. Time-frequency domain transformation technique was tried to be combined with some models to enhance the simulation effects. A complex signal can be decomposed into a series of intrinsic modal functions (IMF) with different frequencies and amplitudes by frequency division. the complexity and strong nonlinearities in a time series can be effectively reduce while obtaining a relatively stable subsequence that contains several different frequency scales. This approach extracts the main data components, and representing them as subsequences with different frequency features, thus optimizing the simulation effect of some intelligent models. The Variational Mode Decomposition (VMD), an adaptive and fully non-recursive signal processing method, was proposed in 2014(Dragomiretskiy and Zosso \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e2014\u003c/span\u003e) with the advantage of determining the number of mode decompositions. The VMD was used to decompose river flow into IMF, and the mixed models RVFL_VMD, GRNN_VMD and RBFNN_VMD were established to predict river turbidity. The best performance was achieved by RVFL_VMD on the hourly time scale, while GRNN_VMD provided the best prediction on the daily time scale (Heddam et al. 2022). VMD can improve the accuracy of power consumption forecasts. The combination of Bi-directional Gated Recurrent Unit (BiGRU) and LSTM models with VMD showed excellent prediction accuracy on various assessment metrics, especially for short to medium term predictions (Ahmed et al. \u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). The VMD method can be used to study the periodicity of vegetation and its relationship with climate (Wang et al. \u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). A new wind power forecasting method called IVMD combining VMD and HFCM was developed to achieve more accurate wind power generation prediction and reduce the prediction error by extracting time series features and learning the weights using Bayesian ridge regression method (Qiao et al. \u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). The spectral characteristics of VMD can be used to sort iron ore in hyperspectral images (Nie et al. \u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e2023\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eIn this paper, taking the rivers in Chengdu area of China were selected as the research focus, with COD as the primary research index. The frequency division technique of VMD was applied to process the raw COD data. Moreover, three models \u0026mdash; Random Forest, LSTM and GRU were integrated with VMD to develop an optimized model framework. The research aimed to find a new method was hoped to be found for the prediction and analysis of organic matter in the Chengdu watershed.\u003c/p\u003e"},{"header":"2 Methodology","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003e2.1 Used data parameters\u003c/h2\u003e \u003cp\u003eData parameters include water quality data and meteorological data.\u003c/p\u003e \u003cp\u003eWater quality data includes WT (Water Temperature), PH (pH Value), DO (Dissolved Oxygen), COD\u003csub\u003eMn\u003c/sub\u003e (Potassium Permanganate Index), NH\u003csub\u003e3\u003c/sub\u003e-N (Ammonia Nitrogen), TP (Total Phosphorus), TN (Total Nitrogen), EC (Electric Conductivity) and NTU (Turbidity). Data from 2020 to 2022 were obtained from daily monitoring of state-controlled cross sections. Monitoring was conducted six times a day at 0:00, 4:00, 8:00, 12:00, 16:00 and 20:00.\u003c/p\u003e \u003cp\u003eMeteorological data includes T (Mean Air Temperature), MT (Maximum Air Temperature), LT (Minimum Air Temperature), BP (Mean Air Pressure), MBP (Maximum Air Pressure), LBP (Minimum Air Pressure), RH (Relative Humidity), LH (Minimum Relative Humidity) and P (Precipitation). The daily meteorological data from 2020 to 2022 were obtained from two meteorological stations of Chengdu area, monitoring once a day.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec4\" class=\"Section2\"\u003e \u003ch2\u003e2.2 Data processing\u003c/h2\u003e \u003cp\u003e(1) Missing values: A small amount of missing data was found in COD\u003csub\u003eMn\u003c/sub\u003e, NH\u003csub\u003e3\u003c/sub\u003e-N, TP and TN. Polynomial interpolation was used to make up the missing values.\u003c/p\u003e \u003cp\u003e(2) Anomalous values: A small amount of anomalous data was found in EC, NTU, P, LH, RH and LT. The anomalous values were filled in by polynomial interpolation.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003e2.3Variational mode decomposition\u003c/h2\u003e \u003cp\u003eIf the COD (Chemical Oxygen Demand) in the river water corresponds to an approximate function f (t) and it is the sum of the set of k (i.e., IMF) components, the constraint problem is expressed ass (Ling et al. \u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e2021\u003c/span\u003e; Li et al. \u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e2021\u003c/span\u003eb; Xu et al. \u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e2020\u003c/span\u003e):\u003cdiv id=\"Equ1\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equ1\" name=\"EquationSource\"\u003e\n$$\\:\\begin{array}{rr}\u0026amp;\\:\\underset{{u}_{k},{\\omega\\:}_{k}}{\\text{m}\\text{i}\\text{n}}\\left\\{\\sum\\:_{k=1}^{K}{∥{\\partial\\:}_{t}\\left[\\left(\\delta\\:\\left(t\\right)+\\frac{j}{\\pi\\:t}\\right)\\text{*}{\\mu\\:}_{k}\\left(t\\right)\\right]{e}^{-j{\\omega\\:}_{k}\\left(t\\right)}∥}_{2}^{2}\\right.\\\\\\:\u0026amp;\\:such.that.\\sum\\:_{k=1}^{K}{\\mu\\:}_{k}=f\\end{array}$$\u003c/div\u003e\u003cdiv class=\"EquationNumber\"\u003e1\u003c/div\u003e\u003c/div\u003e\u003c/p\u003e \u003cp\u003ewhere (t) is the time, δ (t) corresponds to the impulse function, i.e., the Dirac distribution and {\u003cem\u003eu\u003c/em\u003e\u003csub\u003e\u003cem\u003ek\u003c/em\u003e​\u003c/sub\u003e (t)\u0026frac14; {\u003cem\u003eu\u003c/em\u003e\u003csub\u003e1\u003c/sub\u003e (t), \u003cem\u003eu\u003c/em\u003e\u003csub\u003e2\u003c/sub\u003e (t),\u0026hellip;, \u003cem\u003eu\u003c/em\u003e\u003csub\u003e\u003cem\u003ek\u003c/em\u003e​\u003c/sub\u003e (t)} is the series of the decomposed K Intrinsic Mode Functions (IMF) modes, and the set of the center frequencies of each IMF is given as {ω\u003csub\u003ek\u003c/sub\u003e (t)\u0026frac14;{ω\u003csub\u003e1\u003c/sub\u003e (t), ω\u003csub\u003e2\u003c/sub\u003e (t),\u0026hellip;,ω\u003csub\u003ek\u003c/sub\u003e (t)} (Xu et al., \u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e2020\u003c/span\u003e). \u003cem\u003ef\u003c/em\u003e is a given constant.\u003c/p\u003e \u003c/div\u003e"},{"header":"3 Results and Discussion","content":"\u003cdiv id=\"Sec7\" class=\"Section2\"\u003e \u003ch2\u003e3.1 VMD decomposition results\u003c/h2\u003e \u003cp\u003eFigure \u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e shows the original data and the decomposed IMF1 to IMF5 signals from low to high frequencies. IMF1 usually corresponds to the low-frequency component of the data and represents the long-term trend. IMF2 to IMF5 representing the short-term abrupt changes in the data, are usually correspond to rapid changes, fluctuations or transient features in the data.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eFigure \u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e shows the Fourier spectrum of the original data and the decomposed signals. The distribution of the signal in the frequency domain in Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e indicates the intensity or amplitude of the different frequency components contained in the signal. Each wave peak corresponds to the intensity or energy of a particular frequency component. The higher the crest, the stronger the energy of the corresponding frequency component, indicating the greater the contribution of that frequency component in the signal. As can be seen from Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e, there is basically no frequency entanglement between the decomposed signals.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003e3.2Driving analysis of COD change\u003c/h2\u003e \u003cp\u003ePearson's correlation analysis was used to analyze the correlation between variables driving the COD change and COD content. Positive value of the correlation coefficient indicates positive correlation, negative value indicates negative correlation. The larger the absolute value of the value, the stronger the correlation. The correlation judgment criteria are shown in Table\u0026nbsp;\u003cspan refid=\"Tab2\" class=\"InternalRef\"\u003e2\u003c/span\u003e(Jia 2018). The correlation values of COD\u003csub\u003eMn\u003c/sub\u003e with water quality data and hydrological data in the original data and the decomposed signals IMF1-IMF5 are shown in Table\u0026nbsp;\u003cspan refid=\"Tab3\" class=\"InternalRef\"\u003e3\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab2\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eCorrelation criterion (Jia et al. 2018)\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"2\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eAbsolute value of the correlation value\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eRelevance\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e0.0-0.3\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eNo relevant\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e0.3\u0026ndash;0.8\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003erelatively strong correlated\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e0.8-1.0\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eStrong correlation\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab3\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 3\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eCorrelation between the original data, decomposed signals and COD\u003csub\u003eMn\u003c/sub\u003e.\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"7\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c6\" colnum=\"6\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c7\" colnum=\"7\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eFactor\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eOriginal Data\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eIMF1\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eIMF2\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003eIMF3\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c6\"\u003e \u003cp\u003eIMF4\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c7\"\u003e \u003cp\u003eIMF5\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eWT\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.28\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.38\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.11\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003ePH\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e-0.44\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e-0.41\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e-0.58\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e-0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e-0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e-0.02\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eDO\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e-0.57\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e-0.60\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e-0.33\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e-0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e-0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e-0.01\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eNH\u003csub\u003e3\u003c/sub\u003e -N\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.57\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.55\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.42\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.04\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTP\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.69\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.70\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.71\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.02\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.10\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.45\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTN\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.55\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.54\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.34\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.02\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.05\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.34\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eEC\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.20\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.26\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.04\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e-0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eNTU\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.30\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.32\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.17\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.02\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.09\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eT\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.22\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.36\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e-0.19\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e-0.14\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMT\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.20\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.37\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e-0.33\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e-0.20\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eBP\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e-0.25\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e-0.41\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.11\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMBP\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e-0.24\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e-0.41\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.12\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLBP\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e-0.26\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e-0.44\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.12\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.01\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eRH\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.03\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e-0.10\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.15\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e-0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e-0.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e-0.00\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLH\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.03\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e-0.11\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.19\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e-0.02\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e-0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e-0.00\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eP\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.15\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.13\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.28\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.08\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e-0.01\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e0.00\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eIn the long term, the absolute correlation value between TP, DO, NH\u003csub\u003e3\u003c/sub\u003e -N, TN, LBP, PH, BP, MBP, WT, T, NTU and COD is between 0.3\u0026ndash;0.6, indicating a relatively strong correlation. Among these, the correlation values for range from 0.54 to 0.70, showing that COD content is more influenced by water quality factors than by climate factors. Nitrogen and phosphorus are the main indicators of eutrophication in water bodies, suggesting that long-term changes in COD are strongly positively correlated with eutrophication. That is, the more serious the pollution of organic matter in water bodies, the more serious the eutrophication of water bodies. The change in DO also positively correlates with eutrophication, further verifying the strong correlations between eutrophication and COD. When there is an excess input of nutrients (e.g., nitrogen, phosphorus, etc.) to a water body, eutrophic substances increase the concentration of COD and cause other changes. Eutrophic waters usually contain more organic waste and organic material, resulting in an increase in COD\u003csub\u003eMn\u003c/sub\u003e content in the water bodies as well. This relation between eutrophic substances with COD provides a simplified idea for analyzing the long-term pollution characteristics of water bodies in Chengdu. When data on nitrogen, phosphorus, or COD are missing or abnormal, the strong correlations among these indicators can be used to infer the missing values. Characteristics changes of one indicator can also be used to infer the changes in other indicators.\u003c/p\u003e \u003cp\u003eFor the relatively longer short-term change characteristics of COD, the eutrophication index nitrogen and phosphorus and PH, DO, MT still have some influence. But except the nitrogen and phosphorus, the influence of other factors basically disappeared with the gradual elimination of the signals of the influencing factors in the short time series. It is reasonable to speculate that short-term changes in COD are largely unaffected by water and climatic parameters, demonstrating the relative stability of the instantaneous characteristics of COD.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec9\" class=\"Section2\"\u003e \u003ch2\u003e3.3 Simulation results with or without frequency division\u003c/h2\u003e \u003cp\u003eFirstly, Random Forest, LSTM, and GRU models were used to predict COD with the original data. Then, the original data was decomposed, and the signals with a correlation of less than 0.3 with the predicted target COD were eliminated from the variable modal decomposition signals. Thus, Random Forest, LSTM, and GRU models were used again to predict COD with the filtered signal data. The predicted results with and without frequency division were compared by two methods. One is Scatter plots of the predicted value and the true values, the other is three kinds of error values, namely mean absolute error (MAE), root mean square error (RMSE) and symmetric mean absolute percentage error (SMAPE).\u003c/p\u003e \u003cdiv id=\"Sec10\" class=\"Section3\"\u003e \u003ch2\u003e3.3.1 Simulation results with Random Forest and VMD-Random Forest\u003c/h2\u003e \u003cp\u003eThe prediction results with the Random Forest and VMD-Random Forest model were shown in Fig.\u0026nbsp;3.\u003c/p\u003e\u003cp\u003eThe scatter plot showing the fitting degree between the true values and the predicted values was shown in Fig.\u0026nbsp;4. The simulation effect before and after frequency division was evaluated by mean absolute error (MAE), root mean square error (RMSE), and symmetric mean absolute percentage error (SMAPE) were shown in Table\u0026nbsp;\u003cspan refid=\"Tab4\" class=\"InternalRef\"\u003e4\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab4\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 4\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eQuantitative metrics for Random Forest and VMD- Random Forest\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMODEL\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eRMSE\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eMAE\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eSMAPE\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eRandom Forest\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.397\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.285\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.14\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eVMD-Random Forest\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.293\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.224\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.104\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eBy Fig.\u0026nbsp;4 and Table\u0026nbsp;\u003cspan refid=\"Tab4\" class=\"InternalRef\"\u003e4\u003c/span\u003e, it can be concluded that the simulation effect is improved because of VMD processing. The improvement rates are 26.20%, 21.40%, and 25.71%, respectively. It can be found that VMD can improve the simulation effect of Random Forest to some extent.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec11\" class=\"Section3\"\u003e \u003ch2\u003e3.3.2 Simulation results with GRU and VMD-GRU\u003c/h2\u003e \u003cp\u003eThe simulation results with GRU and VMD-GRU models are shown in Fig.\u0026nbsp;5.\u003c/p\u003e \u003cp\u003eScatter plot was shown in Fig.\u0026nbsp;6. The values of MAE, RMSE and SMAPE were shown in Table\u0026nbsp;\u003cspan refid=\"Tab5\" class=\"InternalRef\"\u003e5\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab5\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 5\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eQuantitative metrics for GRU and VMD-GRU\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMODEL\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eRMSE\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eMAE\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eSMAPE\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eGRU\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.502\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.395\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.175\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eVMD-GRU\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.14\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.098\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.046\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eAfter the original data was processed by VMD, the values of RMSE, MAE and SMAPE decreased, and the degree of dispersion shown in Fig.\u0026nbsp;6 was reduced. This indicates that VMD can significantly improve the prediction accuracy of the GRU model. The simulation performance of GRU was improved by 72.11%, 75.19%, and 73.14% respectively. This demonstrates that VMD frequency division can more effectively enhance the GRU model's ability to understand and capture data by eliminating interference factors, thereby improving the simulation effect of the GRU.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section3\"\u003e \u003ch2\u003e3.3.3 Simulation results with LSTM and VMD-LSTM\u003c/h2\u003e \u003cp\u003eThe simulation results with LSTM and VMD-LSTM were shown in Fig.\u0026nbsp;7.\u003c/p\u003e \u003cp\u003eScatter plot was shown in Fig.\u0026nbsp;8. The error values of MAE, RMSE and SMAPE were shown in Table\u0026nbsp;\u003cspan refid=\"Tab6\" class=\"InternalRef\"\u003e6\u003c/span\u003e.\u003c/p\u003e\u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab6\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 6\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eQuantitative metrics for the LSTM and VMD-LSTM\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"5\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMODEL\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eRMSE\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e\u0026nbsp;\u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eMAE\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003eSMAPE\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLSTM\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.772\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e\u0026nbsp;\u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.578\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.24\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eVMD-LSTM\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.205\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e\u0026nbsp;\u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.156\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.073\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eThe values of LSTM RMSE, MAE and SMAPE reduced after VMD processing, indicating that VMD-LSTM model achieves lower error and higher accuracy compared to the original LSTM model. VMD frequency division significantly improves the prediction performance. The improvement rate of VMD to LSTM model is 73.45%, 73.01%, and 69.58% respectively.\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e"},{"header":"4 Conclusions","content":"\u003cp\u003eCOD is often used as an important indicator to evaluate the organic pollution in water environments. Although there have been many researches on water quality prediction models, the prediction accuracy of these models using raw data often falls short of ideal results. This is because water quality is affected by many factors from weather, human activities, and the water environment. There are complex interaction relations among these factors, which is difficult to be captured by traditional water quality prediction models so as to reduce the prediction accuracy of the model.\u003c/p\u003e \u003cp\u003eIn this paper, a new COD prediction method was proposed. Taking water system in Chengdu area of China as the research object, a kind of frequency division method, VMD method was tried to complete the time domain decomposition of COD data before model simulation. the frequency division method was used to separate the COD original data into signals of different frequency bands, namely IMF1, IMF2, IMF3, IMF4 and IMF5. Using these different signal data, the influence of COD on meteorological factors and water quality factors was explored. Based on the correlation between signals of different frequency bands and COD, it is found that the long-term COD content can be affected by eutrophication factors such as phosphorus and nitrogen relatively easily. The short-term immediate change characteristics of COD are relatively stable and are less affected by any water quality factors and meteorological factors.\u003c/p\u003e \u003cp\u003eUsing the original data and the signal data without weak correlated factors, the simulation effects of GRU, LSTM and Random Forest models on COD were compared. By the frequency division, the original data was decomposed to signals with different frequency, and the signals with a correlation of less than 0.3 were eliminated further from the variable modal decomposition signals. Removing the interference factors of COD is conducive to improving the simulation effect. The simulation results verified this. When combining Random Forest, GRU or LSTM with variational mode decomposition in COD prediction, the frequency division is helpful to improve the stability and accuracy of the simulation of these models. It has better promotion effects on GRU and LSTM with higher improvement rates. The improvement rate can even amount to more than 70% after VMD processing, especially for LSTM and GRU simulation. The VMD-GRU model and the VMD-LSTM model can be used as more reliable tools for water quality analyzation and management in Chengdu area.\u003c/p\u003e \u003cp\u003eThis simulation method constructed in this paper can also be considered for exploring other water environment indicators in different regions, potentially providing a more reliable tool for water quality analysis and management.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eFunding\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThis work was supported by [the National Natural Science Foundation] (Grant numbers [52050410328] and [62266014]) and [the Guangxi Natural Science Foundation] (Grant number 2021GXNSFAA220056).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthor Contributions\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e[Mei Li] was involved in the initial draft and editing; [Kexing Chen] contributed to the investigation and methodology; [Deke Wang] was responsible for data curation; [Rui Xu] was involved in the review and overall supervision.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eData Availability Statement\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eSome or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request: (1) original data of COD, (2) models, and (3) codes.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConflict of interest\u003c/strong\u003e\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eThe authors declare they have no conflict of interest.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eAhmed AAM, Bailek N, Abualigah L, Bouchouicha K, Kuriqi A, Sharifi A, Sareh P (2023) Global control of electrical supply: A variational mode decomposition- aided deep learning model forenergy consumption prediction. Energy Rep 10:2152\u0026ndash;2165. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.egyr.2023.08.076\u003c/span\u003e\u003cspan address=\"10.1016/j.egyr.2023.08.076\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAgrell C, Larsson P, Okla L, Agrell J (2002) PCB congeners in precipitation, wash out ratios and depositional fluxes within the Baltic Sea region, Europe. Atmos Environ 36(2):371\u0026ndash;383. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/S1352-2310(01)00228-X\u003c/span\u003e\u003cspan address=\"10.1016/S1352-2310(01)00228-X\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAl-Qaili AH, Al-Mansour AI, Al-Solieman H, AlSharabi K (2024) RNN-based pavement moduli prediction for flexible pavement design enhancement. Case Stud Constr Mater 20. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.cscm.2023.e02811\u003c/span\u003e\u003cspan address=\"10.1016/j.cscm.2023.e02811\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eArfaeinia H, Asadgol Z, Ahmadi E, Seifi M, Moradi M, Dobaradaran S (2017) Characteristics, distribution and sources of polychlorinated biphenyls (PCBs) in coastal sediments from the heavily industrialized area of Asalouyeh, Iran. Water Sci Technol 76(11\u0026ndash;12):3340\u0026ndash;3350. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.2166/wst.2017.500\u003c/span\u003e\u003cspan address=\"10.2166/wst.2017.500\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBradley PM, Journey CA, Romanok KM, Barber LB, Buxton HT, Foreman WT, Furlong ET, Glassmeyer ST, Hladik ML (2017) Expanded Target-Chemical Analysis Reveals Extensive Mixed-Organic-Contaminant Exposure in U.S. Streams. Environ Sci Technol 51(9):4792\u0026ndash;4802. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1021/acs.est.7b00012\u003c/span\u003e\u003cspan address=\"10.1021/acs.est.7b00012\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCai J, Meng L, Liu H, Chen J, Xing Q (2022) Estimating Chemical Oxygen Demand in estuarine urban rivers using unmanned aerial vehicle hyperspectral images. Ecol Indic 139:108936. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.ecolind.2022.108936\u003c/span\u003e\u003cspan address=\"10.1016/j.ecolind.2022.108936\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCho K, van Merrienboer B, Bahdanau D, Bengio Y (2014) On the Properties of Neural Machine Translation: Encoder-Decoder Approaches. In Proceedings of SSST@ EMNLP (pp. 103\u0026ndash;111). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jconhyd.2023.104287\u003c/span\u003e\u003cspan address=\"10.1016/j.jconhyd.2023.104287\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDragomiretskiy K, Zosso D (2014) Variational Mode Decomposition. IEEE Trans Signal Process 62(3):531\u0026ndash;544. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1109/TSP.2013.2288675\u003c/span\u003e\u003cspan address=\"10.1109/TSP.2013.2288675\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eElmolla ES, Chaudhuri M, Eltoukhy MM (2010) The use of artificial neural network (ANN) for modeling of COD removal from antibiotic aqueous solution by the Fenton process. J Hazard Mater 179(1):127\u0026ndash;134. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jhazmat.2010.02.068\u003c/span\u003e\u003cspan address=\"10.1016/j.jhazmat.2010.02.068\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eEsposito M, Canzanella S, Lambiase S, Scaramuzzo A (2020) Organic pollutants (PCBs, PCDD/Fs, PAHs) and toxic metals in farmed mussels from the Gulf of Naples (Italy): Monitoring and human exposure. Reg Stud Mar Sci 40:101497. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.rsma.2020.101497\u003c/span\u003e\u003cspan address=\"10.1016/j.rsma.2020.101497\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGholami H, Mohammadifar A, Golzari S, Song Y (2023) Interpretability of simple RNN and GRU deep learning models used to map land susceptibility to gully erosion. Sci Total Environ 904:166960. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.scitotenv.2023.166960\u003c/span\u003e\u003cspan address=\"10.1016/j.scitotenv.2023.166960\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGlaser C, Escher BI, Engelhardt M, Liu Y, Krauss M (2023) Particle-associated organic contaminant and cytotoxicity transport in a river during storm events. J Hydrol 621:129600. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jhydrol.2023.129600\u003c/span\u003e\u003cspan address=\"10.1016/j.jhydrol.2023.129600\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHochreiter S (1991) Investigations on dynamic neural networks. Dissertation, Johannes Kepler University, Linz\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHochreiter S, Schmidhuber J (1997) Long Short-Term Memory. Neural Comput 9(8):1735\u0026ndash;1780. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1162/neco.1997.9.8.1735\u003c/span\u003e\u003cspan address=\"10.1162/neco.1997.9.8.1735\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJiang J, Zhao J, Zhao G, Liu L (2023) Recognition, possible source, and risk assessment of organic pollutants in surface water from the Yongding River Basin by non-target and target screening. Environ Pollut 331:121895. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.envpol.2023.121895\u003c/span\u003e\u003cspan address=\"10.1016/j.envpol.2023.121895\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKhanmohammadi M, Rahmani F, Shahrouzi JR, Sene RA (2024) Insightful properties-performance study of Ti\u0026ndash;Cu\u0026ndash;O heterojunction sonochemically embedded in mesoporous silica matrix for efficient tetracycline adsorption and photodegradation: RSM and ANN-based modeling and optimization. Chemosphere 141223. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.chemosphere.2024.141223\u003c/span\u003e\u003cspan address=\"10.1016/j.chemosphere.2024.141223\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKolb M, Bahadir M, Teichgr\u0026auml;ber B (2017) Determination of chemical oxygen demand (COD) using an alternative wet chemical method free of mercury and dichromate. Water Res 122:645\u0026ndash;654. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.watres.2017.06.034\u003c/span\u003e\u003cspan address=\"10.1016/j.watres.2017.06.034\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKumar R, Gupta K, Bordoloi N (2022) The potential of engineered endophytic bacteria to improve phytoremediation of organic pollutants. In: Advances in Microbe-assisted Phytoremediation of Polluted Sites, 477\u0026ndash;496. Elsevier. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/B978-0-12-823443-3.00014-4\u003c/span\u003e\u003cspan address=\"10.1016/B978-0-12-823443-3.00014-4\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLan Y, Gao X, Xu H, Li M (2024) 20 years of polybrominated diphenyl ethers on toxicity assessments. Water Res 249:121007. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.watres.2023.121007\u003c/span\u003e\u003cspan address=\"10.1016/j.watres.2023.121007\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi B, Zhang Z, Wang J, Chunli (2003) Chemiluminescence system for automatic determination of chemical oxygen demand using flow injection analysis. Talanta 61(5):651\u0026ndash;658. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/S0039-9140(03)00326-6\u003c/span\u003e\u003cspan address=\"10.1016/S0039-9140(03)00326-6\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi J, Li F, Liu Q (2017) PAHs behavior in surface water and groundwater of the Yellow River estuary: Evidence from isotopes and hydrochemistry. Chemosphere 178:143\u0026ndash;153. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11356-011-0572-7\u003c/span\u003e\u003cspan address=\"10.1007/s11356-011-0572-7\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi J, Tong Y, Guan L, Wu S, Li D (2018) Optimization of COD determination by UV\u0026ndash;vis spectroscopy using PLS chemometrics algorithms. Optik 174:591\u0026ndash;599. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.ijleo.2018.08.111\u003c/span\u003e\u003cspan address=\"10.1016/j.ijleo.2018.08.111\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi R, Chen X, Balezentis T, Streimikiene D, Niu Z (2021) Multi-step least squares support vector machine modeling approach for forecasting short-term electricity demand with application. Neural Comput Appl 33:301\u0026ndash;320. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s00521-020-04996-3\u003c/span\u003e\u003cspan address=\"10.1007/s00521-020-04996-3\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi WW, Sheng GP, Zeng RJ, Liu XW, Yu HQ (2012) China\u0026rsquo;s wastewater discharge standards in urbanization. Environ Sci Pollut Res 19(5):1422\u0026ndash;1431. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.chemosphere.2017.03.052\u003c/span\u003e\u003cspan address=\"10.1016/j.chemosphere.2017.03.052\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLing Q, Zhang Q, Zhang J, Kong L, Zhang W, Zhu L (2021) Prediction of landslide displacement using multikernel extreme learning machine and maximum information coefficient based on variational mode decomposition: a case study in Shaanxi, China. Nat Hazards 108(3):1\u0026ndash;22. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11069-021-04713-w\u003c/span\u003e\u003cspan address=\"10.1007/s11069-021-04713-w\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLiu J, Yang F, Cai Y, Lu G, Li Y, Li M, Fan L, Gao L (2024) Unveiling the existence and ecological hazards of trace organic pollutants in wastewater treatment plant effluents across China. Eco-Environment Health 3(1):21\u0026ndash;29. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.eehl.2023.09.006\u003c/span\u003e\u003cspan address=\"10.1016/j.eehl.2023.09.006\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMallah MA, Changxing L, Mallah MA, Noreen S, Liu Y (2022) Polycyclic aromatic hydrocarbon and its effects on human health: An overview. Chemosphere 296:133948. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.chemosphere.2022.133948\u003c/span\u003e\u003cspan address=\"10.1016/j.chemosphere.2022.133948\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMasouleh SY, Mozaffarian M, Dabir B, Ramezani SF (2022) COD and ammonia removal from landfill leachate by UV/PMS/Fe2\u0026thinsp;+\u0026thinsp;process: ANN/RSM modeling and optimization. Process Saf Environ Prot 159:716\u0026ndash;726. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.psep.2022.01.031\u003c/span\u003e\u003cspan address=\"10.1016/j.psep.2022.01.031\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMiao S, Zhou C, AlQahtani SA, Alrashoud M (2021) Applying machine learning in intelligent sewage treatment: A case study of chemical plant in sustainable cities. Sustain Cities Soc 72:103009. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.scs.2021.103009\u003c/span\u003e\u003cspan address=\"10.1016/j.scs.2021.103009\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMuralikrishna IV, Manickam V (2017) Chapter One - Introduction. In: Environmental Management, Muralikrishna IV, Manickam V (eds) Butterworth-Heinemann, pp 1\u0026ndash;4. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/B978-0-12-811989-1.00001-4\u003c/span\u003e\u003cspan address=\"10.1016/B978-0-12-811989-1.00001-4\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNie C, Jiang J, Deng J, Li K, Jia L, Sun T (2023) Predicting TFe content and sorting iron ores from hyperspectral image by variational mode decomposition-based spectral feature. J Clean Prod 429:139629. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jclepro.2023.139629\u003c/span\u003e\u003cspan address=\"10.1016/j.jclepro.2023.139629\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eQiao B, Liu J, Wu P, Teng Y (2022) Wind power forecasting based on variational mode decomposition and high-order fuzzy cognitive maps. Appl Soft Comput 129:109586. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.asoc.2022.109586\u003c/span\u003e\u003cspan address=\"10.1016/j.asoc.2022.109586\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eShetty SS, Deepthi D, Harshitha S, Sonkusare S, Naik PB, Kumari SN, Madhyastha H (2023) Environmental pollutants and their effects on human health. Heliyon 9(9). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.heliyon.2023.e19496\u003c/span\u003e\u003cspan address=\"10.1016/j.heliyon.2023.e19496\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang C, Li W, Huang M (2019) High precision wide range online chemical oxygen demand measurement method based on ultraviolet absorption spectroscopy and full-spectrum data analysis. Sens Actuators B Chem 300:126943. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.snb.2019.126943\u003c/span\u003e\u003cspan address=\"10.1016/j.snb.2019.126943\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang H, Kang C, Tian Z, Zhang A, Cao Y (2023) Vegetation periodic changes and relationships with climate in Inner Mongolia Based on the VMD method. Ecol Indic 146:109764. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.ecolind.2022.109764\u003c/span\u003e\u003cspan address=\"10.1016/j.ecolind.2022.109764\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang J, Xue B, Wang Y, Wang AY G (2024) Identification of pollution source and prediction of water quality based on deep learning techniques. J Contam Hydrol 261:104287. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jconhyd.2023.104287\u003c/span\u003e\u003cspan address=\"10.1016/j.jconhyd.2023.104287\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang X, Wu D, Yuan D, Wu X (2022) A nano-lead dioxide-composite electrochemical sensor for the determination of chemical oxygen demand. J Environ Chem Eng 10(3):107464. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jece.2022.107464\u003c/span\u003e\u003cspan address=\"10.1016/j.jece.2022.107464\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eXu B, Pooi CK, Tan KM, Huang S, Shi X, Ng HY (2023) A novel long short-term memory artificial neural network (LSTM)-based soft-sensor to monitor and forecast wastewater treatment performance. J Water Process Eng 54:104041. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jwpe.2023.104041\u003c/span\u003e\u003cspan address=\"10.1016/j.jwpe.2023.104041\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eXu W, Jiang Y, Zhang X, Li Y, Zhang R, Fu G (2020) Using long short-term memory networks for river flow prediction. Hydrol Res 51(6):1358\u0026ndash;1376. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.2166/nh.2020.026\u003c/span\u003e\u003cspan address=\"10.2166/nh.2020.026\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYuan J, Liu Y, Wang J, Zhao Y, Li K, Jing Y, Zhang X (2018) Long-term Persistent Organic Pollutants Exposure Induced Telomere Dysfunction and Senescence- Associated Secretary Phenotype. J Gerontol Biol Sci Med Sci 73(8):1027\u0026ndash;1035. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1093/gerona/gly002\u003c/span\u003e\u003cspan address=\"10.1093/gerona/gly002\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYousefi S, Berrani S-A, Garcia C (2017) Contribution of recurrent connectionist language models in improving LSTM-based Arabic text recognition in videos. Pattern Recognit 64:245\u0026ndash;254. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.patcog.2016.11.011\u003c/span\u003e\u003cspan address=\"10.1016/j.patcog.2016.11.011\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhang R, Li YS, Luo YX, Zhang XY, Wen R (2023) A carbon-dot fluorescence capillary sensor for the determination of chemical oxygen demand. Microchem J 187:108445. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.microc.2023.108445\u003c/span\u003e\u003cspan address=\"10.1016/j.microc.2023.108445\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhang R, Li YS, Luo YX, Zhang XY, Wen R (2023) A carbon-dot fluorescence capillary sensor for the determination of chemical oxygen demand. Microchem J 187:108445. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.microc.2023.108445\u003c/span\u003e\u003cspan address=\"10.1016/j.microc.2023.108445\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZaynab M, Fatima M, Sharif Y, Sughra K, Sajid M, Khan KA, Sneharani AH, Li S (2021) Health and environmental effects of silent killers Organochlorine pesticides and polychlorinated biphenyl. J King Saud Univ Sci 33(6):101511. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jksus.2021.101511\u003c/span\u003e\u003cspan address=\"10.1016/j.jksus.2021.101511\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhang R, Li YS, Luo YX, Zhang XY, Wen R (2023) A carbon-dot fluorescence capillary sensor for the determination of chemical oxygen demand. Microchem J 187:108445. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.microc.2023.108445\u003c/span\u003e\u003cspan address=\"10.1016/j.microc.2023.108445\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"stochastic-environmental-research-and-risk-assessment","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"serr","sideBox":"Learn more about [Stochastic Environmental Research and Risk Assessment](https://www.springer.com/journal/477)","snPcode":"477","submissionUrl":"https://submission.nature.com/new-submission/477/3","title":"Stochastic Environmental Research and Risk Assessment","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"frequency division, driving analysis, prediction model, COD","lastPublishedDoi":"10.21203/rs.3.rs-4716541/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-4716541/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eCOD (Chemical Oxygen Demand) is an important indicator to measure organic pollution of water body. To strengthen in-depth analysis and prediction of COD, a new method was proposed in this paper. A frequency division method, Variational Mode Decomposition (VMD) was used to complete time domain decomposition of COD data before model simulation. The original data was separated into five signals with different frequency bands, IMF1, IMF2, IMF3, IMF4 and IMF5, with which the influence of meteorological factors and water quality factors on COD were explored. The long-term COD content is mainly driven by nutrient factors phosphorus and nitrogen, while the immediate fluctuation characteristics exhibit relatively stability. Random Forest, Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) were used to predict COD with the original data and the signal data processed by VMD. It is found that frequency division can improve simulation stability and accuracy of GRU and LSTM more significantly than Random Forest. VMD-GRU and VMD-LSTM models can be used reliably for COD analyzation and prediction in Chengdu area.\u003c/p\u003e","manuscriptTitle":"Driving analysis and prediction of COD based on frequency division","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-08-05 08:48:08","doi":"10.21203/rs.3.rs-4716541/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2024-07-10T21:46:35+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2024-07-10T21:37:10+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2024-07-10T12:18:51+00:00","index":"","fulltext":""},{"type":"submitted","content":"Stochastic Environmental Research and Risk Assessment","date":"2024-07-10T08:10:07+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"stochastic-environmental-research-and-risk-assessment","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"serr","sideBox":"Learn more about [Stochastic Environmental Research and Risk Assessment](https://www.springer.com/journal/477)","snPcode":"477","submissionUrl":"https://submission.nature.com/new-submission/477/3","title":"Stochastic Environmental Research and Risk Assessment","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"2da63390-51c1-4b4e-8011-2e197c650003","owner":[],"postedDate":"August 5th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[],"tags":[],"updatedAt":"2025-02-24T15:58:48+00:00","versionOfRecord":{"articleIdentity":"rs-4716541","link":"https://doi.org/10.1007/s00477-025-02921-5","journal":{"identity":"stochastic-environmental-research-and-risk-assessment","isVorOnly":false,"title":"Stochastic Environmental Research and Risk Assessment"},"publishedOn":"2025-02-20 15:56:53","publishedOnDateReadable":"February 20th, 2025"},"versionCreatedAt":"2024-08-05 08:48:08","video":"","vorDoi":"10.1007/s00477-025-02921-5","vorDoiUrl":"https://doi.org/10.1007/s00477-025-02921-5","workflowStages":[]},"version":"v1","identity":"rs-4716541","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-4716541","identity":"rs-4716541","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00