Comparison of Sarima and Lstm Model to Forecast the Comprehensive Income (Loss) of Life Insurances in Indonesia | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Comparison of Sarima and Lstm Model to Forecast the Comprehensive Income (Loss) of Life Insurances in Indonesia Hiteash Mahajan This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-9180930/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract This study compares the performance of the Seasonal Autoregressive Integrated Moving Average (SARIMA) and Long Short-Term Memory (LSTM) models in predicting time series data with seasonal patterns. Using historical data, namely total comprehensive income (loss) data for all insurance companies in Indonesia that contain strong seasonal components, the SARIMA model proved superior in capturing and representing seasonal patterns compared to LSTM. However, the evaluation results showed that the overall prediction accuracy of both models is still at a level that needs improvement. These findings suggest that although SARIMA is effective for seasonal data, the integration of statistical and deep learning-based models has the potential to improve prediction performance. Future research is recommended to explore hybrid approaches, hyperparameter optimization, and the use of external variables to improve the accuracy and generalization of predictive models. ARIMA Forecast Insurance Indonesia LSTM Total Comprehensive Income Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 1. INTRODUCTION The comprehensive income statement is a report that measures a company's performance over a specific period. It is used to assess and predict the amount and timing of future cash flows, subject to uncertainties. The comprehensive income statement describes the sources of income earned by the company in conducting its business, as well as the types of costs incurred by the company in carrying out its activities. By observing or noting the difference between revenues and expenses, it is possible to determine the company's profit or loss during a specific period. The comprehensive income statement is useful for helping financial statement users predict future cash flows, in order to determine profitability, investment value, and creditworthiness. The comprehensive income statement is often used by investors use information about a company's past earnings as an important input in predicting future earnings and cash flows, which then serve as the basis for predicting the company's future stock price and dividends. Creditors also use past earnings information, creditors can understand a potential borrower's ability to generate future cash flows to cover interest and repay loan principal. Creditors do not primarily desire the liquidation of collateral, but rather the company's success in generating income and cash flow from operations. Beside that, the comprehensive income statement is considered important to investors and creditors, so it is only natural that management should also be interested in the comprehensive income statement. Furthermore, in many companies, bonuses given to managers are determined based on their success in achieving profit targets. Previous research has shown that traditional statistical models such as ARIMA and GARCH have advantages in capturing linear patterns and volatility in financial data. ARIMA has been shown to be effective in forecasting short-term trends and seasonal patterns in time series data (Box & Jenkins, 2015). Meanwhile, GARCH excels in modeling heteroscedasticity, thus representing the volatility fluctuations common in financial and insurance market data (Bollerslev, 1986; Engle, 2001). With their relatively simple structures, these two models provide fairly accurate results for data with linear patterns, clear interpretations, and lower computational requirements than modern machine learning approaches. Insurance financial data is volatile due to the influence of premiums, claims, investment returns, and operational costs. Traditional statistical models such as ARIMA or GARCH are capable of capturing simple linear patterns, but are often less effective when dealing with complex and non- linear interactions. Therefore, modern approaches such as machine learning, specifically LSTM (Long Short-Term Memory), are more suitable because they can learn both short-term and long- term patterns in insurance time series data, thus providing more accurate predictions of comprehensive profit. The LSTM model significantly improved prediction accuracy of mutual fund performance over traditional statistical approaches. It effectively captured temporal patterns in the data, helping investors anticipate fund behavior and make informed decisions. The study from (Wang, 2024) LSTM can serve as a powerful tool for fund managers and retail investors navigating complex financial markets. It confirms that LSTM models offer a robust, accurate, and practical method for predicting mutual fund performance, making them valuable for data-driven investment strategies (Wang, 2024). The study from Nelson et. al (2017) explores the application of Long Short-Term Memory (LSTM) networks in predicting future stock price trends, utilizing historical price data in conjunction with technical analysis indicators. A predictive model was developed and a series of experiments were conducted. The resulting performance was evaluated using several metrics to determine whether this type of algorithm offers improvements over traditional machine learning methods and existing investment strategies. The results are promising, with the model achieving an average accuracy of up to 55.9% in predicting whether the price of a given stock would rise in the near future. Most forecasting research in the insurance sector still focuses on premiums, claims, or claim reserves, while the Total Comprehensive Income (Loss) variable, which encompasses both profit and loss and Other Comprehensive Income, is relatively rarely studied. This indicator, however, is a more comprehensive indicator for assessing the financial health of insurance companies. Most ARIMA/GARCH studies have been conducted on the stock market or macroeconomics (e.g., GDP, inflation, oil prices), but their application to financial data of insurance companies in Indonesia is still very limited. Previous research tends to use only one type of model (e.g., ARIMA alone), without comparing it with variations such as seasonal SARIMA or simple benchmarks (naïve/ETS). This results in less comprehensive evaluations of model accuracy. Some previous research has focused solely on short-term forecasting, while medium- and long- term forecasting is more relevant for risk management and financial planning, but remains rarely studied. This study fills this gap by using Total Comprehensive Income (Loss) as the primary variable, thus providing a more representative forecasting method for assessing the overall performance of insurance companies than simply looking at premiums or claims. This study provides a contextual contribution by testing a forecasting model on insurance financial data in Indonesia, an area where there is still limited literature. In addition to using ARIMA/SARIMA, this study tests model performance using various evaluation metrics (RMSE, MAE, MAPE), providing a more comprehensive picture of prediction quality. The results offer an analytical basis for planning risk management and investment strategies, as they are able to anticipate future comprehensive profit trends. 2. LITERATURE REVIEW The Autoregressive Integrated Moving Average (ARIMA) is a family of Box–Jenkins models for time series forecasting that combines autoregressive (AR), differencing (I), and moving average (MA) components. The ARIMA model consists of three parts: • Autoregressive (AR): The linear relationship between its current value and its lagged observations. • Differencing (I): The number of times the series is differenced to achieve stationarity to remove trends and seasonality. • Moving Average (MA): Relates the current value to past forecast errors using a number of lagged errors. This model is very effective for data that has a linear pattern and requires the assumption of stationarity before modeling. The following is the general formula for ARIMA. ∆𝑌𝑡 = 𝜑1∆𝑌𝑡−1 + 𝜑2∆𝑌𝑡−2 + 𝜑𝑝∆𝑦𝑡−𝑝 + 𝑒𝑡 − 𝜃1𝑒𝑡−1 − 𝜃2𝑒𝑡−2 − 𝜃𝑞𝑒𝑡−𝑞 Where: • ∆𝑌𝑡 is the differenced value of the original time series at time t • 𝜑1, 𝜑2, … 𝜑𝑝 are the autoregressive (AR) coefficients • 𝜃1, 𝜃2, … 𝜃𝑞 are the moving average (MA) coefficients • 𝑒𝑡 is the white noise (random error) at time t • 𝑝 is the number of autoregressive terms • 𝑞 is the number of moving average terms This equation combines both autoregressive (AR) and moving average (MA) components, applied to a differenced version of the series (indicated by ∆), to model time-dependent patterns in the data. Standard practice includes identification (stationarity tests, ACF/PACF), parameter estimation, and residual diagnostics; common extensions include SARIMA (seasonal) and ARIMAX/SARIMAX (with exogenous variables). Classical and modern introductory references place ARIMA alongside exponential smoothing as the two most widely used approaches in applied forecasting. Long Short-Term Memory (LSTM) networks have been shown to be one of the best choices when it comes to handling sequential data. LSTM (Long Short-Term Memory) is a development of the Recurrent Neural Network (RNN) architecture designed to overcome the weakness of RNNs in remembering long-term information (vanishing gradient problem). LSTM has a cell state structure with three main gates: input gate, forget gate, and output gate, which function to regulate the flow of information adaptively. The advantage of LSTM compared to conventional RNNs is its ability to remember patterns over long time spans, manage long-term and short-term dependencies, and capture non-linear relationships in the data. This makes LSTM very suitable for forecasting complex time series, high fluctuations, or patterns that are difficult to capture by linear models such as ARIMA. Their unique architecture comprises memory cells along with three gates: • Input Gates: Controls which parts of the previous memory are discarded. • Forget Gates: Determines what new information should be added to the memory. • Output Gates: Regulates what information is output for the current step. These components work together to selectively retain, update, and output information to capture both the short-term fluctuations and long-term trends in the dataset. Zou and Qu (2020) implemented LSTM networks to predict stock prices and analyze this strategy against others. Their study showed that the LSTM model outperformed the traditional RNNs and machine learning methods by effectively learning patterns in historical stock price data. Sonkavde et al. (2023) conducted a review comparing machine learning and deep learning methods which included LSTM, Convolutional Neural Networks (CNNs), and traditional RNNs. Their result showed the effectiveness of the LSTM model compared to RNNs and CNNs by handling temporal and spatial features better. Study from Sak et al (2014), LSTM recurrent neural networks (RNNs) outperform both deep neural networks (DNNs) and traditional RNNs in the context of acoustic modeling. The following are the general equations and architecture of LSTM Forget gate : Controls what portion of the previous cell state should be retained: 𝑓𝑡 = 𝜎(𝑊𝑓 × 𝑦𝑡 + 𝑈𝑓 × ℎ𝑡−1 + 𝑏𝑓) ( 1 ) Input gate : Determines which new information should be added to the cell state: 𝐼𝑡 = 𝜎(𝑊𝑖 × 𝑦𝑡 + 𝑈𝑖 × ℎ𝑡−1 + 𝑏𝑖 ) ( 2 ) Ć𝑡 = tanh(𝑊𝑐 × 𝑦𝑡 + 𝑈𝑐 × ℎ𝑡−1 + 𝑏𝑐) ( 3 ) C𝑡 = tanh ((𝑓𝑡 × 𝐶𝑡−1 + 𝑖𝑡) × Ć𝑡) ( 4 ) Output gate : Controls the output of the LSTM cell: 𝑂𝑡 = 𝜎(𝑊𝑜 × 𝑦𝑡 + 𝑈𝑜 × ℎ𝑡−1 + 𝑏𝑜) ( 5 ) ℎ𝑡 = 𝑂𝑡 × tanh(C𝑡) ( 6 ) Where: 𝜎 denotes the sigmoid activation function tanh denotes the hyperbolic tangent activation 𝑂𝑡 indicates element-wise multiplication W, U, and b represent weight matrices and bias vectors These equations enable the LSTM to retain, update, and output relevant features across time steps, making it highly effective for sequence modeling tasks such as language modeling, time- series forecasting, and speech recognition. 3. RESEARCH METHOD This study uses a quantitative approach with time series forecasting. The Autoregressive Integrated Moving Average (ARIMA/SARIMA) model chosen because it has proven effective in predicting both linear and fluctuating financial data. 3.1 Research Data Data Type that we used in this study is secondary data in the form of insurance company financial reports. With predicted variable is Total Comprehensive Income (Loss) of Life Insurance in Indonesia. Data Period is from 2017 to 2025. 3.2 Analysis Stages The analysis process follows the Box–Jenkins framework (Box & Jenkins, 2015; Hyndman & Athanasopoulos, 2021): Data Stationarity Test: we utilized the Augmented Dickey-Fuller (ADF) Test and/or the KPSS Test. If non-stationary, we perform a transformation (log/Box-Cox) or differencing. Model Identification: The Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF) plots are used to determine the order of p, d, and q. If a seasonal pattern is present, the SARIMA (Seasonal ARIMA) model is used. Model Parameter Estimation: ARIMA/SARIMA parameters are estimated using the Maximum Likelihood Estimation (MLE) method. Model Diagnostics: The Ljung Box test is used to ensure the residuals are white noise. The best model is selected based on the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC). Forecasting: The best model is used to predict Total Comprehensive Income (Loss) in the future period. Accuracy is evaluated using Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE). 4. RESULTS AND DISCUSSION Based on Table 1 , the company's monthly Total Comprehensive Income (Loss) during the period January 2017 to May 2025, which consists of 101 observation data, has a maximum value of 14,759,160.71 which occurred in December 2017, while the minimum value of -12,191,989.15 occurred in October 2018. Overall, the average Total Comprehensive Income (Loss) during the period was 1,481,665.00, which shows that in general the company recorded positive comprehensive income even though there were significant fluctuations in certain periods. Table 1 First ten observations of total monthly Comprehensive Income (Loss) Months Total Comprehensive Income (Loss) 2017-01-01 1,474,870.8 2017-02-01 2,638,107.4 2017-03-01 5,308,832.8 2017-04-01 6,259,147.3 2017-05-01 8,074,418.8 2017-06-01 9,828,258.4 2017-07-01 10,828,261.0 2017-08-01 6,566,310.7 2017-09-01 7,458,188.1 2017-10-01 11,460,697.5 According to Fig. 2 , the company's monthly Total Comprehensive Income (Loss) trend from January 2017 to May 2025 shows a fluctuating pattern with several significant changes. At the beginning of the period (2017), there was a sharp increase that peaked in December 2017, followed by a drastic decline until it reached its lowest point in October 2018. Afterward, the Total Comprehensive Income (Loss) fluctuated with several extreme spikes and declines, such as in early 2022. Entering the period from 2023 to 2025, the movement tended to be more stable with an upward trend in early 2025, indicating potential positive growth in the coming period. 4.1 Time Series Decomposition This figure shows the results of decomposing the Total Comprehensive Income (Loss) time series into four main components. The Observed component shows the original data observed from January 2017 to May 2025, while the Trend component reveals a long-term pattern with a sharp decline until 2019, then a plateau with a slight increase towards 2025. The Seasonal component displays a seasonal pattern that repeats annually with significant fluctuations, indicating a periodic cycle in the data. Meanwhile, the Residual component contains random variation that cannot be explained by either trends or seasonal patterns, likely influenced by external factors or unforeseen events. This decomposition is useful for separating the influence of trends and seasonality from random variation, thus facilitating the analysis and forecasting process. 4.2 Split Data Figure 3 shows the division of the Total Comprehensive Income (Loss) data into two parts: training data and testing data in an 80:20 ratio. Training data is used to build predictive models, while test data is used to evaluate the model's performance on data it has never seen before. From this visualization, we can see that the training data (blue line) covers the period from January 2017 to approximately mid-2023, while the test data (red line) covers the remaining period until May 2025. This separation is important so the model can learn from historical patterns and test its ability to predict future trends without data leakage. 4.3 Data Stationarity The following figure presents plots of the Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF) on the training data set. These plots are used to observe the interrelationships between observations in a time series and help identify potential autoregressive (AR) and moving average (MA) components before building an ARIMA or SARIMA model. The ACF and PACF plots in the figure are used to identify the possible orders of the autoregressive (AR) and moving average (MA) components in the time series model. Based on the results of the Augmented Dickey-Fuller (ADF) test, the ADF statistic is -3.2049 with a p-value of 0.0197, which is less than the 0.05 significance threshold. Furthermore, this value is also lower than the critical value at the 5% significance level (-2.8989), thus concluding that the data is stationary. In the next stage, seasonal differencing will be performed to eliminate any remaining seasonal patterns and ensure the data meets the stationary assumption more optimally. This stationary condition is important because it is one of the main prerequisites before building an ARIMA or SARIMA model. SARIMA Model Identification (p,d,q) (P,D,Q)^s After seasonal differencing to eliminate annual recurring patterns, the SARIMA model identification process begins with an analysis of the ACF and PACF plots on the differencing data to estimate the initial values of the parameters p, q, P, Qp, q, P, Q. The auto-ARIMA method is also used to automatically suggest parameter combinations based on the information criterion (AIC/BIC). The results of these two approaches are then compared and tested through a model overfitting process, which involves trying various parameter combinations around candidate values to ensure the selected model strikes a balance between goodness of fit and simplicity. This process yields three best candidate SARIMA models, which will be further evaluated in depth during the final model selection stage. Table 2 ARIMA Model Evaluation Based on AIC, BIC, and HQIC Model AIC BIC HQIC ARIMA (1,0,0) (0,1,1) [12] 2265.71 2272.38 2268,36 ARIMA (1,0,1) (0,1,1) [12] 2268.10 2276.98 2271,62 ARIMA (2,0,0) (0,1,1) [12] 2268.73 2277.60 2272,24 Based on the model selection results, three best SARIMA candidates were obtained after going through the initial identification process, auto-ARIMA, and overfitting. The three models are ARIMA (1,0,0) (0,1,1) [12], ARIMA (1,0,1) (0,1,1) [12], and ARIMA (2,0,0) (0,1,1) [12]. The AIC, BIC and HQIC values for each model are relatively close, indicating that they have a similar level of model fit. The ARIMA (1,0,0)(0,1,1) [12] model has the lowest AIC and HQIC, making it slightly superior to the other candidates in terms of information criteria 4.4 Evaluation of the SARIMA model on the testing data The SARIMA (1,0,0) (0,1,1) [12] model was used to forecast Total Comprehensive Income (Loss) data. In the graph, the training data is shown by the blue line, the test data by the green line, and the model's prediction results by the dashed red line. The model can be seen to follow the general direction of data movement, but tends to produce smoother predictions and is less able to capture sharp fluctuations in the test data. Based on the performance evaluation, this model produced an RMSE of 3,858,420.47, an MSE of 1.4887×10¹³, and a MAPE of 85.00%. The relatively high MAPE value indicates that the relative error of the predictions to the actual data is still large, so the model's accuracy is not optimal even though seasonal patterns can be represented. 4.5 LSTM Model Hyperparameters The prediction process using the LSTM model begins with designing an architecture tailored to the characteristics of the time series data. The architecture used consists of one LSTM layer connected to a dense layer as the final output. Model training was conducted by testing 144 hyperparameter variations, including different numbers of neurons, epochs, learning rates, and the use of two optimization algorithms: Adam and RMSProp. All combinations were trained on the training data and validated using the validation data to achieve consistent performance. The batch size was selected by considering the proportion of training and test data, ensuring efficient and stable training. The best combination for each walk-forward validation scenario was selected based on the lowest average RMSE value obtained from all tested configurations. Table 3 LSTM Model Hyperparameters Hyperparameter Value Optimizer Adam, RMSProp, SGD Neuron 4, 8, 12, 16 Learning Rate 0.001, 0.0001, 0.00001 Epoch 10, 25, 40, 60 Layer 1 Dropout 0.1 The best hyperparameters were selected using a manual grid search: all candidate combinations (number of neurons, number of epochs, learning rate, and optimizer type Adam, RMSprop, or SGD) were arranged as a grid. Each combination was then trained (with dropout and early stopping), evaluated on validation data, and its value recorded. The primary selection criterion was the average RMSE (in a walk-forward scenario, if applicable); the combination with the lowest RMSE was selected as the final configuration. This manual grid search approach ensured a transparent and thorough process, consistently testing every point in the hyperparameter space, allowing the best results (16 neurons, 60 epochs, lr = 0.001, Adam optimizer) to be objectively identified. 4.6 LSTM Model Testing with Best Hyperparameters The Total Comprehensive Income (Loss) time series was divided into training (blue) and test (green) data. The data was prepared for the LSTM through normalization and sequential windowing (input–target) to allow the model to learn temporal patterns. Experiments were performed with various combinations of hyperparameters (number of neurons, learning rate, Adam/RMSProp optimizer, epochs, and batch size) and evaluated on validation to select the most stable configuration. The graph shows a comparison of the training data (blue), test data (green), and LSTM predictions (dotted orange). The model is able to follow the general level of the series during the test period, but tends to be unable to capture large spikes in the test data. Overall, the main pattern is captured, but the amplitude of the fluctuations remains muted; improvements could be directed towards adjusting the window length, adding more layers/neurons, tuning the learning rate, or adding exogenous features if available. Based on the performance evaluation results, the LSTM model produced an RMSE of 3067615×106, an MSE of 9410260×10¹², and a MAPE of 165.29% on the test data. The relatively large RMSE and MSE values indicate a significant deviation from the actual predictions on an absolute scale. Meanwhile, a high MAPE indicates a large relative error percentage, so the model is unable to provide predictions with sufficient accuracy. This is consistent with the visualization results, where the model tends to produce predictions that are too smooth (underfitting) and is less able to capture spikes in values during certain periods. 5. CONCLUSION SARIMA is capable of capturing annual seasonal patterns and general trends, but its predictions tend to be smooth, making them less responsive to sharp spikes or drops. The relatively high MAPE value indicates that the prediction accuracy is relatively suboptimal. LSTM is more flexible in capturing non-linear patterns, but on this data it tends to underfit, producing predictions that are too smooth and do not capture extreme fluctuations. The high MAPE value reinforces this indication. In general, SARIMA provides better results than LSTM on this dataset, especially in terms of representing seasonal patterns, although overall accuracy still needs improvement. Given that SARIMA outperformed LSTM in capturing seasonal patterns in this dataset, future studies should consider the following directions to enhance overall forecasting accuracy, Combining the strengths of both models through hybrid frameworks (e.g., SARIMA-LSTM or ARIMA-GRU) could leverage SARIMA's ability to handle seasonality and LSTM’s strength in modeling nonlinear patterns and long-term dependencies. Future research could apply more advanced hyperparameter tuning methods (e.g., grid search, Bayesian optimization) to optimize LSTM architecture (e.g., number of layers, neurons, learning rate), which may result in performance improvements. Declarations ACKNOWLEDGEMENTS The author would like to express sincere gratitude to all parties who have provided support in the preparation of this article. Special thanks to my parents, colleagues and others who have offered constructive feedback, suggestions and encouragement. References Bollerslev T (1986) Generalized autoregressive conditional heteroskedasticity. J Econ 31(3):307–327. https://doi.org/10.1016/0304-4076(86)90063-1 Box GEP, Jenkins GM, Reinsel GC, Ljung GM (2015) Time series analysis: Forecasting and control (5th ed.). Wiley Engle RF (2001) GARCH 101: The use of ARCH/GARCH models in applied econometrics. J Economic Perspect 15(4):157–168. https://doi.org/10.1257/jep.15.4.157 Hyndman RJ, Athanasopoulos G (2021) Forecasting: Principles and practice (3rd ed.). OTexts . https://otexts.com/fpp3/ Nelson DMQ, Pereira ACM, de Oliveira RA (2017) Stock market's price movement prediction with LSTM neural networks, 2017 International Joint Conference on Neural Networks (IJCNN) , Anchorage, AK, USA, pp. 1419–1426. 10.1109/IJCNN.2017.7966019 Sak H, Senior AW, Beaufays F (2014) Long short-term memory recurrent neural network architectures for large scale acoustic modeling. Interspeech Sonkavde G, Dharrao DS, Bongale AM, Deokate ST, Doreswamy D, Bhat SK (2023) Forecasting Stock Market Prices Using Machine Learning and Deep Learning Models: A Systematic Review, Performance Analysis and Discussion of Implications. Int J Financial Stud 11(3):94. https://doi.org/10.3390/ijfs11030094 Wang Y (2024) Predicting Mutual Fund Performance based on LSTM Models. Highlights Sci Eng Technol. https://doi.org/10.54097/qezdm154 Zou Z, Qu Z (2020) Using LSTM in Stock prediction and Quantitative Trading. CS230: Deep Learning, Winter 2020 . https://cs230.stanford.edu/projects_winter_2020/reports/32066186.pdf accessed on 10 July 2025 Additional Declarations The authors declare no competing interests. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-9180930","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":609598455,"identity":"753e5096-a5ba-41d3-ab75-ca055ac26803","order_by":0,"name":"Hiteash Mahajan","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA+UlEQVRIiWNgGAWjYBACA2YwdQDCS6gAEszMDQS0MMO0ABkPzoBoRgJaGJC0MD5sA3EIaDFn5z/44MefO3Ybbvcf/JA4rzaavx2o5UfFNpxaLJuZmQ17254lb7hzmFkicdvx3BmHGRsYe87cxu2ww8xsErwNh5MNbiQzALUcy20AamFmbMOrhf3nnz9gLcw/Euccy51PhBY2Zh62w3ZALWwSiQ01uRuI0GIsLdt2OEHyzmEzi4RjB3I3ArUcxOuX8wcffnzz57A93+3Gxzd/1NTlzjt/GBiGFbi1wEBigwSYPgwmDxBUDwT2DBAtdcQoHgWjYBSMghEGAAzSY5DrMZQbAAAAAElFTkSuQmCC","orcid":"https://orcid.org/0009-0002-3682-4610","institution":"Department of Computer Science Engineering, UIET Kathua","correspondingAuthor":true,"prefix":"","firstName":"Hiteash","middleName":"","lastName":"Mahajan","suffix":""}],"badges":[],"createdAt":"2026-03-20 16:56:14","currentVersionCode":1,"declarations":{"humanSubjects":false,"vertebrateSubjects":false,"conflictsOfInterestStatement":false,"humanSubjectEthicalGuidelines":false,"humanSubjectConsent":false,"humanSubjectClinicalTrial":false,"humanSubjectCaseReport":false,"vertebrateSubjectEthicalGuidelines":false},"doi":"10.21203/rs.3.rs-9180930/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-9180930/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":105178841,"identity":"52236104-d841-4db7-9339-e726fefc8888","added_by":"auto","created_at":"2026-03-23 06:47:58","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":353644,"visible":true,"origin":"","legend":"\u003cp\u003eGambar 1 Arsitektur LSTM\u003c/p\u003e","description":"","filename":"floatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-9180930/v1/1c15ff4380f570c160ee258b.png"},{"id":105178836,"identity":"99254324-a060-4dbd-b9fe-398b1b0abc32","added_by":"auto","created_at":"2026-03-23 06:47:57","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":99687,"visible":true,"origin":"","legend":"\u003cp\u003eMonthly Comprehensive Income (Loss) Trend\u003c/p\u003e","description":"","filename":"floatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-9180930/v1/ba31a4a261f9e46c467cf45a.png"},{"id":105178851,"identity":"e0db2b2d-66a3-425c-a7d5-d656eef543ca","added_by":"auto","created_at":"2026-03-23 06:47:59","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":502318,"visible":true,"origin":"","legend":"\u003cp\u003eTime Series Decomposition Plot\u003c/p\u003e","description":"","filename":"floatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-9180930/v1/9d7726ec24b10c91cbcc472d.png"},{"id":105178838,"identity":"697a8195-db3e-4cc0-b2bc-b60736dca518","added_by":"auto","created_at":"2026-03-23 06:47:57","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":401113,"visible":true,"origin":"","legend":"\u003cp\u003eTraining data and testing data of the Total Comprehensive Income (Loss) of Life Insurance in Indonesia\u003c/p\u003e","description":"","filename":"floatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-9180930/v1/30923dfd7df4f04e337a0bf5.png"},{"id":105564155,"identity":"ee548187-f7f4-42ad-8083-4e777f1841cc","added_by":"auto","created_at":"2026-03-27 12:48:52","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":45459,"visible":true,"origin":"","legend":"\u003cp\u003eAutocorrelation Function (ACF) and Partial Autocorrelation Function (PACF) on the training data set\u003c/p\u003e","description":"","filename":"floatimage5.png","url":"https://assets-eu.researchsquare.com/files/rs-9180930/v1/64085cdc3307085459f7c94a.png"},{"id":105178840,"identity":"292aa849-2e47-48e1-b58a-2b73c4e65cc8","added_by":"auto","created_at":"2026-03-23 06:47:57","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":439664,"visible":true,"origin":"","legend":"\u003cp\u003eForecasting of SARIMA model\u003c/p\u003e","description":"","filename":"floatimage6.png","url":"https://assets-eu.researchsquare.com/files/rs-9180930/v1/b6a5651742451e83df7f4709.png"},{"id":105178839,"identity":"69ad7a43-25de-444c-970d-20c2f3578dfa","added_by":"auto","created_at":"2026-03-23 06:47:57","extension":"png","order_by":7,"title":"Figure 7","display":"","copyAsset":false,"role":"figure","size":383942,"visible":true,"origin":"","legend":"\u003cp\u003eFigure 6. Comparison of Training Data, Testing Data and LSTM Prediction\u003c/p\u003e","description":"","filename":"floatimage7.png","url":"https://assets-eu.researchsquare.com/files/rs-9180930/v1/78988501b1098a74a72fb288.png"},{"id":105903891,"identity":"f327b4b3-9506-4fdb-8ced-62bd372bfa8f","added_by":"auto","created_at":"2026-04-01 09:56:56","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":2726046,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-9180930/v1/49111a14-d28d-412f-b817-459e959578b4.pdf"}],"financialInterests":"The authors declare no competing interests.","formattedTitle":"\u003cp\u003eComparison of Sarima and Lstm Model to Forecast the Comprehensive Income (Loss) of Life Insurances in Indonesia\u003c/p\u003e","fulltext":[{"header":"1. INTRODUCTION","content":"\u003cp\u003eThe comprehensive income statement is a report that measures a company's performance over a specific period. It is used to assess and predict the amount and timing of future cash flows, subject to uncertainties. The comprehensive income statement describes the sources of income earned by the company in conducting its business, as well as the types of costs incurred by the company in carrying out its activities. By observing or noting the difference between revenues and expenses, it is possible to determine the company's profit or loss during a specific period.\u003c/p\u003e \u003cp\u003eThe comprehensive income statement is useful for helping financial statement users predict future cash flows, in order to determine profitability, investment value, and creditworthiness. The comprehensive income statement is often used by investors use information about a company's past earnings as an important input in predicting future earnings and cash flows, which then serve as the basis for predicting the company's future stock price and dividends. Creditors also use past earnings information, creditors can understand a potential borrower's ability to generate future cash flows to cover interest and repay loan principal. Creditors do not primarily desire the liquidation of collateral, but rather the company's success in generating income and cash flow from operations. Beside that, the comprehensive income statement is considered important to investors and creditors, so it is only natural that management should also be interested in the comprehensive income statement. Furthermore, in many companies, bonuses given to managers are determined based on their success in achieving profit targets.\u003c/p\u003e \u003cp\u003ePrevious research has shown that traditional statistical models such as ARIMA and GARCH have advantages in capturing linear patterns and volatility in financial data. ARIMA has been shown to be effective in forecasting short-term trends and seasonal patterns in time series data (Box \u0026amp; Jenkins, 2015). Meanwhile, GARCH excels in modeling heteroscedasticity, thus representing the volatility fluctuations common in financial and insurance market data (Bollerslev, 1986; Engle, 2001). With their relatively simple structures, these two models provide fairly accurate results for data with linear patterns, clear interpretations, and lower computational requirements than modern machine learning approaches.\u003c/p\u003e \u003cp\u003eInsurance financial data is volatile due to the influence of premiums, claims, investment returns, and operational costs. Traditional statistical models such as ARIMA or GARCH are capable of capturing simple linear patterns, but are often less effective when dealing with complex and non- linear interactions. Therefore, modern approaches such as machine learning, specifically LSTM (Long Short-Term Memory), are more suitable because they can learn both short-term and long- term patterns in insurance time series data, thus providing more accurate predictions of comprehensive profit.\u003c/p\u003e \u003cp\u003eThe LSTM model significantly improved prediction accuracy of mutual fund performance over traditional statistical approaches. It effectively captured temporal patterns in the data, helping investors anticipate fund behavior and make informed decisions. The study from (Wang, 2024) LSTM can serve as a powerful tool for fund managers and retail investors navigating complex financial markets. It confirms that LSTM models offer a robust, accurate, and practical method for predicting mutual fund performance, making them valuable for data-driven investment strategies (Wang, 2024). The study from Nelson et. al (2017) explores the application of Long Short-Term Memory (LSTM) networks in predicting future stock price trends, utilizing historical price data in conjunction with technical analysis indicators. A predictive model was developed and a series of experiments were conducted. The resulting performance was evaluated using several metrics to determine whether this type of algorithm offers improvements over traditional machine learning methods and existing investment strategies. The results are promising, with the model achieving an average accuracy of up to 55.9% in predicting whether the price of a given stock would rise in the near future.\u003c/p\u003e \u003cp\u003eMost forecasting research in the insurance sector still focuses on premiums, claims, or claim reserves, while the Total Comprehensive Income (Loss) variable, which encompasses both profit and loss and Other Comprehensive Income, is relatively rarely studied. This indicator, however, is a more comprehensive indicator for assessing the financial health of insurance companies. Most ARIMA/GARCH studies have been conducted on the stock market or macroeconomics (e.g., GDP, inflation, oil prices), but their application to financial data of insurance companies in Indonesia is still very limited. Previous research tends to use only one type of model (e.g., ARIMA alone), without comparing it with variations such as seasonal SARIMA or simple benchmarks (na\u0026iuml;ve/ETS). This results in less comprehensive evaluations of model accuracy.\u003c/p\u003e \u003cp\u003eSome previous research has focused solely on short-term forecasting, while medium- and long- term forecasting is more relevant for risk management and financial planning, but remains rarely studied. This study fills this gap by using Total Comprehensive Income (Loss) as the primary variable, thus providing a more representative forecasting method for assessing the overall performance of insurance companies than simply looking at premiums or claims. This study provides a contextual contribution by testing a forecasting model on insurance financial data in Indonesia, an area where there is still limited literature.\u003c/p\u003e \u003cp\u003eIn addition to using ARIMA/SARIMA, this study tests model performance using various evaluation metrics (RMSE, MAE, MAPE), providing a more comprehensive picture of prediction quality. The results offer an analytical basis for planning risk management and investment strategies, as they are able to anticipate future comprehensive profit trends.\u003c/p\u003e"},{"header":"2. LITERATURE REVIEW","content":"\u003cp\u003eThe Autoregressive Integrated Moving Average (ARIMA) is a family of Box–Jenkins models for time series forecasting that combines autoregressive (AR), differencing (I), and moving average (MA) components.\u003c/p\u003e\n\u003cp\u003eThe ARIMA model consists of three parts:\u003c/p\u003e\n\u003cp\u003e• Autoregressive (AR): The linear relationship between its current value and its lagged observations.\u003c/p\u003e\n\u003cp\u003e• Differencing (I): The number of times the series is differenced to achieve stationarity to remove trends and seasonality.\u003c/p\u003e\n\u003cp\u003e• Moving Average (MA): Relates the current value to past forecast errors using a number of lagged errors.\u003c/p\u003e\n\u003cp\u003eThis model is very effective for data that has a linear pattern and requires the assumption of stationarity before modeling. The following is the general formula for ARIMA.\u003c/p\u003e\n\u003cp\u003e∆𝑌𝑡 = 𝜑1∆𝑌𝑡−1 + 𝜑2∆𝑌𝑡−2 + 𝜑𝑝∆𝑦𝑡−𝑝 + 𝑒𝑡 − 𝜃1𝑒𝑡−1 − 𝜃2𝑒𝑡−2 − 𝜃𝑞𝑒𝑡−𝑞\u003c/p\u003e\n\u003cp\u003eWhere:\u003c/p\u003e\n\u003cp\u003e• ∆𝑌𝑡 is the differenced value of the original time series at time t\u003c/p\u003e\n\u003cp\u003e• 𝜑1, 𝜑2, … 𝜑𝑝 are the autoregressive (AR) coefficients\u003c/p\u003e\n\u003cp\u003e• 𝜃1, 𝜃2, … 𝜃𝑞 are the moving average (MA) coefficients\u003c/p\u003e\n\u003cp\u003e• 𝑒𝑡 is the white noise (random error) at time t\u003c/p\u003e\n\u003cp\u003e• 𝑝 is the number of autoregressive terms\u003c/p\u003e\n\u003cp\u003e• 𝑞 is the number of moving average terms\u003c/p\u003e\n\u003cp\u003eThis equation combines both autoregressive (AR) and moving average (MA) components, applied to a differenced version of the series (indicated by ∆), to model time-dependent patterns in the data.\u003c/p\u003e\n\u003cp\u003eStandard practice includes identification (stationarity tests, ACF/PACF), parameter estimation, and residual diagnostics; common extensions include SARIMA (seasonal) and ARIMAX/SARIMAX (with exogenous variables). Classical and modern introductory references place ARIMA alongside exponential smoothing as the two most widely used approaches in applied forecasting.\u003c/p\u003e\n\u003cp\u003eLong Short-Term Memory (LSTM) networks have been shown to be one of the best choices when it comes to handling sequential data. LSTM (Long Short-Term Memory) is a development of the Recurrent Neural Network (RNN) architecture designed to overcome the weakness of RNNs in remembering long-term information (vanishing gradient problem). LSTM has a cell state structure with three main gates: input gate, forget gate, and output gate, which function to regulate the flow of information adaptively. The advantage of LSTM compared to conventional RNNs is its ability to remember patterns over long time spans, manage long-term and short-term dependencies, and capture non-linear relationships in the data. This makes LSTM very suitable for forecasting complex time series, high fluctuations, or patterns that are difficult to capture by linear models such as ARIMA. Their unique architecture comprises memory cells along with three gates:\u003c/p\u003e\n\u003cp\u003e• Input Gates: Controls which parts of the previous memory are discarded.\u003c/p\u003e\n\u003cp\u003e• Forget Gates: Determines what new information should be added to the memory.\u003c/p\u003e\n\u003cp\u003e• Output Gates: Regulates what information is output for the current step.\u003c/p\u003e \u003cp\u003eThese components work together to selectively retain, update, and output information to capture both the short-term fluctuations and long-term trends in the dataset. Zou and Qu (2020) implemented LSTM networks to predict stock prices and analyze this strategy against others. Their study showed that the LSTM model outperformed the traditional RNNs and machine learning methods by effectively learning patterns in historical stock price data. Sonkavde et al. (2023) conducted a review comparing machine learning and deep learning methods which included LSTM, Convolutional Neural Networks (CNNs), and traditional RNNs. Their result showed the effectiveness of the LSTM model compared to RNNs and CNNs by handling temporal and spatial features better. Study from Sak et al (2014), LSTM recurrent neural networks (RNNs) outperform both deep neural networks (DNNs) and traditional RNNs in the context of acoustic modeling.\u003c/p\u003e \u003cp\u003eThe following are the general equations and architecture of LSTM\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eForget gate :\u003cdiv class=\"BlockQuote\"\u003e\u003cp\u003eControls what portion of the previous cell state should be retained:\u003c/p\u003e\u003cp\u003e\u0026#119891;\u0026#119905; = \u0026#120590;(\u0026#119882;\u0026#119891; \u0026times; \u0026#119910;\u0026#119905; + \u0026#119880;\u0026#119891; \u0026times; ℎ\u0026#119905;\u0026minus;1 + \u0026#119887;\u0026#119891;) (\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e)\u003c/p\u003e\u003cp\u003eInput gate :\u003c/p\u003e\u003cp\u003eDetermines which new information should be added to the cell state:\u003c/p\u003e\u003cp\u003e\u0026#119868;\u0026#119905; = \u0026#120590;(\u0026#119882;\u0026#119894; \u0026times; \u0026#119910;\u0026#119905; + \u0026#119880;\u0026#119894; \u0026times; ℎ\u0026#119905;\u0026minus;1 + \u0026#119887;\u0026#119894; ) (\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e)\u003c/p\u003e\u003cp\u003eĆ\u0026#119905; = tanh(\u0026#119882;\u0026#119888; \u0026times; \u0026#119910;\u0026#119905; + \u0026#119880;\u0026#119888; \u0026times; ℎ\u0026#119905;\u0026minus;1 + \u0026#119887;\u0026#119888;) (\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e)\u003c/p\u003e\u003cp\u003eC\u0026#119905; = tanh ((\u0026#119891;\u0026#119905; \u0026times; \u0026#119862;\u0026#119905;\u0026minus;1 + \u0026#119894;\u0026#119905;) \u0026times; Ć\u0026#119905;) (\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e)\u003c/p\u003e\u003cp\u003eOutput gate :\u003c/p\u003e\u003cp\u003eControls the output of the LSTM cell:\u003c/p\u003e\u003c/div\u003e\u003c/p\u003e \u003cp\u003e\u0026#119874;\u0026#119905; = \u0026#120590;(\u0026#119882;\u0026#119900; \u0026times; \u0026#119910;\u0026#119905; + \u0026#119880;\u0026#119900; \u0026times; ℎ\u0026#119905;\u0026minus;1 + \u0026#119887;\u0026#119900;) (\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e)\u003c/p\u003e \u003cp\u003eℎ\u0026#119905; = \u0026#119874;\u0026#119905; \u0026times; tanh(C\u0026#119905;) (\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e)\u003cdiv class=\"BlockQuote\"\u003e\u003cp\u003eWhere:\u003c/p\u003e\u003c/div\u003e\u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003e\u0026#120590; denotes the sigmoid activation function\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003etanh denotes the hyperbolic tangent activation\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003e\u0026#119874;\u0026#119905; indicates element-wise multiplication\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eW, U, and b represent weight matrices and bias vectors\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003cp\u003eThese equations enable the LSTM to retain, update, and output relevant features across time steps, making it highly effective for sequence modeling tasks such as language modeling, time- series forecasting, and speech recognition.\u003c/p\u003e"},{"header":"3. RESEARCH METHOD","content":"\u003cp\u003eThis study uses a quantitative approach with time series forecasting. The Autoregressive Integrated Moving Average (ARIMA/SARIMA) model chosen because it has proven effective in predicting both linear and fluctuating financial data.\u003c/p\u003e \u003cdiv id=\"Sec4\" class=\"Section2\"\u003e \u003ch2\u003e3.1 Research Data\u003c/h2\u003e \u003cp\u003e \u003cdiv class=\"BlockQuote\"\u003e \u003cp\u003eData Type that we used in this study is secondary data in the form of insurance company financial reports. With predicted variable is Total Comprehensive Income (Loss) of Life Insurance in Indonesia. Data Period is from 2017 to 2025.\u003c/p\u003e \u003c/div\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003e3.2 Analysis Stages\u003c/h2\u003e \u003cp\u003eThe analysis process follows the Box\u0026ndash;Jenkins framework (Box \u0026amp; Jenkins, 2015; Hyndman \u0026amp; Athanasopoulos, 2021):\u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003eData Stationarity Test: we utilized the Augmented Dickey-Fuller (ADF) Test and/or the KPSS Test. If non-stationary, we perform a transformation (log/Box-Cox) or differencing.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eModel Identification: The Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF) plots are used to determine the order of p, d, and q. If a seasonal pattern is present, the SARIMA (Seasonal ARIMA) model is used.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eModel Parameter Estimation: ARIMA/SARIMA parameters are estimated using the Maximum Likelihood Estimation (MLE) method.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eModel Diagnostics: The Ljung Box test is used to ensure the residuals are white noise. The best model is selected based on the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC).\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eForecasting: The best model is used to predict Total Comprehensive Income (Loss) in the future period. Accuracy is evaluated using Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE).\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003c/div\u003e"},{"header":"4. RESULTS AND DISCUSSION","content":"\u003cp\u003eBased on Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e, the company's monthly Total Comprehensive Income (Loss) during the period January 2017 to May 2025, which consists of 101 observation data, has a maximum value of 14,759,160.71 which occurred in December 2017, while the minimum value of -12,191,989.15 occurred in October 2018. Overall, the average Total Comprehensive Income (Loss) during the period was 1,481,665.00, which shows that in general the company recorded positive comprehensive income even though there were significant fluctuations in certain periods.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eFirst ten observations of total monthly Comprehensive Income (Loss)\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"2\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eMonths\u003c/span\u003e\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eTotal Comprehensive Income (Loss)\u003c/span\u003e\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-01-01\u003c/span\u003e\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e1,474,870.8\u003c/span\u003e\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-02-01\u003c/span\u003e\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2,638,107.4\u003c/span\u003e\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-03-01\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e5,308,832.8\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-04-01\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e6,259,147.3\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-05-01\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e8,074,418.8\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-06-01\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e9,828,258.4\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-07-01\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e10,828,261.0\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-08-01\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e6,566,310.7\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-09-01\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e7,458,188.1\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e2017-10-01\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e11,460,697.5\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eAccording to Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e, the company's monthly Total Comprehensive Income (Loss) trend from January 2017 to May 2025 shows a fluctuating pattern with several significant changes. At the beginning of the period (2017), there was a sharp increase that peaked in December 2017, followed by a drastic decline until it reached its lowest point in October 2018. Afterward, the Total Comprehensive Income (Loss) fluctuated with several extreme spikes and declines, such as in early 2022. Entering the period from 2023 to 2025, the movement tended to be more stable with an upward trend in early 2025, indicating potential positive growth in the coming period.\u003c/p\u003e \u003cdiv id=\"Sec7\" class=\"Section2\"\u003e \u003ch2\u003e4.1 Time Series Decomposition\u003c/h2\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eThis figure shows the results of decomposing the Total Comprehensive Income (Loss) time series into four main components. The Observed component shows the original data observed from January 2017 to May 2025, while the Trend component reveals a long-term pattern with a sharp decline until 2019, then a plateau with a slight increase towards 2025. The Seasonal component displays a seasonal pattern that repeats annually with significant fluctuations, indicating a periodic cycle in the data. Meanwhile, the Residual component contains random variation that cannot be explained by either trends or seasonal patterns, likely influenced by external factors or unforeseen events. This decomposition is useful for separating the influence of trends and seasonality from random variation, thus facilitating the analysis and forecasting process.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003e4.2 Split Data\u003c/h2\u003e \u003cp\u003eFigure \u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e shows the division of the Total Comprehensive Income (Loss) data into two parts: training data and testing data in an 80:20 ratio. Training data is used to build predictive models, while test data is used to evaluate the model's performance on data it has never seen before.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eFrom this visualization, we can see that the training data (blue line) covers the period from January 2017 to approximately mid-2023, while the test data (red line) covers the remaining period until May 2025. This separation is important so the model can learn from historical patterns and test its ability to predict future trends without data leakage.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec9\" class=\"Section2\"\u003e \u003ch2\u003e4.3 Data Stationarity\u003c/h2\u003e \u003cp\u003eThe following figure presents plots of the Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF) on the training data set. These plots are used to observe the interrelationships between observations in a time series and help identify potential autoregressive (AR) and moving average (MA) components before building an ARIMA or SARIMA model.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eThe ACF and PACF plots in the figure are used to identify the possible orders of the autoregressive (AR) and moving average (MA) components in the time series model. Based on the results of the Augmented Dickey-Fuller (ADF) test, the ADF statistic is -3.2049 with a p-value of 0.0197, which is less than the 0.05 significance threshold. Furthermore, this value is also lower than the critical value at the 5% significance level (-2.8989), thus concluding that the data is stationary. In the next stage, seasonal differencing will be performed to eliminate any remaining seasonal patterns and ensure the data meets the stationary assumption more optimally. This stationary condition is important because it is one of the main prerequisites before building an ARIMA or SARIMA model. SARIMA Model Identification (p,d,q) (P,D,Q)^s\u003c/p\u003e \u003cp\u003eAfter seasonal differencing to eliminate annual recurring patterns, the SARIMA model identification process begins with an analysis of the ACF and PACF plots on the differencing data to estimate the initial values of the parameters p, q, P, Qp, q, P, Q. The auto-ARIMA method is also used to automatically suggest parameter combinations based on the information criterion (AIC/BIC). The results of these two approaches are then compared and tested through a model overfitting process, which involves trying various parameter combinations around candidate values to ensure the selected model strikes a balance between goodness of fit and simplicity. This process yields three best candidate SARIMA models, which will be further evaluated in depth during the final model selection stage.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab2\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eARIMA Model Evaluation Based on AIC, BIC, and HQIC\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eModel\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eAIC\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eBIC\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eHQIC\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eARIMA (1,0,0) (0,1,1) [12]\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e2265.71\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e2272.38\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e2268,36\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eARIMA (1,0,1) (0,1,1) [12]\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e2268.10\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e2276.98\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e2271,62\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eARIMA (2,0,0) (0,1,1) [12]\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e2268.73\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e2277.60\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e2272,24\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eBased on the model selection results, three best SARIMA candidates were obtained after going through the initial identification process, auto-ARIMA, and overfitting. The three models are ARIMA (1,0,0) (0,1,1) [12], ARIMA (1,0,1) (0,1,1) [12], and ARIMA (2,0,0) (0,1,1) [12].\u003c/p\u003e \u003cp\u003eThe AIC, BIC and HQIC values for each model are relatively close, indicating that they have a similar level of model fit. The ARIMA (1,0,0)(0,1,1) [12] model has the lowest AIC and HQIC, making it slightly superior to the other candidates in terms of information criteria\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec10\" class=\"Section2\"\u003e \u003ch2\u003e4.4 Evaluation of the SARIMA model on the testing data\u003c/h2\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eThe SARIMA (1,0,0) (0,1,1) [12] model was used to forecast Total Comprehensive Income (Loss) data. In the graph, the training data is shown by the blue line, the test data by the green line, and the model's prediction results by the dashed red line. The model can be seen to follow the general direction of data movement, but tends to produce smoother predictions and is less able to capture sharp fluctuations in the test data.\u003c/p\u003e \u003cp\u003eBased on the performance evaluation, this model produced an RMSE of 3,858,420.47, an MSE of 1.4887\u0026times;10\u0026sup1;\u0026sup3;, and a MAPE of 85.00%. The relatively high MAPE value indicates that the relative error of the predictions to the actual data is still large, so the model's accuracy is not optimal even though seasonal patterns can be represented.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003e4.5 LSTM Model Hyperparameters\u003c/h2\u003e \u003cp\u003eThe prediction process using the LSTM model begins with designing an architecture tailored to the characteristics of the time series data. The architecture used consists of one LSTM layer connected to a dense layer as the final output. Model training was conducted by testing 144 hyperparameter variations, including different numbers of neurons, epochs, learning rates, and the use of two optimization algorithms: Adam and RMSProp. All combinations were trained on the training data and validated using the validation data to achieve consistent performance.\u003c/p\u003e \u003cp\u003eThe batch size was selected by considering the proportion of training and test data, ensuring efficient and stable training. The best combination for each walk-forward validation scenario was selected based on the lowest average RMSE value obtained from all tested configurations.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab3\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 3\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eLSTM Model Hyperparameters\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"2\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"ItalicUnderline\" class=\"ItalicUnderline\" name=\"Emphasis\"\u003eHyperparameter\u003c/span\u003e\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eValue\u003c/span\u003e\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eOptimizer\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eAdam, RMSProp, SGD\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eNeuron\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e4, 8, 12, 16\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eLearning Rate\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e0.001, 0.0001, 0.00001\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eEpoch\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e10, 25, 40, 60\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eLayer\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e1\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003eDropout\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e0.1\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eThe best hyperparameters were selected using a manual grid search: all candidate combinations (number of neurons, number of epochs, learning rate, and optimizer type Adam, RMSprop, or SGD) were arranged as a grid. Each combination was then trained (with dropout and early stopping), evaluated on validation data, and its value recorded. The primary selection criterion was the average RMSE (in a walk-forward scenario, if applicable); the combination with the lowest RMSE was selected as the final configuration. This manual grid search approach ensured a transparent and thorough process, consistently testing every point in the hyperparameter space, allowing the best results (16 neurons, 60 epochs, lr\u0026thinsp;=\u0026thinsp;0.001, Adam optimizer) to be objectively identified.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003e4.6 LSTM Model Testing with Best Hyperparameters\u003c/h2\u003e \u003cp\u003eThe Total Comprehensive Income (Loss) time series was divided into training (blue) and test (green) data. The data was prepared for the LSTM through normalization and sequential windowing (input\u0026ndash;target) to allow the model to learn temporal patterns. Experiments were performed with various combinations of hyperparameters (number of neurons, learning rate, Adam/RMSProp optimizer, epochs, and batch size) and evaluated on validation to select the most stable configuration.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eThe graph shows a comparison of the training data (blue), test data (green), and LSTM predictions (dotted orange). The model is able to follow the general level of the series during the test period, but tends to be unable to capture large spikes in the test data. Overall, the main pattern is captured, but the amplitude of the fluctuations remains muted; improvements could be directed towards adjusting the window length, adding more layers/neurons, tuning the learning rate, or adding exogenous features if available.\u003c/p\u003e \u003cp\u003eBased on the performance evaluation results, the LSTM model produced an RMSE of 3067615\u0026times;106, an MSE of 9410260\u0026times;10\u0026sup1;\u0026sup2;, and a MAPE of 165.29% on the test data. The relatively large RMSE and MSE values indicate a significant deviation from the actual predictions on an absolute scale. Meanwhile, a high MAPE indicates a large relative error percentage, so the model is unable to provide predictions with sufficient accuracy. This is consistent with the visualization results, where the model tends to produce predictions that are too smooth (underfitting) and is less able to capture spikes in values during certain periods.\u003c/p\u003e \u003c/div\u003e"},{"header":"5. CONCLUSION","content":"\u003cp\u003eSARIMA is capable of capturing annual seasonal patterns and general trends, but its predictions tend to be smooth, making them less responsive to sharp spikes or drops. The relatively high MAPE value indicates that the prediction accuracy is relatively suboptimal. LSTM is more flexible in capturing non-linear patterns, but on this data it tends to underfit, producing predictions that are too smooth and do not capture extreme fluctuations. The high MAPE value reinforces this indication. In general, SARIMA provides better results than LSTM on this dataset, especially in terms of representing seasonal patterns, although overall accuracy still needs improvement. Given that SARIMA outperformed LSTM in capturing seasonal patterns in this dataset, future studies should consider the following directions to enhance overall forecasting accuracy, Combining the strengths of both models through hybrid frameworks (e.g., SARIMA-LSTM or ARIMA-GRU) could leverage SARIMA's ability to handle seasonality and LSTM\u0026rsquo;s strength in modeling nonlinear patterns and long-term dependencies. Future research could apply more advanced hyperparameter tuning methods (e.g., grid search, Bayesian optimization) to optimize LSTM architecture (e.g., number of layers, neurons, learning rate), which may result in performance improvements.\u003c/p\u003e"},{"header":"Declarations","content":"\u003ch2\u003eACKNOWLEDGEMENTS\u003c/h2\u003e \u003cp\u003eThe author would like to express sincere gratitude to all parties who have provided support in the preparation of this article. Special thanks to my parents, colleagues and others who have offered constructive feedback, suggestions and encouragement.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eBollerslev T (1986) Generalized autoregressive conditional heteroskedasticity. J Econ 31(3):307\u0026ndash;327. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/0304-4076(86)90063-1\u003c/span\u003e\u003cspan address=\"10.1016/0304-4076(86)90063-1\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBox GEP, Jenkins GM, Reinsel GC, Ljung GM (2015) Time series analysis: Forecasting and control (5th ed.). \u003cem\u003eWiley\u003c/em\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eEngle RF (2001) GARCH 101: The use of ARCH/GARCH models in applied econometrics. J Economic Perspect 15(4):157\u0026ndash;168. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1257/jep.15.4.157\u003c/span\u003e\u003cspan address=\"10.1257/jep.15.4.157\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHyndman RJ, Athanasopoulos G (2021) Forecasting: Principles and practice (3rd ed.). \u003cem\u003eOTexts\u003c/em\u003e. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://otexts.com/fpp3/\u003c/span\u003e\u003cspan address=\"https://otexts.com/fpp3/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNelson DMQ, Pereira ACM, de Oliveira RA (2017) Stock market's price movement prediction with LSTM neural networks, 2017 \u003cem\u003eInternational Joint Conference on Neural Networks (IJCNN)\u003c/em\u003e, Anchorage, AK, USA, pp. 1419\u0026ndash;1426. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1109/IJCNN.2017.7966019\u003c/span\u003e\u003cspan address=\"10.1109/IJCNN.2017.7966019\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSak H, Senior AW, Beaufays F (2014) Long short-term memory recurrent neural network architectures for large scale acoustic modeling. Interspeech\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSonkavde G, Dharrao DS, Bongale AM, Deokate ST, Doreswamy D, Bhat SK (2023) Forecasting Stock Market Prices Using Machine Learning and Deep Learning Models: A Systematic Review, Performance Analysis and Discussion of Implications. Int J Financial Stud 11(3):94. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3390/ijfs11030094\u003c/span\u003e\u003cspan address=\"10.3390/ijfs11030094\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWang Y (2024) Predicting Mutual Fund Performance based on LSTM Models. Highlights Sci Eng Technol. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.54097/qezdm154\u003c/span\u003e\u003cspan address=\"10.54097/qezdm154\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZou Z, Qu Z (2020) Using LSTM in Stock prediction and Quantitative Trading. CS230: Deep Learning, \u003cem\u003eWinter 2020\u003c/em\u003e. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://cs230.stanford.edu/projects_winter_2020/reports/32066186.pdf\u003c/span\u003e\u003cspan address=\"https://cs230.stanford.edu/projects_winter_2020/reports/32066186.pdf\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e accessed on 10 July 2025\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":true,"highlight":"","institution":"Department of Computer Science \u0026 Engineering, UIET Kathua","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":true,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"ARIMA, Forecast, Insurance, Indonesia, LSTM, Total Comprehensive Income","lastPublishedDoi":"10.21203/rs.3.rs-9180930/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-9180930/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eThis study compares the performance of the Seasonal Autoregressive Integrated Moving Average (SARIMA) and Long Short-Term Memory (LSTM) models in predicting time series data with seasonal patterns. Using historical data, namely total comprehensive income (loss) data for all insurance companies in Indonesia that contain strong seasonal components, the SARIMA model proved superior in capturing and representing seasonal patterns compared to LSTM. However, the evaluation results showed that the overall prediction accuracy of both models is still at a level that needs improvement. These findings suggest that although SARIMA is effective for seasonal data, the integration of statistical and deep learning-based models has the potential to improve prediction performance. Future research is recommended to explore hybrid approaches, hyperparameter optimization, and the use of external variables to improve the accuracy and generalization of predictive models.\u003c/p\u003e","manuscriptTitle":"Comparison of Sarima and Lstm Model to Forecast the Comprehensive Income (Loss) of Life Insurances in Indonesia","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-03-23 06:47:48","doi":"10.21203/rs.3.rs-9180930/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"eb2b1633-56fd-437f-98c5-ff64ffbcf286","owner":[],"postedDate":"March 23rd, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2026-03-23T06:47:48+00:00","versionOfRecord":[],"versionCreatedAt":"2026-03-23 06:47:48","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-9180930","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-9180930","identity":"rs-9180930","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.