AI-Based Personalised Portfolio Allocation Engine for Indian Retail Investors: A Multi-Algorithm Optimisation Approach with Explainable AI

doi:10.21203/rs.3.rs-9509094/v1

AI-Based Personalised Portfolio Allocation Engine for Indian Retail Investors: A Multi-Algorithm Optimisation Approach with Explainable AI

2026 · doi:10.21203/rs.3.rs-9509094/v1

preprint OA: closed CC-BY-4.0

🔓 Open OA copy Full text JSON View at publisher

Full text 95,020 characters · extracted from preprint-html · click to expand

AI-Based Personalised Portfolio Allocation Engine for Indian Retail Investors: A Multi-Algorithm Optimisation Approach with Explainable AI | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article AI-Based Personalised Portfolio Allocation Engine for Indian Retail Investors: A Multi-Algorithm Optimisation Approach with Explainable AI Tej Bachhav, Tejas Parekh, Kapil Rathor, Ami Munshi, Moumita Roy This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-9509094/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 7 You are reading this latest preprint version Abstract India has seen a rapid rise in retail investment accounts — exceeding 180 million mutual fund folios and 150 million demat accounts by 2024 — has not been met with accessible, personalised investment advisory tools. This paper presents an AI-Based Personalised Portfolio Allocation Engine that closes this gap by combining rule-based financial logic with four machine learning optimisation methods: Mean-Variance Optimisation (MVO), Risk Parity, Maximum Sharpe Ratio, and the Black-Litterman model. The engine builds a five-parameter Composite Risk Score (CRS) to classify investors into Conservative, Moderate, or Aggressive profiles and uses a three-signal market regime detector employing moving average crossover, momentum (rate of change), and volatility (India VIX) indicators. Individual equities are scored on combined fundamental and technical criteria, while mutual funds are ranked via Jensen Alpha-based scoring across six subcategories. Portfolio weights are then fine-tuned using Monte Carlo simulation with 5,000 scenarios spanning 20-year wealth projection horizons. An Explainable AI (XAI) module converts all algorithmic decisions into plain-language investment rationales, and a tax-aware rebalancing engine employing FIFO lot tracking and tax-loss harvesting reduced simulated rebalancing tax liability by 39.8%. Across three representative investor profiles, the system generated annualised simulated returns of 8.2%, 12.6%, and 16.1% (Conservative, Moderate, Aggressive) with Sharpe ratios of 0.62, 0.89, and 1.04 respectively. The system is implemented entirely in Python 3.10+ using open-source libraries and tackles five specific gaps in the Indian retail robo-advisory landscape. Portfolio optimisation Robo-advisory Machine learning in finance Explainable AI Indian capital markets Composite risk scoring Black-Litterman model Tax-loss harvesting Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 1 Introduction The retail investment environment in India has undergone a remarkable transformation during the last ten years. The Association of Mutual Funds in India (AMFI) states that the number of mutual fund folios crossed one hundred and eighty million in 2024 and Securities and Exchange Board of India (SEBI) data shows that the number of demat accounts opened went beyond one hundred and fifty million (AMFI 2024 ). Though there is mass participation, the level of analytical sophistications that individual investors have remains very basic. The majority of retail investors are following general asset allocation rules of thumb, taking advice from a commission-based advisor or using basic online tools. None of these get to know the investor's unique risk profile, the market regime prevailing or the tax impact of portfolio activities. Markowitz ( 1952 ) via his mean-variance framework set up a mathematical foundation for creating efficient portfolios. The progress made after him-Bank-Litterman model (Black and Litterman 1992 ), Risk Parity (Maillard et al. 2010 ), and Monte Carlo simulation are regularly being used by institutional asset managers. However, due to their complexity and high cost, ordinary investors are not able to use them. Machine learning (ML) along with financial data APIs that are open to public are playing an important role in continuing to break down this barrier (Heaton et al. 2017 ; Jiang et al. 2017 ; Lopez de Prado 2018 ). Indian fintech companies like Scrip box, ET Money, and Zerodha Coin have made some steps in automating the mutual fund selection process but none of them offers a fully integrated, multi-asset, data-driven advisory service for equity selection, fund scoring, portfolio optimization, regime adaptation, tax-aware rebalancing, and explainability. Our system aims at addressing such a need. This paper is organised as follows. Section 2 surveys related work. Section 3 describes the system architecture. Section 4 details the key algorithms. Section 5 presents experimental results. Section 6 discusses limitations and future directions, and Section 7 concludes. 2 Related Work 2.1 Portfolio Optimisation Theory Markowitz ( 1952 ) introduced the efficient frontier concept, which comprises portfolios achieving the highest expected return for a given level of risk (variance). This work laid the foundation for what we now refer to as modern portfolio theory (MPT). On the other hand, the notorious sensitivity of the framework to the errors in input estimations prompted the development of Black and Litterman ( 1992 ), which applies a Bayesian method to tie portfolio weights to CAPM equilibrium returns while giving room to investor views. It also generates allocations that seem to be more stable and more intuitive. Maillard et al. ( 2010 ) gave a theoretical basis to Risk Parity, proving that Equal Risk Contribution (ERC) portfolios mitigate drawdown during equity bear markets as a result of evenly distributing risk among asset classes. Ledoit and Wolf ( 2004 ) came up with the shrinkage estimator for covariance matrices in large dimensions, drastically lowering estimation noise. These methods are all implemented in the current system. 2.2 Machine Learning in Finance Heaton et al. ( 2017 ) utilized deep autoencoders for discovering the hidden latent factor structures in the financial time series, achieving better results in portfolio construction than linear factor models. Jiang et al. ( 2017 ) showed that deep reinforcement learning agents are able to learn dynamic portfolio rebalancing policies by interacting with a simulated cryptocurrency market and this work leaves a great scope for the further continuation of our research in line with their findings. Lopez de Prado ( 2018 ) presents a comprehensive overview of ML applications in finance and introduces the concept of meta-labelling as a method of reducing look-ahead bias - an issue that is tackled in our walk-forward evaluation methodology. 2.3 Behavioural Finance and Risk Profiling Kahneman and Tversky ( 1979 ) laid the foundation for prospect theory, supporting the fact that investors give the weight of loss more than double compared to the weight they give to gains of the same amount. Shefrin and Statman ( 2000 ) discovered behavioural portfolio theory which became a foundation in the field of investor psychology, and showed the non-professional investors do not think of their portfolio as one mean-variance optimal portfolio but as different mental accounts with different risk tolerances which in return affect the type of portfolios that they assemble. These results are the basis of our risk questionnaire and the communication strategy of our XAI module, which presents the recommendations in terms of clearly defined downside scenarios. 2.4 Robo-Advisory Platforms Fein ( 2015 ) and Sironi ( 2016 ) have done thorough surveys of US first-generation robo-advisors (Betterment, Wealth front), and amongst other things, they have found that these passive ETF-based strategies are devoid of active instrument selection. Indian-centric studies (Pandey 2020 ; NSE Research 2021 ; AMFI 2024 ; SEBI 2020 ) discuss that the peculiarities of Indian tax treatment (LTCG/STCG), market microstructure, and mutual funds as retail investment vehicles are predominant in India and therefore, the Western platforms are missing the India specific adaptations. Our platform attempts to fill five of the main gaps that have been identified in this literature - an India-market orientation, active instrument selection, market regime adaptation, tax-aware rebalancing, and explainability. 3 System Architecture The engine is implemented in Python 3.10 + as eleven loosely coupled modules that exchange data through a shared contract of pandas Data Frames and typed Python data classes. The pipeline is structured as a directed acyclic graph: User Profiler → [Market Regime | Equity Health] → Allocation Engine → [Stock Selector | Fund Selector | REIT Selector] → ML Optimiser → Rebalancing Engine → [Portfolio Manager | XAI | Dashboard] This modular layout keeps each component focused, makes unit testing straightforward, and lets developers swap out individual algorithms without breaking the rest of the system. Module descriptions are provided in Table 1 . Table 1 Module descriptions Module File Responsibility User Profiler user_profiler.py Investor questionnaire; CRS computation Market Regime market_regime.py BULL/BEAR/SIDEWAYS regime detection Equity Health equity_health.py Fundamental + technical stock scoring Allocation Engine allocation_engine_v2.py Target allocation percentages per asset class Stock Selector stock_selector_v3.py Multi-factor Indian equity ranking Fund Selector fund_selector_v3.py Jensen Alpha mutual fund scoring REIT/InvIT Selector reit_invit_selector.py Real-estate instrument screening ML Optimiser ml_optimizer.py MVO, Risk Parity, Max Sharpe, Black-Litterman Rebalancing Engine rebalancing.py Drift detection; FIFO tax-lot optimisation Portfolio Manager portfolio_manager.py Holdings tracking; performance metrics XAI / Dashboard explainable_ai.py html_dashboard.py Plain-language narratives; interactive charts 4 Key Algorithms 4.1 Composite Risk Scoring The Composite Risk Score (CRS) is a weighted linear combination of four normalised sub-scores computed from investor profile inputs: CRS = 0.25 × Age Score + 0.20 × Income Score + 0.30 × Risk Tolerance Score + 0.25 × Horizon Score Older investors get lower age scores, dropping gradually as they grow older. Scores for income are trimmed by monthly payments and financial responsibilities. Risk tolerance is built from a 1-to-10 self-rating, stretched out to 0–100. Horizon score climbs with how long someone plans to invest. CRS ranges and matching approaches appear in Table 2 - realistically, and the numbers don't always align perfectly. Table 2 CRS bands and corresponding strategy profiles CRS Range Strategy Equity Allocation Debt Allocation 0–40 Conservative 20% – 35% 50% – 65% 41–65 Moderate 45% – 60% 25% – 40% 66–100 Aggressive 65% – 80% 10% – 25% 4.2 Market Regime Detection The regime detection tool interprets the movement of Nifty 50 index by combining three separate binary signals. (i) Trend Signal: 50 day moving average is compared with 200 day moving average; a Golden Cross signals a bullish market whereas a Death Cross signals a bearish market. (ii) Momentum Signal: 20-day rate of change (ROC); ROC > 3% is bullish, ROC < 3% is bearish. (iii) Volatility Signal: India VIX; a reading below 15 is bullish, a reading above 25 is bearish. The regime is decided by a weighted majority vote among the three signals. CRS Equity allocation will be increased by + 10 percentage points under a BULL regime, decreased by 15 pp under a BEAR regime, and remained unchanged under a SIDEWAYS regime with reference to the CRS-derived baseline. This kind of dynamic adjustment is a distinguishing factor of the system compared to fixed robo-advisory platforms. 4.3 Equity Health Scoring Each potential stock is given a composite health score, which is equally split between fundamentals (50%) and technicals (50%) sub-scores. Among the fundamental factors, the ones most aligned with Indian market scenarios as per Pandey ( 2020 ) and NSE Research ( 2021 ) are: Price-to-Earnings (P/E), Price-to-Book (P/B), Return on Capital Employed (ROCE), Return on Equity (ROE), and 3-year EPS growth CAGR. Technical factors, in this case, are Relative Strength Index (RSI, preferred range 45–70), MACD signal cross over direction, price relative to 50-DMA and 200-DMA, and Average True Range (ATR) normalized volatility. The final stock health score affects the allocation engine results: a low overall health score means that the stock allocation will be less than the regime-adjusted baseline. 4.4 Portfolio Optimisation 4.4.1 Mean-Variance Optimisation This approach was used for the Aggressive investor profiles. The optimiser reduces portfolio variance to the lowest level (written mathematically as Var_p = w'Cw, where w stands for the vector of weights and C for the covariance matrix) while keeping the constraints w'mu = mu_target, sum(w_i) = 1, and w_i > = 0 for all i. The covariance matrix C is computed by the Ledoit-Wolf shrinkage estimator (Ledoit and Wolf 2004 ) used on three years of daily return data, significantly lowering the estimation error compared to the sample covariance matrix. 4.4.2 Risk Parity Applied to Conservative profiles only. The optimiser finds a weight vector w such that the risk contribution of each asset to the overall portfolio is the same. Technically, the Equal Risk Contribution (ERC) condition states that w_i times (Cw)_i divided by (w'Cw) equals 1/n should hold for all assets i, where C is the covariance matrix. This is tackled numerically with scipy. optimize. minimize that employs a squared-deviation objective. ERC portfolios, according to research (Maillard et al. 2010 ), tend to have smaller drawdowns during equity bear markets hence they are very suitable for capital-preservation profiles. 4.4.3 Maximum Sharpe Ratio This passage relates to Moderate investor profiles. The optimizer seeks to maximize the Sharpe ratio, which is given by the equation S = (Rp - Rf) / Sp, where Rp is the expected return of the portfolio, Rf = 6. 5% (the approximate yield on 10-year Indian Government Securities), and Sp is the portfolio volatility. In a geometrical sense, it finds the tangency portfolio on the efficient frontier, i.e. the portfolio that lies on the Capital Market Line. 4.4.4 Black-Litterman Model Intended for experienced investors who have very specific market views. The model uses a Bayesian update method to blend CAPM equilibrium returns with investor-specified views. Posterior expected returns (mu_BL) are obtained by merging the prior equilibrium return vector (Pi, calculated from market-cap weights) with the set of investor view vectors (P) and their expected outperformance values (Q). These are weighted by the view-uncertainty matrix (Omega) and scaling factor tau being applied to the covariance matrix. The posterior returns obtained in this manner are subsequently used as inputs to the MVO solver, thereby generating allocations that are significantly more stable and intuitive as compared to those from unconstrained MVO (Black and Litterman 1992 ). 4.5 Fund Selection via Jensen Alpha Scoring Mutual funds receive their evaluation through a 100-point system. The Alpha Score (35 points) gives 35 points for Jensen Alpha > 3%, 30 points for 1. 53%, 20 points for any positive alpha, and 0 for negative alpha. Beta Score (25 points) grants 25 points for beta < = 0. 7, 18 points for beta 1. 2. Downside Protection Score (20 points) is the result of a weighted combination of the Sortino ratio (Sortino and Price 1994 ) and the maximum drawdown. A Consistency Bonus (10 points) goes to the funds on the AMFI whitelist of persistent outperformers. Sharpe Score (10 points) is a measure of risk-adjusted return compared to category peers. From each subcategory (Large Cap, Flexi Cap, Mid Cap, Small Cap ELSS Balanced Advantage, International), top two or three funds are recommended with proportionate allocations. 4.6 Tax-Aware Rebalancing Four modes are available for the rebalancing engine: Threshold-based (5% drift from target allocation as default trigger), Time-based (e. g. monthly, quarterly, or annual fixed intervals), Calendar-based (e. g. start of each financial quarter), and Adaptive (the threshold tightens during low-VIX environments and relaxes during high-VIX periods to reduce unnecessary trading costs). Tax lots are recorded by FIFO accounting, Indian capital-gains law requires. Before producing sell orders the engine locates unrealised-loss positions eligible for tax-loss harvesting (TLH), thus capital losses are netted against realised gains. Tax is calculated as STCG (held < 12 months) at 15% and LTCG (held 12 months, gains exceeding 1 lakh) at 10%, in line with post-Union Budget 2024 rates (SEBI 2013 ). 5 Experimental Results 5.1 Simulated Investor Profiles We built three representative investor profiles that span the full CRS range to test how the system performs. All simulated returns draw on three years of historical data for the selected instruments, as of March 2026; Sharpe ratios are computed at r_f = 6.5%. Results are reported in Table 3 . Table 3 Simulated portfolio outputs for representative investor profiles Parameter Conservative (25F) Moderate (35M) Aggressive (28M) CRS 32 58 75 Market Regime SIDEWAYS BULL BULL Large Cap Equity 18% 28% 38% Mid/Small Cap Equity 5% 16% 28% Debt 55% 32% 14% Gold / Silver 12% 8% 6% REIT / InvIT 5% 8% 6% International Equity 5% 8% 8% Stocks recommended 3 8 12 Funds recommended 4 8 10 Sharpe ratio (simulated) 0.62 0.89 1.04 Ann. return (simulated) 8.2% 12.6% 16.1% The Conservative profile’s heavy debt allocation (55%) under a SIDEWAYS regime reflects appropriate capital-preservation positioning. The Aggressive profile’s 66% combined equity allocation under a BULL regime is consistent with classical aggressive-growth strategy. Simulated Sharpe ratios are consistent with published benchmarks for equivalent strategy types in the Indian mutual fund industry. 5.2 Portfolio Optimisation Results The Moderate investor’s portfolio was processed through the Maximum Sharpe Ratio optimiser. The recommended portfolio lies at the tangency point between the efficient frontier and the Capital Market Line. Monte Carlo simulation (5,000 paths; 5-year calibration window) with an initial lump sum of ₹5,00,000 and a monthly SIP of ₹25,000 produced the wealth projections reported in Table 4 . Table 4 Monte Carlo wealth projections — Moderate investor (₹5 L lump sum + ₹25,000 monthly SIP) Horizon 10th Percentile 50th Percentile (Median) 90th Percentile 1 Year ₹5.8 L ₹7.1 L ₹8.5 L 3 Years ₹12.4 L ₹17.2 L ₹23.8 L 5 Years ₹21.6 L ₹33.5 L ₹49.2 L 10 Years ₹52.3 L ₹98.7 L ₹1.72 Cr 20 Years ₹1.24 Cr ₹3.86 Cr ₹9.41 Cr The large gap between the 10th and 90th percentile wealth outcomes at the 20-year mark (₹1.24 Cr vs ₹9.41 Cr) shows how return uncertainty compounds and grows substantially over long periods. The XAI module makes this clear to users in plain language, alongside the median projection, helping investors form realistic expectations. 5.3 Rebalancing and Tax Efficiency A simulation was conducted in which the Moderate investor’s equity allocation drifted from 44% to 52% during a bull-market period. The 5% drift threshold triggered rebalancing when equity reached 49%. The engine identified three equity positions for partial disposal, generating ₹12,400 in short-term capital gains (STCG, held < 12 months) and ₹34,600 in long-term capital gains (LTCG). One position carrying a ₹6,800 unrealised loss was identified for tax-loss harvesting (TLH). Results are presented in Table 5 . Table 5 Rebalancing tax impact analysis Metric Without Tax Optimisation With Tax Optimisation (TLH) Short-term capital gains ₹12,400 ₹5,600 (after TLH) Long-term capital gains ₹34,600 ₹34,600 Total tax liability ₹11,940 ₹7,180 Tax saving — ₹4,760 (39.8%) Transaction costs ₹890 ₹1,120 (one extra trade) Net benefit of TLH — ₹3,640 The 39.8% reduction in tax liability demonstrates the material financial benefit of TLH-aware rebalancing. Results were cross-validated against manual calculations for five test portfolios, with numerical agreement to within 0.01% (consistent with IEEE Std 829–2008 for software testing documentation). 6 Discussion 6.1 Advantages The system has a range of advantages, including the ability to deal with a very broad selection of different types of investment assets (six asset classes that are not very highly correlated with each other); use of a regime-switching model to dynamically adapt investment allocations, which is not a feature of most 'static' retail advisory tools; tax-loss harvesting which can be quantified and result in meaningful tax savings; use of 'explainable AI' to provide investors with reasoning in an easy and comprehensible manner and therefore the trust in investment decisions can be achieved, which is the point of the explainability that has been questioned by Fein ( 2015 ); as well as the whole implementation being done by means of completely open-source Python libraries which, on the one hand, keep the system very inexpensive and, on the other hand, also allow for the system to be checked (audited) by any interested party. 6.2 Limitations However, there are some important downsides to be aware of. First, reliance on public data APIs (yfinance, nselib) can lead to errors in data quality and timeliness, especially for small-cap stocks and some mutual fund data points. Second, estimating returns based on historical data will obviously not work where there have been major changes in the economic or structural environment that are outside the historical data used for calibration. Third, while the system offers suggestions for investments it does not actually perform trades and would need to be connected with a brokerage API to be able to do trades. Fourth, the system does not take into consideration behavioural tendencies such as panic selling or recency bias which may cause investor actions in the real world to differ significantly from what the model predicts. Lastly, this platform is not registered as a SEBI Investment Adviser under SEBI ( 2013 ) regulations; all outputs are for education only and one should obtain a qualified professional's review before making decisions based on them. 6.3 Future Work Majorly focus on future development directions should be on: (i) real-time brokerage API integration (Zerodha Kite Groww Upstox) for single-click trade execution; (ii) LSTM and transformer-based return forecasting to replace historical mean estimates; (iii) reinforcement learning agents for dynamic rebalancing policy optimisation, extending the approach of Jiang et al. ( 2017 ); (iv) a goal-based investing module supporting multi-bucket portfolios for retirement, education, and housing; (v) NLP-driven news-sentiment integration for real-time expected return adjustment; and (vi) a SEBI regulatory compliance module enabling formal advisory registration. 7 Conclusion Recently, we have developed an AI-powered Personalised Portfolio Allocation Engine that integrates five-parameter risk profiling, three-signal market regime detection, multi-factor equity and fund scoring, four ML portfolio optimisation algorithms, and tax-aware rebalancing into one unified platform for Indian retail investors. Across three investor profiles, simulation results showed Sharpe ratios of 0. 62 to 1. 04 and annualised simulated returns of 8. 2% to 16. 1%, which is consistent with financial theory predictions for each risk level. By using systematic TLH, our tax-aware rebalancing engine was able to reduce the rebalancing tax liability by 39. 8%. Moreover, the easy-to-understand XAI narratives help explain the transparency gap that is widely discussed in the robo-advisory literature. Our solution is filling five major research gaps - focus on the Indian market, active selection of instruments, adaptation to market regimes, tax-aware rebalancing, and explanation - and thereby making institutional-quality portfolio management significantly more accessible to the everyday Indian investor. Declarations Funding No funding was received to assist with the preparation of this manuscript. Conflict of interest The authors have no relevant financial or non-financial interests to disclose. Ethics approval and consent to participate Not applicable. Data availability Market data are sourced from publicly available APIs (Yahoo Finance via yfinance v1.0; NSElib v2.4.2). Mutual fund data are derived from AMFI public disclosures. No proprietary datasets were used. Code is available from the corresponding author on reasonable request. Code availability Source code is available in the project repository. Key dependencies are listed in the Appendix. Author contribution Tej Bachhav: Conceptualisation, system architecture, ML optimisation module, rebalancing engine, writing — original draft. Tejas Parekh: Equity and fund selection modules, XAI module, HTML dashboard, writing — review and editing. Kapil Rathor: Supervision, Validation, Project administration. Ami Munshi: Supervision, Validation, Project administration, writing — review and editing. Moumita Roy: Supervision, Validation, Project administration, writing — review and editing. All authors have read and approved the final manuscript References AMFI (2024) Industry Data and Analytics Reports 2023–2024. Association of Mutual Funds in India. https://www.amfiindia.com Black F, Litterman R (1992) Global portfolio optimization. Financ Anal J 48(5):28–43 Fama EF (1970) Efficient capital markets: a review of theory and empirical work. J Finance 25(2):383–417 Fein ML (2015) Robo-advisers: a closer look. SSRN Working Paper 2658701. https://ssrn.com/abstract=2658701 Heaton JB, Polson NG, Witte JH (2017) Deep learning for finance: deep portfolios. Appl Stoch Model Bus Ind 33(1):3–12. https://doi.org/10.1002/asmb.2209 Jiang Z, Xu D, Liang J (2017) A deep reinforcement learning framework for the financial portfolio management problem. arXiv preprint arXiv:1706.10059 Kahneman D, Tversky A (1979) Prospect theory: an analysis of decision under risk. Econometrica 47(2):263–291 Ledoit O, Wolf M (2004) A well-conditioned estimator for large-dimensional covariance matrices. J Multivar Anal 88(2):365–411. https://doi.org/10.1016/S0047-259X(03)00096-4 Lopez de Prado M (2018) Advances in Financial Machine Learning. Wiley, Hoboken NJ Maillard S, Roncalli T, Teiletche J (2010) The properties of equally weighted risk contribution portfolios. J Portf Manag 36(4):60–70 Markowitz H (1952) Portfolio selection. J Finance 7(1):77–91. https://doi.org/10.2307/2975974 NSE Research (2021) Factor Investing in Indian Equities. National Stock Exchange of India, Mumbai Pandey IM (2020) Financial Management, 12th edn. Vikas Publishing House, New Delhi SEBI (2013) SEBI (Investment Advisers) Regulations. Securities and Exchange Board of India, Mumbai SEBI (2020) Circular on Portfolio Management Services. Securities and Exchange Board of India, Mumbai Sharpe WF (1964) Capital asset prices: a theory of market equilibrium under conditions of risk. J Finance 19(3):425–442 Shefrin H, Statman M (2000) Behavioral portfolio theory. J Financ Quant Anal 35(2):127–151 Sironi P (2016) FinTech Innovation: From Robo-Advisors to Goal Based Investing and Gamification. Wiley, Chichester Sortino FA, Price LN (1994) Performance measurement in a downside risk framework. J Investing 3(3):59–64 yfinance (2024) Yahoo Finance market data downloader, v1.0. https://pypi.org/project/yfinance/ Additional Declarations No competing interests reported. Supplementary Files AppendixAThird.docx Cite Share Download PDF Status: Under Review Version 1 posted Reviewers agreed at journal 06 May, 2026 Reviews received at journal 06 May, 2026 Reviewers agreed at journal 06 May, 2026 Reviewers invited by journal 04 May, 2026 Editor assigned by journal 01 May, 2026 Submission checks completed at journal 29 Apr, 2026 First submitted to journal 23 Apr, 2026 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-9509094","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":635736343,"identity":"98de6132-ff7b-4a80-99dc-0b5f35e3cd96","order_by":0,"name":"Tej Bachhav","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA5ElEQVRIie2PsQrCMBBAI4V0iWS9UvEbCoVWUfBXBCFTBD9AxKmTdM5nODkHgplaXLsXOoiDIDiKKQ5uNW6CecPdcdzj7hByOH6QCCHsvUolTYCBtQII6XmbyTcKidrGZyX1VVOv1mpDe8XtXK1HBPnquO9SxjuWxkIrCLblYcK1OYwwVnUeJjkOCWYQyfIQc2wUIEm3croY5cFgJosm5g8bpTJb+tkUIlR49TKzUMaiScJ+Pg2E1Im3zIHgT7+kdNGE5A6UClXf+H0zpL7SncobkBjajO3GW+jWu9pPOxwOxz/xBOLWRFvpyRXmAAAAAElFTkSuQmCC","orcid":"","institution":"Narsee Monjee Institute of Management Studies","correspondingAuthor":true,"prefix":"","firstName":"Tej","middleName":"","lastName":"Bachhav","suffix":""},{"id":635736344,"identity":"9ac6b975-7446-42d8-a289-56b0717e86cb","order_by":1,"name":"Tejas Parekh","email":"","orcid":"","institution":"Narsee Monjee Institute of Management Studies","correspondingAuthor":false,"prefix":"","firstName":"Tejas","middleName":"","lastName":"Parekh","suffix":""},{"id":635736345,"identity":"81b8cbdb-3068-4636-a98e-777a2c994643","order_by":2,"name":"Kapil Rathor","email":"","orcid":"","institution":"Narsee Monjee Institute of Management Studies","correspondingAuthor":false,"prefix":"","firstName":"Kapil","middleName":"","lastName":"Rathor","suffix":""},{"id":635736346,"identity":"bc373d05-992e-4354-8925-222165ad82cb","order_by":3,"name":"Ami Munshi","email":"","orcid":"","institution":"Narsee Monjee Institute of Management Studies","correspondingAuthor":false,"prefix":"","firstName":"Ami","middleName":"","lastName":"Munshi","suffix":""},{"id":635736347,"identity":"004d9c45-0986-41df-8a4d-7fc0422b06fd","order_by":4,"name":"Moumita Roy","email":"","orcid":"","institution":"Investorie","correspondingAuthor":false,"prefix":"","firstName":"Moumita","middleName":"","lastName":"Roy","suffix":""}],"badges":[],"createdAt":"2026-04-23 16:38:30","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-9509094/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-9509094/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":109121869,"identity":"79931c47-eabd-44fb-a311-9130b696ce46","added_by":"auto","created_at":"2026-05-12 17:31:34","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":135756,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eSystem architecture of the AI Portfolio Allocation Engine, showing directed acyclic graph pipeline from User Profiler through to XAI and Dashboard.\u003c/em\u003e\u003c/p\u003e","description":"","filename":"Figure1SystemArchitecture.png","url":"https://assets-eu.researchsquare.com/files/rs-9509094/v1/f7f2609e629cbf8fc938fdad.png"},{"id":109121874,"identity":"fa692aca-88d6-4881-a005-3ea28e85d443","added_by":"auto","created_at":"2026-05-12 17:31:35","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":225161,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eLeft: Composite Risk Score (CRS) computation pipeline showing four weighted sub-scores and output investor categories. Right: Three-signal market regime detection logic and equity allocation adjustments.\u003c/em\u003e\u003c/p\u003e","description":"","filename":"Figure2CRSandRegimeDetection.png","url":"https://assets-eu.researchsquare.com/files/rs-9509094/v1/ecc679ab48045935a3b4bc2c.png"},{"id":109121876,"identity":"419e29cf-da15-46e2-b250-4b38101a48bc","added_by":"auto","created_at":"2026-05-12 17:31:35","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":223211,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eSimulated portfolio asset allocation across three representative investor profiles: Conservative (25F, CRS=32), Moderate (35M, CRS=58), and Aggressive (28M, CRS=75).\u003c/em\u003e\u003c/p\u003e","description":"","filename":"Figure3PortfolioAllocation.png","url":"https://assets-eu.researchsquare.com/files/rs-9509094/v1/cbe1bf3893c63e0daa4c27cc.png"},{"id":109121871,"identity":"faa3b556-e2de-4b88-9086-5c9d6b37db06","added_by":"auto","created_at":"2026-05-12 17:31:34","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":308061,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eLeft: Efficient frontier with portfolio positions for three investor profiles and Capital Market Line. Right: Monte Carlo wealth projection for Moderate investor (5,000 paths; ₹5L lump sum + ₹25,000/month SIP).\u003c/em\u003e\u003c/p\u003e","description":"","filename":"Figure4EfficientFrontierandMonteCarlo.png","url":"https://assets-eu.researchsquare.com/files/rs-9509094/v1/babce3d7589029dc52d5448e.png"},{"id":109121870,"identity":"d72f337f-4e7c-4156-a320-9c3a015af312","added_by":"auto","created_at":"2026-05-12 17:31:34","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":223035,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eLeft: Jensen Alpha-based mutual fund scoring framework across six fund categories. Right: Tax impact comparison of rebalancing with and without tax-loss harvesting (TLH).\u003c/em\u003e\u003c/p\u003e","description":"","filename":"Figure5FundScoringandTaxImpact.png","url":"https://assets-eu.researchsquare.com/files/rs-9509094/v1/c9a104b47c9a5557e32c3356.png"},{"id":109207250,"identity":"418b8ce0-1134-45d9-8344-fc6dc0403f73","added_by":"auto","created_at":"2026-05-13 15:18:48","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1179806,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-9509094/v1/c3ef7e66-8d8d-41e2-9035-ca368c8096e1.pdf"},{"id":109204671,"identity":"d7557a65-7a69-4299-9437-01bd4e984ed7","added_by":"auto","created_at":"2026-05-13 15:01:47","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":15900,"visible":true,"origin":"","legend":"","description":"","filename":"AppendixAThird.docx","url":"https://assets-eu.researchsquare.com/files/rs-9509094/v1/4ecdd773f65bf2f7c2808430.docx"}],"financialInterests":"No competing interests reported.","formattedTitle":"AI-Based Personalised Portfolio Allocation Engine for Indian Retail Investors: A Multi-Algorithm Optimisation Approach with Explainable AI","fulltext":[{"header":"1 Introduction","content":"\u003cp\u003eThe retail investment environment in India has undergone a remarkable transformation during the last ten years. The Association of Mutual Funds in India (AMFI) states that the number of mutual fund folios crossed one hundred and eighty million in 2024 and Securities and Exchange Board of India (SEBI) data shows that the number of demat accounts opened went beyond one hundred and fifty million (AMFI \u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). Though there is mass participation, the level of analytical sophistications that individual investors have remains very basic. The majority of retail investors are following general asset allocation rules of thumb, taking advice from a commission-based advisor or using basic online tools. None of these get to know the investor's unique risk profile, the market regime prevailing or the tax impact of portfolio activities. Markowitz (\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e1952\u003c/span\u003e) via his mean-variance framework set up a mathematical foundation for creating efficient portfolios.\u003c/p\u003e \u003cp\u003eThe progress made after him-Bank-Litterman model (Black and Litterman \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e1992\u003c/span\u003e), Risk Parity (Maillard et al. \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e2010\u003c/span\u003e), and Monte Carlo simulation are regularly being used by institutional asset managers. However, due to their complexity and high cost, ordinary investors are not able to use them. Machine learning (ML) along with financial data APIs that are open to public are playing an important role in continuing to break down this barrier (Heaton et al. \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e2017\u003c/span\u003e; Jiang et al. \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e2017\u003c/span\u003e; Lopez de Prado \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e2018\u003c/span\u003e). Indian fintech companies like Scrip box, ET Money, and Zerodha Coin have made some steps in automating the mutual fund selection process but none of them offers a fully integrated, multi-asset, data-driven advisory service for equity selection, fund scoring, portfolio optimization, regime adaptation, tax-aware rebalancing, and explainability. Our system aims at addressing such a need.\u003c/p\u003e \u003cp\u003eThis paper is organised as follows. Section \u003cspan refid=\"Sec2\" class=\"InternalRef\"\u003e2\u003c/span\u003e surveys related work. Section \u003cspan refid=\"Sec7\" class=\"InternalRef\"\u003e3\u003c/span\u003e describes the system architecture. Section \u003cspan refid=\"Sec8\" class=\"InternalRef\"\u003e4\u003c/span\u003e details the key algorithms. Section \u003cspan refid=\"Sec19\" class=\"InternalRef\"\u003e5\u003c/span\u003e presents experimental results. Section \u003cspan refid=\"Sec23\" class=\"InternalRef\"\u003e6\u003c/span\u003e discusses limitations and future directions, and Section \u003cspan refid=\"Sec27\" class=\"InternalRef\"\u003e7\u003c/span\u003e concludes.\u003c/p\u003e"},{"header":"2 Related Work","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003e2.1 Portfolio Optimisation Theory\u003c/h2\u003e \u003cp\u003eMarkowitz (\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e1952\u003c/span\u003e) introduced the efficient frontier concept, which comprises portfolios achieving the highest expected return for a given level of risk (variance). This work laid the foundation for what we now refer to as modern portfolio theory (MPT). On the other hand, the notorious sensitivity of the framework to the errors in input estimations prompted the development of Black and Litterman (\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e1992\u003c/span\u003e), which applies a Bayesian method to tie portfolio weights to CAPM equilibrium returns while giving room to investor views. It also generates allocations that seem to be more stable and more intuitive. Maillard et al. (\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e2010\u003c/span\u003e) gave a theoretical basis to Risk Parity, proving that Equal Risk Contribution (ERC) portfolios mitigate drawdown during equity bear markets as a result of evenly distributing risk among asset classes. Ledoit and Wolf (\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e2004\u003c/span\u003e) came up with the shrinkage estimator for covariance matrices in large dimensions, drastically lowering estimation noise. These methods are all implemented in the current system.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec4\" class=\"Section2\"\u003e \u003ch2\u003e2.2 Machine Learning in Finance\u003c/h2\u003e \u003cp\u003eHeaton et al. (\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e2017\u003c/span\u003e) utilized deep autoencoders for discovering the hidden latent factor structures in the financial time series, achieving better results in portfolio construction than linear factor models. Jiang et al. (\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e2017\u003c/span\u003e) showed that deep reinforcement learning agents are able to learn dynamic portfolio rebalancing policies by interacting with a simulated cryptocurrency market and this work leaves a great scope for the further continuation of our research in line with their findings. Lopez de Prado (\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e2018\u003c/span\u003e) presents a comprehensive overview of ML applications in finance and introduces the concept of meta-labelling as a method of reducing look-ahead bias - an issue that is tackled in our walk-forward evaluation methodology.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003e2.3 Behavioural Finance and Risk Profiling\u003c/h2\u003e \u003cp\u003eKahneman and Tversky (\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e1979\u003c/span\u003e) laid the foundation for prospect theory, supporting the fact that investors give the weight of loss more than double compared to the weight they give to gains of the same amount. Shefrin and Statman (\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e2000\u003c/span\u003e) discovered behavioural portfolio theory which became a foundation in the field of investor psychology, and showed the non-professional investors do not think of their portfolio as one mean-variance optimal portfolio but as different mental accounts with different risk tolerances which in return affect the type of portfolios that they assemble. These results are the basis of our risk questionnaire and the communication strategy of our XAI module, which presents the recommendations in terms of clearly defined downside scenarios.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec6\" class=\"Section2\"\u003e \u003ch2\u003e2.4 Robo-Advisory Platforms\u003c/h2\u003e \u003cp\u003eFein (\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e2015\u003c/span\u003e) and Sironi (\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e2016\u003c/span\u003e) have done thorough surveys of US first-generation robo-advisors (Betterment, Wealth front), and amongst other things, they have found that these passive ETF-based strategies are devoid of active instrument selection. Indian-centric studies (Pandey \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; NSE Research \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e2021\u003c/span\u003e; AMFI \u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e2024\u003c/span\u003e; SEBI \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2020\u003c/span\u003e) discuss that the peculiarities of Indian tax treatment (LTCG/STCG), market microstructure, and mutual funds as retail investment vehicles are predominant in India and therefore, the Western platforms are missing the India specific adaptations. Our platform attempts to fill five of the main gaps that have been identified in this literature - an India-market orientation, active instrument selection, market regime adaptation, tax-aware rebalancing, and explainability.\u003c/p\u003e \u003c/div\u003e"},{"header":"3 System Architecture","content":"\u003cp\u003eThe engine is implemented in Python 3.10\u0026thinsp;+\u0026thinsp;as eleven loosely coupled modules that exchange data through a shared contract of pandas Data Frames and typed Python data classes. The pipeline is structured as a directed acyclic graph:\u003c/p\u003e \u003cp\u003e \u003cem\u003eUser Profiler \u0026rarr; [Market Regime | Equity Health] \u0026rarr; Allocation Engine \u0026rarr; [Stock Selector | Fund Selector | REIT Selector] \u0026rarr; ML Optimiser \u0026rarr; Rebalancing Engine \u0026rarr; [Portfolio Manager | XAI | Dashboard]\u003c/em\u003e \u003c/p\u003e \u003cp\u003eThis modular layout keeps each component focused, makes unit testing straightforward, and lets developers swap out individual algorithms without breaking the rest of the system. Module descriptions are provided in Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eModule descriptions\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"3\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eModule\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eFile\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eResponsibility\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eUser Profiler\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003euser_profiler.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eInvestor questionnaire; CRS computation\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMarket Regime\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003emarket_regime.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eBULL/BEAR/SIDEWAYS regime detection\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eEquity Health\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eequity_health.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eFundamental\u0026thinsp;+\u0026thinsp;technical stock scoring\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eAllocation Engine\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eallocation_engine_v2.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eTarget allocation percentages per asset class\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eStock Selector\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003estock_selector_v3.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eMulti-factor Indian equity ranking\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eFund Selector\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003efund_selector_v3.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eJensen Alpha mutual fund scoring\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eREIT/InvIT Selector\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003ereit_invit_selector.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eReal-estate instrument screening\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eML Optimiser\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eml_optimizer.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eMVO, Risk Parity, Max Sharpe, Black-Litterman\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eRebalancing Engine\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003erebalancing.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eDrift detection; FIFO tax-lot optimisation\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003ePortfolio Manager\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eportfolio_manager.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eHoldings tracking; performance metrics\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eXAI / Dashboard\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eexplainable_ai.py html_dashboard.py\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003ePlain-language narratives; interactive charts\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003e \u003c/p\u003e"},{"header":"4 Key Algorithms","content":"\u003cdiv id=\"Sec9\" class=\"Section2\"\u003e \u003ch2\u003e4.1 Composite Risk Scoring\u003c/h2\u003e \u003cp\u003eThe Composite Risk Score (CRS) is a weighted linear combination of four normalised sub-scores computed from investor profile inputs:\u003c/p\u003e \u003cp\u003e \u003cem\u003eCRS\u0026thinsp;=\u0026thinsp;0.25 \u0026times; Age Score\u0026thinsp;+\u0026thinsp;0.20 \u0026times; Income Score\u0026thinsp;+\u0026thinsp;0.30 \u0026times; Risk Tolerance Score\u0026thinsp;+\u0026thinsp;0.25 \u0026times; Horizon Score\u003c/em\u003e \u003c/p\u003e \u003cp\u003eOlder investors get lower age scores, dropping gradually as they grow older. Scores for income are trimmed by monthly payments and financial responsibilities. Risk tolerance is built from a 1-to-10 self-rating, stretched out to 0\u0026ndash;100. Horizon score climbs with how long someone plans to invest. CRS ranges and matching approaches appear in Table \u003cspan refid=\"Tab2\" class=\"InternalRef\"\u003e2\u003c/span\u003e - realistically, and the numbers don't always align perfectly.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab2\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eCRS bands and corresponding strategy profiles\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eCRS Range\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eStrategy\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eEquity Allocation\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eDebt Allocation\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e0\u0026ndash;40\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eConservative\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e20% \u0026ndash; 35%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e50% \u0026ndash; 65%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e41\u0026ndash;65\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eModerate\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e45% \u0026ndash; 60%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e25% \u0026ndash; 40%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e66\u0026ndash;100\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eAggressive\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e65% \u0026ndash; 80%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e10% \u0026ndash; 25%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec10\" class=\"Section2\"\u003e \u003ch2\u003e4.2 Market Regime Detection\u003c/h2\u003e \u003cp\u003eThe regime detection tool interprets the movement of Nifty 50 index by combining three separate binary signals. (i) Trend Signal: 50 day moving average is compared with 200 day moving average; a Golden Cross signals a bullish market whereas a Death Cross signals a bearish market. (ii) Momentum Signal: 20-day rate of change (ROC); ROC\u0026thinsp;\u0026gt;\u0026thinsp;3% is bullish, ROC\u0026thinsp;\u0026lt;\u0026thinsp;3% is bearish. (iii) Volatility Signal: India VIX; a reading below 15 is bullish, a reading above 25 is bearish. The regime is decided by a weighted majority vote among the three signals. CRS Equity allocation will be increased by +\u0026thinsp;10 percentage points under a BULL regime, decreased by 15 pp under a BEAR regime, and remained unchanged under a SIDEWAYS regime with reference to the CRS-derived baseline. This kind of dynamic adjustment is a distinguishing factor of the system compared to fixed robo-advisory platforms.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003e4.3 Equity Health Scoring\u003c/h2\u003e \u003cp\u003eEach potential stock is given a composite health score, which is equally split between fundamentals (50%) and technicals (50%) sub-scores. Among the fundamental factors, the ones most aligned with Indian market scenarios as per Pandey (\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2020\u003c/span\u003e) and NSE Research (\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e2021\u003c/span\u003e) are: Price-to-Earnings (P/E), Price-to-Book (P/B), Return on Capital Employed (ROCE), Return on Equity (ROE), and 3-year EPS growth CAGR. Technical factors, in this case, are Relative Strength Index (RSI, preferred range 45\u0026ndash;70), MACD signal cross over direction, price relative to 50-DMA and 200-DMA, and Average True Range (ATR) normalized volatility. The final stock health score affects the allocation engine results: a low overall health score means that the stock allocation will be less than the regime-adjusted baseline.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003e4.4 Portfolio Optimisation\u003c/h2\u003e \u003cdiv id=\"Sec13\" class=\"Section3\"\u003e \u003ch2\u003e4.4.1 Mean-Variance Optimisation\u003c/h2\u003e \u003cp\u003eThis approach was used for the Aggressive investor profiles. The optimiser reduces portfolio variance to the lowest level (written mathematically as Var_p\u0026thinsp;=\u0026thinsp;w'Cw, where w stands for the vector of weights and C for the covariance matrix) while keeping the constraints w'mu\u0026thinsp;=\u0026thinsp;mu_target, sum(w_i)\u0026thinsp;=\u0026thinsp;1, and w_i\u0026thinsp;\u0026gt;\u0026thinsp;=\u0026thinsp;0 for all i. The covariance matrix C is computed by the Ledoit-Wolf shrinkage estimator (Ledoit and Wolf \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e2004\u003c/span\u003e) used on three years of daily return data, significantly lowering the estimation error compared to the sample covariance matrix.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec14\" class=\"Section3\"\u003e \u003ch2\u003e4.4.2 Risk Parity\u003c/h2\u003e \u003cp\u003eApplied to Conservative profiles only. The optimiser finds a weight vector w such that the risk contribution of each asset to the overall portfolio is the same. Technically, the Equal Risk Contribution (ERC) condition states that w_i times (Cw)_i divided by (w'Cw) equals 1/n should hold for all assets i, where C is the covariance matrix. This is tackled numerically with scipy. optimize. minimize that employs a squared-deviation objective. ERC portfolios, according to research (Maillard et al. \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e2010\u003c/span\u003e), tend to have smaller drawdowns during equity bear markets hence they are very suitable for capital-preservation profiles.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec15\" class=\"Section3\"\u003e \u003ch2\u003e4.4.3 Maximum Sharpe Ratio\u003c/h2\u003e \u003cp\u003eThis passage relates to Moderate investor profiles. The optimizer seeks to maximize the Sharpe ratio, which is given by the equation S = (Rp - Rf) / Sp, where Rp is the expected return of the portfolio, Rf\u0026thinsp;=\u0026thinsp;6. 5% (the approximate yield on 10-year Indian Government Securities), and Sp is the portfolio volatility. In a geometrical sense, it finds the tangency portfolio on the efficient frontier, i.e. the portfolio that lies on the Capital Market Line.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec16\" class=\"Section3\"\u003e \u003ch2\u003e4.4.4 Black-Litterman Model\u003c/h2\u003e \u003cp\u003eIntended for experienced investors who have very specific market views. The model uses a Bayesian update method to blend CAPM equilibrium returns with investor-specified views. Posterior expected returns (mu_BL) are obtained by merging the prior equilibrium return vector (Pi, calculated from market-cap weights) with the set of investor view vectors (P) and their expected outperformance values (Q). These are weighted by the view-uncertainty matrix (Omega) and scaling factor tau being applied to the covariance matrix. The posterior returns obtained in this manner are subsequently used as inputs to the MVO solver, thereby generating allocations that are significantly more stable and intuitive as compared to those from unconstrained MVO (Black and Litterman \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e1992\u003c/span\u003e).\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv id=\"Sec17\" class=\"Section2\"\u003e \u003ch2\u003e4.5 Fund Selection via Jensen Alpha Scoring\u003c/h2\u003e \u003cp\u003eMutual funds receive their evaluation through a 100-point system. The Alpha Score (35 points) gives 35 points for Jensen Alpha\u0026thinsp;\u0026gt;\u0026thinsp;3%, 30 points for 1. 53%, 20 points for any positive alpha, and 0 for negative alpha. Beta Score (25 points) grants 25 points for beta\u0026thinsp;\u0026lt;\u0026thinsp;=\u0026thinsp;0. 7, 18 points for beta\u0026thinsp;\u0026lt;\u0026thinsp;=\u0026thinsp;1. 0, and 0 for beta\u0026thinsp;\u0026gt;\u0026thinsp;1. 2. Downside Protection Score (20 points) is the result of a weighted combination of the Sortino ratio (Sortino and Price \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e1994\u003c/span\u003e) and the maximum drawdown. A Consistency Bonus (10 points) goes to the funds on the AMFI whitelist of persistent outperformers. Sharpe Score (10 points) is a measure of risk-adjusted return compared to category peers. From each subcategory (Large Cap, Flexi Cap, Mid Cap, Small Cap ELSS Balanced Advantage, International), top two or three funds are recommended with proportionate allocations.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec18\" class=\"Section2\"\u003e \u003ch2\u003e4.6 Tax-Aware Rebalancing\u003c/h2\u003e \u003cp\u003eFour modes are available for the rebalancing engine: Threshold-based (5% drift from target allocation as default trigger), Time-based (e. g. monthly, quarterly, or annual fixed intervals), Calendar-based (e. g. start of each financial quarter), and Adaptive (the threshold tightens during low-VIX environments and relaxes during high-VIX periods to reduce unnecessary trading costs). Tax lots are recorded by FIFO accounting, Indian capital-gains law requires. Before producing sell orders the engine locates unrealised-loss positions eligible for tax-loss harvesting (TLH), thus capital losses are netted against realised gains. Tax is calculated as STCG (held\u0026thinsp;\u0026lt;\u0026thinsp;12 months) at 15% and LTCG (held 12 months, gains exceeding 1 lakh) at 10%, in line with post-Union Budget 2024 rates (SEBI \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e2013\u003c/span\u003e).\u003c/p\u003e \u003c/div\u003e"},{"header":"5 Experimental Results","content":"\u003cdiv id=\"Sec20\" class=\"Section2\"\u003e \u003ch2\u003e5.1 Simulated Investor Profiles\u003c/h2\u003e \u003cp\u003eWe built three representative investor profiles that span the full CRS range to test how the system performs. All simulated returns draw on three years of historical data for the selected instruments, as of March 2026; Sharpe ratios are computed at r_f\u0026thinsp;=\u0026thinsp;6.5%. Results are reported in Table\u0026nbsp;\u003cspan refid=\"Tab3\" class=\"InternalRef\"\u003e3\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab3\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 3\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eSimulated portfolio outputs for representative investor profiles\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eParameter\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eConservative (25F)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eModerate (35M)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eAggressive (28M)\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eCRS\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e32\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e58\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e75\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMarket Regime\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eSIDEWAYS\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eBULL\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eBULL\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLarge Cap Equity\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e18%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e28%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e38%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMid/Small Cap Equity\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e5%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e16%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e28%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eDebt\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e55%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e32%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e14%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eGold / Silver\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e12%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e8%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e6%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eREIT / InvIT\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e5%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e8%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e6%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eInternational Equity\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e5%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e8%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e8%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eStocks recommended\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e3\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e8\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e12\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eFunds recommended\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e4\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e8\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e10\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eSharpe ratio (simulated)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e0.62\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e0.89\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e1.04\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eAnn. return (simulated)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e8.2%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e12.6%\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e16.1%\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eThe Conservative profile\u0026rsquo;s heavy debt allocation (55%) under a SIDEWAYS regime reflects appropriate capital-preservation positioning. The Aggressive profile\u0026rsquo;s 66% combined equity allocation under a BULL regime is consistent with classical aggressive-growth strategy. Simulated Sharpe ratios are consistent with published benchmarks for equivalent strategy types in the Indian mutual fund industry.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec21\" class=\"Section2\"\u003e \u003ch2\u003e5.2 Portfolio Optimisation Results\u003c/h2\u003e \u003cp\u003eThe Moderate investor\u0026rsquo;s portfolio was processed through the Maximum Sharpe Ratio optimiser. The recommended portfolio lies at the tangency point between the efficient frontier and the Capital Market Line. Monte Carlo simulation (5,000 paths; 5-year calibration window) with an initial lump sum of ₹5,00,000 and a monthly SIP of ₹25,000 produced the wealth projections reported in Table\u0026nbsp;\u003cspan refid=\"Tab5\" class=\"InternalRef\"\u003e4\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab4\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 4\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eMonte Carlo wealth projections \u0026mdash; Moderate investor (₹5 L lump sum + ₹25,000 monthly SIP)\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eHorizon\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003e10th Percentile\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003e50th Percentile (Median)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003e90th Percentile\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e1 Year\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e₹5.8 L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹7.1 L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e₹8.5 L\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e3 Years\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e₹12.4 L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹17.2 L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e₹23.8 L\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e5 Years\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e₹21.6 L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹33.5 L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e₹49.2 L\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e10 Years\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e₹52.3 L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹98.7 L\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e₹1.72 Cr\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e20 Years\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e₹1.24 Cr\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹3.86 Cr\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e₹9.41 Cr\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eThe large gap between the 10th and 90th percentile wealth outcomes at the 20-year mark (₹1.24 Cr vs ₹9.41 Cr) shows how return uncertainty compounds and grows substantially over long periods. The XAI module makes this clear to users in plain language, alongside the median projection, helping investors form realistic expectations.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec22\" class=\"Section2\"\u003e \u003ch2\u003e5.3 Rebalancing and Tax Efficiency\u003c/h2\u003e \u003cp\u003eA simulation was conducted in which the Moderate investor\u0026rsquo;s equity allocation drifted from 44% to 52% during a bull-market period. The 5% drift threshold triggered rebalancing when equity reached 49%. The engine identified three equity positions for partial disposal, generating ₹12,400 in short-term capital gains (STCG, held\u0026thinsp;\u0026lt;\u0026thinsp;12 months) and ₹34,600 in long-term capital gains (LTCG). One position carrying a ₹6,800 unrealised loss was identified for tax-loss harvesting (TLH). Results are presented in Table\u0026nbsp;\u003cspan refid=\"Tab6\" class=\"InternalRef\"\u003e5\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab6\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 5\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eRebalancing tax impact analysis\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"3\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMetric\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eWithout Tax Optimisation\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eWith Tax Optimisation (TLH)\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eShort-term capital gains\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e₹12,400\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹5,600 (after TLH)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLong-term capital gains\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e₹34,600\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹34,600\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTotal tax liability\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e₹11,940\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹7,180\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTax saving\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u0026mdash;\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹4,760 (39.8%)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTransaction costs\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e₹890\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹1,120 (one extra trade)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eNet benefit of TLH\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003e\u0026mdash;\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003e₹3,640\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eThe 39.8% reduction in tax liability demonstrates the material financial benefit of TLH-aware rebalancing. Results were cross-validated against manual calculations for five test portfolios, with numerical agreement to within 0.01% (consistent with IEEE Std 829\u0026ndash;2008 for software testing documentation).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e"},{"header":"6 Discussion","content":"\u003cdiv id=\"Sec24\" class=\"Section2\"\u003e \u003ch2\u003e6.1 Advantages\u003c/h2\u003e \u003cp\u003eThe system has a range of advantages, including the ability to deal with a very broad selection of different types of investment assets (six asset classes that are not very highly correlated with each other); use of a regime-switching model to dynamically adapt investment allocations, which is not a feature of most 'static' retail advisory tools; tax-loss harvesting which can be quantified and result in meaningful tax savings; use of 'explainable AI' to provide investors with reasoning in an easy and comprehensible manner and therefore the trust in investment decisions can be achieved, which is the point of the explainability that has been questioned by Fein (\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e2015\u003c/span\u003e); as well as the whole implementation being done by means of completely open-source Python libraries which, on the one hand, keep the system very inexpensive and, on the other hand, also allow for the system to be checked (audited) by any interested party.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec25\" class=\"Section2\"\u003e \u003ch2\u003e6.2 Limitations\u003c/h2\u003e \u003cp\u003eHowever, there are some important downsides to be aware of. First, reliance on public data APIs (yfinance, nselib) can lead to errors in data quality and timeliness, especially for small-cap stocks and some mutual fund data points. Second, estimating returns based on historical data will obviously not work where there have been major changes in the economic or structural environment that are outside the historical data used for calibration. Third, while the system offers suggestions for investments it does not actually perform trades and would need to be connected with a brokerage API to be able to do trades. Fourth, the system does not take into consideration behavioural tendencies such as panic selling or recency bias which may cause investor actions in the real world to differ significantly from what the model predicts. Lastly, this platform is not registered as a SEBI Investment Adviser under SEBI (\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e2013\u003c/span\u003e) regulations; all outputs are for education only and one should obtain a qualified professional's review before making decisions based on them.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec26\" class=\"Section2\"\u003e \u003ch2\u003e6.3 Future Work\u003c/h2\u003e \u003cp\u003eMajorly focus on future development directions should be on: (i) real-time brokerage API integration (Zerodha Kite Groww Upstox) for single-click trade execution; (ii) LSTM and transformer-based return forecasting to replace historical mean estimates; (iii) reinforcement learning agents for dynamic rebalancing policy optimisation, extending the approach of Jiang et al. (\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e2017\u003c/span\u003e); (iv) a goal-based investing module supporting multi-bucket portfolios for retirement, education, and housing; (v) NLP-driven news-sentiment integration for real-time expected return adjustment; and (vi) a SEBI regulatory compliance module enabling formal advisory registration.\u003c/p\u003e \u003c/div\u003e"},{"header":"7 Conclusion","content":"\u003cp\u003eRecently, we have developed an AI-powered Personalised Portfolio Allocation Engine that integrates five-parameter risk profiling, three-signal market regime detection, multi-factor equity and fund scoring, four ML portfolio optimisation algorithms, and tax-aware rebalancing into one unified platform for Indian retail investors. Across three investor profiles, simulation results showed Sharpe ratios of 0. 62 to 1. 04 and annualised simulated returns of 8. 2% to 16. 1%, which is consistent with financial theory predictions for each risk level. By using systematic TLH, our tax-aware rebalancing engine was able to reduce the rebalancing tax liability by 39. 8%. Moreover, the easy-to-understand XAI narratives help explain the transparency gap that is widely discussed in the robo-advisory literature. Our solution is filling five major research gaps - focus on the Indian market, active selection of instruments, adaptation to market regimes, tax-aware rebalancing, and explanation - and thereby making institutional-quality portfolio management significantly more accessible to the everyday Indian investor.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eFunding\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNo funding was received to assist with the preparation of this manuscript.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConflict of interest\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors have no relevant financial or non-financial interests to disclose.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eEthics approval and consent to participate\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot applicable.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eData availability\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eMarket data are sourced from publicly available APIs (Yahoo Finance via yfinance v1.0; NSElib v2.4.2). Mutual fund data are derived from AMFI public disclosures. No proprietary datasets were used. Code is available from the corresponding author on reasonable request.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCode availability\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eSource code is available in the project repository. Key dependencies are listed in the Appendix.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthor contribution\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eTej Bachhav: Conceptualisation, system architecture, ML optimisation module, rebalancing engine, writing \u0026mdash; original draft.\u003c/p\u003e\n\u003cp\u003eTejas Parekh: Equity and fund selection modules, XAI module, HTML dashboard, writing \u0026mdash; review and editing.\u003c/p\u003e\n\u003cp\u003eKapil Rathor: Supervision, Validation, Project administration.\u003c/p\u003e\n\u003cp\u003eAmi Munshi: Supervision, Validation, Project administration, writing \u0026mdash; review and editing.\u003c/p\u003e\n\u003cp\u003eMoumita Roy: Supervision, Validation, Project administration, writing \u0026mdash; review and editing.\u003c/p\u003e\n\u003cp\u003eAll authors have read and approved the final manuscript\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eAMFI (2024) Industry Data and Analytics Reports 2023\u0026ndash;2024. Association of Mutual Funds in India. https://www.amfiindia.com\u003c/li\u003e\n\u003cli\u003eBlack F, Litterman R (1992) Global portfolio optimization. Financ Anal J 48(5):28\u0026ndash;43\u003c/li\u003e\n\u003cli\u003eFama EF (1970) Efficient capital markets: a review of theory and empirical work. J Finance 25(2):383\u0026ndash;417\u003c/li\u003e\n\u003cli\u003eFein ML (2015) Robo-advisers: a closer look. SSRN Working Paper 2658701. https://ssrn.com/abstract=2658701\u003c/li\u003e\n\u003cli\u003eHeaton JB, Polson NG, Witte JH (2017) Deep learning for finance: deep portfolios. Appl Stoch Model Bus Ind 33(1):3\u0026ndash;12. https://doi.org/10.1002/asmb.2209\u003c/li\u003e\n\u003cli\u003eJiang Z, Xu D, Liang J (2017) A deep reinforcement learning framework for the financial portfolio management problem. arXiv preprint arXiv:1706.10059\u003c/li\u003e\n\u003cli\u003eKahneman D, Tversky A (1979) Prospect theory: an analysis of decision under risk. Econometrica 47(2):263\u0026ndash;291\u003c/li\u003e\n\u003cli\u003eLedoit O, Wolf M (2004) A well-conditioned estimator for large-dimensional covariance matrices. J Multivar Anal 88(2):365\u0026ndash;411. https://doi.org/10.1016/S0047-259X(03)00096-4\u003c/li\u003e\n\u003cli\u003eLopez de Prado M (2018) Advances in Financial Machine Learning. Wiley, Hoboken NJ\u003c/li\u003e\n\u003cli\u003eMaillard S, Roncalli T, Teiletche J (2010) The properties of equally weighted risk contribution portfolios. J Portf Manag 36(4):60\u0026ndash;70\u003c/li\u003e\n\u003cli\u003eMarkowitz H (1952) Portfolio selection. J Finance 7(1):77\u0026ndash;91. https://doi.org/10.2307/2975974\u003c/li\u003e\n\u003cli\u003eNSE Research (2021) Factor Investing in Indian Equities. National Stock Exchange of India, Mumbai\u003c/li\u003e\n\u003cli\u003ePandey IM (2020) Financial Management, 12th edn. Vikas Publishing House, New Delhi\u003c/li\u003e\n\u003cli\u003eSEBI (2013) SEBI (Investment Advisers) Regulations. Securities and Exchange Board of India, Mumbai\u003c/li\u003e\n\u003cli\u003eSEBI (2020) Circular on Portfolio Management Services. Securities and Exchange Board of India, Mumbai\u003c/li\u003e\n\u003cli\u003eSharpe WF (1964) Capital asset prices: a theory of market equilibrium under conditions of risk. J Finance 19(3):425\u0026ndash;442\u003c/li\u003e\n\u003cli\u003eShefrin H, Statman M (2000) Behavioral portfolio theory. J Financ Quant Anal 35(2):127\u0026ndash;151\u003c/li\u003e\n\u003cli\u003eSironi P (2016) FinTech Innovation: From Robo-Advisors to Goal Based Investing and Gamification. Wiley, Chichester\u003c/li\u003e\n\u003cli\u003eSortino FA, Price LN (1994) Performance measurement in a downside risk framework. J Investing 3(3):59\u0026ndash;64\u003c/li\u003e\n\u003cli\u003eyfinance (2024) Yahoo Finance market data downloader, v1.0. https://pypi.org/project/yfinance/\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"international-journal-of-data-science-and-analytics","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"jdsa","sideBox":"Learn more about [International Journal of Data Science and Analytics](http://link.springer.com/journal/41060)","snPcode":"41060","submissionUrl":"https://submission.nature.com/new-submission/41060/3","title":"International Journal of Data Science and Analytics","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"Portfolio optimisation, Robo-advisory, Machine learning in finance, Explainable AI, Indian capital markets, Composite risk scoring, Black-Litterman model, Tax-loss harvesting","lastPublishedDoi":"10.21203/rs.3.rs-9509094/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-9509094/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eIndia has seen a rapid rise in retail investment accounts — exceeding 180 million mutual fund folios and 150 million demat accounts by 2024 — has not been met with accessible, personalised investment advisory tools. This paper presents an AI-Based Personalised Portfolio Allocation Engine that closes this gap by combining rule-based financial logic with four machine learning optimisation methods: Mean-Variance Optimisation (MVO), Risk Parity, Maximum Sharpe Ratio, and the Black-Litterman model. The engine builds a five-parameter Composite Risk Score (CRS) to classify investors into Conservative, Moderate, or Aggressive profiles and uses a three-signal market regime detector employing moving average crossover, momentum (rate of change), and volatility (India VIX) indicators. Individual equities are scored on combined fundamental and technical criteria, while mutual funds are ranked via Jensen Alpha-based scoring across six subcategories. Portfolio weights are then fine-tuned using Monte Carlo simulation with 5,000 scenarios spanning 20-year wealth projection horizons. An Explainable AI (XAI) module converts all algorithmic decisions into plain-language investment rationales, and a tax-aware rebalancing engine employing FIFO lot tracking and tax-loss harvesting reduced simulated rebalancing tax liability by 39.8%. Across three representative investor profiles, the system generated annualised simulated returns of 8.2%, 12.6%, and 16.1% (Conservative, Moderate, Aggressive) with Sharpe ratios of 0.62, 0.89, and 1.04 respectively. The system is implemented entirely in Python 3.10+ using open-source libraries and tackles five specific gaps in the Indian retail robo-advisory landscape.\u003c/p\u003e","manuscriptTitle":"AI-Based Personalised Portfolio Allocation Engine for Indian Retail Investors: A Multi-Algorithm Optimisation Approach with Explainable AI","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-05-12 17:31:30","doi":"10.21203/rs.3.rs-9509094/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"reviewerAgreed","content":"124272911170247076525020357285211716789","date":"2026-05-06T19:51:12+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-05-06T09:54:15+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"181913795999149090277468532381138596596","date":"2026-05-06T08:46:59+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2026-05-04T23:58:35+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2026-05-01T18:26:32+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2026-04-30T03:55:02+00:00","index":"","fulltext":""},{"type":"submitted","content":"International Journal of Data Science and Analytics","date":"2026-04-23T16:27:17+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"international-journal-of-data-science-and-analytics","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"jdsa","sideBox":"Learn more about [International Journal of Data Science and Analytics](http://link.springer.com/journal/41060)","snPcode":"41060","submissionUrl":"https://submission.nature.com/new-submission/41060/3","title":"International Journal of Data Science and Analytics","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"6d3e6337-8152-4376-a20c-b362930641d5","owner":[],"postedDate":"May 12th, 2026","published":true,"recentEditorialEvents":[{"type":"reviewerAgreed","content":"124272911170247076525020357285211716789","date":"2026-05-06T19:51:12+00:00","index":21,"fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-05-06T09:54:15+00:00","index":20,"fulltext":""},{"type":"reviewerAgreed","content":"181913795999149090277468532381138596596","date":"2026-05-06T08:46:59+00:00","index":19,"fulltext":""},{"type":"reviewersInvited","content":"10","date":"2026-05-04T23:58:35+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2026-05-01T18:26:32+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2026-04-30T03:55:02+00:00","index":"","fulltext":""}],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[],"tags":[],"updatedAt":"2026-05-12T17:31:30+00:00","versionOfRecord":[],"versionCreatedAt":"2026-05-12 17:31:30","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-9509094","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-9509094","identity":"rs-9509094","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall: last seen: 2026-05-23T02:00:01.238055+00:00

License: CC-BY-4.0