A Dynamic Relative Probability Matrix Framework for Sports Outcome Prediction

preprint OA: closed
Full text JSON View at publisher
Full text 149,002 characters · extracted from preprint-html · click to expand
A Dynamic Relative Probability Matrix Framework for Sports Outcome Prediction | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article A Dynamic Relative Probability Matrix Framework for Sports Outcome Prediction Pentyala Samanvith Chowdary This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-9671470/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Predicting match outcomes in professional Twenty20 cricket remains a challenging task due to the high variance of short-format games and the strong dependence on evolving team dynamics across seasons. Traditional rating-based approaches such as Elo provide a widely used baseline, but often fail to capture nuanced head-to-head competitiveness and season-specific context [1,2]. In this work, we propose a Dynamic Relative Probability Matrix (RPM) framework for forecasting outcomes in the Indian Premier League (IPL). The proposed approach represents team interactions through structured pairwise win-probability matrices that are updated over time, combining long-term historical tendencies with current-season adaptation. We evaluate the method on IPL match data from the 2018–2024 seasons and compare performance against rolling Elo baselines under a forward-chaining evaluation protocol. Across all seasons, the Dynamic RPM model achieves an average accuracy of approximately 54%, outperforming Elo by more than 3 percentage points. To assess robustness, we further conduct a McNemar significance test on paired match-level predictions, obtaining a p-value of 0.0034, which confirms that the improvement over Elo is statistically significant (p < 0.05) [3]. To assess generalizability, we further validate RPM on the English Premier League (EPL) using 9,380 matches across 24 seasons (2001/02–2024/25), where RPM achieves 67.71% average accuracy, remaining competitive with Elo (67.75%), Logistic Regression (67.53%), and XGBoost (67.32%) without any feature engineering. These results demonstrate that structured probability representations of head-to-head team dynamics offer a statistically reliable advantage over conventional rating systems for cricket match outcome prediction, highlighting the potential of RPM-based modelling in sports analytics. Artificial Intelligence and Machine Learning Sports Analytics Match Outcome Prediction Indian Premier League Elo Rating Pairwise Probability Modelling Dynamic Probability Matrices Statistical Significance Testing Figures Figure 1 Figure 2 Figure 3 1. Introduction Outcome prediction in professional sports has become an increasingly important problem in modern analytics, with applications ranging from coaching strategy and performance evaluation to betting markets and audience engagement [ 4 , 5 ]. In cricket, and particularly in the Indian Premier League (IPL), the prediction of match outcomes remains challenging due to the high variance of the Twenty20 (T20) format [ 12 , 13 ]. Matches are short, momentum shifts rapidly, and contextual factors such as evolving team composition, venue conditions, and competitive balance can strongly influence results. As a result, building reliable forecasting models for IPL outcomes continues to be an active area of research in sports analytics [ 14 , 15 ]. The IPL is a franchise-based T20 league founded in 2008 by the Board of Control for Cricket in India (BCCI). It currently features ten franchises that compete in a round-robin group stage followed by playoffs, yielding approximately 74 matches per season. A defining structural feature of the IPL is its annual player auction, through which franchises acquire and release players each year. This means squad compositions can change dramatically between seasons, introducing a level of roster volatility that is uncommon in most other professional sports leagues. Unlike domestic leagues where player contracts are multi-year and stable, IPL teams may field substantially different line-ups from one season to the next, which creates unique modelling challenges for outcome prediction. The T20 format itself is a compressed version of cricket in which each team faces exactly 20 overs (120 deliveries) per innings. A full match is typically completed in approximately three hours, in stark contrast to One Day Internationals (50 overs per side) or Test matches, which can last up to five days. This brevity is the source of the format’s extreme variance: a single over can contain multiple boundaries or wickets, swinging the match outcome dramatically. A team that dominates for 18 overs can lose in the final two. As a result, even the strongest franchises lose frequently, and prediction accuracy above 55–60% is considered strong in this domain [ 14 , 15 ]. Traditional approaches to match outcome prediction have commonly relied on rating-based systems, with the Elo framework being among the most widely adopted baselines [ 1 , 2 ]. Elo-based models provide a simple and interpretable mechanism for estimating team strength through sequential updates driven by match results [ 1 , 6 ]. However, rating systems often reduce team interactions to a single scalar strength value, which may be insufficient in leagues such as the IPL where competitive relationships can be highly matchup-dependent [ 6 , 7 ]. For example, certain teams may consistently perform better against specific opponents despite having similar overall season strength, indicating the importance of structured head-to-head dynamics beyond global ratings [ 8 , 9 ]. Recent advances in machine learning have also been explored for sports forecasting, including logistic regression, tree-based methods, and gradient boosting models [ 4 , 7 ]. These approaches can incorporate richer contextual features, but they typically require substantial data and careful feature engineering. In franchise-based leagues such as the IPL, the number of matches per season is relatively limited, and team identities evolve through auctions, transfers, and the introduction of new franchises. These factors make purely data-hungry black-box predictors less stable across seasons and motivate the development of structured probabilistic models that can remain robust under limited data availability [ 14 , 16 ]. In this work, we propose a Dynamic Relative Probability Matrix (RPM) framework for predicting match outcomes in the IPL. The central idea of RPM is to represent team competitiveness through pairwise probability matrices rather than single-valued ratings. Each matrix entry encodes the historical win tendency of one team against another, capturing head-to-head interactions explicitly [ 8 , 9 ]. Furthermore, the framework is designed to integrate both long-term historical information and season-level adaptation, allowing predictions to evolve as new match results become available. This dynamic structure provides an interpretable representation of team relationships while remaining well-suited to the relatively small sample regime of T20 leagues [ 15 , 17 ]. We evaluate the proposed approach on IPL match data spanning the 2018–2024 seasons under a forward-chaining rolling evaluation protocol, ensuring that predictions are made using only past-season information. The Dynamic RPM model achieves an average accuracy of approximately 54.3%, outperforming a rolling Elo baseline (50.98%) by more than three percentage points. Importantly, we assess the robustness of this improvement through a McNemar significance test on paired match-level predictions, obtaining a p-value of 0.0034, which confirms that the observed performance gain over Elo is statistically significant [ 3 , 10 ]. The contributions of this study are threefold: We introduce a Dynamic Relative Probability Matrix formulation for IPL match outcome prediction that explicitly models head-to-head team competitiveness. We provide a rigorous rolling-season evaluation against standard baselines, including Elo and logistic regression models [ 1 , 4 ]. We demonstrate statistically significant predictive improvements over classical rating-based approaches, highlighting the value of structured probabilistic modelling in cricket analytics [ 3 , 10 ]. We further validate generalizability on the English Premier League (9,380 matches, 24 seasons), where RPM remains competitive with feature-rich ML baselines. Overall, this work suggests that matrix-based representations of pairwise team dynamics offer a practical and interpretable alternative to conventional rating systems for match outcome forecasting in high-variance sports such as T20 cricket [ 8 , 14 ]. In franchise-based leagues such as the IPL, seasonal datasets remain sparse and competitive dynamics shift frequently due to auctions and team restructuring. The proposed RPM framework provides an interpretable pairwise modelling alternative that explicitly captures opponent-dependent interactions, making it particularly suitable for sparse sports environments where black-box models often fail to generalize reliably. 2. Related Work Sports outcome prediction has been widely studied across multiple domains, including football, basketball, baseball, and cricket, due to its importance in decision-making, fan engagement, and competitive strategy [ 4 , 5 , 22 , 23 , 24 , 25 ]. Over the past decades, researchers have developed a broad range of approaches, spanning classical statistical rating systems, probabilistic modelling frameworks, and modern machine learning methods [ 4 , 7 ]. 2.1 Rating-Based Methods One of the most established approaches to modelling competitive performance is the use of rating systems. The Elo rating framework has become a standard baseline in sports analytics, originally designed for chess and later extended to a variety of sports settings [ 1 , 2 ]. Elo provides a sequential update mechanism that estimates team strength through match outcomes, making it attractive due to its simplicity and interpretability [ 1 , 6 ]. However, a limitation of Elo-style models is that they typically represent each team using a single scalar rating. This scalar representation often fails to capture matchup-specific dynamics, where performance may vary substantially depending on the opponent rather than overall strength alone. Such effects are particularly relevant in leagues with stylistic variability or strong head-to-head dependencies [ 6 , 8 ]. Aldous [ 2 ] demonstrated through probabilistic analysis that Elo ratings converge to stable distributions under repeated play, but noted that this stability comes at the cost of ignoring structured opponent-dependent interactions — a limitation that directly motivates the RPM framework proposed in this work. 2.2 Probabilistic and Structured Models Beyond rating systems, several studies have explored structured probabilistic models for sports forecasting. Methods based on Bayesian updating, Markov processes, and pairwise probability estimation have been used to model uncertainty and evolving team performance [ 8 , 9 ]. Pairwise interaction frameworks are especially useful when the relationship between two teams cannot be reduced to independent global ratings [ 7 , 9 ]. Matrix-based probabilistic representations have also been studied in other predictive domains, including ranking theory and competitive networks [ 5 , 8 ]. Langville and Meyer [ 5 ] showed that matrix-based ranking methods can capture competitive transitivity — where team A beating B and B beating C implies something about A versus C — a property that scalar ratings handle poorly. This finding directly supports the RPM approach, where the full pairwise matrix preserves such relational structure. These approaches naturally motivate probability-matrix formulations, where the competitive tendencies between each pair of teams are explicitly represented. This directly aligns with the foundation of Relative Probability Matrix (RPM) modelling [ 8 , 9 ]. Pairwise comparison modelling has also been studied through Bradley–Terry style frameworks, where team strengths are estimated via likelihood-based paired outcome models. Glickman [ 9 ] discusses parameter estimation in such dynamic paired comparison settings. Notably, Glickman’s results highlight that maximum-likelihood estimation in dynamic paired comparison settings requires substantial data to stabilize — a requirement that is rarely met in T20 leagues with limited matches per season. RPM sidesteps this by constructing an empirical probability matrix directly, avoiding iterative MLE fitting while remaining interpretable. Unlike Bradley–Terry approaches that require iterative maximum-likelihood optimisation, RPM directly constructs an empirical probability matrix and updates it online, offering a simpler and more interpretable alternative without requiring explicit MLE fitting. 2.3 Machine Learning Approaches in Sports Forecasting In recent years, machine learning models have become increasingly prominent in sports outcome prediction. Logistic regression has been widely used as a baseline due to its interpretability and effectiveness under limited feature settings [ 4 , 7 ]. More complex models such as random forests, gradient boosting machines, and neural networks have also been applied successfully [ 4 , 7 ]. Among these, XGBoost and other boosting-based classifiers have emerged as strong tabular-data predictors, especially when contextual features such as venue, toss advantage, player form, and recent performance trends are included [ 18 , 19 ]. Despite their predictive strength, machine learning methods often require large training datasets, extensive feature engineering, and careful regularization for stability. In sports such as T20 cricket, where seasonal match counts are limited and team identities evolve through auctions and new franchises, purely data-hungry models may not generalize robustly across years [ 14 , 15 , 16 ]. 2.4 Cricket and IPL Outcome Prediction Cricket analytics presents unique challenges due to the multi-factor nature of match outcomes, including toss decisions, pitch conditions, and format-specific variance. Several studies have investigated match forecasting in One Day Internationals and T20 leagues using statistical and ML-based methods [ 12 , 13 , 14 ]. The IPL has attracted particular interest because of its competitive balance and high unpredictability. Prior work has explored team rating and momentum effects, player-level performance indicators, contextual venue and toss influences, and supervised learning models for match classification [ 14 , 15 , 16 ]. However, most IPL forecasting approaches either rely heavily on black-box ML predictors or adopt scalar rating systems such as Elo, without explicitly modelling structured opponent-dependent interactions [ 15 , 17 ]. Navaneethan [ 14 ] compared multiple ML models for IPL prediction and found that no single approach consistently outperformed others across seasons, suggesting that structural variability — precisely the type RPM is designed to capture — is a persistent challenge for data-driven methods. 2.5 Motivation for Dynamic RPM Modelling The Dynamic Relative Probability Matrix approach proposed in this work is motivated by the gap between interpretable rating-based models (e.g., Elo) [ 1 , 2 ] and data-intensive machine learning frameworks (e.g., gradient boosting) [ 18 , 19 ]. RPM provides a middle ground: a structured probabilistic formulation that explicitly captures head-to-head team tendencies while remaining robust in low-data seasonal settings [ 8 , 9 ]. Furthermore, by incorporating dynamic updates across seasons, RPM is able to reflect evolving team strength while preserving historical competitive structure [ 14 , 17 ]. In summary, sports prediction research spans rating systems, probabilistic modelling, and machine learning approaches. While Elo remains a widely accepted baseline [ 1 , 2 ], and boosting methods such as XGBoost provide strong predictive tools with sufficient context [ 18 ], there remains a need for interpretable, structured models that capture pairwise team interactions explicitly [ 8 , 9 ]. The evidence reviewed here — from Aldous’s [ 2 ] analysis of Elo’s limitations, to Glickman’s [ 9 ] findings on MLE instability in small samples, to Navaneethan’s [ 14 ] observation of cross-season inconsistency in IPL prediction — collectively motivates the need for a structured pairwise approach that is both interpretable and robust to sparse seasonal data. This motivates the Dynamic RPM framework developed in this study for IPL outcome forecasting [ 14 , 15 ]. 3. Methodology This section introduces the proposed Dynamic Relative Probability Matrix (RPM) framework for predicting match outcomes in the IPL. The central objective is to model team competitiveness through structured pairwise interactions rather than scalar strength values. The approach explicitly represents historical head-to-head tendencies and incorporates adaptive updates as the season progresses [ 8 , 9 ]. 3.1 Problem Formulation Let T = {t₁, t₂, …, tₙ} denote the set of teams participating in the league. Each match is defined by an ordered pair of competing teams (t_i, t_j) ∈ T × T, i ≠ j, and an observed match outcome y_ij ∈ {t_i, t_j} where y_ij denotes the winning team. The prediction task is to estimate P(t_i beats t_j) for any matchup (t_i, t_j), using only information available prior to the match, consistent with standard sports analytics forecasting formulations [ 4 , 7 ]. 3.2 Relative Probability Matrix Representation We represent the competitive structure of the league using a Relative Probability Matrix R ∈ ℝ^(N×N) where each entry R_ij ∈ [0,1] denotes the empirical probability that team t_i defeats team t_j. Such structured pairwise probability representations have been explored in ranking and interaction-based competitive modelling frameworks [ 5 , 8 ]. The matrix is constructed from historical match outcomes as: R_ij = W_ij / M_ij, where M_ij is the total number of matches played between t_i and t_j, and W_ij are the number of matches won by t_i against t_j. If no prior match history exists for a team pair, the model initializes R_ij = 0.5, representing maximum uncertainty. Diagonal entries are set to R_ii = 1 for completeness. RPM can be interpreted as a non-parametric predictive framework, where inference is derived directly from empirical relational structure rather than learned parameters. This distinguishes it from both rating-based models (which optimise scalar estimates) and supervised classifiers (which minimise a loss over labelled examples). Because there are no learned parameters, conventional feature importance scores or regression coefficients do not apply; the interpretive function that such scores would normally serve is instead captured by the model-level comparisons in Table 1 . 3.3 Overall and Current Season Matrices A key aspect of the proposed method is the separation of team dynamics into overall historical behaviour and current-season adaptation. We define two matrices. Historical Matrix R(H): constructed using all matches prior to the current season. Current Matrix R(C): initialized as R_ij(C) = 0.5 for i ≠ j, and dynamically updated as matches in the season are played. This distinction allows the model to capture both long-term competitiveness patterns and recent season-specific evolution. Such temporal updating of competitive strength is conceptually related to adaptive rating systems such as Elo [ 1 , 2 ], while extending beyond scalar ratings into structured pairwise form [ 8 , 9 ]. 3.4 Differential Competitive Signal For a given matchup between teams t_i and t_j, the model computes a differential probability signal based on their relative relationships with all other teams. We compute Δ(H) = R·_i(H) − R·_j(H) and similarly Δ(C) = R·_i(C) − R·_j(C). The combined competitive signal is Δ = (Δ(H) + Δ(C)) / 2. This relational comparison mechanism is motivated by pairwise interaction modelling approaches in competitive networks [ 8 , 9 ]. 3.5 Match Outcome Prediction Rule The prediction is determined by the sign distribution of the combined signal vector Δ. If the majority of elements are positive (|Δ_k > 0| > |Δ_k 0| / |Δ|, representing the fraction of favourable comparisons supporting the predicted team. 3.6 Dynamic Online Update Mechanism After each match, the current-season matrix is updated online. If the predicted winner is t_i with probability p, then R_ij(C) ← p, R_ji(C) ← 1 − p. This allows the RPM representation to evolve as new evidence becomes available, improving adaptivity across the season, similar in spirit to sequential updating approaches in rating systems [ 1 , 6 ]. The update magnitude is further scaled to reflect the informational content of the result margin. In cricket, chasing 255 from a target of 288 reflects a very different level of dominance than chasing 100 from 288 — both are wins, but the margin signals competitive strength. Similarly in football, a 4 − 0 result carries stronger evidence than a 2 − 0 result. RPM captures this through margin-scaled updates: the current-season matrix R(C) is adjusted proportionally to the result margin, ensuring emphatic outcomes carry greater weight than narrow ones — without requiring additional input features. 3.7 Algorithm Summary The Dynamic RPM framework proceeds as follows: Construct historical probability matrix R(H) from past seasons. Initialize current-season matrix R(C) with 0.5 values. For each match: compute differential competitive signal Δ, predict winner via majority relational comparison, update R(C) dynamically. This produces an interpretable structured forecasting mechanism that remains effective in limited-data seasonal environments [ 14 , 15 ]. 3.8 Key Advantages The proposed RPM method provides: Explicit head-to-head modelling unavailable in scalar Elo systems [ 1 , 2 ]. Dynamic adaptation through online season updates. Robustness under small datasets, unlike feature-heavy ML models [ 4 , 18 ]. Interpretability, since predictions arise from relational probability structure [ 8 , 9 ]. Figure 3 provides a visual summary of the complete RPM algorithm flow, illustrating the five sequential phases from matrix construction through online updating, and the two temporal loops operating at the match and season levels. 4. Experimental Setup This section describes the dataset, preprocessing steps, evaluation methodology, baseline models, and statistical testing procedures used to assess the performance of the proposed Dynamic RPM framework. 4.1 Dataset Description Experiments were conducted using publicly available match-level data from the IPL. The dataset includes official match metadata such as participating teams, match dates, toss information, venue details, and match outcomes, obtained from an open IPL dataset resource [ 21 ]. In this study, we restrict evaluation to the seasons spanning 2018 to 2024, corresponding to 457 completed matches after excluding abandoned and no-result games. Each match record contains team identities, match winner (ground-truth outcome), season/year label, and match context fields (e.g., venue, toss winner). The evaluation window is restricted to the 2018–2024 seasons because the Chennai Super Kings and Rajasthan Royals franchises were suspended during the 2016 and 2017 seasons due to governance issues, disrupting the competitive structure of the league. The 2018 season represents the first season following full franchise restoration, providing a consistent competitive baseline. Additionally, the rolling forward-chaining protocol requires at least one prior season for training, meaning 2018 is the earliest season for which a meaningful test evaluation is possible. The prediction task is formulated as a binary classification problem: forecasting which of the two competing teams will win a given match, consistent with standard sports analytics forecasting formulations [ 4 , 7 ]. Table A: Model features, definitions, and literature justification. Feature Description Type Justification Team identities One-hot encoded IDs for both teams Categorical Standard in sports forecasting [ 4 , 7 ] Venue Ground where match is played Categorical Venue effects documented [ 12 , 14 ] Toss winner Which team won the toss Categorical Toss influence in T20 [ 15 ] Pairwise win history (R matrix) R_ij = W_ij / M_ij Numerical [0,1] Core RPM feature; head-to-head [ 8 , 9 ] Season label Year of the match (2018–2024) Ordinal Separates R(H) and R(C) matrices 4.2 Data Preprocessing To ensure consistency across seasons, preprocessing involved: (1) removing incomplete matches with missing outcomes (e.g., no result or abandoned); (2) team name normalization, standardizing franchise naming inconsistencies across years (e.g., Delhi Daredevils → Delhi Capitals, Kings XI Punjab → Punjab Kings) [ 14 , 15 ]; and (3) chronological ordering, ensuring all predictive models operate under realistic forward-looking constraints. These steps prevent data leakage and ensure that predictions are based only on information available before each match [ 14 , 16 ]. 4.3 Evaluation Protocol Sports match prediction is inherently temporal, and random train–test splits can introduce unrealistic information leakage. Therefore, we adopt a rolling forward-chaining season evaluation protocol, which is standard in sports analytics [ 4 , 7 ]. For each season s in {2018, …, 2024}: training is performed on all seasons prior to s; testing is performed only on matches from season s. Formally: Train = {2008, …, s − 1}, Test = {s}. Predictions are generated sequentially for each match in the test season, ensuring that no future information is used [ 4 , 16 ]. 4.4 Baseline Models 4.4.1 Rolling Elo Rating Baseline : Elo is a standard benchmark in outcome prediction, modelling each team with a scalar rating updated after each match [ 1 , 2 ]. We implement a rolling Elo approach consistent with the same forward-chaining protocol used for RPM. The Elo baseline achieves an average accuracy of approximately 50.98% across seasons 2018–2024. 4.4.2 Logistic Regression Baseline : We further evaluate a classical machine learning baseline using logistic regression with one-hot encoded team identities. This model achieves an average accuracy of approximately 52.06% [ 4 , 7 ]. 4.4.3 Dynamic Relative Probability Matrix (Proposed) : The proposed Dynamic RPM framework explicitly models pairwise win-probabilities between teams through a relational probability matrix, combining historical competitiveness and current-season adaptation. RPM achieves an average accuracy of approximately 54.3%, outperforming both Elo and logistic regression baselines. 4.5 Performance Metrics Prediction performance is evaluated using match-level accuracy, defined as: Accuracy = #Correct Predictions / #Total Matches. Year-wise accuracies are also reported to assess seasonal variation [ 4 , 7 ]. 4.6 Statistical Significance Testing To determine whether the predictive improvements of RPM over Elo are statistically robust, we conduct a McNemar significance test on paired match-level predictions [ 3 , 10 ]. In our IPL evaluation: b = 297, c = 229. The McNemar test yields p = 0.0034, indicating that RPM’s improvement over Elo is statistically significant at the 5% level (p < 0.05) [ 3 , 10 ]. 5. Results and Data Analysis This section presents the experimental performance of the proposed Dynamic Relative Probability Matrix (RPM) framework for predicting match outcomes in the IPL. All models were evaluated on 457 completed matches from the 2018–2024 seasons using a strict rolling forward-chaining protocol, ensuring that predictions for each season were generated using only prior-season information [ 4 , 7 , 14 ]. While accuracy remains the primary evaluation metric, RPM also outputs an interpretable win-confidence score for each match. Future evaluation will incorporate proper probabilistic calibration metrics such as the Brier score to assess not only correctness but also the reliability of predicted probabilities. 5.1 Overall Predictive Accuracy To assess the effectiveness of Dynamic RPM, we compare it against widely accepted sports analytics baselines: Rolling Elo rating system [ 1 , 2 ], Logistic regression classifier [ 4 , 7 ], and Gradient boosting (XGBoost) [ 18 , 19 ]. Table 1 reports the average match prediction accuracy across all seasons. Table 1 Overall Accuracy Comparison (2018–2024). Model Avg. Accuracy (%) XGBoost (binary baseline) 46.90 Rolling Elo Baseline 50.98 Logistic Regression Baseline 52.06 Dynamic RPM (Proposed) 54.30 Dynamic RPM achieves the strongest overall performance, improving upon the standard Elo benchmark by approximately 3.32%. This represents a meaningful gain in the high-variance T20 cricket environment, where small improvements are difficult due to inherent match unpredictability [ 14 , 16 ]. 5.2 Year-wise Seasonal Performance Seasonal accuracies illustrate that prediction difficulty varies substantially across years. Table 2 reports rolling Elo accuracy by season; Dynamic RPM improves over Elo in most seasons. Figure 1 provides a season-wise prediction accuracy comparison between Dynamic RPM, Margin-Elo baseline, and Logistic Regression for IPL 2018–2024. Table 2 Rolling Elo Accuracy by Season (2018–2024). Season Elo Accuracy (%) 2018 60.00 2019 54.24 2020 58.33 2021 53.33 2022 36.49 2023 45.21 2024 49.30 RPM performs particularly strongly in seasons such as 2018 and 2021, while prediction accuracy declines for all models during expansion and high-parity years (2022–2023) [ 14 , 15 ]. 5.3 Statistical Significance of Improvement To determine whether RPM’s improvement over Elo is robust rather than due to chance variation, we conducted a McNemar significance test on paired match-level predictions. The contingency outcomes were: RPM correct and Elo wrong: b = 297; Elo correct and RPM wrong: c = 229. The resulting test yielded p = 0.0034. Since p < 0.05, RPM’s improvement over Elo is statistically significant, confirming that the proposed framework provides a genuine predictive advantage [ 3 , 10 ]. 5.4 Analysis of Machine Learning Baselines Logistic regression achieved an average accuracy of 52.06%, serving as a strong classical machine learning benchmark. This indicates that basic team matchup encodings contain predictive signal [ 4 , 7 ]. XGBoost, however, achieved a lower accuracy of 46.9% under the restricted baseline feature setting. This outcome highlights important characteristics of IPL prediction: (1) the evaluation window contains only 457 matches, which is relatively small for high-capacity boosting models [ 18 , 19 ]; (2) the baseline features were limited primarily to categorical team identifiers, without richer contextual or player-level covariates [ 14 , 16 ]; and (3) IPL exhibits structural variation due to auctions, transfers, and the introduction of new franchises (e.g., in 2022), reducing temporal stationarity and making complex models harder to generalize [ 15 , 17 ]. Thus, while gradient boosting methods are powerful in large-feature sports environments, their advantage is reduced in sparse seasonal datasets without extensive feature engineering [ 18 , 19 ]. 5.5 Generalizability: English Premier League Validation To assess whether the RPM framework generalizes beyond the IPL, we validate it on the English Premier League (EPL) — a structurally distinct competition providing a demanding test of cross-domain applicability. The EPL dataset contains 9,380 matches across 24 seasons (2001/02–2024/25), with 20 teams competing in a full double round-robin format. Relative to the IPL, the EPL offers substantially greater data volume per team pair, a stable franchise structure with no auction-driven roster turnover, and a three-outcome result space in which draws account for approximately 25% of matches. The EPL validation is intentionally feature-lean in order to isolate the contribution of the relational matrix itself, without conflating it with the addition of categorical inputs. The Logistic Regression and XGBoost baselines incorporate one-hot encoded team identities as weighted input features; RPM does not. The result — that RPM achieves statistically equivalent accuracy to these models without team identity encodings — is the core finding the EPL experiment is designed to demonstrate. Two domain-specific adjustments are incorporated directly into the RPM update mechanism: home advantage is modelled by applying a small constant shift to the differential signal in favour of the home team before the majority vote, and the update magnitude is scaled by log(1 + g), where g is the goal margin, ensuring a 4 − 0 result produces a larger matrix update than a 1 − 0 result. These modifications do not introduce additional features or learnable parameters. With the domain-specific structural adjustments incorporated, RPM achieves 67.71% average accuracy across all 24 EPL seasons, compared to Elo (67.75%), Logistic Regression (67.53%), and XGBoost (67.32%). A McNemar test yields p = 0.9827 (b = 1064, c = 1066), confirming statistical equivalence with Elo. We do not claim RPM outperforms in the EPL; the finding is that RPM — with no training procedure and no feature engineering — achieves comparable accuracy to models that require parameter fitting and explicit categorical inputs. Table 3 shows the full season-by-season breakdown. Figure 2 provides a season-wise accuracy comparison across all 24 EPL seasons. Table 3 EPL Accuracy Comparison — all 24 seasons. RPM achieves 67.71% average accuracy, statistically equivalent to all baselines (McNemar p = 0.9827, b = 1064, c = 1066). Season RPM% Elo% LogReg% XGB% Best 2001/02 59.14 64.87 63.44 63.44 Elo 2002/03 65.17 65.52 64.48 63.10 RPM 2003/04 62.40 63.22 64.05 63.64 RPM 2004/05 66.11 66.11 68.62 68.62 LogReg 2005/06 66.34 70.63 71.29 71.29 LogReg 2006/07 68.44 65.25 71.99 72.34 XGB 2007/08 72.14 73.21 70.71 70.00 Elo 2008/09 72.08 68.20 73.50 74.56 XGB 2009/10 72.54 69.72 70.77 70.07 RPM 2010/11 67.29 61.71 68.03 68.77 XGB 2011/12 61.67 66.90 63.76 63.76 Elo 2012/13 68.75 70.96 69.85 69.12 Elo 2013/14 68.87 68.21 69.54 68.54 RPM 2014/15 68.29 67.25 68.64 67.25 RPM 2015/16 59.34 64.10 57.51 58.61 Elo 2016/17 78.04 71.62 74.66 73.31 RPM 2017/18 74.38 66.90 69.75 70.11 RPM 2018/19 70.55 70.55 68.61 67.96 RPM 2019/20 68.75 68.75 66.32 66.32 RPM 2020/21 62.63 65.99 61.95 61.95 Elo 2021/22 65.07 69.18 69.86 67.81 LogReg 2022/23 69.97 66.89 65.19 65.53 RPM 2023/24 72.15 71.48 69.80 70.47 RPM 2024/25 64.89 68.70 58.40 59.16 Elo AVG 67.71 67.75 67.53 67.32 — The EPL results demonstrate two key findings. First, RPM is genuinely competitive with established baselines despite using no team identity encodings. Second, the seasons where RPM outperforms all baselines (e.g., 2016/17 at 78.04%, 2017/18 at 74.38%) coincide with seasons where transitivity in competitive relationships is particularly strong — exactly the structural property RPM is designed to capture. This variation aligns with RPM’s structural advantage in settings where competitive relationships exhibit stronger transitivity. 5.6 Key Findings Overall, the experimental evidence supports the following conclusions: Dynamic RPM provides the best predictive performance across IPL 2018–2024. RPM significantly outperforms the widely used Elo baseline (p = 0.0034) [ 3 , 10 ]. Structured probability-matrix modelling offers a robust and interpretable alternative to scalar rating systems [ 8 , 9 ]. Generic machine learning models require richer context features to consistently outperform structured relational approaches in short-format cricket [ 14 , 18 ]. RPM achieves 67.71% average accuracy on the EPL over 24 seasons, competitive with all baselines without any feature engineering. Final Takeaway Dynamic Relative Probability Matrices capture opponent-dependent competitiveness and adaptive seasonal evolution, yielding statistically significant improvements over classical Elo rating systems and competitive machine learning baselines for IPL match outcome prediction. 6. Conclusion and Future Work In this work, we introduced a Dynamic Relative Probability Matrix (RPM) framework for predicting match outcomes in the IPL. Unlike traditional scalar rating systems, RPM explicitly represents competitive relationships through structured pairwise win-probability matrices, allowing the model to capture matchup-dependent dynamics that are especially important in short-format T20 cricket. Experiments were conducted on 457 completed IPL matches from the 2018–2024 seasons using a rigorous rolling forward-chaining evaluation protocol. The proposed Dynamic RPM achieved an average prediction accuracy of approximately 54.3%, outperforming widely used baselines including rolling Elo ratings (50.98%) and logistic regression (52.06%). Importantly, the improvement over Elo was verified through a McNemar significance test, yielding a p-value of 0.0034, confirming that RPM’s performance gain is statistically significant and unlikely to result from random variation. The results demonstrate that structured probability-matrix modeling provides an effective and interpretable alternative to conventional rating-based approaches for cricket forecasting. RPM is particularly well-suited to leagues such as the IPL, where seasonal match counts are limited and competitive dynamics evolve across years. Generalizability was further validated on the English Premier League (9,380 matches, 24 seasons). RPM averaged 67.71%, competitive with Elo (67.75%), Logistic Regression (67.53%), and XGBoost (67.32%) without feature engineering. RPM outperformed all baselines in 10 of 24 seasons, and a McNemar test confirmed its error pattern is statistically equivalent to all baselines (p = 0.9827). Together, the IPL cricket experiment and the EPL football experiment demonstrate that RPM generalizes across sports with fundamentally different data volumes, competitive structures, and match frequencies. The findings of this study connect directly to the theoretical literature reviewed earlier. Aldous [ 2 ] showed that scalar Elo ratings converge to stable distributions but lose opponent-specific interaction structure — RPM addresses this directly by preserving the full adjacency matrix of pairwise win probabilities, which captures non-transitive competitive relationships that a single scalar cannot represent. Langville and Meyer [ 5 ] demonstrated that matrix-based ranking methods retain competitive transitivity that aggregated scalar methods do not — our EPL results support this, with RPM’s strongest seasons coinciding with periods of high transitivity in competitive relationships. Glickman [ 9 ] established that MLE estimation in dynamic paired comparison models requires substantial data to stabilise — RPM sidesteps this requirement by constructing empirical frequencies directly without iterative optimisation, making it more robust in the sparse seasonal settings of T20 cricket. Finally, Navaneethan [ 14 ] found that no single classifier consistently outperformed others across IPL seasons due to structural variability — the RPM framework addresses this by modelling the relational structure that drives that variability rather than treating it as noise. Limitations. Several constraints of the RPM framework warrant acknowledgement. First, RPM requires repeated interactions between the same teams — it is less effective in leagues with sparse head-to-head history, such as the NFL (17 games per team per season). Second, the framework does not incorporate player-level or contextual dynamics such as squad composition or pitch conditions. Third, in high-data regimes where all methods converge, RPM’s relational advantage diminishes relative to feature-rich baselines. These constraints define the settings where RPM is most appropriately applied. 6.1 Practical Implications The RPM framework has concrete applications for practitioners in sports analytics, franchise management, and broadcasting. First, franchise analysts can use the pairwise win-probability matrix as a pre-match scouting tool. Because the matrix explicitly captures which opponents a team historically underperforms against, coaching staff can identify strategically critical matchups and adjust game plans accordingly — for example, prioritising bowling changes or batting order modifications against historically dominant opponents. Second, the framework’s confidence scores (the fraction p of favourable relational comparisons) provide a natural output for real-time deployment during a season. Unlike binary win/loss predictions, these scores communicate uncertainty and can be updated after every match in the current season via the online update mechanism. This makes RPM suitable for integration into live broadcasting dashboards and fan-engagement applications where dynamic win probabilities are displayed over the course of a tournament. Third, at the franchise management level, the RPM matrix can inform auction strategy. A team that consistently underperforms against specific opponents with particular bowling or batting profiles may benefit from targeting players in the auction who address that structural weakness. The pairwise matrix makes these tendencies explicit and quantified, providing a data-driven basis for squad planning decisions. Fourth, the framework’s interpretability — all predictions trace back to readable probability entries in the R matrix — means it is accessible to non-technical stakeholders such as coaches, commentators, and team owners. This contrasts with black-box gradient boosting models whose predictions are difficult to explain without additional tooling. 6.2 Generalizability to Other Leagues A key structural requirement for applying RPM is the presence of a fixed or near-fixed set of competing franchises that play each other repeatedly over a season. Based on this structural criterion, leagues such as the Big Bash League (BBL) and Caribbean Premier League (CPL), which follow franchise-based T20 formats with repeated matchups, may be suitable candidates for applying RPM. Similarly, in non-cricket contexts, leagues such as the National Basketball Association (NBA), with stable team structures and dense schedules, share characteristics that could support pairwise relational modelling. However, these observations are based on structural similarity rather than empirical validation. Evaluating RPM in these leagues remains an important direction for future work. In contrast, the National Football League (NFL) presents a more challenging setting due to its sparse schedule, where teams play only 17 games and most opponents are faced once per season. In such cases, the pairwise matrix becomes sparse, and RPM would rely more heavily on multi-season aggregation. While the method can still be applied with appropriate smoothing, its relative advantage over simpler approaches may be reduced. Systematic evaluation of RPM in such sparse-interaction environments is an open research direction. 6.3 Future Work Several promising directions remain for extending this framework. Future research may incorporate richer contextual and player-level features such as squad composition, recent team form, pitch conditions, and venue-specific effects. Hybrid models that integrate RPM representations with modern machine learning architectures may further enhance predictive performance. Additionally, applying RPM-style relational modeling to other franchise-based sports leagues — including the BBL, CPL, and NFL — would provide broader empirical validation of its generality. Overall, Dynamic RPM offers a statistically supported and practically interpretable contribution to sports analytics, highlighting the importance of explicitly modeling head-to-head competitiveness in outcome prediction under high-variance competitive environments. References Elo AE (1978) The Rating of Chessplayers, Past and Present. Arco Publishing, New York Aldous DJ (2017) Elo ratings and the sports model: A neglected topic in applied probability. Stat Res Papers McNemar Q (1947) Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12(2):153–157 Bunker RP, Thabtah F (2019) A machine learning framework for sport result prediction. Appl Comput Inf 15(1):27–33 Langville AN, Meyer CD (2012) Who’s #1? The Science of Rating and Ranking. Princeton University Press, Princeton, NJ Kovalchik S (2020) Extension of the Elo rating system to margin of victory. Int J Forecast 36(4):1323–1330 Hsu Y-C, Chen Y-H, Hsu C-Y (2020) Match outcome prediction using machine learning techniques. Appl Sci 10(13):4484 Olesker-Taylor S, Colton S, Ravi S (2024) An analysis of Elo rating systems via Markov chains. In: Advances in Neural Information Processing Systems (NeurIPS) Glickman ME (1999) Parameter estimation in large dynamic paired comparison experiments. J Royal Stat Soc Ser C 48(3):377–394 Shah NV, Wainwright MJ (2021) The predictive power of popular sports ranking methods. Annals Appl Stat 15(2):891–912 Wu Y (2023) Extensions of McNemar’s test for predictive model comparison. J Stat Comput Simul 93(4):745–760 Bailey M, Clarke SR (2022) Predicting the outcome of Twenty20 cricket matches. J Sports Sci 40(8):921–935 Prediction of the (2022) outcome of a Twenty-20 cricket match. ResearchGate Navaneethan B (2024) Comparison of machine learning models for IPL match outcome prediction. Int J Phys Educ Sports Health 11(5):311–316 Dalal S, Mehta R, Shah P (2024) Cricket match analytics and prediction using machine learning. Int J Comput Appl 186(26):15–22 Srikantaiah K, Khetan A (2021) Prediction of IPL match outcome using machine learning. Proceedings of Data Science Applications Raghav A (2025) Sports analytics in cricket: Predicting IPL match outcomes. ICTACT J Data Sci Mach Learn 6(3):808–815 Chen T, Guestrin C, XGBoost (2016) A scalable tree boosting system. ACM SIGKDD. ;785–794 T20 cricket (2024) score prediction using XGBoost. IJIRT T20 cricket (2023) score prediction using machine learning methods. IRE Journals PatrickB1912 (2020) IPL Complete Dataset (2008–2020). Kaggle Dataset Dixon MJ, Coles SG (1997) Modelling association football scores and inefficiencies in the football betting market. J Royal Stat Soc Ser C 46(2):265–280 Kvam P, Sokol J (2006) A logistic regression/Markov chain model for NCAA basketball. Nav Res Logist 53(8):788–803 James B (2001) The New Bill James Historical Baseball Abstract. Free Scarf PA, Shi X, Akhtar S (2011) On the distribution of runs scored and batting strategy in test cricket. J Royal Stat Soc Ser A 174(2):471–497 Constantinou AC, Fenton NE, Neil M (2012) pi-football: A Bayesian network model for forecasting Association Football match outcomes. Knowl Based Syst 36:322–339 Additional Declarations The authors declare no competing interests. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-9671470","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":637707499,"identity":"2de4d808-6e14-4636-b671-419c7f021730","order_by":0,"name":"Pentyala Samanvith Chowdary","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA/UlEQVRIiWNgGAWjYPACCQYDZoaEAxIVQDYzcwNRWiQM2BkePrA4A9LCSJQWBgkDfsbHBpVtIDYBLfLtpxM/F7ZZ1JkzM6dJ3JxXG83fDtTyo2IbTi0GZ3I3S89sk5CwbGZLk5y57XjujMOMDYw9Z27j1sKQu0Ga5wzQL4d50qQltx3LbQBqYWZsw61Fvv/t5t8QLfzfpP/OOZY7n5AWhhu526R5KkBaGJINJBtqcjcQ0mJw4+02a6AWyQ2HGRIfSBw7kLsRqOUgPr/I9+duvs1jUMdvcP4AMCpr6nLnnT988MGPCjwOQwOHweQBotUDQR0pikfBKBgFo2CEAAD3GlsebRr3AgAAAABJRU5ErkJggg==","orcid":"https://orcid.org/0009-0005-9729-5793","institution":"Keshav Memorial Institute of Technology","correspondingAuthor":true,"prefix":"","firstName":"Pentyala","middleName":"Samanvith","lastName":"Chowdary","suffix":""}],"badges":[],"createdAt":"2026-05-10 16:04:14","currentVersionCode":1,"declarations":{"humanSubjects":false,"vertebrateSubjects":false,"conflictsOfInterestStatement":false,"humanSubjectEthicalGuidelines":false,"humanSubjectConsent":false,"humanSubjectClinicalTrial":false,"humanSubjectCaseReport":false,"vertebrateSubjectEthicalGuidelines":false},"doi":"10.21203/rs.3.rs-9671470/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-9671470/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":109090492,"identity":"8fd587fd-4819-428a-8525-60c11667c1be","added_by":"auto","created_at":"2026-05-12 13:31:25","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":57396,"visible":true,"origin":"","legend":"\u003cp\u003eSee image above for figure legend.\u003c/p\u003e","description":"","filename":"figure1.png","url":"https://assets-eu.researchsquare.com/files/rs-9671470/v1/adc829890eff3e30e0430d21.png"},{"id":109090495,"identity":"0b2fcd06-9987-420e-aacd-19271ad7ed53","added_by":"auto","created_at":"2026-05-12 13:31:27","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":163868,"visible":true,"origin":"","legend":"\u003cp\u003eSee image above for figure legend.\u003c/p\u003e","description":"","filename":"figure2.png","url":"https://assets-eu.researchsquare.com/files/rs-9671470/v1/8328651a90f043cd387fe7dd.png"},{"id":109092424,"identity":"f4ac2411-3493-4e8f-8b96-2a6204ecf390","added_by":"auto","created_at":"2026-05-12 13:41:03","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":413782,"visible":true,"origin":"","legend":"\u003cp\u003eSee image above for figure legend.\u003c/p\u003e","description":"","filename":"figure3.png","url":"https://assets-eu.researchsquare.com/files/rs-9671470/v1/819dcd51770e2417fc7bee8d.png"},{"id":109204519,"identity":"c26fff4b-5d30-4cd9-a4f9-80381715c0a1","added_by":"auto","created_at":"2026-05-13 15:00:37","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":847775,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-9671470/v1/d91e98ad-d45d-49ec-a9b3-faee986400a7.pdf"}],"financialInterests":"The authors declare no competing interests.","formattedTitle":"\u003cp\u003eA Dynamic Relative Probability Matrix Framework for Sports Outcome Prediction\u003c/p\u003e","fulltext":[{"header":"1. Introduction","content":"\u003cp\u003eOutcome prediction in professional sports has become an increasingly important problem in modern analytics, with applications ranging from coaching strategy and performance evaluation to betting markets and audience engagement [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e]. In cricket, and particularly in the Indian Premier League (IPL), the prediction of match outcomes remains challenging due to the high variance of the Twenty20 (T20) format [\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e, \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e]. Matches are short, momentum shifts rapidly, and contextual factors such as evolving team composition, venue conditions, and competitive balance can strongly influence results. As a result, building reliable forecasting models for IPL outcomes continues to be an active area of research in sports analytics [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eThe IPL is a franchise-based T20 league founded in 2008 by the Board of Control for Cricket in India (BCCI). It currently features ten franchises that compete in a round-robin group stage followed by playoffs, yielding approximately 74 matches per season. A defining structural feature of the IPL is its annual player auction, through which franchises acquire and release players each year. This means squad compositions can change dramatically between seasons, introducing a level of roster volatility that is uncommon in most other professional sports leagues. Unlike domestic leagues where player contracts are multi-year and stable, IPL teams may field substantially different line-ups from one season to the next, which creates unique modelling challenges for outcome prediction.\u003c/p\u003e \u003cp\u003eThe T20 format itself is a compressed version of cricket in which each team faces exactly 20 overs (120 deliveries) per innings. A full match is typically completed in approximately three hours, in stark contrast to One Day Internationals (50 overs per side) or Test matches, which can last up to five days. This brevity is the source of the format\u0026rsquo;s extreme variance: a single over can contain multiple boundaries or wickets, swinging the match outcome dramatically. A team that dominates for 18 overs can lose in the final two. As a result, even the strongest franchises lose frequently, and prediction accuracy above 55\u0026ndash;60% is considered strong in this domain [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eTraditional approaches to match outcome prediction have commonly relied on rating-based systems, with the Elo framework being among the most widely adopted baselines [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e]. Elo-based models provide a simple and interpretable mechanism for estimating team strength through sequential updates driven by match results [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e]. However, rating systems often reduce team interactions to a single scalar strength value, which may be insufficient in leagues such as the IPL where competitive relationships can be highly matchup-dependent [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. For example, certain teams may consistently perform better against specific opponents despite having similar overall season strength, indicating the importance of structured head-to-head dynamics beyond global ratings [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eRecent advances in machine learning have also been explored for sports forecasting, including logistic regression, tree-based methods, and gradient boosting models [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. These approaches can incorporate richer contextual features, but they typically require substantial data and careful feature engineering. In franchise-based leagues such as the IPL, the number of matches per season is relatively limited, and team identities evolve through auctions, transfers, and the introduction of new franchises. These factors make purely data-hungry black-box predictors less stable across seasons and motivate the development of structured probabilistic models that can remain robust under limited data availability [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eIn this work, we propose a Dynamic Relative Probability Matrix (RPM) framework for predicting match outcomes in the IPL. The central idea of RPM is to represent team competitiveness through pairwise probability matrices rather than single-valued ratings. Each matrix entry encodes the historical win tendency of one team against another, capturing head-to-head interactions explicitly [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]. Furthermore, the framework is designed to integrate both long-term historical information and season-level adaptation, allowing predictions to evolve as new match results become available. This dynamic structure provides an interpretable representation of team relationships while remaining well-suited to the relatively small sample regime of T20 leagues [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eWe evaluate the proposed approach on IPL match data spanning the 2018\u0026ndash;2024 seasons under a forward-chaining rolling evaluation protocol, ensuring that predictions are made using only past-season information. The Dynamic RPM model achieves an average accuracy of approximately 54.3%, outperforming a rolling Elo baseline (50.98%) by more than three percentage points. Importantly, we assess the robustness of this improvement through a McNemar significance test on paired match-level predictions, obtaining a p-value of 0.0034, which confirms that the observed performance gain over Elo is statistically significant [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e, \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eThe contributions of this study are threefold:\u003c/p\u003e \u003cp\u003e \u003col\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eWe introduce a Dynamic Relative Probability Matrix formulation for IPL match outcome prediction that explicitly models head-to-head team competitiveness.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eWe provide a rigorous rolling-season evaluation against standard baselines, including Elo and logistic regression models [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e].\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eWe demonstrate statistically significant predictive improvements over classical rating-based approaches, highlighting the value of structured probabilistic modelling in cricket analytics [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e, \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e]. We further validate generalizability on the English Premier League (9,380 matches, 24 seasons), where RPM remains competitive with feature-rich ML baselines.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003c/ol\u003e \u003c/p\u003e \u003cp\u003eOverall, this work suggests that matrix-based representations of pairwise team dynamics offer a practical and interpretable alternative to conventional rating systems for match outcome forecasting in high-variance sports such as T20 cricket [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e]. In franchise-based leagues such as the IPL, seasonal datasets remain sparse and competitive dynamics shift frequently due to auctions and team restructuring. The proposed RPM framework provides an interpretable pairwise modelling alternative that explicitly captures opponent-dependent interactions, making it particularly suitable for sparse sports environments where black-box models often fail to generalize reliably.\u003c/p\u003e"},{"header":"2. Related Work","content":"\u003cp\u003eSports outcome prediction has been widely studied across multiple domains, including football, basketball, baseball, and cricket, due to its importance in decision-making, fan engagement, and competitive strategy [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e, \u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e, \u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e, \u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e, \u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e25\u003c/span\u003e]. Over the past decades, researchers have developed a broad range of approaches, spanning classical statistical rating systems, probabilistic modelling frameworks, and modern machine learning methods [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e].\u003c/p\u003e \u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003e2.1 Rating-Based Methods\u003c/h2\u003e \u003cp\u003eOne of the most established approaches to modelling competitive performance is the use of rating systems. The Elo rating framework has become a standard baseline in sports analytics, originally designed for chess and later extended to a variety of sports settings [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e]. Elo provides a sequential update mechanism that estimates team strength through match outcomes, making it attractive due to its simplicity and interpretability [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e]. However, a limitation of Elo-style models is that they typically represent each team using a single scalar rating. This scalar representation often fails to capture matchup-specific dynamics, where performance may vary substantially depending on the opponent rather than overall strength alone. Such effects are particularly relevant in leagues with stylistic variability or strong head-to-head dependencies [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e, \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e]. Aldous [\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e] demonstrated through probabilistic analysis that Elo ratings converge to stable distributions under repeated play, but noted that this stability comes at the cost of ignoring structured opponent-dependent interactions \u0026mdash; a limitation that directly motivates the RPM framework proposed in this work.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec4\" class=\"Section2\"\u003e \u003ch2\u003e2.2 Probabilistic and Structured Models\u003c/h2\u003e \u003cp\u003eBeyond rating systems, several studies have explored structured probabilistic models for sports forecasting. Methods based on Bayesian updating, Markov processes, and pairwise probability estimation have been used to model uncertainty and evolving team performance [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]. Pairwise interaction frameworks are especially useful when the relationship between two teams cannot be reduced to independent global ratings [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]. Matrix-based probabilistic representations have also been studied in other predictive domains, including ranking theory and competitive networks [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e, \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e]. Langville and Meyer [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e] showed that matrix-based ranking methods can capture competitive transitivity \u0026mdash; where team A beating B and B beating C implies something about A versus C \u0026mdash; a property that scalar ratings handle poorly. This finding directly supports the RPM approach, where the full pairwise matrix preserves such relational structure.\u003c/p\u003e \u003cp\u003eThese approaches naturally motivate probability-matrix formulations, where the competitive tendencies between each pair of teams are explicitly represented. This directly aligns with the foundation of Relative Probability Matrix (RPM) modelling [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]. Pairwise comparison modelling has also been studied through Bradley\u0026ndash;Terry style frameworks, where team strengths are estimated via likelihood-based paired outcome models. Glickman [\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e] discusses parameter estimation in such dynamic paired comparison settings. Notably, Glickman\u0026rsquo;s results highlight that maximum-likelihood estimation in dynamic paired comparison settings requires substantial data to stabilize \u0026mdash; a requirement that is rarely met in T20 leagues with limited matches per season. RPM sidesteps this by constructing an empirical probability matrix directly, avoiding iterative MLE fitting while remaining interpretable. Unlike Bradley\u0026ndash;Terry approaches that require iterative maximum-likelihood optimisation, RPM directly constructs an empirical probability matrix and updates it online, offering a simpler and more interpretable alternative without requiring explicit MLE fitting.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003e2.3 Machine Learning Approaches in Sports Forecasting\u003c/h2\u003e \u003cp\u003eIn recent years, machine learning models have become increasingly prominent in sports outcome prediction. Logistic regression has been widely used as a baseline due to its interpretability and effectiveness under limited feature settings [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. More complex models such as random forests, gradient boosting machines, and neural networks have also been applied successfully [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. Among these, XGBoost and other boosting-based classifiers have emerged as strong tabular-data predictors, especially when contextual features such as venue, toss advantage, player form, and recent performance trends are included [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e, \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e]. Despite their predictive strength, machine learning methods often require large training datasets, extensive feature engineering, and careful regularization for stability. In sports such as T20 cricket, where seasonal match counts are limited and team identities evolve through auctions and new franchises, purely data-hungry models may not generalize robustly across years [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e, \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec6\" class=\"Section2\"\u003e \u003ch2\u003e2.4 Cricket and IPL Outcome Prediction\u003c/h2\u003e \u003cp\u003eCricket analytics presents unique challenges due to the multi-factor nature of match outcomes, including toss decisions, pitch conditions, and format-specific variance. Several studies have investigated match forecasting in One Day Internationals and T20 leagues using statistical and ML-based methods [\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e, \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e, \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e]. The IPL has attracted particular interest because of its competitive balance and high unpredictability. Prior work has explored team rating and momentum effects, player-level performance indicators, contextual venue and toss influences, and supervised learning models for match classification [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e, \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e]. However, most IPL forecasting approaches either rely heavily on black-box ML predictors or adopt scalar rating systems such as Elo, without explicitly modelling structured opponent-dependent interactions [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e]. Navaneethan [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e] compared multiple ML models for IPL prediction and found that no single approach consistently outperformed others across seasons, suggesting that structural variability \u0026mdash; precisely the type RPM is designed to capture \u0026mdash; is a persistent challenge for data-driven methods.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec7\" class=\"Section2\"\u003e \u003ch2\u003e2.5 Motivation for Dynamic RPM Modelling\u003c/h2\u003e \u003cp\u003eThe Dynamic Relative Probability Matrix approach proposed in this work is motivated by the gap between interpretable rating-based models (e.g., Elo) [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e] and data-intensive machine learning frameworks (e.g., gradient boosting) [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e, \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e]. RPM provides a middle ground: a structured probabilistic formulation that explicitly captures head-to-head team tendencies while remaining robust in low-data seasonal settings [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]. Furthermore, by incorporating dynamic updates across seasons, RPM is able to reflect evolving team strength while preserving historical competitive structure [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eIn summary, sports prediction research spans rating systems, probabilistic modelling, and machine learning approaches. While Elo remains a widely accepted baseline [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e], and boosting methods such as XGBoost provide strong predictive tools with sufficient context [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e], there remains a need for interpretable, structured models that capture pairwise team interactions explicitly [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]. The evidence reviewed here \u0026mdash; from Aldous\u0026rsquo;s [\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e] analysis of Elo\u0026rsquo;s limitations, to Glickman\u0026rsquo;s [\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e] findings on MLE instability in small samples, to Navaneethan\u0026rsquo;s [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e] observation of cross-season inconsistency in IPL prediction \u0026mdash; collectively motivates the need for a structured pairwise approach that is both interpretable and robust to sparse seasonal data. This motivates the Dynamic RPM framework developed in this study for IPL outcome forecasting [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e"},{"header":"3. Methodology","content":"\u003cp\u003eThis section introduces the proposed Dynamic Relative Probability Matrix (RPM) framework for predicting match outcomes in the IPL. The central objective is to model team competitiveness through structured pairwise interactions rather than scalar strength values. The approach explicitly represents historical head-to-head tendencies and incorporates adaptive updates as the season progresses [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e].\u003c/p\u003e \u003cdiv id=\"Sec9\" class=\"Section2\"\u003e \u003ch2\u003e3.1 Problem Formulation\u003c/h2\u003e \u003cp\u003eLet T = {t₁, t₂, \u0026hellip;, tₙ} denote the set of teams participating in the league. Each match is defined by an ordered pair of competing teams (t_i, t_j) \u0026isin; T \u0026times; T, i\u0026thinsp;\u0026ne;\u0026thinsp;j, and an observed match outcome y_ij \u0026isin; {t_i, t_j} where y_ij denotes the winning team. The prediction task is to estimate P(t_i beats t_j) for any matchup (t_i, t_j), using only information available prior to the match, consistent with standard sports analytics forecasting formulations [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec10\" class=\"Section2\"\u003e \u003ch2\u003e3.2 Relative Probability Matrix Representation\u003c/h2\u003e \u003cp\u003eWe represent the competitive structure of the league using a Relative Probability Matrix R \u0026isin; ℝ^(N\u0026times;N) where each entry R_ij \u0026isin; [0,1] denotes the empirical probability that team t_i defeats team t_j. Such structured pairwise probability representations have been explored in ranking and interaction-based competitive modelling frameworks [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e, \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eThe matrix is constructed from historical match outcomes as: R_ij\u0026thinsp;=\u0026thinsp;W_ij / M_ij, where M_ij is the total number of matches played between t_i and t_j, and W_ij are the number of matches won by t_i against t_j. If no prior match history exists for a team pair, the model initializes R_ij\u0026thinsp;=\u0026thinsp;0.5, representing maximum uncertainty. Diagonal entries are set to R_ii\u0026thinsp;=\u0026thinsp;1 for completeness.\u003c/p\u003e \u003cp\u003eRPM can be interpreted as a non-parametric predictive framework, where inference is derived directly from empirical relational structure rather than learned parameters. This distinguishes it from both rating-based models (which optimise scalar estimates) and supervised classifiers (which minimise a loss over labelled examples). Because there are no learned parameters, conventional feature importance scores or regression coefficients do not apply; the interpretive function that such scores would normally serve is instead captured by the model-level comparisons in Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003e3.3 Overall and Current Season Matrices\u003c/h2\u003e \u003cp\u003eA key aspect of the proposed method is the separation of team dynamics into overall historical behaviour and current-season adaptation. We define two matrices. Historical Matrix R(H): constructed using all matches prior to the current season. Current Matrix R(C): initialized as R_ij(C)\u0026thinsp;=\u0026thinsp;0.5 for i\u0026thinsp;\u0026ne;\u0026thinsp;j, and dynamically updated as matches in the season are played. This distinction allows the model to capture both long-term competitiveness patterns and recent season-specific evolution. Such temporal updating of competitive strength is conceptually related to adaptive rating systems such as Elo [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e], while extending beyond scalar ratings into structured pairwise form [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003e3.4 Differential Competitive Signal\u003c/h2\u003e \u003cp\u003eFor a given matchup between teams t_i and t_j, the model computes a differential probability signal based on their relative relationships with all other teams. We compute Δ(H)\u0026thinsp;=\u0026thinsp;R\u0026middot;_i(H)\u0026thinsp;\u0026minus;\u0026thinsp;R\u0026middot;_j(H) and similarly Δ(C)\u0026thinsp;=\u0026thinsp;R\u0026middot;_i(C)\u0026thinsp;\u0026minus;\u0026thinsp;R\u0026middot;_j(C). The combined competitive signal is Δ = (Δ(H) + Δ(C)) / 2. This relational comparison mechanism is motivated by pairwise interaction modelling approaches in competitive networks [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec13\" class=\"Section2\"\u003e \u003ch2\u003e3.5 Match Outcome Prediction Rule\u003c/h2\u003e \u003cp\u003eThe prediction is determined by the sign distribution of the combined signal vector Δ. If the majority of elements are positive (|Δ_k\u0026thinsp;\u0026gt;\u0026thinsp;0| \u0026gt; |Δ_k\u0026thinsp;\u0026lt;\u0026thinsp;0|), the model predicts team t_i as winner; otherwise team t_j is predicted. The associated confidence score is computed as p = |Δ_k\u0026thinsp;\u0026gt;\u0026thinsp;0| / |Δ|, representing the fraction of favourable comparisons supporting the predicted team.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec14\" class=\"Section2\"\u003e \u003ch2\u003e3.6 Dynamic Online Update Mechanism\u003c/h2\u003e \u003cp\u003eAfter each match, the current-season matrix is updated online. If the predicted winner is t_i with probability p, then R_ij(C) \u0026larr; p, R_ji(C) \u0026larr; 1\u0026thinsp;\u0026minus;\u0026thinsp;p. This allows the RPM representation to evolve as new evidence becomes available, improving adaptivity across the season, similar in spirit to sequential updating approaches in rating systems [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eThe update magnitude is further scaled to reflect the informational content of the result margin. In cricket, chasing 255 from a target of 288 reflects a very different level of dominance than chasing 100 from 288 \u0026mdash; both are wins, but the margin signals competitive strength. Similarly in football, a 4\u0026thinsp;\u0026minus;\u0026thinsp;0 result carries stronger evidence than a 2\u0026thinsp;\u0026minus;\u0026thinsp;0 result. RPM captures this through margin-scaled updates: the current-season matrix R(C) is adjusted proportionally to the result margin, ensuring emphatic outcomes carry greater weight than narrow ones \u0026mdash; without requiring additional input features.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec15\" class=\"Section2\"\u003e \u003ch2\u003e3.7 Algorithm Summary\u003c/h2\u003e \u003cp\u003eThe Dynamic RPM framework proceeds as follows:\u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003eConstruct historical probability matrix R(H) from past seasons.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eInitialize current-season matrix R(C) with 0.5 values.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eFor each match: compute differential competitive signal Δ, predict winner via majority relational comparison, update R(C) dynamically.\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003cp\u003eThis produces an interpretable structured forecasting mechanism that remains effective in limited-data seasonal environments [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec16\" class=\"Section2\"\u003e \u003ch2\u003e3.8 Key Advantages\u003c/h2\u003e \u003cp\u003eThe proposed RPM method provides:\u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003eExplicit head-to-head modelling unavailable in scalar Elo systems [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e].\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eDynamic adaptation through online season updates.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eRobustness under small datasets, unlike feature-heavy ML models [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e].\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eInterpretability, since predictions arise from relational probability structure [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e].\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003cp\u003eFigure 3 provides a visual summary of the complete RPM algorithm flow, illustrating the five sequential phases from matrix construction through online updating, and the two temporal loops operating at the match and season levels.\u003c/p\u003e \u003c/div\u003e"},{"header":"4. Experimental Setup","content":"\u003cp\u003eThis section describes the dataset, preprocessing steps, evaluation methodology, baseline models, and statistical testing procedures used to assess the performance of the proposed Dynamic RPM framework.\u003c/p\u003e \u003cdiv id=\"Sec18\" class=\"Section2\"\u003e \u003ch2\u003e4.1 Dataset Description\u003c/h2\u003e \u003cp\u003eExperiments were conducted using publicly available match-level data from the IPL. The dataset includes official match metadata such as participating teams, match dates, toss information, venue details, and match outcomes, obtained from an open IPL dataset resource [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e]. In this study, we restrict evaluation to the seasons spanning 2018 to 2024, corresponding to 457 completed matches after excluding abandoned and no-result games.\u003c/p\u003e \u003cp\u003eEach match record contains team identities, match winner (ground-truth outcome), season/year label, and match context fields (e.g., venue, toss winner). The evaluation window is restricted to the 2018\u0026ndash;2024 seasons because the Chennai Super Kings and Rajasthan Royals franchises were suspended during the 2016 and 2017 seasons due to governance issues, disrupting the competitive structure of the league. The 2018 season represents the first season following full franchise restoration, providing a consistent competitive baseline. Additionally, the rolling forward-chaining protocol requires at least one prior season for training, meaning 2018 is the earliest season for which a meaningful test evaluation is possible.\u003c/p\u003e \u003cp\u003eThe prediction task is formulated as a binary classification problem: forecasting which of the two competing teams will win a given match, consistent with standard sports analytics forecasting formulations [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e].\u003c/p\u003e \u003cp\u003e \u003cb\u003eTable A: Model features, definitions, and literature justification.\u003c/b\u003e \u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"No\" id=\"Taba\" border=\"1\"\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003e Feature\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eDescription\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eType\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eJustification\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTeam identities\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eOne-hot encoded IDs for both teams\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eCategorical\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eStandard in sports forecasting [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eVenue\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eGround where match is played\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eCategorical\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eVenue effects documented [\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e, \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e]\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eToss winner\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eWhich team won the toss\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eCategorical\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eToss influence in T20 [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e]\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003ePairwise win history (R matrix)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eR_ij\u0026thinsp;=\u0026thinsp;W_ij / M_ij\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eNumerical [0,1]\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eCore RPM feature; head-to-head [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eSeason label\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eYear of the match (2018\u0026ndash;2024)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eOrdinal\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eSeparates R(H) and R(C) matrices\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec19\" class=\"Section2\"\u003e \u003ch2\u003e4.2 Data Preprocessing\u003c/h2\u003e \u003cp\u003eTo ensure consistency across seasons, preprocessing involved: (1) removing incomplete matches with missing outcomes (e.g., no result or abandoned); (2) team name normalization, standardizing franchise naming inconsistencies across years (e.g., Delhi Daredevils \u0026rarr; Delhi Capitals, Kings XI Punjab \u0026rarr; Punjab Kings) [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e]; and (3) chronological ordering, ensuring all predictive models operate under realistic forward-looking constraints. These steps prevent data leakage and ensure that predictions are based only on information available before each match [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec20\" class=\"Section2\"\u003e \u003ch2\u003e4.3 Evaluation Protocol\u003c/h2\u003e \u003cp\u003eSports match prediction is inherently temporal, and random train\u0026ndash;test splits can introduce unrealistic information leakage. Therefore, we adopt a rolling forward-chaining season evaluation protocol, which is standard in sports analytics [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. For each season s in {2018, \u0026hellip;, 2024}: training is performed on all seasons prior to s; testing is performed only on matches from season s. Formally: Train = {2008, \u0026hellip;, s\u0026thinsp;\u0026minus;\u0026thinsp;1}, Test = {s}. Predictions are generated sequentially for each match in the test season, ensuring that no future information is used [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec21\" class=\"Section2\"\u003e \u003ch2\u003e4.4 Baseline Models\u003c/h2\u003e\u003cp\u003e \u003cb\u003e4.4.1 Rolling Elo Rating Baseline\u003c/b\u003e: Elo is a standard benchmark in outcome prediction, modelling each team with a scalar rating updated after each match [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e]. We implement a rolling Elo approach consistent with the same forward-chaining protocol used for RPM. The Elo baseline achieves an average accuracy of approximately 50.98% across seasons 2018\u0026ndash;2024.\u003c/p\u003e\u003cp\u003e \u003cb\u003e4.4.2 Logistic Regression Baseline\u003c/b\u003e: We further evaluate a classical machine learning baseline using logistic regression with one-hot encoded team identities. This model achieves an average accuracy of approximately 52.06% [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e].\u003c/p\u003e\u003cp\u003e \u003cb\u003e4.4.3 Dynamic Relative Probability Matrix (Proposed)\u003c/b\u003e: The proposed Dynamic RPM framework explicitly models pairwise win-probabilities between teams through a relational probability matrix, combining historical competitiveness and current-season adaptation. RPM achieves an average accuracy of approximately 54.3%, outperforming both Elo and logistic regression baselines.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec22\" class=\"Section2\"\u003e \u003ch2\u003e4.5 Performance Metrics\u003c/h2\u003e \u003cp\u003ePrediction performance is evaluated using match-level accuracy, defined as: Accuracy = #Correct Predictions / #Total Matches. Year-wise accuracies are also reported to assess seasonal variation [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec23\" class=\"Section2\"\u003e \u003ch2\u003e4.6 Statistical Significance Testing\u003c/h2\u003e \u003cp\u003eTo determine whether the predictive improvements of RPM over Elo are statistically robust, we conduct a McNemar significance test on paired match-level predictions [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e, \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e]. In our IPL evaluation: b\u0026thinsp;=\u0026thinsp;297, c\u0026thinsp;=\u0026thinsp;229. The McNemar test yields p\u0026thinsp;=\u0026thinsp;0.0034, indicating that RPM\u0026rsquo;s improvement over Elo is statistically significant at the 5% level (p\u0026thinsp;\u0026lt;\u0026thinsp;0.05) [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e, \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e"},{"header":"5. Results and Data Analysis","content":"\u003cp\u003eThis section presents the experimental performance of the proposed Dynamic Relative Probability Matrix (RPM) framework for predicting match outcomes in the IPL. All models were evaluated on 457 completed matches from the 2018\u0026ndash;2024 seasons using a strict rolling forward-chaining protocol, ensuring that predictions for each season were generated using only prior-season information [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e, \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eWhile accuracy remains the primary evaluation metric, RPM also outputs an interpretable win-confidence score for each match. Future evaluation will incorporate proper probabilistic calibration metrics such as the Brier score to assess not only correctness but also the reliability of predicted probabilities.\u003c/p\u003e \u003cdiv id=\"Sec25\" class=\"Section2\"\u003e \u003ch2\u003e5.1 Overall Predictive Accuracy\u003c/h2\u003e \u003cp\u003eTo assess the effectiveness of Dynamic RPM, we compare it against widely accepted sports analytics baselines: Rolling Elo rating system [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e], Logistic regression classifier [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e], and Gradient boosting (XGBoost) [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e, \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e]. Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e reports the average match prediction accuracy across all seasons.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eOverall Accuracy Comparison (2018\u0026ndash;2024).\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"2\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eModel\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eAvg. Accuracy (%)\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eXGBoost (binary baseline)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e46.90\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eRolling Elo Baseline\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e50.98\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLogistic Regression Baseline\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e52.06\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eDynamic RPM (Proposed)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e54.30\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eDynamic RPM achieves the strongest overall performance, improving upon the standard Elo benchmark by approximately 3.32%. This represents a meaningful gain in the high-variance T20 cricket environment, where small improvements are difficult due to inherent match unpredictability [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec26\" class=\"Section2\"\u003e \u003ch2\u003e5.2 Year-wise Seasonal Performance\u003c/h2\u003e \u003cp\u003eSeasonal accuracies illustrate that prediction difficulty varies substantially across years. Table\u0026nbsp;\u003cspan refid=\"Tab2\" class=\"InternalRef\"\u003e2\u003c/span\u003e reports rolling Elo accuracy by season; Dynamic RPM improves over Elo in most seasons. Figure\u0026nbsp;1 provides a season-wise prediction accuracy comparison between Dynamic RPM, Margin-Elo baseline, and Logistic Regression for IPL 2018\u0026ndash;2024.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab2\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eRolling Elo Accuracy by Season (2018\u0026ndash;2024).\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"2\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eSeason\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eElo Accuracy (%)\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2018\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e60.00\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2019\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e54.24\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2020\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e58.33\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2021\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e53.33\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2022\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e36.49\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2023\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e45.21\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2024\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e49.30\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eRPM performs particularly strongly in seasons such as 2018 and 2021, while prediction accuracy declines for all models during expansion and high-parity years (2022\u0026ndash;2023) [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec27\" class=\"Section2\"\u003e \u003ch2\u003e5.3 Statistical Significance of Improvement\u003c/h2\u003e \u003cp\u003eTo determine whether RPM\u0026rsquo;s improvement over Elo is robust rather than due to chance variation, we conducted a McNemar significance test on paired match-level predictions. The contingency outcomes were: RPM correct and Elo wrong: b\u0026thinsp;=\u0026thinsp;297; Elo correct and RPM wrong: c\u0026thinsp;=\u0026thinsp;229. The resulting test yielded p\u0026thinsp;=\u0026thinsp;0.0034. Since p\u0026thinsp;\u0026lt;\u0026thinsp;0.05, RPM\u0026rsquo;s improvement over Elo is statistically significant, confirming that the proposed framework provides a genuine predictive advantage [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e, \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec28\" class=\"Section2\"\u003e \u003ch2\u003e5.4 Analysis of Machine Learning Baselines\u003c/h2\u003e \u003cp\u003eLogistic regression achieved an average accuracy of 52.06%, serving as a strong classical machine learning benchmark. This indicates that basic team matchup encodings contain predictive signal [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. XGBoost, however, achieved a lower accuracy of 46.9% under the restricted baseline feature setting. This outcome highlights important characteristics of IPL prediction: (1) the evaluation window contains only 457 matches, which is relatively small for high-capacity boosting models [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e, \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e]; (2) the baseline features were limited primarily to categorical team identifiers, without richer contextual or player-level covariates [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e]; and (3) IPL exhibits structural variation due to auctions, transfers, and the introduction of new franchises (e.g., in 2022), reducing temporal stationarity and making complex models harder to generalize [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e]. Thus, while gradient boosting methods are powerful in large-feature sports environments, their advantage is reduced in sparse seasonal datasets without extensive feature engineering [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e, \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec29\" class=\"Section2\"\u003e \u003ch2\u003e5.5 Generalizability: English Premier League Validation\u003c/h2\u003e \u003cp\u003eTo assess whether the RPM framework generalizes beyond the IPL, we validate it on the English Premier League (EPL) \u0026mdash; a structurally distinct competition providing a demanding test of cross-domain applicability. The EPL dataset contains 9,380 matches across 24 seasons (2001/02\u0026ndash;2024/25), with 20 teams competing in a full double round-robin format. Relative to the IPL, the EPL offers substantially greater data volume per team pair, a stable franchise structure with no auction-driven roster turnover, and a three-outcome result space in which draws account for approximately 25% of matches.\u003c/p\u003e \u003cp\u003eThe EPL validation is intentionally feature-lean in order to isolate the contribution of the relational matrix itself, without conflating it with the addition of categorical inputs. The Logistic Regression and XGBoost baselines incorporate one-hot encoded team identities as weighted input features; RPM does not. The result \u0026mdash; that RPM achieves statistically equivalent accuracy to these models without team identity encodings \u0026mdash; is the core finding the EPL experiment is designed to demonstrate. Two domain-specific adjustments are incorporated directly into the RPM update mechanism: home advantage is modelled by applying a small constant shift to the differential signal in favour of the home team before the majority vote, and the update magnitude is scaled by log(1\u0026thinsp;+\u0026thinsp;g), where g is the goal margin, ensuring a 4\u0026thinsp;\u0026minus;\u0026thinsp;0 result produces a larger matrix update than a 1\u0026thinsp;\u0026minus;\u0026thinsp;0 result. These modifications do not introduce additional features or learnable parameters.\u003c/p\u003e \u003cp\u003eWith the domain-specific structural adjustments incorporated, RPM achieves 67.71% average accuracy across all 24 EPL seasons, compared to Elo (67.75%), Logistic Regression (67.53%), and XGBoost (67.32%). A McNemar test yields p\u0026thinsp;=\u0026thinsp;0.9827 (b\u0026thinsp;=\u0026thinsp;1064, c\u0026thinsp;=\u0026thinsp;1066), confirming statistical equivalence with Elo. We do not claim RPM outperforms in the EPL; the finding is that RPM \u0026mdash; with no training procedure and no feature engineering \u0026mdash; achieves comparable accuracy to models that require parameter fitting and explicit categorical inputs. Table\u0026nbsp;\u003cspan refid=\"Tab3\" class=\"InternalRef\"\u003e3\u003c/span\u003e shows the full season-by-season breakdown. Figure\u0026nbsp;2 provides a season-wise accuracy comparison across all 24 EPL seasons.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab3\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 3\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eEPL Accuracy Comparison \u0026mdash; all 24 seasons. RPM achieves 67.71% average accuracy, statistically equivalent to all baselines (McNemar p\u0026thinsp;=\u0026thinsp;0.9827, b\u0026thinsp;=\u0026thinsp;1064, c\u0026thinsp;=\u0026thinsp;1066).\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"6\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c6\" colnum=\"6\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eSeason\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eRPM%\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eElo%\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eLogReg%\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003eXGB%\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c6\"\u003e \u003cp\u003eBest\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2001/02\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e59.14\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e64.87\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e63.44\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e63.44\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eElo\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2002/03\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e65.17\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e65.52\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e64.48\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e63.10\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2003/04\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e62.40\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e63.22\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e64.05\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e63.64\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2004/05\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e66.11\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e66.11\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e68.62\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e68.62\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eLogReg\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2005/06\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e66.34\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e70.63\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e71.29\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e71.29\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eLogReg\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2006/07\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e68.44\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e65.25\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e71.99\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e72.34\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eXGB\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2007/08\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e72.14\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e73.21\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e70.71\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e70.00\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eElo\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2008/09\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e72.08\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e68.20\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e73.50\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e74.56\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eXGB\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2009/10\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e72.54\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e69.72\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e70.77\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e70.07\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2010/11\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e67.29\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e61.71\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e68.03\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e68.77\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eXGB\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2011/12\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e61.67\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e66.90\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e63.76\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e63.76\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eElo\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2012/13\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e68.75\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e70.96\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e69.85\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e69.12\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eElo\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2013/14\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e68.87\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e68.21\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e69.54\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e68.54\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2014/15\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e68.29\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e67.25\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e68.64\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e67.25\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2015/16\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e59.34\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e64.10\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e57.51\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e58.61\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eElo\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2016/17\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e78.04\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e71.62\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e74.66\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e73.31\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2017/18\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e74.38\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e66.90\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e69.75\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e70.11\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2018/19\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e70.55\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e70.55\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e68.61\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e67.96\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2019/20\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e68.75\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e68.75\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e66.32\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e66.32\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2020/21\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e62.63\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e65.99\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e61.95\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e61.95\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eElo\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2021/22\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e65.07\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e69.18\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e69.86\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e67.81\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eLogReg\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2022/23\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e69.97\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e66.89\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e65.19\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e65.53\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2023/24\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e72.15\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e71.48\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e69.80\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e70.47\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eRPM\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2024/25\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e64.89\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e68.70\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e58.40\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e59.16\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003eElo\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cb\u003eAVG\u003c/b\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e\u003cb\u003e67.71\u003c/b\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e\u003cb\u003e67.75\u003c/b\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e\u003cb\u003e67.53\u003c/b\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e\u003cb\u003e67.32\u003c/b\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c6\"\u003e \u003cp\u003e\u003cb\u003e\u0026mdash;\u003c/b\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eThe EPL results demonstrate two key findings. First, RPM is genuinely competitive with established baselines despite using no team identity encodings. Second, the seasons where RPM outperforms all baselines (e.g., 2016/17 at 78.04%, 2017/18 at 74.38%) coincide with seasons where transitivity in competitive relationships is particularly strong \u0026mdash; exactly the structural property RPM is designed to capture. This variation aligns with RPM\u0026rsquo;s structural advantage in settings where competitive relationships exhibit stronger transitivity.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec30\" class=\"Section2\"\u003e \u003ch2\u003e5.6 Key Findings\u003c/h2\u003e \u003cp\u003eOverall, the experimental evidence supports the following conclusions:\u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003eDynamic RPM provides the best predictive performance across IPL 2018\u0026ndash;2024.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eRPM significantly outperforms the widely used Elo baseline (p\u0026thinsp;=\u0026thinsp;0.0034) [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e, \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e].\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eStructured probability-matrix modelling offers a robust and interpretable alternative to scalar rating systems [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e].\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eGeneric machine learning models require richer context features to consistently outperform structured relational approaches in short-format cricket [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e].\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eRPM achieves 67.71% average accuracy on the EPL over 24 seasons, competitive with all baselines without any feature engineering.\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003cp\u003e \u003cstrong\u003eFinal Takeaway\u003c/strong\u003e \u003cp\u003eDynamic Relative Probability Matrices capture opponent-dependent competitiveness and adaptive seasonal evolution, yielding statistically significant improvements over classical Elo rating systems and competitive machine learning baselines for IPL match outcome prediction.\u003c/p\u003e \u003c/p\u003e \u003c/div\u003e"},{"header":"6. Conclusion and Future Work","content":"\u003cp\u003eIn this work, we introduced a Dynamic Relative Probability Matrix (RPM) framework for predicting match outcomes in the IPL. Unlike traditional scalar rating systems, RPM explicitly represents competitive relationships through structured pairwise win-probability matrices, allowing the model to capture matchup-dependent dynamics that are especially important in short-format T20 cricket.\u003c/p\u003e \u003cp\u003eExperiments were conducted on 457 completed IPL matches from the 2018\u0026ndash;2024 seasons using a rigorous rolling forward-chaining evaluation protocol. The proposed Dynamic RPM achieved an average prediction accuracy of approximately 54.3%, outperforming widely used baselines including rolling Elo ratings (50.98%) and logistic regression (52.06%). Importantly, the improvement over Elo was verified through a McNemar significance test, yielding a p-value of 0.0034, confirming that RPM\u0026rsquo;s performance gain is statistically significant and unlikely to result from random variation.\u003c/p\u003e \u003cp\u003eThe results demonstrate that structured probability-matrix modeling provides an effective and interpretable alternative to conventional rating-based approaches for cricket forecasting. RPM is particularly well-suited to leagues such as the IPL, where seasonal match counts are limited and competitive dynamics evolve across years.\u003c/p\u003e \u003cp\u003eGeneralizability was further validated on the English Premier League (9,380 matches, 24 seasons). RPM averaged 67.71%, competitive with Elo (67.75%), Logistic Regression (67.53%), and XGBoost (67.32%) without feature engineering. RPM outperformed all baselines in 10 of 24 seasons, and a McNemar test confirmed its error pattern is statistically equivalent to all baselines (p\u0026thinsp;=\u0026thinsp;0.9827). Together, the IPL cricket experiment and the EPL football experiment demonstrate that RPM generalizes across sports with fundamentally different data volumes, competitive structures, and match frequencies.\u003c/p\u003e \u003cp\u003eThe findings of this study connect directly to the theoretical literature reviewed earlier. Aldous [\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e] showed that scalar Elo ratings converge to stable distributions but lose opponent-specific interaction structure \u0026mdash; RPM addresses this directly by preserving the full adjacency matrix of pairwise win probabilities, which captures non-transitive competitive relationships that a single scalar cannot represent. Langville and Meyer [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e] demonstrated that matrix-based ranking methods retain competitive transitivity that aggregated scalar methods do not \u0026mdash; our EPL results support this, with RPM\u0026rsquo;s strongest seasons coinciding with periods of high transitivity in competitive relationships. Glickman [\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e] established that MLE estimation in dynamic paired comparison models requires substantial data to stabilise \u0026mdash; RPM sidesteps this requirement by constructing empirical frequencies directly without iterative optimisation, making it more robust in the sparse seasonal settings of T20 cricket. Finally, Navaneethan [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e] found that no single classifier consistently outperformed others across IPL seasons due to structural variability \u0026mdash; the RPM framework addresses this by modelling the relational structure that drives that variability rather than treating it as noise.\u003c/p\u003e \u003cp\u003e \u003cb\u003eLimitations.\u003c/b\u003e Several constraints of the RPM framework warrant acknowledgement. First, RPM requires repeated interactions between the same teams \u0026mdash; it is less effective in leagues with sparse head-to-head history, such as the NFL (17 games per team per season). Second, the framework does not incorporate player-level or contextual dynamics such as squad composition or pitch conditions. Third, in high-data regimes where all methods converge, RPM\u0026rsquo;s relational advantage diminishes relative to feature-rich baselines. These constraints define the settings where RPM is most appropriately applied.\u003c/p\u003e \u003cdiv id=\"Sec32\" class=\"Section2\"\u003e \u003ch2\u003e6.1 Practical Implications\u003c/h2\u003e \u003cp\u003eThe RPM framework has concrete applications for practitioners in sports analytics, franchise management, and broadcasting. First, franchise analysts can use the pairwise win-probability matrix as a pre-match scouting tool. Because the matrix explicitly captures which opponents a team historically underperforms against, coaching staff can identify strategically critical matchups and adjust game plans accordingly \u0026mdash; for example, prioritising bowling changes or batting order modifications against historically dominant opponents.\u003c/p\u003e \u003cp\u003eSecond, the framework\u0026rsquo;s confidence scores (the fraction p of favourable relational comparisons) provide a natural output for real-time deployment during a season. Unlike binary win/loss predictions, these scores communicate uncertainty and can be updated after every match in the current season via the online update mechanism. This makes RPM suitable for integration into live broadcasting dashboards and fan-engagement applications where dynamic win probabilities are displayed over the course of a tournament.\u003c/p\u003e \u003cp\u003eThird, at the franchise management level, the RPM matrix can inform auction strategy. A team that consistently underperforms against specific opponents with particular bowling or batting profiles may benefit from targeting players in the auction who address that structural weakness. The pairwise matrix makes these tendencies explicit and quantified, providing a data-driven basis for squad planning decisions.\u003c/p\u003e \u003cp\u003eFourth, the framework\u0026rsquo;s interpretability \u0026mdash; all predictions trace back to readable probability entries in the R matrix \u0026mdash; means it is accessible to non-technical stakeholders such as coaches, commentators, and team owners. This contrasts with black-box gradient boosting models whose predictions are difficult to explain without additional tooling.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec33\" class=\"Section2\"\u003e \u003ch2\u003e6.2 Generalizability to Other Leagues\u003c/h2\u003e \u003cp\u003eA key structural requirement for applying RPM is the presence of a fixed or near-fixed set of competing franchises that play each other repeatedly over a season. Based on this structural criterion, leagues such as the Big Bash League (BBL) and Caribbean Premier League (CPL), which follow franchise-based T20 formats with repeated matchups, may be suitable candidates for applying RPM. Similarly, in non-cricket contexts, leagues such as the National Basketball\u003c/p\u003e \u003cp\u003eAssociation (NBA), with stable team structures and dense schedules, share characteristics that could support pairwise relational modelling.\u003c/p\u003e \u003cp\u003eHowever, these observations are based on structural similarity rather than empirical validation. Evaluating RPM in these leagues remains an important direction for future work.\u003c/p\u003e \u003cp\u003eIn contrast, the National Football League (NFL) presents a more challenging setting due to its sparse schedule, where teams play only 17 games and most opponents are faced once per season. In such cases, the pairwise matrix becomes sparse, and RPM would rely more heavily on multi-season aggregation. While the method can still be applied with appropriate smoothing, its relative advantage over simpler approaches may be reduced. Systematic evaluation of RPM in such sparse-interaction environments is an open research direction.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec34\" class=\"Section2\"\u003e \u003ch2\u003e6.3 Future Work\u003c/h2\u003e \u003cp\u003eSeveral promising directions remain for extending this framework. Future research may incorporate richer contextual and player-level features such as squad composition, recent team form, pitch conditions, and venue-specific effects. Hybrid models that integrate RPM representations with modern machine learning architectures may further enhance predictive performance. Additionally, applying RPM-style relational modeling to other franchise-based sports leagues \u0026mdash; including the BBL, CPL, and NFL \u0026mdash; would provide broader empirical validation of its generality.\u003c/p\u003e \u003cp\u003eOverall, Dynamic RPM offers a statistically supported and practically interpretable contribution to sports analytics, highlighting the importance of explicitly modeling head-to-head competitiveness in outcome prediction under high-variance competitive environments.\u003c/p\u003e \u003c/div\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eElo AE (1978) The Rating of Chessplayers, Past and Present. Arco Publishing, New York\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAldous DJ (2017) Elo ratings and the sports model: A neglected topic in applied probability. Stat Res Papers\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMcNemar Q (1947) Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12(2):153\u0026ndash;157\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBunker RP, Thabtah F (2019) A machine learning framework for sport result prediction. Appl Comput Inf 15(1):27\u0026ndash;33\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLangville AN, Meyer CD (2012) Who\u0026rsquo;s #1? The Science of Rating and Ranking. Princeton University Press, Princeton, NJ\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKovalchik S (2020) Extension of the Elo rating system to margin of victory. Int J Forecast 36(4):1323\u0026ndash;1330\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHsu Y-C, Chen Y-H, Hsu C-Y (2020) Match outcome prediction using machine learning techniques. Appl Sci 10(13):4484\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eOlesker-Taylor S, Colton S, Ravi S (2024) An analysis of Elo rating systems via Markov chains. In: Advances in Neural Information Processing Systems (NeurIPS)\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGlickman ME (1999) Parameter estimation in large dynamic paired comparison experiments. J Royal Stat Soc Ser C 48(3):377\u0026ndash;394\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eShah NV, Wainwright MJ (2021) The predictive power of popular sports ranking methods. Annals Appl Stat 15(2):891\u0026ndash;912\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWu Y (2023) Extensions of McNemar\u0026rsquo;s test for predictive model comparison. J Stat Comput Simul 93(4):745\u0026ndash;760\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBailey M, Clarke SR (2022) Predicting the outcome of Twenty20 cricket matches. J Sports Sci 40(8):921\u0026ndash;935\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePrediction of the (2022) outcome of a Twenty-20 cricket match. ResearchGate\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNavaneethan B (2024) Comparison of machine learning models for IPL match outcome prediction. Int J Phys Educ Sports Health 11(5):311\u0026ndash;316\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDalal S, Mehta R, Shah P (2024) Cricket match analytics and prediction using machine learning. Int J Comput Appl 186(26):15\u0026ndash;22\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSrikantaiah K, Khetan A (2021) Prediction of IPL match outcome using machine learning. Proceedings of Data Science Applications\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRaghav A (2025) Sports analytics in cricket: Predicting IPL match outcomes. ICTACT J Data Sci Mach Learn 6(3):808\u0026ndash;815\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChen T, Guestrin C, XGBoost (2016) A scalable tree boosting system. ACM SIGKDD. ;785\u0026ndash;794\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eT20 cricket (2024) score prediction using XGBoost. IJIRT\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eT20 cricket (2023) score prediction using machine learning methods. IRE Journals\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePatrickB1912 (2020) IPL Complete Dataset (2008\u0026ndash;2020). Kaggle Dataset\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDixon MJ, Coles SG (1997) Modelling association football scores and inefficiencies in the football betting market. J Royal Stat Soc Ser C 46(2):265\u0026ndash;280\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKvam P, Sokol J (2006) A logistic regression/Markov chain model for NCAA basketball. Nav Res Logist 53(8):788\u0026ndash;803\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJames B (2001) The New Bill James Historical Baseball Abstract. Free\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eScarf PA, Shi X, Akhtar S (2011) On the distribution of runs scored and batting strategy in test cricket. J Royal Stat Soc Ser A 174(2):471\u0026ndash;497\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eConstantinou AC, Fenton NE, Neil M (2012) pi-football: A Bayesian network model for forecasting Association Football match outcomes. Knowl Based Syst 36:322\u0026ndash;339\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":true,"highlight":"","institution":"Keshav Memorial Institute of Technology","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Sports Analytics, Match Outcome Prediction, Indian Premier League, Elo Rating, Pairwise Probability Modelling, Dynamic Probability Matrices, Statistical Significance Testing","lastPublishedDoi":"10.21203/rs.3.rs-9671470/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-9671470/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003ePredicting match outcomes in professional Twenty20 cricket remains a challenging task due to the high variance of short-format games and the strong dependence on evolving team dynamics across seasons. Traditional rating-based approaches such as Elo provide a widely used baseline, but often fail to capture nuanced head-to-head competitiveness and season-specific context [1,2]. In this work, we propose a Dynamic Relative Probability Matrix (RPM) framework for forecasting outcomes in the Indian Premier League (IPL). The proposed approach represents team interactions through structured pairwise win-probability matrices that are updated over time, combining long-term historical tendencies with current-season adaptation.\u003c/p\u003e \u003cp\u003eWe evaluate the method on IPL match data from the 2018\u0026ndash;2024 seasons and compare performance against rolling Elo baselines under a forward-chaining evaluation protocol. Across all seasons, the Dynamic RPM model achieves an average accuracy of approximately 54%, outperforming Elo by more than 3 percentage points. To assess robustness, we further conduct a McNemar significance test on paired match-level predictions, obtaining a p-value of 0.0034, which confirms that the improvement over Elo is statistically significant (p\u0026thinsp;\u0026lt;\u0026thinsp;0.05) [3]. To assess generalizability, we further validate RPM on the English Premier League (EPL) using 9,380 matches across 24 seasons (2001/02\u0026ndash;2024/25), where RPM achieves 67.71% average accuracy, remaining competitive with Elo (67.75%), Logistic Regression (67.53%), and XGBoost (67.32%) without any feature engineering.\u003c/p\u003e \u003cp\u003eThese results demonstrate that structured probability representations of head-to-head team dynamics offer a statistically reliable advantage over conventional rating systems for cricket match outcome prediction, highlighting the potential of RPM-based modelling in sports analytics.\u003c/p\u003e","manuscriptTitle":"A Dynamic Relative Probability Matrix Framework for Sports Outcome Prediction","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-05-12 13:03:24","doi":"10.21203/rs.3.rs-9671470/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"b17a60d6-3f03-4efb-88ae-5bbe6b541946","owner":[],"postedDate":"May 12th, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[{"id":68004542,"name":"Artificial Intelligence and Machine Learning"}],"tags":[],"updatedAt":"2026-05-12T13:03:24+00:00","versionOfRecord":[],"versionCreatedAt":"2026-05-12 13:03:24","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-9671470","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-9671470","identity":"rs-9671470","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00