A Two-Phase Spatio-Temporal Interpolation Framework for Reconstructing Block Group Level Housing Values in California | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article A Two-Phase Spatio-Temporal Interpolation Framework for Reconstructing Block Group Level Housing Values in California Shuang Tian¹, Fang Qiu¹ This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8674976/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 10 You are reading this latest preprint version Abstract High-resolution historical housing value data are essential for urban analysis, socioeconomic research, and policy evaluation. However, housing values reported by the U.S. Census are only available at decennial intervals and are subject to changes in administrative boundaries, resulting in substantial spatial and temporal gaps. This study proposes a two-phase spatio-temporal interpolation framework to reconstruct annual block-group-level housing values in California by explicitly decoupling spatial and temporal dependence. In the first phase, ordinary kriging is applied independently to each census year (1990, 2000, 2010, and 2020) to reconstruct spatially continuous housing values on a unified 2020 block-group geometry, leveraging strong spatial autocorrelation in housing markets. In the second phase, temporal interpolation and extrapolation are performed independently at each block group using a locally constrained inverse time-distance weighting (ITDW) approach that exploits short-range temporal autocorrelation while avoiding unrealistic global temporal trends. Validation against American Community Survey data demonstrates robust predictive performance, with R² values ranging from 0.64 to 0.95 for interpolated years (2013–2019) and from 0.68 to 0.84 for the extrapolated years (2021–2023). The results indicate that housing values are more strongly influenced by nearest temporal neighbors than by long-range temporal trends, and that explicitly separating spatial and temporal processes enables stable and accurate reconstruction of fine-scale housing values from sparse historical observations. Earth and environmental sciences/Environmental sciences Physical sciences/Mathematics and computing Spatio-temporal interpolation Ordinary kriging Inverse time-distance weighting Census block groups Housing values Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 1. Introduction Housing values exhibit pronounced spatial and temporal heterogeneity shaped by local market conditions, accessibility, demographic composition, and broader macroeconomic dynamics. Understanding the historical evolution of housing values at fine spatial scales is critical for urban planning, housing affordability assessment, and research on socioeconomic inequality [ 1 – 3 ]. Despite its importance, high-resolution historical housing data remain limited in the United States because census-based housing values are reported only at decennial intervals and are further complicated by frequent changes in census geographies, leading to substantial spatial and temporal gaps [ 4 , 5 ]. To address these limitations, previous studies have explored a range of spatial and spatio-temporal modeling approaches, including areal interpolation, spatial regression, space–time distance-weighted interpolation, and global trend fitting [ 6 – 10 ]. While these methods have proven useful in certain contexts, many implicitly assume smooth global temporal trends or continuous space–time dependence. Such assumptions are often violated in housing markets, where abrupt regime shifts—such as economic recessions, housing booms, and policy interventions—can produce non-linear and locally heterogeneous temporal dynamics [ 11 , 12 ]. Moreover, empirical evidence consistently shows that housing values exhibit strong spatial autocorrelation at fixed time points and strong short-range temporal autocorrelation at fixed locations, suggesting that spatial and temporal processes may operate at distinct scales rather than as a single coupled process [ 6 , 13 ]. Motivated by these observations, this study proposes a two-phase spatio-temporal interpolation framework that explicitly separates spatial and temporal dependency modeling. In the first phase, spatial structure is reconstructed independently at each census year using ordinary kriging to fully capture contemporaneous spatial autocorrelation. In the second phase, temporal interpolation and extrapolation are performed independently at fixed block-group locations using a locally constrained inverse time-distance weighting (ITDW) strategy that prioritizes nearest temporal neighbors. By design, this framework avoids unrealistic global temporal trends and reduces spatio-temporal leakage that can arise in fully coupled space–time interpolation methods. Using California block-group housing values as a case study, we demonstrate that the proposed framework yields substantially improved reconstruction accuracy for both interpolated and extrapolated years when compared with traditional spatial residual-based and space–time distance-weighted approaches. The results highlight the importance of explicitly decoupling spatial and temporal processes when reconstructing long-term socioeconomic data from sparse historical observations. 2. Study Area and Data This study focuses on the state of California, which exhibits pronounced spatial heterogeneity in housing markets, ranging from dense coastal metropolitan regions to suburban, exurban, and rural inland areas ( Fig. 1 ). California’s diverse economic structure, large population, and substantial regional housing disparities make it an ideal case for evaluating spatio-temporal reconstruction methods at fine spatial scales [ 1 , 11 ]. Median housing values at the census block-group level were obtained from U.S. Census sources [ 4 ] for four decennial years: 1990, 2000, 2010, and 2020 (Fig. 2 ). These data represent the most spatially detailed and consistently available housing value information over long historical periods in the United States [ 4 , 5 ]. However, differences in census block-group boundaries across decades introduce spatial inconsistencies that prevent direct temporal comparison. To ensure spatial consistency across time, all historical housing values were reconstructed onto a unified 2020 block-group geometry using areal harmonization techniques commonly adopted in small-area estimation and spatial interpolation studies [ 6 , 14 ]. Block-group centroids were used to represent spatial locations during interpolation, enabling point-based spatial modeling while preserving block-group-level interpretation. This approach eliminates inconsistencies arising from boundary changes and allows direct temporal comparison of reconstructed housing values across the full study period. 3. Methodology 3.1 Overview of the Two-Phase Framework The proposed framework consists of two sequential and explicitly decoupled phases designed (Fig. 2 ) to separately model spatial and temporal dependence in census-based housing data: Spatial Reconstruction (Phase 1) : Ordinary kriging is applied independently to each census year to reconstruct spatially continuous housing values across a unified block-group geometry, explicitly capturing spatial autocorrelation. Temporal Interpolation (Phase 2) : At each fixed block group, housing values for target years are estimated using a local inverse time-distance weighting (ITDW) approach that exploits short-range temporal autocorrelation. By separating spatial and temporal modeling, the framework avoids imposing strong assumptions of continuous space–time smoothness and allows each dependency structure to be modeled at its appropriate scale [ 7 , 8 ]. 3.2 Phase 1: Spatial Reconstruction Using Ordinary Kriging In the first phase, spatial reconstruction of housing values is performed independently for each census year (1990, 2000, 2010, and 2020) using ordinary kriging (OK). Ordinary kriging is a geostatistical interpolation method that explicitly models spatial autocorrelation and provides best linear unbiased predictions under second-order stationarity assumptions [ 7 , 9 ]. Its application is particularly appropriate for housing markets, which are well known to exhibit strong spatial dependence consistent with Tobler’s First Law of Geography [ 15 ]. For each census year, observed block-group median housing values are represented at block-group centroid locations. An empirical variogram is computed to quantify how variance between housing values changes as a function of spatial separation distance. The empirical variogram characterizes the spatial autocorrelation structure by relating semivariance to distance [ 7 ]. A theoretical variogram model is then fitted and used within the ordinary kriging system to derive optimal interpolation weights. The kriging weights are obtained by minimizing the estimation variance subject to an unbiasedness constraint, ensuring that predicted housing values reflect the dominant influence of nearby block groups while accounting for spatial redundancy. Spatial reconstruction is conducted independently for each census year to avoid imposing assumptions about temporal continuity during spatial estimation. This design yields spatially continuous housing value surfaces on a unified 2020 block-group geometry while preserving year-specific spatial patterns. 3.3 Phase 2: Temporal Interpolation Using Inverse Time-Distance Weighting Following spatial reconstruction, each block group possesses reconstructed housing values at four discrete census years. Temporal interpolation and extrapolation are then performed independently at each block group using a local inverse time-distance weighting (ITDW) approach. Rather than fitting global temporal trends or regression-based curves, ITDW exploits short-range temporal autocorrelation by assigning greater influence to temporally proximate observations [ 10 ]. For a target year \(\:{t}_{0}\) , the estimated housing value \(\:\widehat{H}\left({t}_{0}\right)\) is computed as: $$\:\widehat{H}\left({t}_{0}\right)=\frac{\sum\:_{i=1}^{n}\frac{H\left({t}_{i}\right)}{\mid\:{t}_{0}-{t}_{i}{\mid\:}^{p}}}{\sum\:_{i=1}^{n}\frac{1}{\mid\:{t}_{0}-{t}_{i}{\mid\:}^{p}}}$$ where \(\:H\left({t}_{i}\right)\) denotes the reconstructed housing value at census year \(\:{t}_{i}\) , \(\:p\) is a temporal decay parameter controlling the rate at which influence decreases with temporal distance, and \(\:n\) denotes the number of available census anchors. Temporal interpolation is restricted to a local temporal neighborhood to ensure that estimates are dominated by nearest census years, thereby avoiding unrealistic influence from distant historical observations. This local temporal constraint allows abrupt market shifts and regime changes to be accommodated without enforcing smooth global trends. By decoupling spatial reconstruction and temporal interpolation, the framework preserves spatial and temporal autocorrelation structures independently and reduces spatio-temporal leakage commonly observed in fully coupled space–time interpolation methods [ 8 ]. 3.4 Baseline Methods for Comparison To evaluate the effectiveness of the proposed two-phase framework, its performance is compared against two commonly used baseline approaches: Spatial Residual-Based Interpolation : Housing values are estimated by interpolating spatial residuals derived from a single reference year and applying them across time, implicitly assuming temporal stationarity of spatial patterns. Spatio-Temporal Distance-Weighted Interpolation (ST-IDW) : Spatial and temporal distances are combined into a unified distance metric, assuming continuous space–time smoothness and equal coupling between spatial and temporal dependence [ 10 ]. These baselines represent widely adopted approaches for reconstructing missing spatio-temporal socioeconomic data and provide appropriate benchmarks for evaluating reconstruction accuracy. 3.5 Spatial Autocorrelation Validation Using Moran’s I To assess whether reconstructed housing value surfaces preserve realistic spatial dependence, spatial autocorrelation was evaluated using Global Moran’s I, a widely used statistic for diagnosing spatial structure in socioeconomic variables such as housing prices [ 6 , 15 ]. Global Moran’s I is defined as: $$\:I=\frac{N}{W}\frac{\sum\:_{i}\sum\:_{j}{w}_{ij}({x}_{i}-\stackrel{\prime }{x})({x}_{j}-\stackrel{\prime }{x})}{\sum\:_{i}({x}_{i}-\stackrel{\prime }{x}{)}^{2}}$$ where \(\:N\) is the number of spatial units, \(\:{x}_{i}\) and \(\:{x}_{j}\) are housing values at locations \(\:i\) and \(\:j\) , \(\:\stackrel{\prime }{x}\) is the mean housing value, \(\:{w}_{ij}\) denotes the spatial weight between units \(\:i\) and \(\:j\) , and \(\:W={\sum\:}_{i}{\sum\:}_{j}{w}_{ij}\) . Moran’s I values range from − 1 to + 1, with positive values indicating spatial clustering, negative values indicating spatial dispersion, and values near zero indicating spatial randomness. Statistical significance was assessed using permutation-based tests. Spatial autocorrelation was computed for reconstructed housing value surfaces for 2015 (interpolated year) and 2023 (extrapolated year) using a row-standardized contiguity-based spatial weights matrix at the block-group level. These years were selected to evaluate whether spatial dependence is preserved both within and beyond the temporal range of observed census data. By validating spatial autocorrelation in reconstructed outputs, this analysis ensures that the proposed framework not only minimizes prediction error but also maintains the intrinsic spatial structure characteristic of housing markets. 4. Results 4.1 Reconstruction Performance Across Census and Non-Census Years The performance of the proposed two-phase spatio-temporal interpolation framework was evaluated for both interpolated and extrapolated non-census years using observed American Community Survey (ACS) median house values as reference data. Validation was conducted at the census block-group level using three commonly adopted accuracy metrics: coefficient of determination (R²), root mean square error (RMSE), and mean absolute error (MAE). Overall, the framework demonstrates consistently strong predictive performance across California, indicating its ability to effectively capture both spatial variability and temporal dynamics in housing values (Figs. 3 – 5 ). 4.2 Validation for Interpolated and Near-Term Extrapolated Years For interpolated years, reconstructed housing values exhibit strong agreement with observed ACS data, confirming the effectiveness of the proposed framework in capturing short-range temporal dynamics. Among the interpolated years, 2017 , located near the midpoint between the 2010 and 2020 census anchors, achieves high validation accuracy with an R² of 0.79 across 21,546 block groups (RMSE = 166,628; MAE = 171,319). Performance improves further for 2019 , which lies closer to the 2020 census anchor, achieving an R² of 0.84 across 21,535 block groups (RMSE = 161,151; MAE = 96,550). Earlier interpolated years exhibit slightly lower but still robust performance. Reconstructed housing values for 2015 yield an R² of 0.704 across 21,593 block groups (RMSE = 176,866; MAE = 139,015), while 2013 , which is more temporally distant from the nearest census anchor, achieves an R² of 0.643 across 22,060 block groups (RMSE = 147,055; MAE = 126,118). This progressive improvement in accuracy as target years approach census anchors highlights the importance of nearest temporal neighbors in housing value reconstruction. 4.3 Validation for Extrapolated Year To assess extrapolation performance, reconstructed housing values for 2021, 2022, and 2023 were compared with corresponding ACS observations. Despite the absence of a census anchor beyond 2020, the proposed framework maintains strong predictive capability for near-term extrapolation. 2021 , which is immediately adjacent to the 2020 census anchor, achieves an exceptionally high R² of 0.95 across 23,089 block groups (RMSE = 97,493; MAE = 59,130), reflecting minimal temporal drift. As extrapolation distance increases, predictive performance declines in a controlled and expected manner. 2022 achieves an R² of 0.79 across 23,096 block groups (RMSE = 205,000; MAE = 158,421), while 2023 yields an R² of 0.68 across 23,047 block groups (RMSE = 260,638; MAE = 209,427). Despite increased uncertainty, reconstructed values for these years continue to preserve coherent spatial patterns and large-scale temporal trends (Fig. 5 ), underscoring the robustness of the nearest-time anchoring strategy. 4.4 Summary of Validation Results Across all evaluated non-census years, R² values range from 0.64 to 0.95 (Table 1 ), demonstrating stable and reliable performance under both interpolation and extrapolation scenarios. Interpolated years consistently achieve higher accuracy than more distant extrapolated years, while near-term extrapolation immediately following census anchors remains highly reliable. These findings indicate that resolving spatial autocorrelation first using ordinary kriging, followed by temporally constrained inverse time-distance weighting, provides a robust and interpretable strategy for reconstructing long-term block-group-level socioeconomic data from sparse census observations. Table 1 Validation metrics for reconstructed block-group housing values Year Validation Type Block Groups (N) R² RMSE MAE 2013 Interpolation 22,060 0.643 147,055 126,118 2015 Interpolation 21,593 0.704 176,866 139,015 2017 Interpolation 21,546 0.79 166,628 171,319 2019 Interpolation 21535 0.84 161151 96550 2021 Extrapolation 23089 0.95 97493 59130 2022 Extrapolation 23096 0.79 205000 158421 2023 Extrapolation 23,047 0.680 260,638 209,427 Note : Validation is performed using observed ACS median house value data at the census block-group level. RMSE and MAE are reported in U.S. dollars. 4.5 Spatial Autocorrelation Assessment To evaluate whether reconstructed housing values preserve realistic spatial dependence, Global Moran’s I statistics were computed for both census and non-census years. Moran’s I provides a quantitative measure of spatial autocorrelation and is widely used to assess spatial clustering in socioeconomic variables. For non-census years, Moran’s I values remain strongly positive and statistically significant ( p = 0.001 ) for all evaluated years (Table 2 ). Moran’s I values derived from ITDW-based reconstructions are consistently higher than those computed from temporally interpolated values alone, reflecting the re-imposition and preservation of spatial dependence through the spatial reconstruction phase. Spatial autocorrelation peaks for interpolated years between census anchors (2013–2019) and declines modestly for extrapolated years (2021–2023), indicating increasing temporal uncertainty while maintaining coherent spatial structure. For census years, Moran’s I values computed from observed and reconstructed housing values are identical and statistically significant (Table 3 ), confirming that the spatial reconstruction phase preserves intrinsic spatial autocorrelation without introducing smoothing artifacts. Together, these results demonstrate that the proposed two-phase framework achieves not only strong predictive accuracy but also faithful preservation of the spatial organization characteristic of housing markets. Table 2 Global Moran’s I statistics for reconstructed housing values in non-census years Year Moran’s I (Observed) p-value Moran’s I (ITWD) p-value 2013 0.7956 0.001 0.8557 0.001 2015 0.7768 0.001 0.8968 0.001 2017 0.7921 0.001 0.9134 0.001 2019 0.8119 0.001 0.9144 0.001 2021 0.8025 0.001 0.8171 0.001 2022 0.8018 0.001 0.82420 0.001 2023 0.7981 0.001 0.8304 0.001 Note : Moran’s I statistics were computed using a row-standardized contiguity-based spatial weights matrix at the census block-group level. “Observed” values correspond to housing values obtained through temporal interpolation or extrapolation using inverse time-distance weighting (ITDW) prior to spatial reconstruction, while “ITDW” values represent outputs after applying the ITDW-based temporal interpolation or extrapolation. Statistical significance was assessed using permutation-based tests (p = 0.001). Table 3 Global Moran’s I statistics for reconstructed housing values in census years Year Moran’s I (Observed) p-value Moran’s I (Reconstructed) p-value 1990 0.6737 0.001 0.6737 0.001 2000 0.7374 0.001 0.7374 0.001 2010 0.7303 0.001 0.7303 0.001 2020 0.8006 0.001 0.8006 0.001 Note : Moran’s I statistics were computed using a row-standardized contiguity-based spatial weights matrix at the census block-group level. Identical Moran’s I values for observed and reconstructed data confirm that the spatial reconstruction phase preserves intrinsic spatial autocorrelation without introducing smoothing artifacts. Statistical significance was assessed using permutation-based tests (p = 0.001). 5. Discussion The results underscore the importance of aligning interpolation strategies with the intrinsic spatio-temporal dynamics of housing markets. Housing values exhibit strong spatial autocorrelation at fixed time points and pronounced short-range temporal autocorrelation at fixed locations. Methods that impose linear, smooth, or globally continuous temporal trends across extended periods are therefore poorly suited to housing data, as they fail to capture abrupt market shifts, cyclical behavior, and regime changes driven by macroeconomic conditions and policy interventions. By explicitly decoupling spatial and temporal processes, the proposed two-phase framework mitigates spatio-temporal leakage and enhances reconstruction robustness. Spatial dependence is first resolved independently within each census year using ordinary kriging, ensuring realistic spatial structure is preserved. Temporal interpolation and extrapolation are then constrained to nearest temporal neighbors at fixed locations using inverse time-distance weighting. The strong predictive performance achieved using only four decennial observations demonstrates that local temporal weighting can effectively recover annual housing dynamics when spatial structure is properly reconstructed. Beyond empirical accuracy, the proposed framework offers a transparent and interpretable baseline for more advanced spatio-temporal learning approaches. By explicitly separating spatial and temporal effects, the framework facilitates clearer diagnosis of model behavior and uncertainty, and provides a principled benchmark for evaluating graph-based and deep learning models. This separation is particularly valuable for hybrid modeling strategies that seek to integrate geostatistical methods with data-driven learning while maintaining interpretability. 6. Implications The proposed two-phase spatio-temporal interpolation framework has several important implications for geospatial analysis, urban studies, and the reconstruction of long-term socioeconomic data. First , from a methodological perspective, the results demonstrate that explicitly aligning interpolation strategies with the intrinsic dynamics of the target phenomenon is critical for reliable spatio-temporal reconstruction. Housing values exhibit strong spatial dependence at fixed time points and dominant short-range temporal dependence at fixed locations. The strong performance of the proposed framework highlights that separating spatial and temporal modeling—rather than enforcing fully coupled space–time smoothness—can substantially improve both accuracy and robustness when observations are temporally sparse. Second , the framework provides a practical and scalable solution for reconstructing fine-scale historical housing data in contexts where annual observations are unavailable or inconsistent. By relying solely on decennial census data and a transparent interpolation strategy, the approach enables the generation of annual block-group-level housing estimates suitable for longitudinal urban analysis, housing affordability assessment, and studies of neighborhood change. The use of a unified spatial geometry further ensures temporal comparability, addressing a long-standing challenge in census-based socioeconomic research. Third , the interpretability of the two-phase design has important implications for model transparency and reproducibility. Unlike black-box spatio-temporal learning approaches, the framework explicitly separates spatial and temporal effects, allowing researchers to diagnose sources of uncertainty, evaluate spatial realism using established metrics, and understand how temporal proximity influences reconstructed values. This transparency is particularly valuable for policy-oriented urban research and comparative regional studies, where methodological clarity and interpretability are essential. Fourth , the framework demonstrates strong potential for near-term forecasting of housing values when recent census anchors are available. Anchored by the 2020 census, the proposed approach achieves high predictive accuracy for subsequent years, including 2021, 2022, and 2023. This result suggests that locally constrained temporal weighting, when combined with robust spatial reconstruction, can effectively support short-term housing market analysis without requiring complex forecasting models or extensive auxiliary data. Fifth , the framework offers a powerful solution for spatial interpolation under conditions of severe data sparsity. Across multiple non-census years, a substantial number of block groups lack observed housing values (e.g., over 1,000–2,500 missing block groups per year between 2013 and 2023). The proposed framework successfully reconstructs housing values for these missing spatial units while preserving spatial autocorrelation and minimizing interpolation artifacts. This capability highlights the framework’s utility for recovering massive spatial gaps in fine-scale socioeconomic datasets. Finally , the proposed framework establishes a conceptual and computational baseline for future spatio-temporal modeling efforts. By resolving spatial structure prior to temporal interpolation, the approach can serve as a benchmark or initialization strategy for more advanced methods, including graph-based and hybrid spatio-temporal learning models. As such, the implications of this work extend beyond housing data reconstruction, offering a generalizable strategy for recovering fine-scale socioeconomic variables from sparse historical observations across diverse geographic and thematic contexts. 7. Limitations and Future Work Despite its strong performance and interpretability, the proposed two-phase framework has several limitations that suggest directions for future research. First , the temporal interpolation component relies on a local inverse time-distance weighting (ITDW) scheme with a fixed temporal decay parameter. While this design effectively captures short-range temporal autocorrelation and avoids imposing unrealistic global temporal trends, it does not explicitly account for heterogeneous temporal dynamics across different housing submarkets. As illustrated in Fig. 5 , predicted values for 2015 exhibit a saturation effect at the upper end of the value range, where extremely high housing prices are systematically underestimated and form an apparent ceiling relative to observed ACS values. This behavior reflects the averaging nature of fixed-parameter temporal weighting, which dampens extreme values when nearby temporal anchors do not exhibit comparable magnitudes. Future work could explore adaptive temporal weighting strategies, locally varying decay parameters, or data-driven temporal kernels to better accommodate region-specific housing dynamics, market segmentation, and nonlinear growth regimes. Second , the framework assumes that spatial autocorrelation within each census year can be adequately characterized using block-group centroids and stationary variogram models. Although this assumption is common in geostatistical applications, housing markets may exhibit non-stationary spatial dependence due to urban–rural gradients, localized development patterns, or policy-driven effects. Extending the framework to incorporate non-stationary kriging approaches or multi-scale spatial modeling may further enhance spatial reconstruction accuracy. Third , the current implementation focuses on univariate housing values and does not explicitly incorporate auxiliary socioeconomic or environmental covariates, such as median income, population density, or accessibility measures. Integrating such covariates through co-kriging or hybrid spatial–statistical models represents a promising extension that could improve reconstruction performance, particularly in regions with sparse or noisy observations. Finally , while the proposed framework is intended as a transparent and interpretable baseline, it does not directly model complex nonlinear spatio-temporal interactions. Future research could integrate this two-phase design with advanced spatio-temporal learning approaches, such as graph neural networks, by using the reconstructed spatial and temporal structures as informed priors or initialization strategies. Such hybrid models may combine the interpretability and stability of geostatistical methods with the representational flexibility of deep learning, while maintaining robustness under sparse temporal observations. 8. Conclusion This study proposes a two-phase spatio-temporal interpolation framework for reconstructing block-group-level housing values from sparse historical observations. By explicitly separating spatial reconstruction and temporal interpolation, the framework leverages spatial and temporal autocorrelation more effectively than conventional approaches that impose globally smooth space–time dependence. Spatial structure is first reconstructed independently for each census year using ordinary kriging, followed by locally constrained temporal interpolation at fixed block-group locations, resulting in stable and accurate annual housing value estimates. Empirical validation using California housing data demonstrates that the proposed framework achieves strong predictive performance for both interpolated and extrapolated years while preserving realistic spatial dependence patterns. The results confirm that housing values are more strongly influenced by nearest temporal neighbors than by long-range temporal trends, and that decoupling spatial and temporal processes reduces spatio-temporal leakage and improves reconstruction robustness under temporally sparse observations. Beyond empirical accuracy, the proposed framework offers a scalable and interpretable solution for reconstructing fine-scale historical socioeconomic data from census sources. It also establishes a transparent methodological baseline that can support subsequent integration with advanced spatio-temporal learning models, such as graph-based approaches, enabling systematic comparison and hybrid modeling in future research. Declarations Data Availability Statement The census-based socioeconomic data used in this study are publicly available from the IPUMS National Historical Geographic Information System (NHGIS) [4]. Derived datasets, graph structures, and model inputs generated during the study are available from the corresponding author upon reasonable request. Ethics Statement The authors confirm that they have read and complied with the ethical requirements for publication in the Scientific Reports . This study does not involve human participants, animal subjects, or data collected from social media platforms. Generative artificial intelligence tools, specifically ChatGPT (OpenAI) , were used during manuscript preparation for language editing, text refinement, and clarity enhancement. All content generated with the assistance of this tool was carefully reviewed, verified, and edited by the authors, who take full responsibility for the accuracy, integrity, and originality of the final manuscript. Author Contributions Shuang Tian was responsible for data curation, data analysis, methodology, resources, visualization and writing original drafts. Dr. Fang Qiu was responsible for conceptualization, investigation, project administration, supervision, validation, review and editing. Competing Interests The author declares no competing interests. Funding Statement This research did not receive funding. References Anselin, L. Spatial Econometrics: Methods and Models (Kluwer Academic, 1988). Glaeser, E. L., Gyourko, J. & Saks, R. Why is Manhattan so expensive? Regulation and the rise in housing prices. J. Law Econ. 48 (2), 331–369 (2005). Bailey, N., van Gent, W. & Musterd, S. Remaking urban segregation. Urban Stud. 54 (14), 3284–3302 (2017). Jonathan Schroeder, D. et al. IPUMS National Historical Geographic Information System: Version 20.0 [dataset]. Minneapolis, MN: IPUMS. (2025). http://doi.org/10.18128/D050.V20.0 Logan, J. R., Xu, Z. & Stults, B. Interpolating U.S. decennial census tract data from as far back as 1970. Geographical Anal. 46 (4), 412–434 (2014). Qiu, F. & Cromley, R. Introduction: Areal interpolation and dasymetric modeling. Geographical Anal. 45 (3), 213–215. https://doi.org/10.2747/1548-1603.49.5.644 (2013). Kyriakidis, P. C. & Journel, A. G. Geostatistical space–time models. Math. Geol. 31 (6), 651–684 (1999). Huang, B., Wu, B. & Barry, M. Geographically and temporally weighted regression. Int. J. Geogr. Inf. Sci. 24 (3), 383–401. https://doi.org/10.1080/13658810802672469 (2010). Li, S. et al. Spatiotemporal Analysis of Housing Prices in China: A Big Data Perspective. Appl. Spat. Anal. 10 , 421–433. https://doi.org/10.1007/s12061-016-9185-3 (2017). Gao, S., Janowicz, K. & Couclelis, H. Extracting urban functional regions from points of interest. Trans. GIS . 21 (3), 446–467. https://doi.org/10.1111/tgis.12289 (2017). Case, K. E. & Shiller, R. J. Is there a bubble in the housing market? Brook. Papers Econ. Act. 2 , 299–362 (2003). Gyourko, J., Mayer, C. & Sinai, T. Superstar cities. Am. Economic Journal: Economic Policy . 5 (4), 167–199 (2013). Pace, R. K. & Barry, R. Sparse spatial autoregressions. Stat. Probab. Lett. 33 (3), 291–297. https://doi.org/10.1016/S0167-7152(96)00140-X (1997). Qiu, F., Zhang, C. & Zhou, Y. The development of an areal interpolation ArcGIS extension and a comparative study. GIScience Remote Sens. 49 (5), 644–663. https://doi.org/10.2747/1548-1603.49.5.644 (2012). Tobler, W. R. A computer movie simulating urban growth in the Detroit region. Econ. Geogr. 46 (Suppl.), 234–240. https://doi.org/10.2307/143141 (1970). Additional Declarations No competing interests reported. Cite Share Download PDF Status: Under Review Version 1 posted Editorial decision: Revision requested 21 Mar, 2026 Reviews received at journal 19 Mar, 2026 Reviewers agreed at journal 22 Feb, 2026 Reviews received at journal 19 Feb, 2026 Reviewers agreed at journal 29 Jan, 2026 Reviewers invited by journal 29 Jan, 2026 Editor invited by journal 27 Jan, 2026 Editor assigned by journal 24 Jan, 2026 Submission checks completed at journal 24 Jan, 2026 First submitted to journal 22 Jan, 2026 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8674976","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":582712687,"identity":"9e66a706-2f3f-4ee0-86b1-b5ebd9f822ab","order_by":0,"name":"Shuang Tian¹","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABDElEQVRIie3PMUsDMRTA8Rce5JZcXSOH3FfIcZAlFb/KlcLdcogglIKCN8Wl2LVCv0U/QqFT3Q+6CK43BLoULWJ6dRJydRTMH0IgvB/kAfh8f7B4drwfNAB5BUDoBe0DOomojzejdkpAhkDxFOHfxB7Kf0eiSbK9eQfWC15Wd+FOxRTxzcBYDSrXLvN1Gj0/2Y+x63zDsiLRSCWHdeEkUJciCieHXUppyZJoBAlEL7tI+tGSs0beWnKlMdga8tlJZMR2lvBSoiUDjUxwUrmJqPORCituSZOez/NiaMmIZ6sida4/Gy42bN9n8bRMTNNXl9PgcWHMvbpwfuwQ0fzHS9Y13rY/OeHz+Xz/uS8uhEux3AgVEAAAAABJRU5ErkJggg==","orcid":"","institution":"The University of Texas at Dallas","correspondingAuthor":true,"prefix":"","firstName":"Shuang","middleName":"","lastName":"Tian¹","suffix":""},{"id":582712688,"identity":"a0413548-4922-4f27-b450-fc88c9cbb9bf","order_by":1,"name":"Fang Qiu¹","email":"","orcid":"","institution":"The University of Texas at Dallas","correspondingAuthor":false,"prefix":"","firstName":"Fang","middleName":"","lastName":"Qiu¹","suffix":""}],"badges":[],"createdAt":"2026-01-23 04:39:26","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-8674976/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8674976/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":101671571,"identity":"537e12ea-8a98-4036-b197-31d651c418df","added_by":"auto","created_at":"2026-02-02 12:52:24","extension":"jpg","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":154207,"visible":true,"origin":"","legend":"\u003cp\u003eStudy area and spatial distribution of census block-group median housing values in California for the 1990, 2000, 2010, and 2020 decennial censuses. Blank areas indicate missing data.\u003c/p\u003e","description":"","filename":"1.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8674976/v1/7d2881a526d9b726839eca97.jpg"},{"id":101671568,"identity":"38854810-e7b5-4456-935a-f635511bff14","added_by":"auto","created_at":"2026-02-02 12:52:24","extension":"jpg","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":120879,"visible":true,"origin":"","legend":"\u003cp\u003eWorkflow of the proposed two-phase spatio-temporal interpolation framework, illustrating spatial reconstruction using ordinary kriging (Phase 1) and temporal interpolation using inverse time-distance weighting (Phase 2)\u003c/p\u003e","description":"","filename":"2.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8674976/v1/1c6b8a0d8e825889ce25e842.jpg"},{"id":101671569,"identity":"1594b48a-9605-4092-b09d-ff82f97db5e4","added_by":"auto","created_at":"2026-02-02 12:52:24","extension":"jpg","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":206236,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eTwo-phase\u003c/strong\u003e Spatio-temporal interpolated census block-group housing values in California for census years: \u0026nbsp;1990, 2000, 2010, and 2020 decennial census years. The results illustrate pronounced spatial heterogeneity and long-term temporal evolution, with consistently higher housing values along the coastal regions. Major metropolitan areas, including San Francisco and Los Angeles, exhibit the highest housing values across all census years.\u003c/p\u003e","description":"","filename":"3.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8674976/v1/99264d9db0401c209cf7c13d.jpg"},{"id":101671570,"identity":"5fd6641f-1a76-4162-badf-f6a708df1b26","added_by":"auto","created_at":"2026-02-02 12:52:24","extension":"jpg","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":206735,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eTwo-phase\u003c/strong\u003e Spatio-temporal interpolated census block-group housing values in California for non- census years: 2013, 2015,2017, and ,2023 interpolated and extrapolated housing values. The results illustrate pronounced spatial heterogeneity and long-term temporal evolution, with consistently higher housing values along the coastal regions. Major metropolitan areas, including San Francisco and Los Angeles, exhibit the highest housing values across all census years.\u003c/p\u003e","description":"","filename":"4.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8674976/v1/29f39b344d4931af8bc182f0.jpg"},{"id":101753911,"identity":"02fab1c7-6ded-41c1-9e6d-04b5c4a66ef0","added_by":"auto","created_at":"2026-02-03 10:41:10","extension":"jpg","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":192339,"visible":true,"origin":"","legend":"\u003cp\u003eScatter plots comparing reconstructed block-group-level median housing values against observed American Community Survey (ACS) data for non-census years 2015, 2019, 2021 (interpolated), and 2022 (extrapolated). Each point represents a census block group. The dashed line denotes the 1:1 reference line. Coefficients of determination (R²) are reported for each year. The results demonstrate strong agreement between reconstructed and observed values across both interpolated and extrapolated years, with slightly increased dispersion for the extrapolated year reflecting greater temporal uncertainty.\u003c/p\u003e","description":"","filename":"5.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8674976/v1/86d563a597b38d2d8aa68d94.jpg"},{"id":102397214,"identity":"211f869a-a010-4d76-ab8d-024a2614283b","added_by":"auto","created_at":"2026-02-11 10:10:56","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1902633,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8674976/v1/592eb3e2-80ba-404d-a4b6-90ea2586f1e9.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"A Two-Phase Spatio-Temporal Interpolation Framework for Reconstructing Block Group Level Housing Values in California","fulltext":[{"header":"1. Introduction","content":"\u003cp\u003eHousing values exhibit pronounced spatial and temporal heterogeneity shaped by local market conditions, accessibility, demographic composition, and broader macroeconomic dynamics. Understanding the historical evolution of housing values at fine spatial scales is critical for urban planning, housing affordability assessment, and research on socioeconomic inequality [\u003cspan additionalcitationids=\"CR2\" citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e]. Despite its importance, high-resolution historical housing data remain limited in the United States because census-based housing values are reported only at decennial intervals and are further complicated by frequent changes in census geographies, leading to substantial spatial and temporal gaps [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eTo address these limitations, previous studies have explored a range of spatial and spatio-temporal modeling approaches, including areal interpolation, spatial regression, space\u0026ndash;time distance-weighted interpolation, and global trend fitting [\u003cspan additionalcitationids=\"CR7 CR8 CR9\" citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e]. While these methods have proven useful in certain contexts, many implicitly assume smooth global temporal trends or continuous space\u0026ndash;time dependence. Such assumptions are often violated in housing markets, where abrupt regime shifts\u0026mdash;such as economic recessions, housing booms, and policy interventions\u0026mdash;can produce non-linear and locally heterogeneous temporal dynamics [\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e, \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e]. Moreover, empirical evidence consistently shows that housing values exhibit strong spatial autocorrelation at fixed time points and strong short-range temporal autocorrelation at fixed locations, suggesting that spatial and temporal processes may operate at distinct scales rather than as a single coupled process [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e, \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eMotivated by these observations, this study proposes a two-phase spatio-temporal interpolation framework that explicitly separates spatial and temporal dependency modeling. In the first phase, spatial structure is reconstructed independently at each census year using ordinary kriging to fully capture contemporaneous spatial autocorrelation. In the second phase, temporal interpolation and extrapolation are performed independently at fixed block-group locations using a locally constrained inverse time-distance weighting (ITDW) strategy that prioritizes nearest temporal neighbors. By design, this framework avoids unrealistic global temporal trends and reduces spatio-temporal leakage that can arise in fully coupled space\u0026ndash;time interpolation methods.\u003c/p\u003e \u003cp\u003eUsing California block-group housing values as a case study, we demonstrate that the proposed framework yields substantially improved reconstruction accuracy for both interpolated and extrapolated years when compared with traditional spatial residual-based and space\u0026ndash;time distance-weighted approaches. The results highlight the importance of explicitly decoupling spatial and temporal processes when reconstructing long-term socioeconomic data from sparse historical observations.\u003c/p\u003e"},{"header":"2. Study Area and Data","content":"\u003cp\u003eThis study focuses on the state of California, which exhibits pronounced spatial heterogeneity in housing markets, ranging from dense coastal metropolitan regions to suburban, exurban, and rural inland areas (\u003cb\u003eFig.\u0026nbsp;1\u003c/b\u003e). California\u0026rsquo;s diverse economic structure, large population, and substantial regional housing disparities make it an ideal case for evaluating spatio-temporal reconstruction methods at fine spatial scales [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eMedian housing values at the census block-group level were obtained from U.S. Census sources [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e] for four decennial years: 1990, 2000, 2010, and 2020 (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e). These data represent the most spatially detailed and consistently available housing value information over long historical periods in the United States [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e]. However, differences in census block-group boundaries across decades introduce spatial inconsistencies that prevent direct temporal comparison.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eTo ensure spatial consistency across time, all historical housing values were reconstructed onto a unified 2020 block-group geometry using areal harmonization techniques commonly adopted in small-area estimation and spatial interpolation studies [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e, \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e]. Block-group centroids were used to represent spatial locations during interpolation, enabling point-based spatial modeling while preserving block-group-level interpretation. This approach eliminates inconsistencies arising from boundary changes and allows direct temporal comparison of reconstructed housing values across the full study period.\u003c/p\u003e"},{"header":"3. Methodology","content":"\u003cdiv id=\"Sec4\" class=\"Section2\"\u003e \u003ch2\u003e3.1 Overview of the Two-Phase Framework\u003c/h2\u003e \u003cp\u003eThe proposed framework consists of two sequential and explicitly decoupled phases designed (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e) to separately model spatial and temporal dependence in census-based housing data:\u003c/p\u003e \u003cp\u003e \u003col\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003e \u003cb\u003eSpatial Reconstruction (Phase 1)\u003c/b\u003e: Ordinary kriging is applied independently to each census year to reconstruct spatially continuous housing values across a unified block-group geometry, explicitly capturing spatial autocorrelation.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003e \u003cb\u003eTemporal Interpolation (Phase 2)\u003c/b\u003e: At each fixed block group, housing values for target years are estimated using a local inverse time-distance weighting (ITDW) approach that exploits short-range temporal autocorrelation.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003c/ol\u003e \u003c/p\u003e \u003cp\u003eBy separating spatial and temporal modeling, the framework avoids imposing strong assumptions of continuous space\u0026ndash;time smoothness and allows each dependency structure to be modeled at its appropriate scale [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e, \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e].\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003e3.2 Phase 1: Spatial Reconstruction Using Ordinary Kriging\u003c/h2\u003e \u003cp\u003eIn the first phase, spatial reconstruction of housing values is performed independently for each census year (1990, 2000, 2010, and 2020) using ordinary kriging (OK). Ordinary kriging is a geostatistical interpolation method that explicitly models spatial autocorrelation and provides best linear unbiased predictions under second-order stationarity assumptions [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]. Its application is particularly appropriate for housing markets, which are well known to exhibit strong spatial dependence consistent with Tobler\u0026rsquo;s First Law of Geography [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eFor each census year, observed block-group median housing values are represented at block-group centroid locations. An empirical variogram is computed to quantify how variance between housing values changes as a function of spatial separation distance. The empirical variogram characterizes the spatial autocorrelation structure by relating semivariance to distance [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. A theoretical variogram model is then fitted and used within the ordinary kriging system to derive optimal interpolation weights.\u003c/p\u003e \u003cp\u003eThe kriging weights are obtained by minimizing the estimation variance subject to an unbiasedness constraint, ensuring that predicted housing values reflect the dominant influence of nearby block groups while accounting for spatial redundancy. Spatial reconstruction is conducted independently for each census year to avoid imposing assumptions about temporal continuity during spatial estimation. This design yields spatially continuous housing value surfaces on a unified 2020 block-group geometry while preserving year-specific spatial patterns.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec6\" class=\"Section2\"\u003e \u003ch2\u003e3.3 Phase 2: Temporal Interpolation Using Inverse Time-Distance Weighting\u003c/h2\u003e \u003cp\u003eFollowing spatial reconstruction, each block group possesses reconstructed housing values at four discrete census years. Temporal interpolation and extrapolation are then performed independently at each block group using a local inverse time-distance weighting (ITDW) approach.\u003c/p\u003e \u003cp\u003eRather than fitting global temporal trends or regression-based curves, ITDW exploits short-range temporal autocorrelation by assigning greater influence to temporally proximate observations [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e]. For a target year \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:{t}_{0}\\)\u003c/span\u003e\u003c/span\u003e, the estimated housing value \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:\\widehat{H}\\left({t}_{0}\\right)\\)\u003c/span\u003e\u003c/span\u003eis computed as:\u003cdiv id=\"Equa\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equa\" name=\"EquationSource\"\u003e\n$$\\:\\widehat{H}\\left({t}_{0}\\right)=\\frac{\\sum\\:_{i=1}^{n}\\frac{H\\left({t}_{i}\\right)}{\\mid\\:{t}_{0}-{t}_{i}{\\mid\\:}^{p}}}{\\sum\\:_{i=1}^{n}\\frac{1}{\\mid\\:{t}_{0}-{t}_{i}{\\mid\\:}^{p}}}$$\u003c/div\u003e\u003c/div\u003e\u003c/p\u003e \u003cp\u003ewhere \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:H\\left({t}_{i}\\right)\\)\u003c/span\u003e\u003c/span\u003edenotes the reconstructed housing value at census year \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:{t}_{i}\\)\u003c/span\u003e\u003c/span\u003e, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:p\\)\u003c/span\u003e\u003c/span\u003eis a temporal decay parameter controlling the rate at which influence decreases with temporal distance, and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:n\\)\u003c/span\u003e\u003c/span\u003edenotes the number of available census anchors.\u003c/p\u003e \u003cp\u003eTemporal interpolation is restricted to a local temporal neighborhood to ensure that estimates are dominated by nearest census years, thereby avoiding unrealistic influence from distant historical observations. This local temporal constraint allows abrupt market shifts and regime changes to be accommodated without enforcing smooth global trends. By decoupling spatial reconstruction and temporal interpolation, the framework preserves spatial and temporal autocorrelation structures independently and reduces spatio-temporal leakage commonly observed in fully coupled space\u0026ndash;time interpolation methods [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec7\" class=\"Section2\"\u003e \u003ch2\u003e3.4 Baseline Methods for Comparison\u003c/h2\u003e \u003cp\u003eTo evaluate the effectiveness of the proposed two-phase framework, its performance is compared against two commonly used baseline approaches:\u003c/p\u003e \u003cp\u003e \u003col\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003e \u003cb\u003eSpatial Residual-Based Interpolation\u003c/b\u003e: Housing values are estimated by interpolating spatial residuals derived from a single reference year and applying them across time, implicitly assuming temporal stationarity of spatial patterns.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003e \u003cb\u003eSpatio-Temporal Distance-Weighted Interpolation (ST-IDW)\u003c/b\u003e: Spatial and temporal distances are combined into a unified distance metric, assuming continuous space\u0026ndash;time smoothness and equal coupling between spatial and temporal dependence [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e].\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003c/ol\u003e \u003c/p\u003e \u003cp\u003eThese baselines represent widely adopted approaches for reconstructing missing spatio-temporal socioeconomic data and provide appropriate benchmarks for evaluating reconstruction accuracy.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003e3.5 Spatial Autocorrelation Validation Using Moran\u0026rsquo;s I\u003c/h2\u003e \u003cp\u003eTo assess whether reconstructed housing value surfaces preserve realistic spatial dependence, spatial autocorrelation was evaluated using Global Moran\u0026rsquo;s I, a widely used statistic for diagnosing spatial structure in socioeconomic variables such as housing prices [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eGlobal Moran\u0026rsquo;s I is defined as:\u003cdiv id=\"Equb\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equb\" name=\"EquationSource\"\u003e\n$$\\:I=\\frac{N}{W}\\frac{\\sum\\:_{i}\\sum\\:_{j}{w}_{ij}({x}_{i}-\\stackrel{\\prime }{x})({x}_{j}-\\stackrel{\\prime }{x})}{\\sum\\:_{i}({x}_{i}-\\stackrel{\\prime }{x}{)}^{2}}$$\u003c/div\u003e\u003c/div\u003e\u003c/p\u003e \u003cp\u003ewhere \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:N\\)\u003c/span\u003e\u003c/span\u003eis the number of spatial units, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:{x}_{i}\\)\u003c/span\u003e\u003c/span\u003eand \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:{x}_{j}\\)\u003c/span\u003e\u003c/span\u003eare housing values at locations \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:i\\)\u003c/span\u003e\u003c/span\u003eand \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:j\\)\u003c/span\u003e\u003c/span\u003e, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:\\stackrel{\\prime }{x}\\)\u003c/span\u003e\u003c/span\u003eis the mean housing value, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:{w}_{ij}\\)\u003c/span\u003e\u003c/span\u003edenotes the spatial weight between units \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:i\\)\u003c/span\u003e\u003c/span\u003eand \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:j\\)\u003c/span\u003e\u003c/span\u003e, and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\\(\\:W={\\sum\\:}_{i}{\\sum\\:}_{j}{w}_{ij}\\)\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e \u003cp\u003eMoran\u0026rsquo;s I values range from \u0026minus;\u0026thinsp;1 to +\u0026thinsp;1, with positive values indicating spatial clustering, negative values indicating spatial dispersion, and values near zero indicating spatial randomness. Statistical significance was assessed using permutation-based tests.\u003c/p\u003e \u003cp\u003eSpatial autocorrelation was computed for reconstructed housing value surfaces for \u003cb\u003e2015\u003c/b\u003e (interpolated year) and \u003cb\u003e2023\u003c/b\u003e (extrapolated year) using a row-standardized contiguity-based spatial weights matrix at the block-group level. These years were selected to evaluate whether spatial dependence is preserved both within and beyond the temporal range of observed census data. By validating spatial autocorrelation in reconstructed outputs, this analysis ensures that the proposed framework not only minimizes prediction error but also maintains the intrinsic spatial structure characteristic of housing markets.\u003c/p\u003e \u003c/div\u003e"},{"header":"4. Results","content":"\u003cdiv id=\"Sec10\" class=\"Section2\"\u003e \u003ch2\u003e4.1 Reconstruction Performance Across Census and Non-Census Years\u003c/h2\u003e \u003cp\u003eThe performance of the proposed two-phase spatio-temporal interpolation framework was evaluated for both interpolated and extrapolated non-census years using observed American Community Survey (ACS) median house values as reference data. Validation was conducted at the census block-group level using three commonly adopted accuracy metrics: coefficient of determination (R\u0026sup2;), root mean square error (RMSE), and mean absolute error (MAE). Overall, the framework demonstrates consistently strong predictive performance across California, indicating its ability to effectively capture both spatial variability and temporal dynamics in housing values (Figs.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e\u0026ndash;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e).\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003e4.2 Validation for Interpolated and Near-Term Extrapolated Years\u003c/h2\u003e \u003cp\u003eFor interpolated years, reconstructed housing values exhibit strong agreement with observed ACS data, confirming the effectiveness of the proposed framework in capturing short-range temporal dynamics. Among the interpolated years, \u003cb\u003e2017\u003c/b\u003e, located near the midpoint between the 2010 and 2020 census anchors, achieves high validation accuracy with an \u003cb\u003eR\u0026sup2; of 0.79\u003c/b\u003e across \u003cb\u003e21,546 block groups\u003c/b\u003e (RMSE\u0026thinsp;=\u0026thinsp;166,628; MAE\u0026thinsp;=\u0026thinsp;171,319). Performance improves further for \u003cb\u003e2019\u003c/b\u003e, which lies closer to the 2020 census anchor, achieving an \u003cb\u003eR\u0026sup2; of 0.84\u003c/b\u003e across \u003cb\u003e21,535 block groups\u003c/b\u003e (RMSE\u0026thinsp;=\u0026thinsp;161,151; MAE\u0026thinsp;=\u0026thinsp;96,550).\u003c/p\u003e \u003cp\u003eEarlier interpolated years exhibit slightly lower but still robust performance. Reconstructed housing values for \u003cb\u003e2015\u003c/b\u003e yield an \u003cb\u003eR\u0026sup2; of 0.704\u003c/b\u003e across \u003cb\u003e21,593 block groups\u003c/b\u003e (RMSE\u0026thinsp;=\u0026thinsp;176,866; MAE\u0026thinsp;=\u0026thinsp;139,015), while \u003cb\u003e2013\u003c/b\u003e, which is more temporally distant from the nearest census anchor, achieves an \u003cb\u003eR\u0026sup2; of 0.643\u003c/b\u003e across \u003cb\u003e22,060 block groups\u003c/b\u003e (RMSE\u0026thinsp;=\u0026thinsp;147,055; MAE\u0026thinsp;=\u0026thinsp;126,118). This progressive improvement in accuracy as target years approach census anchors highlights the importance of nearest temporal neighbors in housing value reconstruction.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003e4.3 Validation for Extrapolated Year\u003c/h2\u003e \u003cp\u003eTo assess extrapolation performance, reconstructed housing values for \u003cb\u003e2021, 2022, and 2023\u003c/b\u003e were compared with corresponding ACS observations. Despite the absence of a census anchor beyond 2020, the proposed framework maintains strong predictive capability for near-term extrapolation. \u003cb\u003e2021\u003c/b\u003e, which is immediately adjacent to the 2020 census anchor, achieves an exceptionally high \u003cb\u003eR\u0026sup2; of 0.95\u003c/b\u003e across \u003cb\u003e23,089 block groups\u003c/b\u003e (RMSE\u0026thinsp;=\u0026thinsp;97,493; MAE\u0026thinsp;=\u0026thinsp;59,130), reflecting minimal temporal drift.\u003c/p\u003e \u003cp\u003eAs extrapolation distance increases, predictive performance declines in a controlled and expected manner. \u003cb\u003e2022\u003c/b\u003e achieves an \u003cb\u003eR\u0026sup2; of 0.79\u003c/b\u003e across \u003cb\u003e23,096 block groups\u003c/b\u003e (RMSE\u0026thinsp;=\u0026thinsp;205,000; MAE\u0026thinsp;=\u0026thinsp;158,421), while \u003cb\u003e2023\u003c/b\u003e yields an \u003cb\u003eR\u0026sup2; of 0.68\u003c/b\u003e across \u003cb\u003e23,047 block groups\u003c/b\u003e (RMSE\u0026thinsp;=\u0026thinsp;260,638; MAE\u0026thinsp;=\u0026thinsp;209,427). Despite increased uncertainty, reconstructed values for these years continue to preserve coherent spatial patterns and large-scale temporal trends (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e), underscoring the robustness of the nearest-time anchoring strategy.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec13\" class=\"Section2\"\u003e \u003ch2\u003e4.4 Summary of Validation Results\u003c/h2\u003e \u003cp\u003eAcross all evaluated non-census years, \u003cb\u003eR\u0026sup2; values range from 0.64 to 0.95\u003c/b\u003e (Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e), demonstrating stable and reliable performance under both interpolation and extrapolation scenarios. Interpolated years consistently achieve higher accuracy than more distant extrapolated years, while near-term extrapolation immediately following census anchors remains highly reliable. These findings indicate that resolving spatial autocorrelation first using ordinary kriging, followed by temporally constrained inverse time-distance weighting, provides a robust and interpretable strategy for reconstructing long-term block-group-level socioeconomic data from sparse census observations.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eValidation metrics for reconstructed block-group housing values\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"6\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c6\" colnum=\"6\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eYear\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eValidation Type\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eBlock Groups (N)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eR\u0026sup2;\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003eRMSE\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c6\"\u003e \u003cp\u003eMAE\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2013\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eInterpolation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e22,060\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.643\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e147,055\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e126,118\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2015\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eInterpolation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e21,593\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.704\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e176,866\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e139,015\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2017\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eInterpolation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e21,546\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.79\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e166,628\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e171,319\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2019\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eInterpolation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e21535\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.84\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e161151\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e96550\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2021\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eExtrapolation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e23089\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.95\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e97493\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e59130\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2022\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eExtrapolation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e23096\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.79\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e205000\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e158421\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2023\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eExtrapolation\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e23,047\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.680\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e260,638\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e209,427\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003ctfoot\u003e \u003ctr\u003e\u003ctd colspan=\"6\"\u003e\u003cb\u003eNote\u003c/b\u003e: \u003cem\u003eValidation is performed using observed ACS median house value data at the census block-group level. RMSE and MAE are reported in U.S. dollars.\u003c/em\u003e\u003c/td\u003e\u003c/tr\u003e \u003c/tfoot\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec14\" class=\"Section2\"\u003e \u003ch2\u003e4.5 Spatial Autocorrelation Assessment\u003c/h2\u003e \u003cp\u003eTo evaluate whether reconstructed housing values preserve realistic spatial dependence, \u003cb\u003eGlobal Moran\u0026rsquo;s I\u003c/b\u003e statistics were computed for both census and non-census years. Moran\u0026rsquo;s I provides a quantitative measure of spatial autocorrelation and is widely used to assess spatial clustering in socioeconomic variables.\u003c/p\u003e \u003cp\u003eFor non-census years, Moran\u0026rsquo;s I values remain strongly positive and statistically significant (\u003cb\u003ep\u0026thinsp;=\u0026thinsp;0.001\u003c/b\u003e) for all evaluated years (Table\u0026nbsp;\u003cspan refid=\"Tab2\" class=\"InternalRef\"\u003e2\u003c/span\u003e). Moran\u0026rsquo;s I values derived from ITDW-based reconstructions are consistently higher than those computed from temporally interpolated values alone, reflecting the re-imposition and preservation of spatial dependence through the spatial reconstruction phase. Spatial autocorrelation peaks for interpolated years between census anchors (2013\u0026ndash;2019) and declines modestly for extrapolated years (2021\u0026ndash;2023), indicating increasing temporal uncertainty while maintaining coherent spatial structure.\u003c/p\u003e \u003cp\u003eFor census years, Moran\u0026rsquo;s I values computed from observed and reconstructed housing values are identical and statistically significant (Table\u0026nbsp;\u003cspan refid=\"Tab3\" class=\"InternalRef\"\u003e3\u003c/span\u003e), confirming that the spatial reconstruction phase preserves intrinsic spatial autocorrelation without introducing smoothing artifacts. Together, these results demonstrate that the proposed two-phase framework achieves not only strong predictive accuracy but also faithful preservation of the spatial organization characteristic of housing markets.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab2\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eGlobal Moran\u0026rsquo;s I statistics for reconstructed housing values in non-census years\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"5\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eYear\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eMoran\u0026rsquo;s I (Observed)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003ep-value\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eMoran\u0026rsquo;s I (ITWD)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003ep-value\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2013\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.7956\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.8557\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2015\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.7768\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.8968\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2017\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.7921\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.9134\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2019\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.8119\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.9144\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2021\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.8025\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.8171\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2022\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.8018\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.82420\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2023\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.7981\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.8304\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003ctfoot\u003e \u003ctr\u003e\u003ctd colspan=\"5\"\u003e\u003cb\u003eNote\u003c/b\u003e: \u003cem\u003eMoran\u0026rsquo;s I statistics were computed using a row-standardized contiguity-based spatial weights matrix at the census block-group level. \u0026ldquo;Observed\u0026rdquo; values correspond to housing values obtained through temporal interpolation or extrapolation using inverse time-distance weighting (ITDW) prior to spatial reconstruction, while \u0026ldquo;ITDW\u0026rdquo; values represent outputs after applying the ITDW-based temporal interpolation or extrapolation. Statistical significance was assessed using permutation-based tests (p\u0026thinsp;=\u0026thinsp;0.001).\u003c/em\u003e\u003c/td\u003e\u003c/tr\u003e \u003c/tfoot\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab3\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 3\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eGlobal Moran\u0026rsquo;s I statistics for reconstructed housing values in census years\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"5\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eYear\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eMoran\u0026rsquo;s I (Observed)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003ep-value\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eMoran\u0026rsquo;s I (Reconstructed)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003ep-value\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e1990\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.6737\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.6737\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2000\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.7374\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.7374\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2010\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.7303\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.7303\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e2020\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e0.8006\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e0.8006\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e0.001\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003ctfoot\u003e \u003ctr\u003e\u003ctd colspan=\"5\"\u003e\u003cb\u003eNote\u003c/b\u003e: \u003cem\u003eMoran\u0026rsquo;s I statistics were computed using a row-standardized contiguity-based spatial weights matrix at the census block-group level. Identical Moran\u0026rsquo;s I values for observed and reconstructed data confirm that the spatial reconstruction phase preserves intrinsic spatial autocorrelation without introducing smoothing artifacts. Statistical significance was assessed using permutation-based tests (p\u0026thinsp;=\u0026thinsp;0.001).\u003c/em\u003e\u003c/td\u003e\u003c/tr\u003e \u003c/tfoot\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003c/div\u003e"},{"header":"5. Discussion","content":"\u003cp\u003eThe results underscore the importance of aligning interpolation strategies with the intrinsic spatio-temporal dynamics of housing markets. Housing values exhibit strong spatial autocorrelation at fixed time points and pronounced short-range temporal autocorrelation at fixed locations. Methods that impose linear, smooth, or globally continuous temporal trends across extended periods are therefore poorly suited to housing data, as they fail to capture abrupt market shifts, cyclical behavior, and regime changes driven by macroeconomic conditions and policy interventions.\u003c/p\u003e \u003cp\u003eBy explicitly decoupling spatial and temporal processes, the proposed two-phase framework mitigates spatio-temporal leakage and enhances reconstruction robustness. Spatial dependence is first resolved independently within each census year using ordinary kriging, ensuring realistic spatial structure is preserved. Temporal interpolation and extrapolation are then constrained to nearest temporal neighbors at fixed locations using inverse time-distance weighting. The strong predictive performance achieved using only four decennial observations demonstrates that local temporal weighting can effectively recover annual housing dynamics when spatial structure is properly reconstructed.\u003c/p\u003e \u003cp\u003eBeyond empirical accuracy, the proposed framework offers a transparent and interpretable baseline for more advanced spatio-temporal learning approaches. By explicitly separating spatial and temporal effects, the framework facilitates clearer diagnosis of model behavior and uncertainty, and provides a principled benchmark for evaluating graph-based and deep learning models. This separation is particularly valuable for hybrid modeling strategies that seek to integrate geostatistical methods with data-driven learning while maintaining interpretability.\u003c/p\u003e"},{"header":"6. Implications","content":"\u003cp\u003eThe proposed two-phase spatio-temporal interpolation framework has several important implications for geospatial analysis, urban studies, and the reconstruction of long-term socioeconomic data.\u003c/p\u003e \u003cp\u003e \u003cb\u003eFirst\u003c/b\u003e, from a methodological perspective, the results demonstrate that explicitly aligning interpolation strategies with the intrinsic dynamics of the target phenomenon is critical for reliable spatio-temporal reconstruction. Housing values exhibit strong spatial dependence at fixed time points and dominant short-range temporal dependence at fixed locations. The strong performance of the proposed framework highlights that separating spatial and temporal modeling\u0026mdash;rather than enforcing fully coupled space\u0026ndash;time smoothness\u0026mdash;can substantially improve both accuracy and robustness when observations are temporally sparse.\u003c/p\u003e \u003cp\u003e \u003cb\u003eSecond\u003c/b\u003e, the framework provides a practical and scalable solution for reconstructing fine-scale historical housing data in contexts where annual observations are unavailable or inconsistent. By relying solely on decennial census data and a transparent interpolation strategy, the approach enables the generation of annual block-group-level housing estimates suitable for longitudinal urban analysis, housing affordability assessment, and studies of neighborhood change. The use of a unified spatial geometry further ensures temporal comparability, addressing a long-standing challenge in census-based socioeconomic research.\u003c/p\u003e \u003cp\u003e \u003cb\u003eThird\u003c/b\u003e, the interpretability of the two-phase design has important implications for model transparency and reproducibility. Unlike black-box spatio-temporal learning approaches, the framework explicitly separates spatial and temporal effects, allowing researchers to diagnose sources of uncertainty, evaluate spatial realism using established metrics, and understand how temporal proximity influences reconstructed values. This transparency is particularly valuable for policy-oriented urban research and comparative regional studies, where methodological clarity and interpretability are essential.\u003c/p\u003e \u003cp\u003e \u003cb\u003eFourth\u003c/b\u003e, the framework demonstrates strong potential for near-term forecasting of housing values when recent census anchors are available. Anchored by the 2020 census, the proposed approach achieves high predictive accuracy for subsequent years, including 2021, 2022, and 2023. This result suggests that locally constrained temporal weighting, when combined with robust spatial reconstruction, can effectively support short-term housing market analysis without requiring complex forecasting models or extensive auxiliary data.\u003c/p\u003e \u003cp\u003e \u003cb\u003eFifth\u003c/b\u003e, the framework offers a powerful solution for spatial interpolation under conditions of severe data sparsity. Across multiple non-census years, a substantial number of block groups lack observed housing values (e.g., over 1,000\u0026ndash;2,500 missing block groups per year between 2013 and 2023). The proposed framework successfully reconstructs housing values for these missing spatial units while preserving spatial autocorrelation and minimizing interpolation artifacts. This capability highlights the framework\u0026rsquo;s utility for recovering massive spatial gaps in fine-scale socioeconomic datasets.\u003c/p\u003e \u003cp\u003e \u003cb\u003eFinally\u003c/b\u003e, the proposed framework establishes a conceptual and computational baseline for future spatio-temporal modeling efforts. By resolving spatial structure prior to temporal interpolation, the approach can serve as a benchmark or initialization strategy for more advanced methods, including graph-based and hybrid spatio-temporal learning models. As such, the implications of this work extend beyond housing data reconstruction, offering a generalizable strategy for recovering fine-scale socioeconomic variables from sparse historical observations across diverse geographic and thematic contexts.\u003c/p\u003e"},{"header":"7. Limitations and Future Work","content":"\u003cp\u003eDespite its strong performance and interpretability, the proposed two-phase framework has several limitations that suggest directions for future research.\u003c/p\u003e \u003cp\u003e \u003cb\u003eFirst\u003c/b\u003e, the temporal interpolation component relies on a local inverse time-distance weighting (ITDW) scheme with a fixed temporal decay parameter. While this design effectively captures short-range temporal autocorrelation and avoids imposing unrealistic global temporal trends, it does not explicitly account for heterogeneous temporal dynamics across different housing submarkets. As illustrated in Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e, predicted values for 2015 exhibit a saturation effect at the upper end of the value range, where extremely high housing prices are systematically underestimated and form an apparent ceiling relative to observed ACS values. This behavior reflects the averaging nature of fixed-parameter temporal weighting, which dampens extreme values when nearby temporal anchors do not exhibit comparable magnitudes. Future work could explore adaptive temporal weighting strategies, locally varying decay parameters, or data-driven temporal kernels to better accommodate region-specific housing dynamics, market segmentation, and nonlinear growth regimes.\u003c/p\u003e \u003cp\u003e \u003cb\u003eSecond\u003c/b\u003e, the framework assumes that spatial autocorrelation within each census year can be adequately characterized using block-group centroids and stationary variogram models. Although this assumption is common in geostatistical applications, housing markets may exhibit non-stationary spatial dependence due to urban\u0026ndash;rural gradients, localized development patterns, or policy-driven effects. Extending the framework to incorporate non-stationary kriging approaches or multi-scale spatial modeling may further enhance spatial reconstruction accuracy.\u003c/p\u003e \u003cp\u003e \u003cb\u003eThird\u003c/b\u003e, the current implementation focuses on univariate housing values and does not explicitly incorporate auxiliary socioeconomic or environmental covariates, such as median income, population density, or accessibility measures. Integrating such covariates through co-kriging or hybrid spatial\u0026ndash;statistical models represents a promising extension that could improve reconstruction performance, particularly in regions with sparse or noisy observations.\u003c/p\u003e \u003cp\u003e \u003cb\u003eFinally\u003c/b\u003e, while the proposed framework is intended as a transparent and interpretable baseline, it does not directly model complex nonlinear spatio-temporal interactions. Future research could integrate this two-phase design with advanced spatio-temporal learning approaches, such as graph neural networks, by using the reconstructed spatial and temporal structures as informed priors or initialization strategies. Such hybrid models may combine the interpretability and stability of geostatistical methods with the representational flexibility of deep learning, while maintaining robustness under sparse temporal observations.\u003c/p\u003e"},{"header":"8. Conclusion","content":"\u003cp\u003eThis study proposes a two-phase spatio-temporal interpolation framework for reconstructing block-group-level housing values from sparse historical observations. By explicitly separating spatial reconstruction and temporal interpolation, the framework leverages spatial and temporal autocorrelation more effectively than conventional approaches that impose globally smooth space\u0026ndash;time dependence. Spatial structure is first reconstructed independently for each census year using ordinary kriging, followed by locally constrained temporal interpolation at fixed block-group locations, resulting in stable and accurate annual housing value estimates.\u003c/p\u003e \u003cp\u003eEmpirical validation using California housing data demonstrates that the proposed framework achieves strong predictive performance for both interpolated and extrapolated years while preserving realistic spatial dependence patterns. The results confirm that housing values are more strongly influenced by nearest temporal neighbors than by long-range temporal trends, and that decoupling spatial and temporal processes reduces spatio-temporal leakage and improves reconstruction robustness under temporally sparse observations.\u003c/p\u003e \u003cp\u003eBeyond empirical accuracy, the proposed framework offers a scalable and interpretable solution for reconstructing fine-scale historical socioeconomic data from census sources. It also establishes a transparent methodological baseline that can support subsequent integration with advanced spatio-temporal learning models, such as graph-based approaches, enabling systematic comparison and hybrid modeling in future research.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003eData Availability Statement\u003c/p\u003e\n\u003cp\u003eThe census-based socioeconomic data used in this study are publicly available from the IPUMS National Historical Geographic Information System (NHGIS) [4]. Derived datasets, graph structures, and model inputs generated during the study are available from the corresponding author upon reasonable request.\u003c/p\u003e\n\u003cp\u003eEthics Statement\u003c/p\u003e\n\u003cp\u003eThe authors confirm that they have read and complied with the ethical requirements for publication in the\u0026nbsp;\u003cem\u003eScientific Reports\u003c/em\u003e. This study does not involve human participants, animal subjects, or data collected from social media platforms.\u003c/p\u003e\n\u003cp\u003eGenerative artificial intelligence tools, specifically\u0026nbsp;\u003cstrong\u003eChatGPT (OpenAI)\u003c/strong\u003e, were used during manuscript preparation for language editing, text refinement, and clarity enhancement. All content generated with the assistance of this tool was carefully reviewed, verified, and edited by the authors, who take full responsibility for the accuracy, integrity, and originality of the final manuscript.\u003c/p\u003e\n\u003cp\u003eAuthor Contributions\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eShuang Tian\u003c/strong\u003e was responsible for data curation, data analysis, methodology, resources, visualization and writing original drafts.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eDr. Fang Qiu\u003c/strong\u003e was responsible for conceptualization, investigation, project administration, supervision, validation, review and editing.\u003c/p\u003e\n\u003cp\u003eCompeting Interests\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eThe author declares no competing interests.\u003c/p\u003e\n\u003cp\u003eFunding Statement\u003c/p\u003e\n\u003cp\u003eThis research did not receive funding.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eAnselin, L. \u003cem\u003eSpatial Econometrics: Methods and Models\u003c/em\u003e (Kluwer Academic, 1988).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGlaeser, E. L., Gyourko, J. \u0026amp; Saks, R. Why is Manhattan so expensive? Regulation and the rise in housing prices. \u003cem\u003eJ. Law Econ.\u003c/em\u003e \u003cb\u003e48\u003c/b\u003e (2), 331\u0026ndash;369 (2005).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBailey, N., van Gent, W. \u0026amp; Musterd, S. Remaking urban segregation. \u003cem\u003eUrban Stud.\u003c/em\u003e \u003cb\u003e54\u003c/b\u003e (14), 3284\u0026ndash;3302 (2017).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJonathan Schroeder, D. et al. IPUMS National Historical Geographic Information System: Version 20.0 [dataset]. Minneapolis, MN: IPUMS. (2025). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttp://doi.org/10.18128/D050.V20.0\u003c/span\u003e\u003cspan address=\"10.18128/D050.V20.0\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLogan, J. R., Xu, Z. \u0026amp; Stults, B. Interpolating U.S. decennial census tract data from as far back as 1970. \u003cem\u003eGeographical Anal.\u003c/em\u003e \u003cb\u003e46\u003c/b\u003e (4), 412\u0026ndash;434 (2014).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eQiu, F. \u0026amp; Cromley, R. Introduction: Areal interpolation and dasymetric modeling. \u003cem\u003eGeographical Anal.\u003c/em\u003e \u003cb\u003e45\u003c/b\u003e (3), 213\u0026ndash;215. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.2747/1548-1603.49.5.644\u003c/span\u003e\u003cspan address=\"10.2747/1548-1603.49.5.644\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e (2013).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKyriakidis, P. C. \u0026amp; Journel, A. G. Geostatistical space\u0026ndash;time models. \u003cem\u003eMath. Geol.\u003c/em\u003e \u003cb\u003e31\u003c/b\u003e (6), 651\u0026ndash;684 (1999).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHuang, B., Wu, B. \u0026amp; Barry, M. Geographically and temporally weighted regression. \u003cem\u003eInt. J. Geogr. Inf. Sci.\u003c/em\u003e \u003cb\u003e24\u003c/b\u003e (3), 383\u0026ndash;401. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1080/13658810802672469\u003c/span\u003e\u003cspan address=\"10.1080/13658810802672469\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e (2010).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi, S. et al. Spatiotemporal Analysis of Housing Prices in China: A Big Data Perspective. \u003cem\u003eAppl. Spat. Anal.\u003c/em\u003e \u003cb\u003e10\u003c/b\u003e, 421\u0026ndash;433. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s12061-016-9185-3\u003c/span\u003e\u003cspan address=\"10.1007/s12061-016-9185-3\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e (2017).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGao, S., Janowicz, K. \u0026amp; Couclelis, H. Extracting urban functional regions from points of interest. \u003cem\u003eTrans. GIS\u003c/em\u003e. \u003cb\u003e21\u003c/b\u003e (3), 446\u0026ndash;467. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1111/tgis.12289\u003c/span\u003e\u003cspan address=\"10.1111/tgis.12289\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e (2017).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCase, K. E. \u0026amp; Shiller, R. J. Is there a bubble in the housing market? \u003cem\u003eBrook. Papers Econ. Act.\u003c/em\u003e \u003cb\u003e2\u003c/b\u003e, 299\u0026ndash;362 (2003).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGyourko, J., Mayer, C. \u0026amp; Sinai, T. Superstar cities. \u003cem\u003eAm. Economic Journal: Economic Policy\u003c/em\u003e. \u003cb\u003e5\u003c/b\u003e (4), 167\u0026ndash;199 (2013).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePace, R. K. \u0026amp; Barry, R. Sparse spatial autoregressions. \u003cem\u003eStat. Probab. Lett.\u003c/em\u003e \u003cb\u003e33\u003c/b\u003e (3), 291\u0026ndash;297. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/S0167-7152(96)00140-X\u003c/span\u003e\u003cspan address=\"10.1016/S0167-7152(96)00140-X\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e (1997).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eQiu, F., Zhang, C. \u0026amp; Zhou, Y. The development of an areal interpolation ArcGIS extension and a comparative study. \u003cem\u003eGIScience Remote Sens.\u003c/em\u003e \u003cb\u003e49\u003c/b\u003e (5), 644\u0026ndash;663. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.2747/1548-1603.49.5.644\u003c/span\u003e\u003cspan address=\"10.2747/1548-1603.49.5.644\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e (2012).\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTobler, W. R. A computer movie simulating urban growth in the Detroit region. \u003cem\u003eEcon. Geogr.\u003c/em\u003e \u003cb\u003e46\u003c/b\u003e (Suppl.), 234\u0026ndash;240. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.2307/143141\u003c/span\u003e\u003cspan address=\"10.2307/143141\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e (1970).\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"scientific-reports","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"scirep","sideBox":"Learn more about [Scientific Reports](http://www.nature.com/srep/)","snPcode":"","submissionUrl":"","title":"Scientific Reports","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Scientific Reports","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"Spatio-temporal interpolation, Ordinary kriging, Inverse time-distance weighting, Census block groups, Housing values","lastPublishedDoi":"10.21203/rs.3.rs-8674976/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8674976/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eHigh-resolution historical housing value data are essential for urban analysis, socioeconomic research, and policy evaluation. However, housing values reported by the U.S. Census are only available at decennial intervals and are subject to changes in administrative boundaries, resulting in substantial spatial and temporal gaps. This study proposes a two-phase spatio-temporal interpolation framework to reconstruct annual block-group-level housing values in California by explicitly decoupling spatial and temporal dependence. In the first phase, ordinary kriging is applied independently to each census year (1990, 2000, 2010, and 2020) to reconstruct spatially continuous housing values on a unified 2020 block-group geometry, leveraging strong spatial autocorrelation in housing markets. In the second phase, temporal interpolation and extrapolation are performed independently at each block group using a locally constrained inverse time-distance weighting (ITDW) approach that exploits short-range temporal autocorrelation while avoiding unrealistic global temporal trends. Validation against American Community Survey data demonstrates robust predictive performance, with R\u0026sup2; values ranging from 0.64 to 0.95 for interpolated years (2013\u0026ndash;2019) and from 0.68 to 0.84 for the extrapolated years (2021\u0026ndash;2023). The results indicate that housing values are more strongly influenced by nearest temporal neighbors than by long-range temporal trends, and that explicitly separating spatial and temporal processes enables stable and accurate reconstruction of fine-scale housing values from sparse historical observations.\u003c/p\u003e","manuscriptTitle":"A Two-Phase Spatio-Temporal Interpolation Framework for Reconstructing Block Group Level Housing Values in California","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-02-02 12:52:19","doi":"10.21203/rs.3.rs-8674976/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2026-03-21T10:36:29+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-03-19T12:16:20+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"244466282561766627488457683060773117235","date":"2026-02-22T09:34:49+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-02-19T15:01:29+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"235475276745593456348505876874642908928","date":"2026-01-29T12:16:43+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2026-01-29T10:54:58+00:00","index":"","fulltext":""},{"type":"editorInvited","content":"","date":"2026-01-27T15:28:50+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2026-01-24T05:06:19+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2026-01-24T05:04:59+00:00","index":"","fulltext":""},{"type":"submitted","content":"Scientific Reports","date":"2026-01-23T04:33:21+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"scientific-reports","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"scirep","sideBox":"Learn more about [Scientific Reports](http://www.nature.com/srep/)","snPcode":"","submissionUrl":"","title":"Scientific Reports","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Scientific Reports","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"19692956-fc0e-41b6-937d-72a7ff6d1f0f","owner":[],"postedDate":"February 2nd, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":61997271,"name":"Earth and environmental sciences/Environmental sciences"},{"id":61997272,"name":"Physical sciences/Mathematics and computing"}],"tags":[],"updatedAt":"2026-05-05T09:23:24+00:00","versionOfRecord":[],"versionCreatedAt":"2026-02-02 12:52:19","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8674976","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8674976","identity":"rs-8674976","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.