A comprehensive computational pipeline for foramen magnum shape analysis integrating elliptic Fourier transform, polygon approximation, and Procrustes heatmaps from CT images: application to forensic sex estimation | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article A comprehensive computational pipeline for foramen magnum shape analysis integrating elliptic Fourier transform, polygon approximation, and Procrustes heatmaps from CT images: application to forensic sex estimation Yasin Etli, Erhan Kartal, Mahmut Asirdizer, Yavuz Hekimoglu, Sıddık Keskin, and 3 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8928788/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 13 You are reading this latest preprint version Abstract Background The foramen magnum (FM) exhibits well-documented size-based sexual dimorphism, yet whether its shape independently differs between sexes remains unresolved. Previous attempts to characterize FM shape have relied on subjective visual classification into categorical types, producing inconsistent and irreproducible results. This study aimed to apply a comprehensive suite of computational geometric methods to CT-derived FM contours in order to objectively quantify shape variation and evaluate its discriminative capacity for sex estimation. Methods FM boundaries were automatically extracted from axial CT images of 473 adults (234 males, 239 females) from an Eastern Turkish population. Seven complementary geometric analyses—classical morphometry, Fitzgibbon and genetic-algorithm-optimized ellipse fitting, systematic polygon approximation (3–13 sides), normalized elliptic Fourier transform (20 harmonics), radial distance profiling, and Procrustes-based point-density heatmap construction—yielded 231 quantitative features per specimen. Sex differences were assessed using Mann–Whitney U tests with Bonferroni correction and Cohen’s d effect sizes. Classification performance was evaluated with linear discriminant analysis, support vector machines, and random forest under stratified 10-fold cross-validation. Within-sex morphological subtypes were explored using Gaussian mixture model clustering. Results Of 234 morphometric features, 92 (39.3%) showed significant sex differences after Bonferroni correction. Dimorphism was overwhelmingly size-driven: all six size measurements showed large effect sizes (Cohen’s d = 1.13–1.28, perimeter strongest at d = 1.284), while only 17 of 146 pure shape features (11.6%) reached significance and the FM index showed no sex difference (d = 0.088). The highest classification accuracy was 80.9% ± 4.9% (sensitivity 76.4%, specificity 85.3%), achieved by forward-selected shape features with LDA, outperforming size-only classification (75.6%). A novel Procrustes heatmap-based method yielded near-chance accuracy (54.1%), confirming size dominance. Gaussian mixture model analysis revealed greater male morphological heterogeneity (25.2% atypical variants) compared to more homogeneous female FM morphology (71.5% single dominant subtype). Conclusions FM sexual dimorphism is predominantly size-mediated, but computationally derived shape features provide incremental discriminative value that exceeds what size alone can achieve. This study introduces a fully automated, reproducible geometric morphometric pipeline that replaces subjective shape categorization with objective, continuous measurements. The framework is openly available and readily transferable to other skeletal structures amenable to CT-based contour analysis. Foramen magnum Sexual dimorphism Geometric morphometrics Elliptic Fourier transform Polygon approximation Computed tomography Forensic anthropology Sex estimation Procrustes analysis Machine learning Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Background The foramen magnum (FM), the largest opening in the skull base formed by the occipital bone, serves as the primary conduit between the cranial cavity and the vertebral canal, transmitting the medulla oblongata, vertebral arteries, spinal accessory nerves, and associated meningeal membranes [ 1 , 2 ]. Beyond its well-established clinical significance in cranio-cervical surgery and neuropathology, the FM has attracted considerable research interest due to its reported morphological variation across individuals, populations, and between sexes [ 3 – 6 ]. A central question that has persisted in the anatomical and forensic literature is whether sex-related differences in the FM are limited to overall size—with males exhibiting larger dimensions—or whether genuine shape differences also exist between the sexes [ 7 – 9 ]. The dimensional sexual dimorphism of the FM has been well documented across diverse populations. Numerous studies using dry skulls, radiographs, and computed tomography (CT) have consistently demonstrated that the anteroposterior diameter, transverse diameter, circumference, and area of the FM are significantly greater in males than in females [ 5 , 10 – 14 ]. A recent systematic review and meta-analysis confirmed these dimensional differences across multiple populations with sex estimation accuracies ranging from approximately 62% to 90% depending on the population and analytical method employed [ 15 ]. In the Turkish population specifically, Meral et al. reported a maximum multivariate accuracy of 75% [ 16 ], while Kartal et al. achieved up to 86.7% with discriminant analysis and 88.2% with artificial neural networks using dimensional measurements from 720 individuals [ 17 ]. These findings collectively establish that the FM exhibits moderate but consistent size-based sexual dimorphism that is primarily driven by the overall larger cranial dimensions in males. In contrast, the question of whether the FM also exhibits sex-related differences in shape—independent of size—remains substantially less resolved. Morphological descriptions in the forensic and anatomical literature have long suggested that the male FM tends to display a "rougher" or more irregular contour, while the female FM is described as "more oval" and "more regular" [ 4 , 18 ]. However, these descriptions are based on qualitative, observer-dependent assessments and have not been quantitatively validated. The most common approach to characterizing FM shape has been subjective visual classification into categorical types such as oval, round, egg-shaped, tetragonal, pentagonal, hexagonal, and irregular forms [ 18 – 21 ]. This approach suffers from several well-documented limitations. First, the shape categories themselves are not standardized across studies, leading to marked inconsistencies in the reported prevalence of each type; for instance, the FM has been reported as oval in 87.3% of crania in one study but in only 7.39% in another that utilized the same visual categorization approach [ 22 ]. Second, studies that have investigated whether these categorical shape types differ between sexes have generally concluded that FM shape is not a reliable sex discriminator [ 7 , 8 ], with the FM index (anteroposterior/transverse diameter ratio) showing no significant sex difference in several populations [ 6 , 7 , 23 ]. Zdilla et al. critically observed that previous shape studies utilized "ambiguous and subjective descriptors" that are "difficult to reproduce" and called for more reproducible methods of characterizing FM morphology [ 22 ]. The reliance on subjective categorical classification or simple ratios to characterize a complex, irregularly bounded anatomical contour fundamentally limits the ability to detect subtle but potentially meaningful shape differences. A two-dimensional closed contour such as the FM boundary contains considerably more morphological information than can be captured by two perpendicular diameters or a single aspect ratio [ 24 , 25 ]. Modern computational imaging and geometric analysis methods offer the potential to extract this richer shape information objectively. Elliptic Fourier analysis (EFA), introduced by Kuhl and Giardina [ 24 ], enables the mathematical decomposition of any closed contour into a series of harmonic ellipses, capturing both global shape properties and fine local boundary details in a compact numerical representation. This approach has been successfully applied to various anatomical contours, including the FM of the tropical raccoon, where Samuel and Bhagwat demonstrated that higher harmonics revealed fine morphological details relevant to sexual dimorphism that were invisible to conventional measurements [ 26 ]. Hayashizaki et al. applied Fourier analysis to pelvic contours from postmortem CT images and showed that frequency-domain shape descriptors could discriminate sex with greater objectivity than traditional morphometric methods [ 27 ]. Similarly, Gonzalez et al. utilized semi-landmark-based geometric morphometrics on the greater sciatic notch and ischiopubic complex to achieve over 90% sex estimation accuracy from shape variables alone [ 28 ]. In our own previous work, we applied advanced contour-based geometric analysis methods—including ellipse fitting—to pelvic CT images and demonstrated that computational shape features could objectively capture morphological variations that had previously been described only qualitatively [ 29 ]. These studies collectively suggest that computational geometric approaches, when applied to anatomical contour data extracted from CT images, can reveal shape information that is inaccessible to conventional linear measurements and subjective visual classification. Despite the clear potential of these computational shape analysis methods, no study to date has applied a comprehensive suite of such techniques to the FM contour for the purpose of objectively quantifying and validating the morphological descriptions that have persisted in the anatomical literature. Specifically, it remains unknown whether computationally derived metrics—such as circularity, boundary roughness, contour regularity, left-right and antero-posterior asymmetry, and harmonic shape descriptors—differ significantly between male and female FM contours when size effects are controlled for. Furthermore, whether any such pure shape features, once objectively defined, can contribute to or improve upon size-based sex estimation has not been systematically evaluated. Resolving these questions is important not only for advancing evidence-based anatomical knowledge, but also for practical applications in forensic medicine and anthropology, where the ability to estimate sex from fragmentary cranial remains—particularly the skull base, which is resistant to taphonomic destruction—can be critical in identification cases [ 30 , 31 ]. The present study aims to address these gaps through two integrated objectives. The first objective is to apply a comprehensive array of computational geometric methods—including Fitzgibbon ellipse fitting, genetic algorithm-optimized inner and outer bounding ellipses, systematic polygon approximation from 3-sided to 13-sided polygons, normalized elliptic Fourier transform with 20 harmonics, radial profile analysis, and point density heatmap construction—to FM boundary coordinates extracted from CT images of a large contemporary Turkish sample (n = 473), in order to objectively characterize and quantify morphological features that correspond to classical qualitative descriptions such as "oval," "round," "irregular," and "rough." The second objective is to evaluate the discriminative capacity of these objectively defined shape features for sex estimation, both independently and in combination with conventional size-based measurements, using multiple classification approaches including linear discriminant analysis, support vector machines, and random forest, with rigorous cross-validation; and to investigate whether unsupervised clustering reveals sex-specific morphological subtypes that may further elucidate the nature and extent of FM shape dimorphism. Methods Study design and ethical approval This retrospective cross-sectional study utilized cranial computed tomography (CT) images originally acquired for a prior investigation of foramen magnum (FM) sexual dimorphism in an Eastern Turkish population [ 17 ]. The original study protocol received approval from the Non-Interventional Clinical Research Ethics Committee of Van Yuzuncu Yil University Faculty of Medicine (decision number: 02, dated: 19.02.2017) and was conducted in accordance with the Declaration of Helsinki and the principles of Good Clinical Practice. Since the present study represents a secondary analysis of the same anonymized imaging dataset using computational geometric methods, no additional ethical approval was required. Informed consent was waived due to the retrospective design and full anonymization of all data. Study sample The source dataset comprised 720 cranial CT examinations (360 males and 360 females) from adult patients who presented to Yüzüncü Yıl University Dursun Odabaş Medical Center between January and December 2016 for various clinical indications unrelated to craniocervical pathology [ 17 ]. The original sample was constructed using stratified sampling across six adult age groups (21–30, 31–40, 41–50, 51–60, 61–70, and ≥ 71 years) with equal sex distribution to control for potential age-related confounding effects. Exclusion criteria in the original study included skull trauma causing cranial deformities, congenital or acquired cranial bone diseases, any malignancy affecting the skull, and cases with missing or defective FM boundaries [ 17 ]. For the present study, all 720 CT examinations were re-evaluated to determine their suitability for automated contour extraction. Cases were included if the axial CT slice displaying the widest FM cross-section yielded a boundary amenable to automated edge detection—specifically, sufficient contrast at the bone–soft tissue interface and a complete, uninterrupted FM margin. Cases with excessive image noise, partial volume artifacts obscuring the FM border, or incomplete boundary delineation after edge detection were excluded. After this additional selection, 473 individuals (234 males and 239 females) met the criteria and constituted the final study sample for geometric morphometric analysis. CT image acquisition All CT examinations had been performed on a 16-detector multislice CT system (Somatom Emotion 16-slice, CT2012E; Siemens AG, Berlin and Munich, Germany). Imaging parameters included a tube voltage of 120 kV with automatic tube current modulation ranging between 80 and 120 mA, a section thickness of 1.2 mm, a reconstruction interval of 1.0 mm, a pitch factor of 0.8, a field of view of 220 mm, and a matrix size of 256 × 256. Data acquisition was performed in the caudocranial direction without gantry tilt. Three-dimensional volume-rendered reformations were generated using Syngo VE.52A software (Siemens Healthcare) to minimize potential measurement errors arising from asymmetry or obliquity that may occur on conventional axial images [ 17 ]. The spatial calibration factor (mm per pixel) was individually extracted from the DICOM metadata of each examination and ranged from 0.271 to 0.513 mm/pixel (mean: 0.371 mm/pixel), ensuring accurate conversion of all pixel-based coordinate measurements to absolute millimeter values. Foramen magnum boundary extraction For each specimen, the axial CT slice displaying the widest cross-section of the FM was identified. The FM boundary was extracted using automated edge detection applied to the bone-windowed CT image at this level. The resulting contour coordinates (sequential X, Y pixel positions along the FM margin) were exported as individual comma-separated value (CSV) files. Each raw contour comprised between 36 and 187 boundary points (mean: 85.5). A key methodological advantage of this approach is that the FM boundary was derived through computational image processing rather than manual tracing. As a consequence, the contour extraction procedure is fully deterministic: processing the same CT image repeatedly yields identical boundary coordinates. This eliminates inter-observer and intra-observer variability, which are inherent limitations of manual measurement approaches and have been documented as a source of error in conventional FM morphometric studies [ 5 , 8 ]. To ensure comparability across specimens with different numbers of raw boundary points, all contours were resampled to 200 equally spaced points using cubic spline interpolation along the cumulative arc length prior to subsequent analysis [ 32 ]. Computational geometric analyses All computational analyses were implemented in Python 3.8 + using NumPy, SciPy, scikit-learn, and Matplotlib [ 33 – 36 ]. The complete analysis pipeline is publicly available as a single, self-contained script (FM_Comprehensive_Analysis.py) in the Zenodo repository accompanying this paper (DOI: 10.5281/zenodo.18716445 ), along with all anonymized boundary coordinate data, enabling full reproducibility. Classical morphometric measurements Eleven standard morphometric parameters were computed from each resampled contour: anteroposterior (AP) diameter (maximum extent along the Y-axis), transverse diameter (maximum extent along the X-axis), maximum and minimum diameters (derived from the convex hull), perimeter (sum of consecutive point-to-point Euclidean distances along the closed contour), area (computed using the shoelace formula [ 37 ]), FM index (AP/transverse diameter ratio), circularity (4πA/P², where A is area and P is perimeter), eccentricity (maximum/minimum diameter ratio), area-to-perimeter ratio (A/P), and compactness (P/√A). These parameters correspond to those commonly reported in the FM literature [ 5 , 6 , 8 , 16 , 17 ], thereby facilitating direct comparison with previously published dimensional data. Ellipse fitting Three ellipse models were fitted to each FM contour using distinct optimization strategies, together with a best-fit circle. The Fitzgibbon direct least-squares ellipse was obtained by minimizing the algebraic distance from contour points to a general conic equation, with an ellipticity constraint ensuring a valid ellipse solution [ 38 ]. Initial estimates were derived from the centroid and coordinate covariance, followed by geometric refinement via the Nelder-Mead simplex algorithm [ 39 ]. For each fitted ellipse, the following were recorded: center coordinates (cx, cy), semi-major axis (a), semi-minor axis (b), rotation angle (θ), four distance metrics (mean, standard deviation, and maximum Euclidean distance from each contour point to its nearest point on the ellipse boundary, and RMSE), the fitted ellipse area, the corresponding FM area, and the area ratio between the two. This yielded 12 features per ellipse model. The genetic algorithm (GA)-optimized inner bounding ellipse (the largest ellipse fully inscribed within the FM contour) and the GA-optimized outer bounding ellipse (the smallest ellipse fully circumscribing the FM contour) were estimated using a stochastic optimization approach. An initial population of 60 candidate ellipses (parameterized by center coordinates, semi-axes, and rotation angle) was evaluated over 150 generations, followed by local refinement using the Nelder-Mead algorithm. These bounding ellipses capture the deviation of the FM contour from an ideal elliptical shape: a large gap between inner and outer ellipses indicates a more irregular boundary. Each bounding ellipse model produced 12 features identical in structure to the Fitzgibbon model. The best-fit circle was obtained via algebraic least-squares fitting [ 40 ]. Ten features were extracted: center coordinates, radius, four distance metrics (mean, standard deviation, maximum, RMSE), circle area, FM area, and area ratio. In total, the three ellipse models and the circle model yielded 46 features. Polygon approximation To capture FM boundary irregularity at multiple geometric resolution levels, systematic polygon approximation was performed for polygons with 3 to 13 sides (11 polygon orders). For each polygon order n, the optimal n-sided polygon was determined by angular sectoring from the FM centroid, with the vertex of each sector placed at the farthest contour point from the centroid within that sector. Twenty offset rotations were evaluated per polygon order and the configuration yielding the minimum RMSE was selected. For each polygon order, six features were extracted: RMSE between the contour and the polygon edges (point-to-segment distance), mean distance, standard deviation of distance, maximum distance, polygon area, and the polygon-to-FM area ratio. This yielded 66 polygon features in total. The rationale for this multi-scale approach is that low-sided polygons (3–5 sides) capture gross shape asymmetry and angularity, while high-sided polygons (9–13 sides) approximate the contour more closely and their residual errors primarily reflect fine boundary roughness and local irregularity. The progression of RMSE across polygon orders thus provides an objective, multi-resolution characterization of contour complexity. Polygon elbow (optimal complexity) analysis To identify the polygon order at which additional sides yield diminishing returns in contour approximation accuracy, an elbow-point analysis was performed on each specimen's RMSE curve using the Kneedle algorithm [ 53 ]. For each specimen, the RMSE values for polygon orders 3 through 13 were extracted and the curve was normalized to the unit interval on both axes. A reference diagonal line connecting the first and last normalized points was constructed, and the deviation of the normalized RMSE curve from this diagonal was computed at each polygon order. The elbow point was defined as the polygon order at which this deviation was maximal, representing the point of maximum curvature—i.e., the transition from rapid to marginal improvement in contour fit with each additional polygon side [ 53 ]. For each specimen, five derived metrics were computed at the identified elbow point: (1) the optimal polygon order (number of sides), (2) the RMSE at the optimal polygon, (3) the total improvement in RMSE from 3-sided to the optimal polygon as a percentage of the 3-sided RMSE, (4) the remaining improvement from the optimal polygon to 13-sided as a percentage, and (5) an efficiency ratio defined as the total improvement up to the elbow divided by the remaining improvement beyond it. The elbow analysis was performed at both the individual specimen level and the group level. At the group level, RMSE values were averaged across all specimens within each sex and across the entire sample, and the Kneedle algorithm was applied to these mean curves. The distribution of individually determined optimal polygon orders was compared between males and females using the Mann–Whitney U test and Cohen's d to assess whether the two sexes differ in the geometric complexity required to adequately represent their FM boundaries. The step-by-step marginal improvement table—quantifying the percentage RMSE reduction at each polygon-order increment for males, females, and the pooled sample—was also computed and reported. Elliptic Fourier transform The normalized elliptic Fourier transform (EFT) was applied to each FM contour following the method of Kuhl and Giardina [ 24 ]. The EFT decomposes a closed two-dimensional contour into a series of harmonically related ellipses, where the first harmonic captures the gross elliptical shape and higher harmonics capture progressively finer morphological details. Twenty harmonics were computed, yielding 80 Fourier coefficients (four per harmonic: aₙ, bₙ, cₙ, dₙ). These coefficients were normalized for size, rotation, and starting-point invariance using the first-harmonic parameters, ensuring that the resulting descriptors reflect pure shape information independent of scale, orientation, and the arbitrary starting location of contour tracing [ 24 , 25 ]. In addition to the 80 normalized coefficients, 14 summary descriptors were derived: individual power ratios for harmonics 1–10 (the proportion of total spectral power contributed by each harmonic), a higher-harmonic energy ratio (the proportion of total power from harmonics 4–20, reflecting fine morphological detail relative to gross shape), total spectral power, a first-harmonic power ratio (power of the first harmonic relative to total power), and a symmetry index (ratio of odd-harmonic to even-harmonic power, reflecting bilateral symmetry [ 41 ]). This yielded 94 EFT features per specimen. Radial distance profile analysis To provide intuitive, physically interpretable shape descriptors that correspond directly to classical qualitative morphological descriptions, a radial distance profile was computed for each FM contour. Radial distances from the centroid to each contour point were calculated and sorted by angular position. From this profile, six descriptors were extracted: radial mean and standard deviation (in mm), reflecting overall size and radial variability; radial coefficient of variation (CV), reflecting boundary regularity independent of size; roughness (mean absolute value of the second derivative of the normalized radial profile), quantifying high-frequency boundary undulation; left–right asymmetry (mean absolute difference in normalized radial distances between corresponding points in the left and right halves); and anterior–posterior (AP) asymmetry (the analogous measure for the anterior and posterior quarters). Roughness, left–right asymmetry, and AP asymmetry were computed from size-normalized (unit-mean) radial profiles to ensure they reflect pure shape properties. Procrustes normalization and point density heatmaps To enable shape comparison independent of size, position, and orientation, all resampled FM contours were subjected to Procrustes superimposition [ 42 ]. Each contour was: (1) translated to place its centroid at the origin, (2) scaled to unit centroid size, and (3) rotated to align its principal axis (determined by eigendecomposition of the coordinate covariance matrix) with the horizontal axis. A reflection correction ensured consistent orientation across all specimens. Following Procrustes normalization, sex-specific point density heatmaps were generated by pooling all normalized contour points for each sex onto a 200 × 200 grid. Gaussian kernel smoothing (σ = 2 grid units) was applied to produce continuous density surfaces (Fig. 1). For each specimen, a male heatmap score and a female heatmap score were computed as the mean density value at the specimen's normalized contour point locations within the respective sex-specific heatmap. The heatmap difference score (male score minus female score) served as an additional descriptive feature, yielding 3 heatmap-derived features per specimen. These global heatmaps were used solely for descriptive visualization and feature extraction; heatmap-based classification employed a separate cross-validated procedure described below. Statistical analysis Sex comparison of individual features All 234 numeric variables were compared between males and females using the Mann–Whitney U test, chosen for its robustness to non-normality [ 43 ]. Effect sizes were quantified using Cohen's d, computed as the difference in group means divided by the pooled standard deviation [ 44 ]. The Bonferroni correction was applied to all p-values to control the familywise error rate at α = 0.05 across all 234 simultaneous comparisons. Features were considered statistically significant if the Bonferroni-corrected p-value was less than 0.05. Sex classification The discriminative capacity of the extracted features for sex estimation was evaluated using four classifiers: linear discriminant analysis (LDA) [ 45 ], support vector machine with linear kernel (SVM-linear), support vector machine with radial basis function kernel (SVM-RBF) [ 46 ], and random forest (RF, 100 trees) [ 47 ]. All classifiers were evaluated using 10-fold stratified cross-validation with a fixed random seed (42) to ensure reproducibility. Within each fold, features were standardized to zero mean and unit variance using the training set parameters only, and the same standardization was then applied to the test set to prevent data leakage. Classification performance was assessed using overall accuracy (proportion of correctly classified specimens), sensitivity (proportion of males correctly identified), and specificity (proportion of females correctly identified). Feature sets To systematically evaluate the contribution of size-based versus shape-based features, classification was performed on the following feature sets: S1 (Size): Six classical size measurements—AP diameter, transverse diameter, maximum and minimum diameters, perimeter, and area—that directly capture FM dimensions in millimeters. S2 (Shape): All non-size shape features (174 features), including FM index, circularity, eccentricity, area-to-perimeter ratio, compactness, all EFT coefficients and summary descriptors, radial CV, roughness, asymmetry indices, polygon distance metrics, polygon area ratios, ellipse and circle distance metrics (RMSE, mean/std/max distances), and ellipse area ratios. Excluded from this set were absolute area measurements in mm² (polygon areas, ellipse areas, FM areas), ellipse geometric parameters (center coordinates, semi-axes, rotation angles), circle radius, radial mean and standard deviation in mm, and polygon elbow-derived metrics, as these either directly encode size information or represent pipeline metadata rather than morphometric descriptors. S3 (Combined): The union of S1 and S2 (180 features). S4 (Forward-selected Shape): The minimal subset of S2 features identified by sequential forward selection (see below). S5 (Forward-selected Combined): The minimal subset of S3 features identified by forward selection. S6 (Forward-selected All): The minimal subset from all available numeric features (including absolute area measurements and polygon elbow metrics) identified by forward selection. In addition, two heatmap-based classification methods were evaluated as separate, feature-free approaches (see Heatmap-based classification below). Forward feature selection Sequential forward feature selection (SFS) was performed to identify minimal, non-redundant feature subsets that maximize classification accuracy [ 48 ]. Starting from an empty set, at each step the feature producing the greatest improvement in 10-fold stratified LDA cross-validation accuracy was added. The process terminated when no additional feature improved accuracy. Three separate forward selections were conducted corresponding to feature sets S4, S5, and S6, selecting from shape features exclusively (S2 pool), the combined size–shape pool (S3), and all available features, respectively. Heatmap-based sex classification A novel heatmap-based classification approach was developed as an alternative to conventional feature-based methods. Instead of extracting discrete features and applying a classifier, this method operates directly on the spatial distribution of Procrustes-normalized FM boundary points, comparing each test specimen's contour against sex-specific population density templates. Two scoring strategies were evaluated under proper 10-fold stratified cross-validation with strict train/test separation—heatmaps were constructed exclusively from training-fold specimens, and test specimens were never included in the heatmaps against which they were scored, thereby preventing data leakage. Method A — Point-sampling: For each fold, sex-specific density heatmaps were constructed from training specimens by accumulating their Procrustes-normalized contour points onto a shared 200 × 200 grid and applying Gaussian smoothing (σ = 2 grid units). Each test specimen's normalized contour was then overlaid on both the male and female training heatmaps, and the mean density value at the specimen's boundary point positions was computed. Classification was assigned to the sex whose heatmap yielded the higher score. Method B — Kernel density estimation (KDE) with overlap integral scoring: This method addresses a limitation of point-sampling, which evaluates only the exact grid cells occupied by the test contour and is therefore sensitive to minor positional shifts and pixel-level gaps. In the KDE approach, each contour point—in both the training heatmaps and the test specimen—is treated as the center of a two-dimensional Gaussian kernel, producing continuous density fields rather than discrete point samples. The kernel has full intensity (1.0) at its center, diminishing to approximately half intensity at one pixel distance, one-quarter at two pixels, and so on—following a Gaussian decay profile. This ensures that nearby but not exactly coincident boundary points still contribute to the similarity measure. The similarity between a test specimen's density field and each sex-specific reference field was quantified using the overlap integral—the element-wise product of the two fields summed over the entire grid. This metric is related to the Bhattacharyya coefficient [54] and provides a continuous, smooth similarity score that is robust to minor positional shifts. Classification was assigned to the sex whose reference field yielded the higher overlap score. The kernel bandwidth (controlling the spatial extent of each Gaussian kernel in grid-pixel units) was optimized by evaluating values of 3, 5, 7, 9, and 11 pixels and selecting the value that maximized cross-validated accuracy within the same fold structure. Within-sex morphological subtype discovery To investigate whether the FM exhibits distinct morphological subtypes within each sex, unsupervised clustering was performed using Gaussian mixture models (GMM) [49]. For each sex independently: (1) Procrustes-normalized contour coordinates were flattened into feature vectors (200 points × 2 coordinates = 400 dimensions), (2) principal component analysis (PCA) was applied to reduce dimensionality to the first 20 components (retaining > 95% of variance), and (3) GMM with full covariance matrices was fitted for k = 2, 3, and 4 clusters (10 random initializations per k). The optimal number of clusters was selected based on the silhouette score [50] and the Bayesian information criterion (BIC) [51]. Each cluster was characterized by computing its mean values for eight shape descriptors: circularity, radial CV, roughness, left–right asymmetry, AP asymmetry, FM index, Fitzgibbon ellipse fitting RMSE, and area-to-perimeter ratio. For visualization, each subtype was represented by individual contour overlays (with adaptive transparency based on cluster size), mean contour shapes with ±1 SD and ±2 SD radial variation bands, density heatmaps, and sex-matched subtype comparisons showing the mean contour of the corresponding male and female cluster side by side. Software and reproducibility All analyses were implemented in Python 3 using the following libraries: NumPy 1.21+ for numerical computation [33], SciPy 1.7+ for statistical tests and optimization [34], scikit-learn 1.0+ for classification, clustering, and cross-validation [35], Matplotlib 3.4+ for visualization [36], and pandas 1.3+ for data management [52]. The complete analysis code, all 473 anonymized FM boundary coordinate files (500 equally-spaced points per specimen, resampled via periodic cubic spline interpolation), and associated metadata are publicly available in a Zenodo repository (DOI: 10.5281/zenodo.18716445) to ensure full reproducibility. The repository is currently under embargo and will be made publicly accessible upon publication; reviewer access can be arranged upon request to the corresponding author. Per-specimen morphometric measurements are provided in Table S1. Results Study sample A total of 473 adult CT scans met the inclusion criteria. The sample comprised 234 males (49.5%) and 239 females (50.5%). Automated contour extraction yielded between 36 and 187 boundary points per foramen magnum (mean: 85.5), which were resampled to 200 equidistant points for all subsequent analyses. Six specimens were excluded from ellipse fitting due to convergence failure, leaving 467 specimens for the polygon elbow analysis and the full feature set. Descriptive morphometry Classical linear and area measurements revealed pronounced sexual dimorphism in all size-related parameters (Table 1). Male FM perimeters averaged 107.13 ± 11.16 mm compared to 95.02 ± 7.36 mm in females (Cohen's d = 1.284, p < 10⁻³⁶). FM area showed a comparable effect (males: 830.1 ± 159.9 mm², females: 665.3 ± 99.0 mm²; d = 1.242). Anteroposterior and transverse diameters were both significantly larger in males (AP: 36.53 ± 3.48 vs. 32.68 ± 2.77 mm, d = 1.225; transverse: 31.35 ± 3.16 vs. 28.23 ± 2.31 mm, d = 1.132). In contrast, shape descriptors showed minimal dimorphism. The FM index did not differ significantly between sexes (males: 1.17 ± 0.08, females: 1.16 ± 0.09; d = 0.088, p > 0.05 after Bonferroni correction). Circularity showed a small, non-significant trend toward rounder foramina in females (females: 0.92 ± 0.04, males: 0.91 ± 0.07; d = − 0.264, p > 0.05). Left–right and anteroposterior asymmetry indices likewise showed no significant sex differences (d = 0.034 and d = 0.088, respectively). Feature significance Of 234 morphometric features extracted per specimen, 92 (39.3%) showed statistically significant sex differences after Bonferroni correction (p < 0.05) (Table 1; Table S2 ). The top five features by effect size were all area-derived measures: polygon approximation areas (d = 1.285 for 10-sided polygon area), perimeter (d = 1.284), Fitzgibbon ellipse area (d = 1.279), and total FM area (d = 1.242). All six classical size measurements were significant. Among pure shape features (dimensionless ratios and normalized coefficients), only 17 of 146 (11.6%) reached significance. All 11 polygon approximation areas in mm² were significant (d = 1.237–1.285), but none of the 14 polygon area ratios achieved significance, confirming that the discriminative information resided in absolute size rather than shape proportions. Elliptic Fourier transform (EFT) coefficients showed limited dimorphism: only the total power spectrum (d = 0.855, p < 10⁻¹⁵) reached significance among the 94 EFT-derived features. The best-fitted ellipse semi-axes were significant (semi-major axis d = 0.818, semi-minor axis d = 0.828), again reflecting size. Polygon approximation analysis The Kneedle-based elbow detection identified the optimal polygon as 6-sided for both sexes (mode = 6, mean = 5.60 ± 0.94 sides) (Table S3 ). The distribution was: 6-sided (n = 241, 51.6%), 5-sided (n = 84, 18.0%), 4-sided (n = 83, 17.8%), 7-sided (n = 54, 11.6%), and 8-sided (n = 5, 1.1%) (Fig. 4). The transition from 3 to 4 sides provided the largest marginal RMSE improvement (50.1% for males, 50.7% for females), followed by 5→6 sides (37.6% and 35.5%, respectively) (Table 2; Fig. 3). Beyond 6 sides, each additional vertex contributed diminishing returns (< 25%). Mean optimal polygon RMSE was 1.80 ± 0.47 mm, representing 74.0% improvement over the initial 3-sided approximation. Males and females showed no significant difference in optimal polygon selection (male mean = 5.68, female mean = 5.53; p > 0.05), indicating that the geometric complexity of the FM boundary is sex-invariant. Sex classification Classification performance was evaluated using seven feature sets across four classifiers with stratified 10-fold cross-validation (Table 3; Table S4 ; Fig. 4). The highest accuracy was achieved by forward-selected shape features (S4) with LDA: 80.9% ± 4.9% (sensitivity = 76.4%, specificity = 85.3%). The forward selection procedure identified seven optimal shape features: area-to-perimeter ratio, circularity, Fitzgibbon ellipse RMSE, and four EFT coefficients (harmonics 48, 21, 13, and 51) (Table 4). Notably, the first feature alone (area-to-perimeter ratio) achieved 74.9%, and the addition of circularity raised accuracy to 77.7%. Forward-selected combined features (S5, LDA) and the full feature set (S6, LDA) both reached 80.5% ± 6.8%, with higher specificity (85.7%) but lower sensitivity (75.1%) than S4 (Table S5 ). The combined forward selection started with perimeter (78.1% alone) and added five shape refinements to reach the final accuracy. Size features alone (S1) reached 75.6% ± 5.7% with LDA, and 77.3% ± 7.0% with SVM-rbf. The full shape feature set without selection (S2) achieved 67.9% with LDA but 77.1% with Random Forest, suggesting non-linear shape relationships captured better by ensemble methods (Fig. 4). Among classifiers, LDA consistently outperformed others on forward-selected features, while Random Forest showed superior performance on unselected high-dimensional feature sets (S2: RF 77.1% vs. LDA 67.9%). Heatmap-based density classification A novel classification approach was implemented using Procrustes-normalized point-density heatmaps. Two scoring strategies were compared under proper 10-fold cross-validation with strict train/test separation. Point-sampling — where each test contour point was scored against the reference heatmap at its exact grid position — achieved only 50.3% ± 3.2%, effectively at chance level. The method exhibited extreme sensitivity bias (96.2% for males) with near-zero specificity (5.4% for females), indicating systematic classification toward one sex. KDE overlap scoring — where both the reference and test contours were rendered as Gaussian density fields (optimal bandwidth = 9 pixels) and their element-wise product integrated — improved accuracy to 54.1% ± 7.1% with more balanced sensitivity (70.5%) and specificity (38.1%). The near-chance performance of both heatmap methods is attributable to Procrustes normalization, which removes all size information. Since the dominant sexual dimorphism in FM morphology is size-based (top features: d = 1.28), the normalized contours retain only subtle shape differences insufficient for density-based discrimination. This negative finding is consistent with the observation that pure shape features without feature selection achieved only 67.9% accuracy even with LDA, a method with far greater discriminative capacity than density overlap (Fig. 1). Within-sex morphological subtypes GMM clustering on Procrustes-normalized contour coordinates, evaluated for k = 2, 3, and 4 clusters, revealed distinct patterns of within-sex variation (Table 5; Fig. 5; Table S6 ; Table S7 , Figure S1 ). For males, the optimal cluster number was k = 2 based on silhouette score (0.72) and BIC (Fig. 6). The dominant subtype (Subtype-1, n = 175, 74.8%) showed higher circularity (0.914) and a more elliptical FM index (1.177), representing the typical male FM morphology. The minor subtype (Subtype-0, n = 59, 25.2%) exhibited lower circularity (0.884) and higher radial variability (CV = 7.7%), indicating a distinct, more irregular morphological pattern (Fig. 5; Figure S2 ). For females, the silhouette score peaked at k = 3 (0.78), though k = 2 also performed well (0.74) (Fig. 6). At k = 2, the dominant subtype (Subtype-0, n = 171, 71.5%) showed high circularity (0.923) with an FM index of 1.164. The secondary subtype (Subtype-1, n = 68, 28.5%) had comparable circularity (0.920) but slightly higher asymmetry (0.058 vs. 0.050), suggesting size-related rather than shape-related differentiation (Fig. 5; Figure S3 ). The subtype analysis revealed that male FM morphology exhibits greater heterogeneity than female. At k = 4, male subtypes included a clearly atypical group (Subtype-1, n = 16, 6.8%) with the highest radial CV (10.5%) and lowest circularity, while female subtypes remained more morphologically coherent. These findings suggest that the male foramen magnum encompasses a wider range of morphological variants (Figure S4 - S17 ). Summary of key findings The foramen magnum shows clear sexual dimorphism, but this dimorphism is overwhelmingly driven by size rather than shape. Males have FM perimeters approximately 12.7% longer and areas roughly 24.8% larger than females, yielding large effect sizes (d ≈ 1.28). However, when size is removed through normalization, male and female FM boundaries are remarkably similar — shape-only features achieve only modest classification gains over chance. The most effective sex estimation (80.9% accuracy) required a combination of shape features selected through forward selection, including geometric ratios, ellipse fitting residuals, and specific Fourier harmonics. This suggests that while no single shape feature strongly discriminates sex, certain combinations of subtle boundary irregularities can collectively approach the discriminative power of size-based measurements. The finding that shape features outperform size features at optimal selection (80.9% vs. 75.6%) indicates that shape encodes complementary — though individually weak — dimorphic information. In practical forensic terms, these results mean that: (1) the foramen magnum can correctly predict biological sex in approximately 4 out of 5 individuals; (2) female FMs are better classified (specificity 85.3%) than male FMs (sensitivity 76.4%); and (3) when only fragmentary remains are available where size cannot be assessed, automated shape analysis can still provide meaningful, though less precise, sex estimation. Discussion Principal findings and novelty This study presents the first comprehensive computational geometric morphometric analysis of the foramen magnum that integrates multiple complementary outline-based approaches — elliptic Fourier transform, systematic polygon approximation, ellipse fitting, radial distance profiling, Procrustes-based density heatmaps, and Gaussian kernel density estimation — to jointly characterize FM shape and its relationship to biological sex. While conventional FM studies have relied on a small number of linear measurements and subjective shape classification [ 4 , 8 , 17 ], the present work extracted 231 quantitative features per specimen from automatically detected contour coordinates, enabling an objective, reproducible, and high-dimensional evaluation of FM morphology. The best classification accuracy of 80.9% was achieved using forward-selected shape features with LDA — a result that compares favorably with the reported range of 62.5%–86.7% for conventional FM morphometry across various populations [ 5 , 6 , 10 , 17 , 55 ]. Our most consequential finding, however, is not the classification accuracy itself but rather the quantitative demonstration that FM sexual dimorphism is overwhelmingly driven by size rather than shape. When size information was removed through Procrustes normalization, classification accuracy dropped sharply, and the novel density-heatmap approach — which operates exclusively on size-normalized contours — achieved only 54.1%. This provides the first direct, quantitative evidence that residual FM shape differences between sexes are minimal once allometric scaling is accounted for. Size dominates FM sexual dimorphism All six classical size measurements differed significantly between sexes with large effect sizes (Cohen's d = 1.13–1.28). Perimeter showed the strongest discriminative capacity (d = 1.284, p < 10⁻³⁶), followed by polygon area (d = 1.285) and total FM area (d = 1.242). Males had FM perimeters approximately 12.7% longer and areas roughly 24.8% larger than females. These findings are consistent with the broader FM literature. Gapert et al. [ 5 ] reported that FM width and area were among the best univariate discriminating parameters in a British sample, achieving a maximum multivariate accuracy of 70.3%. Madadin et al. [ 6 ] similarly found that all FM dimensions were significantly larger in males in a Saudi population, with width yielding the highest univariate accuracy of 65.0%. A recent systematic review and meta-analysis by Fernandes et al. [ 15 ] confirmed that FM dimensions are consistently larger in males across populations, though classification accuracy varies substantially by population. In contrast, shape indices showed negligible dimorphism in our sample. The FM index — the most widely used shape descriptor in the literature — did not differ significantly between sexes (d = 0.088, p > 0.05). Circularity showed only a weak, non-significant trend (d = − 0.264), and left–right and AP asymmetry were virtually identical between sexes. These results align with Toneva et al. [ 8 ], who found that FM shape did not provide substantial sex discrimination in a Bulgarian population, and with our previous study [ 17 ], which reported that the H/W index and FMI had no significant sex difference (p = 0.622 and p = 0.440, respectively) and yielded univariate accuracies of only 51.9% and 52.8%. From subjective categories to objective shape quantification A particularly notable contribution of this work concerns the long-standing problem of FM shape classification. Numerous studies have attempted to characterize FM morphology using categorical descriptors such as "oval," "round," "tetragonal," "pentagonal," "egg-shaped," "hexagonal," and "irregular" [ 4 , 18 , 56 ]. As Zdilla et al. [ 22 ] noted in their comprehensive review, these shape categories rely on ambiguous terminology that is inconsistent between studies, observer-dependent, and difficult to reproduce. Indeed, visual shape classification is inherently subjective: the same foramen may be classified as "oval" by one observer and "egg-shaped" by another, and studies vary widely in how many and which shape categories they employ [ 22 ]. Our polygon approximation analysis offers a direct, objective alternative. By fitting optimal polygons of 3 through 13 sides to each FM contour and identifying the elbow point where additional sides yield diminishing accuracy improvements, we determined that the typical FM is best approximated by a 6-sided polygon (mode = 6 in both sexes, median = 6, mean = 5.60 ± 0.94). This finding objectively confirms and quantifies what the traditional literature has described qualitatively: the FM boundary is broadly hexagonal with substantial individual variation spanning 4 to 8 sides (Fig. 2). Furthermore, the absence of a significant sex difference in optimal polygon order (male mean = 5.68, female mean = 5.53; p > 0.05) provides the first quantitative evidence that the geometric complexity of the FM boundary is sex-invariant — a finding that could not have been established through subjective visual categorization. Similarly, the elliptic Fourier transform decomposed each FM contour into 20 harmonics, enabling objective quantification of boundary features at multiple spatial scales. Of the 94 EFT-derived features, only the total spectral power (d = 0.855) reached significance, reflecting the global size component captured by the power spectrum. Individual harmonic power ratios and normalized coefficients showed minimal dimorphism, confirming that fine boundary details — the "roughness," undulations, and local irregularities that distinguish one FM shape from another — do not differ systematically between sexes. The value of shape features despite modest individual effects Although no single shape feature showed strong dimorphism, the forward selection procedure demonstrated that particular combinations of individually weak shape features can collectively approach and even exceed the discriminative capacity of size-based measurements. The optimal shape feature subset (area-to-perimeter ratio, circularity, Fitzgibbon ellipse RMSE, and four EFT coefficients) achieved 80.9% accuracy — surpassing size features alone (75.6%) (Table 3). This pattern suggests that shape encodes complementary information: each shape feature captures a subtle, partially independent aspect of FM morphology, and their combination reveals dimorphic patterns that individual features or simple size measurements cannot. This finding has practical implications for forensic anthropology. In cases involving fragmentary cranial remains — such as fire-damaged or blast-affected specimens — size measurements may be compromised by incomplete margins, while shape features derived from even a partially preserved FM contour could still contribute to sex estimation [ 5 , 30 ]. Furthermore, the forward selection identified specific features — area-to-perimeter ratio, circularity, and ellipse fitting residuals — that are computationally simple and could be readily integrated into automated forensic analysis tools. Heatmap-based classification: an informative negative result The density heatmap approach, particularly the novel KDE overlap scoring method, represents a conceptually distinct strategy that bypasses discrete feature extraction entirely, instead comparing continuous spatial distributions of boundary points. The near-chance performance of this method (54.1%) is itself an informative finding. Because heatmap classification operates on Procrustes-normalized contours — from which all size information has been removed — its failure provides direct confirmation that the spatial distribution of FM boundary points does not differ meaningfully between sexes after size normalization. This result is consistent with the statistical finding that pure shape features collectively explain only modest variance in sex (LDA on the full shape feature set yielded 67.9%), and offers a population-level visualization of this principle. We note that this approach may prove more effective for anatomical structures where shape differences between groups are more pronounced. For example, EFA applied to the greater sciatic notch — a structure with well-documented qualitative shape dimorphism — has achieved classification rates exceeding 80% [ 57 ]. The heatmap methodology developed here could be readily extended to such structures, where it may provide superior discrimination precisely because the shape differences are larger. Within-sex morphological heterogeneity The GMM-based subtype analysis revealed an asymmetry in within-sex morphological variation. Males showed greater morphological heterogeneity, with k = 2 as the optimal cluster number but a clearly identifiable atypical minority subtype (25.2%) characterized by lower circularity and higher radial variability (Figure S5 ). Females, by contrast, exhibited more morphological homogeneity, with the majority (71.5%) belonging to a single dominant subtype. This asymmetry in within-sex variability is also reflected in the descriptive statistics reported in our previous study [ 17 ], where males showed higher standard deviations than females in most FM measurements (e.g., FM area SD: 121.5 mm² vs 101.6 mm²; perimeter SD: 8.15 mm vs 6.8 mm). The differential classification performance observed in the present study — females more reliably classified (specificity 85.3%) than males (sensitivity 76.4%) — is likely a consequence of this narrower distribution of female FM morphology providing a more distinct separation from the male distribution. Methodological contributions and broader applicability Beyond the specific findings regarding the FM, this study demonstrates the feasibility and value of applying a comprehensive, automated geometric morphometric pipeline to skeletal structures that lack well-defined anatomical landmarks. The FM is particularly challenging for traditional landmark-based morphometrics because its margin is a smooth, continuous curve without discrete, reproducible points that would qualify as Type I or Type II landmarks [ 58 ]. Our approach — treating the entire contour as the unit of analysis through resampling, Fourier decomposition, polygon approximation, and density mapping — circumvents this limitation entirely. Caple et al. [ 59 ] highlighted the potential of EFA in forensic anthropology, noting that outline-based methods can capture a large amount of shape information in contrast to landmark approaches where information falling between landmarks fails to be acquired. Similarly, Kilmer and Garvin [ 57 ] demonstrated that EFA of skeletal outlines can support qualitative descriptions of sex differences with objective, quantitative data. Our work extends this paradigm by combining EFA with multiple additional geometric analysis methods, providing a more complete morphometric characterization than any single technique alone. The complete analysis pipeline — from automated contour extraction through feature computation, statistical testing, classification, and subtype discovery — is implemented as a single, self-contained, publicly available script. This design facilitates direct application to other skeletal structures where shape variation has traditionally been described only qualitatively, such as the obturator foramen [ 57 , 60 ] or piriform aperture. The objective quantification of shapes that are currently assessed visually (e.g., "oval vs. triangular obturator foramen") could enable more reproducible forensic assessments and contribute to a more precise anatomical vocabulary for morphological description. Comparison with the source study Our dataset derives from the same CT image collection used in our previous study [ 17 ], which reported an accuracy of 86.7% using linear discriminant function analysis with leave-one-out cross-validation on the full 720-case sample, and 88.2% using artificial neural networks. The apparent difference from our best accuracy of 80.9% in the present study likely reflects sampling effects rather than methodological inferiority. First, 247 cases were excluded in the present study due to insufficient contour quality for automated edge detection, reducing the sample to 473. This exclusion was not random with respect to morphology: cases with lower image quality, partial volume artifacts, or less distinct FM margins were preferentially removed, potentially yielding a subsample that — by chance — captured a less dimorphic subset of the original population. Second, the present study employed 10-fold rather than leave-one-out cross-validation, which can yield slightly different accuracy estimates depending on fold composition. Despite this reduction in sample size, the present study achieved a comparable cross-validated accuracy (80.9%) while simultaneously demonstrating that shape features, when optimally selected, can match and even marginally exceed the performance of classical size-based measurements — a finding that was not addressed in the original study. Limitations Several limitations should be acknowledged. First, the study sample was drawn from a single Eastern Turkish population, and the population-specificity of FM morphometry is well established [ 6 , 15 ]. As noted in our previous study [ 17 ], even within Turkey, discriminant formulas from western Turkish samples showed variable performance when applied to the eastern Turkish sample, reflecting potential ancestral differences between regions. The classification models and feature importance rankings derived here may therefore not generalize directly to other populations. Second, the contour extraction relied on 2D axial CT slices rather than 3D surface reconstructions, potentially missing three-dimensional shape information such as FM depth, curvature, and condylar morphology. Third, the automated edge detection required sufficient CT image quality, which led to the exclusion of 247 cases from the original dataset; this introduces a potential selection bias toward cases with clearer FM margins. Fourth, age-related variation was not explicitly modeled. Although our previous study found no significant correlation between FM measurements and age apart from a weak trend in perimeter [ 17 ], age effects on shape features remain unexplored. Finally, the heatmap and KDE methods employed fixed grid resolutions and kernel bandwidths that may not be optimal for all contour morphologies; adaptive or multi-scale approaches could potentially improve performance. Future directions The computational framework developed here opens several avenues for future research. Application to skeletal structures with more pronounced qualitative shape dimorphism — such as the greater sciatic notch, obturator foramen, or frontal sinus — could test whether the polygon approximation and heatmap methods achieve higher discriminative accuracy when shape differences are more substantial. Multi-population studies using the same pipeline would clarify which FM shape features, if any, transcend population boundaries. Integration of 3D surface data, enabled by modern CT reconstruction capabilities, could capture additional morphological information inaccessible from 2D contours. Finally, the subtype discovery approach could be extended to investigate whether FM morphological variants correlate with other craniometric traits, functional anatomy, or developmental influences. Conclusions This study demonstrates that the sexual dimorphism of the foramen magnum is predominantly mediated by overall size rather than by outline shape. Among 234 CT-derived morphometric features, all six conventional size measurements produced large, highly significant sex differences, whereas only 11.6% of pure shape features reached significance after correction for multiple testing. Perimeter emerged as the single strongest discriminator (Cohen’s d = 1.284), and size features alone achieved 75.6% classification accuracy. Importantly, however, a forward-selected combination of seven shape features—spanning ellipse fitting residuals, polygon approximation metrics, Fourier harmonic descriptors, and radial asymmetry indices—surpassed size-based classification, reaching 80.9% under rigorous 10-fold cross-validation. This finding establishes that individually weak shape signals, when optimally combined, carry discriminative information that is not redundant with overall size. From a methodological standpoint, this study replaces the subjective and non-reproducible categorical shape classification that has prevailed in the FM literature with a fully automated, deterministic pipeline that extracts objective and continuous shape descriptors from standard axial CT images. The seven complementary geometric analyses—classical morphometry, Fitzgibbon and genetic-algorithm-optimized ellipse fitting, systematic polygon approximation, normalized elliptic Fourier transform, radial distance profiling, and Procrustes-based density heatmap construction—collectively provide a multiscale characterization of the FM boundary that is reproducible across observers and institutions. The polygon approximation analysis, in particular, revealed that both sexes are best described by a hexagonal template, offering a data-driven, objective alternative to the inconsistent subjective shape taxonomies reported in the literature. The pipeline is distributed as a single open-source Python script with no proprietary software dependencies, facilitating independent replication and extension. The unsupervised clustering analysis further revealed a previously unreported asymmetry in within-sex morphological variation: male foramina magna display greater heterogeneity, with a distinct atypical subtype comprising approximately one quarter of male specimens, whereas female FM morphology is substantially more homogeneous. This observation suggests that the developmental and biomechanical constraints shaping the FM may act more uniformly in females, and it warrants further investigation across geographically diverse samples. In practical forensic and clinical terms, the foramen magnum can predict biological sex correctly in approximately four out of five individuals using the computational approach presented here. As 80.9% accuracy approaches the upper boundary reported in the conventional FM literature, these results suggest that further gains from two-dimensional FM morphometry alone may be limited. Future studies should validate these findings across multiple populations, extend the pipeline to three-dimensional surface data enabled by modern CT reconstruction capabilities, and apply the framework to anatomical structures where shape dimorphism is expected to be more pronounced, such as the greater sciatic notch and the pelvic inlet. Abbreviations FM Foramen magnum CT Computed tomography AP Anteroposterior EFT Elliptic Fourier transform GA Genetic algorithm LDA Linear discriminant analysis SVM Support vector machine RF Random forest KDE Kernel density estimation GMM Gaussian mixture model BIC Bayesian information criterion RMSE Root mean square error CV Coefficient of variation SD Standard deviation SFS Sequential forward selection Declarations Ethics approval and consent to participate The original study protocol received approval from the Non-Interventional Clinical Research Ethics Committee of Van Yuzuncu Yil University Faculty of Medicine (decision number: 02, dated: 19.02.2017) and was conducted in accordance with the Declaration of Helsinki and the principles of Good Clinical Practice. Since the present study represents a secondary analysis of the same anonymized imaging dataset using computational geometric methods, no additional ethical approval was required. Informed consent was waived due to the retrospective design and full anonymization of all data. Consent for publication Not applicable. Availability of data and materials The complete analysis pipeline is publicly available as a single, self-contained Python script (FM_Comprehensive_Analysis.py). Per-specimen morphometric measurements are provided in the supplementary materials (Table S1 in Additional file 1). The source CT images cannot be shared publicly due to institutional data protection regulations but are available from the corresponding author upon reasonable request. Competing interests The authors declare that they have no competing interests. Funding This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors. Authors’ contributions YE conceived the study, developed the computational pipeline and software, performed programming and data analyses, conducted the literature review, and wrote the original draft of the manuscript. EK contributed to image acquisition and selection, literature review, ethics committee procedures, and radiological image analysis. MA contributed to project organization, supervision, conceptualization, and critical revision of the final manuscript. YH contributed to supervision and conceptualization. SK contributed to data curation and statistical analysis. UD contributed to the literature review, preparation of radiological images for the study, and critical revision of the final manuscript. AY contributed to radiological image acquisition, development of analysis methodology, and preparation of images for final analysis. OC contributed to project organization, supervision, data analysis, conceptualization, and critical revision of the final manuscript. All authors read and approved the final manuscript. Acknowledgements Not applicable. References Standring S. Gray's Anatomy: The Anatomical Basis of Clinical Practice. 42nd ed. Amsterdam: Elsevier; 2020. Tubbs RS, Griessenauer CJ, Loukas M, Shoja MM, Cohen-Gadol AA. Morphometric analysis of the foramen magnum: an anatomic study. Neurosurgery. 2010;67(5):1479–84. Catalina-Herrera CJ. Study of the anatomic metric values of the foramen magnum and its relation to sex. Acta Anat (Basel). 1987;130(4):344–7. Murshed KA, Çiçekcibaşi AE, Tuncer I. Morphometric evaluation of the foramen magnum and variations in its shape: a study on computerized tomographic images of normal adults. Turk J Med Sci. 2003;33(5):301–6. Gapert R, Black S, Last J. Sex determination from the foramen magnum: discriminant function analysis in an eighteenth and nineteenth century British sample. Int J Legal Med. 2009;123(1):25–33. Madadin M, Menezes RG, Al Saif HS, Abu Alola H, Al Muhanna A, Gullenpet AH, et al. Morphometric evaluation of the foramen magnum for sex determination: a study from Saudi Arabia. J Forensic Leg Med. 2017;46:66–71. Aljarrah K, Packirisamy V, Al Anazi N, Nayak SB. Morphometric analysis of foramen magnum and occipital condyle using CT images for sex determination in a Saudi Arabian population. Morphologie. 2022;106(355):260–8. Toneva D, Nikolova S, Harizanov S, Georgiev I, Zlatareva D, Hadjidekov V, et al. Sex estimation by size and shape of foramen magnum based on CT imaging. Leg Med (Tokyo). 2018;35:50–60. Ajanovic Z, Dervisevic L, Dervisevic E, Lujinovic A, Ajanovic U, Biscevic-Tokic J, et al. Sex estimation based on foramen magnum: a three-dimensional geometric morphometrics approach. Int J Morphol. 2023;41(2):410–6. Uthman AT, Al-Rawi NH, Al-Timimi JF. Evaluation of foramen magnum in gender determination using helical CT scanning. Dentomaxillofac Radiol. 2012;41(3):197–202. Jaitley M, Phulambrikar T, Kode M, Gupta A, Singh SK. Foramen magnum as a tool for sexual dimorphism: a cone beam computed tomography study. Indian J Dent Res. 2016;27(5):458–62. Singh PK, Tamrakar D, Paudel S, Timsinha S, Pokhrel RP, Shrestha R, et al. Morphometric analysis of the foramen magnum in sex estimation: an additional 3DCT study from Nepal on a larger sample. Anat Cell Biol. 2022;55(4):401–11. Mehta M, Saini V, Patel MN, Menon SK. Applicability and reliability of foramen magnum for sex determination in contemporary Western Indian population: a computed tomographic study. J Forensic Radiol Imaging. 2019;17:13–9. Akay G, Güngör K, Peker İ. Morphometric analysis of the foramen magnum using cone beam computed tomography. Turk J Med Sci. 2017;47(6):1715–22. Fernandes ALVC, Shetti A, Lagali-Jirge V, Keluskar V. Accuracy of sex estimation by morphometric evaluation of foramen magnum using computed tomography – a systematic review and meta-analysis. Forensic Sci Med Pathol. 2024;20(1):268–79. Meral O, Toklu BB, Meydan R, Kaya A, Karadayı B, Acar T. Sex estimation from foramen magnum parameters in adult Turkish population: a computed tomography study. Leg Med (Tokyo). 2020;47:101775. Kartal E, Etli Y, Asirdizer M, Hekimoglu Y, Keskin S, Demir U, Yavuz A, Celbis O. Sex estimation using foramen magnum measurements, discriminant analyses and artificial neural networks on an Eastern Turkish population sample. Leg Med (Tokyo). 2022;59:102143. Chethan P, Prakash KG, Murlimanju BV, Prashanth KU, Prabhu LV, Saralaya VV, et al. Morphological analysis and morphometry of the foramen magnum: an anatomical investigation. Turk Neurosurg. 2012;22(4):416–9. Aragão JA, de Oliveira Pereira R, da, Cruz de Moraes RZ, Reis FP. Morphological types and the dimorphism of the foramen magnum of dry skulls from northeastern Brazil. Int J Morphol. 2014;32(4):1372–8. Gargi V, Yadav PK, Kumar Y. Foramen magnum as a tool for sexual dimorphism. Natl J Med Res. 2018;8:28–31. Singh G, Gargi V. Discriminant function analysis of foramen magnum variables in South Indian population: a study of computerised tomographic images. J Clin Diagn Res. 2018;12(5):HC01–5. Zdilla MJ, Russell ML, Bliss KN, Mangus KR, Koons AW. The size and shape of the foramen magnum in man. J Craniovertebr Junction Spine. 2017;8(3):205–21. Singh G, Talwar I. Morphometric analysis of foramen magnum for sex determination using CT scan. Indian J Forensic Med Toxicol. 2013;7(2):220–3. Kuhl FP, Giardina CR. Elliptic Fourier features of a closed contour. Comput Graph Image Process. 1982;18(3):236–58. Haines AJ, Crampton JS. Improvements to the method of Fourier shape analysis as applied in morphometric studies. Palaeontology. 2000;43(4):765–83. Samuel OM, Bhagwat AA. Elliptical Fourier descriptors of outline and morphological analysis in caudal view of foramen magnum of the tropical raccoon (Procyon cancrivorus) (Linnaeus, 1758). J Morphol Sci. 2018;35(1):24–32. Hayashizaki Y, Usui A, Hosokai Y, Sakai J, Funayama M. Sex determination of the pelvis using Fourier analysis of postmortem CT images. Forensic Sci Int. 2015;246:e1221–9. Gonzalez PN, Bernal V, Perez SI. Geometric morphometric approach to sex estimation of human pelvis. Forensic Sci Int. 2009;189(1–3):68–74. Etli Y, Kartal E, Demir U, Etli E, Hekimoğlu Y, Aşirdizer M. Sex determination using image processing reveals elliptical features of the female pelvic inlet. Sci Rep. 2025;15:33800. Krishan K, Chatterjee PM, Kanchan T, Kaur S, Baryah N, Singh RK. A review of sex estimation techniques during examination of skeletal remains in forensic anthropology casework. Forensic Sci Int. 2016;261:e1651–8. Tsiouris C, Triantafyllou G, Karangeli N, Botis GG, Papadopoulos-Manolarakis P, Kalamatianos T, et al. Morphometric assessment of occipital condyles and foramen magnum reveals enhanced sexual dimorphism detection via 3D imaging: a systematic review and meta-analysis utilizing classification and regression trees. Diagnostics (Basel). 2025;15(11):1359. De Boor C. A Practical Guide to Splines. Rev. ed. New York: Springer; 2001. Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, et al. Array programming with NumPy. Nature. 2020;585(7825):357–62. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods. 2020;17(3):261–72. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30. Hunter JD. Matplotlib: a 2D graphics environment. Comput Sci Eng. 2007;9(3):90–5. Braden B. The surveyor's area formula. Coll Math J. 1986;17(4):326–37. Fitzgibbon A, Pilu M, Fisher RB. Direct least square fitting of ellipses. IEEE Trans Pattern Anal Mach Intell. 1999;21(5):476–80. Nelder JA, Mead R. A simplex method for function minimization. Comput J. 1965;7(4):308–13. Kasa I. A circle fitting procedure and its error analysis. IEEE Trans Instrum Meas. 1976;25(1):8–14. Lestrel PE. Fourier Descriptors and Their Applications in Biology. Cambridge: Cambridge University Press; 1997. Rohlf FJ, Slice D. Extensions of the Procrustes method for the optimal superimposition of landmarks. Syst Biol. 1990;39(1):40–59. Mann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat. 1947;18(1):50–60. Cohen J. Statistical Power Analysis for the Behavioral Sciences. 2nd ed. Hillsdale: Lawrence Erlbaum Associates; 1988. Fisher RA. The use of multiple measurements in taxonomic problems. Ann Eugen. 1936;7(2):179–88. Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–97. Breiman L. Random forests. Mach Learn. 2001;45(1):5–32. Whitney AW. A direct method of nonparametric measurement selection. IEEE Trans Comput. 1971;C–20(9):1100–3. McLachlan GJ, Peel D. Finite Mixture Models. New York: Wiley; 2000. Rousseeuw PJ. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math. 1987;20:53–65. Schwarz G. Estimating the dimension of a model. Ann Stat. 1978;6(2):461–4. McKinney W. Data structures for statistical computing in Python. Proc 9th Python Sci Conf. 2010;445:51–6. Satopaa V, Albrecht J, Irwin D, Raghavan B. Finding a kneedle in a haystack: detecting knee points in system behavior. In: 2011 31st International Conference on Distributed Computing Systems Workshops. IEEE; 2011. pp. 166–71. Bhattacharyya A. On a measure of divergence between two statistical populations defined by their probability distributions. Bull Calcutta Math Soc. 1943;35:99–109. Atreya A, Shrestha R, Bhandari K, Malla SK, Acharya S, Menezes RG. Morphometric analysis of the foramen magnum in sex estimation: an additional 3DCT study from Nepal on a larger sample. Health Sci Rep. 2023;6(1):e999. Vinutha SP, Suresh V, Shubha R. Discriminant Function Analysis of Foramen Magnum Variables in South Indian Population: A Study of Computerised Tomographic Images. Anat Res Int. 2018;2018:2056291. Kilmer K, Garvin HM. Outline analysis of sex and population variation in greater sciatic notch and obturator foramen morphology with implications for sex estimation. Forensic Sci Int. 2020;314:110346. Bookstein FL. Landmark methods for forms without landmarks: morphometrics of group differences in outline shape. Med Image Anal. 1997;1(3):225–43. Caple J, Byrd J, Stephan CN. Elliptical Fourier analysis: fundamentals, applications, and value for forensic anthropology. Int J Legal Med. 2017;131(6):1675–90. Bierry G, Le Minor JM, Schmittbuhl M. Oval in males and triangular in females? A quantitative evaluation of sexual dimorphism in the human obturator foramen. Am J Phys Anthropol. 2010;141(4):626–31. Tables Tables are available in the Supplementary Files section. Additional Declarations No competing interests reported. Supplementary Files SupplementaryFigureCaptions.docx FIGURES1.png FIGURES4.png FIGURES5.png FIGURES6.png FIGURES7.png FIGURES8.png FIGURES16.png FIGURES17.png TABLES3.xlsx TABLES4.xlsx TABLES2.xlsx TABLES6.xlsx TABLES7.xlsx TABLES5.xlsx TABLES1.xlsx FIGURES2.png FIGURES3.png FIGURES9.png FIGURES10.png FIGURES11.png FIGURES12.png FIGURES13.png FIGURES14.png FIGURES15.png Tables.docx Cite Share Download PDF Status: Under Review Version 1 posted Reviews received at journal 21 Apr, 2026 Reviews received at journal 07 Apr, 2026 Reviews received at journal 01 Apr, 2026 Reviews received at journal 01 Apr, 2026 Reviewers agreed at journal 29 Mar, 2026 Reviewers agreed at journal 27 Mar, 2026 Reviewers agreed at journal 24 Mar, 2026 Reviewers agreed at journal 23 Mar, 2026 Reviewers invited by journal 17 Mar, 2026 Editor invited by journal 23 Feb, 2026 Editor assigned by journal 23 Feb, 2026 Submission checks completed at journal 23 Feb, 2026 First submitted to journal 20 Feb, 2026 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8928788","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":608095954,"identity":"00d80325-23e0-4e9f-80a0-5a8f6f7695e8","order_by":0,"name":"Yasin Etli","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA3klEQVRIiWNgGAWjYJCCA0Asw8beAOYwNhCpxYCHjecATAszURYZ8DBIJBCpxeD48YeHCyr+8PBJvjGT5mGwkd1wgP/YB7xazuQYHJ5xBugw6RyQljTjDQeYmWfg1XIgh+Ewbxtcy+FEkBb8Djv//MFh3n9ALZJnQFr+E6HlRoLBYd4GoBYJHpCWA4S1SN54Y3CY55gxMJDTii3nGCQbzzzMbIxXC9/59MefeWrk5OTbD2+88abCTrbveONjvFoUDsCZHCYSDAZAmlBMyjfAmeyP8UbHKBgFo2AUjFwAAKC/RBQabv09AAAAAElFTkSuQmCC","orcid":"","institution":"Van Yuzuncu Yil University","correspondingAuthor":true,"prefix":"","firstName":"Yasin","middleName":"","lastName":"Etli","suffix":""},{"id":608095956,"identity":"c1b3761c-2bb2-4953-9016-91675eb8fe4c","order_by":1,"name":"Erhan Kartal","email":"","orcid":"","institution":"Van Yuzuncu Yil University","correspondingAuthor":false,"prefix":"","firstName":"Erhan","middleName":"","lastName":"Kartal","suffix":""},{"id":608095957,"identity":"df33143b-ebf4-4f1a-a1bc-0e3f3f7cf515","order_by":2,"name":"Mahmut Asirdizer","email":"","orcid":"","institution":"Bahcesehir University","correspondingAuthor":false,"prefix":"","firstName":"Mahmut","middleName":"","lastName":"Asirdizer","suffix":""},{"id":608095958,"identity":"053585b4-cac5-4018-8a6f-f7114812f391","order_by":3,"name":"Yavuz Hekimoglu","email":"","orcid":"","institution":"Ankara City Hospital, Health Sciences University","correspondingAuthor":false,"prefix":"","firstName":"Yavuz","middleName":"","lastName":"Hekimoglu","suffix":""},{"id":608095959,"identity":"4074cc18-c6b8-4a07-9b9a-a0c02d071745","order_by":4,"name":"Sıddık Keskin","email":"","orcid":"","institution":"Van Yuzuncu Yil University","correspondingAuthor":false,"prefix":"","firstName":"Sıddık","middleName":"","lastName":"Keskin","suffix":""},{"id":608095960,"identity":"cac5fd14-a687-4f4b-9262-614931369056","order_by":5,"name":"Uğur Demir","email":"","orcid":"","institution":"Harran University","correspondingAuthor":false,"prefix":"","firstName":"Uğur","middleName":"","lastName":"Demir","suffix":""},{"id":608095961,"identity":"fd85dd2b-d16c-4b79-82ca-a751bdea608a","order_by":6,"name":"Alparslan Yavuz","email":"","orcid":"","institution":"Antalya Training and Research Hospital, Health Sciences University","correspondingAuthor":false,"prefix":"","firstName":"Alparslan","middleName":"","lastName":"Yavuz","suffix":""},{"id":608095962,"identity":"1c776e06-4ffc-4ed3-9ad2-4f79a7956d41","order_by":7,"name":"Osman Celbiş","email":"","orcid":"","institution":"Alanya Alaaddin Keykubat University","correspondingAuthor":false,"prefix":"","firstName":"Osman","middleName":"","lastName":"Celbiş","suffix":""}],"badges":[],"createdAt":"2026-02-20 19:54:17","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-8928788/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8928788/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":105039670,"identity":"05daf51b-2245-4bf4-a44f-5ca5c6b99242","added_by":"auto","created_at":"2026-03-20 07:46:50","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":204199,"visible":true,"origin":"","legend":"\u003cp\u003eProcrustes-normalized foramen magnum point density heatmaps for males (left) and females (right). Gaussian kernel smoothing (σ = 2 grid units) was applied to pooled contour points on a 200 × 200 grid. Color intensity represents point density, with warmer colors indicating higher concentrations.\u003c/p\u003e","description":"","filename":"FIGURE1.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/1d11d02d461355e54d977f13.png"},{"id":105562804,"identity":"88752f67-6b63-4d34-9a39-d17e5d4aa229","added_by":"auto","created_at":"2026-03-27 12:44:47","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":473708,"visible":true,"origin":"","legend":"\u003cp\u003eDistribution of individually determined optimal polygon sides using the Kneedle algorithm. (A) Histogram of optimal polygon order by sex (mode = 6 for both). (B) Box plot comparison between males and females. (C) Cumulative RMSE improvement as a function of polygon sides, with star indicating the elbow point.\u003c/p\u003e","description":"","filename":"FIGURE2.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/e7f3ed82bcac203de34b3b7c.png"},{"id":105040033,"identity":"8509ce8c-6bcc-44c9-b00f-bd4d2e1cad4c","added_by":"auto","created_at":"2026-03-20 07:47:54","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":605961,"visible":true,"origin":"","legend":"\u003cp\u003ePolygon approximation elbow analysis for optimal side count. (A) Mean RMSE versus number of polygon sides by sex, with stars indicating optimal elbow points. (B) Step-wise marginal RMSE improvement showing diminishing returns beyond 6 sides. (C) Kneedle algorithm difference curve with peak at 6 sides.\u003c/p\u003e","description":"","filename":"FIGURE3.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/cc5e3356518f69d54e027244.png"},{"id":105039600,"identity":"0048ea51-36b2-4cae-822b-1ed846d83674","added_by":"auto","created_at":"2026-03-20 07:46:46","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":167260,"visible":true,"origin":"","legend":"\u003cp\u003eSex classification performance comparison across feature sets using LDA. Horizontal bars show 10-fold cross-validation accuracy with error bars indicating standard deviation. The dashed red line indicates chance level (50%). Feature sets are ordered by accuracy.\u003c/p\u003e","description":"","filename":"FIGURE4.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/0f30274714d4ec44f18dcb26.png"},{"id":105039133,"identity":"38797927-9679-4d1a-b54e-c282b6325382","added_by":"auto","created_at":"2026-03-20 07:45:07","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":397132,"visible":true,"origin":"","legend":"\u003cp\u003eWithin-sex subtype mean contours at k = 2. Male subtypes (left, blue) and female subtypes (right, pink) with error bars showing ±1 SD at sampled contour points. Male Subtype-0 (n = 59) shows a distinctly irregular, less circular shape compared to Subtype-1 (n = 175). Female subtypes show minimal shape differentiation.\u003c/p\u003e","description":"","filename":"FIGURE5.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/435067e7201eeb261afe1342.png"},{"id":105039994,"identity":"39a9c682-f9e4-4531-b17b-289cc0be5bdb","added_by":"auto","created_at":"2026-03-20 07:47:32","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":263318,"visible":true,"origin":"","legend":"\u003cp\u003eOptimal cluster number selection for within-sex morphological subtypes. Left: Silhouette score versus number of clusters (k). Right: Bayesian information criterion (BIC) versus k. For males, k = 2 is optimal based on silhouette score. For females, silhouette score peaks at k = 3 though k = 2 also performs well.\u003c/p\u003e","description":"","filename":"FIGURE6.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/8ebb2c48fab14a067798c202.png"},{"id":105568853,"identity":"35495fbf-7330-4ac8-bf6a-b438fd967203","added_by":"auto","created_at":"2026-03-27 13:10:34","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":3339685,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/4737bd02-6918-4137-915e-6e370797ae61.pdf"},{"id":105038752,"identity":"81be6662-bcb7-478d-8ac4-9443360703d2","added_by":"auto","created_at":"2026-03-20 07:44:22","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":17489,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryFigureCaptions.docx","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/e5f80e1840606d163b19f2f3.docx"},{"id":105039478,"identity":"a5bab5ce-a9bf-463f-b5fe-e0aa1742bcc0","added_by":"auto","created_at":"2026-03-20 07:46:22","extension":"png","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":355795,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES1.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/76ad22b14dd461ae11ec68e2.png"},{"id":105039179,"identity":"31624b87-96bb-402f-ba51-7668e92228c1","added_by":"auto","created_at":"2026-03-20 07:45:18","extension":"png","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":542242,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES4.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/28e8c75d6e4fcd9bc9f7c4f5.png"},{"id":105038687,"identity":"25d2a57d-9ce2-48ad-99fa-bda290c2e2cf","added_by":"auto","created_at":"2026-03-20 07:44:17","extension":"png","order_by":4,"title":"","display":"","copyAsset":false,"role":"supplement","size":630778,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES5.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/5cc8443f0cb68c3cc170e3d2.png"},{"id":105039484,"identity":"65889f3b-b46b-4796-87c4-f51dc87ab06d","added_by":"auto","created_at":"2026-03-20 07:46:24","extension":"png","order_by":5,"title":"","display":"","copyAsset":false,"role":"supplement","size":727781,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES6.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/2ec108b4ed06a9f8571cf990.png"},{"id":105039697,"identity":"1d83a33b-e392-488e-96b3-3f58c5b0680e","added_by":"auto","created_at":"2026-03-20 07:46:54","extension":"png","order_by":6,"title":"","display":"","copyAsset":false,"role":"supplement","size":474953,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES7.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/30374f920f155ed4db9e3e95.png"},{"id":105038457,"identity":"cd3729c4-d733-4f8f-a7c0-09d2b786f22f","added_by":"auto","created_at":"2026-03-20 07:43:43","extension":"png","order_by":7,"title":"","display":"","copyAsset":false,"role":"supplement","size":537586,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES8.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/dee9cac9445afdadf6a032a3.png"},{"id":105039532,"identity":"b4c6c5de-41de-4d79-b6da-68689ba6331c","added_by":"auto","created_at":"2026-03-20 07:46:28","extension":"png","order_by":8,"title":"","display":"","copyAsset":false,"role":"supplement","size":349029,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES16.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/ea10c6b4efcbb632a683f246.png"},{"id":105038660,"identity":"7e95dedf-3939-4510-8147-9859d87f339f","added_by":"auto","created_at":"2026-03-20 07:44:16","extension":"png","order_by":9,"title":"","display":"","copyAsset":false,"role":"supplement","size":481183,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES17.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/fb7c148191b28672596ccfdc.png"},{"id":105039472,"identity":"ad96ccd7-7643-4b9e-9d09-5d2ed782d100","added_by":"auto","created_at":"2026-03-20 07:46:16","extension":"xlsx","order_by":10,"title":"","display":"","copyAsset":false,"role":"supplement","size":42063,"visible":true,"origin":"","legend":"","description":"","filename":"TABLES3.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/44b56fdaad68565471d36433.xlsx"},{"id":105039333,"identity":"4ead666c-46dc-4744-8756-e39e00d474ca","added_by":"auto","created_at":"2026-03-20 07:45:52","extension":"xlsx","order_by":11,"title":"","display":"","copyAsset":false,"role":"supplement","size":9655,"visible":true,"origin":"","legend":"","description":"","filename":"TABLES4.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/852bcbbaf1e1e51dd7ea30c2.xlsx"},{"id":105038802,"identity":"16529542-e1cf-4af4-a8b8-68125314e3e0","added_by":"auto","created_at":"2026-03-20 07:44:35","extension":"xlsx","order_by":12,"title":"","display":"","copyAsset":false,"role":"supplement","size":34048,"visible":true,"origin":"","legend":"","description":"","filename":"TABLES2.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/ab6aafc602de6af68869e989.xlsx"},{"id":105039989,"identity":"0a99fc5e-4317-4e7c-8d13-ca84934b0f9f","added_by":"auto","created_at":"2026-03-20 07:47:31","extension":"xlsx","order_by":13,"title":"","display":"","copyAsset":false,"role":"supplement","size":16837,"visible":true,"origin":"","legend":"","description":"","filename":"TABLES6.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/27eb33eb9fbd57f57616c550.xlsx"},{"id":105038789,"identity":"40816b2c-0c67-48f4-b3a3-70ba1c4adcc3","added_by":"auto","created_at":"2026-03-20 07:44:33","extension":"xlsx","order_by":14,"title":"","display":"","copyAsset":false,"role":"supplement","size":30111,"visible":true,"origin":"","legend":"","description":"","filename":"TABLES7.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/d4276a692e05f9d9ea17d725.xlsx"},{"id":105039209,"identity":"e2887f28-8304-4e76-847a-67d107b3a609","added_by":"auto","created_at":"2026-03-20 07:45:25","extension":"xlsx","order_by":15,"title":"","display":"","copyAsset":false,"role":"supplement","size":9245,"visible":true,"origin":"","legend":"","description":"","filename":"TABLES5.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/b7684adcd126f512f421aa96.xlsx"},{"id":105040173,"identity":"1923240f-fb8a-40e9-badf-cc55e55198ca","added_by":"auto","created_at":"2026-03-20 07:48:50","extension":"xlsx","order_by":16,"title":"","display":"","copyAsset":false,"role":"supplement","size":104382,"visible":true,"origin":"","legend":"","description":"","filename":"TABLES1.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/28b1cbdffd637a32d6e886c4.xlsx"},{"id":105038657,"identity":"f7055264-b2df-401c-a369-0841a105b3e0","added_by":"auto","created_at":"2026-03-20 07:44:15","extension":"png","order_by":17,"title":"","display":"","copyAsset":false,"role":"supplement","size":1270570,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES2.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/e979c8889f958e2a136827b1.png"},{"id":105039022,"identity":"36aa4b6c-d24e-46e1-b465-40abc2680d1a","added_by":"auto","created_at":"2026-03-20 07:44:54","extension":"png","order_by":18,"title":"","display":"","copyAsset":false,"role":"supplement","size":1348137,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES3.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/5cca3a1df2f576a1fa639040.png"},{"id":105039948,"identity":"e0778975-2415-4367-8969-3d970ae26f97","added_by":"auto","created_at":"2026-03-20 07:47:20","extension":"png","order_by":19,"title":"","display":"","copyAsset":false,"role":"supplement","size":1884837,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES9.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/5e21a58e0db959c85716955e.png"},{"id":105040000,"identity":"178b2225-21c7-479e-8d3d-ca7e7ec7fd08","added_by":"auto","created_at":"2026-03-20 07:47:34","extension":"png","order_by":20,"title":"","display":"","copyAsset":false,"role":"supplement","size":2278006,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES10.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/bd0bfe7aabb91d958849386f.png"},{"id":105038452,"identity":"e5e6cc8e-8784-4c32-9b1d-b32076420903","added_by":"auto","created_at":"2026-03-20 07:43:37","extension":"png","order_by":21,"title":"","display":"","copyAsset":false,"role":"supplement","size":2016016,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES11.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/7205da32bb8950a8a7696eb0.png"},{"id":105038791,"identity":"9ba19ff1-d5d7-4169-8dab-24e6c6db4f79","added_by":"auto","created_at":"2026-03-20 07:44:33","extension":"png","order_by":22,"title":"","display":"","copyAsset":false,"role":"supplement","size":2490044,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES12.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/67547c0bf8200717b7864a0b.png"},{"id":105038690,"identity":"8cade9d4-abc9-4f7b-850a-e3fb3d31a341","added_by":"auto","created_at":"2026-03-20 07:44:18","extension":"png","order_by":23,"title":"","display":"","copyAsset":false,"role":"supplement","size":3102446,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES13.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/27840d19aa1c6cae5e8556ff.png"},{"id":105038785,"identity":"1a5157ab-2b08-4101-9f4c-d813ecb272d8","added_by":"auto","created_at":"2026-03-20 07:44:32","extension":"png","order_by":24,"title":"","display":"","copyAsset":false,"role":"supplement","size":1756102,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES14.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/e93e707147e050251b7462d4.png"},{"id":105039744,"identity":"311c8e05-6dc9-4f98-a7f2-84ea3e80000a","added_by":"auto","created_at":"2026-03-20 07:47:01","extension":"png","order_by":25,"title":"","display":"","copyAsset":false,"role":"supplement","size":2254288,"visible":true,"origin":"","legend":"","description":"","filename":"FIGURES15.png","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/2c2cef01a4bea0e97380e336.png"},{"id":105039657,"identity":"606847f8-9de3-4a03-8e96-55ba79344a85","added_by":"auto","created_at":"2026-03-20 07:46:46","extension":"docx","order_by":26,"title":"","display":"","copyAsset":false,"role":"supplement","size":28023,"visible":true,"origin":"","legend":"","description":"","filename":"Tables.docx","url":"https://assets-eu.researchsquare.com/files/rs-8928788/v1/76c73d90a435616a565591e3.docx"}],"financialInterests":"No competing interests reported.","formattedTitle":"A comprehensive computational pipeline for foramen magnum shape analysis integrating elliptic Fourier transform, polygon approximation, and Procrustes heatmaps from CT images: application to forensic sex estimation","fulltext":[{"header":"Background","content":"\u003cp\u003eThe foramen magnum (FM), the largest opening in the skull base formed by the occipital bone, serves as the primary conduit between the cranial cavity and the vertebral canal, transmitting the medulla oblongata, vertebral arteries, spinal accessory nerves, and associated meningeal membranes [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e]. Beyond its well-established clinical significance in cranio-cervical surgery and neuropathology, the FM has attracted considerable research interest due to its reported morphological variation across individuals, populations, and between sexes [\u003cspan additionalcitationids=\"CR4 CR5\" citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e]. A central question that has persisted in the anatomical and forensic literature is whether sex-related differences in the FM are limited to overall size\u0026mdash;with males exhibiting larger dimensions\u0026mdash;or whether genuine shape differences also exist between the sexes [\u003cspan additionalcitationids=\"CR8\" citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eThe dimensional sexual dimorphism of the FM has been well documented across diverse populations. Numerous studies using dry skulls, radiographs, and computed tomography (CT) have consistently demonstrated that the anteroposterior diameter, transverse diameter, circumference, and area of the FM are significantly greater in males than in females [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e, \u003cspan additionalcitationids=\"CR11 CR12 CR13\" citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e]. A recent systematic review and meta-analysis confirmed these dimensional differences across multiple populations with sex estimation accuracies ranging from approximately 62% to 90% depending on the population and analytical method employed [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e]. In the Turkish population specifically, Meral et al. reported a maximum multivariate accuracy of 75% [\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e], while Kartal et al. achieved up to 86.7% with discriminant analysis and 88.2% with artificial neural networks using dimensional measurements from 720 individuals [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e]. These findings collectively establish that the FM exhibits moderate but consistent size-based sexual dimorphism that is primarily driven by the overall larger cranial dimensions in males.\u003c/p\u003e \u003cp\u003eIn contrast, the question of whether the FM also exhibits sex-related differences in shape\u0026mdash;independent of size\u0026mdash;remains substantially less resolved. Morphological descriptions in the forensic and anatomical literature have long suggested that the male FM tends to display a \"rougher\" or more irregular contour, while the female FM is described as \"more oval\" and \"more regular\" [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e]. However, these descriptions are based on qualitative, observer-dependent assessments and have not been quantitatively validated. The most common approach to characterizing FM shape has been subjective visual classification into categorical types such as oval, round, egg-shaped, tetragonal, pentagonal, hexagonal, and irregular forms [\u003cspan additionalcitationids=\"CR19 CR20\" citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e]. This approach suffers from several well-documented limitations. First, the shape categories themselves are not standardized across studies, leading to marked inconsistencies in the reported prevalence of each type; for instance, the FM has been reported as oval in 87.3% of crania in one study but in only 7.39% in another that utilized the same visual categorization approach [\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e]. Second, studies that have investigated whether these categorical shape types differ between sexes have generally concluded that FM shape is not a reliable sex discriminator [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e, \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e], with the FM index (anteroposterior/transverse diameter ratio) showing no significant sex difference in several populations [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e, \u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e]. Zdilla et al. critically observed that previous shape studies utilized \"ambiguous and subjective descriptors\" that are \"difficult to reproduce\" and called for more reproducible methods of characterizing FM morphology [\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eThe reliance on subjective categorical classification or simple ratios to characterize a complex, irregularly bounded anatomical contour fundamentally limits the ability to detect subtle but potentially meaningful shape differences. A two-dimensional closed contour such as the FM boundary contains considerably more morphological information than can be captured by two perpendicular diameters or a single aspect ratio [\u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e, \u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e25\u003c/span\u003e]. Modern computational imaging and geometric analysis methods offer the potential to extract this richer shape information objectively. Elliptic Fourier analysis (EFA), introduced by Kuhl and Giardina [\u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e], enables the mathematical decomposition of any closed contour into a series of harmonic ellipses, capturing both global shape properties and fine local boundary details in a compact numerical representation. This approach has been successfully applied to various anatomical contours, including the FM of the tropical raccoon, where Samuel and Bhagwat demonstrated that higher harmonics revealed fine morphological details relevant to sexual dimorphism that were invisible to conventional measurements [\u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e26\u003c/span\u003e]. Hayashizaki et al. applied Fourier analysis to pelvic contours from postmortem CT images and showed that frequency-domain shape descriptors could discriminate sex with greater objectivity than traditional morphometric methods [\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e]. Similarly, Gonzalez et al. utilized semi-landmark-based geometric morphometrics on the greater sciatic notch and ischiopubic complex to achieve over 90% sex estimation accuracy from shape variables alone [\u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e28\u003c/span\u003e]. In our own previous work, we applied advanced contour-based geometric analysis methods\u0026mdash;including ellipse fitting\u0026mdash;to pelvic CT images and demonstrated that computational shape features could objectively capture morphological variations that had previously been described only qualitatively [\u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e29\u003c/span\u003e]. These studies collectively suggest that computational geometric approaches, when applied to anatomical contour data extracted from CT images, can reveal shape information that is inaccessible to conventional linear measurements and subjective visual classification.\u003c/p\u003e \u003cp\u003eDespite the clear potential of these computational shape analysis methods, no study to date has applied a comprehensive suite of such techniques to the FM contour for the purpose of objectively quantifying and validating the morphological descriptions that have persisted in the anatomical literature. Specifically, it remains unknown whether computationally derived metrics\u0026mdash;such as circularity, boundary roughness, contour regularity, left-right and antero-posterior asymmetry, and harmonic shape descriptors\u0026mdash;differ significantly between male and female FM contours when size effects are controlled for. Furthermore, whether any such pure shape features, once objectively defined, can contribute to or improve upon size-based sex estimation has not been systematically evaluated. Resolving these questions is important not only for advancing evidence-based anatomical knowledge, but also for practical applications in forensic medicine and anthropology, where the ability to estimate sex from fragmentary cranial remains\u0026mdash;particularly the skull base, which is resistant to taphonomic destruction\u0026mdash;can be critical in identification cases [\u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e30\u003c/span\u003e, \u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e31\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eThe present study aims to address these gaps through two integrated objectives. The first objective is to apply a comprehensive array of computational geometric methods\u0026mdash;including Fitzgibbon ellipse fitting, genetic algorithm-optimized inner and outer bounding ellipses, systematic polygon approximation from 3-sided to 13-sided polygons, normalized elliptic Fourier transform with 20 harmonics, radial profile analysis, and point density heatmap construction\u0026mdash;to FM boundary coordinates extracted from CT images of a large contemporary Turkish sample (n\u0026thinsp;=\u0026thinsp;473), in order to objectively characterize and quantify morphological features that correspond to classical qualitative descriptions such as \"oval,\" \"round,\" \"irregular,\" and \"rough.\" The second objective is to evaluate the discriminative capacity of these objectively defined shape features for sex estimation, both independently and in combination with conventional size-based measurements, using multiple classification approaches including linear discriminant analysis, support vector machines, and random forest, with rigorous cross-validation; and to investigate whether unsupervised clustering reveals sex-specific morphological subtypes that may further elucidate the nature and extent of FM shape dimorphism.\u003c/p\u003e"},{"header":"Methods","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eStudy design and ethical approval\u003c/h2\u003e \u003cp\u003eThis retrospective cross-sectional study utilized cranial computed tomography (CT) images originally acquired for a prior investigation of foramen magnum (FM) sexual dimorphism in an Eastern Turkish population [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e]. The original study protocol received approval from the Non-Interventional Clinical Research Ethics Committee of Van Yuzuncu Yil University Faculty of Medicine (decision number: 02, dated: 19.02.2017) and was conducted in accordance with the Declaration of Helsinki and the principles of Good Clinical Practice. Since the present study represents a secondary analysis of the same anonymized imaging dataset using computational geometric methods, no additional ethical approval was required. Informed consent was waived due to the retrospective design and full anonymization of all data.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eStudy sample\u003c/h3\u003e\n\u003cp\u003eThe source dataset comprised 720 cranial CT examinations (360 males and 360 females) from adult patients who presented to Y\u0026uuml;z\u0026uuml;nc\u0026uuml; Yıl University Dursun Odabaş Medical Center between January and December 2016 for various clinical indications unrelated to craniocervical pathology [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e]. The original sample was constructed using stratified sampling across six adult age groups (21\u0026ndash;30, 31\u0026ndash;40, 41\u0026ndash;50, 51\u0026ndash;60, 61\u0026ndash;70, and \u0026ge;\u0026thinsp;71 years) with equal sex distribution to control for potential age-related confounding effects. Exclusion criteria in the original study included skull trauma causing cranial deformities, congenital or acquired cranial bone diseases, any malignancy affecting the skull, and cases with missing or defective FM boundaries [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eFor the present study, all 720 CT examinations were re-evaluated to determine their suitability for automated contour extraction. Cases were included if the axial CT slice displaying the widest FM cross-section yielded a boundary amenable to automated edge detection\u0026mdash;specifically, sufficient contrast at the bone\u0026ndash;soft tissue interface and a complete, uninterrupted FM margin. Cases with excessive image noise, partial volume artifacts obscuring the FM border, or incomplete boundary delineation after edge detection were excluded. After this additional selection, 473 individuals (234 males and 239 females) met the criteria and constituted the final study sample for geometric morphometric analysis.\u003c/p\u003e\n\u003ch3\u003eCT image acquisition\u003c/h3\u003e\n\u003cp\u003eAll CT examinations had been performed on a 16-detector multislice CT system (Somatom Emotion 16-slice, CT2012E; Siemens AG, Berlin and Munich, Germany). Imaging parameters included a tube voltage of 120 kV with automatic tube current modulation ranging between 80 and 120 mA, a section thickness of 1.2 mm, a reconstruction interval of 1.0 mm, a pitch factor of 0.8, a field of view of 220 mm, and a matrix size of 256 \u0026times; 256. Data acquisition was performed in the caudocranial direction without gantry tilt. Three-dimensional volume-rendered reformations were generated using Syngo VE.52A software (Siemens Healthcare) to minimize potential measurement errors arising from asymmetry or obliquity that may occur on conventional axial images [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e]. The spatial calibration factor (mm per pixel) was individually extracted from the DICOM metadata of each examination and ranged from 0.271 to 0.513 mm/pixel (mean: 0.371 mm/pixel), ensuring accurate conversion of all pixel-based coordinate measurements to absolute millimeter values.\u003c/p\u003e\n\u003ch3\u003eForamen magnum boundary extraction\u003c/h3\u003e\n\u003cp\u003eFor each specimen, the axial CT slice displaying the widest cross-section of the FM was identified. The FM boundary was extracted using automated edge detection applied to the bone-windowed CT image at this level. The resulting contour coordinates (sequential X, Y pixel positions along the FM margin) were exported as individual comma-separated value (CSV) files. Each raw contour comprised between 36 and 187 boundary points (mean: 85.5).\u003c/p\u003e \u003cp\u003eA key methodological advantage of this approach is that the FM boundary was derived through computational image processing rather than manual tracing. As a consequence, the contour extraction procedure is fully deterministic: processing the same CT image repeatedly yields identical boundary coordinates. This eliminates inter-observer and intra-observer variability, which are inherent limitations of manual measurement approaches and have been documented as a source of error in conventional FM morphometric studies [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e, \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e]. To ensure comparability across specimens with different numbers of raw boundary points, all contours were resampled to 200 equally spaced points using cubic spline interpolation along the cumulative arc length prior to subsequent analysis [\u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e32\u003c/span\u003e].\u003c/p\u003e\n\u003ch3\u003eComputational geometric analyses\u003c/h3\u003e\n\u003cp\u003eAll computational analyses were implemented in Python 3.8\u0026thinsp;+\u0026thinsp;using NumPy, SciPy, scikit-learn, and Matplotlib [\u003cspan additionalcitationids=\"CR34 CR35\" citationid=\"CR33\" class=\"CitationRef\"\u003e33\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e36\u003c/span\u003e]. The complete analysis pipeline is publicly available as a single, self-contained script (FM_Comprehensive_Analysis.py) in the Zenodo repository accompanying this paper (DOI: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.5281/zenodo.18716445\u003c/span\u003e\u003cspan address=\"10.5281/zenodo.18716445\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e), along with all anonymized boundary coordinate data, enabling full reproducibility.\u003c/p\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003eClassical morphometric measurements\u003c/h2\u003e \u003cp\u003eEleven standard morphometric parameters were computed from each resampled contour: anteroposterior (AP) diameter (maximum extent along the Y-axis), transverse diameter (maximum extent along the X-axis), maximum and minimum diameters (derived from the convex hull), perimeter (sum of consecutive point-to-point Euclidean distances along the closed contour), area (computed using the shoelace formula [\u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e37\u003c/span\u003e]), FM index (AP/transverse diameter ratio), circularity (4πA/P\u0026sup2;, where A is area and P is perimeter), eccentricity (maximum/minimum diameter ratio), area-to-perimeter ratio (A/P), and compactness (P/\u0026radic;A). These parameters correspond to those commonly reported in the FM literature [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e, \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e, \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e], thereby facilitating direct comparison with previously published dimensional data.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eEllipse fitting\u003c/h3\u003e\n\u003cp\u003eThree ellipse models were fitted to each FM contour using distinct optimization strategies, together with a best-fit circle.\u003c/p\u003e \u003cp\u003eThe Fitzgibbon direct least-squares ellipse was obtained by minimizing the algebraic distance from contour points to a general conic equation, with an ellipticity constraint ensuring a valid ellipse solution [\u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e38\u003c/span\u003e]. Initial estimates were derived from the centroid and coordinate covariance, followed by geometric refinement via the Nelder-Mead simplex algorithm [\u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e39\u003c/span\u003e]. For each fitted ellipse, the following were recorded: center coordinates (cx, cy), semi-major axis (a), semi-minor axis (b), rotation angle (θ), four distance metrics (mean, standard deviation, and maximum Euclidean distance from each contour point to its nearest point on the ellipse boundary, and RMSE), the fitted ellipse area, the corresponding FM area, and the area ratio between the two. This yielded 12 features per ellipse model.\u003c/p\u003e \u003cp\u003eThe genetic algorithm (GA)-optimized inner bounding ellipse (the largest ellipse fully inscribed within the FM contour) and the GA-optimized outer bounding ellipse (the smallest ellipse fully circumscribing the FM contour) were estimated using a stochastic optimization approach. An initial population of 60 candidate ellipses (parameterized by center coordinates, semi-axes, and rotation angle) was evaluated over 150 generations, followed by local refinement using the Nelder-Mead algorithm. These bounding ellipses capture the deviation of the FM contour from an ideal elliptical shape: a large gap between inner and outer ellipses indicates a more irregular boundary. Each bounding ellipse model produced 12 features identical in structure to the Fitzgibbon model.\u003c/p\u003e \u003cp\u003eThe best-fit circle was obtained via algebraic least-squares fitting [\u003cspan citationid=\"CR40\" class=\"CitationRef\"\u003e40\u003c/span\u003e]. Ten features were extracted: center coordinates, radius, four distance metrics (mean, standard deviation, maximum, RMSE), circle area, FM area, and area ratio.\u003c/p\u003e \u003cp\u003eIn total, the three ellipse models and the circle model yielded 46 features.\u003c/p\u003e\n\u003ch3\u003ePolygon approximation\u003c/h3\u003e\n\u003cp\u003eTo capture FM boundary irregularity at multiple geometric resolution levels, systematic polygon approximation was performed for polygons with 3 to 13 sides (11 polygon orders). For each polygon order n, the optimal n-sided polygon was determined by angular sectoring from the FM centroid, with the vertex of each sector placed at the farthest contour point from the centroid within that sector. Twenty offset rotations were evaluated per polygon order and the configuration yielding the minimum RMSE was selected.\u003c/p\u003e \u003cp\u003eFor each polygon order, six features were extracted: RMSE between the contour and the polygon edges (point-to-segment distance), mean distance, standard deviation of distance, maximum distance, polygon area, and the polygon-to-FM area ratio. This yielded 66 polygon features in total.\u003c/p\u003e \u003cp\u003eThe rationale for this multi-scale approach is that low-sided polygons (3\u0026ndash;5 sides) capture gross shape asymmetry and angularity, while high-sided polygons (9\u0026ndash;13 sides) approximate the contour more closely and their residual errors primarily reflect fine boundary roughness and local irregularity. The progression of RMSE across polygon orders thus provides an objective, multi-resolution characterization of contour complexity.\u003c/p\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003ePolygon elbow (optimal complexity) analysis\u003c/h2\u003e \u003cp\u003eTo identify the polygon order at which additional sides yield diminishing returns in contour approximation accuracy, an elbow-point analysis was performed on each specimen's RMSE curve using the Kneedle algorithm [\u003cspan citationid=\"CR53\" class=\"CitationRef\"\u003e53\u003c/span\u003e]. For each specimen, the RMSE values for polygon orders 3 through 13 were extracted and the curve was normalized to the unit interval on both axes. A reference diagonal line connecting the first and last normalized points was constructed, and the deviation of the normalized RMSE curve from this diagonal was computed at each polygon order. The elbow point was defined as the polygon order at which this deviation was maximal, representing the point of maximum curvature\u0026mdash;i.e., the transition from rapid to marginal improvement in contour fit with each additional polygon side [\u003cspan citationid=\"CR53\" class=\"CitationRef\"\u003e53\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eFor each specimen, five derived metrics were computed at the identified elbow point: (1) the optimal polygon order (number of sides), (2) the RMSE at the optimal polygon, (3) the total improvement in RMSE from 3-sided to the optimal polygon as a percentage of the 3-sided RMSE, (4) the remaining improvement from the optimal polygon to 13-sided as a percentage, and (5) an efficiency ratio defined as the total improvement up to the elbow divided by the remaining improvement beyond it.\u003c/p\u003e \u003cp\u003eThe elbow analysis was performed at both the individual specimen level and the group level. At the group level, RMSE values were averaged across all specimens within each sex and across the entire sample, and the Kneedle algorithm was applied to these mean curves. The distribution of individually determined optimal polygon orders was compared between males and females using the Mann\u0026ndash;Whitney U test and Cohen's d to assess whether the two sexes differ in the geometric complexity required to adequately represent their FM boundaries. The step-by-step marginal improvement table\u0026mdash;quantifying the percentage RMSE reduction at each polygon-order increment for males, females, and the pooled sample\u0026mdash;was also computed and reported.\u003c/p\u003e \u003cp\u003eElliptic Fourier transform\u003c/p\u003e \u003cp\u003eThe normalized elliptic Fourier transform (EFT) was applied to each FM contour following the method of Kuhl and Giardina [\u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e]. The EFT decomposes a closed two-dimensional contour into a series of harmonically related ellipses, where the first harmonic captures the gross elliptical shape and higher harmonics capture progressively finer morphological details. Twenty harmonics were computed, yielding 80 Fourier coefficients (four per harmonic: aₙ, bₙ, cₙ, dₙ). These coefficients were normalized for size, rotation, and starting-point invariance using the first-harmonic parameters, ensuring that the resulting descriptors reflect pure shape information independent of scale, orientation, and the arbitrary starting location of contour tracing [\u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e, \u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e25\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eIn addition to the 80 normalized coefficients, 14 summary descriptors were derived: individual power ratios for harmonics 1\u0026ndash;10 (the proportion of total spectral power contributed by each harmonic), a higher-harmonic energy ratio (the proportion of total power from harmonics 4\u0026ndash;20, reflecting fine morphological detail relative to gross shape), total spectral power, a first-harmonic power ratio (power of the first harmonic relative to total power), and a symmetry index (ratio of odd-harmonic to even-harmonic power, reflecting bilateral symmetry [\u003cspan citationid=\"CR41\" class=\"CitationRef\"\u003e41\u003c/span\u003e]). This yielded 94 EFT features per specimen.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003eRadial distance profile analysis\u003c/h2\u003e \u003cp\u003eTo provide intuitive, physically interpretable shape descriptors that correspond directly to classical qualitative morphological descriptions, a radial distance profile was computed for each FM contour. Radial distances from the centroid to each contour point were calculated and sorted by angular position. From this profile, six descriptors were extracted: radial mean and standard deviation (in mm), reflecting overall size and radial variability; radial coefficient of variation (CV), reflecting boundary regularity independent of size; roughness (mean absolute value of the second derivative of the normalized radial profile), quantifying high-frequency boundary undulation; left\u0026ndash;right asymmetry (mean absolute difference in normalized radial distances between corresponding points in the left and right halves); and anterior\u0026ndash;posterior (AP) asymmetry (the analogous measure for the anterior and posterior quarters).\u003c/p\u003e \u003cp\u003eRoughness, left\u0026ndash;right asymmetry, and AP asymmetry were computed from size-normalized (unit-mean) radial profiles to ensure they reflect pure shape properties.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec13\" class=\"Section2\"\u003e \u003ch2\u003eProcrustes normalization and point density heatmaps\u003c/h2\u003e \u003cp\u003eTo enable shape comparison independent of size, position, and orientation, all resampled FM contours were subjected to Procrustes superimposition [\u003cspan citationid=\"CR42\" class=\"CitationRef\"\u003e42\u003c/span\u003e]. Each contour was: (1) translated to place its centroid at the origin, (2) scaled to unit centroid size, and (3) rotated to align its principal axis (determined by eigendecomposition of the coordinate covariance matrix) with the horizontal axis. A reflection correction ensured consistent orientation across all specimens.\u003c/p\u003e \u003cp\u003eFollowing Procrustes normalization, sex-specific point density heatmaps were generated by pooling all normalized contour points for each sex onto a 200 \u0026times; 200 grid. Gaussian kernel smoothing (σ\u0026thinsp;=\u0026thinsp;2 grid units) was applied to produce continuous density surfaces (Fig.\u0026nbsp;1). For each specimen, a male heatmap score and a female heatmap score were computed as the mean density value at the specimen's normalized contour point locations within the respective sex-specific heatmap. The heatmap difference score (male score minus female score) served as an additional descriptive feature, yielding 3 heatmap-derived features per specimen. These global heatmaps were used solely for descriptive visualization and feature extraction; heatmap-based classification employed a separate cross-validated procedure described below.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec14\" class=\"Section2\"\u003e \u003ch2\u003eStatistical analysis\u003c/h2\u003e \u003cdiv id=\"Sec15\" class=\"Section3\"\u003e \u003ch2\u003eSex comparison of individual features\u003c/h2\u003e \u003cp\u003eAll 234 numeric variables were compared between males and females using the Mann\u0026ndash;Whitney U test, chosen for its robustness to non-normality [\u003cspan citationid=\"CR43\" class=\"CitationRef\"\u003e43\u003c/span\u003e]. Effect sizes were quantified using Cohen's d, computed as the difference in group means divided by the pooled standard deviation [\u003cspan citationid=\"CR44\" class=\"CitationRef\"\u003e44\u003c/span\u003e]. The Bonferroni correction was applied to all p-values to control the familywise error rate at α\u0026thinsp;=\u0026thinsp;0.05 across all 234 simultaneous comparisons. Features were considered statistically significant if the Bonferroni-corrected p-value was less than 0.05.\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv id=\"Sec16\" class=\"Section2\"\u003e \u003ch2\u003eSex classification\u003c/h2\u003e \u003cp\u003eThe discriminative capacity of the extracted features for sex estimation was evaluated using four classifiers: linear discriminant analysis (LDA) [\u003cspan citationid=\"CR45\" class=\"CitationRef\"\u003e45\u003c/span\u003e], support vector machine with linear kernel (SVM-linear), support vector machine with radial basis function kernel (SVM-RBF) [\u003cspan citationid=\"CR46\" class=\"CitationRef\"\u003e46\u003c/span\u003e], and random forest (RF, 100 trees) [\u003cspan citationid=\"CR47\" class=\"CitationRef\"\u003e47\u003c/span\u003e]. All classifiers were evaluated using 10-fold stratified cross-validation with a fixed random seed (42) to ensure reproducibility. Within each fold, features were standardized to zero mean and unit variance using the training set parameters only, and the same standardization was then applied to the test set to prevent data leakage.\u003c/p\u003e \u003cp\u003eClassification performance was assessed using overall accuracy (proportion of correctly classified specimens), sensitivity (proportion of males correctly identified), and specificity (proportion of females correctly identified).\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec17\" class=\"Section2\"\u003e \u003ch2\u003eFeature sets\u003c/h2\u003e \u003cp\u003eTo systematically evaluate the contribution of size-based versus shape-based features, classification was performed on the following feature sets:\u003c/p\u003e \u003cp\u003eS1 (Size): Six classical size measurements\u0026mdash;AP diameter, transverse diameter, maximum and minimum diameters, perimeter, and area\u0026mdash;that directly capture FM dimensions in millimeters.\u003c/p\u003e \u003cp\u003eS2 (Shape): All non-size shape features (174 features), including FM index, circularity, eccentricity, area-to-perimeter ratio, compactness, all EFT coefficients and summary descriptors, radial CV, roughness, asymmetry indices, polygon distance metrics, polygon area ratios, ellipse and circle distance metrics (RMSE, mean/std/max distances), and ellipse area ratios. Excluded from this set were absolute area measurements in mm\u0026sup2; (polygon areas, ellipse areas, FM areas), ellipse geometric parameters (center coordinates, semi-axes, rotation angles), circle radius, radial mean and standard deviation in mm, and polygon elbow-derived metrics, as these either directly encode size information or represent pipeline metadata rather than morphometric descriptors.\u003c/p\u003e \u003cp\u003eS3 (Combined): The union of S1 and S2 (180 features).\u003c/p\u003e \u003cp\u003eS4 (Forward-selected Shape): The minimal subset of S2 features identified by sequential forward selection (see below).\u003c/p\u003e \u003cp\u003eS5 (Forward-selected Combined): The minimal subset of S3 features identified by forward selection.\u003c/p\u003e \u003cp\u003eS6 (Forward-selected All): The minimal subset from all available numeric features (including absolute area measurements and polygon elbow metrics) identified by forward selection.\u003c/p\u003e \u003cp\u003eIn addition, two heatmap-based classification methods were evaluated as separate, feature-free approaches (see Heatmap-based classification below).\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec18\" class=\"Section2\"\u003e \u003ch2\u003eForward feature selection\u003c/h2\u003e \u003cp\u003eSequential forward feature selection (SFS) was performed to identify minimal, non-redundant feature subsets that maximize classification accuracy [\u003cspan citationid=\"CR48\" class=\"CitationRef\"\u003e48\u003c/span\u003e]. Starting from an empty set, at each step the feature producing the greatest improvement in 10-fold stratified LDA cross-validation accuracy was added. The process terminated when no additional feature improved accuracy. Three separate forward selections were conducted corresponding to feature sets S4, S5, and S6, selecting from shape features exclusively (S2 pool), the combined size\u0026ndash;shape pool (S3), and all available features, respectively.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec19\" class=\"Section2\"\u003e \u003ch2\u003eHeatmap-based sex classification\u003c/h2\u003e \u003cp\u003eA novel heatmap-based classification approach was developed as an alternative to conventional feature-based methods. Instead of extracting discrete features and applying a classifier, this method operates directly on the spatial distribution of Procrustes-normalized FM boundary points, comparing each test specimen's contour against sex-specific population density templates.\u003c/p\u003e \u003cp\u003eTwo scoring strategies were evaluated under proper 10-fold stratified cross-validation with strict train/test separation\u0026mdash;heatmaps were constructed exclusively from training-fold specimens, and test specimens were never included in the heatmaps against which they were scored, thereby preventing data leakage.\u003c/p\u003e \u003c/div\u003e\u003cp\u003e\u003cstrong\u003eMethod A \u0026mdash; Point-sampling:\u003c/strong\u003e For each fold, sex-specific density heatmaps were constructed from training specimens by accumulating their Procrustes-normalized contour points onto a shared 200 \u0026times; 200 grid and applying Gaussian smoothing (\u0026sigma; = 2 grid units). Each test specimen\u0026apos;s normalized contour was then overlaid on both the male and female training heatmaps, and the mean density value at the specimen\u0026apos;s boundary point positions was computed. Classification was assigned to the sex whose heatmap yielded the higher score.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eMethod B \u0026mdash; Kernel density estimation (KDE) with overlap integral scoring:\u003c/strong\u003e This method addresses a limitation of point-sampling, which evaluates only the exact grid cells occupied by the test contour and is therefore sensitive to minor positional shifts and pixel-level gaps. In the KDE approach, each contour point\u0026mdash;in both the training heatmaps and the test specimen\u0026mdash;is treated as the center of a two-dimensional Gaussian kernel, producing continuous density fields rather than discrete point samples. The kernel has full intensity (1.0) at its center, diminishing to approximately half intensity at one pixel distance, one-quarter at two pixels, and so on\u0026mdash;following a Gaussian decay profile. This ensures that nearby but not exactly coincident boundary points still contribute to the similarity measure.\u003c/p\u003e\n\n\u003cp\u003eThe similarity between a test specimen\u0026apos;s density field and each sex-specific reference field was quantified using the overlap integral\u0026mdash;the element-wise product of the two fields summed over the entire grid. This metric is related to the Bhattacharyya coefficient [54] and provides a continuous, smooth similarity score that is robust to minor positional shifts. Classification was assigned to the sex whose reference field yielded the higher overlap score.\u003c/p\u003e\n\n\u003cp\u003eThe kernel bandwidth (controlling the spatial extent of each Gaussian kernel in grid-pixel units) was optimized by evaluating values of 3, 5, 7, 9, and 11 pixels and selecting the value that maximized cross-validated accuracy within the same fold structure.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eWithin-sex morphological subtype discovery\u003c/strong\u003e\u003c/p\u003e\n\n\u003cp\u003eTo investigate whether the FM exhibits distinct morphological subtypes within each sex, unsupervised clustering was performed using Gaussian mixture models (GMM) [49]. For each sex independently: (1) Procrustes-normalized contour coordinates were flattened into feature vectors (200 points \u0026times; 2 coordinates = 400 dimensions), (2) principal component analysis (PCA) was applied to reduce dimensionality to the first 20 components (retaining \u0026gt; 95% of variance), and (3) GMM with full covariance matrices was fitted for k = 2, 3, and 4 clusters (10 random initializations per k).\u003c/p\u003e\n\n\u003cp\u003eThe optimal number of clusters was selected based on the silhouette score [50] and the Bayesian information criterion (BIC) [51]. Each cluster was characterized by computing its mean values for eight shape descriptors: circularity, radial CV, roughness, left\u0026ndash;right asymmetry, AP asymmetry, FM index, Fitzgibbon ellipse fitting RMSE, and area-to-perimeter ratio. For visualization, each subtype was represented by individual contour overlays (with adaptive transparency based on cluster size), mean contour shapes with \u0026plusmn;1 SD and \u0026plusmn;2 SD radial variation bands, density heatmaps, and sex-matched subtype comparisons showing the mean contour of the corresponding male and female cluster side by side.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSoftware and reproducibility\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eAll analyses were implemented in Python 3 using the following libraries: NumPy 1.21+ for numerical computation [33], SciPy 1.7+ for statistical tests and optimization [34], scikit-learn 1.0+ for classification, clustering, and cross-validation [35], Matplotlib 3.4+ for visualization [36], and pandas 1.3+ for data management [52]. The complete analysis code, all 473 anonymized FM boundary coordinate files (500 equally-spaced points per specimen, resampled via periodic cubic spline interpolation), and associated metadata are publicly available in a Zenodo repository (DOI: 10.5281/zenodo.18716445) to ensure full reproducibility. The repository is currently under embargo and will be made publicly accessible upon publication; reviewer access can be arranged upon request to the corresponding author. Per-specimen morphometric measurements are provided in Table S1.\u003c/p\u003e"},{"header":"Results","content":"\u003cdiv id=\"Sec23\" class=\"Section2\"\u003e \u003ch2\u003eStudy sample\u003c/h2\u003e \u003cp\u003eA total of 473 adult CT scans met the inclusion criteria. The sample comprised 234 males (49.5%) and 239 females (50.5%). Automated contour extraction yielded between 36 and 187 boundary points per foramen magnum (mean: 85.5), which were resampled to 200 equidistant points for all subsequent analyses. Six specimens were excluded from ellipse fitting due to convergence failure, leaving 467 specimens for the polygon elbow analysis and the full feature set.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec24\" class=\"Section2\"\u003e \u003ch2\u003eDescriptive morphometry\u003c/h2\u003e \u003cp\u003eClassical linear and area measurements revealed pronounced sexual dimorphism in all size-related parameters (Table\u0026nbsp;1). Male FM perimeters averaged 107.13\u0026thinsp;\u0026plusmn;\u0026thinsp;11.16 mm compared to 95.02\u0026thinsp;\u0026plusmn;\u0026thinsp;7.36 mm in females (Cohen's d\u0026thinsp;=\u0026thinsp;1.284, p\u0026thinsp;\u0026lt;\u0026thinsp;10⁻\u0026sup3;⁶). FM area showed a comparable effect (males: 830.1\u0026thinsp;\u0026plusmn;\u0026thinsp;159.9 mm\u0026sup2;, females: 665.3\u0026thinsp;\u0026plusmn;\u0026thinsp;99.0 mm\u0026sup2;; d\u0026thinsp;=\u0026thinsp;1.242). Anteroposterior and transverse diameters were both significantly larger in males (AP: 36.53\u0026thinsp;\u0026plusmn;\u0026thinsp;3.48 vs. 32.68\u0026thinsp;\u0026plusmn;\u0026thinsp;2.77 mm, d\u0026thinsp;=\u0026thinsp;1.225; transverse: 31.35\u0026thinsp;\u0026plusmn;\u0026thinsp;3.16 vs. 28.23\u0026thinsp;\u0026plusmn;\u0026thinsp;2.31 mm, d\u0026thinsp;=\u0026thinsp;1.132).\u003c/p\u003e \u003cp\u003eIn contrast, shape descriptors showed minimal dimorphism. The FM index did not differ significantly between sexes (males: 1.17\u0026thinsp;\u0026plusmn;\u0026thinsp;0.08, females: 1.16\u0026thinsp;\u0026plusmn;\u0026thinsp;0.09; d\u0026thinsp;=\u0026thinsp;0.088, p\u0026thinsp;\u0026gt;\u0026thinsp;0.05 after Bonferroni correction). Circularity showed a small, non-significant trend toward rounder foramina in females (females: 0.92\u0026thinsp;\u0026plusmn;\u0026thinsp;0.04, males: 0.91\u0026thinsp;\u0026plusmn;\u0026thinsp;0.07; d\u0026thinsp;=\u0026thinsp;\u0026minus;\u0026thinsp;0.264, p\u0026thinsp;\u0026gt;\u0026thinsp;0.05). Left\u0026ndash;right and anteroposterior asymmetry indices likewise showed no significant sex differences (d\u0026thinsp;=\u0026thinsp;0.034 and d\u0026thinsp;=\u0026thinsp;0.088, respectively).\u003c/p\u003e \u003cdiv id=\"Sec25\" class=\"Section3\"\u003e \u003ch2\u003eFeature significance\u003c/h2\u003e \u003cp\u003eOf 234 morphometric features extracted per specimen, 92 (39.3%) showed statistically significant sex differences after Bonferroni correction (p\u0026thinsp;\u0026lt;\u0026thinsp;0.05) (Table\u0026nbsp;1; Table \u003cspan refid=\"MOESM2\" class=\"InternalRef\"\u003eS2\u003c/span\u003e). The top five features by effect size were all area-derived measures: polygon approximation areas (d\u0026thinsp;=\u0026thinsp;1.285 for 10-sided polygon area), perimeter (d\u0026thinsp;=\u0026thinsp;1.284), Fitzgibbon ellipse area (d\u0026thinsp;=\u0026thinsp;1.279), and total FM area (d\u0026thinsp;=\u0026thinsp;1.242). All six classical size measurements were significant.\u003c/p\u003e \u003cp\u003eAmong pure shape features (dimensionless ratios and normalized coefficients), only 17 of 146 (11.6%) reached significance. All 11 polygon approximation areas in mm\u0026sup2; were significant (d\u0026thinsp;=\u0026thinsp;1.237\u0026ndash;1.285), but none of the 14 polygon area ratios achieved significance, confirming that the discriminative information resided in absolute size rather than shape proportions. Elliptic Fourier transform (EFT) coefficients showed limited dimorphism: only the total power spectrum (d\u0026thinsp;=\u0026thinsp;0.855, p\u0026thinsp;\u0026lt;\u0026thinsp;10⁻\u0026sup1;⁵) reached significance among the 94 EFT-derived features. The best-fitted ellipse semi-axes were significant (semi-major axis d\u0026thinsp;=\u0026thinsp;0.818, semi-minor axis d\u0026thinsp;=\u0026thinsp;0.828), again reflecting size.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec26\" class=\"Section3\"\u003e \u003ch2\u003ePolygon approximation analysis\u003c/h2\u003e \u003cp\u003eThe Kneedle-based elbow detection identified the optimal polygon as 6-sided for both sexes (mode\u0026thinsp;=\u0026thinsp;6, mean\u0026thinsp;=\u0026thinsp;5.60\u0026thinsp;\u0026plusmn;\u0026thinsp;0.94 sides) (Table \u003cspan refid=\"MOESM3\" class=\"InternalRef\"\u003eS3\u003c/span\u003e). The distribution was: 6-sided (n\u0026thinsp;=\u0026thinsp;241, 51.6%), 5-sided (n\u0026thinsp;=\u0026thinsp;84, 18.0%), 4-sided (n\u0026thinsp;=\u0026thinsp;83, 17.8%), 7-sided (n\u0026thinsp;=\u0026thinsp;54, 11.6%), and 8-sided (n\u0026thinsp;=\u0026thinsp;5, 1.1%) (Fig.\u0026nbsp;4).\u003c/p\u003e \u003cp\u003eThe transition from 3 to 4 sides provided the largest marginal RMSE improvement (50.1% for males, 50.7% for females), followed by 5\u0026rarr;6 sides (37.6% and 35.5%, respectively) (Table\u0026nbsp;2; Fig.\u0026nbsp;3). Beyond 6 sides, each additional vertex contributed diminishing returns (\u0026lt;\u0026thinsp;25%). Mean optimal polygon RMSE was 1.80\u0026thinsp;\u0026plusmn;\u0026thinsp;0.47 mm, representing 74.0% improvement over the initial 3-sided approximation. Males and females showed no significant difference in optimal polygon selection (male mean\u0026thinsp;=\u0026thinsp;5.68, female mean\u0026thinsp;=\u0026thinsp;5.53; p\u0026thinsp;\u0026gt;\u0026thinsp;0.05), indicating that the geometric complexity of the FM boundary is sex-invariant.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec27\" class=\"Section3\"\u003e \u003ch2\u003eSex classification\u003c/h2\u003e \u003cp\u003eClassification performance was evaluated using seven feature sets across four classifiers with stratified 10-fold cross-validation (Table\u0026nbsp;3; Table \u003cspan refid=\"MOESM4\" class=\"InternalRef\"\u003eS4\u003c/span\u003e; Fig.\u0026nbsp;4).\u003c/p\u003e \u003cp\u003eThe highest accuracy was achieved by forward-selected shape features (S4) with LDA: 80.9% \u0026plusmn; 4.9% (sensitivity\u0026thinsp;=\u0026thinsp;76.4%, specificity\u0026thinsp;=\u0026thinsp;85.3%). The forward selection procedure identified seven optimal shape features: area-to-perimeter ratio, circularity, Fitzgibbon ellipse RMSE, and four EFT coefficients (harmonics 48, 21, 13, and 51) (Table\u0026nbsp;4). Notably, the first feature alone (area-to-perimeter ratio) achieved 74.9%, and the addition of circularity raised accuracy to 77.7%.\u003c/p\u003e \u003cp\u003eForward-selected combined features (S5, LDA) and the full feature set (S6, LDA) both reached 80.5% \u0026plusmn; 6.8%, with higher specificity (85.7%) but lower sensitivity (75.1%) than S4 (Table \u003cspan refid=\"MOESM5\" class=\"InternalRef\"\u003eS5\u003c/span\u003e). The combined forward selection started with perimeter (78.1% alone) and added five shape refinements to reach the final accuracy.\u003c/p\u003e \u003cp\u003eSize features alone (S1) reached 75.6% \u0026plusmn; 5.7% with LDA, and 77.3% \u0026plusmn; 7.0% with SVM-rbf. The full shape feature set without selection (S2) achieved 67.9% with LDA but 77.1% with Random Forest, suggesting non-linear shape relationships captured better by ensemble methods (Fig.\u0026nbsp;4).\u003c/p\u003e \u003cp\u003eAmong classifiers, LDA consistently outperformed others on forward-selected features, while Random Forest showed superior performance on unselected high-dimensional feature sets (S2: RF 77.1% vs. LDA 67.9%).\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv id=\"Sec28\" class=\"Section2\"\u003e \u003ch2\u003eHeatmap-based density classification\u003c/h2\u003e \u003cp\u003eA novel classification approach was implemented using Procrustes-normalized point-density heatmaps. Two scoring strategies were compared under proper 10-fold cross-validation with strict train/test separation.\u003c/p\u003e \u003cp\u003ePoint-sampling \u0026mdash; where each test contour point was scored against the reference heatmap at its exact grid position \u0026mdash; achieved only 50.3% \u0026plusmn; 3.2%, effectively at chance level. The method exhibited extreme sensitivity bias (96.2% for males) with near-zero specificity (5.4% for females), indicating systematic classification toward one sex.\u003c/p\u003e \u003cp\u003eKDE overlap scoring \u0026mdash; where both the reference and test contours were rendered as Gaussian density fields (optimal bandwidth\u0026thinsp;=\u0026thinsp;9 pixels) and their element-wise product integrated \u0026mdash; improved accuracy to 54.1% \u0026plusmn; 7.1% with more balanced sensitivity (70.5%) and specificity (38.1%).\u003c/p\u003e \u003cp\u003eThe near-chance performance of both heatmap methods is attributable to Procrustes normalization, which removes all size information. Since the dominant sexual dimorphism in FM morphology is size-based (top features: d\u0026thinsp;=\u0026thinsp;1.28), the normalized contours retain only subtle shape differences insufficient for density-based discrimination. This negative finding is consistent with the observation that pure shape features without feature selection achieved only 67.9% accuracy even with LDA, a method with far greater discriminative capacity than density overlap (Fig.\u0026nbsp;1).\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec29\" class=\"Section2\"\u003e \u003ch2\u003eWithin-sex morphological subtypes\u003c/h2\u003e \u003cp\u003eGMM clustering on Procrustes-normalized contour coordinates, evaluated for k\u0026thinsp;=\u0026thinsp;2, 3, and 4 clusters, revealed distinct patterns of within-sex variation (Table\u0026nbsp;5; Fig.\u0026nbsp;5; Table \u003cspan refid=\"MOESM6\" class=\"InternalRef\"\u003eS6\u003c/span\u003e; Table \u003cspan refid=\"MOESM7\" class=\"InternalRef\"\u003eS7\u003c/span\u003e, Figure \u003cspan refid=\"MOESM1\" class=\"InternalRef\"\u003eS1\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eFor males, the optimal cluster number was k\u0026thinsp;=\u0026thinsp;2 based on silhouette score (0.72) and BIC (Fig.\u0026nbsp;6). The dominant subtype (Subtype-1, n\u0026thinsp;=\u0026thinsp;175, 74.8%) showed higher circularity (0.914) and a more elliptical FM index (1.177), representing the typical male FM morphology. The minor subtype (Subtype-0, n\u0026thinsp;=\u0026thinsp;59, 25.2%) exhibited lower circularity (0.884) and higher radial variability (CV\u0026thinsp;=\u0026thinsp;7.7%), indicating a distinct, more irregular morphological pattern (Fig.\u0026nbsp;5; Figure \u003cspan refid=\"MOESM2\" class=\"InternalRef\"\u003eS2\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eFor females, the silhouette score peaked at k\u0026thinsp;=\u0026thinsp;3 (0.78), though k\u0026thinsp;=\u0026thinsp;2 also performed well (0.74) (Fig.\u0026nbsp;6). At k\u0026thinsp;=\u0026thinsp;2, the dominant subtype (Subtype-0, n\u0026thinsp;=\u0026thinsp;171, 71.5%) showed high circularity (0.923) with an FM index of 1.164. The secondary subtype (Subtype-1, n\u0026thinsp;=\u0026thinsp;68, 28.5%) had comparable circularity (0.920) but slightly higher asymmetry (0.058 vs. 0.050), suggesting size-related rather than shape-related differentiation (Fig.\u0026nbsp;5; Figure \u003cspan refid=\"MOESM3\" class=\"InternalRef\"\u003eS3\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eThe subtype analysis revealed that male FM morphology exhibits greater heterogeneity than female. At k\u0026thinsp;=\u0026thinsp;4, male subtypes included a clearly atypical group (Subtype-1, n\u0026thinsp;=\u0026thinsp;16, 6.8%) with the highest radial CV (10.5%) and lowest circularity, while female subtypes remained more morphologically coherent. These findings suggest that the male foramen magnum encompasses a wider range of morphological variants (Figure \u003cspan refid=\"MOESM4\" class=\"InternalRef\"\u003eS4\u003c/span\u003e-\u003cspan refid=\"MOESM17\" class=\"InternalRef\"\u003eS17\u003c/span\u003e).\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eSummary of key findings\u003c/h3\u003e\n\u003cp\u003eThe foramen magnum shows clear sexual dimorphism, but this dimorphism is overwhelmingly driven by size rather than shape. Males have FM perimeters approximately 12.7% longer and areas roughly 24.8% larger than females, yielding large effect sizes (d\u0026thinsp;\u0026asymp;\u0026thinsp;1.28). However, when size is removed through normalization, male and female FM boundaries are remarkably similar \u0026mdash; shape-only features achieve only modest classification gains over chance.\u003c/p\u003e \u003cp\u003eThe most effective sex estimation (80.9% accuracy) required a combination of shape features selected through forward selection, including geometric ratios, ellipse fitting residuals, and specific Fourier harmonics. This suggests that while no single shape feature strongly discriminates sex, certain combinations of subtle boundary irregularities can collectively approach the discriminative power of size-based measurements. The finding that shape features outperform size features at optimal selection (80.9% vs. 75.6%) indicates that shape encodes complementary \u0026mdash; though individually weak \u0026mdash; dimorphic information.\u003c/p\u003e \u003cp\u003eIn practical forensic terms, these results mean that: (1) the foramen magnum can correctly predict biological sex in approximately 4 out of 5 individuals; (2) female FMs are better classified (specificity 85.3%) than male FMs (sensitivity 76.4%); and (3) when only fragmentary remains are available where size cannot be assessed, automated shape analysis can still provide meaningful, though less precise, sex estimation.\u003c/p\u003e"},{"header":"Discussion","content":"\u003cdiv id=\"Sec32\" class=\"Section2\"\u003e \u003ch2\u003ePrincipal findings and novelty\u003c/h2\u003e \u003cp\u003eThis study presents the first comprehensive computational geometric morphometric analysis of the foramen magnum that integrates multiple complementary outline-based approaches \u0026mdash; elliptic Fourier transform, systematic polygon approximation, ellipse fitting, radial distance profiling, Procrustes-based density heatmaps, and Gaussian kernel density estimation \u0026mdash; to jointly characterize FM shape and its relationship to biological sex. While conventional FM studies have relied on a small number of linear measurements and subjective shape classification [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e], the present work extracted 231 quantitative features per specimen from automatically detected contour coordinates, enabling an objective, reproducible, and high-dimensional evaluation of FM morphology.\u003c/p\u003e \u003cp\u003eThe best classification accuracy of 80.9% was achieved using forward-selected shape features with LDA \u0026mdash; a result that compares favorably with the reported range of 62.5%\u0026ndash;86.7% for conventional FM morphometry across various populations [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e, \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e, \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e, \u003cspan citationid=\"CR55\" class=\"CitationRef\"\u003e55\u003c/span\u003e]. Our most consequential finding, however, is not the classification accuracy itself but rather the quantitative demonstration that FM sexual dimorphism is overwhelmingly driven by size rather than shape. When size information was removed through Procrustes normalization, classification accuracy dropped sharply, and the novel density-heatmap approach \u0026mdash; which operates exclusively on size-normalized contours \u0026mdash; achieved only 54.1%. This provides the first direct, quantitative evidence that residual FM shape differences between sexes are minimal once allometric scaling is accounted for.\u003c/p\u003e \u003cdiv id=\"Sec33\" class=\"Section3\"\u003e \u003ch2\u003eSize dominates FM sexual dimorphism\u003c/h2\u003e \u003cp\u003eAll six classical size measurements differed significantly between sexes with large effect sizes (Cohen's d\u0026thinsp;=\u0026thinsp;1.13\u0026ndash;1.28). Perimeter showed the strongest discriminative capacity (d\u0026thinsp;=\u0026thinsp;1.284, p\u0026thinsp;\u0026lt;\u0026thinsp;10⁻\u0026sup3;⁶), followed by polygon area (d\u0026thinsp;=\u0026thinsp;1.285) and total FM area (d\u0026thinsp;=\u0026thinsp;1.242). Males had FM perimeters approximately 12.7% longer and areas roughly 24.8% larger than females. These findings are consistent with the broader FM literature. Gapert et al. [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e] reported that FM width and area were among the best univariate discriminating parameters in a British sample, achieving a maximum multivariate accuracy of 70.3%. Madadin et al. [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e] similarly found that all FM dimensions were significantly larger in males in a Saudi population, with width yielding the highest univariate accuracy of 65.0%. A recent systematic review and meta-analysis by Fernandes et al. [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e] confirmed that FM dimensions are consistently larger in males across populations, though classification accuracy varies substantially by population.\u003c/p\u003e \u003cp\u003eIn contrast, shape indices showed negligible dimorphism in our sample. The FM index \u0026mdash; the most widely used shape descriptor in the literature \u0026mdash; did not differ significantly between sexes (d\u0026thinsp;=\u0026thinsp;0.088, p\u0026thinsp;\u0026gt;\u0026thinsp;0.05). Circularity showed only a weak, non-significant trend (d\u0026thinsp;=\u0026thinsp;\u0026minus;\u0026thinsp;0.264), and left\u0026ndash;right and AP asymmetry were virtually identical between sexes. These results align with Toneva et al. [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e], who found that FM shape did not provide substantial sex discrimination in a Bulgarian population, and with our previous study [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e], which reported that the H/W index and FMI had no significant sex difference (p\u0026thinsp;=\u0026thinsp;0.622 and p\u0026thinsp;=\u0026thinsp;0.440, respectively) and yielded univariate accuracies of only 51.9% and 52.8%.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec34\" class=\"Section3\"\u003e \u003ch2\u003eFrom subjective categories to objective shape quantification\u003c/h2\u003e \u003cp\u003eA particularly notable contribution of this work concerns the long-standing problem of FM shape classification. Numerous studies have attempted to characterize FM morphology using categorical descriptors such as \"oval,\" \"round,\" \"tetragonal,\" \"pentagonal,\" \"egg-shaped,\" \"hexagonal,\" and \"irregular\" [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e, \u003cspan citationid=\"CR56\" class=\"CitationRef\"\u003e56\u003c/span\u003e]. As Zdilla et al. [\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e] noted in their comprehensive review, these shape categories rely on ambiguous terminology that is inconsistent between studies, observer-dependent, and difficult to reproduce. Indeed, visual shape classification is inherently subjective: the same foramen may be classified as \"oval\" by one observer and \"egg-shaped\" by another, and studies vary widely in how many and which shape categories they employ [\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eOur polygon approximation analysis offers a direct, objective alternative. By fitting optimal polygons of 3 through 13 sides to each FM contour and identifying the elbow point where additional sides yield diminishing accuracy improvements, we determined that the typical FM is best approximated by a 6-sided polygon (mode\u0026thinsp;=\u0026thinsp;6 in both sexes, median\u0026thinsp;=\u0026thinsp;6, mean\u0026thinsp;=\u0026thinsp;5.60\u0026thinsp;\u0026plusmn;\u0026thinsp;0.94). This finding objectively confirms and quantifies what the traditional literature has described qualitatively: the FM boundary is broadly hexagonal with substantial individual variation spanning 4 to 8 sides (Fig.\u0026nbsp;2). Furthermore, the absence of a significant sex difference in optimal polygon order (male mean\u0026thinsp;=\u0026thinsp;5.68, female mean\u0026thinsp;=\u0026thinsp;5.53; p\u0026thinsp;\u0026gt;\u0026thinsp;0.05) provides the first quantitative evidence that the geometric complexity of the FM boundary is sex-invariant \u0026mdash; a finding that could not have been established through subjective visual categorization.\u003c/p\u003e \u003cp\u003eSimilarly, the elliptic Fourier transform decomposed each FM contour into 20 harmonics, enabling objective quantification of boundary features at multiple spatial scales. Of the 94 EFT-derived features, only the total spectral power (d\u0026thinsp;=\u0026thinsp;0.855) reached significance, reflecting the global size component captured by the power spectrum. Individual harmonic power ratios and normalized coefficients showed minimal dimorphism, confirming that fine boundary details \u0026mdash; the \"roughness,\" undulations, and local irregularities that distinguish one FM shape from another \u0026mdash; do not differ systematically between sexes.\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e\n\u003ch3\u003eThe value of shape features despite modest individual effects\u003c/h3\u003e\n\u003cp\u003eAlthough no single shape feature showed strong dimorphism, the forward selection procedure demonstrated that particular combinations of individually weak shape features can collectively approach and even exceed the discriminative capacity of size-based measurements. The optimal shape feature subset (area-to-perimeter ratio, circularity, Fitzgibbon ellipse RMSE, and four EFT coefficients) achieved 80.9% accuracy \u0026mdash; surpassing size features alone (75.6%) (Table\u0026nbsp;3). This pattern suggests that shape encodes complementary information: each shape feature captures a subtle, partially independent aspect of FM morphology, and their combination reveals dimorphic patterns that individual features or simple size measurements cannot.\u003c/p\u003e \u003cp\u003eThis finding has practical implications for forensic anthropology. In cases involving fragmentary cranial remains \u0026mdash; such as fire-damaged or blast-affected specimens \u0026mdash; size measurements may be compromised by incomplete margins, while shape features derived from even a partially preserved FM contour could still contribute to sex estimation [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e, \u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e30\u003c/span\u003e]. Furthermore, the forward selection identified specific features \u0026mdash; area-to-perimeter ratio, circularity, and ellipse fitting residuals \u0026mdash; that are computationally simple and could be readily integrated into automated forensic analysis tools.\u003c/p\u003e\n\u003ch3\u003eHeatmap-based classification: an informative negative result\u003c/h3\u003e\n\u003cp\u003eThe density heatmap approach, particularly the novel KDE overlap scoring method, represents a conceptually distinct strategy that bypasses discrete feature extraction entirely, instead comparing continuous spatial distributions of boundary points. The near-chance performance of this method (54.1%) is itself an informative finding. Because heatmap classification operates on Procrustes-normalized contours \u0026mdash; from which all size information has been removed \u0026mdash; its failure provides direct confirmation that the spatial distribution of FM boundary points does not differ meaningfully between sexes after size normalization. This result is consistent with the statistical finding that pure shape features collectively explain only modest variance in sex (LDA on the full shape feature set yielded 67.9%), and offers a population-level visualization of this principle.\u003c/p\u003e \u003cp\u003eWe note that this approach may prove more effective for anatomical structures where shape differences between groups are more pronounced. For example, EFA applied to the greater sciatic notch \u0026mdash; a structure with well-documented qualitative shape dimorphism \u0026mdash; has achieved classification rates exceeding 80% [\u003cspan citationid=\"CR57\" class=\"CitationRef\"\u003e57\u003c/span\u003e]. The heatmap methodology developed here could be readily extended to such structures, where it may provide superior discrimination precisely because the shape differences are larger.\u003c/p\u003e \u003cdiv id=\"Sec37\" class=\"Section2\"\u003e \u003ch2\u003eWithin-sex morphological heterogeneity\u003c/h2\u003e \u003cp\u003eThe GMM-based subtype analysis revealed an asymmetry in within-sex morphological variation. Males showed greater morphological heterogeneity, with k\u0026thinsp;=\u0026thinsp;2 as the optimal cluster number but a clearly identifiable atypical minority subtype (25.2%) characterized by lower circularity and higher radial variability (Figure \u003cspan refid=\"MOESM5\" class=\"InternalRef\"\u003eS5\u003c/span\u003e). Females, by contrast, exhibited more morphological homogeneity, with the majority (71.5%) belonging to a single dominant subtype. This asymmetry in within-sex variability is also reflected in the descriptive statistics reported in our previous study [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e], where males showed higher standard deviations than females in most FM measurements (e.g., FM area SD: 121.5 mm\u0026sup2; vs 101.6 mm\u0026sup2;; perimeter SD: 8.15 mm vs 6.8 mm). The differential classification performance observed in the present study \u0026mdash; females more reliably classified (specificity 85.3%) than males (sensitivity 76.4%) \u0026mdash; is likely a consequence of this narrower distribution of female FM morphology providing a more distinct separation from the male distribution.\u003c/p\u003e \u003cdiv id=\"Sec38\" class=\"Section3\"\u003e \u003ch2\u003eMethodological contributions and broader applicability\u003c/h2\u003e \u003cp\u003eBeyond the specific findings regarding the FM, this study demonstrates the feasibility and value of applying a comprehensive, automated geometric morphometric pipeline to skeletal structures that lack well-defined anatomical landmarks. The FM is particularly challenging for traditional landmark-based morphometrics because its margin is a smooth, continuous curve without discrete, reproducible points that would qualify as Type I or Type II landmarks [\u003cspan citationid=\"CR58\" class=\"CitationRef\"\u003e58\u003c/span\u003e]. Our approach \u0026mdash; treating the entire contour as the unit of analysis through resampling, Fourier decomposition, polygon approximation, and density mapping \u0026mdash; circumvents this limitation entirely.\u003c/p\u003e \u003cp\u003eCaple et al. [\u003cspan citationid=\"CR59\" class=\"CitationRef\"\u003e59\u003c/span\u003e] highlighted the potential of EFA in forensic anthropology, noting that outline-based methods can capture a large amount of shape information in contrast to landmark approaches where information falling between landmarks fails to be acquired. Similarly, Kilmer and Garvin [\u003cspan citationid=\"CR57\" class=\"CitationRef\"\u003e57\u003c/span\u003e] demonstrated that EFA of skeletal outlines can support qualitative descriptions of sex differences with objective, quantitative data. Our work extends this paradigm by combining EFA with multiple additional geometric analysis methods, providing a more complete morphometric characterization than any single technique alone.\u003c/p\u003e \u003cp\u003eThe complete analysis pipeline \u0026mdash; from automated contour extraction through feature computation, statistical testing, classification, and subtype discovery \u0026mdash; is implemented as a single, self-contained, publicly available script. This design facilitates direct application to other skeletal structures where shape variation has traditionally been described only qualitatively, such as the obturator foramen [\u003cspan citationid=\"CR57\" class=\"CitationRef\"\u003e57\u003c/span\u003e, \u003cspan citationid=\"CR60\" class=\"CitationRef\"\u003e60\u003c/span\u003e] or piriform aperture. The objective quantification of shapes that are currently assessed visually (e.g., \"oval vs. triangular obturator foramen\") could enable more reproducible forensic assessments and contribute to a more precise anatomical vocabulary for morphological description.\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv id=\"Sec39\" class=\"Section2\"\u003e \u003ch2\u003eComparison with the source study\u003c/h2\u003e \u003cp\u003eOur dataset derives from the same CT image collection used in our previous study [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e], which reported an accuracy of 86.7% using linear discriminant function analysis with leave-one-out cross-validation on the full 720-case sample, and 88.2% using artificial neural networks. The apparent difference from our best accuracy of 80.9% in the present study likely reflects sampling effects rather than methodological inferiority. First, 247 cases were excluded in the present study due to insufficient contour quality for automated edge detection, reducing the sample to 473. This exclusion was not random with respect to morphology: cases with lower image quality, partial volume artifacts, or less distinct FM margins were preferentially removed, potentially yielding a subsample that \u0026mdash; by chance \u0026mdash; captured a less dimorphic subset of the original population. Second, the present study employed 10-fold rather than leave-one-out cross-validation, which can yield slightly different accuracy estimates depending on fold composition. Despite this reduction in sample size, the present study achieved a comparable cross-validated accuracy (80.9%) while simultaneously demonstrating that shape features, when optimally selected, can match and even marginally exceed the performance of classical size-based measurements \u0026mdash; a finding that was not addressed in the original study.\u003c/p\u003e \u003cdiv id=\"Sec40\" class=\"Section3\"\u003e \u003ch2\u003eLimitations\u003c/h2\u003e \u003cp\u003eSeveral limitations should be acknowledged. First, the study sample was drawn from a single Eastern Turkish population, and the population-specificity of FM morphometry is well established [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e]. As noted in our previous study [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e], even within Turkey, discriminant formulas from western Turkish samples showed variable performance when applied to the eastern Turkish sample, reflecting potential ancestral differences between regions. The classification models and feature importance rankings derived here may therefore not generalize directly to other populations. Second, the contour extraction relied on 2D axial CT slices rather than 3D surface reconstructions, potentially missing three-dimensional shape information such as FM depth, curvature, and condylar morphology. Third, the automated edge detection required sufficient CT image quality, which led to the exclusion of 247 cases from the original dataset; this introduces a potential selection bias toward cases with clearer FM margins. Fourth, age-related variation was not explicitly modeled. Although our previous study found no significant correlation between FM measurements and age apart from a weak trend in perimeter [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e], age effects on shape features remain unexplored. Finally, the heatmap and KDE methods employed fixed grid resolutions and kernel bandwidths that may not be optimal for all contour morphologies; adaptive or multi-scale approaches could potentially improve performance.\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e\n\u003ch3\u003eFuture directions\u003c/h3\u003e\n\u003cp\u003eThe computational framework developed here opens several avenues for future research. Application to skeletal structures with more pronounced qualitative shape dimorphism \u0026mdash; such as the greater sciatic notch, obturator foramen, or frontal sinus \u0026mdash; could test whether the polygon approximation and heatmap methods achieve higher discriminative accuracy when shape differences are more substantial. Multi-population studies using the same pipeline would clarify which FM shape features, if any, transcend population boundaries. Integration of 3D surface data, enabled by modern CT reconstruction capabilities, could capture additional morphological information inaccessible from 2D contours. Finally, the subtype discovery approach could be extended to investigate whether FM morphological variants correlate with other craniometric traits, functional anatomy, or developmental influences.\u003c/p\u003e"},{"header":"Conclusions","content":"\u003cp\u003eThis study demonstrates that the sexual dimorphism of the foramen magnum is predominantly mediated by overall size rather than by outline shape. Among 234 CT-derived morphometric features, all six conventional size measurements produced large, highly significant sex differences, whereas only 11.6% of pure shape features reached significance after correction for multiple testing. Perimeter emerged as the single strongest discriminator (Cohen\u0026rsquo;s d\u0026thinsp;=\u0026thinsp;1.284), and size features alone achieved 75.6% classification accuracy. Importantly, however, a forward-selected combination of seven shape features\u0026mdash;spanning ellipse fitting residuals, polygon approximation metrics, Fourier harmonic descriptors, and radial asymmetry indices\u0026mdash;surpassed size-based classification, reaching 80.9% under rigorous 10-fold cross-validation. This finding establishes that individually weak shape signals, when optimally combined, carry discriminative information that is not redundant with overall size.\u003c/p\u003e \u003cp\u003eFrom a methodological standpoint, this study replaces the subjective and non-reproducible categorical shape classification that has prevailed in the FM literature with a fully automated, deterministic pipeline that extracts objective and continuous shape descriptors from standard axial CT images. The seven complementary geometric analyses\u0026mdash;classical morphometry, Fitzgibbon and genetic-algorithm-optimized ellipse fitting, systematic polygon approximation, normalized elliptic Fourier transform, radial distance profiling, and Procrustes-based density heatmap construction\u0026mdash;collectively provide a multiscale characterization of the FM boundary that is reproducible across observers and institutions. The polygon approximation analysis, in particular, revealed that both sexes are best described by a hexagonal template, offering a data-driven, objective alternative to the inconsistent subjective shape taxonomies reported in the literature. The pipeline is distributed as a single open-source Python script with no proprietary software dependencies, facilitating independent replication and extension.\u003c/p\u003e \u003cp\u003eThe unsupervised clustering analysis further revealed a previously unreported asymmetry in within-sex morphological variation: male foramina magna display greater heterogeneity, with a distinct atypical subtype comprising approximately one quarter of male specimens, whereas female FM morphology is substantially more homogeneous. This observation suggests that the developmental and biomechanical constraints shaping the FM may act more uniformly in females, and it warrants further investigation across geographically diverse samples.\u003c/p\u003e \u003cp\u003eIn practical forensic and clinical terms, the foramen magnum can predict biological sex correctly in approximately four out of five individuals using the computational approach presented here. As 80.9% accuracy approaches the upper boundary reported in the conventional FM literature, these results suggest that further gains from two-dimensional FM morphometry alone may be limited. Future studies should validate these findings across multiple populations, extend the pipeline to three-dimensional surface data enabled by modern CT reconstruction capabilities, and apply the framework to anatomical structures where shape dimorphism is expected to be more pronounced, such as the greater sciatic notch and the pelvic inlet.\u003c/p\u003e"},{"header":"Abbreviations","content":"\u003cdiv class=\"DefinitionList\"\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eFM\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eForamen magnum\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eCT\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eComputed tomography\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eAP\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eAnteroposterior\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eEFT\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eElliptic Fourier transform\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eGA\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eGenetic algorithm\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eLDA\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eLinear discriminant analysis\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eSVM\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eSupport vector machine\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eRF\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eRandom forest\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eKDE\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eKernel density estimation\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eGMM\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eGaussian mixture model\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eBIC\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eBayesian information criterion\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eRMSE\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eRoot mean square error\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eCV\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eCoefficient of variation\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eSD\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eStandard deviation\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv class=\"DefinitionListEntry\"\u003e \u003cdiv class=\"Term\"\u003eSFS\u003c/div\u003e \u003cdiv class=\"Description\"\u003e \u003cp\u003eSequential forward selection\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003c/div\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eEthics approval and consent to participate\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe original study protocol received approval from the Non-Interventional Clinical Research Ethics Committee of Van Yuzuncu Yil University Faculty of Medicine (decision number: 02, dated: 19.02.2017) and was conducted in accordance with the Declaration of Helsinki and the principles of Good Clinical Practice. Since the present study represents a secondary analysis of the same anonymized imaging dataset using computational geometric methods, no additional ethical approval was required. Informed consent was waived due to the retrospective design and full anonymization of all data.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConsent for publication\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot applicable.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAvailability of data and materials\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe complete analysis pipeline is publicly available as a single, self-contained Python script (FM_Comprehensive_Analysis.py). Per-specimen morphometric measurements are provided in the supplementary materials (Table S1 in Additional file 1). The source CT images cannot be shared publicly due to institutional data protection regulations but are available from the corresponding author upon reasonable request.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCompeting interests\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors declare that they have no competing interests.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThis research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthors\u0026rsquo; contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eYE conceived the study, developed the computational pipeline and software, performed programming and data analyses, conducted the literature review, and wrote the original draft of the manuscript. EK contributed to image acquisition and selection, literature review, ethics committee procedures, and radiological image analysis. MA contributed to project organization, supervision, conceptualization, and critical revision of the final manuscript. YH contributed to supervision and conceptualization. SK contributed to data curation and statistical analysis. UD contributed to the literature review, preparation of radiological images for the study, and critical revision of the final manuscript. AY contributed to radiological image acquisition, development of analysis methodology, and preparation of images for final analysis. OC contributed to project organization, supervision, data analysis, conceptualization, and critical revision of the final manuscript. All authors read and approved the final manuscript.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAcknowledgements\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot applicable.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eStandring S. Gray's Anatomy: The Anatomical Basis of Clinical Practice. 42nd ed. Amsterdam: Elsevier; 2020.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTubbs RS, Griessenauer CJ, Loukas M, Shoja MM, Cohen-Gadol AA. Morphometric analysis of the foramen magnum: an anatomic study. Neurosurgery. 2010;67(5):1479\u0026ndash;84.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCatalina-Herrera CJ. Study of the anatomic metric values of the foramen magnum and its relation to sex. Acta Anat (Basel). 1987;130(4):344\u0026ndash;7.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMurshed KA, \u0026Ccedil;i\u0026ccedil;ekcibaşi AE, Tuncer I. Morphometric evaluation of the foramen magnum and variations in its shape: a study on computerized tomographic images of normal adults. Turk J Med Sci. 2003;33(5):301\u0026ndash;6.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGapert R, Black S, Last J. Sex determination from the foramen magnum: discriminant function analysis in an eighteenth and nineteenth century British sample. Int J Legal Med. 2009;123(1):25\u0026ndash;33.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMadadin M, Menezes RG, Al Saif HS, Abu Alola H, Al Muhanna A, Gullenpet AH, et al. Morphometric evaluation of the foramen magnum for sex determination: a study from Saudi Arabia. J Forensic Leg Med. 2017;46:66\u0026ndash;71.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAljarrah K, Packirisamy V, Al Anazi N, Nayak SB. Morphometric analysis of foramen magnum and occipital condyle using CT images for sex determination in a Saudi Arabian population. Morphologie. 2022;106(355):260\u0026ndash;8.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eToneva D, Nikolova S, Harizanov S, Georgiev I, Zlatareva D, Hadjidekov V, et al. Sex estimation by size and shape of foramen magnum based on CT imaging. Leg Med (Tokyo). 2018;35:50\u0026ndash;60.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAjanovic Z, Dervisevic L, Dervisevic E, Lujinovic A, Ajanovic U, Biscevic-Tokic J, et al. Sex estimation based on foramen magnum: a three-dimensional geometric morphometrics approach. Int J Morphol. 2023;41(2):410\u0026ndash;6.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eUthman AT, Al-Rawi NH, Al-Timimi JF. Evaluation of foramen magnum in gender determination using helical CT scanning. Dentomaxillofac Radiol. 2012;41(3):197\u0026ndash;202.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJaitley M, Phulambrikar T, Kode M, Gupta A, Singh SK. Foramen magnum as a tool for sexual dimorphism: a cone beam computed tomography study. Indian J Dent Res. 2016;27(5):458\u0026ndash;62.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSingh PK, Tamrakar D, Paudel S, Timsinha S, Pokhrel RP, Shrestha R, et al. Morphometric analysis of the foramen magnum in sex estimation: an additional 3DCT study from Nepal on a larger sample. Anat Cell Biol. 2022;55(4):401\u0026ndash;11.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMehta M, Saini V, Patel MN, Menon SK. Applicability and reliability of foramen magnum for sex determination in contemporary Western Indian population: a computed tomographic study. J Forensic Radiol Imaging. 2019;17:13\u0026ndash;9.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAkay G, G\u0026uuml;ng\u0026ouml;r K, Peker İ. Morphometric analysis of the foramen magnum using cone beam computed tomography. Turk J Med Sci. 2017;47(6):1715\u0026ndash;22.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFernandes ALVC, Shetti A, Lagali-Jirge V, Keluskar V. Accuracy of sex estimation by morphometric evaluation of foramen magnum using computed tomography \u0026ndash; a systematic review and meta-analysis. Forensic Sci Med Pathol. 2024;20(1):268\u0026ndash;79.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMeral O, Toklu BB, Meydan R, Kaya A, Karadayı B, Acar T. Sex estimation from foramen magnum parameters in adult Turkish population: a computed tomography study. Leg Med (Tokyo). 2020;47:101775.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKartal E, Etli Y, Asirdizer M, Hekimoglu Y, Keskin S, Demir U, Yavuz A, Celbis O. Sex estimation using foramen magnum measurements, discriminant analyses and artificial neural networks on an Eastern Turkish population sample. Leg Med (Tokyo). 2022;59:102143.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChethan P, Prakash KG, Murlimanju BV, Prashanth KU, Prabhu LV, Saralaya VV, et al. Morphological analysis and morphometry of the foramen magnum: an anatomical investigation. Turk Neurosurg. 2012;22(4):416\u0026ndash;9.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eArag\u0026atilde;o JA, de Oliveira Pereira R, da, Cruz de Moraes RZ, Reis FP. Morphological types and the dimorphism of the foramen magnum of dry skulls from northeastern Brazil. Int J Morphol. 2014;32(4):1372\u0026ndash;8.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGargi V, Yadav PK, Kumar Y. Foramen magnum as a tool for sexual dimorphism. Natl J Med Res. 2018;8:28\u0026ndash;31.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSingh G, Gargi V. Discriminant function analysis of foramen magnum variables in South Indian population: a study of computerised tomographic images. J Clin Diagn Res. 2018;12(5):HC01\u0026ndash;5.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZdilla MJ, Russell ML, Bliss KN, Mangus KR, Koons AW. The size and shape of the foramen magnum in man. J Craniovertebr Junction Spine. 2017;8(3):205\u0026ndash;21.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSingh G, Talwar I. Morphometric analysis of foramen magnum for sex determination using CT scan. Indian J Forensic Med Toxicol. 2013;7(2):220\u0026ndash;3.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKuhl FP, Giardina CR. Elliptic Fourier features of a closed contour. Comput Graph Image Process. 1982;18(3):236\u0026ndash;58.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHaines AJ, Crampton JS. Improvements to the method of Fourier shape analysis as applied in morphometric studies. Palaeontology. 2000;43(4):765\u0026ndash;83.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSamuel OM, Bhagwat AA. Elliptical Fourier descriptors of outline and morphological analysis in caudal view of foramen magnum of the tropical raccoon (Procyon cancrivorus) (Linnaeus, 1758). J Morphol Sci. 2018;35(1):24\u0026ndash;32.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHayashizaki Y, Usui A, Hosokai Y, Sakai J, Funayama M. Sex determination of the pelvis using Fourier analysis of postmortem CT images. Forensic Sci Int. 2015;246:e1221\u0026ndash;9.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGonzalez PN, Bernal V, Perez SI. Geometric morphometric approach to sex estimation of human pelvis. Forensic Sci Int. 2009;189(1\u0026ndash;3):68\u0026ndash;74.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eEtli Y, Kartal E, Demir U, Etli E, Hekimoğlu Y, Aşirdizer M. Sex determination using image processing reveals elliptical features of the female pelvic inlet. Sci Rep. 2025;15:33800.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKrishan K, Chatterjee PM, Kanchan T, Kaur S, Baryah N, Singh RK. A review of sex estimation techniques during examination of skeletal remains in forensic anthropology casework. Forensic Sci Int. 2016;261:e1651\u0026ndash;8.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTsiouris C, Triantafyllou G, Karangeli N, Botis GG, Papadopoulos-Manolarakis P, Kalamatianos T, et al. Morphometric assessment of occipital condyles and foramen magnum reveals enhanced sexual dimorphism detection via 3D imaging: a systematic review and meta-analysis utilizing classification and regression trees. Diagnostics (Basel). 2025;15(11):1359.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDe Boor C. A Practical Guide to Splines. Rev. ed. New York: Springer; 2001.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHarris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, et al. Array programming with NumPy. Nature. 2020;585(7825):357\u0026ndash;62.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eVirtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods. 2020;17(3):261\u0026ndash;72.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825\u0026ndash;30.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHunter JD. Matplotlib: a 2D graphics environment. Comput Sci Eng. 2007;9(3):90\u0026ndash;5.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBraden B. The surveyor's area formula. Coll Math J. 1986;17(4):326\u0026ndash;37.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFitzgibbon A, Pilu M, Fisher RB. Direct least square fitting of ellipses. IEEE Trans Pattern Anal Mach Intell. 1999;21(5):476\u0026ndash;80.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNelder JA, Mead R. A simplex method for function minimization. Comput J. 1965;7(4):308\u0026ndash;13.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKasa I. A circle fitting procedure and its error analysis. IEEE Trans Instrum Meas. 1976;25(1):8\u0026ndash;14.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLestrel PE. Fourier Descriptors and Their Applications in Biology. Cambridge: Cambridge University Press; 1997.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRohlf FJ, Slice D. Extensions of the Procrustes method for the optimal superimposition of landmarks. Syst Biol. 1990;39(1):40\u0026ndash;59.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat. 1947;18(1):50\u0026ndash;60.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCohen J. Statistical Power Analysis for the Behavioral Sciences. 2nd ed. Hillsdale: Lawrence Erlbaum Associates; 1988.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFisher RA. The use of multiple measurements in taxonomic problems. Ann Eugen. 1936;7(2):179\u0026ndash;88.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273\u0026ndash;97.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBreiman L. Random forests. Mach Learn. 2001;45(1):5\u0026ndash;32.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWhitney AW. A direct method of nonparametric measurement selection. IEEE Trans Comput. 1971;C\u0026ndash;20(9):1100\u0026ndash;3.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMcLachlan GJ, Peel D. Finite Mixture Models. New York: Wiley; 2000.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRousseeuw PJ. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math. 1987;20:53\u0026ndash;65.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSchwarz G. Estimating the dimension of a model. Ann Stat. 1978;6(2):461\u0026ndash;4.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMcKinney W. Data structures for statistical computing in Python. Proc 9th Python Sci Conf. 2010;445:51\u0026ndash;6.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSatopaa V, Albrecht J, Irwin D, Raghavan B. Finding a kneedle in a haystack: detecting knee points in system behavior. In: 2011 31st International Conference on Distributed Computing Systems Workshops. IEEE; 2011. pp. 166\u0026ndash;71.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBhattacharyya A. On a measure of divergence between two statistical populations defined by their probability distributions. Bull Calcutta Math Soc. 1943;35:99\u0026ndash;109.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAtreya A, Shrestha R, Bhandari K, Malla SK, Acharya S, Menezes RG. Morphometric analysis of the foramen magnum in sex estimation: an additional 3DCT study from Nepal on a larger sample. Health Sci Rep. 2023;6(1):e999.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eVinutha SP, Suresh V, Shubha R. Discriminant Function Analysis of Foramen Magnum Variables in South Indian Population: A Study of Computerised Tomographic Images. Anat Res Int. 2018;2018:2056291.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKilmer K, Garvin HM. Outline analysis of sex and population variation in greater sciatic notch and obturator foramen morphology with implications for sex estimation. Forensic Sci Int. 2020;314:110346.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBookstein FL. Landmark methods for forms without landmarks: morphometrics of group differences in outline shape. Med Image Anal. 1997;1(3):225\u0026ndash;43.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCaple J, Byrd J, Stephan CN. Elliptical Fourier analysis: fundamentals, applications, and value for forensic anthropology. Int J Legal Med. 2017;131(6):1675\u0026ndash;90.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBierry G, Le Minor JM, Schmittbuhl M. Oval in males and triangular in females? A quantitative evaluation of sexual dimorphism in the human obturator foramen. Am J Phys Anthropol. 2010;141(4):626\u0026ndash;31.\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"},{"header":"Tables","content":"\u003cp\u003eTables are available in the Supplementary Files section.\u003c/p\u003e\n"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"bmc-medical-imaging","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"bmim","sideBox":"Learn more about [BMC Medical Imaging](http://bmcmedimaging.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/bmim/default.aspx","title":"BMC Medical Imaging","twitterHandle":"BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"Foramen magnum, Sexual dimorphism, Geometric morphometrics, Elliptic Fourier transform, Polygon approximation, Computed tomography, Forensic anthropology, Sex estimation, Procrustes analysis, Machine learning","lastPublishedDoi":"10.21203/rs.3.rs-8928788/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8928788/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003ch2\u003eBackground\u003c/h2\u003e \u003cp\u003eThe foramen magnum (FM) exhibits well-documented size-based sexual dimorphism, yet whether its shape independently differs between sexes remains unresolved. Previous attempts to characterize FM shape have relied on subjective visual classification into categorical types, producing inconsistent and irreproducible results. This study aimed to apply a comprehensive suite of computational geometric methods to CT-derived FM contours in order to objectively quantify shape variation and evaluate its discriminative capacity for sex estimation.\u003c/p\u003e\u003ch2\u003eMethods\u003c/h2\u003e \u003cp\u003eFM boundaries were automatically extracted from axial CT images of 473 adults (234 males, 239 females) from an Eastern Turkish population. Seven complementary geometric analyses\u0026mdash;classical morphometry, Fitzgibbon and genetic-algorithm-optimized ellipse fitting, systematic polygon approximation (3\u0026ndash;13 sides), normalized elliptic Fourier transform (20 harmonics), radial distance profiling, and Procrustes-based point-density heatmap construction\u0026mdash;yielded 231 quantitative features per specimen. Sex differences were assessed using Mann\u0026ndash;Whitney U tests with Bonferroni correction and Cohen\u0026rsquo;s d effect sizes. Classification performance was evaluated with linear discriminant analysis, support vector machines, and random forest under stratified 10-fold cross-validation. Within-sex morphological subtypes were explored using Gaussian mixture model clustering.\u003c/p\u003e\u003ch2\u003eResults\u003c/h2\u003e \u003cp\u003eOf 234 morphometric features, 92 (39.3%) showed significant sex differences after Bonferroni correction. Dimorphism was overwhelmingly size-driven: all six size measurements showed large effect sizes (Cohen\u0026rsquo;s d\u0026thinsp;=\u0026thinsp;1.13\u0026ndash;1.28, perimeter strongest at d\u0026thinsp;=\u0026thinsp;1.284), while only 17 of 146 pure shape features (11.6%) reached significance and the FM index showed no sex difference (d\u0026thinsp;=\u0026thinsp;0.088). The highest classification accuracy was 80.9% \u0026plusmn; 4.9% (sensitivity 76.4%, specificity 85.3%), achieved by forward-selected shape features with LDA, outperforming size-only classification (75.6%). A novel Procrustes heatmap-based method yielded near-chance accuracy (54.1%), confirming size dominance. Gaussian mixture model analysis revealed greater male morphological heterogeneity (25.2% atypical variants) compared to more homogeneous female FM morphology (71.5% single dominant subtype).\u003c/p\u003e\u003ch2\u003eConclusions\u003c/h2\u003e \u003cp\u003eFM sexual dimorphism is predominantly size-mediated, but computationally derived shape features provide incremental discriminative value that exceeds what size alone can achieve. This study introduces a fully automated, reproducible geometric morphometric pipeline that replaces subjective shape categorization with objective, continuous measurements. The framework is openly available and readily transferable to other skeletal structures amenable to CT-based contour analysis.\u003c/p\u003e","manuscriptTitle":"A comprehensive computational pipeline for foramen magnum shape analysis integrating elliptic Fourier transform, polygon approximation, and Procrustes heatmaps from CT images: application to forensic sex estimation","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-03-20 07:13:56","doi":"10.21203/rs.3.rs-8928788/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"editorInvitedReview","content":"","date":"2026-04-21T12:27:57+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-04-08T03:31:52+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-04-01T09:10:19+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-04-01T08:49:10+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"57023475745769703183585040916069454030","date":"2026-03-29T08:36:57+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"207341181263966604630374019918710207739","date":"2026-03-27T09:08:10+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"160850361723189363248226397655160553032","date":"2026-03-24T07:42:36+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"163651996306300557466163813919876266847","date":"2026-03-23T12:41:14+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2026-03-17T13:34:53+00:00","index":"","fulltext":""},{"type":"editorInvited","content":"","date":"2026-02-23T16:06:25+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2026-02-23T07:08:34+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2026-02-23T07:03:09+00:00","index":"","fulltext":""},{"type":"submitted","content":"BMC Medical Imaging","date":"2026-02-20T19:41:57+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"bmc-medical-imaging","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"bmim","sideBox":"Learn more about [BMC Medical Imaging](http://bmcmedimaging.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/bmim/default.aspx","title":"BMC Medical Imaging","twitterHandle":"BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"99be69c9-3728-4ae6-8fb1-4145efb7a8cf","owner":[],"postedDate":"March 20th, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[],"tags":[],"updatedAt":"2026-03-20T07:13:57+00:00","versionOfRecord":[],"versionCreatedAt":"2026-03-20 07:13:56","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8928788","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8928788","identity":"rs-8928788","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.