Results
Overall, the mean CA125 values were 17.3 U/ mL in 815 NEC controls and 14.9 U/ mL in 473 EPIC controls after recalibration. The baseline characteristics of NEC and EPIC premenopausal women were similar except age at blood draw, race, age at menarche, OC use, current hormone use, infertility, parity, and tubal ligation were significantly different. ( Supplementary Table S1 ).
We recalibrated the CA125 values in EPIC using the recalibration model based on 187 NEC premenopausal controls with both CA125II and MSD assay measurements. These two measurements were highly correlated (r=0.96, 95%CI 0.94, 0.97). After recalibration, the measured and recalibrated CA125 values also showed high correlation (r=0.95, 95%CI 0.93, 0.96). The recalibration model showed high performance in general.
Age at blood draw was non-linearly associated with CA125, with women younger than 30 or more than 50 years old having significantly lower CA125 than those aged 30–39 years ( Table 1 ). In age-adjusted models, menstrual phase at blood draw was significantly associated with CA125 levels, with early follicular phase levels being 8 to 21% higher than in other menstrual phases. Endometriosis and fibroids were associated with significantly higher CA125 levels, with 21 and 13% difference, respectively, compared to those who did not have the condition. Current hormonal contraception use and tubal ligation were significantly associated with lower CA125 levels, with −16% and −11% difference, respectively. Cycle length, days with menstrual bleeding, dysmenorrhea, age at first live birth, age at last live birth, and years since last live birth were not significantly associated with CA125 levels in premenopausal women. Similar predictors were significantly associated with CA125 levels in the dichotomous model ( Supplementary Table S2 ).
The final linear CA125 prediction model included age at blood draw, race, tubal ligation, endometriosis, menstrual phase at blood draw, and fibroids, with an r-squared of 0.07 (95%CI 0.02, 0.09) ( Table 2 ). The association between individual predictors and CA125 were similar in univariate and multivariate adjusted models. The r-squared of this full linear model when conducting 5-fold cross-validation was 0.02. When we restricted the analysis to the 498 controls with complete information on all predictors and applied the final linear CA125 prediction model, the r-squared was 0.12 (95%CI 0.05, 0.15) ( Supplementary Table S3 ). When restricting to women with complete information on all predictors, the same variables were retained in the final model. For all the models, the delta r-squared, which subtracts the variance attributable to study phase and center from the total variance, was similar to the r-squared reported above. When evaluating the final continuous model in multiple imputed datasets in NEC, the beta coefficients, standard errors, and the r-squared were similar to the original model ( Supplementary Table S3 ). The small differences in the measures of association when running the final model in the dataset using missing indicators, dataset restricted to those with complete information on all potential predictors, and multiple imputed datasets suggest that the missingness of menstrual phase at blood draw do not largely influence the results. We also observed similar performance of the model when including all significant predictors in the univariate analyses, suggesting that the final model included important key predictors. Predicted log-transformed CA125 calculated based on the final model and the observed log-transformed CA125 were weakly correlated with a Pearson correlation coefficient of 0.26 (95%CI 0.19, 0.33) ( Figure 2A ).
For external validation, we developed an abridged linear CA125 prediction model which included age at blood draw, race, and menstrual phase at blood draw with r-squared of 0.05 (95%CI 0.01, 0.07) in NEC ( Table 2 ). Using the measures of association from this abridged model, we calculated the predicted log-transformed CA125 values in EPIC. The predicted log-transformed CA125 values had a similar correlation with the observed log-transformed CA125 values in EPIC (r=0.22 (95%CI 0.13, 0.31)) as in the NEC abridged linear model (r=0.22 (95%CI 0.15, 0.29)) ( Figure 2B , 2C ). The spread of the predicted CA125 values in Figure 2 are much smaller than the spread of the observed CA125 values because the linear prediction model only explains a small proportion of the total variance of the observed CA125 values.
The final dichotomous prediction model to predict women with CA125 ≥ 35 U/ mL included age at blood draw, tubal ligation, endometriosis, prior personal cancer diagnosis, family history of ovarian cancer, number of miscarriages, menstrual phase at blood draw, and smoking status and duration with an AUC of 0.83 (95%CI 0.77, 0.89) ( Table 3 , Figure 3 ). For menstrual phase at blood draw, we collapsed the other phase and irregular menstruation categories because few individuals had CA125 ≥ 35 U/mL in these groups. The association between individual predictors and CA125 were similar in univariate and multivariate adjusted models. The AUC of this full dichotomous model when conducting 5-fold cross-validation was 0.67. When we restricted the analysis to the 498 controls with complete information on all predictors and applied the final dichotomous model, the AUC was 0.84 (95%CI 0.76, 0.93) ( Supplementary Table S4 ). When we conducted variable selection process using stepwise regression among women with complete information on all predictors, similar predictors were retained except number of miscarriages and smoking status, resulting with an AUC of 0.79 (95%CI 0.69, 0.89). When evaluating the model in the multiple imputed datasets in NEC, the odds ratios and the AUC were largely similar to the primary analysis ( Supplementary Table S4 ). We also observed a similar performance of the model when including all significant predictors from the univariate analyses, suggesting that the final model included important key predictors. We also considered using 65 U/mL cutoff which has been proposed for premenopausal women( 26 ), but were limited with five controls who had CA125 greater than 65 U/mL so were not able to investigate further.
For external validation, we developed an abridged model, which included age at blood draw, number of miscarriages, menstrual phase (collapsing those on hormones, blood draw at other phase, and having irregular menstruation due to power), and smoking status with an AUC of 0.73 (95%CI 0.65, 0.81) in NEC ( Table 3 , Figure3 ). When we applied this model to EPIC using recalibrated CA125 value of 35 U/ mL as cutoff, the AUC was 0.78 (95%CI 0.67, 0.89) ( Figure 3 ).
Materials
The New England Case Control Study (NEC) is a population-based case-control study of ovarian cancer with 2,100 population-based controls enrolled from New Hampshire and eastern Massachusetts over the three study phases (1992–97, 1998–2002, 2003–2008). Details on the study design have been described previously( 12 – 14 ). Briefly, controls were identified using random-digit dialing, town book selection, and drivers’ license lists and frequency matched on age and state of residence. Approximately half (54%) of the eligible controls that were contacted agreed to participate. We restricted the study population to controls (n=2,100) and excluded women without CA125 measurements (n=96), women postmenopausal at time of blood draw (n=1,130), women who had hysterectomy due to unknown menopausal status (n=30), women who were pregnant or breastfeeding at time of blood draw (n=25), and women with extreme CA125 values ranging from 115.3U/ mL to 411.7 U/mL (n=4) identified based on the generalized extreme studentized deviated many-outlier detection approach applied to log-transformed values( 15 ). In sum, our analysis included 815 premenopausal NEC controls.
The European Prospective Investigation into Cancer and Nutrition (EPIC) study is a multicenter prospective cohort including participants from ten Western European countries developed to evaluate the association between nutrition and cancer. Briefly, 519,978 participants (366,521 women) were enrolled between 1991 and 1998 across 23 research centers. Details on the study design have been described previously( 16 ). A nested case-control study of ovarian cancer was designed within the cohort( 17 ). For each ovarian cancer case, up to four controls were randomly selected using incidence density sampling for a total of 1,939 controls( 17 ). We excluded women without CA125 measurements (n=12), women who were either postmenopausal (n=1,416), or had a hysterectomy or unknown menopausal status (n=38). There were no outlying values in these EPIC controls. In sum, our analysis included a total of 473 premenopausal EPIC controls.
In NEC controls we measured CA125 using the CA125II radioimmmunoassay (Centocor, Malvern, PA) at the CER Lab at Boston Children’s Hospitals. We assessed the reproducibility of the assay by including five blinded aliquots of a uniform quality control pool in each of the 46 assay batches. The average of the coefficients of variation (CV) was 1%. In EPIC controls and in a subset of NEC controls, we previously measured CA125 using the volume-effective highly sensitive multiplex platform (Meso Scale Discovery (MSD), Gaithersburg, MD) in the Genital Tract Biology Laboratory at Brigham and Women’s Hospital( 17 ). The average CV across the assay batches was 19%.
We selected factors that have been previously reported to be associated with CA125 in at least one prior study( 7 – 9 , 11 ), ovarian cancer risk factors( 18 ), as well as several factors which were biologically plausible to be associated with CA125( 10 ). Those included age at blood draw, race, body mass index (BMI, kg/m 2 ), smoking status (never, former, current), pack-years calculated by number of packs of cigarettes per day multiplied by the number of years a person had smoked, age at menarche, oral contraceptive use and its duration (months), parity, self-reported endometriosis, tubal ligation, family history of ovarian cancer, prior personal cancer diagnosis, caffeine intake (mg), genital powder use, infertility, number of miscarriages, ectopic pregnancy, ever use of intrauterine device (IUD), fibroids, menstrual cycle regularity and days between last menstrual period (LMP) and blood draw( 7 – 11 ). Furthermore, we evaluated additional variables related to menstrual characteristics and pregnancy timing: cycle length, days with menstrual bleeding, dysmenorrhea, age at first live birth, age at last live birth, and years since last live birth.
We log-transformed CA125 values to achieve a normal distribution. With this transformation, the distribution of log-transformed CA125 was normally distributed with skewness of 0.35 and kurtosis of 0.34, with a bell-shaped histogram.
Since the EPIC samples had CA125 measured using an alternate assay (MSD assay) with a different scale, we used recalibration to rescale these measurement results to be comparable to the CA125II assay values. We recalibrated the EPIC CA125 values based on 187 NEC premenopausal controls with CA125 measurements on both CA125II and MSD assays using the drift correction method( 19 ). We regressed the log-transformed MSD assay values to the log-transformed CA125II assay values and used the intercept and effect estimates of the model to calculate the recalibrated CA125II assay values based on the measured MSD assay values for all premenopausal controls in EPIC and used the recalibrated values in our analyses.
First, we evaluated the association between individual candidate predictors and CA125 using linear or logistic regression adjusted for continuous age. We used effect estimates of the linear regression for each predictor to calculate the percent change in CA125 levels, calculated as [exp (beta) - 1] x100 for a 1-unit change in the predictor. We determined the optimal modeling of continuous variables (age, BMI, age at menarche, duration of OC use, parity, and smoking pack-years) using restricted cubic splines to test for linearity( 20 ). We used categorical variables for age, dichotomous variable for age at menarche, and piecewise linear spline with single knot for BMI since these variables were non-linearly associated with log-transformed CA125. We created composite categorical variables for OC use and duration and smoking status and pack-years, and compared nested models using likelihood ratio test and non-nested models using the Akaike information criterion and Vuong test( 21 ). Based on these evaluations, candidate predictors were modeled as follows: age at blood draw (categorical, by 10 year intervals from age 30), race (white, non-white), BMI (piecewise linear spline model with single knot at 27), height (continuous, centered at 165), smoking status(categorical, never, former, current) and pack-years (continuous, never smokers, pack-years among former smokers, pack-years among current smokers), age at menarche (age 12 and under, above 12), duration of OC use (continuous, including never users), parity (continuous), endometriosis (no, yes), tubal ligation (no, yes), family history of ovarian cancer (no, yes), prior personal cancer diagnosis (no, yes), caffeine intake (quartiles), genital powder use (no use, body use, genital use), infertility (no, yes), number of miscarriages (0, 1, 2, 3 or more), ectopic pregnancy (no, yes), intrauterine device use (never, ever), fibroids (no, yes), menstrual cycle regularity (regular, irregular) and predicted phase of the menstrual cycle (early follicular, late follicular, peri-ovulatory, luteal, long cycle, irregular, missing) based on the number of days between the last menstrual period and blood draw.
Overall, we developed CA125 prediction models (linear and dichotomous) in NEC and conducted external validation in EPIC ( Figure 1 ). We used cross-validation to conduct internal validation of the model developed in NEC. Since information on some of the predictors were not collected in EPIC, we developed an abridged model restricted to variables available in EPIC from the final model, and then validated the abridged model in EPIC.
First, we developed a linear CA125 prediction model of log-transformed CA125 in NEC. We used stepwise linear regression analysis using p<0.15 as significance level for entry and retention in the model. In our primary prediction modeling, we used missing indicators for menstrual phase at blood draw due to a proportion of missing values. For variables with a limited number of missing values (age at menarche (n=1), caffeine intake (n=23), menstrual cycle length (n=23)), women with missing values were excluded. Age was forced in the model and the r-squared was calculated for the final prediction model, adjusted for study phase (1992–1997, 1998–2003, 2003–2008) and center (Massachusetts, New Hampshire). In addition, we calculated a delta r-squared that excluded the variability explained by study phase and center as these were matching factors in NEC and were forced into the model( 22 ). The predicted log-transformed CA125 values in NEC were calculated using the effect estimates from the final prediction model. We evaluated the performance of the model by calculating the Pearson correlation coefficient to assess how well the predicted and the observed CA125 values agreed (i.e. calibration). We used 5-fold cross-validation to assess for overfitting in NEC and calculated the average r-squared across all sampled datasets( 23 ). To evaluate potential bias due to missing data of candidate predictors, we conducted a sensitivity analysis restricted to women who had no missing predictors. We also conducted multiple imputation by chained equations (MICE) to impute the missing variables conditional on all of the predictors and outcome( 24 ). We allowed 100 iterations and generated 20 imputed datasets. We applied the final prediction model in the 20 imputed datasets using the methods described and pooled the results of the model estimates using the Rubin’s rules( 25 ).
For external validation, we sought to validate our linear CA125 prediction model in EPIC. However, some of our key predictors (endometriosis and fibroids) were not collected in EPIC or were missing in majority of women (tubal ligation), thus, we validated an abridged model restricted to variables available in EPIC. First, among the predictors selected in the final model developed in NEC, we identified predictors available both in NEC and EPIC. We next ran the abridged model in NEC restricting to those variables available in both NEC and EPIC. We used the effect estimates from this model to calculate the predicted value of log-transformed CA125 in the EPIC samples. We calculated the Pearson correlation coefficient between the predicted and the observed log-transformed CA125 to assess agreement and compare to that in the discovery dataset. We plotted the predicted versus the observed log-transformed CA125 for visual assessment.
Next, we developed and validated a dichotomous prediction model of elevated CA125 defined as having CA125 ≥ 35 U/ mL following the same method used for developing the linear CA125 prediction model described above but using logistic stepwise regression analysis. We evaluated the performance of the model by calculating the area under the curve (AUC) in the NEC (discovery) and EPIC (validation).
All statistical analyses were performed using SAS version 9.4 (SAS Institute Inc., Cary, NC), STATA version 12.1 (StataCorp, College Station, TX), and R version 3.4.3.
Discussion
This is the largest population-based study to develop and validate CA125 prediction models among healthy premenopausal women considering both continuous levels as well as those over current clinical threshold of 35 U/ mL. Although, the model predicting continuous CA125 only explained a small percent of the total variability, the model did show comparable correlations between predicted and observed levels in EPIC, suggesting the validity of the model. Conversely, the AUC for predicting elevated CA125 (≥ 35 U/ mL) was relatively high in NEC and validated in EPIC.
Age was non-linearly associated with CA125 in our study, which is consistent with our prior study in EPIC in which we observed an inverse U-shaped association between age and CA125 levels among premenopausal women( 9 ). Similarly, non-white race was associated with significantly lower CA125, which was consistent with prior studies in postmenopausal women( 7 , 8 ), suggesting the need for different thresholds for minority populations. Unfortunately, we were underpowered to evaluate differences in prediction models between racial subgroups, though others have described differences in CA125 levels between Black and Asian women( 7 , 8 ).
Factors related to menstruation were strongly related to CA125. Specifically, an early follicular phase blood draw was significantly associated with higher CA125 levels and strong predictor of CA125 in our final model, which was consistent with previous reports( 27 ). This association is likely driven by MUC16 expression on the endometrium and endometrial shedding during early follicular phase which may lead to higher circulating CA125 levels( 10 ). This could explain the increased CA125 levels in women with fibroids, since fibroids are known to increase menstrual bleeding( 28 ). In contrast, MUC16 expression on the endometrium may explain lower CA125 levels among women with a tubal ligation as this procedure would prevent retrograde menstruation, which occurs in approximately 85% of women during menstruation( 29 ), leading to systemic exposure to the antigen. Factors related to infertility, particularly endometriosis, were also related to substantially higher CA125 levels, consistent with prior studies( 30 , 31 ). A similar mechanism is likely responsible as endometriosis leads to ectopic endometrial tissue usually in the peritoneal cavity.
Our linear CA125 prediction model explained 7% of the variability in CA125 but showed moderate validation in EPIC, whereas our dichotomous CA125 prediction model had better predictive ability with good validation. These results suggest that the variability of CA125 may be small in general but change dramatically by certain factors such as menstrual phase and endometriosis, and therefore the dichotomous prediction model performed better. We decided to use a standard log-linear model for developing the linear CA125 prediction model because the distribution of log-transformed CA125 was normally distributed with low kurtosis and skewness. When we included all significant predictors in the univariate analyses, both linear and dichotomous models showed similar performance compared to our final model having fewer predictors, suggesting some predictors were correlated.
Interestingly, some factors, such as fibroids and race were only significantly associated with continuous CA125 and some factors, such as prior personal cancer diagnosis and family history of ovarian cancer were only significantly associated with elevated CA125 (above 35 U/ mL). We suspect more predictors were selected in the final dichotomous CA125 prediction model because the association between exposures and CA125 were non-linear.
The major strength of our study is that we had two large independent population-based studies with detailed information on candidate predictors of CA125 to develop and validate CA125 prediction models among premenopausal women. However, there are several limitations to our study. First, we had missing data on several variables. While we used missing indicators for our main analysis, our sensitivity analyses restricting to those with complete information on all predictors and using multiple imputation showed similar results, suggesting that the method for handling missing data did not influence overall results. In addition, we evaluated the performance of our prediction models using cross-validation and conducting external validation in an independent dataset, in which all the results were similar, suggesting a parsimonious model. Secondly, we were not able to validate the full prediction models in the independent dataset. Although we were only able to validate an abridged model in EPIC lacking tubal ligation and endometriosis, we expect the model performance to be better and closer to what we would have observed in NEC if we had information on all predictors. Thirdly, our model could be missing unknown predictors of CA125 since we restricted the candidate predictors to those previously described, which were mostly conducted among postmenopausal women. The relatively low r-squared of the final linear CA125 prediction model suggest that other candidate predictors may exist, such as genetic factors, common medications, and dietary factors, opening new opportunities for future studies. While hysterectomy has been previously described as a predictor of CA125, only few participants in NEC had hysterectomy. Given their ambiguous menopausal status we excluded them from current analysis of premenopausal women. Lastly, the model performance in EPIC could be underestimated because NEC and EPIC used different assays to measure CA125. However, the CA125 values of the two assays were highly correlated (r=0.96) and the predicted CA125 values calculated using the recalibration model were also very highly correlated with the observed CA125II assay values (r =0.95).
In summary, we developed and validated CA125 prediction models among premenopausal women in two independent studies that further our understanding of factors that influence CA125 levels and can therefore be used to optimize ovarian cancer screening with CA125. While performance of population-level screening for ovarian cancer in premenopausal women may be limited due to the lower incidence of ovarian cancer in this age range, approximately 30% of ovarian cancers are diagnosed before age 55. Furthermore, the impact of ovarian cancer in younger women results in potentially greater social, emotional, and economic impact. Further studies are needed to identify new predictors of CA125 to improve the model and to understand the predictors of changes in CA125 over time based on personal characteristics.
Introduction
Ovarian cancer is the eighth leading cause of cancer death in 2012 with 151,900 deaths worldwide due, in part, to lack of specific symptoms leading to diagnosis at late stage when prognosis is poor( 1 , 2 ). More than 80% of ovarian cancer patients have elevated cancer antigen 125 (CA125), a membrane bound glycosylated mucin (MUC16), which is used clinically as a prognostic biomarker and to monitor response to therapy( 3 , 4 ). However, results from two large randomized screening trials in primarily postmenopausal women using transvaginal ultrasound and CA125 (either using 35 U/ml as a cutoff, or the risk of ovarian cancer algorithm (ROCA)) showed no clinically significant benefit( 5 , 6 ). MUC16 is expressed on a variety of tissues, including the lung, pancreas, stomach, liver, endometrium, and breast, and levels vary between individuals based on demographic, reproductive and lifestyle characteristics( 7 – 11 ). Therefore, identifying personal characteristics that are associated with CA125 levels could be used to create personalized thresholds for CA125 instead of a single 35 U/mL cutoff, thereby improving the interpretation of measured CA125 and its performance as a screening biomarker and ultimately leading to decreased ovarian cancer mortality.
However, prior studies examining factors associated with CA125 have focused on postmenopausal women( 7 , 8 ). Thus, we evaluated factors associated with CA125 in premenopausal women and developed and validated CA125 prediction models (linear and dichotomous) among premenopausal women without ovarian cancer from the New England Case-Control Study and the European Prospective Investigation into Nutrition and Cancer study.
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.