Results
During the review period, we identified 491,048 women with a diagnosis of endometriosis across the CCAE (N=236,594), MDCD (N=70,426), Optum ® EHR (N=181,743), and CUIMC EHR (N=2,285) datasets ( Figure S1 ). Across the four databases, participants ranged in age from 15–19 years old in 2–3% of cases, 20–24 years in 5–9%, 25–29 years in 10–14%, 30–34 years in 15–19%, 35–39 years in 20–22%, 40–44 years in 20–25%, and 45–49 years old in 14–23% of cases ( Table 1 , Tables S1 – S3 ). Women were identified as 1% Asian, 7–8% Black or African American, and 37–40% white depending on the data source; 52–55% were of unknown race and 3% were of Hispanic or Latina ethnicity. The CCAE dataset did not provide race or ethnicity data.
In total, phenotyping approaches identified 278,971 women with endometriosis in Cohort A, 157,959 in Cohort B, 363,049 in Cohort C, 373,201 in Cohort D, and 377,036 in Cohort E ( Figure S1 ). A sizable proportion of women (15–20% of endometriosis cases) were identified across all 5 cohorts regardless of phenotype definition ( Figure 1 , Table S4 ). More than one-fourth (26–30%) were represented in symptom-only Cohorts C-E but not in Cohorts A and B ( Table S4 ). We also note that 2–4.6% of cases (14,795 women total) showed only pelvic and/or abdominal pain but not other symptoms noted in clinical guidelines ( Table S4 ). Largely, our findings are consistent across each dataset. In the interest of brevity, remaining findings presented in the main text are from our largest EHR dataset (Optum ® EHR). Additional tables and figures, including those based on data from other sources, are provided as Supplemental Materials ( Tables S1 – S11 , Figures S2 – S22 ).
Women with a diagnosis of endometriosis based on surgical confirmation (Cohort A) were older at time of diagnosis (mean age = 38 years, SD = 8) compared to women diagnosed based on imaging and symptoms (Cohort B mean age = 35, SD = 9; p<0.001, Figure 2b ). We found a similar trend when comparing age at diagnosis between Cohort A and Cohorts C-E (mean age = 36, average SD = 8) with a mean age difference of 2 years (p<0.001 for all comparisons). This pattern was consistent across all four data sources ( Figure 2 , Figures S5 – S7 ).
Among women identified with endometriosis, we also compared their date of entry into Cohort A with date of entry into Cohorts B-E ( Figure 3 , Figures S8 – S10 ). For most women, there was little to no difference in date of cohort entry between Cohort A and Cohort B (median difference = 0 days; IQR = 0–45 days). Similarly, we found almost no difference between entry into Cohort A and Cohorts C-E (median difference = 0 days for all, IQR = 0–15 days for Cohort C, 0–14 days for Cohort D, and 0–13 days for Cohort E).
The top 25 conditions with the greatest prevalence difference between Cohort A and Cohort B and between Cohort A and Cohort E are shown in Figure 4a – b (see Figures S11 – S14 for comparisons between Cohort A and Cohorts C and D). The most common symptom experienced by women prior to receipt of an endometriosis diagnosis was pain. Pain was reported among 91% of women in Cohort A and 100% of women in Cohorts B-E. Among all five cohorts, localized pain was most frequently reported as abdominal pain and pain in the pelvis with prevalence ranging from 69% to 70% of women in Cohort A compared to 83% to 90% in Cohorts B-E.
Genitourinary pain (43% vs 68–69%), pain in female genitalia (43% vs 67–68%), and pain of uterus (42% vs 67–68%) were also more commonly reported among patients identified based on imaging and/or symptom presentation (Cohorts B-E). We also found that urinary tract pain was documented more than twice as much among women in Cohorts B-E (36–45% of women) as Cohort A (18%). Condition codes for infectious disease of genitourinary system (30% in Cohort A, 49% in Cohort B, 39–41% in Cohorts C-E) and urinary tract infectious disease (22% in Cohort A, 39% in Cohort B, 31–32% in Cohorts C-E) were also more frequently reported in cohorts based on imaging and/or symptoms.
Similarly, we found substantial documentation of dysuria (17% in Cohort A, 37%–44% in Cohorts B through E) and dysmenorrhea (42% of patients in Cohort A, 67%–68% in Cohorts B-E) in these cohorts. Dyspareunia was recorded in nearly one-sixth to one-fourth of cases (16% in Cohort A, 24–27% in Cohorts B-E) and fatigue in one-sixth to one-fifth of cases (15% in Cohort A, 18–22% in Cohorts B-E). The non-specific diagnosis of acute inflammatory disease was also more common among these cohorts (42% vs 49–57%) as were conditions not traditionally associated with endometriosis such as inflammatory disorder of the respiratory system (46% vs 53–61%) and acute respiratory disease (41% vs 48–56%).
In contrast, we found more frequent documentation of lesions, masses, and neoplasms in Cohort A compared to Cohorts B-E. For example, lesions of the genitalia (prevalence range: 73–90%), ovary (46–59%), and uterus (50–68%), abdominal mass (61–78%), mass of urogenital structure (61–78%), pelvic mass (60–77%), and mass of uterus (33–49%) were more commonly identified among women diagnosed by surgical confirmation (Cohort A) than the other phenotype definitions (Cohorts B-E). We also noted significant differences between neoplasms of the abdomen (33–49%), female genital organs (31–47%), intra-abdominal organs (32–49%), uterus (29–44%), and pelvis (31–48%). Reporting of peritoneal adhesions was twice as common in Cohort A (32%) compared to Cohorts B-E (16–18%).
The top 25 medications with the greatest prevalence difference between Cohort A and Cohorts B through E are shown in Figure 5 and Figures S15 – S18 . Among treatments recommended by the ESHRE guidelines for endometriosis-related symptoms, women in Cohorts B-E were significantly more likely to receive hormones and related agents (44% in Cohort A, 61% in Cohort B, 50–51% in Cohorts C-E), including hormonal contraceptives for systemic use (50% in Cohort A, 67% in Cohort B, 56–57% in Cohorts C-E), compared to women in Cohort A. We also found that opioids had high levels of utilization among all cohorts but were most common among women in Cohort A (96% in Cohort A, 82% in Cohort B, 77–78% in Cohorts C-E). The use of antiemetics and antinauseants (71–93%), antipruritics (68–88%), antacids (62–89%), and corticosteroids (70–87%) were also high overall and highest in Cohort A. Antidiarrheal microorganisms (67–90%), other antidiarrheals (57–87%), and urologicals (62–88%) were also most prevalent among women in Cohort A.
Notably, routine gynecologic examination was only recorded in slightly more than one-third of patients (34% in Cohort A, 44% in Cohort B, and 35–36% in Cohorts C-E). The top procedures with the greatest prevalence difference between cohorts are illustrated in Figure 6 and Figures S19 – S22 . We found that women with a diagnosis of endometriosis based on surgical confirmation (Cohort A) were more likely to have received laparoscopy with fulguration or excision of lesions of the lower abdomen (38% vs 15–21%), laparoscopy with total hysterectomy for uterus 250g or less (30% vs 7–9%), and surgical hysteroscopy with biopsy of endometrium (12% vs 7–8%). We also note that surgical pathology with microscopic evaluation was performed more than twice as often in Cohort A (48% in Cohort A, 16% in Cohort B, and 21% in Cohorts C-E).
Transvaginal ultrasound was performed the most among procedures conducted, especially among patients in Cohort B (60% in Cohort A, 84% in Cohort B, and 53–55% in Cohorts C-E). Non-obstetric pelvic ultrasound was conducted for slightly more than one-half of patients in Cohort B but only roughly one-third in other cohorts (39% in Cohort A, 56% in Cohort B, and 37% in Cohorts C-E). We found that abdominal ultrasound was performed infrequently though most often among patients in Cohort B (5% in Cohort A, 9% in Cohort B, 6% in Cohorts C-E). Therapeutic, prophylactic, or diagnostic intravenous injection was also more common in Cohort B than all other cohorts (40% vs 31–32%).
Before receiving an endometriosis diagnosis, nearly all participants in each cohort had at least one outpatient visit ( Tables S8 – S11 ). Compared to women diagnosed with endometriosis based on surgical confirmation, more women diagnosed based on imaging and symptom presentation had emergency room (ER) visits (41% of women in Cohort B vs 30% in Cohort A, p<0.001). They also used the ER with greater frequency prior to diagnosis (mean number of ER visits = 1.4, SD = 4.3 in Cohort B vs mean of 0.8 ER visits, SD=3.2 in Cohort A; p<0.001). While nearly half of women (45%) in Cohort A had at least one hospitalization prior to diagnosis (mean of 0.6 visits, SD=1.1), this was the case for only one-fourth (26%) in Cohort B (mean of 0.4 visits, SD=1.8). Similar trends held when comparing Cohorts C-E to Cohort A and across all datasets. Notably, women in the MDCD dataset had consistently higher levels of healthcare utilization overall.
Materials
We performed a retrospective, observational cohort study of women in the United States (US) aged 15–49 years with a diagnosis date of endometriosis between January 1, 2013 and December 31, 2023. This study was approved by the institutional review board of Columbia University Irving Medical Center (Protocol AAAO7805) and reporting follows the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guideline.
Our data sources included US insurance claims from the Merative ™ MarketScan ® Commercial Database (CCAE) and Merative ™ MarketScan ® Multi-State Medicaid (MDCD) databases as well as administrative and electronic health record (EHR) data from Optum ® de-identified EHR (Optum ® EHR) and Columbia University Irving Medical Center (CUIMC) databases. To standardize the data, each de-identified dataset was transformed to the Observational Health Data Sciences and Informatics (OHDSI) Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). 10 , 11
The CCAE dataset includes health insurance claims across the care continuum (i.e., inpatient, outpatient, outpatient pharmacy, and carve-out behavioral healthcare) as well as enrollment data from large employers and health plans across the US who provide private healthcare coverage for more than 170 million employees, their spouses, and dependents. This administrative claims database includes a variety of fee-for-service, preferred provider organizations, and capitated health plans.
The MDCD dataset reflects the healthcare service use of more than 35 million individuals covered by Medicaid programs in numerous geographically dispersed states. The database contains the pooled healthcare experience of Medicaid enrollees, covered under fee-for-service and managed care plans. It includes records of inpatient services, inpatient admissions, outpatient services, and prescription drug claims, as well as information on long-term care.
The Optum ® EHR dataset contains clinical, claims, and other administrative data on more than 111 million patients from more than 111,000 sites of care spanning dozens of healthcare provider organizations across the US. Data are obtained from inpatient and ambulatory EHRs, practice management systems, and other internal systems. Data elements include demographics, prescribed and administered medications, immunizations, allergies, laboratory results, vital signs and other observable measurements, clinical and inpatient stay administrative data and coded diagnoses and procedures.
The CUIMC EHR dataset comprises 6.7 million patients with data collection starting in 1985. CUIMC is a quaternary care center in the northeast U.S. with primary care practices in northern Manhattan and surrounding areas. The CUIMC database currently holds information about patient demographics, conditions, symptoms and other observations, laboratory and vital sign measurements, medications, medical devices, and both inpatient and outpatient encounters.
Based on a combination of coded diagnoses, observations, procedures, and measurements, we constructed five different cohorts — (i) women who received a diagnosis of endometriosis based on surgical confirmation (Cohort A); (ii) women who received a diagnosis of endometriosis based on a combination of diagnostic imaging and presentation of guideline-recognized endometriosis symptoms (Cohort B); (iii) women who received a diagnosis of endometriosis and presented with guideline-recognized endometriosis symptoms, regardless of whether diagnostic imaging was performed (Cohort C); (iv) women who received a diagnosis of endometriosis and presented with guideline-recognized endometriosis symptoms and/or pelvic pain (Cohort D); and (v) women who received a diagnosis of endometriosis and presented with guideline-recognized endometriosis symptoms, pelvic pain, and/or abdominal pain (Cohort E). For each cohort, we restricted our analysis to participants with at least one year of continuous observation prior to cohort entry to ensure we capture incident events. Each cohort definition was translated into a phenotype, 12 a computable algorithm incorporating temporal logic, to query the CCAE, MDCD, Optum ® EHR and CUIMC databases.
For Cohort A, women with an endometriosis-related surgical procedure (e.g., laparoscopic surgery), we required an accompanying endometriosis diagnosis within 30 days of the procedure. For this group, aligning with previous guidelines, the date of cohort entry was the date of surgery.
Based on the updated diagnosis guidelines, our definition for Cohort B, women with an endometriosis diagnosis based on imaging required the occurrence of (i) at least one medical imaging code (e.g., transvaginal ultrasound); (ii) at least two endometriosis diagnosis codes on the date of or following imaging; (iii) at least one code for dysmenorrhea, dyspareunia, or dysuria prior to diagnosis; and (iv) at least one other code for an endometriosis-related symptom prior to diagnosis, including additional symptoms such as constipation and cough to account for episodes of dyschezia and cyclical cough, respectively, which are known indications of endometriosis. Due to data limitations, more granular symptom descriptors were not available in the source data. Additionally, cyclical scar swelling and pain was the only symptom from the updated guidelines without an associated code in the source data and thus not included in the phenotype. For this group, the date of cohort entry was the date of diagnosis.
For women who received a diagnosis of endometriosis and presented with endometriosis-related symptoms, we created three cohorts by successively expanding the number of potential symptoms included. For Cohort C, we required the occurrence of (i) at least two endometriosis diagnosis codes; (ii) at least one code for dysmenorrhea, dyspareunia, or dysuria prior to diagnosis; and (iii) at least one other code for an endometriosis-related symptom recognized in the clinical guidelines prior to diagnosis. For Cohort D, we required the same criteria as Cohort C but included pelvic pain. Cohort E builds upon Cohort D further by including abdominal pain.
We chose to include Cohorts D and E in our analysis based on patient-reported outcomes compiled via the Phendo mobile app ( https://citizenendo.org/phendo/ ). Phendo is a research-oriented app that has 17,000+ users across the world who submit information about their experiences with endometriosis. Approximately 60% of Phendo users report experiencing chronic pelvic pain and/or abdominal pain.
To validate and compare the relative performance of our five endometriosis cohort definitions, two physicians specializing in gynecology (MNA, SG) independently conducted a detailed chart review of CUIMC EHRs for a sample of patients. Full details regarding phenotype evaluation are provided in Supplementary Materials . In summary, all five cohort definitions achieved high levels of positive predictive value (PPV range: 0.84–0.96) but there was poor concordance among the patients identified in each cohort. Complete agreement among all five cohort definitions was only found among 16% of cases (Fleiss’ kappa = 0.13).
The primary outcomes of interest were the age at diagnosis, the number and proportion of patients shared among and unique to each cohort, and the prevalence of conditions, medications, and procedures documented prior to receipt of an endometriosis diagnosis. We standardized medications based on their anatomical therapeutic chemical (ATC) level three classification. Secondary outcomes included measures of healthcare utilization including the number of clinical encounters and emergency department visits and/or inpatient hospitalizations occurring before diagnosis. Among participants identified by multiple cohort definitions, we also assessed differences in their date of entry to each cohort.
We calculated descriptive statistics using count, percentage, mean, median, standard deviation, and interquartile range (IQR). We performed pairwise comparisons of proportion between cohorts using Chi-squared tests, then applied Bonferroni corrections to account for multiple comparisons. To compare means, we used z-tests. We conducted statistical analyses using the R programming environment, version 4.2.3 (R Foundation for Statistical Computing, Vienna, Austria).
Discussion
Our study of four large national insurance claims and EHR databases characterized differences in clinical histories and demographics for nearly half a million women diagnosed with endometriosis in the US. Findings from our analysis suggest that the latest shift in clinical guidelines results in more comprehensive diagnosis of women with the disease, but updated ESHRE diagnostic criteria still do not capture a sizable percentage of women with endometriosis. Most notably, despite using phenotypes with high levels of precision (PPV), only 15–20% of all endometriosis cases were shared among all five cohorts. This illustrates both the heterogeneous presentation of the disease as well as the importance of expanding the criteria used for diagnosis.
Our analysis focuses on comparison of cohorts based on five complementary phenotype definitions, two of which operationalize the original and updated diagnostic criteria delineated in the ESHRE clinical guidelines (Cohorts A and B, respectively). Importantly, the most recent ESHRE guideline asserts that, while imaging via ultrasound or MRI are recommended in a diagnostic work-up, a negative finding does not necessarily rule out endometriosis as a potential diagnosis. 9 , 14 – 18 In light of this, we took a data-driven approach that identified a substantial cohort of patients whose literal pain would have been ignored by either cohort definition. This manifested as three additional symptom-only cohorts based on phenotype definitions that first built upon the current guideline by incorporating only ESHRE-recognized symptoms (Cohort C) with successive expansion to include either one or both of the two most commonly reported symptoms of endometriosis – pelvic pain and/or abdominal pain (Cohorts D and E, respectively).
Importantly, cohorts derived from the newer guidelines and its expanded iterations identified patients who were younger at the time of diagnosis. Given that the disease is suspected to be congenital and prior studies suggest a diagnostic delay of four to eleven years from first symptom onset, this finding is not inconsiderable. While the new guidelines were only released a short time ago, this also nonetheless illustrates substantial room for improvement. Further demonstrating this is our comparison of differences in the timing of cohort entry. We found little-to-no difference in entry date between symptom cohorts, with some indication that entry to the surgical cohort skewed slightly toward later dates, representing a diagnostic delay. Additionally, the racial and ethnic makeups of patients identified under the new guidelines were largely unchanged, though notably women who identified as African American or Black routinely represented a quarter of patients in each cohort segment in the Medicaid dataset while only comprising 15% of the US population of women.
In light of its heterogenous presentation, flexible guidelines are especially necessary to prevent under- and delayed diagnosis and any potential disparities that may have arisen in the past by maintaining surgical confirmation as a strict requirement. 13 Given the complexity of endometriosis, it is also paramount that as additional evidence is generated, novel insights are further incorporated into clinical practice. This transformation also has important implications for how observational health research is conducted. Changes in clinical documentation will require updating existing approaches to identify cohorts of patients with endometriosis and those cohort definitions, if not rigorously examined, may in turn result in skewed characterizations of women with the disease.
We found substantial differences in the prevalence of conditions, medications, and procedures associated with patients in each cohort. Primarily these significant differences were between Cohort A and all the other cohorts. Nonetheless, despite the similarities in composition of Cohorts B-E, there were still many women not represented under the new guidelines. Our investigation found that nearly one-fourth of cases were consistently identified only when using the symptom-based phenotypes (Cohorts C-E), suggesting that imposing surgery or imaging-related constraints to diagnosis may miss a substantial portion of women with endometriosis. Further, a small but sizable percentage of women (nearly 15,000 cases) presented with only pelvic and/or abdominal pain and none of the other guideline-recognized symptoms.
Among conditions for which there was significant difference between Cohort A and Cohorts B-E, we noted that urinary tract pain was both commonly documented among women in Cohorts B-E and at rates more than twice that of women in Cohort A. Additionally, condition codes for infectious disease of genitourinary system and urinary tract infectious disease were much more frequently reported in Cohorts B-E. While we cannot be certain based on available data, one potential hypothesis is that when women present with urogenital pain, in the absence of surgical, imaging, or laboratory test confirmation, clinicians may first suspect urinary tract infection instead of endometriosis, resulting in diagnostic delay.
Our analysis also revealed differences in patterns of healthcare utilization among women diagnosed with endometriosis. For example, we observed higher rates of ER visits among women in cohorts based on imaging and/or symptom presentation compared to surgical confirmation. These findings underscore the continued need for improved access to timely and appropriate care for women with endometriosis, particularly those with non-classical symptoms, different care-seeking preferences, or those lacking access to surgical intervention.
While our study provides valuable insights regarding the clinical characteristics of endometriosis among women in the US, we recognize several limitations that should be acknowledged. First, our retrospective observational study design inherently limits our ability to establish causality. As secondary data sources, insurance claims and EHR data also introduce potential biases and inaccuracies inherent to administrative data coding and documentation practices. Second, while we largely evaluate differences among cohorts within each data source separately to preserve internal consistency, there is non-zero possibility that a woman represented in one data source may also be represented in another and we do not account for potential correlation. Third, while we did not detect any statistically significant differences in race or ethnicity between cohorts, there were a large percentage of women of unknown race or ethnicity. Fourth, our phenotyping algorithms rely on condition, observation, and procedure codes that did not necessarily capture the full spectrum of diagnostic criteria noted in the ESHRE guidelines (i.e., dyschezia, cyclical cough, and cyclical scar swelling and pain). Further, algorithmic performance may also differ across healthcare settings and populations. Fifth, by design, our inclusion criteria for each cohort were based on specific combinations of diagnostic criteria and may limit the generalizability of our findings to all women with endometriosis, particularly among those with atypical presentations. Lastly, each of our phenotypes required diagnosis codes for endometriosis. Our approach likely underestimates the true prevalence of endometriosis, as the condition is known to be underdiagnosed and underreported in clinical practice.
Our study provides novel insights into the complex interplay between diagnostic criteria, clinical characteristics, and healthcare utilization patterns among women diagnosed with endometriosis in the US. By elucidating these relationships, our findings contribute to a deeper understanding of the heterogeneity of endometriosis and underscore the importance of personalized approaches to diagnosis and management. Moving forward, efforts to optimize diagnostic algorithms, expand symptom recognition, and enhance access to comprehensive care are critical to addressing the unmet needs of women affected by this condition.
Introduction
Endometriosis is a chronic gynecological condition characterized by the growth of endometrial tissue outside of the uterus. 1 , 2 Marked by a heterogeneous set of symptoms that vary widely among women, the complex disorder can impact a range of body systems and often involves chronic pain, dysmenorrhea, dyspareunia, dysuria, and fatigue. Half of women with infertility are diagnosed with endometriosis. 1 , 2 Given the enigmatic nature of the disease and high likelihood for underdiagnosis, estimates of prevalence can vary widely. Nonetheless, the statistic most often cited is that 10% of women have endometriosis. 3
Historically, laparoscopy was the gold standard for diagnosing endometriosis. 4 While effective in confirming the presence of the disease, laparoscopy poses potential risks and is not always preferred by patients. The correlation between the extent of surgical findings and severity of symptoms also remains complex, underscoring the need for a comprehensive diagnostic strategy. Additionally, there is growing concern that emphasis on laparoscopy contributes to delays and underdiagnosis, particularly in cases where surgical intervention is not feasible or not covered by insurance. Notably, the average delay between disease onset and formal diagnosis of endometriosis is considered to be between four and eleven years. 5 – 7
In 2022 the European Society of Human Reproduction and Embryology (ESHRE) shifted its clinical guidelines regarding endometriosis toward a more multimodal approach to diagnosis, emphasizing assessment of indicative symptoms and the use of diagnostic imaging such as transvaginal sonography (TVS) and magnetic resonance imaging (MRI) as a complement to laparoscopic confirmation. 8 , 9 The shift in clinical guidelines acknowledges high variability in diagnosing endometriosis and aims to enhance accuracy and expedite diagnosis.
Given the potential for reducing delayed and underdiagnosis, the rise of new guidelines also suggests prior characterizations may not accurately reflect the full composition, care patterns, and spectrum of experiences associated with endometriosis, especially prior to receipt of a formal diagnosis. It is not yet clear how the expansion of diagnostic criteria will impact the composition of women diagnosed with the disorder. Moreover, these new guidelines may still not reflect the breadth of symptoms commonly experienced by women with endometriosis. Importantly, it is critical to investigate how such changes and considerations are likely to influence future observational health research given the impact on both clinical practice and documentation of care.
In this study, we sought to examine differences among women diagnosed with endometriosis, including their clinical histories and demographics. Specifically, we compared five cohort definitions based on different sets of diagnostic criteria involving combinations of surgical confirmation, diagnostic imaging, guideline-recognized symptoms, and other symptoms commonly reported among women with endometriosis.
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.