Automated extraction of ESGO operative report fields from free-text surgical notes using large language models in advanced ovarian cancer
article
OA: hybrid
CC0
AI-generated summary
Large language models accurately extracted organ-level disease involvement from surgical notes for advanced ovarian cancer, achieving near-human performance after optimization strategies.
One-sentence paraphrase of the abstract; not a substitute for reading it. No clinical advice. How this works
Abstract
OBJECTIVE: To determine whether large language models (LLMs) can automatically extract organ-level disease involvement to populate the Surgical Findings section of the European Society of Gynaecological Oncology (ESGO) Operative Report for advanced ovarian cancer. METHODS: We retrospectively collected 300 operative notes from cytoreductive surgeries performed at a tertiary ESGO-accredited center. Each note was interrogated to identify disease involvement across 35 pre-defined ESGO anatomical sites. For each site, LLMs were tasked with classifying whether disease was present. Their accuracy was compared with expert annotations using F1 scores. Four modern models were selected based on their state-of-the-art performance and suitability for clinical text interpretation. Operative notes were converted into sets of binary (yes/no) questions corresponding to each anatomical site. Models were tested both in their basic form and after targeted enhancement strategies to reduce common errors. These enhancements included adding a clinical terminology list, providing clearer task instructions, and showing a small number of examples. RESULTS: The models showed good baseline accuracy, with the two top-performing systems achieving F1 scores of 0.851 (95% confidence interval [CI] 0.841 to 0.861) and 0.864 (95% CI 0.854 to 0.873). Following optimization strategies, accuracy increased further, reaching 0.897 (95% CI 0.888 to 0.906) and 0.875 (95% CI 0.866 to 0.884). Performance was highest for key clinical sites, including the omentum, right diaphragm (95%), and ovaries (92%). Lower accuracy was observed for complex anatomical sites such as bowel (small bowel 73%, large bowel 61%) and peritoneal sites (pouch of Douglas 82%, abdominal wall 68%). Frequent errors involved laterality, overlapping anatomical regions, and ambiguous abbreviations. Optimization strategies improved distinction between closely related sites (rectosigmoid vs large bowel/mesentery) and reduced left/right errors. CONCLUSIONS: With enhancement strategies, LLMs demonstrated near-human performance in extracting ESGO-compliant operative information. Integrating model-assisted extraction into surgical workflows may reduce reporting time, improve completeness, and help standardize operative documentation.
My notes (saved in your browser only)
Citation neighborhood (no data yet)
We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.
References (23)
- doi:10.1177/10732748231209892 via openalex
- doi:10.4103/ijnmr.ijnmr_413_23 via openalex
- doi:10.1007/s10115-022-01779-1 via openalex
- doi:10.1007/s00404-016-4080-3 via openalex
- doi:10.1016/j.compbiomed.2011.04.014 via openalex
- doi:10.1097/igc.0000000000000767 via openalex
- doi:10.1038/s41746-024-01233-2 via openalex
- doi:10.3390/jcm14072223 via openalex
- doi:10.1038/s41746-025-01810-z via openalex
- doi:10.1002/ijgo.15869 via openalex
- doi:10.1038/s41746-024-01239-w via openalex
- doi:10.2196/50638 via openalex
- doi:10.1001/jamapediatrics.2023.2373 via openalex
- doi:10.1016/j.artmed.2024.102900 via openalex
- doi:10.2196/65984 via openalex
- doi:10.3390/cancers15225386 via openalex
- doi:10.1093/biomethods/bpae072 via openalex
- doi:10.1088/1361-6560/adcb17 via openalex
- doi:10.1371/journal.pdig.0000604 via openalex
- doi:10.3390/cancers15030966 via openalex
- doi:10.1148/radiol.241554 via openalex
- doi:10.1186/s13048-023-01303-1 via openalex
- doi:10.1016/j.ygyno.2011.04.041 via openalex
Source provenance
- openalex
- last seen: 2026-05-13T19:46:01.794608+00:00
License: CC0
· commercial use OK