⚙
AI-generated deep summary
by claude@2026-06, 2026-06-15
· read from full text
ⓘ
This author correction to a Scientific Reports article about self-report symptom-based endometriosis prediction using machine learning addresses an error in how a Jaccard heatmap in Figure 1 was interpreted under “Symptom importance.” The correction clarifies that the heatmap reflects 1–Jaccard index (Jaccard distance) values, where darker cells indicate smaller distances and greater symptom similarity, and it states that all calculated distances exceeded 0.25, leading to no feature columns being eliminated for redundancy. It also notes removal of a related sentence in the symptom importance analysis, updates the Figure 1 legend, and removes Table 3 from the corrected version. This paper is centrally about endometriosis — it corrects interpretation details in a machine-learning model intended to predict endometriosis from self-reported symptoms.
Full text
3,827 characters
· extracted from
oa-pdf
· click to expand
1
Vol.:(0123456789)Scientific Reports | (2024) 14:10443 | https://doi.org/10.1038/s41598-024-61280-3
www.nature.com/scientificreports
Author Correction: Self‑report
symptom‑based endometriosis
prediction using machine learning
Anat Goldstein & Shani Cohen
Correction to: Scientific Reports https:// doi. org/ 10. 1038/ s41598- 023- 32761-8, published online 04 April 2023
The original version of this Article contained an error in the interpretation of the heatmap presented as Figure 1.
As a result, in the Results section under the subheading, ‘Symptom importance’ ,
“Figure 1 shows a heatmap of the Jaccard Index values, indicating the correlation between each pair of symptom
values. In this figure, the yellow rows/columns indicate that the symptom is highly correlated with many other
symptoms. We identified six symptoms that are highly correlated (Jaccard Index > 0.8) with more than 30% of
the symptoms: fever, abnormal uterine bleeding, syncope (fainting, passing out), infertility, constant bleeding,
and malaise/sickness. Five of these symptoms appear at the bottom of Table 1. To investigate whether removing
these potentially redundant features improves the models’ classification performance, we trained the models again
without these six symptoms. Table 3 present the performance results of the different models. After removing the
highly correlated symptoms, the performance of the Decision Tree model improved, whereas the performance
of the remaining models diminished slightly. ”
now reads:
“Figure 1 shows a heatmap of the Jaccard Index distance values, derived as 1-Jaccard index, which reflect the
correlation levels between symptom pairs. In this figure, darker cells signify smaller distances, indicating a higher
degree of similarity between the symptoms. Notably, all calculated Jaccard distance values exceeded 0.25, and
following this analysis, no columns were eliminated due to redundancy. ”
Additionally, in section “Symptom importance analysis” , the sentence “We analyzed the performance of the
models after removing symptoms that are highly correlated with other symptoms (Jaccard Index close to 1). ”
was removed.
Thirdly, in the legend of Figure 1:
“ A heatmap that shows Jaccard Indices between each pair of symptom value vectors. A lighter color indicates
a higher Jaccard Index, or a strong similarity between values. We use the Jaccard Index to identify potentially
redundant symptoms. ”
now reads:
“ A heatmap that shows Jaccard Indices between each pair of symptom value vectors. A darker color indicates
a lower Jaccard Index distance, or a strong similarity between symptoms. We use the Jaccard Index to identify
potentially redundant symptoms. ”
Lastly, Table 3 was removed.
The original Article has been corrected.
OPEN
2
Vol:.(1234567890)Scientific Reports | (2024) 14:10443 | https://doi.org/10.1038/s41598-024-61280-3
www.nature.com/scientificreports/
Open Access This article is licensed under a Creative Commons Attribution 4.0 International
License, which permits use, sharing, adaptation, distribution and reproduction in any medium or
format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the
Creative Commons licence, and indicate if changes were made. The images or other third party material in this
article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the
material. If material is not included in the article’s Creative Commons licence and your intended use is not
permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from
the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.
© The Author(s) 2024
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.