Overall survival prediction models for gynecological endometrioid adenocarcinoma with squamous differentiation (GE-ASqD) using machine-learning algorithms

Liu, Xiangmei; Jin, Shuai; Zi, Dan

doi:10.1038/s41598-023-33748-1

Download PDF

Article
Open access
Published: 24 May 2023

Overall survival prediction models for gynecological endometrioid adenocarcinoma with squamous differentiation (GE-ASqD) using machine-learning algorithms

Xiangmei Liu^1,3^na1,
Shuai Jin²^na1 &
Dan Zi^3,4

Scientific Reports volume 13, Article number: 8395 (2023) Cite this article

626 Accesses
Metrics details

Subjects

Abstract

The actual 5-year survival rates for Gynecological Endometrioid Adenocarcinoma with Squamous Differentiation (GE-ASqD) are rarely reported. The purpose of this study was to evaluate how histological subtypes affected long-term survivors of GE-ASqD (> 5 years). We conducted a retrospective analysis of patients diagnosed GE-ASqD from the Surveillance, Epidemiology, and End Results database (2004–2015). In order to conduct the studies, we employed the chi-square test, univariate cox regression, and multivariate cox proportional hazards model. A total of 1131 patients with GE-ASqD were included in the survival study from 2004 to 2015 after applying the inclusion and exclusion criteria and the sample randomly split into a training set and a test set at a ratio of 7:3. Five machine learning algorithms were trained based on nine clinical variables to predict the 5-year overall survival. The AUC of the training group for the LR, Decision Tree, forest, Gbdt, and gbm algorithms were 0.809, 0.336, 0.841, 0.823, and 0.856 respectively. The AUC of the testing group was 0.779, 0.738, 0.753, 0.767 and 0.734, respectively. The calibration curves confirmed good performance of the five machine learning algorithms. Finally, five algorithms were combined to create a machine learning model that forecasts the 5-year overall survival rate of patients with GE-ASqD.

A nomogram for predicting cancer-specific survival in patients with uterine clear cell carcinoma: a population-based study

Article Open access 07 June 2023

Predictive model for the preoperative assessment and prognostic modeling of lymph node metastasis in endometrial cancer

Article Open access 08 November 2022

Development and validation of a nomogram for predicting recurrence-free survival in endometrial cancer: a multicenter study

Article Open access 20 November 2023

Introduction

Gynecologic malignancies are key and more common diseases that affect women's survival¹. The most frequent histological type of endometrial cancer, endometrioid adenocarcinoma, accounting for 80% of all cases of endometrial cancer². Endometrioid ovarian cancer is a rare epithelial ovarian cancer with pathological features similar to endometrial cancer, accounting for only 10% of epithelial ovarian cancers³.

Squamous differentiation is defined as any kind of squamous metaplasia, including morular metaplasia. Typically, endometrioid features include evidence of endometrioid differentiation, including squamous differentiation, complex atypical hyperplasia, and low-grade endometrioid components^4,5. Studies have revealed that the classification of squamous differentiation components as low-grade or high-grade differentiation is important in order to accurately predict tumor prognosis^6,7. Squamous differentiation of endometrial cancer has drawn particular attention from researchers because the prognostic variables are still unclear⁸. According to data from a different study, squamous differentiation may operate as a poor prognostic factor in patients with low to moderate endometrioid endometrial cancer by raising the probability of recurrence by a factor of 5.6 times⁹.

Machine learning (ML), an area of artificial intelligence that allows mining the relationships from complex datasets, has been used to make predictions about future outcomes among gynecologic oncology. A deep learning-based automatic staging technique for early endometrial cancer was developed by Mao et al. using MRI scans¹⁰. According to a multicenter study by Wu et al., an artificial intelligence-based preoperative prediction system could identify and forecast the prognosis of epithelial ovarian cancer¹¹. Grimley et al. also utilized a machine learning model to forecast prognostic of patients with epithelial ovarian carcinomas¹².

Herein, the main purpose of this work was to construct machine learning models and forecast the 5-year survival rate of GE-AsqD patients.

Methods

Data collection

The datasets analysed during the current study are available in the SEER databases repository, SEER* Stat 8.3.6, https://seer.cancer.gov/. SEER belong to public databases. It was not necessary to get written informed consent for participating in the present research as the information contained in the SEER database has been de-identified and is publically available following authorization. Users can download relevant data for free for research and publish relevant articles. Our study is based on open source data, so there are no ethical issues and other conflicts of interest.

Patient and variable selection

We extracted patients diagnosed with gynecological endometrioid adenocarcinoma with squamous differentiation (GE-ASqD) data from the Surveillance, Epidemiology, and End Results (SEER) database. The inclusion criteria were applied: (I) diagnosed between 2004 and 2015; (II) primary site in the endometrium and ovary [International Classification of Diseases for Oncology, third edition (ICD-O-3) code, C54.1, C56.9]; (III) histologically proven malignant carcinosarcoma (ICD-O-3 codes 8570/3). The exclusion criteria were applied: (I) age < 18 year-old, (II) not the primary tumor; (III) unknown information about race, stage, regional nodes examined, tumor size, T, N, M; (IV) For futher training and validation prognostic model analysis, survival time less than 60 months would be excluded. The following clinical pathologic variables were selected: age at diagnosis, race, sequence number, marital status, stage, surgery status, radiation status, chemotherapy status, regional nodes examined (RN Examined), AJCC T, N, M stage, primary site. All patients were staged according to the SEER stage: localized, regional, and distant. We employed the sixth edition of the Derived AJCC Stage Group. It is worth mentioning that the X-tile software (https://medicine.yale.edu/lab/rimm/research/software/) converted continuous variables (age at diagnosis) into categorical variables by determining the optimal cutoff points for each variable¹³. We divided the age at diagnosis into the 18–66, and 67–95-year categories using 66- and 95-year as the cutoff values. The main endpoint was overall survival (OS), which was calculated as the period from diagnosis to death from any cause. The sample was randomly split into a training set and a test set at a ratio of 7:3. The patient selection flowchart is shown in Fig. 1.

Machine learning models

In this study, we have used several supervised ensemble-based machine learning algorithms, including Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Gradient Boosting (Gbdt) separately to build classification models to stratify GE-ASqD patients., and we searched for the models with the best performance.

In machine learning, a random forest (forest) is a classifier that includes multiple decision trees. The categories of its output are determined by the modes of categories output by individual trees.

The LightGBM (gbm) algorithm is a lifting machine learning algorithm. It is a fast, distributed and high-performing gradient lifting framework based on a decision tree algorithm. It can sort, classify, run regressions, and perform many other machine learning tasks.

The construction of a decision tree model has two steps: induction and pruning. Induction is the step of constructing a decision tree by setting all hierarchical decision boundaries based on data at hand. However, the tree model is subject to severe over-fitting due to the nature of the training decision tree, and this is when pruning is required. Pruning is the process of removing unnecessary branch structures from the decision tree, simplifying the process of overcoming over-fitting and making it easier to interpret.

Elevation is a machine learning technique that can be used for regression and classification problems. It produces a weak prediction model (like a decision tree) at each step and weights it into the total model. If the weak prediction model of each step generates consistent loss function gradient direction, then it is called gradient boosting (Gbdt).

For all machine learning studies, the Python (Python 3.7.13) programming language has been utilized.We have utilized Python libraries such as pandas and numpy for basic data processing and sklearn for machine learning.

The coefficients for the machine learning technique were trained and tested. Evaluation and comparison were completed with the prediction accuracy of a model constructed by machine learning and the area under the curve (AUC). F1-Measure evaluation indicators are used in information retrieval and natural language processing. Precision rate indicates the proportion of correctly classified cases of the sample. Accuracy rate refers to the number of paired cases split by the total number of cases. Recall rate relates to the positive cases in the sample which were predicted correctly. MSE (Mean Squared Error) measures the amount of error in statistical models. Missing data were estimated through multiple imputations.

Statistical analysis

All statistical analyses were conducted using R version 3.6.1 (www.r-project.org). The association among demographic, clinicopathological, and treatment variables for the histological subtypes was compared using the chi-square test and the Fisher exact test.

Univariate cox regression analysis demonstrated potential prognostic factors with P values < 0.1. Multivariate cox proportional hazards model was used to evaluate the prognostic factors associated with OS. Prognostic factors with P values < 0.1 on univariate analyses were entered into multivariate analyses. Then, a set of machine learning models were developed base on the independent prognostic factors associated with OS for GE-ASqD patients.

Results

Baseline characteristics

After screening data from the SEER database (Fig. 1), we selected a total of 1131 GE-ASqD patients, including 1079 cases of endometrium (EE-ASqD), meanwhile 52 cases of ovary (OE-ASqD). Most of the variables were similarly distributed between EE-AsqD and OE-ASqD. In comparison, approximately 12% and 33% of patients were 1st of 2 or more primaries in the EE-AsqD and OE-AsqD sets, respectively, 75.1% vs. 46.2% as localized, 20.5% vs. 44.2% were regional; radiation (27% vs. 1.9%), chemotherapy (11% vs. 54%) in both sets.The characteristics between the two cancers are shown in detail in Table 1.

Table 1 The baseline characteristics of the GE-ASqD patients in SEER database.

Full size table

According to the results of univariate cox regression analysis, we found that race, age at diagnosis, sequence number, stage, surgery status, radiation status, chemotherapy status, regional nodes, T, N, M stage were potentially correlated with the OS of GE-ASqD (P < 0.05).

These potential prognostic factors were evaluated through multivariate regression analysis, which indicated that race [other race vs. black : hazard ratio (HR) = 0.42, 95% confidence interval (CI) 0.21–0.085, P = 0.016 ], age at diagnosis [18–66 years vs. 67–95 years: hazard ratio (HR) = 0.27, 95% confidence interval (CI) 0.21–0.35, P < 0.001], sequence number[1st of 2 or more primaries vs. 1st of 2 or more primaries: hazard ratio (HR) = 0.58, 95% confidence interval (CI) 0.41–0.8, P = 0.001], surgical status (yes vs. none : HR = 0.28, 95% CI 0.16–0.49, P < 0.001), radiation status (radiation vs. no radiation: HR = 1.44, 95% CI 1.07–1.94, P = 0.016), chemotherapy status (chemotherapy vs. no/unknown: HR = 0.6, 95% CI 0.4–0.91, P = 0.016), RN Examined (yes vs. no : HR = 0.54, 95% CI 0.4–0.73, P < 0.001), T stage (T3 vs. T1 : HR = 3.45, 95% CI 2.32–5.12, P < 0.001), N stage (N1 vs. N0 : HR = 2.9, 95% CI 1.91–4.39, P < 0.001), and M stage (M1 vs. M0: HR = 3.03, 95% CI 1.81–5.01, P < 0.001) were independent prognostic factors for GE-ASqD (P < 0.05). The results of the univariate and multivariate cox regression analysis are listed in detail in Table 2.

Table 2 The baseline characteristics, univariate and multivariate cox analysis.

Full size table

Alive but survival months < 60 months were excluded, finally 907 patients remain for further analysis. Table 3 summarizes the baseline characteristics of the training and validation sets. All variables were similarly distributed between the two sets, with EE-ASqD (95.9% vs. 94.9%) and OE-ASqD (4.1% vs. 5.1%) in the training and validation sets. In both sets, almost all patients sequence number were the one primary only (86% vs. 88%). Most of the patients in the training and validation sets were white (83.8% vs. 84.3%), 18–66-year (78% vs. 81%), and married (51% vs. 58%). The clinical data demonstrated a relatively localized (71.6% vs. 75.8%) malignancy; In comparison, approximately 77% and 80.2% of patients in the training and validation sets, respectively, was T1 stage, 89% vs.92.7% of N0, and 95% vs. 96% as M0. In both sets, almost all patients received surgery (96.5% vs. 96.3%), for regional nodes examination (RN Examined) were done (63% vs. 61%) in the training and validation sets furthermore. whereas only a few patients received chemotherapy (13%) and radiation (26% vs. 23%) in the training and validation sets.

Table 3 The baseline characteristics of the training and validation sets used in the prognostic model.

Full size table

Prognostic model construction and model performance

In this study, the dataset consisted of 907 individual patients’ information. We divided the whole dataset into 70% for training and 30% for testing. Accuracy, Precision, Recall, F1-score, AUC, and MSE evaluation metrics were employed to test the classifier performance. Figure 2A shows the associated independent risk factors based on a multiple linear regression model. Multiple linear regression models are used to quantify the relationship between predictor variables and a response variable takes on a continuous value. Two of the most important values in a regression table are the regression coefficients and their corresponding p-values. The p-values inform whether or not there is a statistically significant relationship between each predictor variable and the response variable. Because the output of a linear regression model is continuous value. It is possible to get negative values as well as the output. It is different from logistic regression model, which returns probability as the output varies between 0 and 1. The most significant descending order parameters were age at diagnosis, N stage, T2 stage, RN Examined, and surgery status in Fig. 2B of DT model. The age at diagnosis, N stage, T3 stage, RN Examined, and radiation status are the most critical attribute in descending order in the RF model. The highest vital features in descending order are age at diagnosis, N stage, T3 stage, surgery and RN Examined, status in the GB model. The radiation, RN Examined, chemotherapy status, age at diagnosis and sequence number are the most critical attribute in descending order in the LGBM model.

As show in Figs. 3A and 4A, the models constructed by the five machine learning algorithms in the training group are compared. Among the five machine learning algorithms, gbm and random forest have the highest accuracy, 0.836 and 0.822 respectively. The highest precision of the five algorithms was random forest, 0.742. The highest recall rate was that of the gbm algorithm (0.613). Among the five algorithms, gbm had the highest accuracy, recall rate and f1 score, and Auc, 0.836, 0.613, 0.671 and 0.856, respectively. The AUC values for the four algorithms were: gbm (0.856), forest (0.841), DecisionTree (0.836). Gbdt (0.823) and LR (0.809). Among the five algorithms, gbm had the lowest MSE value (0.164).

The models constructed by four machine learning algorithms in the test group are compared (Figs. 3B and 4B). LR had the highest accuracy (0.799), precision (0.559) and Auc (0.779). The recall rate and f1-score for the gbm algorithm was 0.407 and 0.44. The lowest f1 score was that of decision tree at 0.059. The AUC values of the five algorithms were: LR (0.779), Gbdt (0.767), forest (0.753), DecisionTree (0.738) and gbm(0.734). Among the five algorithms, LR had the lowest MSE value at 0.201. The calibration curves confirm good performance of the five machine learning algorithms (Fig. 5A and 5B).

Discussion

In the era of “personalized medicine,” the use of prediction models has gained increasing interest among clinicians to guide treatment planning, individualized treatment aims to minimize unnecessary exposure to therapy-related morbidity and at the same time offer proper management for high-risk patients. The combination of PARP inhibitors and immune checkpoint inhibitors considerably improves the prognosis of gynecologic cancer patients and promotes the long-term benefits following maintenance therapy in the era of personalized precision medicine where targeted and immunotherapy are common¹⁴. Furthermore, Zhang et al. and Chen et al. demonstrated the predictive importance of PD-L1 expression in endometrial serous cancer^15,16.

It is now evident that OC and EC are not single disease, but is a category comprised of several distinct histotypes. The medical community’s understanding of OC and EC has changed significantly over the past few years^17,18. Katelyn et al. also suggest that each morphologic subtype has potential therapeutic implications¹⁹. Additionally, the likelihood of passing away varied noticeably among those with various histological subtypes²⁰. Less research has been focused on gynecologic endometrial adenocarcinoma with histological subtypes of squamous differentiation in gynecologic oncology, but this needs to be examined because of the unique prognostic determinant and association pattern our findings suggest throughout the survival trajectory. Data from the SEER program provides a unique opportunity to study a rare disease given the large, nationally-representative sample of cancer patients, extensive follow-up information, and availability of detailed histomorphologic data. According to the data extracted from the SEER database in our current study, GE-ASqD is more common among young whites, generally in the early stages of the disease, and most patients underwent surgical treatment and intraoperative lymph node examination.

Although the demographic characteristics of the present study suggest that non-white populations account for a very small proportion of GE-ASqD, there are many studies on ovarian cancer that suggest poor survival for black compared with other ethnic groups. Thus, the recently established African ancestry women's ovarian cancer (OCWAA) consortium²¹ analyzed key differential factors by analyzing various characteristics of patients in their large national cancer databases or medical databases. Studies have also noted worse survival in black patients with endometrial cancer (EC) compared with white patients, and higher staging and grading, histological risk, and worse survival in black women^22,23. Studies have also shown that black women are 2.5 times more likely to die from endometrial cancer²⁴. In the multivariate analysis of our study, age, surgery, chemoradiotherapy, lymph node examination, and the presence or absence of node-positive metastases were statistically significantly different from patient prognosis analysis. Our analysis based on LR models and tree models showed that T stage and presence of positive lymph node metastasis were significantly associated with 5-year survival. Especially in tree-based models, age and Nodal properties have been shown to be significantly associated with disease survival.

According to international guidelines^25,26, the fundamental management of gynecological oncology is achieved through standard surgery or cytoreductive surgery performed by a team of trained gynecological oncologists. Most patients present with early-stage disease are cured by surgery. In this study, we did not further detail the classification of surgical procedures for patients, but the results of this study suggest that surgery is an important determinant of patient survival. In the current study, N was assessed as a very important and meaningful clinical variable factor in each model for risk assessment and prognostic prediction, and the model suggested the importance of N intraoperative assessment. Many previous studies and clinical guidelines have indicated that Positivity for pelvic and/or paraaortic lymph node metastasis (LNM) is an important aspect of predicting a worse prognostic outcome, and current guidelines of the National Comprehensive Cancer Network (NCCN) recommend observation for patients with substage IA or IB, grade 1 EEOC^27,28,29. The significance of conventional lymphadenectomy for improving outcomes in early clinical endometrial cancer is controversial, but it is strongly associated with a 15% to 20% surgically related morbidity³⁰. Few attempts have been made to predict the risk of LNM before surgical treatment^31,32,33. In recent years, gynecologic oncologists have chosen to use dye-injected tracer in the first few minutes of the operation, combining with Fluorescence microscope to accurately assess whether there are first-stop Sentinel lymph node and distant lymph node metastases, when the results were positive, the decision was made to perform systemic lymph node dissection and para-aortic lymph node dissection to improve the patient's prognosis³⁴. At the same time, the implementation of this technology has also had a positive impact on reducing healthcare costs. Among the patients in this study, ovarian cancer patients receiving radiotherapy and endometrioid carcinoma patients receiving chemotherapy were a minority. It has been demonstrated that radiation therapy increases survival in individuals with high risk of endometrial cancer but not in those with intermediate risk³⁵. It also increases costs and a higher risk of morbidity. It is advised that patients with mild endometrial cancer refrain from radiation therapy for the time being, and that they instead undergo follow-up observation. The study points out that by dividing gynecologic tumors into clinically meaningful subgroups, we can better understand the pathological development and pathogenesis of tumors, thus adapting to the era of individualized and precise treatment, to select and improve individualized treatment based on machine learning and other prognostic scientific prediction.

However, there are some limitations to our study. Firstly, SEER lacks data on chemotherapy treatments and patterns and timing of recurrence. Second, outcome data for individuals receiving targeted therapies were not included in the sample, which may have made the prediction model less comprehensive. Finally, it should be admitted that further external validation in different geographic regions and etiology is of necessity.

Conclusions

In this work, five machine learning algorithms were used to build predictive models after analyzing the clinical traits and prognosis of patients with GE-ASqD. The 5-year OS of patients with GE-ASqD could be accurately predicted using the machine learning model, which may aid clinicians in making more precise and individualized therapeutic decisions. This is especially crucial to boost the long-term prognosis of high-risk patients with histological subtype squamous differentiation.

Condensation

Machine Learning Algorithm Predictive Model for the 5-year OS of GE-ASqD.

Data availability

The code to perform all presented studies is written in R or python and is freely available on GitHub: https://github.com/users/mimimay-cpu/projects/3/views/1?pane=issue&itemId=24276328.

References

Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71, 209–249. https://doi.org/10.3322/caac.21660 (2021).
Article PubMed Google Scholar
Horn, L. C., Meinel, A., Handzel, R. & Einenkel, J. Histopathology of endometrial hyperplasia and endometrial carcinoma: An update. Ann. Diagn. Pathol. 11, 297–311. https://doi.org/10.1016/j.anndiagpath.2007.05.002 (2007).
Article PubMed Google Scholar
Prat, J. New insights into ovarian cancer pathology. Ann. Oncol. 23(Suppl 10), x111-117. https://doi.org/10.1093/annonc/mds300 (2012).
Article PubMed Google Scholar
Long, M. E. & Taylor, H. C. Jr. Endometrioid carcinoma of the ovary. Am. J. Obstet. Gynecol. 90, 936–950. https://doi.org/10.1016/0002-9378(64)90790-2 (1964).
Article CAS PubMed Google Scholar
Nicolae, A., Preda, O. & Nogales, F. F. Endometrial metaplasias and reactive changes: A spectrum of altered differentiation. J. Clin. Pathol. 64, 97–106. https://doi.org/10.1136/jcp.2010.085555 (2011).
Article PubMed Google Scholar
Kline, R. C. et al. Endometrioid carcinoma of the ovary: Retrospective review of 145 cases. Gynecol. Oncol. 39, 337–346. https://doi.org/10.1016/0090-8258(90)90263-k (1990).
Article CAS PubMed Google Scholar
Hoang, L. N. et al. Histotype-genotype correlation in 36 high-grade endometrial carcinomas. Am. J. Surg. Pathol. 37, 1421–1432. https://doi.org/10.1097/PAS.0b013e31828c63ed (2013).
Article PubMed Google Scholar
Zaino, R. J. et al. The significance of squamous differentiation in endometrial carcinoma. Data from a Gynecologic Oncology Group study. Cancer 68, 2293–2302. https://doi.org/10.1002/1097-0142(19911115)68:10%3c2293::aid-cncr2820681032%3e3.0.co;2-v (1991).
Article CAS PubMed Google Scholar
Andrade, D. A. P. et al. Squamous differentiation portends poor prognosis in low and intermediate-risk endometrioid endometrial cancer. PLoS One 14, e0220086. https://doi.org/10.1371/journal.pone.0220086 (2019).
Article CAS PubMed PubMed Central Google Scholar
Mao, W., Chen, C., Gao, H., Xiong, L. & Lin, Y. A deep learning-based automatic staging method for early endometrial cancer on MRI images. Front. Physiol. 13, 974245. https://doi.org/10.3389/fphys.2022.974245 (2022).
Article PubMed PubMed Central Google Scholar
Wu, M. et al. Artificial intelligence-based preoperative prediction system for diagnosis and prognosis in epithelial ovarian cancer: A multicenter study. Front. Oncol. 12, 975703. https://doi.org/10.3389/fonc.2022.975703 (2022).
Article PubMed PubMed Central Google Scholar
Grimley, P. M. et al. A prognostic system for epithelial ovarian carcinomas using machine learning. Acta Obstet. Gynecol. Scand. 100, 1511–1519. https://doi.org/10.1111/aogs.14137 (2021).
Article PubMed Google Scholar
Camp, R. L., Dolled-Filhart, M. & Rimm, D. L. X-tile: A new bio-informatics tool for biomarker assessment and outcome-based cut-point optimization. Clin. Cancer Res. 10, 7252–7259. https://doi.org/10.1158/1078-0432.Ccr-04-0713 (2004).
Article CAS PubMed Google Scholar
Musacchio, L. et al. PARP inhibitors in endometrial cancer: Current status and perspectives. Cancer Manag. Res. 12, 6123–6135. https://doi.org/10.2147/cmar.S221001 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhang, T. et al. PD-L1 expression in endometrial serous carcinoma and its prognostic significance. Cancer Manag. Res. 13, 9157–9165. https://doi.org/10.2147/cmar.S337271 (2021).
Article CAS PubMed PubMed Central Google Scholar
Chen, H. et al. Prevalence and prognostic significance of PD-L1, TIM-3 and B7–H3 expression in endometrial serous carcinoma. Mod. Pathol. https://doi.org/10.1038/s41379-022-01131-6 (2022).
Article PubMed PubMed Central Google Scholar
McMullen, M., Karakasis, K., Rottapel, R. & Oza, A. M. Advances in ovarian cancer, from biology to treatment. Nat. Cancer 2, 6–8. https://doi.org/10.1038/s43018-020-00166-5 (2021).
Article CAS PubMed Google Scholar
Travaglino, A. et al. Impact of endometrial carcinoma histotype on the prognostic value of the TCGA molecular subgroups. Arch. Gynecol. Obstet. 301, 1355–1363. https://doi.org/10.1007/s00404-020-05542-1 (2020).
Article PubMed Google Scholar
Handley, K. F. et al. Classification of high-grade serous ovarian cancer using tumor morphologic characteristics. JAMA Netw. Open 5, e2236626. https://doi.org/10.1001/jamanetworkopen.2022.36626 (2022).
Article PubMed PubMed Central Google Scholar
Yang, S. P. et al. Long-term survival among histological subtypes in advanced epithelial ovarian cancer: Population-based study using the surveillance, epidemiology, and end results database. JMIR Public Health Surveill. 7, e25976. https://doi.org/10.2196/25976 (2021).
Article PubMed PubMed Central Google Scholar
Schildkraut, J. M. et al. Ovarian cancer in women of African ancestry (OCWAA) consortium: A resource of harmonized data from eight epidemiologic studies of African American and white women. Cancer Causes Control 30, 967–978. https://doi.org/10.1007/s10552-019-01199-7 (2019).
Article PubMed PubMed Central Google Scholar
Long, B., Liu, F. W. & Bristow, R. E. Disparities in uterine cancer epidemiology, treatment, and survival among African Americans in the United States. Gynecol. Oncol. 130, 652–659. https://doi.org/10.1016/j.ygyno.2013.05.020 (2013).
Article CAS PubMed PubMed Central Google Scholar
Farley, J., Risinger, J. I., Rose, G. S. & Maxwell, G. L. Racial disparities in blacks with gynecologic cancers. Cancer 110, 234–243. https://doi.org/10.1002/cncr.22797 (2007).
Article PubMed Google Scholar
Jemal, A., Siegel, R., Xu, J. & Ward, E. Cancer statistics, 2010. CA Cancer J. Clin. 60, 277–300. https://doi.org/10.3322/caac.20073 (2010).
Article PubMed Google Scholar
Abu-Rustum, N. R. et al. NCCN Guidelines® insights: Uterine neoplasms, Version 3.2021. J. Natl. Compr. Canc. Netw. 19, 888–895. https://doi.org/10.6004/jnccn.2021.0038 (2021).
Article PubMed Google Scholar
Ledermann, J. A. et al. Newly diagnosed and relapsed epithelial ovarian carcinoma: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 29, iv259. https://doi.org/10.1093/annonc/mdy157 (2018).
Article CAS PubMed Google Scholar
Armstrong, D. K. et al. Ovarian cancer, Version 2.2020, NCCN Clinical practice guidelines in oncology. J. Natl. Compr. Canc. Netw. 19, 191–226. https://doi.org/10.6004/jnccn.2021.0007 (2021).
Article CAS PubMed Google Scholar
Matei, D. et al. Adjuvant chemotherapy plus radiation for locally advanced endometrial cancer. N. Engl. J. Med. 380, 2317–2326. https://doi.org/10.1056/NEJMoa1813181 (2019).
Article CAS PubMed PubMed Central Google Scholar
de Boer, S. M. et al. Adjuvant chemoradiotherapy versus radiotherapy alone in women with high-risk endometrial cancer (PORTEC-3): Patterns of recurrence and post-hoc survival analysis of a randomised phase 3 trial. Lancet Oncol. 20, 1273–1285. https://doi.org/10.1016/s1470-2045(19)30395-x (2019).
Article PubMed PubMed Central Google Scholar
Kitchener, H., Swart, A. M., Qian, Q., Amos, C. & Parmar, M. K. Efficacy of systematic pelvic lymphadenectomy in endometrial cancer (MRC ASTEC trial): A randomised study. Lancet 373, 125–136. https://doi.org/10.1016/s0140-6736(08)61766-3 (2009).
Article CAS PubMed Google Scholar
Kang, S. et al. Preoperative identification of a low-risk group for lymph node metastasis in endometrial cancer: A Korean gynecologic oncology group study. J. Clin. Oncol. 30, 1329–1334. https://doi.org/10.1200/jco.2011.38.2416 (2012).
Article PubMed Google Scholar
Todo, Y. et al. Combined use of magnetic resonance imaging, CA 125 assay, histologic type, and histologic grade in the prediction of lymph node metastasis in endometrial carcinoma. Am. J. Obstet. Gynecol. 188, 1265–1272. https://doi.org/10.1067/mob.2003.318 (2003).
Article PubMed Google Scholar
Lee, J. Y. et al. Preoperative prediction model of lymph node metastasis in endometrial cancer. Int. J. Gynecol. Cancer 20, 1350–1355. https://doi.org/10.1111/IGC.0b013e3181f44f5a (2010).
Article PubMed Google Scholar
Soliman, P. T. et al. A prospective validation study of sentinel lymph node mapping for high-risk endometrial cancer. Gynecol. Oncol. 146, 234–239. https://doi.org/10.1016/j.ygyno.2017.05.016 (2017).
Article PubMed PubMed Central Google Scholar
Colombo, N. et al. ESMO-ESGO-ESTRO consensus conference on endometrial cancer: Diagnosis, treatment and follow-up. Ann. Oncol. https://doi.org/10.1093/annonc/mdv484 (2016).
Article PubMed PubMed Central Google Scholar

Download references

Funding

This work was supported by the Nature Scientific Foundation of China (grant number: 82060564), Science and Technology Fund of Guizhou Province (No.ZK[2022]270), Guizhou Provincial People’s Hospital Doctoral Fund (No.GZSYBSA[2021]01), and Non-profit Central Research Institute Fund of the Chinese Academy of Medical Science (No.2018PT31048 and 2019PT310013).

Author information

These authors contributed equally: Xiangmei Liu and Shuai Jin.

Authors and Affiliations

Guizhou Medical University, Guiyang, China
Xiangmei Liu
School of Big Health, Guizhou Medical University, Guiyang, China
Shuai Jin
Department of Gynecology and Obstetrics, Guizhou Provincial People’s Hospital, Guiyang, China
Xiangmei Liu & Dan Zi
Department of Gynecology and Obstetrics, The Affiliated People’s Hospital of Guizhou Medical University, Guiyang, China
Dan Zi

Authors

Xiangmei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Jin
View author publications
You can also search for this author in PubMed Google Scholar
Dan Zi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

XM.L. wrote the main manuscript text and prepared Tables 1–3 and S.J. prepared Figs. 1–5. All authors reviewed the manuscript.

Corresponding author

Correspondence to Dan Zi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, X., Jin, S. & Zi, D. Overall survival prediction models for gynecological endometrioid adenocarcinoma with squamous differentiation (GE-ASqD) using machine-learning algorithms. Sci Rep 13, 8395 (2023). https://doi.org/10.1038/s41598-023-33748-1

Download citation

Received: 27 December 2022
Accepted: 18 April 2023
Published: 24 May 2023
DOI: https://doi.org/10.1038/s41598-023-33748-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.