ESTRO 2022

Session Item

Breast

Session Type: Poster (digital)

Track: Clinical

Journey:

Machine learning to predict locoregional relapse in pT1-2pN0-1 breast cancer following mastectomy

Stefania Volpe, Italy

Presentation Number: PO-1190

Abstract

Abstract Title:

Machine learning to predict locoregional relapse in pT1-2pN0-1 breast cancer following mastectomy

Authors:

Stefania Volpe¹, Federica Bellerba², Mattia Zaffaroni¹, Matteo Pepa¹, Lars Johannes Isaksson¹, Giorgia Maimone³, Bianca Menzani³, Ilaria Monaco³, Patrick Maisonneuve⁴, Ida Rosalia Scognamiglio¹, Samantha Dicuonzo¹, Maria Alessia Zerella¹, Damaris Patricia Rojas¹, Giulia Marvaso¹, Cristiana Fodor¹, Sara Gandini², Elena De Momi³, Paolo Veronesi⁵, Giovanni Corso⁵, Viviana Enrica Galimberti⁵, Maria Cristina Leonardi¹, Barbara Alicja Jereczek-Fossa¹

¹Istituto Europeo di Oncologia IRCCS, Radiation Oncology, Milan, Italy; ²Istituto Europeo di Oncologia IRCCS, Experimental Oncology, Milan, Italy; ³Politecnico di Milano, Electronics, Information and Bioengineering, Milan, Italy; ⁴Istituto Europeo di Oncologia IRCCS, Epidemiology and Biostatistics, Milan, Italy; ⁵Istituto Europeo di Oncologia IRCCS, Breast Surgery, Milan, Italy

Show Affiliations

Purpose or Objective

While post-mastectomy radiotherapy is a mainstay for the treatment of locally-advanced breast cancer patients, indications for early stages (namely, pT1-2 pN0-1) are less defined, and a clear understanding of predictive factors of locoregional relapse (LRR) is warranted to better establish clinical indications. This study explores the potentials of machine learning (ML)-based algorithms in this clinical setting.

Material and Methods

A total of 2632 patients, treated at the European Institute of Oncology IRCCS, Milan, Italy between 1998 and 2006, who underwent mastectomy without subsequent radiotherapy was considered for the analysis. Three ML- and statistics-based regression models were trained to predict LRR and to estimate the hazard ratios for all the predictor variables. For ML models the importance of the clinical features on the outcome was estimated by permuting out-of-bag (OOB) cases. The concordance index (c-index) was used to compare the performances.

Results

A total of 1823 patients with no missing clinical values was selected for the analysis and randomly split into training and validation set (1367 and 456 patients, respectively, representing 75% and 25% of the whole included population). The performance of the Cox’s proportional hazard (CPH) model in the test set was 0.71, while the c-index of Random Survival Forest (SRF) was 0.65 and the one of Survival Support Vector Machine (SSVM) reached 0.67. Considering the validation set, the performance of the CPH was comparable to those of SRF and SSVM, achieving c-indexes of 0.65, 0.65, and 0.67 in the validation test, respectively. Overall, the performance of the Cox’s proportional hazard (CPH) model was comparable to those of Random Survival Forest (SRF) and Survival Support Vector Machine (SSVM), achieving c-indexes of 0.65, 0.65, and 0.67 in the validation test, respectively.

The most significant contributions to the CPH model are shown in Figure 1A. The SRF confirmed the statistically significant contribution of elevated Ki-67 (>20%), the primary tumor staging at surgery (pT), and the execution of any systemic treatment. The combination of risk factors and molecular subtypes also provided a significant contribution to the model, together with young age (<35 years). A graphical representation of variable importance is SRF is reported in Figure 1B.

Conclusion

The prediction accuracy between CPH and ML algorithms in terms of C-index was comparable in both the test and validation sets. Overall, results of CPH were largely confirmed by those of SRF, with clinically-meaningful estimates of variables contribution for the prediction of LRR. The quantitative assessment of the importance of individual parameters in SSVM is more challenging. In perspective, external validation would be beneficial to confirm our results.