Copenhagen, Denmark
Onsite/Online

ESTRO 2022

Session Item

Monday
May 09
10:30 - 11:30
Room D2
Big data, AI
Ben Heijmen, The Netherlands;
Eduard Gershkevitsh, Estonia
Proffered Papers are presented at one of the sessions scheduled in the main session halls. Each author will present orally for 7 minutes, followed by 3 minutes for discussion. Sessions will be streamed.
Proffered Papers
Interdisciplinary
11:00 - 11:10
TRIPOD level-4 validation for a larynx cancer survival model using distributed learning
OC-0754

Abstract

TRIPOD level-4 validation for a larynx cancer survival model using distributed learning
Authors:

Christian Rønn Hansen1,2,3,4, Matthew Field5, Gareth Price6, Nis Sarup1, Ruta Zukauskaite7, Jørgen Johansen7, Jesper Grau Eriksen8, Farhannah Aly9, Andrew McPartlin6, Lois Holloway10, David Ian Thwaites4, Carsten Brink1,11

1Odense University Hospital, Laboratory of Radiation Physics, Odense, Denmark; 2University of Southern Denmark, Department of Clinical Research, Odense C, Denmark; 3Aarhus University Hospital, Danish Centre for Particle Therapy, Aarhus, Denmark; 4University of Sydney, Institute of Medical Physics, School of Physics, Sydney, Australia; 5University of New South Wales, SouthWest Sydney Clinical School, Sydney, Australia; 6University of Manchester, The Christie NHS Foundation Trust, Manchester, United Kingdom; 7Odense University Hospital, Department of Oncology, Odense, Denmark; 8Aarhus University Hospital, Department of Oncology, Aarhus, Denmark; 9Ingham Institute, Applied Medical Research, Sydney, Australia; 10Liverpool and Macarthur Cancer Therapy Centres, Department of Oncology, Sydney, Australia; 11University of Southern Denmark, Department of Clinical Research, Odense , Denmark

Show Affiliations
Purpose or Objective

Prediction models are needed to support clinical decision making; however, models need to be robustly validated in diverse cohorts to demonstrate generalisability to the clinical community. Healthcare providers also have a competing responsibility to protect sensitive patient data. The current study used distributed learning to validate a larynx cancer survival model in an international multi-centre setting without patient data leaving their own host institute.

Material and Methods

Patients receiving radiotherapy for larynx cancer from 2005-2018 at three international centres were identified to validate the overall survival (OS) model of Egelmeer et al. (Radiother. Oncol. 2011). This model utilises the parameters: time corrected EQD2 tumour dose, haemoglobin at treatment start, sex, age, site (glottic vs non-glottic), tumour and nodal stage. Data imputation for a maximum of one missing variable was allowed. An institution-stratified Cox regression model was developed utilising an open-source privacy-by-design distributed learning network. The validation aimed to test whether the hazard predicted from the original model would benefit from multiplication by a recalibration (RCA) factor. The study is a TRIPOD level 4 validation, where the model is fully supported if RCA=1. During the entire model optimisation, no patient data left their own hospital.

Results

1930 patients were identified, with 1278 suitable for use in the evaluation. The RCA factor determined across the centres was 0.76 [95%CI 0.62-0.91], i.e. showing the original model would benefit from recalibration. The three centres' Harrell C-indices were 0.68±0.06, 0.74±0.02 and 0.70±0.04 (95%CI), indicating a generally acceptable model performance. The distributed learning system produced centre-specific calibration plots and comparisons between observed and predicted Kaplan-Meier curves split by risk group. Following RCA, the data in the calibration plot is close to the identity line, indicating the model's general applicability (fig 1).


The Kaplan-Meier plots (fig 2) show that the gain from the RCA factor is centre-dependent. Due to the stratified approach, baseline risks can also be calculated per centre. Differences between centres are observed, indicating that differences in OS cannot be fully accounted for based only on the included model parameters. Additional parameters are needed to improve the original model’s centre-specific performance to account for this.

 

Conclusion

A TRIPOD type 4 evaluation has been performed of the Egelmeer et al. model using distributed learning to protect patient privacy. It is shown that the model needed RCA to increase the predictive accuracy. However, the improvement in prediction power was institution-dependent, indicating that differences within the cohorts exist beyond those accounted for by the original model parameters. This indicates the need to evaluate the regression value for the included model parameter or include additional parameters, e.g. smoking status and tumour volume.