Vienna, Austria

ESTRO 2023

Session Item

Monday
May 15
09:00 - 10:00
Stolz 1
Head & neck
Francesca De Felice, Italy;
Nadine Vatterodt, Denmark
3140
Mini-Oral
Clinical
09:00 - 10:00
Inter-expert observer variance of organs at risk according to the DAHANCA guidelines
Kenneth Jensen, Denmark
MO-0713

Abstract

Inter-expert observer variance of organs at risk according to the DAHANCA guidelines
Authors:

Kenneth Jensen1, Ebbe Lorentzen2, Jesper Grau Eriksen3, Jørgen Johansen4, Niels Gyldenkerne4, Ruta Zukauskaite4, Martin Kjellgren5, Elo Andersen6, Christian Maare6, Camilla Kjær Lønkvist6, Kinga Nowicka-Matus7, Weronicka Szejniuk7, Mohammad Farhadi8, Zaza Ujmajuridze8, Kirsten Marienhagen9, Tanja Stagaard Johansen10, Bente SOmmer Kristensen10, Jeppe Friborg10, Jens Overgaard3, Christian Rønn Hansen2

1Aarhus University Hospital, Danish Center for Particle Therapy, Aarhus, Denmark; 2Odense University Hospital, Laboratory of Radiation Physics, Odense, Denmark; 3Aarhus University Hospital, Department of Experimental Clinical Oncology, Aarhus, Denmark; 4Odense University Hospital, Department of Oncology, Odense, Denmark; 5Aarhus University Hospital, Department of Oncology, Aarhus, Denmark; 6Copenhagen University Hospital Herlev, Department of Oncology, Herlev, Denmark; 7Aalborg University Hospital, Department of Oncology, Aalborg, Denmark; 8Zealand University Hospital, Department of Oncology, Næstved, Denmark; 9Universitetssykehus Nord-Norge, Department of Radiotherapy, Tromsø, Norway; 10Rigshospitalet, Department of Oncology, Copenhagen, Denmark

Show Affiliations
Purpose or Objective

Organ at risk contouring is an important but time-consuming radiotherapy process that has become increasingly automated in recent years through computers and artificial intelligence. This has led to the development of automated contouring algorithms. However, to validate such contouring algorithms, guidelines are needed, and quantification of inter-expert observer variance can help to comprehend the quality of automated contours.

The project aimed to quantify the inter-observer variance of OAR in the head and neck region across DAHANCA oncology experts.

Material and Methods

The DAHANCA OAR contouring guidelines were clarified and discussed through a two-day workshop. The discussions were based on a pre-workshop national contouring audit where 20 expert oncologists contoured 17 OARs of the same patient. Subsequently, 15 experts were asked to contour 26 patients at the workshop to collect multiple contours on the same patient.

For each patient, all contours were compared in pairs using the Dice index, mean surface distance (MSD), and 95% Hausdorff distance. To illustrate 3D spatial areas of larger contouring variance, all OAR were deformable registered to a reference patient, and the average contouring variance was displayed (fig 1).

Results

In total, 3545 OAR were delineated in 26 patients, 17 per patient. The median number of observers per organ was 9 (IQR 7-9). The most familiar organs, like the brain stem, oral cavity, parotid and submandibular glands, esophagus and thyroid, showed high median dice, above 0.8. The most recent guidelines with new OAR, like pharynx constrictor muscles (PCM), lips, and buccal mucosa, had the lowest median dice, below 0.6.

The structures with the highest tissue contrast, like submandibular and thyroid glands, showed median MSD of just above 1 mm, while most other OAR had median MSD below 3 mm (fig 2). The spinal cord showed a very large variance linked to observers' inconsistency in following the guidelines of contouring the full length in the caudal direction.

Most information on inter-observer variation is found in the variance plots (fig 1), from which it is clear that the deep part of the parotid gland had the highest uncertainty. The transition between PCM low and esophagus was not well defined. Likewise, there is roughly one CT slice of variance for the transition between the glottic larynx and supraglottic larynx.  

Conclusion

A national auto-segmentation validation repository of 26 patients has been created. Areas of low image contrast present more contouring variance, and for OAR split into substructures, the transition causes a significantly increased variance.