Cracking the code: How we tackle image segmentation in radiation oncology
SP-0368
Abstract
Dealing with public datasets
1UC San Diego Health, Radiation Medicine & Applied Sciences, La Jolla, USA
Show Affiliations
Hide Affiliations
Abstract Text
Introduction: The incorporation of publicly available data is not only useful for providing additional data for model training/evaluation, but also the fundamental base of comparison with any past or future work. Despite this, finding data, or converting the data into a usable form for the research being performed, can be particularly time consuming. Furthermore, the evaluation and curation of the data is often difficult, leaving many disheartened or disinterested.
Methods: In this talk, we will provide a brief background on the importance of incorporating publicly available data, a list of resources available (TCIA, WHO, etc.) for data acquisition, and examples of best practices/pit falls. To illustrate the necessary steps in this process, we will include simple, real-world examples of best practices that can be implemented by the viewer: from data searching, to final curation.
Hopeful takeaways: The viewer should leave with an understanding of the importance of using publicly available data, an ability to confidently find data matching their own desired use cases and have the tools to quickly and efficiently curate the acquired data.