The impact of multicentric datasets for the automated tumor delineation in primary prostate cancer using convolutional neural networks on 18F-PSMA-1007 PET

Holzschuh, Julius C.; Mix, Michael; Freitag, Martin T.; Hölscher, Tobias; Braune, Anja; Kotzerke, Jörg; Vrachimis, Alexis; Doolan, Paul; Ilhan, Harun; Marinescu, Ioana M.; Spohn, Simon K. B.; Fechter, Tobias; Kuhn, Dejan; Gratzke, Christian; Grosu, Radu; Grosu, Anca-Ligia; Zamboglou, C.

doi:10.1186/s13014-024-02491-w

Research
Open access
Published: 07 August 2024

The impact of multicentric datasets for the automated tumor delineation in primary prostate cancer using convolutional neural networks on ¹⁸F-PSMA-1007 PET

Julius C. Holzschuh^1,2,
Michael Mix³,
Martin T. Freitag³,
Tobias Hölscher⁴,
Anja Braune⁵,
Jörg Kotzerke⁵,
Alexis Vrachimis⁶,
Paul Doolan⁷,
Harun Ilhan⁸,
Ioana M. Marinescu¹,
Simon K. B. Spohn¹,
Tobias Fechter^1,9,
Dejan Kuhn^1,9,
Christian Gratzke¹⁰,
Radu Grosu^11,12,
Anca-Ligia Grosu¹ &
…
C. Zamboglou^1,13

Radiation Oncology volume 19, Article number: 106 (2024) Cite this article

467 Accesses
Metrics details

Abstract

Purpose

Convolutional Neural Networks (CNNs) have emerged as transformative tools in the field of radiation oncology, significantly advancing the precision of contouring practices. However, the adaptability of these algorithms across diverse scanners, institutions, and imaging protocols remains a considerable obstacle. This study aims to investigate the effects of incorporating institution-specific datasets into the training regimen of CNNs to assess their generalization ability in real-world clinical environments. Focusing on a data-centric analysis, the influence of varying multi- and single center training approaches on algorithm performance is conducted.

Methods

nnU-Net is trained using a dataset comprising 161 ¹⁸F-PSMA-1007 PET images collected from four distinct institutions (Freiburg: n = 96, Munich: n = 19, Cyprus: n = 32, Dresden: n = 14). The dataset is partitioned such that data from each center are systematically excluded from training and used solely for testing to assess the model's generalizability and adaptability to data from unfamiliar sources. Performance is compared through a 5-Fold Cross-Validation, providing a detailed comparison between models trained on datasets from single centers to those trained on aggregated multi-center datasets. Dice Similarity Score, Hausdorff distance and volumetric analysis are used as primary evaluation metrics.

Results

The mixed training approach yielded a median DSC of 0.76 (IQR: 0.64–0.84) in a five-fold cross-validation, showing no significant differences (p = 0.18) compared to models trained with data exclusion from each center, which performed with a median DSC of 0.74 (IQR: 0.56–0.86). Significant performance improvements regarding multi-center training were observed for the Dresden cohort (multi-center median DSC 0.71, IQR: 0.58–0.80 vs. single-center 0.68, IQR: 0.50–0.80, p < 0.001) and Cyprus cohort (multi-center 0.74, IQR: 0.62–0.83 vs. single-center 0.72, IQR: 0.54–0.82, p < 0.01). While Munich and Freiburg also showed performance improvements with multi-center training, results showed no statistical significance (Munich: multi-center DSC 0.74, IQR: 0.60–0.80 vs. single-center 0.72, IQR: 0.59–0.82, p > 0.05; Freiburg: multi-center 0.78, IQR: 0.53–0.87 vs. single-center 0.71, IQR: 0.53–0.83, p = 0.23).

Conclusion

CNNs trained for auto contouring intraprostatic GTV in ¹⁸F-PSMA-1007 PET on a diverse dataset from multiple centers mostly generalize well to unseen data from other centers. Training on a multicentric dataset can improve performance compared to training exclusively with a single-center dataset regarding intraprostatic ¹⁸F-PSMA-1007 PET GTV segmentation. The segmentation performance of the same CNN can vary depending on the dataset employed for training and testing.

Introduction

Primary prostate cancer (PCa) represents one of the most common cancer types in men with a prevalence of up to 15% in industrialized nations [1]. In the treatment of localized prostate cancer, radiotherapy holds a pivotal position. As there has been an upsurge in more patient-centric approaches such as the use of focal radiation dose escalation, a meticulous delineation of the intraprostatic tumor burden is essential [2].

¹⁸F-PSMA-1007 PET imaging has emerged as a powerful tool for characterizing intraprostatic tumor lesions [3], exhibiting, in some aspects even higher sensitivities compared to prostate multiparametric magnetic resonance imaging (mpMRI) [4].

While manual segmentation can be used for delineating the intraprostatic Gross Tumor Volume (GTV) [5], modern deep learning algorithms, especially convolutional neural networks (CNNs), are significantly transforming the contouring process in radiation oncology across various dimensions [6,7,8,9,10].

In light of this critical shift towards automated, deep learning-based contouring, the feasibility of CNN-based autocontouring has been demonstrated for both ⁶⁸ Ga-PSMA-11 and ¹⁸F-PSMA-1007 PET in prior work [11,12,13].

However, these studies primarily focused on the practical aspects of intraprostatic GTV segmentation, neglecting crucial technical and data-centric considerations necessary for the successful integration of CNNs into clinical workflows.

Undoubtedly, external validation remains a crucial aspect in the evaluation of deep learning models [14, 15], as the heterogeneity of imaging data across different clinical centers poses a significant barrier to the universal adoption and effectiveness of these technologies. The consideration of multicentric training datasets emerges as an important factor in addressing these challenges, aiming to enhance the robustness and accuracy of CNN-based autocontouring solutions. Therefore, the motivation behind our study is twofold: to critically analyze the performance implications of employing a multicentric approach in the training of deep learning models for intraprostatic GTV delineation while also investigating the potential benefits of integrating institution-specific nuances into the training process. By doing so, we strive to contribute valuable insights into the optimization of deep learning applications in the domain of radiation oncology, ultimately facilitating the advancement of patient-centric treatment strategies that are both effective and adaptable to varied clinical settings.

In this study, we systematically assess the importance of using a multicentric training dataset, evaluating the generalizability and performance of CNNs for intraprostatic GTV delineation on ¹⁸F-PSMA-1007 PET across different clinical settings inside the nnUNet Framework [16].

Initially, models are trained on datasets from all centers except one and subsequently tested on a cohort from the excluded center. In a second phase, the models are trained using data from a single center and tested on datasets from all other centers. Comparative analysis of the outcomes is performed against results obtained from a dataset that includes mixed data from all centers.

Through the comparison of different training strategies and validation datasets, our research aims to delineate the critical factors that contribute to the success of CNN-based auto contouring in a clinical context, ultimately guiding future developments in the field of radiation oncology.

Methods

Patients

A total of 161 patients diagnosed with primary prostate cancer (PCa) from four different medical centers (Freiburg: n = 96, Munich: n = 19, Cyprus: n = 32, Dresden: n = 14) were included in this study. Inclusion criteria were biopsy-proven diagnosis of primary PCa without having received prior treatment at the time of imaging. Local ethics committees from all participating centers granted approval or exemption for the study. Patients with high and intermediate-risk PCa were included retrospectively. The detailed characteristics are shown in Table 1 and have been described in prior work [11]. Imaging characteristics can be found in the supplementary.

Table 1 Patient characteristics

Full size table

Preprocessing

For preprocessing, body weight-adapted standardized uptake value (SUV) PET scans were resampled to a voxel size of 2 × 2 × 2 mm³ using B-spline interpolation. The focus was narrowed to the prostate area by cropping to a size of 64 × 64 × 64 voxels based on the prostate contour, enhancing computational efficiency. Input SUV-PET images were further normalized by applying value clipping at a SUV of 20, followed by regular nnU-Net preprocessing [16]. Voxels outside of the prostate were not changed.

Architecture and training

The nnUNet architecture [16] was utilized, taking SUV PET and prostate contour as input while outputting intraprostatic GTV contours. Training the network involved a specialized approach to accommodate the sparsely labeled dataset. Emphasis was placed on intraprostatic lesions using a weighted cross-entropy (CE) loss, thereby refining the model's focus and accuracy for intraprostatic GTV delineation based on the contour of the prostate. Besides training for 200 epochs, nnUNet default schedule was used, leveraging its established efficacy in medical image segmentation without the need for further hyperparameter optimization.

Data splitting

To compare different types of training data we used five-fold cross-validation as depicted in Fig. 1. For single center training comparison to mixed data, we first performed a split excluding the specific center. To ensure a fair comparison, we further partitioned the split into five subsets that are collectively exhaustive and mutually exclusive of the respective split and added training data from the other centers to reach the same amount of training data. We compared the respective results for the leave-one-center-out approach to an adjusted five-fold cross-validation on the whole dataset for the same respective validation set.

Evaluation

Evaluation metrics include volumetric Dice Similarity Coefficient (DSC) [17] and Hausdorff Distance 95% (HD95) [18, 19], providing a comprehensive assessment of segmentation accuracy and following metric selection guidelines [18]. Statistical analysis was conducted using the Wilcoxon signed-rank test and a paired t-test with Bonferroni correction, based on the scipy 1.10.1 and statannot 0.6.0 library.

Contouring

The contouring methodology built upon previously established techniques and the exact process has been described in previous works [5, 11]. A consensus contour was derived from the expertise of two board-certified radiation oncologists, both experienced in intraprostatic GTV contouring and PET image analysis.

Results

Comparing single-center to multicentric training, adjusted for sample size, significant performance improvements regarding multi-center training were observed for the Dresden cohort (multi-center median DSC 0.71, IQR: 0.58–0.8 vs. single-center 0.68, IQR: 0.50–0.8, p < 0.001) and Cyprus cohort (multi-center 0.74, IQR: 0.62–0.83 vs. single-center 0.72, IQR: 0.54–0.82, p < 0.01). While Munich and Freiburg cohorts also indicated enhanced performance for multi-center training over single-center training, results showed no statistical significance (Munich: multi-center DSC 0.74, IQR: 0.60–0.80 vs. single-center 0.72, IQR: 0.59–0.82, p > 0.05; Freiburg: multi-center 0.78, IQR: 0.53–0.87 vs. single-center 0.71, IQR: 0.53–0.83, p > 0.05). A comparative analysis is depicted in Fig. 2.

Regarding the 95% Hausdorff Distance (HD95) metric, comparisons between multi-center and single-center training revealed no significant difference across each comparison (p > 0.05). Specifically, the median HD95 values for Center Dresden were 1.94 (IQR: 1.35–3.22) for multi-center training and 1.73 (IQR: 1.00–3.70) for single-center training. For Center Munich, the values were 1.71 (IQR: 1.20–2.90) for multi-center training versus 1.73 (IQR: 1.01–4.10) for single-center training. Center Cyprus reported HD95 values of 1.60 (IQR: 1.14–2.71) for multi-center training compared to 1.41 (IQR: 1.0–4.15) for single-center training. Lastly, Center Freiburg exhibited median HD95 values of 1.41 (IQR: 1.41–2.95) for multi-center training and 1.85 (IQR: 1.14–3.83) for single-center training.

Utilizing a five-fold cross-validation on the entire cohort of all centers yielded a median DSC of 0.76 (IQR: 0.64–0.84). When CNNs were trained excluding data from each specific center, results indicated a median DSC of 0.74 (IQR: 0.56–0.86). Statistical analysis revealed no significant difference in DSC values between these two methodologies (p = 0.18). A detailed center-wise analysis is depicted in Fig. 3.

Concerning HD95 metric, cross-validation exhibited a median HD95 of 1.73 mm (IQR: 1–2). Models trained while excluding one center at a time demonstrated a median HD95 of 1.41 mm (IQR: 1–4) on the whole dataset, with these differences reaching statistical significance (p < 0.01).

Volumetric analysis showed a median of 0.87 ml (IQR: 0.41–1.63) for manual contouring, a median of 1 ml (IQR: 0.58–1.60) for cross-validation and a median of 0.86 ml (IQR: 0.43–1.61) for CNNs trained on utilizing a leave-one-center-out approach respectively, showing no statistically significant differences (p > 0.05).

Model training required approximately 40–60 s per epoch. Inference for a single prediction on resampled data was achieved in under one second.

Discussion

In the present study, we undertook a comprehensive analysis evaluating the ability of CNNs to generalize to novel datasets in the context of intraprostatic GTV delineation for ¹⁸F-PSMA-1007 PET imaging. Our findings indicate that training the model with datasets from multiple centers significantly improves performance compared to training solely on data from a single center, when compared on an equivalent amount of training data. This improvement is anticipated, as exposure to multicentric data during training enables the model to encounter a broader variety of data representations, thereby enhancing its generalization capacity. Additionally, although the difference in performance between the Leave-One-Center-Out approach and mixed training did not reach the required significance level, the mixed training methodology also exhibited slightly superior performance. Overall, results suggest that integrating data from all centers in the training process, despite sometimes only yielding a small benefit, can contribute to a more robust model by providing a more diverse training dataset.

The difficulty of training deep learning models that perform well on novel, unseen data represents a fundamental challenge within the current landscape of machine learning [23]. Factors such as AI bias [21], shortcut learning [21], distribution shifts [22] and heterogenous acquisition and data annotation [19] further complicate this issue. Even when performing the same method on similar tasks, results can vary greatly depending on the dataset used.

This phenomenon is evident across various studies regarding automated tumor segmentation in PSMA-PET. For intraprostatic GTV segmentation, Kostyszyn et al. (2021) [12] demonstrated that a CNN achieved median DSC of 0.81 to 0.84 across internal and external independent validation cohorts on ⁶⁸ Ga-PSMA-11 and ¹⁸F-PSMA-1007 PET scans. Adding to these findings, Ghezzo et al. (2023) [13], conducted an independent external validation of Kostyszyn et al.'s method, observing lower median DSC values ranging from 0.72 to 0.77, with mean DSC values between 0.69 and 0.71 for ⁶⁸ Ga-PSMA-11 PET.

Holzschuh et al. (2023) [11], reported a range of median DSC values, from 0.70 to 0.82, for ¹⁸F-PSMA-1007, ¹⁸F-DCFPyL, and ⁶⁸ Ga-PSMA-11. Notably, in this study an external validation was conducted independently by another institute.

Leung et al. reported a mean DSC of 0.7 for ¹⁸F-DCFPyL PET [23], though it is unknown if only intraprostatic or whole-body lesions are considered.

Regarding whole body PSMA PET, Kendrick et al. (2022) [24] reported a median DSC of 0.5 in a single-center study. Huang et al. (2023) [25] report mean DSC values ranging from 0.59 to 0.63 on ⁶⁸ Ga-PSMA-11 PET. Jafari et al. [26] presented results for whole-body ⁶⁸ Ga-PSMA-11 PET, showing voxel-level mean DSC values of 0.65 to 0.7 for different independent centers.

Notably, our results are consistent with previously observed data ranges for automated tumor segmentation in PSMA-PET imaging. Our analysis also reveals that the performance of CNNs in delineating GTV is influenced by the dataset employed for training and testing. This variance also underscores the complexity of machine learning models in adapting to new, unseen data, highlighting the critical importance of well annotated, diverse and representative training datasets to improve model generalization, which is particularly relevant in the context of medical imaging.

However, despite the potential for slight performance decrements, our study also demonstrates that models can achieve commendable performance in certain cases when trained with mixed data from multiple centers, even if the quantity of data is small (n = 19), yielding a median DSC of 0.74. This underscores the data-dependent nature of AI experiments, which can lead to over- or under-estimation of the trained segmentation model's final performance. While the nnU-Net employed in our study aims to maintain hyperparameter invariance to data due to its end-to-end design, this aspect gains particular significance in the context of comparing models that have undergone individual hyperparameter optimization.

Regarding limitations, our study's conclusions are inherently confined to the delineation of intraprostatic GTV using the ¹⁸F-PSMA-1007 PET tracer within the nnUNet framework. Although, to the best of our knowledge, this represents one of the most extensive cohorts to date concerning ¹⁸F-PSMA-1007 PET imaging, it is imperative to incorporate more data in future research as cohorts from individual centers are relatively small, necessitating verification of these results across larger cohorts. Moreover, the inherent challenges associated with image segmentation metrics must be acknowledged [18, 27]. Also, tumor size can represent a potential source of bias that could affect segmentation results. In our study, the cohorts comprised patients at different tumor stages, which were not homogenous across different centers. For instance, no stage I patients were included in the Freiburg and Munich cohorts while present in other cohorts. This resulted in potential variability in tumor sizes across the groups.

Additionally, the exploration of alternative deep learning architectures is warranted in subsequent studies, given that our analysis was limited to the nnU-Net architecture. While our cohorts offer a diverse clinical spectrum, results may also vary across different patient collectives.

Overall, our research highlights the importance of multicentric training datasets in enhancing the generalization capabilities of CNNs, underscoring the relationship between dataset diversity and the performance of machine learning models.

Conclusion

CNNs trained for auto contouring intraprostatic GTV in ¹⁸F-PSMA-1007 PET on a diverse dataset from multiple centers mostly generalize well to unseen data from other centers. Training with a multicentric dataset can improve performance compared to training exclusively on single-center datasets regarding intraprostatic ¹⁸F-PSMA-1007 PET GTV segmentation. The segmentation performance of the same CNN can vary depending on the dataset employed for training and testing.

Data availability

No datasets were generated or analysed during the current study.

Abbreviations

AI:: Artificial intelligence
CNN:: Convolutional neural network
CE:: Cross-entropy
CT:: Computer tomography
DSC:: Dice similarity coefficient
GTV:: Gross tumor volume
HD95:: Hausdorff distance 95%
IQR:: Interquartile range
mpMRI:: Multiparametric magnetic resonance imaging
PCa:: Primary prostate cancer
PET:: Positron emission tomography
SUV:: Standardized uptake value

References

Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. “Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424. https://doi.org/10.3322/caac.21492.
Article PubMed Google Scholar
Spohn SKB, et al. “The maximum standardized uptake value in patients with recurrent or persistent prostate cancer after radical prostatectomy and PSMA-PET-guided salvage radiotherapy-a multicenter retrospective analysis. Eur J Nucl Med Mol Imaging. 2022;50(1):218–27. https://doi.org/10.1007/s00259-022-05931-5.
Article CAS PubMed PubMed Central Google Scholar
Lapa C, et al. “Value of PET imaging for radiation therapy. Strahlenther Onkol. 2021;197(9):1–23. https://doi.org/10.1007/s00066-021-01812-2.
Article PubMed Google Scholar
Eiber M, et al. Simultaneous 68Ga-PSMA HBED-CC PET/MRI Improves the Localization of Primary Prostate Cancer. Eur Urol. 2016;70(5):829–36. https://doi.org/10.1016/j.eururo.2015.12.053.
Article CAS PubMed Google Scholar
Spohn SKB, et al. Comparison of manual and semi-automatic [18F]PSMA-1007 PET based contouring techniques for intraprostatic tumor delineation in patients with primary prostate cancer and validation with histopathology as standard of reference. Front Oncol. 2020;10: 600690. https://doi.org/10.3389/fonc.2020.600690.
Article PubMed PubMed Central Google Scholar
Hindocha S, et al. Artificial intelligence for radiotherapy auto-contouring: current use, perceptions of and barriers to implementation. Clin Oncol. 2023;35(4):219–26. https://doi.org/10.1016/j.clon.2023.01.014.
Article CAS Google Scholar
Tillman-Schwartz E, Tillman G, Tansky JY, Paly JJ, Efstathiou JA, Wang Y. Prospective analysis of a deep learning auto-contouring model for definitive radiation of localized prostate cancer. Int J Radiat Oncol Biol Phys. 2022;114(3):e105–6. https://doi.org/10.1016/j.ijrobp.2022.07.905.
Article Google Scholar
Yousefirizi F, Jha AK, Brosch-Lenz J, Saboury B, Rahmim A. Toward high-throughput artificial intelligence-based segmentation in oncological PET imaging. PET Clinics. 2021;16(4):577–96. https://doi.org/10.1016/j.cpet.2021.06.001.
Article PubMed Google Scholar
Hoque SMH, et al. Clinical use of a commercial artificial intelligence-based software for autocontouring in radiation therapy: geometric performance and dosimetric impact. Cancers. 2023. https://doi.org/10.3390/cancers15245735.
Article PubMed PubMed Central Google Scholar
Doolan PJ, et al. A clinical evaluation of the performance of five commercial artificial intelligence contouring systems for radiotherapy. Front Oncol. 2023. https://doi.org/10.3389/fonc.2023.1213068.
Article PubMed PubMed Central Google Scholar
Holzschuh JC, et al. “Deep learning based automated delineation of the intraprostatic gross tumour volume in PSMA-PET for patients with primary prostate cancer. Radiother Oncol. 2023;188: 109774. https://doi.org/10.1016/j.radonc.2023.109774.
Article CAS PubMed Google Scholar
Kostyszyn D, et al. Intraprostatic tumor segmentation on PSMA PET images in patients with primary prostate cancer with a convolutional neural network. J Nucl Med. 2021;62(6):823–8. https://doi.org/10.2967/jnumed.120.254623.
Article CAS PubMed PubMed Central Google Scholar
S. Ghezzo et al., “External validation of a convolutional neural network for the automatic segmentation of intraprostatic tumor lesions on 68Ga-PSMA PET images. Frontiers in Medicine, vol. 10, 2023, Accessed 13 Jan 2024. https://www.frontiersin.org/articles/https://doi.org/10.3389/fmed.2023.1133269
Varoquaux G, Cheplygina V. Machine learning for medical imaging: methodological failures and recommendations for the future. NPJ Digit Med. 2022. https://doi.org/10.1038/s41746-022-00592-y.
Article PubMed PubMed Central Google Scholar
Steyerberg EW, Harrell FE. Prediction models need appropriate internal, internal–external, and external validation. J Clin Epidemiol. 2016;69:245–7. https://doi.org/10.1016/j.jclinepi.2015.04.005.
Article PubMed Google Scholar
Isensee F, Jaeger PF, Kohl SAA, Petersen J, Maier-Hein KH. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods. 2021. https://doi.org/10.1038/s41592-020-01008-z.
Article PubMed Google Scholar
Dice LR. Measures of the amount of ecologic association between species. Ecology. 1945;26(3):297–302. https://doi.org/10.2307/1932409.
Article Google Scholar
Maier-Hein L, et al. Metrics reloaded: recommendations for image analysis validation. Nat Methods. 2024. https://doi.org/10.1038/s41592-023-02151-z.
Article PubMed Google Scholar
Huttenlocher DP, Klanderman GA, Rucklidge WJ. Comparing images using the Hausdorff distance. IEEE Trans Pattern Anal Mach Intell. 1993;15(9):850–63. https://doi.org/10.1109/34.232073.
Article Google Scholar
Gichoya JW, et al. AI pitfalls and what not to do: mitigating bias in AI. Br J Radiol. 2023;96(1150):20230023. https://doi.org/10.1259/bjr.20230023.
Article PubMed PubMed Central Google Scholar
Geirhos R, et al. Shortcut learning in deep neural networks. Nat Mach Intell. 2020. https://doi.org/10.1038/s42256-020-00257-z.
Article Google Scholar
Choi Y, et al. Translating AI to clinical practice: overcoming data shift with explainability. Radiographics. 2023;43(5): e220105. https://doi.org/10.1148/rg.220105.
Article PubMed Google Scholar
Leung K, et al. A fully automated deep-learning based method for lesion segmentation in 18F-DCFPyL PSMA PET images of patients with prostate cancer. J Nucl Med. 2019;60(supplement 1):399–399.
Google Scholar
Kendrick J, Francis RJ, Hassan GM, Rowshanfarzad P, Ong JSL, Ebert MA. Fully automatic prognostic biomarker extraction from metastatic prostate lesion segmentations in whole-body [68Ga]Ga-PSMA-11 PET/CT images. Eur J Nucl Med Mol Imaging. 2022;50(1):67–79. https://doi.org/10.1007/s00259-022-05927-1.
Article CAS PubMed PubMed Central Google Scholar
Huang B, et al. Deep learning–based whole-body characterization of prostate cancer lesions on [68Ga]Ga-PSMA-11 PET/CT in patients with post-prostatectomy recurrence. Eur J Nucl Med Mol Imaging. 2023. https://doi.org/10.1007/s00259-023-06551-3.
Article PubMed PubMed Central Google Scholar
Jafari E, et al. A convolutional neural network–based system for fully automatic segmentation of whole-body [68Ga]Ga-PSMA PET images in prostate cancer. Eur J Nucl Med Mol Imaging. 2023. https://doi.org/10.1007/s00259-023-06555-z.
Article PubMed PubMed Central Google Scholar
Reinke A, et al. Understanding metric-related pitfalls in image analysis validation. Nat Methods. 2024. https://doi.org/10.1038/s41592-023-02150-0.
Article PubMed Google Scholar

Download references

Acknowledgements

This work was partially funded by the Agora 3.0 project (Cypriot Research and Innovation Foundation, STRATEGIC INFRASTRUCTURES/1222/0186). JCH has been funded by a fellowship of the DKFZ Clinician Scientist Program, supported by the Dieter Morszeck Foundation.

Funding

Open Access funding enabled and organized by Projekt DEAL. JCH has been funded by a fellowship of the DKFZ Clinician Scientist Program, supported by the Dieter Morszeck Foundation. This work was partially funded by the Agora 3.0 project (Cypriot Research and Innovation Foundation, STRATEGIC INFRASTRUCTURES/1222/0186).

Author information

Authors and Affiliations

Department of Radiation Oncology, Faculty of Medicine, Medical Center – University of Freiburg, University of Freiburg, German Cancer Consortium (DKTK), Partner Site DKTK, Freiburg, Germany
Julius C. Holzschuh, Ioana M. Marinescu, Simon K. B. Spohn, Tobias Fechter, Dejan Kuhn, Anca-Ligia Grosu & C. Zamboglou
Division of Radiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
Julius C. Holzschuh
Department of Nuclear Medicine, Faculty of Medicine, Medical Center – University of Freiburg, Freiburg, Germany
Michael Mix & Martin T. Freitag
Department of Radiotherapy and Radiation Oncology, Faculty of Medicine, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
Tobias Hölscher
Department of Nuclear Medicine, Faculty of Medicine, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
Anja Braune & Jörg Kotzerke
Department of Nuclear Medicine, German Oncology Center, European University Cyprus, Limassol, Cyprus
Alexis Vrachimis
Department of Medical Physics, German Oncology Center, European University Cyprus, Limassol, Cyprus
Paul Doolan
Department of Nuclear Medicine, University Hospital - Ludwig-Maximilians-Universität, Munich, Germany
Harun Ilhan
Division of Medical Physics, Department of Radiation Oncology, Faculty of Medicine, Medical Center–University of Freiburg, German Cancer Consortium (DKTK), Partner Site DKTK, Freiburg, Germany
Tobias Fechter & Dejan Kuhn
Department of Urology, Medical Center–University of Freiburg, Freiburg, Germany
Christian Gratzke
Cyber-Physical Systems Division, Institute of Computer Engineering and Faculty of Informatics, Technical University of Vienna, Vienna, Austria
Radu Grosu
Department of Computer Science, State University of New York at Stony Brook, Stony Brook, NY, USA
Radu Grosu
Department of Radiation Oncology, German Oncology Center, European University Cyprus, Limassol, Cyprus
C. Zamboglou

Authors

Julius C. Holzschuh
View author publications
You can also search for this author in PubMed Google Scholar
Michael Mix
View author publications
You can also search for this author in PubMed Google Scholar
Martin T. Freitag
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Hölscher
View author publications
You can also search for this author in PubMed Google Scholar
Anja Braune
View author publications
You can also search for this author in PubMed Google Scholar
Jörg Kotzerke
View author publications
You can also search for this author in PubMed Google Scholar
Alexis Vrachimis
View author publications
You can also search for this author in PubMed Google Scholar
Paul Doolan
View author publications
You can also search for this author in PubMed Google Scholar
Harun Ilhan
View author publications
You can also search for this author in PubMed Google Scholar
Ioana M. Marinescu
View author publications
You can also search for this author in PubMed Google Scholar
Simon K. B. Spohn
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Fechter
View author publications
You can also search for this author in PubMed Google Scholar
Dejan Kuhn
View author publications
You can also search for this author in PubMed Google Scholar
Christian Gratzke
View author publications
You can also search for this author in PubMed Google Scholar
Radu Grosu
View author publications
You can also search for this author in PubMed Google Scholar
Anca-Ligia Grosu
View author publications
You can also search for this author in PubMed Google Scholar
C. Zamboglou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

CZ and JCH oversaw the study. JCH performed experiments and analysis. JCH wrote the draft of the manuscript. CZ, MF, AB, TH, JK, AV, HI, CG, ALG, provided the patient data. MM, PD, IM, SS, TF, DK, RG, ALG helped with advisory and visualization. All authors corrected and edited the final manuscript. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Julius C. Holzschuh.

Ethics declarations

Ethics

Local ethics committees from all participating centers granted approval or exemption for the study.

Consent for publication

Not Applicable.

Conflict of Interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Holzschuh, J.C., Mix, M., Freitag, M.T. et al. The impact of multicentric datasets for the automated tumor delineation in primary prostate cancer using convolutional neural networks on ¹⁸F-PSMA-1007 PET. Radiat Oncol 19, 106 (2024). https://doi.org/10.1186/s13014-024-02491-w

Download citation

Received: 04 April 2024
Accepted: 17 July 2024
Published: 07 August 2024
DOI: https://doi.org/10.1186/s13014-024-02491-w

The impact of multicentric datasets for the automated tumor delineation in primary prostate cancer using convolutional neural networks on 18F-PSMA-1007 PET

Abstract

Purpose

Methods

Results

Conclusion

Introduction

Methods

Patients

Preprocessing

Architecture and training

Data splitting

Evaluation

Contouring

Results

Discussion

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics

Consent for publication

Conflict of Interest

Additional information

Publisher's Note

Supplementary Information

Additional file 1

Rights and permissions

About this article

Cite this article

Share this article

Radiation Oncology

Contact us

The impact of multicentric datasets for the automated tumor delineation in primary prostate cancer using convolutional neural networks on ¹⁸F-PSMA-1007 PET