Skip to main content

Table 6 P-value obtained from Wilcoxon signed-rank test between baseline models and facility-specific training models. Significant results (p<0.05) are denoted with an asterisk

From: Comparison of deep learning networks for fully automated head and neck tumor delineation on multi-centric PET/CT images

 

Regular Unet

2D Retina Unet

3D Retina Unet

DSC

HDavg (mm)

HD95 (mm)

DSC

HDavg (mm)

HD95 (mm)

DSC

HDavg (mm)

HD95 (mm)

MAASTRO

0.15

1.7e-3*

0.01*

7.9e-4*

0.47

0.80

4.8e-3*

0.96

0.69

CRO

3.7e-3*

< 1e-4*

< 1e-4*

0.16

0.04*

0.13

0.03*

0.21

0.39

BERLIN

0.048*

< 1e-4*

9.3e-4*

0.11

4.1e-3*

0.74

0.06

0.04*

0.22