Clinical evaluation of multi-atlas based segmentation of lymph node regions in head and neck and prostate cancer patients
- Carl Sjöberg1, 2Email author,
- Martin Lundmark†3,
- Christoffer Granberg†4,
- Silvia Johansson1,
- Anders Ahnesjö1 and
- Anders Montelius1, 3
© Sjöberg et al.; licensee BioMed Central Ltd. 2013
Received: 24 June 2013
Accepted: 28 September 2013
Published: 3 October 2013
Semi-automated segmentation using deformable registration of selected atlas cases consisting of expert segmented patient images has been proposed to facilitate the delineation of lymph node regions for three-dimensional conformal and intensity-modulated radiotherapy planning of head and neck and prostate tumours. Our aim is to investigate if fusion of multiple atlases will lead to clinical workload reductions and more accurate segmentation proposals compared to the use of a single atlas segmentation, due to a more complete representation of the anatomical variations.
Atlases for lymph node regions were constructed using 11 head and neck patients and 15 prostate patients based on published recommendations for segmentations. A commercial registration software (Velocity AI) was used to create individual segmentations through deformable registration. Ten head and neck patients, and ten prostate patients, all different from the atlas patients, were randomly chosen for the study from retrospective data. Each patient was first delineated three times, (a) manually by a radiation oncologist, (b) automatically using a single atlas segmentation proposal from a chosen atlas and (c) automatically by fusing the atlas proposals from all cases in the database using the probabilistic weighting fusion algorithm. In a subsequent step a radiation oncologist corrected the segmentation proposals achieved from step (b) and (c) without using the result from method (a) as reference. The time spent for editing the segmentations was recorded separately for each method and for each individual structure. Finally, the Dice Similarity Coefficient and the volume of the structures were used to evaluate the similarity between the structures delineated with the different methods.
For the single atlas method, the time reduction compared to manual segmentation was 29% and 23% for head and neck and pelvis lymph nodes, respectively, while editing the fused atlas proposal resulted in time reductions of 49% and 34%. The average volume of the fused atlas proposals was only 74% of the manual segmentation for the head and neck cases and 82% for the prostate cases due to a blurring effect from the fusion process. After editing of the proposals the resulting volume differences were no longer statistically significant, although a slight influence by the proposals could be noticed since the average edited volume was still slightly smaller than the manual segmentation, 9% and 5%, respectively.
Segmentation based on fusion of multiple atlases reduces the time needed for delineation of lymph node regions compared to the use of a single atlas segmentation. Even though the time saving is large, the quality of the segmentation is maintained compared to manual segmentation.
In radiotherapy for local tumour control all anatomical structures of interest for dose quantification must be delineated before the treatment can start. Small structures with well-defined edges can be delineated manually in a short period of time without large inter-observer variation. Structures or organs such as lymph node regions that are hard to discriminate in CT images implies use of segmentation protocols based on indirect characteristics of nearby, visible anatomical structures [1, 2]. Manual delineation of these structures can be very time-consuming, especially if they are large and irregularly shaped. Atlas based segmentation, where deformable registration is used to create delineations of regions of interest for a new image by transforming pre-made delineations of the corresponding structures in existing images , is assumed to reduce segmentation workload and time compared to manual segmentation under such conditions [4–6].
When the anatomy change of a patient between two image sets is small a single registration of an image set might be adequate to transform delineations that can be clinically approved with little or no editing. This can be the case for example if a patient is repeatedly imaged at different fractions of a radiotherapy treatment, or imaged by different modalities using similar patient setup procedure. For new patients where no previous delineated images exist, deformable registration to the new image using images from other patients as an atlas can be performed. In the latter case, minor or major editing of the resulting delineations is usually required to achieve a clinically acceptable result [4–6]. However, the required editing workload can be significantly shorter than to manually segment the new image from scratch.
In this work we will use the term “proposal” to denote a segmentation result before any manual editing is used to improve it. Several studies have shown that multi-atlas segmentation, where proposals from several different atlas patients are fused to yield a resulting proposal, can improve the result compared to the use of a single atlas [7–9]. We compare the segmentation time gains for editing of single atlas and multi-atlas segmentation proposals versus a manual segmentation. For multi-atlas fusion we use the recently introduced probabilistic weighting fusion algorithm (PWF) . We focus on the target volume of lymph node regions for head and neck cancer (HNC) patients, and pelvic lymph node regions for prostate cancer patients. Besides work load analysis the similarity of the structures from the different methods are analysed using the Dice similarity coefficient and the volume of the structures.
The atlas database for the HNC cases consisted of image sets of 11 patients with different diagnoses in various stages randomly chosen from the clinical data. For use as a single atlas, one male and one female patient image set were subjectively selected by a radiation oncologist as being the most representative out of the available cases. The same male atlas was used for all male patients, and the same female atlas for all female patients. For all HNC atlas cases the lymph node regions of the patients were segmented in accordance with a generally adapted guideline for head and neck target and risk organ delineation .
For the prostate patients, an atlas database was established containing 15 prostate patient image sets randomly chosen from the clinical data. Again, a representative case with an anatomy that was least deformed by malignancies was selected for use as a single atlas. The pelvic lymph node regions of the prostate patients were delineated consistently following the protocol proposed by Taylor et al. .
For the HNC patients, two female and eight male patients not part of the atlas material were randomly selected from the available clinical data and used as test cases. Ten prostate cancer patients, also different from the patients used for the atlas database, with pelvic lymph node involvement were included in this study. The same segmentation guidelines as used for the atlas images were used for the manual segmentation of the test patient images. All delineations were created in a commercial treatment planning system (Oncentra 4.0).
Probabilistic weighting fusion
All atlas segmentations were exported from VelocityAI and imported into an in-house developed implementation of the probabilistic weighting fusion algorithm . The resulting fused proposal from this multi-atlas application is a weighted mean shape using weights based on the local registration success, as measured by an image similarity measure. The normalized cross correlation was used as image similarity measure, calculated for a volume consisting of the deformed structure including a uniform dilation margin of 10 mm for the head and neck patients and 50 mm for the prostate patients. The PWF method uses as input parameter the ratio k/s, where k is the proportionality of segmentation quality to image registration quality, and s is the expected spread of segmentation quality. The fused segmentations are constructed by weighted superposition of distance maps with weights calculated from the k/s parameter according to section 2.5 of . For optimal performance k/s should be optimized from a large material. However, in lack of such data, preliminary testing indicated that k/s could be set to 0.5 for the head and neck patients and to 20 for the prostate patients. The lower limit of k/s is zero, which is equivalent to use of equal, unbiased weights, while an infinitely high value of k/s results in selection of the most similar registration as output.
For each test patient, both the single atlas proposal and the fused multi-atlas proposal were imported into the treatment planning system where all editing of the delineations were performed. This assured use of identical editing tools, familiar to the radiation oncologist and independent of registration method. The same radiation oncologist performed all delineations as well as all segmentation editing. The structures delineated from scratch and used as reference were created using a “freehand drawing tool” and a “polygon drawing tool”. For editing of the atlas based proposals mainly the “pearl drawing tool” was used that provides a circular brush (“pearl”) of variable diameter which can be moved to “push” the contour lines to enlarge or diminish the segmented region. This tool was found to be particularly effective for adjusting smoothly curved contour lines. The atlas proposals and the edited segmentations can be seen in the right panels of Figures 1 and 2.
The main objective for this study was to investigate how much time the radiation oncologist potentially could save by using atlas based segmentation tools. For this purpose, the radiation oncologist recorded the manual contouring time as well as the time needed for editing of each individual structure for every patient. To reduce bias from editing the same structure multiple times, a minimum of one week was spent between each of the three segmentation occasions for a patient. Also, the different segmentation editing options were performed in varying order. The auxiliary time for studying medical records, MRI and PET images, etc., was omitted from the scored time.
Evaluation of structure volumes and overlap
The commonly used Dice Similarity Coefficient (DSC)  was used for evaluation of segmentation quality, and to quantify the similarity between different segmentations of a structure. DSC is a measure of the spatial overlap between structures ranging from 0 to 1, where 0 means no overlap and 1 equals a perfect match. The DSC was calculated for the clinical data based on the number of voxels contained in every structure. A voxel was deemed to be inside the structure if the voxel centre was located inside the structure.
A second evaluation tool was to determine the volumes of the segmented structures. The volumes could be extracted from both the treatment planning system and the registration software but, since the two systems gave slightly different results due to different handling of partial voxel volumes, the volumes from the registration software only were used to ensure consistency.
Editing single atlas
Editing fused atlas
Head & Neck
Segmentation time gain
Editing single atlas vs. manual
Editing fused atlas vs. manual
Editing fused atlas vs. editing single atlas
Head & Neck
29%*, p = 0.044
49%*, p = 1.5e-4
31%*, p = 0.045
34%*, p = 0.012
Edited single atlas
Edited fused atlas
Head & Neck
0.74*, p = 0.047
1.26*, p = 0.026
0.82*, p = 0.019
Dice similarity coefficient
In this work, one single atlas was selected per anatomical site and sex. If instead this single atlas would be selected manually for each registration, a better registration result might be possible. To select this atlas quickly might however be difficult. Another problem with this approach is that since this single atlas might be more similar in some areas than others, segmentation similarities will still probably be lower than using a fused atlas method.
Of interest is to note that the volume of the fused atlas proposal was on average smaller than the single atlas proposal. The pearl editing tool was regarded by the radiation oncologist as somewhat easier to use when starting with a small volume where the borders are pushed outwards to the desired position rather than the opposite, which would mean that smaller proposals would be preferred over larger.
The segmentation time reduction was largest for the head and neck lymph nodes. This is most likely due to the complexity and individual variation for these structures. The pelvic lymph nodes are closely linked to the neighbouring bone structures. This facilitates both manual segmentation and editing, leading to a smaller reduction of segmentation times than for the head and neck nodes. However, even if the magnitude of time saved per patient for the pelvic lymph nodes is modest, for a large throughput of patients the gain can still be of importance.
The accuracy measures showed that on average most atlas based proposals were reasonable but no segmentation proposal was approved by the radiation oncologist without any further editing. Thus, fully automated segmentation may still not be feasible. Volume measures and DSC values together gave a good picture of the segmentation accuracy results, which was confirmed in visual inspections.
Atlas based segmentations of lymph node regions for prostate and head and neck patients significantly saves time, on average, for the radiation oncologists compared to manually segmenting each patient. This is demonstrated even when segmentation proposals need to be extensively edited. Fused atlas proposals are generally superior to single atlas proposals, both as measured by a reduction in segmentation time and as measured by a higher binary overlap.
The authors thank Ulf Isacsson for valuable help with the study.
- Grégoire V, Levendag P, Ang KK, Bernier J, Braaksma M, Budach V, Chao C, Coche E, Cooper JS, Cosnard G, et al.: CT-based delineation of lymph node levels and related CTVs in the node-negative neck: DAHANCA, EORTC, GORTEC, NCIC, RTOG consensus guidelines. Radiother Oncol 2003, 69: 227-236. 10.1016/j.radonc.2003.09.011View ArticlePubMedGoogle Scholar
- Taylor A, Rockall AG, Powell MEB: An atlas of the pelvic lymph node regions to aid radiotherapy target volume definition. Clin Oncol 2007, 19: 542-550. 10.1016/j.clon.2007.05.002View ArticleGoogle Scholar
- Rohlfing T, Brandt R, Menzel R, Russakoff DB, Maurer J, Calvin R: Quo Vadis, Atlas-Based Segmentation? In The Handbook of Medical Image Analysis - Volume III: Registration Models. New York, NY: Kluwer Academic /Plenum Publishers; 2005:435-486.View ArticleGoogle Scholar
- Young AV, Wortham A, Wernick I, Evans A, Ennis RD: Atlas-Based Segmentation Improves Consistency and Decreases Time Required for Contouring Postoperative Endometrial Cancer Nodal Volumes. International Journal of Radiation Oncology*Biology*Physics 2011, 79: 943-947. 10.1016/j.ijrobp.2010.04.063View ArticleGoogle Scholar
- Teguh DN, Levendag PC, Voet PWJ, Al-Mamgani A, Han X, Wolf TK, Hibbard LS, Nowak P, Akhiat H, Dirkx MLP, et al.: Clinical Validation of Atlas-Based Auto-Segmentation of Multiple Target Volumes and Normal Tissue (Swallowing/Mastication) Structures in the Head and Neck. International journal of radiation oncology, biology, physics 2011, 81: 950-957. 10.1016/j.ijrobp.2010.07.009View ArticlePubMedGoogle Scholar
- Stapleford LJ, Lawson JD, Perkins C, Edelman S, Davis L, McDonald MW, Waller A, Schreibmann E, Fox T: Evaluation of automatic atlas-based lymph node segmentation for head-and-neck cancer. Int J Radiat Oncol Biol Phys 2010, 77: 959-966. 10.1016/j.ijrobp.2009.09.023View ArticlePubMedGoogle Scholar
- Rohlfing T, Brandt R, Menzel R, Maurer CR Jr: Evaluation of atlas selection strategies for atlas-based image segmentation with application to confocal microscopy images of bee brains. Neuroimage 2004, 21: 1428-1442. 10.1016/j.neuroimage.2003.11.010View ArticlePubMedGoogle Scholar
- Isgum I, Staring M, Rutten A, Prokop M, Viergever MA, Van Ginneken B: Multi-atlas-based segmentation with local decision fusion–application to cardiac and aortic segmentation in CT scans. IEEE Trans Med Imaging 2009, 28: 1000-1010.View ArticlePubMedGoogle Scholar
- Heckemann RA, Hajnal JV, Aljabar P, Rueckert D, Hammers A: Automatic anatomical brain MRI segmentation combining label propagation and decision fusion. Neuroimage 2006, 33: 115-126. 10.1016/j.neuroimage.2006.05.061View ArticlePubMedGoogle Scholar
- Sjöberg C, Ahnesjö A: Multi-atlas based segmentation using probabilistic label fusion with adaptive weighting of image similarity measures. Comput Methods Programs Biomed 2013, 110: 308-319. 10.1016/j.cmpb.2012.12.006View ArticlePubMedGoogle Scholar
- Mattes D, Haynor DR, Vesselle H, Lewellyn TK, Eubank W: Nonrigid multimodality image registration. Proc SPIE Medical Imaging 2001, 4322: 1609-1620. 10.1117/12.431046View ArticleGoogle Scholar
- Dice LR: Measures of the Amount of Ecologic Association Between Species. Ecology 1945, 26: 297-302. 10.2307/1932409View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.