Deep learning in CT image segmentation of cervical cancer: a systematic review and meta-analysis

Yang, Chongze; Qin, Lan-hui; Xie, Yu-en; Liao, Jin-yuan

doi:10.1186/s13014-022-02148-6

Research
Open access
Published: 07 November 2022

Deep learning in CT image segmentation of cervical cancer: a systematic review and meta-analysis

Chongze Yang¹^na1,
Lan-hui Qin¹^na1,
Yu-en Xie¹ &
…
Jin-yuan Liao^1,2

Radiation Oncology volume 17, Article number: 175 (2022) Cite this article

4981 Accesses
14 Citations
1 Altmetric
Metrics details

Abstract

Background

This paper attempts to conduct a systematic review and meta-analysis of deep learning (DLs) models for cervical cancer CT image segmentation.

Methods

Relevant studies were systematically searched in PubMed, Embase, The Cochrane Library, and Web of science. The literature on DLs for cervical cancer CT image segmentation were included, a meta-analysis was performed on the dice similarity coefficient (DSC) of the segmentation results of the included DLs models. We also did subgroup analyses according to the size of the sample, type of segmentation (i.e., two dimensions and three dimensions), and three organs at risk (i.e., bladder, rectum, and femur). This study was registered in PROSPERO prior to initiation (CRD42022307071).

Results

A total of 1893 articles were retrieved and 14 articles were included in the meta-analysis. The pooled effect of DSC score of clinical target volume (CTV), bladder, rectum, femoral head were 0.86(95%CI 0.84 to 0.87), 0.91(95%CI 0.89 to 0.93), 0.83(95%CI 0.79 to 0.88), and 0.92(95%CI 0.91to 0.94), respectively. For the performance of segmented CTV by two dimensions (2D) and three dimensions (3D) model, the DSC score value for 2D model was 0.87 (95%CI 0.85 to 0.90), while the DSC score for 3D model was 0.85 (95%CI 0.82 to 0.87). As for the effect of the capacity of sample on segmentation performance, no matter whether the sample size is divided into two groups: greater than 100 and less than 100, or greater than 150 and less than 150, the results show no difference (P > 0.05). Four papers reported the time for segmentation from 15 s to 2 min.

Conclusion

DLs have good accuracy in automatic segmentation of CT images of cervical cancer with a less time consuming and have good prospects for future radiotherapy applications, but still need public high-quality databases and large-scale research verification.

Background

Cervical cancer is the second most common cancer in women aged 15–44 years worldwide, second only to breast cancer in incidence, and the incidence rate and mortality rate are on the rise in recent years [1]. With an annual incidence of about 500,000, more than half of whom die from the disease, cervical cancer is the main reason for the worldwide cancer burden [2]. In many developing countries, most cases of cervical cancer are locally advanced cervical cancer (LACC) when diagnosed [3].

Radiation therapy (RT) was a non-surgical option for lots of varieties of cancer. Likewise, RT is an effective way to improve the survival rate of patients with cervical cancer [4, 5], especially for patients with LACC and those whose physical condition is not suitable for surgery. The preferred approach to radiotherapy for LACC is intensity-modulated radiotherapy (IMRT). To achieve optimal treatment efficacy, for the target area, the radiation dose needs to be increased; and for the surrounding normal tissues and organs, the damage needs to be reduced. Therefore, the key to successful implementation of IMRT is accurate mapping of clinical target volume (CTV) and organs at risk (OARs) [6, 7]. Today, manual segmentation of CTV by a physician is still the standard, but it is a time-consuming and fatiguing task that takes an experienced physician at least 30 min. Even with guidelines, different doctors have different habits, and even the same doctor may have different segmentation results at different times, and there was also literature reported interobserver differences [8]. It is important to note that most CTV does not have clear borders (unlike OAR, which mostly have clear borders) and their contours include not only the apparent lesion volume but also the regional lymph nodes and other suspected pathways of tumor spread [6, 7]. The CTV is largely dependent on individual differences, lesion location, and cancer stage, moreover, even patients with the same stage have different extents of tumor infiltration and lymphatic involvement. All of the above-mentioned will lead to different results of segmentation.

Compared to manual segmentation, automatic segmentation has shown great potential since it was proposed, such as reducing physician burden, decreasing patient waiting time, and improving cancer treatment. During IMRT for Cervical Cancer, the dramatic anatomical changes also require advanced adaptive radiotherapy (ART) strategies [9]. Meanwhile, in low-and middle-income areas and areas with limited medical care, it is difficult to implement radiotherapy according to the guidelines. In this case, the emergence of automatic segmentation can improve the level of local and global medical care [5].

Traditional automatic segmentation methods, such as traditional supervised machine learning and unsupervised machine learning approaches, based on Atlas models and based on statistical models [10], these two methods can obtain great segmentation results, but the segmentation results still require very time-consuming manual editions by the doctor. Unfortunately, both methods have a limitation, in which they cannot handle large differences between different images and different patients [11]. Although a large amount of data can solve this problem, we all know that medical databases, although large in volume, suffer from diversity (i.e., different types, different devices, large differences in data quality, and individual differences of patients), even, many factors can't be dealt with. Therefore we need to use a limited available sample size to achieve the expected results [12, 13]. Thus, all these factors above result in the development of deep learning (DL) networks.

Although DL networks have been around since the 1940s, it was only in 2006 that deep learning emerged as a branch of machine learning, and known as one of the top ten technological breakthroughs since 2013 [12]. Initially, image segmentation by DLs was done with the convolutional neural network (CNN). The CNN usually consists of convolutional layers, pooling layers, and fully connected layers, and its complex structure leads to a large enough sample size and a lot of time, as well as adequate computational capabilities to train the model. Furthermore, CNN has a limitation on image size because of the fixed number of nodes. This problem was then solved by the emergence of the fully convolutional network (FCN), which uses a convolutional layer instead of CNN’s fully-connected layer so that FCN model can handle any image size. In addition, FCN improves segmentation efficiency over CNN because of its skip-connections. Unfortunately, there is also a problem that the multiplier up-sampling in the model is too big, resulting in Insufficient contexts information integration and decrease in segmentation accuracy. The most popular medical image segmentation FCN architecture is the U-net today, which use equal number of up-sampling convolutional layers and down-sampling convolutional layers, and the up-sampling layer can accept the features extracted from the corresponding down-sampling layer because there is a skip-connection in each corresponding layer, so that the segmentation accuracy can be improved. The U-net enables end-to-end training without the need for large numbers of training samples and pre-training [14,15,16,17]. With the development of CNN, FCN, and U-net, the accuracy of medical image segmentation has been greatly improved, suggesting that we have entered the development of the fourth generation of segmentation algorithms [13].

Although many reviews have reported the performance of DLs for image segmentation, some previous authors also conducted meta-analyses on the performance of DLs in glioma [18] and head and neck tumor [19] segmentation. However, there is still a lack of comprehensive review and meta-analysis on cervical cancer segmentation. Therefore, this paper aims to investigate the performance of DLs in the segmentation of cervical cancer. Through this paper, highlights the current state and limitations of the field and makes recommendations for research in the future.

Methods

The systematic review and meta-analysis was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. This study was registered in PROSPERO prior to initiation (CRD42022307071).

Search strategy

This systematic review and meta-analysis reviews papers on the development or validation of DLs for segmentation of cervical cancer computed tomography (CT) images of CTV or OARs. Publications were retrieved from MEDLINE (accessed via PubMed), The Cochrane Library, Embase, and Web of science, until November 2021. We used ("deep learning", OR "convolutional neural network") AND (Uterine Cervical Neoplasms OR Cervical cancer) as the search strategy, and followed specific search techniques for each database. Language is restricted to English. While there are no publication date restrictions. The full search strategies are available in the Additional file 1.

Selection criteria and data extraction

Screening of the title, abstract and full text of the paper by two researchers (YC) and (QL), respectively. Discussing and resolving disagreements between the two researchers, the remaining disagreements were resolved jointly with the participation of a third researcher (LJ). All researchers extracted data as follows: (a) publication date and first author; (b) size of training set; (c) whether there was an internal/external validation; (d) CT scan parameters; (e) architecture of the DLs; (f) study design, including segmentation strategy; (g) the DSC score of DLs; (h) time of segmentation. Then, the Cross-validation is performed after finishing extracting data. The inclusion criteria were as follows: (1) develop or validate DLs for segmentation of cervical cancer CT images of CTV and/or OARs; (2) the article reports the structure of DLs, the size of the training set, the size of the validation set, the size of the test set and the DSC score of segmentation; (3) evaluation of segmentation results of DLs by senior oncologists or radiologists. The exclusion criteria were as follows: (1) non-deep learning models; (2) no relevant data reported; (3) animal experiments were excluded; (4) reviews, conference, meta-analysis, and duplicate publication.

Quality assessment

We evaluated the included articles in this paper with reference to the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) [20].

Statistical analysis

We used Stata software (version 15.1) for meta-analysis. When estimating the overall effect size of the current DLs, for the Meta-analysis, we used a random effects model. As for the normally distributed, data, it is shown as mean with ± SD. By contrast, the abnormally distributed data is displayed as the median with a range (min–max). Statistical tests were considered significant when p < 0.05.

The Dice Similarity Coefficient (DSC) [8, 21] was used to evaluate DLs models. The DSC is defined as follows:

$$DSC = \frac{{2\left| {X \cap Y} \right|}}{{\left| {\text{X}} \right| + \left| {\text{Y}} \right|}}$$

where X = {X1, …, Xn} and Y = {Y1, …, Yn} are two finite point sets. X is the predicted mask, and Y is the ground truth. ∣X ∩ Y∣represents the intersection of X and Y and∣X∣ + ∣Y∣represents the union of X and Y.

The Higgins I² test was used to examine the heterogeneity of the included studies. Higgins I² test was able to quantify inconsistency among included studies. When the values > 75% indicated a high degree of heterogeneity between groups. When the values between 25 and 50% indicated moderate heterogeneity between groups. When the value is less than 25% indicated no heterogeneity. We also did subgroup analyses according to the size of the sample, type of segmentation (two dimensions and three dimensions), and three OARs (bladder, rectum, and femur).

The funnel plot and the Egger’s publication bias test were generated with Stata 15.1 to show possible publication bias.

Results

Study selection and characteristics

A total of 1893 articles were retrieved, of which MEDLINE is 316 (accessed via PubMed), The Cochrane Library is 26, Embase is 478, and Web of science is 1073. After 673 duplicate articles were deleted, 1220 articles remained. Of these, 1120 articles were excluded as irrelevant after viewing the title and abstract. Then, two researchers conducted a full-text review of 100 articles, of which 86 records were excluded and 14 articles were included in the study, according to the inclusion and exclusion criteria (Fig. 1). These 14 articles all focus on segmentations cervical cancer CT images for CTV and/or OARs by DLs. The characteristics of the included article are shown in detail in Table 1.

Table 1 The characteristics of the included studies, and structure and outcome of DLs of the included studies

Full size table

Meta‑analysis results

CTV

10 of the 14 studies with a total of 12 DLs segmented the CTV for cervical cancer, among them, 4 papers reported time for segmentation from 15 s to 2 min. The pooled effect of DSC score was 0.86 (95%CI 0.84 to 0.87), Higgins I² is 47.9% that is between 25 and 50%, with moderate heterogeneity (Fig. 2). For the performance of segmented CTV by two dimensions (2D) and three dimensions (3D) model, the DSC score value for 2D model was 0.87 (95%CI 0.85 to 0.90), while the DSC score for 3D model was 0.85 (95%CI 0.82 to 0.87). There was significant difference between the two groups (p = 0.039) (Fig. 3a). To investigate the effect of the capacity of sample on segmentation performance, no matter whether the sample size is divided into two groups: greater than 100 and less than 100, or greater than 150 and less than 150, the results show no difference (P > 0.05).

OARs

For OARs, included articles had focused on the bladder, rectum, femur, L4 vertebral body, L5 vertebral body, sigmoid colon, etc. Due to the number of studies that reported OARs, this article focuses on only three OARs: bladder, rectum, and femur. Two of the included articles reported that splitting the OAR takes 1.5 s and 4.2 s.

Bladder

For the bladder, there were 10 articles in the included studies, of which 11 models mentioned segmentation of the bladder, the total effect of DSC score was 0.91 (95%CI 0.89 to 0.93). Higgins is I² = 48.5% with moderate heterogeneity. For the 2D model and 3D model of the segmented bladder, the pooled effect of DSC score of 2D model and 3D model were 0.93 (95%CI 0.91 to 0.96), and 0.90 (95%CI 0.87 to 0.92) respectively, with a significant difference between the two groups (p = 0.018) (Fig. 3b).

Rectum

For the rectum, there were 9 articles included, of which 10 models mentioned segmentation of the rectum, the pooled effect of DSC score was 0.83 (95%CI 0.79 to 0.88). Higgins I² is 86.0% with a high degree of heterogeneity. For the 2D model and 3D model of the segmented rectum, the total effect of DSC score of 2D model and 3D model were 0.85 (95%CI 0.77 to 0.94) and 0.82 (95%CI 0.80 to 0.84), with a significant difference between the two groups (p = 0.000) (Fig. 3b).

Femoral head

For the femoral head, there were 7 articles in the included studies, of which 8 models mentioned segmentation of the femoral head, the pooled effect of DSC score of the femoral head was 0.92 (95%CI 0.91 to 0.94). Higgins I² is 28.0% with moderate heterogeneity,

We also compare the performance of DLs segmentation of different OARs and the results show: (1) no difference in bilateral femoral head (P > 0.05); (2) performance of DLs for segmentation of the femoral head is superior to that for segmentation of the bladder and rectum (P < 0.05); (3) performance of DLs for segmentation of the bladder is superior to that for segmentation of the rectum (P < 0.05).

Risk of bias

The risk of bias was assessed in 14 included studies according to the TRIPOD tool. Results showed 12 studies were rated as high risk (Fig. 4), the main reasons are the following two: (1) validation results for models are not reported or validation groups are not set; (2) reported applicability of DLs segmentation results for clinical.

Publication bias

The studies included in the funnel plot were the 12 studies that reported CTV, the funnel plot had a symmetrically distributed shape and the Egger’s publication bias test show the P = 0.531 > 0.05, implying no publication bias in the included studies (Figs. 5, 6). For details of the publication bias of included studies that reported OARs are available in the Additional file 1.

Discussion

This paper systematically reviews various DLs models for the segmentation of cervical cancer CTV and OARs. in which the models are almost all U-net and its variants, and despite some heterogeneity, they still show excellent performance of DLs with no significant publication bias.

At present, manually delineating CTV is still a very time-consuming and discrepancy-prone task. For a physician, it often takes half an hour to delineate the CTV, while for DLs it takes only 15 s to 2 min to complete [4, 6, 7, 22]. The results of the evaluation of the metrics show that the current performance of deep learning segmentation of cervical cancer CTV and OARs can achieve good results (DSC > 0.8), and oncologists also say that the results of segmentation by DLs can be used directly or with minor modifications in clinical RT [5, 6, 8, 23, 24]. Therefore, with computer assistance, it will provide great convenience and optimize the work of clinical radiotherapy.

The results for both CTV and OARs segmentation show that the 2D model has a better performance than the 3D model. As you can imagine, this is an obvious problem for a few main reasons: (a) 3D models have more data to process and require more computational capabilities and more optimized algorithms to handle segmentation issues [26, 29, 30]; (b) Comparing to 3D models, 2D metrics typically provide less bias and do not account for under- contour or over-contouring in the 2D slice [6]; (c) 3D models demand a larger sample size and correctly encode the essential features in the image, and a large number of training samples complicates the computational process of the model and may increase the risk of overfitting [27, 31]. But the lack of training data makes the feature capture process of 3D models more difficult [28]. Thus, 2D models can show superior performance to 3D at this stage.

It is worth noting that 2D model has its disadvantages: the loss of data between slices, while 3D models use the entire volume of the image rather than individual slices. Oscar noted that in prostate cancer detection and segmentation tasks, 3D models tend to show better performance than 2D [32]. On the one hand, 2D models have started to appear saturated in many areas, on the other hand, 3D models have many advantages that 2D models do not have: (a) for anatomical structures and lesions with wide variations or irregular shapes, 3D models can show superior performance compared to 2D models if more kinds of data can be collected [6, 31]; (b) compared with 2D-based RT plans, 3D-based plans can reduce the dose to patients and improve their prognosis [22, 33,34,35,36]. These two advantages also show that 3D is the need for the advancement of medical images, and it is also the necessity of developing 3D models. In addition, it is noted that we are currently evaluating the segmentation results of the 3D model in cross-section with a 2D parameter DSC, because the oncologists are also evaluating it. In this way, which will underestimate the performance of the 3D model. But under such conditions, the 3D DL model can also reach a satisfactory segmentation result, and the performance of the 3D model in the future is worthy of our expectation.

For OARs, we are more concerned about the organs adjacent to the cervix, like the bladder and rectum, because of their anatomical location, they are prone to receive radiation and cause radiological sequelae, such as bladder fistula, cystitis, proctitis, etc. All of the above complications will affect the patient's quality of life afterward, and accurate delineation of OARs will reduce the probability of postoperative complications in patients [37,38,39,40]. In this article, DLs segmented both bladder and rectum with good results. The results indicated that segmentation of the bladder performed better than segmentation of the rectum. and the main reasons may be as follows: one is that the bladder contents are urine and the density is more homogeneous, and the other is that the anatomy of the bladder is more rounded and regular, the bladder has comparatively well-defined borders and the bladder has higher contrast with the surrounding tissue [16], so there are enough features for DLs learning and recognition to reach a good DSC. Dazhou Guo proposed to classify OARs into three difficulties by the contrast between OAR and surrounding, and also demonstrated that organs with high contrast with surrounding tissues could achieve higher DSC [41]. In contrast, the borders of the intestine are less clear than those of the bladder, and the main surrounding structures are fatty tissue, which made the intestine lacks contrast with the surrounding tissue [23]. The intestinal contents are not consistent (including air and fecal stones), which also affects the contour of the intestine and its internal density and increase the difficulty of segmentation [5], therefore, the effect of DL in dividing intestines is slightly worse and has a certain difference. Likewise, failure to delineate the femur may lead to complications such as ischemic necrosis of the femoral head and bone marrow suppression after radiotherapy. For segmenting the femoral head excellent results can be achieved, which is superior to the two soft tissues, bladder and rectum. We also observed that the Pelvic bone, L4 and L5 vertebral bodies, which are also bony structures, can also reach DSC 0.9 or higher. The main reason is the great contrast between the bone structure and the surrounding tissue, which is more conducive for DLs to learn and segment [23].

We found no significant effect of sample size on the segmentation results by comparison, suggesting that DLs (U-net) can achieve excellent segmentation performance using small sample training. This may be due to the technique of data augmentation (panning, flipping, deforming, rotating or using DL generation, etc.), or the advantages of the U-net algorithm, although this fact was not statistically verified in this paper [42, 43]. It still shows that U-net can achieve excellent segmentation results with DL models mapped out by training with small samples.

Furthermore, we have to focus on the robustness of U-net, once, deep learning was considered by clinicians to be a "black box algorithm" because of the uncertainty of its results [44]. Nowadays, the same excellent results (DSC > 0.8) have been achieved in several internal and external tests and even in multi-center blinded randomized controlled tests [5, 9, 24]. Moreover, we also noticed that the U-net achieved good segmentation results in glioma [18], head and neck tumors [19], prostate cancer [45], and breast cancer [46]. Despite the differences in segmentation results, as long as they are within the acceptable range of guidelines, it makes sense to have several more treatment options for physicians to choose from and to reduce patient waiting time on the other [47].

However, this paper also has certain limitations: (1) this study only focuses on the segmentation performance of CT images and does not elaborate on MR-based and PET-based, while we know that MRI T2-WI and DWI have very good lesion display power and contouring ability. Therefore, the performance of DLs for segmentation in MR images and PET images needs to be further investigated. (2) This study focuses on only one evaluation metric of DSC, while for segmentation, we have many more metrics to evaluate, such as the Hausdorff Distance (HD), Jaccard Distance (JD), the Deviation of Volume (ΔV), and Sensitivity Index (SI) [22], different metrics have different meanings, and we need to use more metrics to measure the performance of different models in future work. (3) There is a lack of publicly available quality data sets for the task of cervical cancer segmentation. (4) The existing training sets are almost all in the order of a hundred, and it is worthwhile to verify whether there are better results if they exceed the order of a thousand in future larger-scale or multi-center studies.

Nevertheless, we were able to see the very powerful potential of deep learning for cervical cancer image segmentation. Hassanzadeh [48] proposes to use enough 2D data to do 3D segmentation instead of the original 3D data, also achieving higher accuracy and saving at least 75% of time and computation. Linyan Gu proposed the fusion of 2d to 3d (2d UNet + + ASPP to 3D ResUNet) two-step model, this model not only utilizes the information between levels, but also reduces the rate of missed detections compared to a pure 2D model. It also reduces the training time, compared to a pure 3D model [49]. All of these experiences and methods are worth learning from in the future.

Conclusions

This systematic review and meta-analysis show that DLs have good accuracy in automatic segmentation of CT images of cervical cancer with a less time consuming and have good prospects for future radiotherapy applications, but still need public high-quality databases, and large-scale research verification.

Availability of data and materials

Not applicable.

Abbreviations

CCRT:: Concurrent chemoradiotherapy
FIGO:: The International Federation of Gynecology and Obstetrics
CT:: Computed tomography
LACC:: Locally advanced cervical cancer
RT:: Radiation therapy
IMRT:: Intensity-modulated radiotherapy
CTV:: Clinical target volume
OARs:: Organs at risk
ART:: Advanced adaptive radiotherapy
CNN:: Convolutional neural network
DL:: Deep learning
FCN:: Fully convolutional network
DSC:: Dice similarity coefficient
2D:: Two dimensions
3D:: Three dimensions
PRISMA:: The preferred reporting items for systematic reviews and meta-analyses
RECIST:: Response evaluation criteria in solid tumors

References

Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424. https://doi.org/10.3322/caac.21492.
Article PubMed Google Scholar
Fidler MM, Gupta S, Soerjomataram I, Ferlay J, Steliarova-Foucher E, Bray F. Cancer incidence and mortality among young adults aged 20–39 years worldwide in 2012: a population-based study. Lancet Oncol. 2017;18(12):1579–89. https://doi.org/10.1016/S1470-2045(17)30677-0.
Article PubMed Google Scholar
Sahiner B, Pezeshk A, Hadjiiski LM, Wang X, Drukker K, Cha KH, Summers RM, Giger ML. Deep learning in medical imaging and radiation therapy. Med Phys. 2019;46(1):e1–36. https://doi.org/10.1002/mp.13264.
Article PubMed Google Scholar
Wang Z, Chang Y, Peng Z, Lv Y, Shi W, Wang F, Pei X, Xu XG. Evaluation of deep learning-based auto-segmentation algorithms for delineating clinical target volume and organs at risk involving data for 125 cervical cancer patients. J Appl Clin Med Phys. 2020;21(12):272–9. https://doi.org/10.1002/acm2.13097.
Article PubMed PubMed Central Google Scholar
Rhee DJ, Jhingran A, Rigaud B, Netherton T, Cardenas CE, Zhang L, Vedam S, Kry S, Brock KK, Shaw W, O’Reilly F, Parkes J, Burger H, Fakie N, Trauernicht C, Simonds H, Court LE. Automatic contouring system for cervical cancer using convolutional neural networks. Med Phys. 2020;47(11):5648–58. https://doi.org/10.1002/mp.14467.
Article PubMed Google Scholar
Liu Z, Liu X, Guan H, Zhen H, Sun Y, Chen Q, Chen Y, Wang S, Qiu J. Development and validation of a deep learning algorithm for auto-delineation of clinical target volume and organs at risk in cervical cancer radiotherapy. Radiother Oncol. 2020;153:172–9. https://doi.org/10.1016/j.radonc.2020.09.060.
Article PubMed Google Scholar
Shi J, Ding X, Liu X, Li Y, Liang W, Wu J. Automatic clinical target volume delineation for cervical cancer in CT images using deep learning. Med Phys. 2021;48(7):3968–81. https://doi.org/10.1002/mp.14898.
Article PubMed Google Scholar
Liu Z, Liu X, Xiao B, Wang S, Miao Z, Sun Y, Zhang F. Segmentation of organs-at-risk in cervical cancer CT images with a convolutional neural network. Phys Med. 2020;69:184–91. https://doi.org/10.1016/j.ejmp.2019.12.008.
Article PubMed Google Scholar
Rigaud B, Anderson BM, Yu ZH, Gobeli M, Cazoulat G, Söderberg J, Samuelsson E, Lidberg D, Ward C, Taku N, Cardenas C, Rhee DJ, Venkatesan AM, Peterson CB, Court L, Svensson S, Löfman F, Klopp AH, Brock KK. Automatic segmentation using deep learning to enable online dose optimization during adaptive radiation therapy of cervical cancer. Int J Radiat Oncol Biol Phys. 2021;109(4):1096–110. https://doi.org/10.1016/j.ijrobp.2020.10.038.
Article PubMed Google Scholar
Shal K, Choudhry MS. Evolution of deep learning algorithms for MRI-based brain tumor image segmentation. Crit Rev Biomed Eng. 2021;49(1):77–94. https://doi.org/10.1615/CritRevBiomedEng.2021035557.
Article PubMed Google Scholar
Ju Z, Guo W, Gu S, Zhou J, Yang W, Cong X, Dai X, Quan H, Liu J, Qu B, Liu G. CT based automatic clinical target volume delineation using a dense-fully connected convolution network for cervical cancer radiation therapy. BMC Cancer. 2021;21(1):243. https://doi.org/10.1186/s12885-020-07595-6.
Article PubMed PubMed Central Google Scholar
Cai L, Gao J, Zhao D. A review of the application of deep learning in medical image classification and segmentation. Ann Transl Med. 2020;8(11):713. https://doi.org/10.21037/atm.2020.02.44.
Article PubMed PubMed Central Google Scholar
Cardenas CE, Yang J, Anderson BM, Court LE, Brock KB. Advances in auto-segmentation. Semin Radiat Oncol. 2019;29(3):185–97. https://doi.org/10.1016/j.semradonc.2019.02.001.
Article PubMed Google Scholar
Ronneberger O, Fischer P, Brox T. U-Net: convolutional networks for Biomedical image segmentation. In: Navab N, Hornegger J, Wells W, Frangi A, editors. Medical image computing and computer-assisted intervention– MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science, vol. 9351. Cham: Springer; 2015. https://doi.org/10.1007/978-3-319-24574-4_28
Çiçek Ö, Abdulkadir A, Lienkamp SS, et al. 3D U-net Learning dense volumetric segmentation from sparse annotation. Lect Notes Comput Sci. 2016;9901:424–32.
Article Google Scholar
Liu X, Li KW, Yang R, Geng LS. Review of deep learning based automatic segmentation for lung cancer radiotherapy. Front Oncol. 2021;8(11):717039. https://doi.org/10.3389/fonc.2021.717039.
Article Google Scholar
Shelhamer E, Long J, Darrell T. Fully convolutional networks for semantic segmentation. IEEE Trans Pattern Anal Mach Intell. 2017;39(4):640–51. https://doi.org/10.1109/TPAMI.2016.2572683.
Article PubMed Google Scholar
van Kempen EJ, Post M, Mannil M, Witkam RL, Ter Laan M, Patel A, Meijer FJA, Henssen D. Performance of machine learning algorithms for glioma segmentation of brain MRI: a systematic literature review and meta-analysis. Eur Radiol. 2021;31(12):9638–53. https://doi.org/10.1007/s00330-021-08035-0.
Article PubMed PubMed Central Google Scholar
Badrigilan S, Nabavi S, Abin AA, Rostampour N, Abedi I, Shirvani A, Ebrahimi MM. Deep learning approaches for automated classification and segmentation of head and neck cancers and brain tumors in magnetic resonance images: a meta-analysis study. Int J Comput Assist Radiol Surg. 2021;16(4):529–42. https://doi.org/10.1007/s11548-021-02326-z.
Article PubMed Google Scholar
Patzer RE, Kaji AH, Fong Y. TRIPOD reporting guidelines for diagnostic and prognostic studies. JAMA Surg. 2021;156(7):675–6. https://doi.org/10.1001/jamasurg.2021.0537.
Article PubMed Google Scholar
Dice LR. Measures of the amount of ecologic association between species. Ecology. 1945;26:297–302. https://doi.org/10.2307/1932409.
Article Google Scholar
Zhang D, Yang Z, Jiang S, Zhou Z, Meng M, Wang W. Automatic segmentation and applicator reconstruction for CT-based brachytherapy of cervical cancer using 3D convolutional neural networks. J Appl Clin Med Phys. 2020;21(10):158–69. https://doi.org/10.1002/acm2.13024.
Article PubMed PubMed Central Google Scholar
Sartor H, Minarik D, Enqvist O, Ulén J, Wittrup A, Bjurberg M, Trägårdh E. Auto-segmentations by convolutional neural network in cervical and anorectal cancer with clinical structure sets as the ground truth. Clin Transl Radiat Oncol. 2020;14(25):37–45. https://doi.org/10.1016/j.ctro.2020.09.004.
Article Google Scholar
Liu Z, Chen W, Guan H, Zhen H, Shen J, Liu X, Liu A, Li R, Geng J, You J, Wang W, Li Z, Zhang Y, Chen Y, Du J, Chen Q, Chen Y, Wang S, Zhang F, Qiu J. An adversarial deep-learning-based model for cervical cancer CTV segmentation with multicenter blinded randomized controlled validation. Front Oncol. 2021;19(11):702270. https://doi.org/10.3389/fonc.2021.702270.
Article Google Scholar
Hu H, Yang Q, Li J, Wang P, Tang B, Wang X, Lang J. Deep learning applications in automatic segmentation and reconstruction in CT-based cervix brachytherapy. J Contemp Brachytherapy. 2021;13(3):325–30. https://doi.org/10.5114/jcb.2021.106118.
Article PubMed PubMed Central Google Scholar
Chang J-H, Lin K-H, Wang T-H, Zhou Y-K, Chung P-C. Image segmentation in 3D brachytherapy using convolutional LSTM. J Med Biol Eng. 2021. https://doi.org/10.1007/s40846-021-00624-0.
Article Google Scholar
Mohammadi R, Shokatian I, Salehi M, Arabi H, Shiri I, Zaidi H. Deep learning-based auto-segmentation of organs at risk in high-dose rate brachytherapy of cervical cancer. Radiother Oncol. 2021;159:231–40. https://doi.org/10.1016/j.radonc.2021.03.030.
Article PubMed Google Scholar
Ju Z, Wu Q, Yang W, Gu S, Guo W, Wang J, Ge R, Quan H, Liu J, Qu B. Automatic segmentation of pelvic organs-at-risk using a fusion network model based on limited training samples. Acta Oncol. 2020;59(8):933–9. https://doi.org/10.1080/0284186X.2020.1775290.
Article PubMed Google Scholar
Noori M, Bahri A, Mohammadi K. Attention-guided version of 2D UNet for automatic brain tumor segmentation. In: 2019 9th international conference on computer and knowledge engineering (ICCKE), Mashhad, Iran; 2019. p. 269–75.
Akal O, Peng Z, Valadez GH. ComboNet: combined 2D & 3D architecture for aorta segmentation. arXiv:2006.05325
Shivdeo A, Lokwani R, Kulkarni V, Kharat A, Pant A. Comparative evaluation of 3D and 2D Deep learning techniques for semantic segmentation in CT scans. arXiv:2101.07612
Pellicer-Valero OJ, Marenco Jiménez JL, Gonzalez-Perez V, et al. Deep learning for fully automatic detection, segmentation, and Gleason Grade estimation of prostate cancer in multiparametric magnetic resonance images.arXiv:2103.12650
Tanderup K, Nielsen SK, Nyvang GB, et al. From point A to the sculpted pear: MR image guidance significantly improves tumour dose and sparing of organs at risk in brachytherapy of cervical cancer. Radiother Oncol. 2010;94:173–80.
Article PubMed Google Scholar
Potter R, Georg P, Dimopoulos JC, et al. Clinical outcome of protocol based image (MRI) guided adaptive brachytherapy combined with 3D conformal radiotherapy with or without chemotherapy in patients with locally advanced cervical cancer. Radiother Oncol.
Simpson DR, Scanderbeg DJ, Carmona R, et al. Clinical outcomes of computed tomography-based volumetric brachytherapy planning for cervical cancer. Int J Radiat Oncol Biol Phys. 2015;93:150–7.
Article PubMed Google Scholar
Charra-Brunaud C, Harter V, Delannes M, et al. Impact of 3D image-based PDR brachytherapy on outcome of patients treated for cervix carcinoma in France: results of the French STIC prospective study. Radiother Oncol. 2012;103:305–13.
Article PubMed Google Scholar
Beller HL, Rapp DE, Zillioux J, Abdalla B, Duska LR, Showalter TN, Krupski TL, Cisu T, Congleton JY, Schenkman NS. Urologic complications requiring intervention following high-dose pelvic radiation for cervical cancer. Urology. 2021;151:107–12. https://doi.org/10.1016/j.urology.2020.09.011.
Article PubMed Google Scholar
Spampinato S, Fokdal LU, Pötter R, Haie-Meder C, Lindegaard JC, Schmid MP, Sturdza A, Jürgenliemk-Schulz IM, Mahantshetty U, Segedin B, Bruheim K, Hoskin P, Rai B, Huang F, Cooper R, van der Steen-Banasik E, Van Limbergen E, Sundset M, Westerveld H, Nout RA, Jensen NBK, Kirisits C, Kirchheiner K, Tanderup K, EMBRACE Collaborative Group. Risk factors and dose-effects for bladder fistula, bleeding and cystitis after radiotherapy with imaged-guided adaptive brachytherapy for cervical cancer: an EMBRACE analysis. Radiother Oncol. 2021;158:312–20.
Article PubMed Google Scholar
Fokdal L, Pötter R, Kirchheiner K, Lindegaard JC, Jensen NBK, Kirisits C, Chargari C, Mahantshetty U, Jürgenliemk-Schulz IM, Segedin B, Hoskin P, Tanderup K. Physician assessed and patient reported urinary morbidity after radio-chemotherapy and image guided adaptive brachytherapy for locally advanced cervical cancer. Radiother Oncol. 2018;127(3):423–30. https://doi.org/10.1016/j.radonc.2018.05.002.
Article PubMed Google Scholar
Mansha MA, Sadaf T, Waheed A, Munawar A, Rashid A, Chaudry SJ. Long-term toxicity and efficacy of intensity-modulated radiation therapy in cervical cancers: experience of a cancer hospital in Pakistan. JCO Glob Oncol. 2020;6:1639–46. https://doi.org/10.1200/GO.20.00169.
Article PubMed Google Scholar
Guo D, Jin D, Zhu Z, Ho T-Y, Harrison AP, Chao CH, Xiao J, Yuille A, Lin C-Y, Lu L. Organ at risk segmentation for head and neck cancer using stratified learning and neural architecture search. arXiv:2004.08426
Yamanakkanavar N, Choi JY, Lee B. MRI segmentation and classification of human brain using deep learning for diagnosis of Alzheimer’s disease: a survey. Sensors. 2020;20(11):3243. https://doi.org/10.3390/s20113243.
Article PubMed Central Google Scholar
Zhao Y, Rhee DJ, Cardenas C, Court LE, Yang J. Training deep-learning segmentation models from severely limited data. Med Phys. 2021;48(4):1697–706. https://doi.org/10.1002/mp.14728 (Epub 2021 Feb 19).
Article PubMed Google Scholar
Kalantar R, Lin G, Winfield JM, Messiou C, Lalondrelle S, Blackledge MD, Koh DM. Automatic segmentation of pelvic cancers using deep learning: state-of-the-art approaches and challenges. Diagnostics. 2021;11(11):1964. https://doi.org/10.3390/diagnostics11111964.
Article PubMed PubMed Central Google Scholar
Almeida G, Tavares JMRS. Deep learning in radiation oncology treatment planning for prostate cancer: a systematic review. J Med Syst. 2020;44(10):179. https://doi.org/10.1007/s10916-020-01641-3.
Article PubMed Google Scholar
Reig B, Heacock L, Geras KJ, Moy L. Machine learning in breast MRI. J Magn Reson Imaging. 2020;52(4):998–1018. https://doi.org/10.1002/jmri.26852.
Article PubMed Google Scholar
Balagopal A, Morgan H, Dohopolski M, Timmerman R, Shan J, Heitjan DF, Liu W, Nguyen D, Hannan R, Garant A, Desai N, Jiang S. PSA-Net: deep learning-based physician style-aware segmentation network for postoperative prostate cancer clinical target volumes. Artif Intell Med. 2021;121:102195. https://doi.org/10.1016/j.artmed.2021.102195.
Article PubMed Google Scholar
Hassanzadeh T, Essam D, Sarker R. 2D to 3D evolutionary deep convolutional neural networks for medical Image segmentation. IEEE Trans Med Imaging. 2021;40(2):712–21. https://doi.org/10.1109/TMI.2020.3035555.
Article PubMed Google Scholar
Gu L, Cai XC. Fusing 2D and 3D convolutional neural networks for the segmentation of aorta and coronary arteries from CT images. Artif Intell Med. 2021;121:102189. https://doi.org/10.1016/j.artmed.2021.102189.
Article PubMed Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This study was granted from National Natural Science Foundation of China (No. 81360220).

Author information

Chongze Yang and Lan-hui Qin contributes equally

Authors and Affiliations

Department of Radiology, First Affiliated Hospital of Guangxi Medical University, Nanning, 530021, Guangxi Zhuang Autonomous Region, People’s Republic of China
Chongze Yang, Lan-hui Qin, Yu-en Xie & Jin-yuan Liao
Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor (Gaungxi Medical University), Ministry of Education, Nanning, 530021, Guangxi Zhuang Autonomous Region, People’s Republic of China
Jin-yuan Liao

Authors

Chongze Yang
View author publications
You can also search for this author in PubMed Google Scholar
Lan-hui Qin
View author publications
You can also search for this author in PubMed Google Scholar
Yu-en Xie
View author publications
You can also search for this author in PubMed Google Scholar
Jin-yuan Liao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C-zY and L-hQ drafted and prepared manuscript for final publication; C-zY, L-hQ, and Y-eX reviewed the literature and extracted the data; J-yL and Y-eX worked on the statistic part; J-yL performed consultation and revised the manuscript. All authors issued final approval for the version to be submitted.

Corresponding author

Correspondence to Jin-yuan Liao.

Ethics declarations

Ethical approval and consent to participate

The review did not require approval by an Ethical Committee.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

The search strategy and the additional figure.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Yang, C., Qin, Lh., Xie, Ye. et al. Deep learning in CT image segmentation of cervical cancer: a systematic review and meta-analysis. Radiat Oncol 17, 175 (2022). https://doi.org/10.1186/s13014-022-02148-6

Download citation

Received: 14 March 2022
Accepted: 16 October 2022
Published: 07 November 2022
DOI: https://doi.org/10.1186/s13014-022-02148-6

Deep learning in CT image segmentation of cervical cancer: a systematic review and meta-analysis

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Search strategy

Selection criteria and data extraction

Quality assessment

Statistical analysis

Results

Study selection and characteristics

Meta‑analysis results

CTV

OARs

Bladder

Rectum

Femoral head

Risk of bias

Publication bias

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Radiation Oncology

Contact us