Retrospective dosimetry study of intensity-modulated radiation therapy for nasopharyngeal carcinoma: measurement-guided dose reconstruction and analysis
Radiation Oncologyvolume 13, Article number: 42 (2018)
The Correction to this article has been published in Radiation Oncology 2018 13:117
Conventional phantom-based planar dosimetry (2D-PBD) quality assurance (QA) using gamma pass rate (GP (%)) is inadequate to reflect clinically relevant dose error in intensity-modulated radiation therapy (IMRT), owing to a lack of information regarding patient anatomy and volumetric dose distribution. This study aimed to evaluate the dose distribution accuracy of IMRT delivery for nasopharyngeal carcinoma (NPC), which passed the 2D-PBD verification, using a measurement-guided 3D dose reconstruction (3D-MGR) method.
Radiation treatment plans of 30 NPC cases and their pre-treatment 2D-PBD data were analyzed. 3D dose distribution was reconstructed on patient computed tomography (CT) images using the 3DVH software and compared to the treatment plans. Global and organ-specific dose GP (%), and dose-volume histogram (DVH) deviation of each structure was evaluated. Interdependency between GP (%) and the deviation of the volumetric dose was studied through correlation analysis.
The 3D-MGR achieved global GP (%) similar to conventional 2D-PBD in the same criteria. However, structure-specific GP (%) significantly decreased under stricter criteria, including the planning target volume (PTV). The average deviation of all inspected dose volumes (DV) and volumetric dose (VD) parameters ranged from − 2.93% to 1.17%, with the largest negative deviation in V100% of the PTVnx of − 15.66% and positive deviation in D1cc of the spinal cord of 6.66%. There was no significant correlation between global GP (%) of 2D-PBD or 3D-MGR and the deviation of the most volumetric dosimetry parameters (DV or VD), when the Pearson’s coefficient value of 0.8 was used for correlation evaluation.
Even upon passing the pre-treatment phantom based dosimetric QA, there could still be risk of dose error like under-dose in PTVnx and overdose in critical structures. Measurement-guided 3D volumetric dosimetry QA is recommended as the more clinically efficient verification for the complicated NPC IMRT.
Intensity-modulated radiation therapy (IMRT) is capable of improving the overall survival and long-term quality of life in patients with nasopharyngeal carcinoma [1, 2]. Patient-specific pre-treatment quality assurance (QA) is necessary for the implementation of IMRT , and it has been a consensus of the researcher community that patient-specific QA can be done by film dosimetry combined with ionization chambers measurement [4,5,6], or by a 2D/3D detector arrays test in a phantom to compare and validate the dose accuracy of the treatment [7,8,9,10]. Most of these pre-treatment QA use the ‘γ evaluation method’ for the result analysis, which is a composite analysis of distance-to-agreement (DTA) and dose difference (DD) [11,12,13].
The phantom measure-based γ evaluation method provides a quantitative analysis of the degree of agreement between the measured and calculated dose distributions. It can be used to confirm or evaluate if the treatment plan was delivered with sufficient accuracy based on patient-specific quality assurance. The AAPM TG-119 report  recommended action levels of 88% and 90% for composite and per field gamma passing rate GP (%) analysis, respectively. However, it only determines the ratio of points out of tolerance without giving any information about the spatial location of points that the dose deviated from in the origin plan, including the volumetric dose deviation for planned target volumes (PTVs) and organ at risk (OAR) of the patient . Some of the recent research showed that γ passing rates of per beam planar IMRT QA did not predict clinically relevant dose errors , owing to a lack of correlation between the gamma passing rates (GP (%)and the volumetric dose errors in the anatomic regions-of-interest [17, 18]. It has, therefore, raised a question whether the patient OARs are safe or if the PTVs are covered by the prescribed dose when a higher passing rate is achieved.
Recently, a 3D dose reconstruction method was introduced in the IMRT QA; this method reconstructed the delivered 3D dose distribution on the patient CT image based on per beam measured doses. Olch  validated the software called 3DVH for 3D dose analysis of IMRT verification. In this study, the 3DVH was used to retrospectively analyze the 3D dose distribution of a group of NPC cases treated with IMRT at our center. Each treatment plan was validated with the pre-treatment 2D phantom QA and passed the 3 mm/3% GP (%) examination. The correlation between the 2D/3D GP (%) and the deviation in reconstructed DVHs were assessed as well.
The data of treatment plans for 30 NPC patients who finished IMRT treatment courses were randomized selected from our database and fully anonymized for the purpose of this retrospective analysis study. Of the total group of cases, 16 were males and 14 were females, with a sex ratio of 1.1:1. According to UICC 2009 staging criteria, there were 2, 19, 8 and 1 cases with stage II, III, IVa and IVc disease, respectively.
All the studied cases were treated with 9-field static IMRT using a linear accelerator (Synergy, Elekta AB, Stockholm, Sweden) with 1-cm MLC and a 6 MV photon beam. The primary gross target volumes (GTVnx), nodal gross target volumes (GTVnd), and clinical target volumes (CTV1 and CTV2) were delineated manually by radiation oncologists, and the relevant planning target volumes (PTVnx, PTVnd, PTV1 and PTV2) were generated by adding a set-up margin to the corresponding volumes in all directions according to the immobilization and localization uncertainties [20, 21]. The prescribed doses were 70 Gy to PTVnx, 60–66 Gy to PTVnd, 60 Gy to PTV1, and 54 Gy to PTV2, 5 times per week with a total of 30 fractions. The dose constraints for all PTVs were that over 95% of the PTV covered by the prescribed dose, The main constrained OARs included the spinal cord, brainstem, parotid gland, temporal lobes, and larynx. All planned dose distributions were optimized and calculated with an inverse treatment planning system (TPS) (Monaco V3.0 Elekta AB, Stockholm, Sweden) using the Monte Carlo (MC) algorithm. The calculation grid was 3 mm, and 3% statistic uncertainty was used.
All the 30 IMRT plans were validated with a 2D diode detector array (Mapcheck2, Sun Nuclear Corporation, Melbourne, FL). A QA plan was generated using a fractional treatment plan, and the dose distribution was recalculated in the QA phantom. The delivery of the QA plan was verified by a measurement using the diode array, and (3 mm/3%) GP (%) of greater than 90% was accepted for composite dose verification.
Review of 3D dose reconstruction
A commercialized 3D dose reconstruction system (3DVH, Sun Nuclear Corporation, Melbourne, FL) was used for the study, which can reconstruct 3D dose distribution in patients’ CT images based on the 2D dose distribution measured in the pre-treatment QA with a planned dose perturbation (PDP) algorithm . The 3DVH software uses the dose differences between the 2D array measurement and the TPS dose calculation for each beam to produce the PDP files and then projects it back into the TPS calculated 3D dose distribution to reconstruct the delivered dose. For comparing the difference between the measurement and the original plan dose calculated by the TPS, interpolation is needed for the dose between the diode detectors of the 2D array. A so called “Smarterpolation” method is built in the 3DVH software to interpolate the measured dose to the same resolution and voxel size as the TPS calculation. The Smarterpolation estimates the dose changes in the neighborhood of every detector according to the high spatial resolution dose distribution calculated by the TPS and uses these changes to interpolate the measurement data [23, 24]. After importing the patient CT sets, RT plan, RT dose, and RT structures, the PDP files will be applied directly by the 3DVH system to perturb the planned 3D dose to produce a new 3D dose distribution in patients’ CT images, and evaluate clinically relevant dose discrepancies for each OAR or PTVs.
Reconstructed 3D dose analysis
Using the reconstructed 3D dose distribution, the following dosimetry related parameters were analyzed.
Gamma pass rate comparison
In this study, 2D GP (%) was retrieved from recorded patient QA data. A 3D dose verification review for each plan was done by the above-described PDP algorithm, and the delivered dose distributions were reconstructed on the patient CT images. The global and each organ-specific 3D GP (%) between the reconstructed dose distribution and original treatment plan were calculated using the 3DVH software. Three different criteria were used for analysis: 3 mm/3%, 2 mm/2%, and 1 mm/1%. The percentage dose differences were normalized to the global maximum dose. The GP (%) was calculated for all dose points over a threshold of 10% of the maximum dose, indicating that the detectors whose values fell within 0 to 10% would be excluded from the statistic.
DVH parameters comparison
To evaluate the actual delivered dose distribution and DVH deviation in patients, the reconstructed and original planned DVH parameters were compared for each of the PTVs and OAR, including: (1) dose coverage for PTVs: percentage target volume received at least 100% and 95% of the prescription dose, V100% (%) and V95% (%); minimum dose covered 98% and 95% of the target volume, D98%, and D95%; and mean dose in target volume, Dmean. (2) dose for OARs: D1cc of the spinal cord, brainstem, and temporal lobe (the maximum dose covering 1 cm3 volume of the organ); V60Gy (%) of the brainstem (percentage volume that received at least 60 Gy); V30Gy (%) and Dmean of the parotid gland (percentage volume that received at least 30 Gy dose and mean dose of the parotids); and Dmean of the larynx.
The percentage deviation (%) of the absolute dose and the DVH parameters were calculated using the following equations:
Correlation analysis of DVH deviation with gamma pass rate
Statistical correlation of DVH deviation (absolute value) and GP (%) was studied with the Pearson’s coefficient (r), calculated using the SPSS (19) software. The Pearson’s coefficient value of 0.8 was considered to be a significant correlation.
Gamma pass rate comparison
For all studied cases, the GP (%) using three different criteria were evaluated for 2D, 3D, and organ-specific areas. Table 1 showed the average GP (%) of the 30 NPC cases; the maximum and minimum GP (%) values were also reported. Both the GP (%) using criteria of 3%/3 mm and 1%/1 mm for 2D planar phantom dose verification and the global 3D reconstructed dose verification were significantly different, based on the paired samples T test. Compared to the global 3D GP (%), the mean GP (%) was relatively lower in PTVs but relatively higher in the main OAR for the 3 mm/3% criterion. However, the GP (%) decreased a lot in both PTVs and some OAR when a stricter criterion (1 mm/1%) was used.
The average relative difference in the volumetric dose (DV) and dose volume (VD) between the 3D dose reconstruction and the planned dose ranged from − 2.93% to 0.02% for PTVs, and − 1.66% to 1.17% for OAR (Table 2). Although the average deviations were slight, clinically significant deviation was found in some individual cases. In Table 3, eight of the 30 cases were under-dosed with a discrepancy of − 5% in V70 Gy (V100%) of the PTVnx. One of the 30 cases received a 5% higher dose than the planned dose separately in D1cc of the spinal cord and the mean dose of the larynx. Fig 1 shows the two cases with the highest dose deviation in PTV and OAR, one with a largest negative deviation (− 15.66%) in V100% of the PTVnx and another case with a significant positive deviation (6.66%) in D1cc of the spinal cord.
Correlation analysis of DVH deviation and gamma pass rate
The results of statistical correlations between DV, VD, and GP (%), described by Pearson’s coefficient (r), are shown in Table 4. No obvious correlations (both criteria R > 0.8 and p < 0.05 were met) were found between all the DVH metrics and the global GP (%) got from the 2D QA measurement and the 3D reconstructed dose. In the measurement-based 3D dose verification, only the reconstructed D2% and the Dmean of the PTVnx showed a significant (p < 0.01) strong correlation with the organ-specific GP (%) for the PTVnx, when a Pearson’s coefficient value of 0.8 was used for the correlation evaluation. The plots of the correlation analysis with the R2 value is available in the additional figures files [see Additional file 1: Figures S1 to S10].
Phantom measurement and global GP (%) evaluation are widely accepted in the radiation therapy (RT) community as a routine IMRT QA procedure. According to the report of AAPM TG119, the 3 mm /3% criterion is suggested for this kind of verification. In this study, an average GP (%) of 96.4%, ranging from 89.1% to 99.7%, was achieved using the AAPM suggested criteria. However, the GP (%) significantly decreased using a stricter acceptance criterion, which is similar to the report of Benjamin E, et al. , although it did not reflect a volumetric dose deviation in the PTVs and OAR.
The results of the correlation analysis showed that all the coefficient values (r) were much lower than 0.8 for correlations between the global GP (%) and DV or VD for each of the PTVs and OAR. It indicated either no correlation or only very weak correlation existing between the global GP (%) and the deviation of DVH parameters. M. Stasi, et al. (17) have reported similar results in their study of 2 groups of IMRT cases (prostate and pelvic IMRT, and head and neck IMRT), wherein all coefficient values were smaller than 0.8, indicating a weak correlation between the GP (%) and the dose deviation.
In the organ-specific GP (%) analysis, the GP (%) of three different criteria all showed strong negative correlation with the deviation of mean dose in the PTVnx-specific evaluation. A coefficient value larger than 0.8, indicated that the higher the GP (%) in the PTVnx, the less the deviation in the mean dose of its volume. Also, the strength of the correlation coefficients (r) of the organ-specific GP (%) was higher than that of the global GP (%). These results are consistent with the findings of M. Cozzlino et al . In their study of a group of RapidArc treatment plans for the prostate, on using the COMPASS system (IBA Dosimetry, Germany) to reconstruct the delivered dose distribution, a stronger correlation was observed between the organ-specific GP (%) and dose deviation rather than with the global GP (%).
A high global GP (%) did not always mean a high organ-specific GP (%) (e.g. target volume specific GP (%)), and vice versa, a low global GP (%) did not always indicate a low GP (%) in the specific organ volumes. As depicted in Fig. 2, the case on the left one showed a high global GP (%) which meet the QA criteria, but not ensured the clinical concerned dose errors within tolerance. In fact, a significant low-dose area was located in the PTVnx leading to a large reduction (12.8%) in the V70Gy, which might reduce local control of the treatment. The case on the right showed a relatively low global GP (%), but the dose error all distributed out of the gross tumor, high risk and critic structure areas.
M. Stasi et al.  observed that the measurement-based reconstructed delivery doses to the PTVs were all negative discrepant in their analysis of a group of cases of prostate and head and neck cancers using the same 3DVH system. M. Cozzolino et al  reported the discrepancy between the measurement-guided dose reconstruction using a 3D QA system (COMPASS, IBA Dosimetry, Schwarzenbruck, Germany) and the original plan, in which the actual dose could be 5% greater than the planned value in some cases. In our review study, the deviation of the reconstructed DVH from the planned values ranged between 6.66% and − 15.66%. There were 27% (8/30) of cases in which coverage of the prescribed dosage in the gross tumor volume (V70 Gy of the PTVnx) decreased by 5% or more, implicating the possibility of a potential effect on local control of the treatment, which was concealed during the pretreatment 2D phantom verification. In addition, there were two cases with > 5% dose increment in critical structures, separately in the D1cc of the spinal cord and in the mean dose of the larynx, compared to the planned doses. In the case of the largest dose increase, in the spinal cord, the planned D1cc was 47.019 Gy, and the reconstructed D1cc was 50.149 Gy which was already beyond our clinically tolerated dose. This big discrepancy in the dose should be noticed before treatment and carefully re-evaluated, especially in cases where the planned dose was close to the tolerated dose. Except for the above-mentioned cases, all other OARs showed a very small deviation in DVHs. A carefully review of DVHs of the PTVs and OARs revealed that these kinds of dose deviations could be overlooked when only global GP (%) evaluation is used in the pretreatment QA. Hence, a volumetric dose verification and evaluation might be needed in clinical practice by means of 3D dose reconstruction based on delivery measurement.
In this report of our study, the gamma pass rate (GP (%)) evaluation was based on the percentage dose differences normalized to the global maximum dose. This is good for the high-dose regions close to the target. However, for some organs at risk which are found in the lower-dose region, this normalization might underestimate the real difference in dose, and a local dose difference might be helpful for understanding the sensitivity of the GP (%) in some cases. For this reason, we also analyzed the GP (%) using local dose normalization and found that it was lower than that using global maximum dose normalization. Nevertheless, both GP (%) of global maximum dose normalization and local dose normalization had the similar results in the DVH correlation analysis, having no significant strong correlation with the DVH errors, except in the PTVnx-specific GP% and the DVH error (detail data is available in an additional table file [see Additional file 2: Table S1-S3]).
Although the measurement-guided 3D dose reconstruction method can be used to predict the actually delivery dose distribution on patient before IMRT treatment, the actual delivered dose distribution in patient, during the whole treatment course underwent a long period of time, may be affected by many factors such as the change in multi-leaf collimator (MLC) position accuracy, beam energy fluctuation, gross machine monitor (MU) errors, tumor shrink and anatomy changes, etc. The accumulated actual delivered dose distribution on patient will be interesting in our future work.
Traditional 2D Phantom QA and global GP (%) evaluation is not sufficient for ensuring the clinically accurate volumetric dose for IMRT treatment, as there is no strong correlation between the global GP (%) and percentage deviation in DVH of both PTVs and OAR, even when a strict 1%/1 mm gamma criterion was used. According to the results of our study, 3D dose verification and organ-specific GP (%) evaluation is a more effective QA method, and the PTVnx specific GP (%) has a strong negative correlation with the mean dose of the PTVnx. Although the IMRT treatment plan passed a 2D phantom-based dosimetry QA of GP (%) evaluation, there is still a potential risk of volumetric dose deviation, such as lack of dose coverage in the target or an overdose in the OAR. Three-dimensional dose reconstruction based on measurement and DVH verification are recommended for IMRT QA, rather than taking the GP (%) evaluation only.
Conventional phantom-based planar dosimetry
Measurement-guided 3D dose reconstruction
- DV :
Volumetric dose and
- GP (%):
Gamma pass rate (%)
Organ at risk
Planning target volume
- VD :
Tai-Lin H, Chih-Yen C, Wen-Ling T, et al. Long-term late toxicities and quality of life for survivors of nasopharyngeal carcinoma treated with intensity-modulated radiotherapy versus non–intensity-modulated radiotherapy. Head Neck. 2015;26(8):929–33.
Tao S, Ming F, Xue-Bang Z, et al. Sustained improvement of quality of life for nasopharyngeal carcinoma treated by intensity modulated radiation therapy in long-term survivors. Int J Clin Exp Med. 2015;8(4):5658–66.
Gary AE, James MG, Daniel L, et al. Guidance document on delivery, treatment planning, and clinical implementation of IMRT: report of the IMRT subcommittee of the AAPM radiation therapy committee. Med Phys. 2003;30(8):2089–115.
Wilbert C, Ganesh N, Morgan R, et al. Patient specific IMRT quality assurance with film, ionization chamber and detector arrays: our institutional experience. Radiat Phys Chem. 2015;115:12–6.
Livia M, Margherita Z, Stefania P, et al. GafChromic® EBT3 films for patient specific IMRT QA using a multichannel approach. Phys Med. 2015;31(8):1035–42.
García-Garduño OA, Lárraga-Gutiérrez JM, Rodríguez-Villafuerte M, et al. Effect of correction methods of radiochromic EBT2 films on the accuracy of IMRT QA. Appl Radiat Isot. 2016;107:121–6.
EW K, DJL W, Peter C, der Hulst v, et al. Clinical introduction of a linac head-mounted 2D detector array based quality assurance system in head and neck IMRT. Radiother Oncol. 2011;100:444–52.
Rajesh T, Arunai N, Sujit Nath S, et al. Analyzing the performance of ArcCHECK diode array detector for VMAT plan. Rep Pract Oncol Radiother. 2016;21(1):50–6.
Mohammad H, Pejman R, Martin AE, et al. A comparison of the gamma index analysis in various commercial IMRT/VMAT QA systems. Radiother Oncol. 2013;109:370–6.
Guangjun L, Yingjie Z, Xiaoqin J, et al. Evaluation of the ArcCHECK QA system for IMRT and VMAT verification. Phys Med. May 2013;29(3):295–303.
Cozzolino M, Oliviero C, Califano G, et al. Clinically relevant quality assurance (QA) for prostate RapidArc plans: gamma maps and DVH-based evaluation. Phys Med. Jun 2014;30(4):462–72.
Laure V, Jeremy M, Thomas B, et al. Gamma index comparison of three VMAT QA systems and evaluation of their sensitivity to delivery errors. Phys Med. 2015;31:720–5.
Ruurd V, Wauben DJL, Martijn de G, et al. Evaluation of DVH-based treatment plan verification in addition to gamma passing rates for head and neck IMRT. Radiother Oncol. 2014;112(3):389–95.
Ezzell GA, Burmeister JW, Nesrin D, et al. IMRT commissioning: multiple institution planning and dosimetry comparisons, a report from AAPM task group 119. Med Phys. 2009;36(11):5359–73.
Sotirios S, Panayiotis M, Chengyu S. γ+ index: a new evaluation parameter for quantitative quality assurance. Comput Methods Prog Biomed. 2014;114:60–9.
Benjamin EN, Heming Z, Wolfgang AT. Per-beam, planar IMRT QA passing rates do not predict clinically relevant patient dose errors. Med Phys. 2011;38(2):1037–44.
Stasi M, Bresciani S, Miranti A, et al. Pretreatment patient-specific IMRT quality assurance: a correlation study between gamma index and patient clinical dose volume histogram. Med Phys Dec. 2012;39(12):7626–34.
Cozzolino M, Oliviero C, Califano G, et al. Clinically relevant quality assurance (QA) for prostate RapidArc plans: gamma maps and DVH-based evaluation. Phys Med. 2014;39(2):134–8.
Arthur JO. Evaluation of the accuracy of 3DVH software estimates of dose to virtual ion chamber and film in composite IMRT QA. Med Phys. 2012;39(1):81–6.
Xueming S, Shengfa S. Chunyan Chen, et al. long-term outcomes of intensity-modulated radiotherapy for 868 patients with nasopharyngeal carcinoma: an analysis of survival and treatment toxicities. Radiother Oncol. 2014;110(3):398–403.
Lei Z, Tian Y-M, Xue-Ming S, et al. Late toxicities after intensity-modulated radiotherapy for nasopharyngeal carcinoma: patient and treatment-related risk factors. Br J Cancer. 2014;110:49–54.
Nelms BE, Simon WE. Radiation therapy plan dose perturbation system and method. United States Patent 7945022. 2011. http://www.freepatentsonline.com/7945022.html(2017). Accessed 13 Feb 2018.
Pablo C, Núria J, Artur L, et al. 3D DVH-based metric analysis versus per-beam planar analysis in IMRT pretreatment verification. Med Phys. 2012;39(8):5040–9.
Heming Z, Benjamin EN, Wolfgang AT. Moving from gamma passing rates to patient DVH-based QA metrics in pretreatment dose QA. Med Phys. 2011;38(10):5477–89.
This work was jointly supported by the National Key R&D Program of China (2017YFC0113200), Science and Technology program of Guangdong Province, China (2015B020214002) and Science and Technology program of Guangzhou, China (201508020105).
Availability of data and materials
The datasets are backed up on the Research Data Deposit public platform (RDD, http://www.researchdata.org.cn/, approval number: RDDA2017000312) and are available on reasonable request.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original version of this article was revised: the original contained an error in the layout of Table 4.
Figure S1. Dose deviation (DD(%)) in PTVnx vs GP (%)-linear fits and R2 were reported. Figure S2. Dose deviation (DD(%)) in PTV1 vs GP (%)-linear fits and R2 were reported. Figure S3. Dose deviation (DD(%)) in PTV2 vs GP (%)-linear fits and R2 were reported. Figure S4. Dose deviation (DD(%)) in spinal cord vs GP (%)-linear fits and R2 were reported. Figure S5. Dose deviation (DD(%)) in Brain stem vs GP (%)-linear fits and R2 were reported. Figure S6. Dose deviation (DD(%)) in left Parotid gland vs GP (%)-linear fits and R2 were reported. Figure S7. Dose deviation (DD(%)) in right Parotid gland vs GP (%)-linear fits and R2 were reported. Figure S8. Dose deviation (DD(%)) in left Temporal lobe vs GP (%)-linear fits and R2 were reported. Figure S9. Dose deviation (DD(%)) in right Temporal lobe vs GP (%)-linear fits and R2 were reported. Figure S10. Dose deviation (DD(%)) in Larynx vs GP (%)-linear fits and R2 were reported. (ZIP 17676 kb)
Table S1. The comparison of the 3D globe, and organ-specific GP (%) calculated with local dose normalization for 30 NPC cases with different gamma criteria. Table S2. Pearson correlation coefficient with three type gamma pass rate calculated with local dose normalization and DV, VD. Table S3. Significant p-values for correlation between three type gamma pass rate and DV, VD. (ZIP 55 kb)