Skip to main content

Neural network and spline-based regression for the prediction of salivary hypofunction in patients undergoing radiation therapy

Abstract

Background

This study leverages a large retrospective cohort of head and neck cancer patients in order to develop machine learning models to predict radiation induced hyposalivation from dose-volume histograms of the parotid glands.

Methods

The pre and post-radiotherapy salivary flow rates of 510 head and neck cancer patients were used to fit three predictive models of salivary hypofunction, (1) the Lyman-Kutcher-Burman (LKB) model, (2) a spline-based model, (3) a neural network. A fourth LKB-type model using literature reported parameter values was included for reference. Predictive performance was evaluated using a cut-off dependent AUC analysis.

Results

The neural network model dominated the LKB models demonstrating better predictive performance at every cutoff with AUCs ranging from 0.75 to 0.83 depending on the cutoff selected. The spline-based model nearly dominated the LKB models with the fitted LKB model only performing better at the 0.55 cutoff. The AUCs for the spline model ranged from 0.75 to 0.84 depending on the cutoff chosen. The LKB models had the lowest predictive ability with AUCs ranging from 0.70 to 0.80 (fitted) and 0.67 to 0.77 (literature reported).

Conclusion

Our neural network model showed improved performance over the LKB and alternative machine learning approaches and provided clinically useful predictions of salivary hypofunction without relying on summary measures.

Background

Radiotherapy to the head and neck can result in a wide array of morbidities. One such morbidity, which can have a devastating effect on oral health, is radiation-induced salivary gland hypofunction [1]. Despite being a relatively slow-dividing type of cell, salivary glands demonstrate remarkable sensitivity to ionizing radiation. At low doses radiation damage may be reversible, but substantial decreases in salivary function have been noted with even moderate doses of radiation (30–40 Gy) [2, 3]. Cumulative therapeutic doses of radiation of 60–70 Gy often result in irreversible loss of function.

Saliva has diverse function in the oral cavity. Saliva moistens the oral mucosa, contains antibodies that facilitate immune response to oral microbes, buffers against changes in pH, contains minerals that allow for remineralization of damaged tooth structure, cleanses the teeth, moistens food to aid in swallowing, and is required for normal gustatory sensation. HNC survivors that experience hyposalivation consequently are at risk for an increase in dental disease, fungal infection, dysphagia, dysgeusia, and friability of the oral mucosa. Because these symptoms interfere with eating, taste, and speech they can contribute to social isolation and poor quality of life [4, 5].

Predicting salivary gland hypofunction after radiation therapy is complicated by the sophistication of modern radiation plans. The most common way to summarize these plans is using a dose-volume histogram (DVH), which is a step function relating the dosage (ranging from 0 to the maximum dose) and the fraction of the total organ volume that received at least the given dose. The DVH requires processing to be incorporated in predictive models as it has a high number of dimensions. Since its development in the 1980s, the Lyman-Kutcher-Burman (LKB) model has been the standard method for processing the DVH and assessing normal tissue complication probabilities (NTCP) during radiation treatment planning. This model is appealing in that it is estimated using parameters that are tangible, volumes of the organ in question, the dose that would result in 50% NTCP with uniform irradiation of the total volume (\({TD}_{50}(1)\)), an organ specific parameter describing how the organ is affected by partial irradiation (n), and the slope of the dose–response curve (m). Despite the clinical appeal of this class of models, they have a particular limitation as they often are not well calibrated, a criticism that was recognized by the authors in their original publication [6]. Another limitation is that the LKB model requires a binary input for what constitutes a complication necessitating an arbitrary cutoff to be established for the fitting process. Finally, the LKB does not allow for inclusion of other potentially relevant covariates. While this may be a lesser concern in radiation planning specifically, having a model that incorporates the radiation treatment information separate from any model that incorporates information on comorbidities and medications presents a problem when trying to holistically assess a patient’s risk of developing a complication. The LKB model has been applied to the parotid glands and data exists which estimate relevant parameters for model fitting [7].

The LKB model was the original prediction model for NTCPs and was applied to a wide variety of organs. Roesink et al. [7] performed a study in 2001 in which they used a cohort of 108 patients to estimate the parameters of the LKB for parotid glands and salivary hypofunction. More recently, Beetz et al. [8] and Li et al. [9] developed regression models that predict salivary hypofunction using a variety of clinical and demographic information in addition to a DVH-derived mean organ dose in cohorts of size 167 and 365 respectively. Still others have attempted to predict either xerostomia or salivary hypofunction without using any data on the radiation plan [10, 11] while others have turned to delta radiomics [12] for prediction, although these calculations cannot be performed in the radiation planning phase.

Among these attempts, the only studies that utilized the radiation plan as a predictor (the LKB model via Roesink and the regression-based models of Beetz and Li which relied on mean organ dose), only utilized a gross summary of the DVH. In this manuscript, we compare the LKB method for assessing the probability of radiation induced salivary gland hypofunction to two alternative methods of prediction, which incorporate the information from the radiation DVH in a way in which the full information about the distribution of radiation dose across the organ is preserved. The first method summarizes the DVH using a cubic spline basis and uses this as input to a standard regression model. The second method registers the value of the DVH at each dose and uses these as inputs into a neural network. This study arrived at new estimates of the LKB model parameters for the parotid glands which were derived in a larger cohort than those previously reported, developed and demonstrated two methods for incorporating the complete information contained within the DVH into a prediction model, and obtained some level of evidence that grossly summarizing the radiation plan may adversely impact prediction of NTCPs in the parotid glands.

Methods

Patients

510 patients undergoing radiotherapy for H&N cancers at BC Cancer between November 2004 and July 2015 were enrolled in this study. Patients were treated with either intensity modulated radiation therapy or volumetric modulated arc therapy. Radiation dose for all radiotherapy plans were calculated using the analytical anisotropic algorithm using the same planning system, dose prescribing convention, and dosimetric grid size. DVHs for each patient’s parotid glands (both ipsilateral and contralateral to the tumor site) were extracted using DICOMautomation. Patients were excluded if: they were unable to follow written saliva collection procedures; they received atypical chemotherapy agents (i.e. an agent other than cetuximab, cisplatin, carboplatin, or gemcitabine); they received electron therapy; or they had previous interfering radiotherapy. In addition to routine clinical quality assurance procedures prior to delivery of radiotherapy plans, a single senior H&N Radiation Oncologist (JW) validated the consistency and accuracy of salivary contours of the parotid glands specifically for research quality assurance purposes after plan delivery. Although salivary function can also be impacted by radiation to the other major salivary glands, stimulated salivary function is most impacted by the parotid glands. In addition, the LKB model requires gland specific parameters and therefore cannot be fit or otherwise compared to the candidate approaches in a mixed-gland framework.

Saliva collection

Stimulated saliva was measured prior to radiotherapy and one year following completion of treatment. Measurements comprise whole-mouth saliva collected with patients in prone or upright position over a five-minute period while chewing flavorless wax.

Radiation planning/data collection

All clinical plans were created according to institutional guidelines using the Varian Eclipse treatment planning system. Dose-volume histograms for the parotid glands were extracted from clinical plans using DICOMautomaton [13], an open source toolkit for radiotherapy analysis.

LKB model

The LKB model addresses the multidimensionality of a DVH as a predictor variable by reducing it to a single dose and volume in a process originally described by Lyman [6, 14]. This reduction of the multidimensional DVH to a single dose over an effective volume is justified by an assumed power law relationship, where \(i\) represents each step of the DVH.

$$V=\sum_{i}\Delta {V}_{i}{\left[\frac{{D}_{i}}{{D}_{max}}\right]}^\frac{1}{n}$$

Transformed single-step histograms are assumed to have the same complication probability as the original one. The newly transformed dose and volume are then used to normalize the dose with \(m\times {TD}_{50}(v)\) serving as an estimate of the standard deviation of dose, where V and D are the transformed values from the DVH and Vtotal is the total organ volume (in this case the parotid glands).

$$t=\frac{D-{TD}_{50}(v)}{m \times {TD}_{50}(v)}$$

where \({TD}_{50}(v)\) is attained from another assumed power law.

$$v=\frac{V}{{V}_{total}}$$
$${TD}_{50}\left(1\right)={TD}_{50}(v)\times {v}^{n}$$

The final model estimate is then calculated by plugging the estimated t into the cumulative distribution of a standard normal random variable.

$$NTCP=\frac{1}{\sqrt{2\pi }}{\int }_{-\infty }^{t}{e}^{\frac{{-t}^{2}}{2}}dt$$

Due to the LKB model requiring a binary definition of complication, patients were defined to have a complication if their post-radiation salivary flow rate was reduced to less than 25% of the preoperative rate (i.e., a ‘severe’ reduction). The model was fit using maximum likelihood. After transforming the DVHs to a single dose and volume, the ratio of post-treatment to pre-treatment whole salivary flow was dichotomized and the model fit resulting in the 3 organ specific parameters (TD50(1), m, n). Roesink et.al conducted a study of 93 patients in which these parameters were estimated to be 31 Gy, 0.54, and 1 respectively [7]. The fitted values as well as those reported by Roesink et al. were also used in the assessment of the candidate models fit with new methods.

Alternative models

The first alternative model addresses the high dimensionality of the DVH using a cubic spline basis. This procedure fits a polynomial function with a specified form to each of the DVHs in the dataset. For this application, it was decided that a spline function with 5 knots equally spaced across the range of observed doses imparted sufficient flexibility to adequately mimic the DVHs, Fig. 1. The resulting fitted model contains six coefficients which are then used as predictors in a logistic regression model, which also incorporates splines to improve model flexibility.

Fig. 1
figure 1

Three examples of spline approximation of dose-volume histograms and their approximation by cubic spline basis (red)

The second model extracts the volume recorded in the DVH at intervals of 1 Gy from 0 to 70 Gy. These values are then used directly as inputs into a neural network with a single hidden layer containing 12 nodes and a decay of 0.8, which is a form of regularization for the model. These model parameters were determined by using ten-fold cross validation to obtain optimal predictive performance.

Both of these candidate models contain tuning parameters, which were optimized using tenfold cross-validation of the AUC for predicting a decrease in salivary flow rate of 0.5*baseline. In the case of the regression-based approach, the cubic splines were applied to the model inputs with the number of knots being tuned by tenfold cross-validation. In the case of the neural network, the decay parameter was employed. The decay parameter regularizes the model penalizing the size of the weights to prevent overfitting and improves the performance of the model-fitting algorithm by reducing flat spots in the cost function by inducing a differential penalty between highly correlated inputs like those coming from the dose-volume histogram. Additionally, alternative architectures were tested in which the number of inputs were reduced to as few as 10-equally spaced readings from the DVH and hidden layer sizes ranging from 5 to 25. However, reducing the number of inputs did not improve performance with performance being negatively effected at the smallest number of inputs.

Evaluation

The data was partitioned into a training set containing 70% of the observations and a test set containing 30%. The predictive performances of the models were compared using area under the receiver operating characteristic curve in the test set. Sensitivity to the cutoff for reduction in salivary flow rate was examined by including a variety of other potential cutoffs. All analyses were conducted in the R statistical computing program [15].

Results

Patient demographics

Since salivary measurements were considered standard-of-care for dental monitoring at the study site, study participant demographics are representative of institutional-level head-and-neck radiotherapy patient demographics. 335 (65.7%) were male, 118 (23.1%) were female, and 57 (11.1%) were unknown or other gender. Average patient age when radiotherapy began was 59.8 years (standard deviation: 11.9 years; minimum: 18.8 years; maximum: 90.9 years).

Tumour primary site was: nasopharynx for 110 (21.6%) patients; tonsil for 94 (18.4%); base of tongue for 76 (14.9%); larynx for 27 (5.2%); thyroid for 13 (2.5%); and unknown or various other sites for the remaining patients (Table 1).

Table 1 Demographics for study participants

Salivary function

Patients baseline salivary function was measured prior to initiation of cancer therapy. The mean whole stimulated salivary flow rate was 1.47 g/min (95% CI (1.40, 1.55)). Post cancer therapy, salivary function was reassessed at 3 months and one year post radiation, with the one-year data being used to train the model. The post therapy mean whole stimulated salivary flow rates were 0.57 g/min (95% CI (0.52, 0.62)) and 0.77 g/min (95% CI (0.70, 0.83)) respectively.

Model fitting

The resulting LKB model parameter estimates from the data were 39.2 Gy, 1.1, and 1 for TD50(1), m, and n respectively. During the fitting of the candidate machine learning models, the number of inputs for the learning methods was considered. Models were fit using the ipsilateral parotid data only as well as combining data from the ipsilateral and contralateral glands. Given no substantial improvement in predictive ability, the data presented here represents the more parsimonious approach, which utilized only data from the ipsilateral parotid gland.

Comparison of predictive ability

The LKB, neural network, and the spline basis models were fit in the training set and predictions were derived for the test set. Predictions based on the parameter estimates found by Roesink et al. were also generated for the test set. Because model performance may differ depending on the definition of what constitutes a complication, predictive accuracy was evaluated using a cutoff dependent area under the ROC curve. The neural network model dominated the LKB models demonstrating better predictive performance at every cutoff with AUCs ranging from 0.75 to 0.83 depending on the cutoff selected. Similarly the spline based model nearly dominated the LKB models with the fitted LKB model only performing better at the 0.55 cutoff. The AUCs for the spline model ranged from 0.75 to 0.84 depending on the cutoff chosen. The LKB models had the lowest predictive ability with AUCs ranging from 0.70 to 0.80 (fitted) and 0.67 to 0.77 (Roesink et al.), Fig. 2.

Fig. 2
figure 2

Cutoff dependent AUC for 4-candidate models: This figure illustrates how the four models compared to each other depending on what cutoff was chosen to define a salivary function complication. For example 0.5 on the x-axis indicates the results obtained when patients whose salivary function decreased by half or more were considered a complication

The statistical significance of the difference between the models was dependent on the cutoff chosen to define the complication. Using a cutoff at 0.5*baseline, all differences were statistically significant ANN vs regression (p = 0.004); ANN versus LKB (p < 0.001); regression versus LKB (p = 0.031). The neural network maintained statistically significant p-values at cutoffs from 0.45*baseline to 0.7*baseline. Below 0.45*baseline and at 0.7–0.75*baseline, there was no significant difference between the regression and neural network’s performance. The difference between the neural network and the fitted LKB model was statistically significant at every cutoff. The regression approach was statistically superior to the LKB model at all cutoffs with the exception of 0.55–0.60*baseline at which point they performed similarly.

Discussion

Since the original formulation of the LKB model, its limitations have been recognized [6]. However, the convenient and clinically relevant parameterization of the LKB model coupled with a lack of compelling alternatives have made it a mainstay of radiation treatment planning. Although these parameters have proved to be clinically useful, they are only directly comparable between organs to the extent that the LKB model assumptions are uniformly satisfied for each. Parameters in the alternative approaches lack a readily apparent clinically relevant interpretation. In order to obtain similarly relevant organ specific information, the user must analyze how the predictions from the model change with varying inputs. For example, feeding a DVH representative of uniform irradiation at various levels can be used to determine TD50 by simply inputting dosage until the model returns a 50% complication probability. The effect of partial irradiation, typically associated with LKB parameter n, could similarly be determined by the feeding the models DVHs consistent with partial irradiation. Unlike the LKB model estimates, clinically useful measures obtained in this manner would likely be directly comparable under any circumstances.

Comparison of the predictive ability of the four candidate models suggests that alternative approaches to incorporating dose-volume histograms paired with modern machine learning approaches can provide improved discrimination for the prediction of post-radiation hyposalivation. These improvements are likely due to information loss in the way in which the LKB model and similar approaches which rely on simple low-dimensionality summaries of the DVH incorporate the DVH information, such as average organ dose [8, 9].

Minor differences between the LKB model parameters in the fitted LKB model and the model from Roesink et al. can likely be attributed to a combination of estimation error and differences in measuring and defining salivary hyposalivation. These differences also account for the model’s low performance. The Roesink study was relatively small (n = 108) resulting in a relatively large degree of random error. In addition, effective LKB parameter values may shift over time due to advances in therapeutic methodologies.

Models which start with the DVH are limited by the fact that they cannot account for the variable clinical impact of radiation delivered to specific anatomic regions; spatial location information is lost when the radiation is summarized as a DVH. There is some evidence to suggest that radiation to specific regions of the parotid glands, for example those containing stem and progenitor cells, may be particularly detrimental [16,17,18]. This limitation would apply to any DVH-based approach to an organ with such a region, where functional capacity was particularly dependent on a specific anatomical location. However, in these cases similar machine learning models could be applied directly to the three-dimensional dosimetry data, but these approaches would require extra steps to achieve spatial registration and extremely large amounts of data, likely on the order of tens of thousands of patients, to properly fit the predictive models.

Conclusions

One advantage of the alternative modeling approaches explored here is that they can easily incorporate other clinical information relevant to predicting patient outcomes. Although this study was limited by the lack of availability of other clinically meaningful information, future studies could incorporate other factors pertinent to predicting hyposalivation such as chemotherapies, comorbidities, use of other medications commonly associated with hyposalivation, and delta radiomics data, which have been shown to have predictive value beyond that of dosing information alone [19,20,21]. In addition, this study was limited by lack of access to the DVHs of other major salivary glands. However, it is unlikely that the addition of the information from these glands would have made a substantial difference in predictive ability as they almost certainly contain less predictive information than is contained in the contralateral parotid gland, whose addition did not improve predictive accuracy. It is possible that information about the other glands could prove more fruitful in even larger cohorts. While the model using the DVH alone is helpful for radiation planning, larger models that can accurately predict complications from any cause could help supportive care teams to quickly and accurately identify side effects and intervene proactively.

Availability of data and materials

Data access can be requested in collaboration with the BC cancer institute.

References

  1. Jellema AP, et al. Impact of radiation-induced xerostomia on quality of life after primary radiotherapy among patients with head and neck cancer. Int J Radiat Oncol Biol Phys. 2007;69(3):751–60.

    Article  PubMed  Google Scholar 

  2. Deasy JO, et al. Radiotherapy dose-volume effects on salivary gland function. Int J Radiat Oncol Biol Phys. 2010;76(3 Suppl):S58-63.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Bentzen SM, et al. Quantitative analyses of normal tissue effects in the Clinic (QUANTEC): an introduction to the scientific issues. Int J Radiat Oncol Biol Phys. 2010;76(3 Suppl):S3-9.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Abendstein H, et al. Quality of life and head and neck cancer: a 5 year prospective study. Laryngoscope. 2005;115(12):2183–92.

    Article  PubMed  Google Scholar 

  5. Nordgren M, et al. Quality of life in oral carcinoma: a 5-year prospective study. Head Neck. 2008;30(4):461–70.

    Article  PubMed  Google Scholar 

  6. Kutcher GJ, Burman C. Calculation of complication probability factors for non-uniform normal tissue irradiation: the effective volume method. Int J Radiat Oncol Biol Phys. 1989;16(6):1623–30.

    Article  CAS  PubMed  Google Scholar 

  7. Roesink JM, et al. Quantitative dose-volume response analysis of changes in parotid gland function after radiotherapy in the head-and-neck region. Int J Radiat Oncol Biol Phys. 2001;51(4):938–46.

    Article  CAS  PubMed  Google Scholar 

  8. Beetz I, et al. Development of NTCP models for head and neck cancer patients treated with three-dimensional conformal radiotherapy for xerostomia and sticky saliva: the role of dosimetric and clinical factors. Radiother Oncol. 2012;105(1):86–93.

    Article  PubMed  Google Scholar 

  9. Li M, et al. A prediction model for xerostomia in locoregionally advanced nasopharyngeal carcinoma patients receiving radical radiotherapy. BMC Oral Health. 2022;22(1):239.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Villa A, Nordio F, Gohel A. A risk prediction model for xerostomia: a retrospective cohort study. Gerodontology. 2016;33(4):562–8.

    Article  PubMed  Google Scholar 

  11. Yang K, et al. A nomogram for predicting late radiation-induced xerostomia among locoregionally advanced nasopharyngeal carcinoma in intensity modulated radiation therapy era. Aging. 2021;13(14):18645–57.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Liu Y, et al. Early prediction of acute xerostomia during radiation therapy for nasopharyngeal cancer based on delta radiomics from CT images. Quant Imaging Med Surg. 2019;9(7):1288–302.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Clark, H., DICOMautomaton. 2020: Zenodo.

  14. Lyman JT. Complication probability as assessed from dose-volume histograms. Radiat Res Suppl. 1985;8:S13–9.

    Article  CAS  PubMed  Google Scholar 

  15. R Core Team. R: A language and environment for statistical computing. 2017; Available from: https://www.R-project.org/.

  16. van Luijk P et al. Sparing the region of the salivary gland containing stem cells preserves saliva production after radiotherapy for head and neck cancer. Sci Transl Med, 2015; 7(305): 305ra147.

  17. Buettner F, et al. Novel approaches to improve the therapeutic index of head and neck radiotherapy: an analysis of data from the PARSPORT randomised phase III trial. Radiother Oncol. 2012;103(1):82–7.

    Article  PubMed  Google Scholar 

  18. Clark HD et al. Heterogeneous radiotherapy dose-outcomes response in parotid glands. Converg Sci Phys Oncol. 2018;4.

  19. van Dijk LV, et al. Delta-radiomics features during radiotherapy improve the prediction of late xerostomia. Sci Rep. 2019;9(1):12483.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Rosen BS, et al. Early changes in serial CBCT-measured parotid gland biomarkers predict chronic xerostomia after head and neck radiation therapy. Int J Radiat Oncol Biol Phys. 2018;102(4):1319–29.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Pota M, et al. Early prediction of radiotherapy-induced parotid shrinkage and toxicity based on CT radiomics and fuzzy classification. Artif Intell Med. 2017;81:41–53.

    Article  PubMed  Google Scholar 

Download references

Funding

This research was partially funded by a grant from the American Cancer Society (IRG# 19-139-60).

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the conceptualization of the study. Authors JW AH and HC were primarily responsible for data collection and curation. DS was responsible for data analysis and model development. All authors contributed to the manuscript and editing.

Statement of clinical relevance

This study uses advanced machine learning techniques to predict post-radiotherapy hyposalivation in survivors of head and neck cancers. This study includes data from a larger number of patients and does not rely on summary measures of the dose-volume histogram.

Corresponding author

Correspondence to Derek K. Smith.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the internal review board of BC Cancer Institute. Consent to participate: All data salivary flow rate information was collected as standard of care. Consent was waived due to the study being minimal risk.

Consent for publication

No identifiable information was used in this manuscript.

Competing interests

The Authors Deny any conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Smith, D.K., Clark, H., Hovan, A. et al. Neural network and spline-based regression for the prediction of salivary hypofunction in patients undergoing radiation therapy. Radiat Oncol 18, 77 (2023). https://doi.org/10.1186/s13014-023-02274-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13014-023-02274-9

Keyword