Skip to main content

Predicting 2-year survival in stage I-III non-small cell lung cancer: the development and validation of a scoring system from an Australian cohort



There are limited data on survival prediction models in contemporary inoperable non-small cell lung cancer (NSCLC) patients. The objective of this study was to develop and validate a survival prediction model in a cohort of inoperable stage I-III NSCLC patients treated with radiotherapy.


Data from inoperable stage I-III NSCLC patients diagnosed from 1/1/2016 to 31/12/2017 were collected from three radiation oncology clinics. Patient, tumour and treatment-related variables were selected for model inclusion using univariate and multivariate analysis. Cox proportional hazards regression was used to develop a 2-year overall survival prediction model, the South West Sydney Model (SWSM) in one clinic (n = 117) and validated in the other clinics (n = 144). Model performance, assessed internally and on one independent dataset, was expressed as Harrell’s concordance index (c-index).


The SWSM contained five variables: Eastern Cooperative Oncology Group performance status, diffusing capacity of the lung for carbon monoxide, histological diagnosis, tumour lobe and equivalent dose in 2 Gy fractions. The SWSM yielded a c-index of 0.70 on internal validation and 0.72 on external validation. Survival probability could be stratified into three groups using a risk score derived from the model.


A 2-year survival model with good discrimination was developed. The model included tumour lobe as a novel variable and has the potential to guide treatment decisions. Further validation is needed in a larger patient cohort.


Lung cancer is the leading cause of cancer incidence and mortality globally, of which non-small cell lung cancer (NSCLC) comprises 85% [1]. In Australia, the majority of NSCLC patients are diagnosed at stage I-III, with guidelines recommending curative radiotherapy in for those who are medically inoperable or refuse surgery [2].

Over the last two decades, while there has been an increase in the use of curative treatment [3], significant patterns of radiotherapy underutilisation persist across Australia and internationally [4]. Despite optimal radiotherapy rates being 61–76% of NSCLC patients, in practice this falls to 42–43% [5] due to a multitude of reasons including patient comorbidity and clinician bias. One strategy for addressing such variation is the development of a survival prediction model that integrates individual, medical and environmental factors unaccounted for by guidelines that commonly influence treatment decisions. This would have the potential to objectively evaluate treatment benefits in individual patients to facilitate shared decision-making, tailor patient management and optimise outcomes [6].

At present, the tumour, node, and metastasis (TNM) classification is considered gold standard for NSCLC prognostication. However stage alone is a poor predictor of overall survival, accounting for less than half of prognostic variance [7]. NSCLC patients within the same anatomic stratification are inherently heterogenous, with actual prognosis depending on a complex interplay of patient, tumour and treatment characteristics [8]. To accurately predict NSCLC survival beyond TNM stage and clinical judgement alone [9], quantitative survival prediction models that can be applied to specific patient profiles must account for a range of predictive factors, reflect current practice and demonstrate higher concordance than existing prognostication methods.

While several models have been published, none have demonstrated superior performance, applicability or global utility [11]. There is considerable discordance in the factors included in prognostic tools, with a systematic review discussing incomplete coverage of established predictors and the incorporation of variables that are difficult to measure as key shortcomings of published models [10]. Additionally, the discriminatory accuracies of existing tools have generally been insufficient to justify deviation from conventional staging systems [11, 12]. Furthermore, with the development of newer radiotherapy protocols, targeted therapies and immunotherapy, earlier studies fail to capture contemporary approaches to NSCLC and are no longer clinically relevant [13]. There is a need for prediction models that incorporate comprehensive data from cohorts treated with modern radiotherapy techniques, and that encompass emerging factors such as mutation and programmed cell death-ligand-1 (PD- L1) status [14].

The primary aim of this study was to develop and validate a 2-year survival prediction model in a contemporary cohort of stage I–III NSCLC patients treated with radiotherapy. Secondary aims were to compare model survival predictions to those predicted by TNM stage and validate published survival prediction models in a comparable cohort of patients.



This retrospective cohort study included patients diagnosed with inoperable stage I-III NSCLC between January 2016 and December 2017 at three Australian radiotherapy treatment institutions. Tumour staging was performed based on multidisciplinary team recommendations. All patients treated radically were staged using positron emission tomography-computed tomography (PET-CT), with confirmation using endobronchial ultrasound and biopsy in instances of uncertainty or where there was the potential to influence management. Patients who received radiotherapy alone or chemoradiotherapy (concurrent or sequential) were eligible for inclusion. Patients treated surgically or for recurrent disease were excluded. The development cohort comprised patients from South Western Sydney Local Health District (SWSLHD). The validation cohort included patients from Blacktown Cancer and Haematology Centre and Illawarra Cancer Care Centre (BICC).


Retrospective data were retrieved through automated and manual extraction methods using the electronic medical record systems MOSAIQ (Elekta AB, Stockholm, Sweden), ARIA (Varian Medical Systems, Palo Alto, CA) and Cerner Powerchart (Cerner Corp, North Kansas City, MO). Gross tumour volume (GTV) data were obtained from radiotherapy planning systems for patients who received radiotherapy.

Data were collected for all available predictive variables for survival as identified from a prior literature review. Patient-related variables included: age at diagnosis, sex, current smoking status, pack years smoked, weight loss, pre-treatment pulmonary function (percent predicted values for forced expiratory volume in 1 s (FEV1) and diffusing capacity of the lung for carbon monoxide (DLCO)), Eastern Cooperative Oncology Group (ECOG) performance status [15] and comorbidities as defined by the Simplified Comorbidity Score (SCS) [16].

Tumour-related variables were: TNM stage according to the International Association for the Study of Lung Cancer (IASLC) 8th edition [17], histology, tumour grade, GTV, tumour location, mutation status (epidermal growth factor receptor (EGFR), anaplastic lymphoma kinase (ALK) and V-raf murine sarcoma viral oncogene homolog B (BRAF)) and PD-L1 status [18]. GTV was defined as the sum of the GTV primary and GTV lymph nodes.

Treatment-related variables included radiotherapy technique, radiotherapy treatment duration, equivalent dose in 2 Gy fractions (EQD2) and use of chemotherapy. Radiotherapy across the three cohorts included both conventional and stereotactic ablative body radiotherapy (SABR), with radiotherapy technique classified as conformal, intensity-modulated radiotherapy (IMRT) and volumetric modulated arc therapy (VMAT).

The primary outcome of overall survival was recorded as patient status at 31/12/2019 with all follow-up data obtained before this study end date. Survival time was defined as the period from the start of radiation therapy until date of death or until 31/12/2019 for living patients.

Statistical analysis

The Kaplan–Meier method was used to predict survival in the study population. Variables missing > 50% of data were excluded from univariate and multivariate analysis. In the development cohort, univariate Cox proportional hazards regression was used to evaluate the predictive value of variables, with those demonstrating an association with overall survival (p < 0.20) considered for inclusion in multivariate analysis. Backward stepwise regression was applied to select the variables retained in the final multivariate model [19]. Model fit was evaluated using the Hosmer–Lemeshow goodness of fit test.

A scoring system for the South West Sydney Model (SWSM) was generated using the logarithm of the odds ratio (OR) to allocate points for each variable. Risk groups were defined according to total score quartiles in the development cohort. Kaplan–Meier curves were generated and log-rank test used to evaluate significant survival differences between subgroups. The model was applied to the SWSLHD cohort to assess internal validity, with discrimination estimated using Harrell’s concordance index (c-index) [20]. To assess the impact of missing DLCO data on model performance, validation was performed twice, initially with missing data excluded and again with simple mean imputation. Model calibration was assessed graphically by plotting observed survival probabilities against predicted probabilities and calculating the calibration slope and intercept for the development and validation cohorts [21]. External validation was performed by applying the SWSM to the BICC cohort.

The performance of the multivariate model was compared with predictions based on TNM staging alone. In addition, an existing prediction model was externally validated in the development and validation cohorts and compared to the SWSM. The discrimination of the model was also assessed using Harrell’s c-index.

Statistical analyses were performed using IBM SPSS Statistics, version 26.0 (IBM Corp, Armonk, NY), Matlab, version 9.3 (MathWorks, Natick, MA) and Python, version 3.6 (Python Software Foundation, Wilmington, DE). Ethics approval was obtained from the SWSLHD Human Research Ethics Committee.


The patient, tumour and treatment characteristics of each study site are summarised in Table 1. There were a total of 261 patients included in the study. At the study end date, 47.5% of patients were alive. In the development cohort, mutation status was unknown in 65.0% and PD-L1 status in 85.5% of patients and these variables were excluded from analysis.

Table 1 Characteristics of the development and validation datasets

By univariate analysis, the variables predictive of survival in the study population were ECOG performance status, DLCO, overall stage, histological diagnosis, tumour lobe, radiotherapy technique, radiotherapy treatment duration and EQD2 (Table 2). On multivariate analysis, the variables predictive of survival were ECOG performance status, DLCO, histological diagnosis, tumour lobe and EQD2 (Table 3). The Hosmer–Lemeshow test demonstrated good model fit using the selected predictors (p = 0.65). The scoring system for the SWSM is presented in Table 4.

Table 2 Survival predictors in the development cohort by univariate and multivariate Cox proportional hazards regression
Table 3 Odds ratios of SWSM for overall survival by multivariate analysis
Table 4 Scoring system for the SWSM

Internal validation of the SWSM model demonstrated a c-index of 0.70, 95% CI [0.64, 0.75]. Discrimination on the external validation cohort resulted in a c-index of 0.72, 95% CI [0.68, 0.76] with imputation and 0.67, 95% CI [0.58, 0.74] with missing DLCO data excluded. Calibration of the SWSM yielded a calibration slope of 0.89, 95% CI [0.40, 1.34] and intercept of 0.19, 95% CI [0.01, 0.35] in the development cohort and a calibration slope of 0.98, 95% CI [0.77, 1.16] and intercept of 0.16, 95% CI [0.06, 0.26] in the validation cohort (Fig. 1).

Fig. 1
figure 1

Calibration plots for the SWSM. SWSM calibration plots in the A development cohort (n = 117) and B validation cohort (n = 144)

The formation of four risk groups in the SWSLHD cohort according to SWSM quartiles resulted in no differences between groups 2 and 3. Subsequently these groups were combined, producing a total of three risk groups. Kaplan–Meier survival curves by SWSM risk group in the development and validation cohorts are presented in Fig. 2. Log-rank test identified significant differences (p < 0.05) between risk groups in both development and validation cohorts. Using the SWSM, 2-year survival probability in the development cohort was 63.3% in group 1, 55.0% in group 2 and 20.0% in group 3.

Fig. 2
figure 2

Survival curves according to SWSM risk group. Kaplan–Meier curves for cumulative survival of NSCLC patients by SWSM risk group in A development cohort (n = 117) and B validation cohort (n = 144). SWSM = South West Sydney Model

The c-index of TNM staging alone for survival prediction was 0.53, 95% CI [0.48, 0.58] in the SWSLHD cohort and 0.59, 95% CI [0.55, 0.63] in the BICC cohort. An existing model from the MAASTRO clinic [13] was externally validated to give c-indexes of 0.62, 95% CI [0.56, 0.67] in the SWSLHD cohort and 0.66, 95% CI [0.61, 0.70]in the BICC cohort.


To our knowledge, this study is the first to develop and validate a survival prediction model in a cohort of inoperable but potentially curable stage I-III NSCLC patients irrespective of radiotherapy intent. Developing a survival prediction model in this cohort may impact on management decisions as although in theory all patients should be treated with curative radiotherapy, this does not happen in real world practice. The SWSM incorporates a combination of well-established and novel predictors using routinely collected data to predict survival in a radiotherapy cohort. Notably, our incorporation of EQD2 enables the SWSM to be applied as a predictive models with the potential to facilitate treatment decisions and radiotherapy planning.

When evaluating the predictive performance of a model, a c-index greater than 0.6 generally reflects helpful discrimination [22]. In the development cohort, the SWSM demonstrated good predictive performance, and achieved similar results on external validation in one independent cohort. There was minimal difference between the c-indexes obtained on external validation when missing data were excluded or imputed. Importantly, the SWSM maintained discrimination superior to survival predictions based on TNM staging alone. This is in accordance with published models whereby the addition of demographic and clinical covariates translated into more robust survival predictions [14, 23, 24].

In prediction models, calibration illustrates the association between predicted and observed outcomes. A slope of one and intercept of zero indicate perfect calibration [25]. While our model calibration plot showed good agreement between expected and actual survival probabilities, the slope and positive intercept are suggestive of survival underestimation. Validation in a larger population is required for the generation of a more precise calibration curve.

Several models predicting survival in inoperable NSCLC cohorts have been published [11, 14, 15], however none have examined the entire inoperable stage I-III radiotherapy cohort in a contemporary setting. The MAASTRO model is a prognostic tool for 2-year survival of stage I-III NSCLC patients treated with curative chemoradiotherapy. However, since its 2009 publication, staging classifications and radiotherapy techniques have been updated. The c-indexes obtained during external validation of the SWSM were higher than those obtained on validation of the MAASTRO model [13]. In addition, the advantages of our model include its reflection of current practice and potential to be applied to patients receiving curative or palliative therapy.

Similarly, models published more recently have been limited to patients receiving curative radiotherapy or with early or localised disease. The STEPS (sex, T stage, Staging EBUS, performance status, N stage) score was developed in the UK to predict 2-year risk of death in stage I-III NSCLC and also contained five variables [26]. However, only patient and tumour-related factors were retained in this multivariate model, precluding its use in treatment decision-making. Another model developed in Japan only included patient-related variables (age, performance status, body mass index and Charlson comorbidity index) to predict non-lung cancer death in an elderly cohort receiving definitive SABR [27]. In contrast to these models, we deliberately chose to include a potentially curable population of patients with stage I-III NSCLC who were managed heterogeneously in order to develop a prediction model which can help decision-making in the real world.

During the development of the SWSM, the predictive variables considered for model inclusion encompassed patient, tumour and treatment factors as supported by current evidence. The patient-related determinants most commonly included in NSCLC models are age, sex and performance status. While a poorer prognosis has been associated with older age [27] and male sex [8] overall findings have been inconclusive. In line with previous models, our study found no significant associations between age and sex on the survival outcomes of the study population [28, 29]. In contrast, performance status has been associated with NSCLC survival in patients receiving curative radiotherapy [23, 26, 28, 30] and was retained in our final model. However, a recognised limitation of performance status as a predictive variable is its subjectivity and inter-observer variability [27].

The only other patient-related variable identified as a survival predictor was pre-treatment DLCO. This has been demonstrated in a model involving NSCLC patients treated with SABR [30]. Insufficient lung function is a common reason for medical inoperability and frequently determines the suitability of treatment, with one study identifying pre-treatment DLCO as the pulmonary function measure most strongly associated with overall survival [31]. While DLCO has been reported to influence NSCLC survival in the surgical literature [32], few studies have analysed its influence on inoperable cohorts. The results of this study could be used to support further research exploring this association.

The tumour-related characteristics included in the SWSM were histological diagnosis and tumour lobe. Consistent with prior studies, non-squamous cell tumours demonstrated better survival probability than squamous cell carcinomas [17, 33]. The inclusion of tumour lobe in the SWSM is novel. A recent systematic review concluded that tumours located in the upper lobe conferred improved survival compared to those in the middle or lower lobes [34], consistent with our findings. The increased treatment toxicities from higher cardiac dose for middle lobe tumours and higher lung dose for lower lobe tumours may explain this result. Furthermore, the lower lobe has been associated with an increased proportion of non-adenocarcinoma tumours and a lower frequency of EGFR mutations, both of which are unfavourable survival characteristics [35]. However, at present, evidence supporting the significance of tumour location as an independent predictor of survival is less established.

The inclusion of the treatment variable EQD2 in the SWSM allows its application as a predictive rather than a prognostic model [36]. By including EQD2 in the prediction model, using the scoring in Table 4, one can calculate survival depending on dosage regimen in an individual patient and counsel patients accordingly about risks and benefits of treatment. Some patients who only derive a small survival benefit from more intensive radiotherapy may not wish to risk the toxicities of treatment [37]. Others may choose to undergo higher dose radiotherapy for any survival benefit no matter how small. The model attempts to provide some objectivity to aid decision-making rather than relying on clinician judgement alone. Few studies have considered EQD2 as a variable [23] as most have been developed in a specific population only receiving curative treatment [11, 37]. Furthermore, while the survival benefit of increasing radiotherapy dose has been demonstrated, its ability to improve quality of life is yet to be established [38].

There are limitations to this study. The SWSM was developed using single institution data with a relatively small sample size, although the sample size is similar to other studies [37, 39]. The model was developed on the population demographic of South West Sydney, which has a higher proportion of overseas-born individuals compared to Australia as a whole, hence this may impact generalisability. This was a retrospective study relying on information documented in medical records resulting in inevitable missing data. Information not routinely collected within oncology information systems such as blood parameters were not analysed. Furthermore, data on the staging procedures used for individual patients were not collected, although this is reflective of a clinic cohort of patients. Survival may also have been impacted on by treatment at relapse which was not accounted for in this study. However the greatest impact on survival is initial treatment, and our methodology is similar to other survival prediction modelling studies [13, 39].

The findings of this study have implications for further research. External validation studies should be conducted by applying the SWSM to larger datasets to confirm findings and assess model generalisability. The current model may potentially be improved by the addition of mutation and PD-L1 status. Unfortunately, the influence of these predictors was unable to be evaluated as data on these markers were not routinely collected in stage I-III NSCLC patients during the time period of the study. Likewise, global advances in laboratory biomarkers and genomic parameters [40, 41] have been recently highlighted and may transform future NSCLC prognostication, however at present lack systematic investigation. In addition, data were not collected on cardiac radiation dose, which has been identified as an independent risk factor for all-cause mortality after radiotherapy in locally advanced NSCLC [42, 43]. Finally, impact analysis studies evaluating the acceptability, cost-effectiveness and practicality of the SWSM is required prior to clinical implementation [44].

We plan to develop and validate a survival prediction model in patients with Stage I-III NSCLC patients undergoing radiotherapy in a larger cohort of patients with distributed learning across multiple centres using the AusCAT network [45]. The factors found to be significant in this work will be considered alongside newer variables. The ultimate aim is to develop a tool to support radiotherapy decision-making in NSCLC using objective parameters rather than subjective clinical judgment. This will facilitate shared decision-making between patients and clinicians and reduce variability in treatment recommendations between clinicians and between institutions.


In conclusion, our study developed a survival prediction model in a real-world contemporary cohort of inoperable stage I–III NSCLC patients treated with radiotherapy. The SWSM utilises readily obtainable data and is convenient and simple to use by clinicians. The model exhibited good discrimination on both internal and external validation, and has the potential to guide treatment decisions. Further validation of this model is needed in a larger cohort of patients.

Availability of data and materials

The datasets used during the current study are available from the corresponding author on reasonable request.



Non-small cell lung cancer


Tumour, node, and metastasis


Programmed cell death-ligand-1


Positron emission tomography-computed tomography


South Western Sydney Local Health District


Blacktown Cancer and Haematology Centre and Illawarra Cancer Care Centre


Gross tumour volume


Forced expiratory volume in 1 s


Diffusing capacity of the lung for carbon monoxide


Eastern Cooperative Oncology Group


Simplified Comorbidity Score


International Association for the Study of Lung Cancer


Epidermal growth factor receptor


Anaplastic lymphoma kinase


V-raf murine sarcoma viral oncogene homolog B


Equivalent dose in 2 Gy fractions


Stereotactic ablative body radiotherapy


Intensity-modulated radiotherapy


Volumetric modulated arc therapy


South West Sydney Model


Odds ratio


Concordance index


  1. Bray F, et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424.

    Article  PubMed  Google Scholar 

  2. Cancer Council Australia Lung Cancer Guidelines Working Party. Clinical practice guidelines for the treatment of lung cancer. 2020. Accessed 20 June 2021.

  3. Koshy M, et al. Disparities in treatment of patients with inoperable stage I non-small cell lung cancer: a population-based analysis. J Thorac Oncol. 2015;10(2):264–71.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Vinod SK. International patterns of radiotherapy practice for non–small cell lung cancer. Semin Radiat Oncol. 2015;25(2):143–50.

    Article  PubMed  Google Scholar 

  5. Vinod SK, et al. Underutilization of radiotherapy for lung cancer in New South Wales. Australia Cancer. 2010;116(3):686–94.

    Article  PubMed  Google Scholar 

  6. Nguyen AD, et al. Radiotherapy patterns of care for stage I and II non-small cell lung cancer in Sydney, Australia. J Med Imaging Radiat Oncol. 2019;63(1):131–41.

    Article  PubMed  Google Scholar 

  7. Detterbeck FC, et al. The IASLC Lung Cancer Staging Project: methodology and validation used in the development of proposals for revision of the stage classification of NSCLC in the forthcoming (eighth) edition of the TNM classification of lung cancer. J Thorac Oncol. 2016;11(9):1433–46.

    Article  PubMed  Google Scholar 

  8. Lin J, et al. A prognostic model to predict mortality among non-small-cell lung cancer patients in the U.S. military health system. J Thorac Oncol. 2015;10(12):1694–702.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Oberije C, et al. A prospective study comparing the predictions of doctors versus models for treatment outcome of lung cancer patients: a step toward individualized care and shared decision making. Radiother Oncol. 2014;112(1):37–43.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Mahar AL, et al. Refining prognosis in lung cancer: a report on the quality and relevance of clinical prognostic tools. J Thorac Oncol. 2015;10(11):1576–89.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Bouwmeester W, et al. Reporting and methods in clinical prediction research: a systematic review. PLoS Med. 2012;9(5):1–12.

    Article  PubMed  Google Scholar 

  12. Jochems A, et al. A prediction model for early death in non-small cell lung cancer patients following curative-intent chemoradiotherapy. Acta Oncol. 2018;57(2):226–30.

    Article  CAS  PubMed  Google Scholar 

  13. Dehing-Oberije C, et al. Development and external validation of prognostic model for 2-year survival of non-small-cell lung cancer patients treated with chemoradiotherapy. Int J Radiat Oncol Biol Phys. 2009;74(2):355–62.

    Article  PubMed  Google Scholar 

  14. Alexander M, et al. Lung cancer prognostic index: a risk score to predict overall survival after the diagnosis of non-small-cell lung cancer. Br J Cancer. 2017;117(5):744–51.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Oken MM, et al. Toxicity and response criteria of the Eastern Cooperative Oncology Group. Am J Clin Oncol. 1982;5(6):649–56.

    Article  CAS  PubMed  Google Scholar 

  16. Colinet B, et al. A new simplified comorbidity score as a prognostic factor in non-small-cell lung cancer patients: description and comparison with the Charlson’s index. Br J Cancer. 2005;93(10):1098–105.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Chansky K, et al. The IASLC Lung Cancer Staging Project external validation of the revision of the TNM stage groupings in the eighth edition of the TNM classification of lung cancer. J Thorac Oncol. 2017;12(7):1109–21.

    Article  PubMed  Google Scholar 

  18. Davis AA, Patel VG. The role of PD-L1 expression as a predictive biomarker: an analysis of all US Food and Drug Administration (FDA) approvals of immune checkpoint inhibitors. J Immunother Cancer. 2019;7(1):278.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Dunkler D, et al. Augmented backward elimination: a pragmatic and purposeful way to develop statistical models. PLoS ONE. 2014;9(11):e113677.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Harrell FE Jr, Lee KL, Mark DB. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996;15(4):361–87.

    Article  PubMed  Google Scholar 

  21. Moons KGM, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162(1):W1–73.

    Article  PubMed  Google Scholar 

  22. Alba AC, et al. Discrimination and calibration of clinical prediction models: users’ guides to the medical literature. JAMA. 2017;318(14):1377–84.

    Article  PubMed  Google Scholar 

  23. Oberije C, et al. A validated prediction model for overall survival from stage III non-small cell lung cancer: toward survival prediction for individual patients. Int J Radiat Oncol Biol Phys. 2015;92(4):935–44.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Putila J, Remick SC, Guo NL. Combining clinical, pathological, and demographic factors refines prognosis of lung cancer: a population-based study. PLoS ONE. 2011;6(2):e17493.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Van Calster B, et al. Calibration: the Achilles heel of predictive analytics. BMC Med. 2019;17(1):230.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Evison M, et al. Predicting the risk of disease recurrence and death following curative-intent radiotherapy for non-small cell lung cancer: the development and validation of two scoring systems from a large multicentre UK cohort. Clin Oncol. 2021;33(3):145–54.

    Article  CAS  Google Scholar 

  27. Hanazawa H, et al. Development and validation of a prognostic model for non-lung cancer death in elderly patients treated with stereotactic body radiotherapy for non-small cell lung cancer. J Radiat Res. 2021;62(6):1029–38.

    Google Scholar 

  28. Louie AV, et al. Predicting overall survival after stereotactic ablative radiation therapy in early-stage lung cancer: development and external validation of the Amsterdam Prognostic Model. Int J Radiat Oncol Biol Phys. 2015;93(1):82–90.

    Article  PubMed  Google Scholar 

  29. Jochems A, et al. Developing and validating a survival prediction model for NSCLC patients through distributed learning across 3 countries. Int J Radiat Oncol Biol Phys. 2017;99(2):344–52.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Kang J, et al. Predicting 5-year progression and survival outcomes for early stage non-small cell lung cancer treated with stereotactic ablative radiation therapy: development and validation of robust prognostic nomograms. Int J Radiat Oncol Biol Phys. 2020;106(1):90–9.

    Article  CAS  PubMed  Google Scholar 

  31. Guckenberger M, et al. Is there a lower limit of pretreatment pulmonary function for safe and effective stereotactic body radiotherapy for early-stage non-small cell lung cancer? J Thorac Oncol. 2012;7(3):542–51.

    Article  PubMed  Google Scholar 

  32. Berry MF, et al. Impact of pulmonary function measurements on long-term survival after lobectomy for stage I non-small cell lung cancer. Ann Thorac Surg. 2015;100(1):271–6.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Yang Y, et al. Development and validation of a prediction model using molecular marker for long-term survival in unresectable stage III non-small cell lung cancer treated with chemoradiotherapy. Thorac Cancer. 2022;13(3):296–307.

    Article  CAS  PubMed  Google Scholar 

  34. Lee HW, Lee C-H, Park YS. Location of stage I-III non-small cell lung cancer and survival rate: systematic review and meta-analysis. Thorac Cancer. 2018;9(12):1614–22.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Lee HW, et al. Poor prognosis of NSCLC located in lower lobe is partly mediated by lower frequency of EGFR mutations. Sci Rep. 2020;10(1):14933.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Clark GM. Prognostic factors versus predictive factors: examples from a clinical trial of erlotinib. Mol Oncol. 2008;1(4):406–12.

    Article  PubMed  Google Scholar 

  37. Defraene G, et al. Multifactorial risk factors for mortality after chemotherapy and radiotherapy for non-small cell lung cancer. Radiother Oncol. 2020;152:117–25.

    Article  CAS  PubMed  Google Scholar 

  38. Lewis TS, et al. Palliative lung radiotherapy: higher dose leads to improved survival? Clin Oncol. 2020;32(10):674–84.

    Article  CAS  Google Scholar 

  39. Dekker A, et al. Rapid learning in practice: a lung cancer survival decision support system in routine patient care data. Radiother Oncol. 2014;113(1):47–53.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Guo M, et al. Prognostic value of delta inflammatory biomarker-based nomograms in patients with inoperable locally advanced NSCLC. Int Immunopharmacol. 2019;72:395–401.

    Article  CAS  PubMed  Google Scholar 

  41. He R, Zuo S. A robust 8-gene prognostic signature for early-stage non-small cell lung cancer. Front Oncol. 2019.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Atkins KM, et al. Cardiac radiation dose, cardiac disease, and mortality in patients with lung cancer. J Am Coll Cardiol. 2019;73:2976–87.

    Article  PubMed  Google Scholar 

  43. Thor M, et al. Modeling the impact of cardiopulmonary irradiation on overall survival in NRG Oncology Trial RTOG 0617. Clin Can Res. 2020;26:4643–50.

    Article  CAS  Google Scholar 

  44. Cowley LE, et al. Methodological standards for the development and evaluation of clinical prediction rules: a review of the literature. Diagn Progn Res. 2019;3:16.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Field M, et al. Implementation of the Australian Computer-Assisted Theragnostics (AusCAT) network for radiation oncology data extraction, reporting and distributed learning. J Med Imaging Radiat Oncol. 2021;65(5):627–36.

    Article  PubMed  Google Scholar 

Download references


The authors thank Dr Jonathan Sykes for the acquisition of tumour volume data at Blacktown Cancer and Haematology Centre, as well as Prof Andrew Miller for assisting data collection at Illawarra Cancer Care Centre. The authors additionally acknowledge Dr Farhannah Aly for her support in reviewing the manuscript.


No funding was involved in this study.

Author information

Authors and Affiliations



NSL collected data from the SWSLHD study site, and was a major contributor in analysing data and writing the manuscript. JS and MF contributed to data analysis and writing the manuscript. CF, SV, SG and EH were involved in data collection from the BICC study site. SKV supervised the project and was involved in data interpretation and writing the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Shalini Kavita Vinod.

Ethics declarations

Ethics approval and consent to participate

Ethics approval for this study was obtained from the SWSLHD Human Research Ethics Committee (2019/ETH11689). Because of the retrospective nature of this study, individual consent for study participation was not deemed necessary.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lee, N.SY., Shafiq, J., Field, M. et al. Predicting 2-year survival in stage I-III non-small cell lung cancer: the development and validation of a scoring system from an Australian cohort. Radiat Oncol 17, 74 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: