Skip to main content

Online prediction for respiratory movement compensation: a patient-specific gating control for MRI-guided radiotherapy

Abstract

Background

This study aims to validate the effectiveness of linear regression for motion prediction of internal organs or tumors on 2D cine-MR and to present an online gating signal prediction scheme that can improve the accuracy of MR-guided radiotherapy for liver and lung cancer.

Materials and methods

We collected 2D cine-MR sequences of 21 liver cancer patients and 10 lung cancer patients to develop a binary gating signal prediction algorithm that forecasts the crossing-time of tumor motion traces relative to the target threshold. Both 0.4 s and 0.6 s prediction windows were tested using three linear predictors and three recurrent neural networks (RNNs), given the system delay of 0.5 s. Furthermore, an adaptive linear regression model was evaluated using only the first 30 s as the burn-in period, during which the model parameters were adapted during the online prediction process. The accuracy of the predicted traces was measured using amplitude metrics (MAE, RMSE, and R2), and in addition, we proposed three temporal metrics, namely crossing error, gating error, and gating accuracy, which are more relevant to the nature of the gating signals.

Results

In both 0.6 s and 0.4 s prediction cases, linear regression outperformed other methods, demonstrating significantly smaller amplitude errors compared to the RNNs (P < 0.05). The proposed algorithm with adaptive linear regression had the best performance with an average gating accuracy of 98.3% and 98.0%, a gating error of 44 ms and 45 ms, for liver cancer and lung cancer patients, respectively.

Conclusion

A functional online gating control scheme was developed with an adaptive linear regression that is both more cost-efficient and accurate than sophisticated RNN based methods in all studied metrics.

Introduction

Gating technique involves synchronizing the delivery of radiation beams with specific phases of the patient's respiratory cycle to ensure precise targeting of the tumor while minimizing the dose to healthy tissues. The gating scheme relies on monitoring the patient's respiratory or motion signals, often through external markers or imaging modalities. These gating signals ensure that radiation is delivered only when the patient's motion is within the predetermined range, known as the “gating window” which is specific to each patient. Therefore, the accuracy of the gating signals plays a key role in the efficacy and reduction of side effects of the gating scheme to compensate for respiratory motions during radiotherapy [1, 2]. The recent advent of hybrid on-line magnetic resonance imaging (MRI)-guidance systems, represented by the Elekta Unity MR-Linac (Elekta AB, Stockholm, Sweden), provides higher accuracy in producing amplitude-based gating signals, as they directly visualization of tumors and internal structures without the need for implanted fiducials and/or external respiratory signals for target localization [3, 4].

Nevertheless, achieving real-time acquisition of 3D target volumes remains a challenge due to the trade-offs between spatial and temporal resolution. Previous studies reported that interleaved orthogonal slices (sagittal and coronal) can be obtained at adequate frequencies (4–8 Hz) for monitoring respiratory motion, enabling the reconstruction of the tumor's 3D position over time [5]. Amplitude-based gating performed superior to phase-based gating, as irregularities in breathing depth are better resolved when analyzing amplitudes instead of phases [6]. In MR-guided radiotherapy, the gating scheme for respiratory synchronization irradiation utilizes amplitude-based gating signals.

In order to compensate for the system delay of gating in Unity, it is vital to forecast the location of tumors and critical internal structures in real-time [7]. Klüter [8] reported latencies ranging from 300 to 436 ms for the Co60 version of the MRIdian using gating while Uijtewaal et al. [9] reported a latency of 330 ms for MLC-tracking on the Unity. Lamb et al. [10] however reported the overall system latency of gating was indicated to be within 0.5 s, including the image acquisition time, the contour-based target motion tracking and the beam on/off triggering. Thus, we fixed the latency of 0.5 s in our study to take into account the end-to-end system delay in the particular case of MRI guided gating application though it may not necessarily reflect the state-of-the-art for MRI-guided precision radiotherapy in the ideal scenario.

There are a number of alternative methods for predicting the trajectory of respiratory movements. Linear filters and their generalization have been largely used [11,12,13]. Probabilistic frameworks, including Bayesian inference [14], Kalman filters [15] and particle filters have also been widely proposed in several studies. More innovative and complex techniques such as support vector regression [11, 16], neural networks [12, 17, 18] and recurrent neural networks (RNN) [19, 20] have been explored for respiratory motion prediction and have demonstrated their effectiveness. In a recent review focusing on the primary prediction filters mentioned above, linear approaches were found to be sufficiently effective in prediction compared to more complex methods when using respiratory signals collected during Cyber-knife treatment [21]. However, it should be noted that directly translating prediction methods developed for respiratory signals guidance to MRI-guided treatments can be challenging due to the differences in imaging modalities (i.e., interleaved orthogonal slices instead of stereo X-ray images) and longer acquisition periods (e.g., 150–380 ms instead of 30 ms for X-ray fluoroscopy [22]), as well as overall system latencies (up to 500 ms [10]). Hence, it is crucial to validate the suitability of these prediction filters for internal organ or tumor position obtained from 2D cine-MR data, extending beyond just abdominal or thoracic amplitudes. Taking into account beam gating, an amplitude-based gating method was simulated in this study, where the beam is activated when the tumor is detected within “gating window”. Specifically, a binary [0,1] gating signal is generated to control the beam on/off by comparing the predicted tumor motion trajectory with the specified “gating window”.

In the current study, we focused on the comparison of linear models with three state-of-the-art deep RNN models, coupled with temporal metrics for the accuracy measure of the generated binary gating signal. Figure 1 shows the flow chart of the optimal gating signal prediction. Furthermore, we evaluated an adaptive regression model, in which the training on each patient’s online data lasted only for 30 s for the burn-in period and model parameters were updated during the online prediction process.

Fig. 1
figure 1

Flow chart of gating signals prediction for radiotherapy

Materials and methods

Subjects

The study enlisted 21 patients with liver cancer and 10 with lung cancer, all receiving Elekta Unity radiation therapy from October 2020 to May 2023. Among these patients, 22 males and 9 females, averaging 62.1 years in age (ranging from 45 to 83), participated. 2D cine-MR images were acquired at the beginning of each radiotherapy session, with an acquisition duration ranging from 313 to 483 s.

Data acquisition

Continuous MR images with 5 Hz imaging frequency were acquired by the Unity comprising an Achieva 1.5T MR scanner and a 7 MV flattening-filter-free linear accelerator. To strike a balance between acquisition time, signal-to-noise ratio, and resolution, we opted for an acquisition frequency of 5 Hz, which is considered to be reasonable [23]. The spatial resolution of 2D cine images is 3 × 3 mm2. The MRI-Linac allows real-time acquisition of three orthogonal planes (coronal, sagittal, and transverse), with motion in the left–right (LR) and superior-inferior (SI) directions measured from coronal slices, and motion in the anterior–posterior (AP) direction derived from sagittal slices.

Unlike the MRI sequences for lung cancer patients, the movement of the liver organ's is considered instead of that of the tumor since liver is not a hightly compressible organ while the tumor borders are extremely difficult to identify in practice. The gross tumor volumes (GTVs) of lung patients (Fig. 2a) and liver organ of liver patients (Fig. 2c) were outlined by an experienced radiologist, and then their trajectories (Fig. 2b, d) in three directions of the tumor/liver boundary were extracted from the 2D cine-MR. In amplitude-based gated radiotherapy, we combine the motion from these three different directions into a Euclidean distance called “3D total motion”. This approach computes SI, AP, and LR displacements as the length of a three-dimensional vector, ensuring a holistic consideration of motion. And it allows specific gating thresholds to be set based on the amplitude of 3D total tumor motion for each patient, helping to simplify treatment planning and reduce motion-related complexity. Table 1 shows the mean amplitude of motion and the min–max range (in millimeters), along with the mean respiratory cycle and the min–max range (in seconds) for liver organ and lung tumors.

Fig. 2
figure 2

a Labeled lung tumor in coronal and sagittal slices. b Tumor centroid trajectories in three directions and 3D total motion. c Labeled liver organ in coronal and sagittal slices. d Liver centroid trajectories in three directions and 3D total motion

Table 1 Movement characteristics of liver organs and lung tumors

Data interception

Trajectory of each patient was divided into training set (70%) and testing set (30%), and 20% of training set was a validation set to cross validate and monitor the performance of training and the optimization of the hyper-parameters for Ridge regression, L2–L1 regression and RNN models. The inputs and outputs of the predictor were segmented with a sliding window consisting of one pair of input and output data, which were denoted as \({x}_{i}\) and \({y}_{i}\). The successive input \({x}_{i+1}\) was generated by moving the sliding window by 1 sample forward, and the sliding window was moved forward until the last available observation in the training part was hit. The length of each \({x}_{i}\) represents the number of samples used to make prediction, we fixed \(m=15\) (about 3 s) in this study. The length of \({y}_{i}\) is determined by the prediction window. Since the sampling period is 0.2 s, \({y}_{(i,1)}\), \({y}_{(i,2)}\), \({y}_{(i,3)}\) represent the prediction windows of 0.2, 0.4 and 0.6 s, respectively. In this study, since the system delay is 0.5 s while the MRI sampling period is 0.2 s, we are confronted with the choice of either 0.4 s (under prediction) or 0.6 s (over prediction) for the prediction window.

Gating signals prediction algorithm

Linear predictors

The linear regression models assume the linear relationship between the future data (\({\widehat{y}}_{(i,j-1)}\) and \({\widehat{y}}_{(i,j)}\)) and the past available data \({x}_{i}\):

$$\left\{\begin{array}{c}{\widehat{y}}_{\left(i,j-1\right)}={x}_{i}^{T}{\beta }_{j-1}\\ {\widehat{y}}_{\left(i,j\right)}={x}_{i}^{T}{\beta }_{j}\end{array}\right.$$
(1)

where \(T\) denotes the transpose, the coefficient vector defined as \({\beta }_{j}=({\beta }_{j1}, \dots , {\beta }_{jm})\) for \(m=15\) and \(j=2\) for prediction window of 0.4 s or \(j=3\) for prediction window of 0.6 s, note that this rule applies to all \(j\) in the following.

The loss function of linear regression is to minimize the sum of the squares of the residuals, given by

$$\left\{\begin{array}{c}L({\beta }_{j-1})=\sum_{i=1}^{N}{({y}_{\left(i,j-1\right)}-{x}_{i}^{T}{\beta }_{j-1})}^{2}\\ L({\beta }_{j})=\sum_{i=1}^{N}{({y}_{\left(i,j\right)}-{x}_{i}^{T}{\beta }_{j})}^{2}\end{array}\right.$$
(2)

For ridge regression, the loss function to be minimized is the penalized residual sum of squares [24]:

$$\left\{\begin{array}{c}L({\beta }_{j-1})=\sum_{i=1}^{N}{({y}_{\left(i,j-1\right)}-{x}_{i}^{T}{\beta }_{j-1})}^{2}+{\lambda \Vert {\beta }_{j-1}\Vert }_{2}^{2}\\ L\left({\beta }_{j}\right)=\sum_{i=1}^{N}{\left({y}_{\left(i,j\right)}-{x}_{i}^{T}{\beta }_{j}\right)}^{2}+{\lambda \Vert {\beta }_{j}\Vert }_{2}^{2}\end{array}\right.$$
(3)

where the optimal parameter \(\lambda\) is implemented by Leave-One-Out cross-validation (LOOCV). The regularization term shrinks the magnitude of the coefficient vector \(\beta\), which leads to a reduction of the noise level of the prediction signals [25].

For the L2–L1 regression, the l1-norm of the parameters is used as the penalty term in the loss function [26]:

$$\left\{\begin{array}{c}L({\beta }_{j-1})=\sum_{i=1}^{N}{({y}_{\left(i,j-1\right)}-{x}_{i}^{T}{\beta }_{j-1})}^{2}+\lambda {\Vert {\beta }_{j-1}\Vert }_{1}\\ L\left({\beta }_{j}\right)=\sum_{i=1}^{N}{\left({y}_{\left(i,j\right)}-{x}_{i}^{T}{\beta }_{j}\right)}^{2}+\lambda {\Vert {\beta }_{j}\Vert }_{1}\end{array}\right.$$
(4)

where the optimal \(\lambda\) is chosen by the generalized cross-validation (GCV) [27], and the optimal solution \(\beta\) is obtained by the alternating direction method of multipliers (ADMM) [28].

Due to the lower complexity of linear regression methods, we also implemented and evaluated the adaptive linear regression with a burn-in period of the first 30 s (150 samples) for each patient. The minimization of \(\mathrm{L}(\upbeta )\) is performed continuously to update the model parameters (\({\beta }_{j-1}\) and \({\beta }_{j}\)) with the incoming data of the same patient.

RNN models

RNNs capture temporal dependencies in data by processing sequences, making them useful for predicting respiratory motion. Gradient vanishing and explosion caused RNNs to lose their grasp on nonlinear relationships. Solutions include long short-term memory (LSTM), bidirectional LSTM (Bi-LSTM), and gated recurrent unit networks (GRUs). Detailed description of the LSTM, Bi-LSTM and GRU and their performance for predicting respiratory motion can be found elsewhere [19, 20, 29].

A validation set was used to cross validation to monitor the performance of training and search the optimal hyper-parameters. During the training phase, the Adaptive Moment Estimation (Adam) optimizer was used to optimize the loss function due to its adaptability to varying learning rates, effective handling of sparse gradients common, robustness to noisy data, and efficient parameter updates [19]. The following hyperparameters were chosen based on a search conducted by Lombardo et al. [30]: the number of layers from {1, 3, 5, 10}, dropout rates from {0, 0.1, 0.2}, learning rates from {0.0001, 0.0005, 0.001, 0.005, 0.01}, and batch sizes from {16, 32, 64, 128}.

Crossing time

“Crossing time” refers to the moment when the tumor's position crosses a predefined threshold. When the tumor crosses below the predefined threshold, referred to as ‘crossing-on’, the radiation beam is activated. Conversely, when the tumor crosses above the thresh-old, termed as ‘crossing-off’, the radiation beam is deactivated. Since the sampling frequency of MR images is 5 Hz, it is reasonable to assume that the shape of the respiratory curve can be restored by connecting each frame of the continuous images. Therefore, we proposed a linear interpolation to predict the threshold crossing time:

$${\widehat{T}}_{cross}= \frac{Th-{\widehat{y}}_{(i,j)}}{{\widehat{y}}_{(i,j)}-{\widehat{y}}_{(i,j-1)}}\times\Delta T+{T}_{i}+j\times\Delta T$$
(5)

where \({T}_{i}\) is the current sample time, \({\widehat{y}}_{(i,j-1)}\) and \({\widehat{y}}_{(i,j)}\) the predictions by Eq. 1, \(\Delta T= 0.2\mathrm{ s}\) the MRI sampling period, and \(Th\) the threshold, set as the average of the respiratory trajectory in the SI direction during the burn-in period for simplicity. Note that a lower threshold means beaming during the more stagnant phase of exhalation and thus ensures better treatment margins while increasing the overall treatment time. The optimal trade-off between radiation precision and efficiency is beyond the scope of the present study. The reference crossing time (gold standard) could be calculated in a similar manner with Eq. 5, using \({y}_{(i,j-1)}\) and \({y}_{(i,j)}\) instead of their predictions.

Gating signals generation

Figure 3 shows the gating control scheme by thresholding the predicted tumor trajectory for prediction window of 0.6 s. The system latency was 0.5 s in this study, thus ideal gating signals should be 0.5 s earlier than ideal crossing time. In the experiment, taking the 0.6 s prediction window \((j = 3)\) as an example, there are two conditions for triggering gating: (1) when the motion curve crosses the threshold between \({y}_{(i,2)}\) and \({y}_{(i,3)}\) (point A in Fig. 3), or (2) when the time difference (\(dt\)) between \({\widehat{y}}_{(i,3)}\) and cross time is less than 0.1 s (point B in Fig. 3). For the prediction window of 0.4 s (\(j=2\)), the decision condition is changed from \({y}_{(i,2)}\) and \({y}_{(i,3)}\) to \({y}_{(i,1)}\) and \({y}_{(i,2)}\). The black and red binary control signals were generated by the reference and predicted crossing time, respectively. Additional file 1: Table S1 describes in detail the process of adaptive gating signals generation with an ever-increasing training data set.

Fig. 3
figure 3

Gating control scheme by thresholding the predicted tumor trajectory for prediction window of 0.6 s

Performance evaluation

Amplitude metrics

The mean absolute error (MAE) is the mean absolute difference between the predicted and observed values defined by:

$$MAE=\frac{1}{N}\sum_{i=1}^{N}\left|{y}_{i}-{\widehat{y}}_{i}\right|$$
(6)

where \({y}_{i}\) and \({\widehat{y}}_{i}\) are the actual and predicted respiration data respectively, and \(N\) the number of total points.

The root mean square error (RMSE) is a measure of accuracy, to compare forecasting errors of different models for a particular dataset, and is defined as:

$$RMSE= \sqrt{\frac{1}{N}\sum_{i=1}^{N}{\left({y}_{i}-{\widehat{y}}_{i}\right)}^{2}}$$
(7)

The coefficient of determination \({R}^{2}\), is the proportion of the variation in the dependent variable that is predictable from the independent variable(s):

$${R}^{2}=1-\frac{\sum_{i=1}^{N}{\left({y}_{i}-{\widehat{y}}_{i}\right)}^{2}}{\sum_{i=1}^{N}{\left({y}_{i}-{\overline{y} }_{i}\right)}^{2}}$$
(8)

Temporal metrics

For gated radiotherapy, it is essential in predicting accurately the beam on/off control when the target crosses threshold position rather than predicting the respiratory curve itself. We thus proposed the gating on/off error and gating accuracy to compare with the ideal gating signals.

We first define the crossing on/off error as the temporal difference of threshold-crossing between the true and predicted curves in both on/off directions. The gating on/off error, on the other hand, denotes the temporal difference of beam-on/off control between the ideal (0.5 s in advance of the real threshold-crossing) and the predicted gating signals.

The gating accuracy is the ratio of overlapped duration (of the ideal and predicted gating signals) over the entire therapy:

$${\text{Gating}}\;{\text{accuracy}} = \left[ {1 - \frac{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{N}}} \left( {{\text{Tgaterr}}} \right)}}{{{\text{Total}}\;{\text{time}}\;{\text{of}}\;{\text{therapy}}}}} \right] \times 100\%$$
(9)

where \(N\) is the number of breathing cycles during therapy of each patient and \(\mathrm{Tgaterr}\) the non-overlapping period of the ideal and predicted gating signals.

Statistical tests

We performed a Wilcoxon signed-rank test to analyze whether there is a statistically significant difference between the metrics (R2 and Gating accuracy) obtained from the different models on the testing set. A p value of less than 0.05 was considered significant.

Results

Linear versus RNN regression

Linear regression with and without its regularizations (Ridge and L2-L1) variants are compared to three classical RNN models and Kalman filter [15] for prediction performance evaluations. Since RNN requires a large amount of training data, we first use 70% of the data (more than 4 min) in these six models for training and the rest for testing.

Table 2 shows the mean and standard deviation of amplitude errors (MAE, RMSE, and R2) for different methods with 0.4 s and 0.6 s prediction window for liver organs of 21 liver cancer patients and lung tumors of 10 lung cancer patients. No matter the choice of prediction length, linear regression without regularization performed the best, followed by the Ridge, and linear methods have significantly smaller amplitude errors than those of the RNNs (P < 0.05), not to mention the unrealistic 70–30% data partition adopted in favor of the RNNs. Meanwhile, the magnitude-based results demonstrated the excellent predictive power of the linear regression for motion prediction of both liver organ and lung tumors.

Table 2 Mean and std deviation of amplitude errors with 0.4 s and 0.6 s prediction window for 21 liver cancer patients and 10 lung cancer patients

Table 3 shows the mean and standard deviation of gating errors with prediction length of 0.4 s and 0.6 s for liver organs of 21 liver cancer patients and lung tumors of 10 lung cancer patients, as well as calculation time for each model. For the gating accuracy, choosing over-prediction (0.6 s) is significantly better than under-prediction (0.4 s), all factors being equal otherwise (P < 0.05). When the prediction window is 0.6 s, the linear regression achieved the best performance with an average gating accuracy of 98.3% and 98.0%, a gating error of 56 ms and 45 ms, for liver cancer and lung cancer patients, respectively. Table 4 shows the P values obtained from Wilcoxon signed-rank test pairwise model comparisons with prediction length of 0.6 s. Performances of the linear regression are significantly higher than those of RNNs in both amplitude metrics (R2) and temporal metrics (Gating accuracy) (P < 0.05). To address the potential for overfitting in linear regression and ridge regression, we performed cross-validation on each respiratory curve. Although the temporal correlations within the time series data could potentially affect the cross-validation results, it is noteworthy that linear model achieved an average gating accuracy of 98.2% and 97.7% for liver cancer and lung cancer patients, respectively.

Table 3 Mean and std deviation of gating errors with prediction length of 0.4 s and 0.6 s for 21 liver cancer patients and 10 lung cancer patients, as well as calculation time for each model
Table 4 P values obtained from Wilcoxon signed-rank test pairwise model comparisons with prediction length of 0.6 s

Adaptive regression

Due to the lower complexity of linear regression methods, we also implemented and validated the approach of training the model with the first 30 s (150 samples) for each patient as the burn-in period and keep adapting the model parameters with the incoming data during the prediction and gating signals generation process. As shown in Table 3, the proposed framework with adaptive linear regression achieved convincing performances in real clinical scenarios with an average gating accuracy of 98.3% and 98.0%, a gating error of 44 ms and 45 ms, for liver cancer and lung cancer patients, respectively. The predictive performance of adaptive linear regression is not significantly different from that of non-adaptive linear regression using more than 4 min of training data (P > 0.05, see in Table 4).

Calculation time

All experiments were conducted using the Keras API with the TensorFlow backend and were executed on an Intel 4-core 2.4-GHz CPU, a NVIDIA GeForce GTX1660 Ti GPU, 512 GB SSD and 20 GB RAM machine. The time required to predict the gating signals was calculated for all methods mentioned above (see Table 3). For the non-adaptive mode, the calculation time of linear models are less than 0.1 ms and the calculation time of RNN models are between 3 and 8 ms. For the adaptive mode, the most time-consuming part was the update of the linear regression predictor, requiring an average of 1 ms, the rest being identical to the non-adaptive mode, i.e. less than 0.1 ms.

Discussion

The study introduced a linear interpolation for the prediction of the threshold-crossing time and achieved better temporal accuracy in the subsequent gating signals. A fixed online linear model was first used to predict tumor locations for 0.4 s and 0.6 s ahead and to update the predicted threshold-crossing time, while the gating signal was triggered 0.5 s (system delay) ahead of the predicted crossing time. Moreover, this study gave the certain criteria in temporal metrics. For the best performance, the error is 0, it is proved that the predicted gating signal is equal to the ideal gating signal. For dummy performance, the maximum error was latency time (0.5 s) even without predicting, and here gating time is equal to crossing time. Indeed, we observe identical crossing and gating errors in the case of over-prediction whereas in the case of under-prediction crossing errors seem to be reduced thanks to its higher precision, gating errors are getting worse since the gating decisions are made typically between 0.2 s and 0.4 s in advance with a system delay of 0.5 s. We thus conclude that over-prediction is a preferable choice compared to under-prediction when system delay is not a multiple of the MRI sampling period.

To verify the minimum number of training data required to train a stable linear regression model, we tested its adaptive and non-adaptive versions with different lengths of training data. Figure 4 shows the gating accuracy using adaptive and non-adaptive linear regression with different sizes of training data. At the beginning, the gating accuracy increased with the increase of training data, the average gating accuracy of the adaptive version reached 98% at 14 s and tended to increase slowly, while the non-adaptive version reached the stable prediction ability at 32 s. When the training data reached 80 s, the prediction ability of the adaptive version and the non-adaptive version are the same. According to the box plot, when the training data is less than 40 s, the gating accuracy of the adaptive version is obviously better than that of the non-adaptive version. For the patient-specific prediction model, the adaptive version can significantly reduce the burn-in time.

Fig. 4
figure 4

Gating accuracy using adaptive/non-adaptive linear regression with different burn-in period

As observed in several studies [5, 9, 15, 20], the predictive performance decreased with increasing forecasted time span. However, for the 0.4 s and 0.6 s prediction windows, linear regression still achieved sub-resolution accuracy (RMSE < 2.5 mm). The good performance of the linear regression for the 0.6 s prediction window shows that it can successfully account for the system latency found by Glitzner et al. [4] when performing MLC tracking on the Elekta Unity MR-linac. As shown in Table 1, the range of respiratory cycles for the 10 lung cancer patients was 2.4–5.4 s and the range of 3D movement amplitudes was 5.1–25.5 mm; the range of respiratory cycles for the 21 liver cancer patients was 2.9–7.4 s and the range of 3D movement amplitudes was 14.5–43 mm. These large ranges show that there are large differences in respiratory movements between each patient and that respiratory movements are patient-specific. Therefore, we proposed to use the first 30 s (burn-in period) of treatment for each patient to train a patient-specific adaptive linear model, which is used for real-time prediction of tumor location during subsequent radiotherapy for that patient and for gating signal generation. Based on the excellent experimental results, the patient-specific online gating signal prediction scheme based on the linear regression model proposed in this study can be widely applied in MRI-guided radiotherapy for lung and liver cancer.

A recent review study by Jöhl et al. [21] found that a continuously re-optimized (i.e., online) linear regression model performed best on average compared to other motion predictors such as artificial neural networks or Kalman filters. Sharp et al. [15] note the relatively worse performance of the Kalman filter predictor when compared with linear and ANN predictors. In this study, we also compared the traditional methods represented by Kalman filtering with linear regression. Interestingly, our findings consistently indicate that linear regression outperforms Kalman filtering. This may reflect the difficulty in estimating the state transition matrix from such a small amount of data.

Our experimental results on small sample sets verified the excellent performance of linear regression, considerably superior to that of the RNN. The major drawback of classic RNNs is that they are notoriously difficult to train. One important consideration that we addressed within this study is the effect of the non-stationary nature of breathing. Anetai et al. [31] clearly illustrated that the movement and pattern of breathing can easily change individually various. They developed and verified novel respiratory criteria for selecting optimal breathing for gating radiation treatment and defining numerical targets for respiratory gating. For the adaptive linear regression in the study, the predictive parameters were adjusted in real time based on the most recent tumor motion, which may address the issue of respiratory pattern variability. In the current experiment, despite the adaptive regression model continuously increasing its training samples, the average calculation speed is still less than 1 ms, as patients’ radiotherapy time on Unity is between 15–20 min.

Quantifying liver tumor motion is difficult because of the difficulty of imaging free-breathing tumors, the difficulty of delineating tumors, and the complexity of tumor motion: rigid (translation and rotation), non-rigid, and the combination of both types of motion [32]. The liver's most significant movement typically occurs along the SI direction, influencing motion in the AP and LR directions. While liver tissue isn't highly compressible, some degree of deformation is expected [33]. As liver tumors often reside within a larger liver volume, the extent of tumor displacement during the respiratory cycle depends on its specific location within the liver [34]. Prior research has indicated a strong correlation between tumor motion within the liver and the concurrent movements of the liver organ, its vasculature, and the diaphragm [35,36,37,38]. Yang et al. [36] highlighted that the consistency of tumor and diaphragm motion varies based on the distance separating them. Zhao et al. [34] employed a five-lobe classification scheme to subdivide the liver, noting that segments within distinct liver lobes display differing levels of motion. Consequently, when pinpointing the region of interest (ROI) for a tumor, it's important to not only encompass the tumor but also consider adjacent boundaries, neighboring organs, or any area anticipated to move in synchrony with the target [23]. The precise selection of the ROI representing tumor motion is of paramount importance and should be determined by experienced professionals based on the specific clinical context. In this study, to overcome the difficulties in liver tumor contouring on 2D cine-MR without the use of contrast agents and based on the high correlation properties of the two motions, we admittedly simplified the tracking problem to deal with the organ centroids in order to validate the proposed prediction system. It is beyond the scope of this study as to how to accurately select the ROI representing the tumor motion and it's tracking methods.

What’s more, the manual labelling of liver organs and tumors from 2D cine-MR is time-consuming and only valid for the current proof-of-concept study while not compatible with a real-time clinical application scenario. For the future work, we aim to combine automatic tumor localization and prediction to evaluate the potential dosimetric improvements. MRiPT (MR-guided proton therapy) has progressed significantly with the development of clinical prototypes expected in 5–10 years. It will need a dedicated workflow similarly to Unity, and gated treatments could be expected as the default treatment style [39,40,41]. Therefore, our proposed gating signals prediction algorithm is expected to be used for proton therapy in the future and will have better clinical application value.

In conclusion, the proposed algorithm could offset the system delays in beaming on/off switching and thus deliver the dose with better temporal accuracy. The Unity is potentially capable of performing more accurate radiotherapy procedures when coupled with the proposed gating signals prediction algorithm.

Availability of data and materials

The Unity database generated and analyzed during the current study are available from the corresponding author B. Li (Email: bsli@sdfmu.edu.cn). The data are not publicly available for patient privacy purposes.

References

  1. Gabryś D, Kulik R, Trela K, et al. Dosimetric comparison of liver tumour radiotherapy in all respiratory phases and in one phase using 4DCT. Radiother Oncol. 2011;100(3):360–4.

    Article  PubMed  Google Scholar 

  2. Paganelli C, Seregni M, Fattori G, et al. Magnetic resonance imaging–guided versus surrogate-based motion tracking in liver radiation therapy: a prospective comparative study. Int J Radiat Oncol Biol Phys. 2015;91(4):840–8.

    Article  PubMed  Google Scholar 

  3. Pappas E, Kalaitzakis G, Boursianis T, et al. Dosimetric performance of the Elekta Unity MR-linac system: 2D and 3D dosimetry in anthropomorphic inhomogeneous geometry. Phys Med Biol. 2019;64(22):225009.

    Article  CAS  PubMed  Google Scholar 

  4. Glitzner M, Woodhead PL, Borman PTS, et al. MLC-tracking performance on the Elekta unity MRI-linac. Phys Med Biol. 2019;64(15):15NT02.

    Article  CAS  PubMed  Google Scholar 

  5. Seregni M, Paganelli C, Lee D, et al. Motion prediction in MRI-guided radiotherapy based on interleaved orthogonal cine-MRI. Phys Med Biol. 2016;61(2):872.

    Article  CAS  PubMed  Google Scholar 

  6. Büther F, Heß M, Schäfers KP. Investigating the influence of baseline drifts of respiratory signals in amplitude-based gating for positron emission tomography. In: 2014 IEEE nuclear science symposium and medical imaging conference (NSS/MIC). IEEE; 2014. pp. 1–4.

  7. Li R, Mok E, Chang DT, et al. Intrafraction verification of gated RapidArc by using beam-level kilovoltage X-ray images. Int J Radiat Oncol Biol Phys. 2012;83(5):e709–15.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Klüter S. Technical design and concept of a 0.35 T MR-Linac. Clin Trans Radiat Oncol. 2019;18:98–101.

    Google Scholar 

  9. Uijtewaal P, Borman PTS, Woodhead PL, et al. First experimental demonstration of VMAT combined with MLC tracking for single and multi-fraction lung SBRT on an MR-linac. Radiother Oncol. 2022;174:149–57.

    Article  CAS  PubMed  Google Scholar 

  10. Lamb JM, Ginn JS, O’Connell DP, et al. Dosimetric validation of a magnetic resonance image gated radiotherapy system using a motion phantom and radiochromic film. J Appl Clin Med Phys. 2017;18(3):163–9.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Krauss A, Nill S, Oelfke U. The comparative performance of four respiratory motion predictors for real-time tumour tracking. Phys Med Biol. 2011;56(16):5303.

    Article  CAS  PubMed  Google Scholar 

  12. Murphy MJ, Dieterich S. Comparative performance of linear and nonlinear neural networks to predict irregular breathing. Phys Med Biol. 2006;51(22):5903.

    Article  PubMed  Google Scholar 

  13. Riaz N, Shanker P, Wiersma R, et al. Predicting respiratory tumor motion with multi-dimensional adaptive filters and support vector regression. Phys Med Biol. 2009;54(19):5735.

    Article  PubMed  Google Scholar 

  14. Remy C, Ahumada D, Labine A, et al. Potential of a probabilistic framework for target prediction from surrogate respiratory motion during lung radiotherapy. Phys Med Biol. 2021;66(10):105002.

    Article  Google Scholar 

  15. Sharp GC, Jiang SB, Shimizu S, et al. Prediction of respiratory tumour motion for real-time image-guided radiotherapy. Phys Med Biol. 2004;49(3):425.

    Article  PubMed  Google Scholar 

  16. Ernst F, Schweikard A. Forecasting respiratory motion with accurate online support vector regression (SVRpred). Int J Comput Ass Rad. 2009;4:439–47.

    Google Scholar 

  17. Murphy MJ, Pokhrel D. Optimization of an adaptive neural network to predict breathing. Med Phys. 2009;36(1):40–7.

    Article  PubMed  Google Scholar 

  18. Mafi M, Moghadam SM. Real-time prediction of tumor motion using a dynamic neural network. Med Biol Eng Comput. 2020;58:529–39.

    Article  PubMed  Google Scholar 

  19. Lin H, Shi C, Wang B, et al. Towards real-time respiratory motion prediction based on long short-term memory neural networks. Phys Med Biol. 2019;64(8):085010.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Wang R, Liang X, Zhu X, et al. A feasibility of respiration prediction based on deep Bi-LSTM for real-time tumor tracking. IEEE Access. 2018;6:51262–8.

    Article  Google Scholar 

  21. Jöhl A, Ehrbar S, Guckenberger M, et al. Performance comparison of prediction filters for respiratory motion tracking in radiotherapy. Med Phys. 2020;47(2):643–50.

    Article  PubMed  Google Scholar 

  22. Shirato H, Shimizu S, Kunieda T, et al. Physical aspects of a real-time tumor-tracking system for gated radiotherapy. Int J Radiat Oncol Biol Phys. 2000;48(4):1187–95.

    Article  CAS  PubMed  Google Scholar 

  23. Keiper TD, Tai A, Chen X, et al. Feasibility of real-time motion tracking using cine MRI during MR-guided radiation therapy for abdominal targets. Med Phys. 2020;47(8):3554–66.

    Article  PubMed  Google Scholar 

  24. Hoerl AE, Kennard RW. Ridge regression: biased estimation for nonorthogonal problems. Technometrics. 1970;12(1):55–67.

    Article  Google Scholar 

  25. Hastie T, Tibshirani R, Friedman JH, Friedman JH. The elements of statistical learning: data mining, inference, and prediction, vol. 2. New York: Springer; 2009. p. 1–758.

    Google Scholar 

  26. Tibshirani R. Regression shrinkage and selection via the lasso. R Stat Soc Series B Stat Methodol. 1996;58(1):267–88.

    Google Scholar 

  27. Golub GH, Heath M, Wahba G. Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics. 1979;21(2):215–23.

    Article  Google Scholar 

  28. Boyd S, Parikh N, Chu E, et al. Distributed optimization and statistical learning via the alternating direction method of multipliers. Found Trends Mach Learn. 2011;3(1):1–122.

    Article  Google Scholar 

  29. Yoganathan SA, Paloor S, Torfeh T, et al. Predicting respiratory motion using a novel patient specific dual deep recurrent neural networks. Biomed Phys Eng Express. 2022;8(6):065013.

    Article  Google Scholar 

  30. Lombardo E, Rabe M, Xiong Y, et al. Offline and online LSTM networks for respiratory motion prediction in MR-guided radiotherapy. Phys Med Biol. 2022;67(9):095006.

    Article  Google Scholar 

  31. Anetai Y, Sumida I, Takahashi Y, et al. A concept for classification of optimal breathing pattern for use in radiotherapy tracking, based on respiratory tumor kinematics and minimum jerk analysis. Med Phys. 2016;43(6Part1):3168–77.

    Article  PubMed  Google Scholar 

  32. Xu Q, Hanna G, Grimm J, et al. Quantifying rigid and nonrigid motion of liver tumors during stereotactic body radiation therapy. Int J Radiat Oncol Biol Phys. 2014;90(1):94–101.

    Article  PubMed  Google Scholar 

  33. Tascón-Vidarte JD, Stick LB, Josipovic M, et al. Accuracy and consistency of intensity-based deformable image registration in 4DCT for tumor motion estimation in liver radiotherapy planning. PLoS ONE. 2022;17(7):e0271064.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Zhao YT, Liu ZK, Wu QW, et al. Observation of different tumor motion magnitude within liver and estimate of internal motion margins in postoperative patients with hepatocellular carcinoma. Cancer Manag Res. 2017;9:839–48.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Dick D, Wu X, Hatoum GF, et al. A fiducial-less tracking method for radiation therapy of liver tumors by diaphragm disparity analysis part 2: validation study by using clinical data. J Radiat Oncol. 2018;7:345–56.

    Article  CAS  Google Scholar 

  36. Yang J, Cai J, Wang H, et al. Is diaphragm motion a good surrogate for liver tumor motion? Int J Radiat Oncol Biol Phys. 2014;90(4):952–8.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Shao HC, Huang X, Folkert MR, et al. Automatic liver tumor localization using deep learning-based liver boundary motion estimation and biomechanical modeling (DL-Bio). Med Phys. 2021;48(12):7790–805.

    Article  PubMed  Google Scholar 

  38. Brix L, Ringgaard S, Sørensen TS, et al. Three-dimensional liver motion tracking using real-time two-dimensional MRI. Med Phys. 2014;41(4):042302.

    Article  PubMed  Google Scholar 

  39. Hoffmann A, Oborn B, Moteabbed M, et al. MR-guided proton therapy: a review and a preview. Radiat Oncol. 2020;15(1):1–13.

    Article  Google Scholar 

  40. Liu HH, Balter P, Tutt T, et al. Assessing respiration-induced tumor motion and internal target volume using four-dimensional computed tomography for radiotherapy of lung cancer. Int J Radiat Oncol Biol Phys. 2007;68(2):531–40.

    Article  PubMed  Google Scholar 

  41. Li Y, Kardar L, Li X, et al. On the interplay effects with proton scanning beams in stage III lung cancer. Med Phys. 2014;41(2):021721.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

This work is supported by the Key Research and Development Program of Shandong Province (2021LCZX04), Academic promotion program of Shandong First Medical University (2019LJ004), Major Basic Research Project of Shandong Provincial Natural Science Foundation (ZR2022ZD31), Special Fund for Guiding Local Science and Technology Development in Shandong Province (YDZX2022010), PHC XU GUANGQI, PROJET N°45637SM.

Author information

Authors and Affiliations

Authors

Contributions

Study design: BL, YL, DG; Patient recruitment and data collection: ZL, JZ; Motion trajectories extraction: YL; Data analysis and prediction algorithms: YL, DG; Writing of manuscript: YL, DG. All authors reviewed the final manuscript for important intellectual content and agreed to submit.

Corresponding authors

Correspondence to Baosheng Li, Huazhong Shu or Di Ge.

Ethics declarations

Ethics approval and consent to participate

The study adheres to the principles outlined in the Declaration of Helsinki and was conducted with the approval of an ethics committee. After a detailed description of the protocol, subjects provided their informed consent.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1. 

Algorithmic steps for generating adaptive gating signals.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Y., Li, Z., Zhu, J. et al. Online prediction for respiratory movement compensation: a patient-specific gating control for MRI-guided radiotherapy. Radiat Oncol 18, 149 (2023). https://doi.org/10.1186/s13014-023-02341-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13014-023-02341-1

Keywords