 Research
 Open Access
 Published:
DeepBeam: a machine learning framework for tuning the primary electron beam of the PRIMO Monte Carlo software
Radiation Oncology volume 16, Article number: 124 (2021)
Abstract
Background
Any Monte Carlo simulation of dose delivery using medical acceleratorgenerated megavolt photon beams begins by simulating electrons of the primary electron beam interacting with a target. Because the electron beam characteristics of any single accelerator are unique and generally unknown, an appropriate model of an electron beam must be assumed before MC simulations can be run. The purpose of the present study is to develop a flexible framework with suitable regression models for estimating parameters of the model of primary electron beam in simulators of medical linear accelerators using real reference dose profiles measured in a water phantom.
Methods
All simulations were run using PRIMO MC simulator. Two regression models for estimating the parameters of the simulated primary electron beam, both based on machine learning, were developed. The first model applies Principal Component Analysis to measured dose profiles in order to extract principal features of the shapes of the these profiles. The PCAobtained features are then used by Support Vector Regressors to estimate the parameters of the model of the electron beam. The second model, based on deep learning, consists of a set of encoders processing measured dose profiles, followed by a sequence of fully connected layers acting together, which solve the regression problem of estimating values of the electron beam parameters directly from the measured dose profiles. Results of the regression are then used to reconstruct the dose profiles based on the PCA model. Agreement between the measured and reconstructed profiles can be further improved by an optimization procedure resulting in the final estimates of the parameters of the model of the primary electron beam. These final estimates are then used to determine dose profiles in MC simulations.
Results
Analysed were a set of actually measured (real) dose profiles of 6 MV beams from a real Varian 2300 C/D accelerator, a set of simulated training profiles, and a separate set of simulated testing profiles, both generated for a range of parameters of the primary electron beam of the Varian 2300 C/D PRIMO simulator. Application of the twostage procedure based on regression followed by reconstructionbased minimization of the difference between measured (real) and reconstructed profiles resulted in achieving consistent estimates of electron beam parameters and in a very good agreement between the measured and simulated photon beam profiles.
Conclusions
The proposed framework is a readily applicable and customizable tool which may be applied in tuning virtual primary electron beams of Monte Carlo simulators of linear accelerators. The codes, training and test data, together with readout procedures, are freely available at the site: https://github.com/taborzbislaw/DeepBeam.
Introduction
External photon beam therapy (EBT) is nowadays the most common cancer radiotherapy modality. The key factors which determine the success of EBT are correct and accurate treatment therapy planning and quality assurance procedures prior to delivery of the therapeutic dose. Designing a therapy plan is a multidimensional optimization problem. The treatment planning system (TPS) designs therapy plan to match the therapy goals within specified clinical tolerances. Only after assuring that these goals are met, may the quality assurance procedures of dose delivery be implemented and the patient treated.
Currently, treatment planning systems calculate the dose distribution within an irradiated volume based on approximate algorithms like Pencil Beam Convolution (PBC) [1], Anisotropic Analytical Algorithm (AAA) [2] or ACUROSE XB (AXB) algorithm [3]. Monte Carlo (MC) modelling is an alternative approach to calculate dose distributions [4]. An advantage of MC over most analyticalbased algorithms is that the latter are adequate for dose calculations in homogeneous media but they could be rather crude approximations whenever inhomogeneities are present. In contrast, MC simulations, while also based on approximations (mostly in the transport physics and cross sections estimation), have been demonstrated to be superior over analytical algorithms when it comes to dose distributions calculations in heterogeneous volumes [5]. However, while being potentially very accurate and extremely valuable in gaining thorough understanding of all phenomena related to dose deposition in various media, it is also a very challenging task [4]. This is due not only to the high computational effort required by MC modelling, but also because of the tuning process which must be carefully implemented to match MCcalculated doses and doses measured under controlled conditions. This tuning process involves finding an appropriate model of a primary electron beam of a medical linear accelerator being simulated.
Any MC simulation of dose delivery by megavolt photon beams from a medical linear accelerator commences by simulating the primary beam of highenergy electrons which leave the acceleration tube of the linac with an energy of several MeV impinging a tungsten target to generate megavolt photons via bremsstrahlung. The spatial distribution of dose delivered by the thus generated photon beam to a water phantom or to the patient’s tumour volume, crucially depends on the characteristics of the primary electron beam. Yet, the primary electron beam characteristics of any individual accelerator are unique. Moreover, the characteristics of their primary electron beams may vary not only between linacs of the same type, but may also vary in time within the same accelerator, due to ageing effects [6,7,8]. Unfortunately, the geometry and spectra of the primary electron beam are neither known exactly nor easily measurable, except by quite specialized equipment, which is not readily available in a typical clinical radiotherapy environment [9, 10].
For this reason, to run MC simulations, an appropriate model of the electron beam must be designed, which includes a model of its electron energy spectrum and model of its spatial distribution. Typically, no less than four parameters are needed to characterise a MCsimulated electron beam of a medical linear accelerator—namely the mean energy of electrons in the beam, the full width at halfmaximum of the electron energy spectrum in the beam by assuming the energy distribution is a Gaussian function, the radial distribution of electrons in the beam, and the angular divergence of this beam. Clearly, in order to generate a clinically realistic MC simulation of dose delivery from any individual accelerator, the parameters of the model of the electron beam must be determined specifically for that accelerator [11].
For abovediscussed reasons, availability of a welldefined and realistically executable procedure of specifying the parameters of the model of a primary electron beam in an individual linear accelerator is of major importance in the subsequent application of MCbased modelling of clinical procedures using this particular accelerator. Notably, parameters of a MC model of the electron beam of a linac can only be determined indirectly by analysing a number of depth and lateral doseprofiles measured in specified conditions, best in a standard water phantom.
Due to the fundamental importance of this issue, several studies have been published proposing various experimental setups and methods of such analysis [6,7,8,9, 11,12,13,14,15,16,17]. These studies are briefly reviewed in Related works section of Additional file 1 accompanying this paper. It follows that in most cases a trialanderror approach has been adopted to determine parameters of a model of a primary electron beam. No phenomenological model enabling the values of primary electron beam parameters to be estimated directly from the measured dose profiles has been proposed. The proposed tuning procedures are often very specific with respect to the data required to determine the parameters of a model of a primary electron beam.
In contrast to such studies, we propose a flexible framework, termed DeepBeam, such that its user may collect the profile data using dosimetry tools and protocols at her/his disposal and select the profiles to be measured according to her/his best experience. Then, a regression model is created which can be directly and routinely used to estimate the parameters of the model of the primary electron beam. The proposed framework, apart from tuning electron beams of Monte Carlo simulators of real linear accelerators may also be applied in routine quality assurance of an operating linear accelerator—not only to verify its beam stability via dose profile analysis, but also to indicate which of the beam parameters had likely changed and by how much. The complete code of this framework and the data used for training the regression models are freely available at https://github.com/taborzbislaw/DeepBeam.
Material and methods
The MC simulator and sources of data
While the proposed framework for tuning primary electron beams of MC simulators of linear accelerators can be used for any such simulator, the present study is based on data generated by the PRIMO simulator, version 0.1.5.1307 [18] (www.primoproject.net). PRIMO is a freelydistributed application used for simulating radiation transport during radiotherapy [8, 17, 18]. It is based on the PENELOPE 2011 [19] general purpose Monte Carlo engine and allows simulation of dose delivery to be performed for a few linear accelerator models, based on their geometry, as provided by their manufacturers. This last feature is especially important, as details of accelerator geometry are usually confidential and may not be available from the manufacturer, even upon request. Hence, using PRIMO, attention could be focussed on the primary goal of designing a framework for tuning the electron beam of this simulator without undue concern with simulation details related to the physics, materials or the geometry configuration of the simulated accelerator system.
All simulations were run using the PRIMO Varian Clinac 2300 C/D simulator operating in photon mode at a nominal energy of 6 MV. Electron beam simulation in PRIMO is configured by specifying values of four beam parameters: E—the initial electron beam energy (in MeV), σ_{E}—the fullwidthathalfmaximum (FWHM) of the primary beam energy distribution (in MeV), s—the focal spot FWHM (in cm), and α—the angular beam divergence (in degrees). The developed framework should however be readily adaptable if different primary beam parameters were specified in the PRIMO simulator, or if other MC simulators of linear accelerators were applied.
Simulated input data
To generate training data for the machine learning framework, the simulations were run for a total of 300 tuples (E, σ_{E}, s, α) within the set S such that:
At the first simulation stage, 10^{8} histories (a history corresponds to a single electron of the virtual primary beam) were simulated for each tuple (E, σ_{E}, s, α) and the phasespace file (PSF) above the secondary collimators was saved for further purposes. At this first stage, the splitting roulette variance reduction technique [20] was used with the size of the splitting region set to the largest region, i.e. to the 40 × 40 cm^{2} field. The saved PSFs were then used to simulate radiation transport to a homogeneous cubic water phantom for three fields: 3 × 3 cm^{2}, 10 × 10 cm^{2}, and 30 × 30 cm^{2}. The size of the phantom was set to 50 × 50 × 50 cm^{3}. The doses in the phantom were tallied within a regular grid of 0.5 × 0.5 × 0.5 cm^{3} voxels. The respective faces of the phantom were set parallel to the respective main axes of the coordinate frame of reference of the accelerator. The main axis of the phantom coincided with the photon beam axis. The sourcetosurface distance (SSD) was set at 100 cm, the isocentre being located at the front surface of the phantom. Splitting in the water phantom was selected as the variance reduction method [20] at this simulation stage, with a splitting factor of 300. The uncertainty of the dose values tallied in the water phantom always remained within 1.5% (which corresponds to two standard deviations of MC calculated dose). The calculated 3D spatial distribution of doses within the phantom was saved to a text file, separately for each tuple (E, σ_{E}, s, α) and for each field. A total of 900 3D dose files were collected. Each 3D dose file contained 10^{6} dose values calculated by PRIMO at (x,y,z) coordinates given by the following coordinate ranges:
where the z axis is parallel to the radiation field axis. To generate testing data for the machine learning framework, the simulations were run further for 25 tuples (E, σ_{E}, s, α) with primary beam parameters sampled randomly from the following sets of values:
Applying the above sampling scheme, it was assured that the primary electron beam parameters (E, σ_{E}, s, α) in the testing set never coincided with parameters used for generating the training set, and consequently, that the electron beam parameters for the testing set were well separated from the electron beam parameter selected for training.
All simulations were run using the PlGrid infrastructure (Prometheus grid, https://kdm.cyfronet.pl/portal/Main_page) and required a total real time of about 2.5 months. During the simulation period 12 Prometheus nodes run the PRIMO software, each node equipped with two Intel Xeon E52680v3 processors, 24 cores in total, and 128 GB RAM. The simulation of a single case, i.e., of three fields for a single tuple (E, σ_{E}, s, α), required about 40 CPU hours. As the operating system installed on the nodes is Linux CentOS 7, while PRIMO is a Windows application, wine software (https://www.winehq.org/) was installed and configured in order to use PRIMO in graphic mode under Linux exactly as if Windows were the operating system.
Measured input data
Dose profiles were measured in water for the 6 MV photon beam of a clinically exploited Clinac 2300C/D medical accelerator at the Krakow Branch of the National Research Institute of Oncology. A PTW MP3 Water Phantom and PTW Markus Type 23,343 and PTW Semiflex Type 31,010 ionization chambers were used for dosimetry. PTW Mephysto software was applied for data collection. Three experimental setups of dose profile measurements were arranged, as described in more detail in the Results section.
Applied models and computational framework
The task to solve is a regression problem, i.e., given dose profiles in a water phantom, the parameters (E, σ_{E}, s, α) of the primary electron beam are to be estimated. To prepare training and test data, each 3D dose spatial distribution was normalized to the dose value calculated along the photon beam axis at the depth of maximum (D_{max} = 1.4 cm), which was then set to 100% (such normalization is not essential if not implemented in a clinical measurement system). Next, from each 3D dose file six profiles were extracted: one depth profile along the axis of the radiation field, and five lateral profiles at depths: 1.4 cm, 5 cm, 10 cm, 20 cm, and 30 cm. To match the resolution of the simulated profiles and the typical spatial resolution of clinical dosimetry systems (usually 1 mm), linear interpolation was applied to the tallied simulated doses during profile extraction. Additionally, as PRIMO assumes the electron beam spot to be of circular shape, the lateral dose profiles extracted from the 3D dose files consisted of averages over two perpendicular lateral dose profiles over the x and y directions. Such averaging is not a necessary condition and may be skipped if a more complex, e.g., elliptic, electron spot shape is assumed by the accelerator simulator.
The extracted dose profiles (18 profiles for each tuple (E, σ_{E}, s, α)) represent a reasonable maximum set Prof_{MAX} of dose profiles to be used in the proposed machine learning framework. Moreover, the extracted depth dose profiles span the range of z ∈ < 0.3 cm, 49.7 cm > , while all the extracted lateral dose profiles span the range of x ∈ < 24.7 cm, 24.7 cm > , i.e., the maximum ranges for the geometry of the simulated water phantom and for the spatial resolution of the grid of tallied dose values.
The proposed framework is customizable, meaning that any subset of the dose profiles can be selected from the complete set of dose profiles to match the needs of an individual user. The ranges over which the profiles are measured can also be arbitrarily selected to match the measurement ranges of real profiles. For example, the user may decide to build her/his regression model which predicts the parameters of the model of the electron beam (E, σ_{E}, s, α) from the depth dose profiles and from lateral dose profiles at 10 cm depth, all collected for 10 × 10 cm^{2} and 30 × 30 cm^{2} fields, depth dose profiles measured up to 35 cm, and lateral dose profiles measured over the ranges between − 10 cm and + 10 cm and between 20 cm to + 20 cm, for 10 × 10 cm^{2} and 30 × 30 cm^{2} fields, respectively. Given these userdefined constraints the framework finds the optimum regression model, as described in the following sections.
PCA + SVR regression model
Let Prof = {Prof_{1}, Prof_{2},…, Prof_{n}} represent a userselected subset of Prof_{MAX}. A userselected spatial range Range_{i} is associated with each Prof_{i} (Range_{i} would typically be the userdependent spatial range over which Prof_{i} is measured under clinical settings). Each subscript i corresponds to a unique field size and a unique dose profile type (either depth or lateral, at one of the five depths: D_{max} = 1.4 cm, 5 cm, 10 cm, 20 cm, or 30 cm).
As each dose profile Prof_{i} is sampled within a given spatial resolution (usually 1 mm), it may be considered a 1D vector of some dimensionality (dependent of the sampling resolution and the sampling range Range_{i}). The regression task which is to be solved can be formulated as follows:
where Par is any element of the tuple (E, σ_{E}, s, α), f_{Par} is the regression function and ε_{Par} is the residual term. The components of each Prof_{i} are however strongly correlated as they represent dose values measured at neighbouring spatial locations. For this reason, the model given by Eq. (4) may not be very effective, as the set of explanatory variables (arguments of f_{Par}) contains a high contribution of redundant information.
To resolve this redundancy problem dimensionality reduction is applied. Typically, dose profiles are specified by applying some adhoc features, such as width at half maximum, width of penumbra regions, “wing heights” in lateral profiles, etc. Here, rather than rely on such handcrafted features, Principal Component Analysis (PCA) is applied to the analysed profiles [21]. PCA is a method for finding uncorrelated variables from correlated ones (in our case—consecutive dose values along depth and lateral profiles). Finding such new variables reduces to solving an eigenvalue/eigenvector problem, and the new variables are linear combination of the old ones. Moreover, each PCA feature is assigned a percentage of the total variance of profile shapes it explains. As demonstrated in what follows, three most important PCA features usually explain over 98% of the variability of shapes of the training dose profiles. PCA reduces the dimensionality of the space of explanatory variables by a factor of 10^{2}—the final set of features consists of 3n elements (explanatory variables)—three features for each profile Prof_{i} in Prof. The learnt PCA models were saved in respective files (a separate PCA model M_{PCA,i} file for each index i) and subsequently used at the stage of model testing.
Clearly, for each i—index there are 300 training profiles {Prof_{i,1}, Prof_{i,2},…,Prof_{i,300}} corresponding to 300 different tuples {(E, σ_{E}, s, α)_{1}, (E, σ_{E}, s, α)_{2},…, (E, σ_{E}, s, α)_{300}} and a single PCA model M_{PCA,i} which extracts three features (F_{i,1}, F_{i,2}, F_{i,3})_{k} from Prof_{i,k}. Hence, the regression problem, after dimensionality reduction, becomes:
where Par is any element of the tuple (E, σ_{E}, s, α), \({f}_{PAR}^{PCA}\) is the PCAbased regression function and ε_{PCA} is the residual term. To learn the regression functions, the following training set, Tr, was applied:
Support Vector Regression (SVR) with radial basis function (rbf) kernel was selected as the regressor [22] though other options are also available. SVR with rbf kernel first transforms the explanatory variables (PCA features in our case) to some high dimensional space and then, in this high dimensional space, replaces nonlinear relationship of Eq. (5) with a multilinear one between transformed explanatory variables and explained variables (electron beam parameters in our case). This high dimensional multilinear relationship is then used to make predictions. The best regression models were selected using a fivefold cross validation run on Tr. After training, four \({f}_{PAR}^{PCA}\) regressors were obtained, one per E, σ_{E}, s, and α. The regression models were saved to files and subsequently used in testing.
The deep learning regression model
The processing pipeline described in the previous section consists of two separate steps: feature extraction, and training of four regressors. In the current section an endtoend regression model is described which, during training, learns both dose profile data representation and regression functions simultaneously for all primary beam parameters (E, σ_{E}, s, α). The model presented here is based on deep learning. The architecture of the deep learning (DL) model is outlined in Fig. 1.
The architecture is designed to follow the same processing steps as the approach described in the previous section. In short, each Prof_{i} in Prof is a separate input for the DL model and is processed by a separate encoder block. Each encoder block consists of a few convolution blocks. Each convolution block consists of two 1D convolutions (filter size equal to 3, number of filters equal to 16, 32, 64, etc. in the consecutive convolution blocks, ReLU activation) followed by a MaxPool1D layer which reduces the size of the data by a factor of two. The number L of convolution blocks in each encoder block is selected based on the length N of the input of this block, according to the formula L = int(log_{2}N/3), i.e., the number of features learnt by any encoder block cannot be less than 3. Each encoder block ends with 1D convolution with a single filter of unit size. The outputs of the encoder blocks are then concatenated to form a 1D vector of features (in analogy to PCA features). This feature vector is then processed by two fully connected layers of size 100 and ReLU activation. The output of the last fully connected layer is next fed into the final fully connected layer with four outputs and no activation. These outputs are expected to deliver estimates of E, σ_{E}, s, and α.
The training data Tr_{DL} for the DL model is:
The model is trained for 300 epochs using the Adam optimizer and a constant learning rate equal to 0.0001. The loss function selected for this regression problem was mean square error between the model outputs and the values of primary electron beam parameters for which the dose profiles at the model input were obtained. A 20% portion of the training set was randomly selected for model validation. The best model found during training was saved to a file and used in subsequent testing.
Testing the models
At the stage of model testing, the testing profiles were fed at the input of either the PCA + SVR or DL models. The PCA + SVR model first extracts the features from the testing profiles based on PCA models learnt on the training set. These test features are then processed by SVR regressors which return the predicted values of E, σ_{E}, s, and α. In the case of the DL model the raw testing profiles are fed at the input of the DL model which returns the predicted values of E, σ_{E}, s, and α. The true and predicted values of E, σ_{E}, s, and α are then compared using correlation analysis and linear regression.
Optimizing the solution with profiles reconstructed from regression results
The regression results can be further improved by minimizing the difference between the actual profiles being fed at the input of regressors and profiles reconstructed from the regression results. In particular, the parameters of the model of the primary electron beam for the training set Tr (Eq. (6)) have the form of a regular grid S, defined in Eq. (1), embedded within a 4D hypercube H. With every node Q of S associated are PCA features corresponding to the dose profiles determined for primary electron beam model parameters (E, σ_{E}, s, α)_{Q} assigned to Q. The regressions return a point P = (E, σ_{E}, s, α)_{PRED} within H (see Fig. 2 for a 2D example). Consecutively, using interpolation, PCA features corresponding to P may be determined, and next an inverse PCA transform applied to them in order to reconstruct profiles from the results of regression models (E, σ_{E}, s, α)_{PRED}. Thus, for each Prof_{i} in Prof a reconstructed profile RecProf_{i}(P) obtains, which in general differs from Prof_{i}. This difference can then be further minimized using one of several optimization methods, with (E, σ_{E}, s, α)_{PRED} as the starting point for such minimization. Namely, beginning with P = (E, σ_{E}, s, α)_{PRED}, P_{MIN} = (E, σ_{E}, s, α)_{MIN} is sought, such that:
where w_{i} is the weight assigned to the ith profile. In the experiments all w_{i} were set to unity but in general the user may set these according to her/his actual needs. The minimization problem defined in Eq. (8) was solved using the SLSQP method [23].
Applied software
All models were implemented in Python 3.6.10. The scipy library (version 1.5.2) was used to implement regression models using PCA and SVR. The same library was used to run interpolation over 3D dose distributions to extract dose profiles, optimization of the regression results and profile reconstruction from regression results. The DL model was implemented using the keras (version 2.3.1) library. All codes, pretrained models, as well as training and testing data, are freely available at https://github.com/taborzbislaw/DeepBeam.
Results
Analysis of simulated training data
Detailed analysis of simulated training data formed the basis for model design decisions and aided in selecting the best hyperparameters (that is, parameters C and epsilon of SVR, see for example https://scikitlearn.org/stable/modules/generated/sklearn.svm.SVR.html) for the developed model. Following these decisions, a final verification of the performance of the model was performed using only test data. Thus, test data were never used in model finetuning.
The first issue considered was what number of PCA features extracted from the profiles explains what fraction of the variability in the shapes of profiles. The results shown in Selecting the number of PCA features section of Additional file 1 indicate that three PCA features suffice in explaining most of the variability of the shapes of profiles.
The second issue considered was how selection of dose profiles influences precision of estimation of the parameters of a primary electron beam model. The results shown in Selecting dose profiles section of Additional file 1 indicate that a total of six profiles—one depth profile and two lateral profile, and any two of three fields (3 × 3 cm^{2}, 10 × 10 cm^{2}, or 30 × 30 cm^{2}) would be sufficient in obtaining precise predictions of E, s, and α values.
Considering the abovediscussed results obtained using training, the final design decision was made to train regressors based on PCA features extracted from a total of six profiles, i.e., three profiles (depth, lateral at D_{max} = 1.4 cm depth and lateral at 10 cm depth) of two field sizes (10 × 10 cm^{2} and 30 × 30 cm^{2}).
The testing results for the PCA + SVR model are shown in Fig. 3. Models trained on the training data were applied to previously unseen testing data and the predicted values of primary beam parameters compared to the ground truth data, i.e., the values of primary beam parameters applied in the generation of simulated profiles. Notably, the values of the coefficient of determination for the testing data were only slightly lower than those obtained for the training data—implying that the regressors were not overfitted. Also shown in this figure are bestfitted linear regression lines to demonstrate the precision with which the model is able to predict the primary beam parameters. The slopes of these regression lines are all close to 1.0. The prediction errors were estimated as values of standard deviation of the difference between the true and predicted values of the primary electron beam parameter, and were equal to 0.03 MeV, 0.007 cm, and 0.13 degrees for E, s, and α, respectively.
The testing results for the deep learning model trained on the same set of profiles as those used for the PCA + SVR model are shown in Fig. 4. The results for the deep learning model are slightly inferior to those obtained for the PCA + SVR model which is not surprising, since the deep model was trained only on 300 sets of profiles, which may not be sufficient for a deep learning task. Further refinements would certainly be possible, however in that case more data would need to be generated. Yet, as demonstrated in the next section, both models offer good starting points for optimizationbased estimates of the parameters of a model of the primary electron beam from clinical measurements, leading to virtually the same final results. The prediction errors for the deep model were equal to 0.065 MeV, 0.023 cm, and 0.21 degrees for E, s, and α, respectively.
Discrepancies between the slopes of the best fit lines and the ideal 1.0 value are due to the noise present in the training data, which, although being relatively low (1.5%) is however higher than that in real measurements. Decreasing the noise level to 0.5% would however increase the computation time by a factor of 9 which is unrealistic in view of the computational expense. The optimization procedure which follows the regression, as described in the previous section, resolves this issue.
Analysis of clinical data
The developed framework was used to find the values of primary electron beam parameters which could best reproduce real profiles measured using the 6 MV photon beam of a Clinac 2300C/D medical accelerator in a PTW MP3 Water Phantom. The applied input fields and profiles, obtained beam parameters, and mean errors of the reconstructed dose distributions against those measured, for three cases of experimental setups discussed below, are gathered in Table 1. The measured and reconstructed profiles for the respective sets of input profiles in each of these three cases are compared in Fig. 5.
Three experimental design cases were investigated, where different sets of measured profiles (as shown in the second and third columns of Table 1) were used as input. In each of the three cases, the models were trained on a set of training profiles corresponding to those measured, after suitable adjustment of the ranges of the training profiles. Following this training, the measured profiles were then input to the trained PCA + SVR or DL models to obtain the values of the parameters of the model of the primary electron beam, E_{PRED}, s_{PRED}, and α_{PRED}, shown in the fourth and fifth columns of Table 1, respectively. Because there was no possibility to train a regressor for predicting the value of σ_{E}, σ_{E} = 0.50 MeV was consistently used throughout. These initial predictions were next fed as input to the reconstructionbased minimization procedure. After optimizing these predicted values for either model, usually identical (or very similar) sets of finally estimated parameter values: E_{FINAL}, s_{FINAL}, and α_{FINAL}, shown in column 6 of Table 1, were obtained. These finally estimated electron beam model parameters values were then used to calculate the reconstructed dose profiles. The mean values of absolute differences between the measured and reconstructed profiles, and the corresponding 1D gamma index passing rates for dose tolerance equal to 3% of dose at depth of maximum and distance to agreement tolerance equal to 3 mm, are given in the last two columns of Table 1. The measured and reconstructed profiles in each of the three experimental cases are compared in Fig. 5.
The first set of measured profiles (Case 1) consisted of five profiles: two depth profiles and three lateral profiles, one of which was measured at the depth of D_{max} = 1.4 cm, as listed in Table 1. The depth profiles were measured to a depth of 35 cm while the ranges of measurement of lateral profiles were adjusted to the field size. The ranges of training data for this set of profiles were adjusted to the ranges of real measurements prior to being applied to train the PCA + SVR or DL models. The measured and reconstructed profiles for this case are shown in Fig. 5a.
The second set of measurement profiles (Case 2) consisted of one depth profile and three lateral profiles, one of which was measured at depth D_{max}, listed in column 2 of Table 1. The depth profile was measured to a depth of 30 cm while the ranges of measurement of lateral profiles were adjusted to the field size. The measured and reconstructed profiles for this case are shown in Fig. 5b.
Finally, the third set of measurement profiles (Case 3) consisted of six profiles: a depth profile and two lateral profiles, both at D_{max} depth. The depth profile was measured to a depth of 30 cm while the ranges of measurement of lateral profiles were adjusted to the field size. The measured and reconstructed profiles for this case are shown in Fig. 5c.
Commenting generally on the results obtained, one should note the remarkably consistent estimates of beam parameters obtained using either the PCA + SVR or the deep learning models, and the excellent agreement between the reconstructed and measured profiles, especially using the set of measured profiles and fields in the Case 3 study. However, even in the Case 1 study, the somewhat higher discrepancies observed at the borders of the lateral fields are to be expected. Over such regions of high dose gradients, higher uncertainties may be due to measurement uncertainties, to averaging of input data by the phantom software or to averaging of data in the training profiles, all affecting the quality of reconstructed profiles over such regions. The excellent agreement between the reconstructed and real profiles is confirmed by the low values of mean absolute errors displayed in column 7 of Table 1 in most cases ranging around 0.5% and exceeding 1% only once, and by high gamma passing rates.
While there is a very good agreement between measured and profiles reconstructed from regression results, based on PCA models, a final check of the proposed framework and must be comparison of measured and simulated profiles. To this end, the dose delivery was simulated for 10^{9} histories, using the virtual primary electron settings determined for Case 3 measurement experiment. Real profiles and simulated profiles for this case are compared in Fig. 6.
Discussion
Development of a method to relate specific features of the inphantom measured dose distributions with values of the parameters of the model of the electron beam in a medical accelerator was the prime motivation in this work. Ideally, for the solution of such a task to be of practical utility, it should be delivered as a model which accepts at its input a welldefined assembly of measured profiles, returning estimated values of the parameters characterizing the primary electron beam of a given accelerator, together with dose profiles reconstructed based on these estimated parameters. Availability of an elsewheredeveloped complete MC model of the accelerator, i.e., of the PRIMO Monte Carlo software, and development and successful application of statistical learning technology made it possible to accomplish this task.
The solution presented in this study is flexible and readily usable. Based on Monte Carlo simulation data, a set of models was developed which extract features from a userdefined collection of dose profiles to estimate primary electron beam model parameters from such features and returns reconstructed profiles for comparison with those measured and used as input. In contrast to all the work published so far, the characteristics of the dose profile shapes and the regression functions are both machinelearned and collected in a datadependent manner. Neither handcrafted shape features nor adhoc regression functions need to be applied, these being replaced by a wellestablished background of statistical learning. The two models developed in this work—one based on PCA feature extraction and SVR regression and another, based on endtoend deeplearning which simultaneously learns to represent the shapes of the dose profiles and to apply the most suitable regression functions—are the proposed solution. Such a solution will support several different experimental arrangements, offering optimum regression models for any such arrangement. By studying a few experimental cases, the effect of the selection of the experimental setup on the accuracy of parameter estimation has been demonstrated and discussed.
Estimation of the primary electron beam model parameters involves two steps, the first of which is an initial guess made by a regression model. In principle, this initial guess could be made without any such model—merely by a brute force search over all collected profiles for a set of profiles that best fit the analysed profiles. The second stage of estimation, which is based on reconstructionbased minimization, requires that techniques be developed to effectively represent the shape of the measured profiles—as introduced in the present work. It should also be noted that a brute force search delivers no explanatory power, in contrast to regression models introduced in the present study. In particular, regression models deliver an association between explanatory and explaining variables—for example, given a regression model it can be inferred in what manner will any specific changes of primary electron beam model parameters influence the shapes of the resulting dose profiles. This is the general advantage of regression models over any brute force search strategies, which is why regression models are widely used in statistical data analysis.
The developed framework was tested using both simulated and real data. The tests based on simulated data demonstrated that the coefficient of determination of true primary beam parameters from dose profiles varies from around 92% for angular beam divergence to 97% for mean energy of the simulated electron beam. It was not possible to train the developed model to predict the FWHM of energy spectrum of primary electrons, implying that this particular beam parameter does not seriously affect the shapes of dose profiles, at least for the cases studied in this work.
The presented framework has been made freely available together with the simulation data used for training the models. Model training and testing stages do not require extensive computation resources. Using any uptodate PC with no graphic card support, the PCA + SVR models can be trained within a few seconds and prediction takes no longer than a second. The training of deep learning models usually requires about ten hours of an average CPU. However, testing the deep models takes no longer than testing the PCA + SVR model.
The presented framework can be readily adapted to individual requirements, perhaps guided by the availability of profile sets prepared for QA purposes, or by ease of measurement. Dose data could also be supplied by dose distributions measured by detectors other than ionization chambers, e.g., dye films, especially over regions of high dose gradient. Indeed, for any selection of profiles which the user intends to apply in determining values of parameters of the primary electron beam models, only a few lines of the configuration code need to be changed to indicate such userspecified selection. Then, the regression models must be retrained, which takes only a few seconds with no user intervention, except for running the code. Following this training run, the estimation of electron beam model parameters and reconstruction of profiles from estimation results can be executed—this requiring a few more seconds, provided that the measured doses are read by a script. Three examples of such procedures for reading measured doses from text files have also been provided in the freely available repository at https://github.com/taborzbislaw/DeepBeam.
Conclusion
The purpose of the present study was to develop a flexible framework with suitable regression models for estimating parameters of the model of primary electron beam in simulators of medical linear accelerators, based on real reference dose profiles measured in a water phantom. The proposed framework is a readily applicable and customizable tool which may be applied in tuning parameters of primary electron beams of Monte Carlo simulators of linear accelerators. The codes, training and test data, together with readout procedures, are freely available at the site: https://github.com/taborzbislaw/DeepBeam.
Availability of data and materials
All data and code are available at https://github.com/taborzbislaw/DeepBeam.
Change history
28 February 2022
A Correction to this paper has been published: https://doi.org/10.1186/s1301402201983x
Abbreviations
 MC:

Monte Carlo
 EBT:

External beam photon therapy
 TPS:

Therapy planning system
 QA:

Quality assurance
 FWHM:

Full width at half maximum
 PCA:

Principal component analysis
 SVR:

Support vector regression
 DL:

Deep learning
References
Storchi P, Woudstra E. Calculation of the absorbed dose distribution due to irregularly shaped photon beams using pencil beam kernels derived from basic beam data. Phys Med Biol. 1996;41(4):637–56.
Ulmer W, Pyyry J, Kaissl W. A 3D photon superposition/convolution algorithm and its foundation on results of Monte Carlo calculations. Phys Med Biol. 2005;50(8):1767–90.
Failla GA, Wareing T, Archambault Y, Thompson S. Acuros XB advanced dose calculation for the Eclipse treatment planning system. Palo Alto, CA: Varian Medical Systems; 2010
Chetty IJ, Curran B, Cygler JE, et al. Report of the AAPM Task Group 105: Issues associated with clinical implementation of Monte Carlobased photon and electron external beam treatment planning. Med Phys. 2007;34(12):4818–53.
Seco J, Verhaegen F. Monte Carlo techniques in radiation therapy. Boca Raton: CRC Press; 2013.
Björk P, Knöös K, Nilsson P. Influence of initial electron beam characteristics on Monte Carlo calculated absorbed dose distributions for linear accelerator electron beams. Phys Med Biol. 2002;47(22):4019–41.
Tzedakis A, Damilakis JE, Mazonakis M, Stratakis J, Varveris H, Gourtsoyiannis N. Influence of initial electron beam parameters on Monte Carlo calculated absorbed dose distributions for radiotherapy photon beams. Med Phys. 2004;31(4):907–13.
Maskani R, Tahmasebibirgani MJ, HoseiniGhahfarokhi M, Fatahiasl J. Determination of initial beam parameters of varian 2100 CD LINAC for various therapeutic electrons using PRIMO. Asian Pac J Cancer Prev. 2014;16(17):7795–801.
Pena J, GonzalezCastano DM, Gomez F, SánchezDoblado F, Hartmann GH. Automatic determination of primary electron beam parameters in Monte Carlo simulation. Med Phys. 2007;34(3):1076–84.
Mayles P, Nahum A, Rosenwald JC, editors. Handbook of radiotherapy physics. Boca Raton: CRC Press; 2007.
Jiang SB, Kapur A, Ma CM. Electron beam modelling and commissioning for Monte Carlo treatment planning. Med Phys. 2000;27(1):180–91.
Almberg SS, Frengen J, Kylling A, Lindmo T. Monte Carlo linear accelerator simulation of megavoltage photon beams: independent determination of initial beam parameters. Med Phys. 2012;39(1):40–7.
Park H, Choi HJ, Kim JI, Min CH. Analysis of dose distribution according to the initial electron beam of the linear accelerator: a Monte Carlo study. J Radiat Prot Res. 2018;43(1):10–9.
Mohammed M, El Bardouni T, Chakir E, Boukhal H, Saeed M, Ahmed AA. Monte Carlo simulation of Varian Linac for 6 MV photon beam with BEAMnrc code. Radiat Phys Chem. 2018;144:69–75.
Tugrul T, Erogul O. Determination of initial electron parameters by means of Monte Carlo simulations for the Siemens Artiste Linac 6 MV photon beam. Rep Pract Oncol Radiother. 2019;24(4):331–7.
Najafzadeh M, HoseiniGhafarokhi M, Mayn Bolagh RS, Haghparast M, Zarifi S, Nickfarjam A, Farhood A, Chow JCL. Benchmarking of Monte Carlo model of Siemens Oncor® linear accelerator for 18MV photon beam: determination of initial electron beam parameters. J Xray Sci Technol. 2019;27(6):1047–70.
Bacala AM. Linac photon beam finetuning in PRIMO using the gammaindex analysis toolkit. Radiat Oncol. 2020;15:8.
Rodriguez M, Sempau J, Brualla L. PRIMO: a graphical environment for the Monte Carlo simulation of Varian and Elekta linacs. Strahlenther Onkol. 2013;189(10):881–6.
Salvat F, FernándezVarea JM, Sempau J. PENELOPE 2011—a code system for Monte Carlo simulation of electron and photon transport. IssylesMoulineaux: OECD Nuclear Energy Agency; 2011.
Brualla L, Sauerwein W. On the efficiency of azimuthal and rotational splitting for Monte Carlo simulation of clinical linear accelerators. Radiat Phys Chem. 2010;79(9):929–32.
Jolliffe IT. Principal component analysis. New York: Springer; 2002.
Vapnik VN. Statistical learning theory. New York: Wiley; 1998.
Kraft D. A software package for sequential quadratic programming. Tech. Rep. DFVLRFB 88–28, Koln: DLR German Aerospace Center — Institute for Flight Mechanics; 1988.
Acknowledgements
This research was supported by PLGrid Infrastructure. The authors wish to thank Mr. Łukasz Flis of the ACC Cyfronet AGH for his invaluable help in handling computations on the Prometheus grid.
Funding
This study was supported by the POIR.04.04.000015E5/18 Project. The POIR.04.04.000015E5/18 project is carried out within the TEAMNET programme of the Foundation for Polish Science cofinanced by the European Union under the European Regional Development Fund.
Author information
Authors and Affiliations
Contributions
ZT conceived the work, conducted all the simulations, performed the subsequent analyses, wrote the manuscript and revised it. DK conducted the measurements, wrote the manuscript and revised it. MPRW wrote the manuscript and revised it. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
All authors consent to the publication.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised: the statement in the Funding information section was incorrectly given as “This study was supported by the Foundation for Polish Science under Grant POIR.04.04.000015E5/18.” and should have read "This study was supported by the POIR.04.04.000015E5/18 Project. The POIR.04.04.000015E5/18 project is carried out within the TEAMNET programme of the Foundation for Polish Science cofinanced by the European Union under the European Regional Development Fund." This has been updated.
Supplementary Information
Additional file 1.
Supplementary material.
Additional file 2. Fig. S1.
Variance of profiles’ shapes explained by PCS. (a) Fraction of explained variance in the shapes of profiles, versus number of PCA features. The error bars represent standard deviation of the explained variance values, calculated for six profiles of three squared fields (3x3 cm2, 10x10 cm2 and 30x30 cm2); (b) Fraction of profile shape variance explained by the first three features, as averaged over the three squared fields, for six profiles (depth profile: ID=1, and five lateral profiles at depths Dmax=1.4 cm, 5 cm, 10 cm, 20 cm, and 30 cm, IDs from 2 to 6, respectively).
Additional file 3. Fig. S2.
Variation in profiles’ shapes. Variation in profile shapes in relation to any one of the first three PCA features being either negative or positive (left, middle and right panels) for a 10x10 cm2, lateral profile at 30 cm depth (upper panels) or for a depth profile of a 10x10 cm2 field (lower panels). For explanation of “mean shape”, “negative feature” and “positive feature” labels, see text.
Additional file 4. Fig. S3.
Selecting dose profiles. Coefficient of determination between ground truth and predicted values of energy, spot size, and angular divergence, for regression based on all of three fields (3x3 cm2, 10x10 cm2 and 30x30 cm2) and a different number of profiles of each field. For further details, see text.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Tabor, Z., Kabat, D. & Waligórski, M.P.R. DeepBeam: a machine learning framework for tuning the primary electron beam of the PRIMO Monte Carlo software. Radiat Oncol 16, 124 (2021). https://doi.org/10.1186/s1301402101847w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s1301402101847w
Keywords
 Machine learning
 Deep learning
 Monte Carlo
 Beam simulation
 Quality assurance (QA)
 Quality control (QC)
 Principal component analysis (PCA)
 Support vector regression