Range accuracy in carbon ion treatment planning based on CT-calibration with real tissue samples
© Rietzel et al. 2007
Received: 31 October 2006
Accepted: 23 March 2007
Published: 23 March 2007
Skip to main content
© Rietzel et al. 2007
Received: 31 October 2006
Accepted: 23 March 2007
Published: 23 March 2007
The precision in carbon ion radiotherapy depends on the calibration of Hounsfield units (HU) as measured with computed tomography (CT) to water equivalence. This calibration can cause relevant differences between treatment planning and treatment delivery.
Calibration data for several soft tissues were measured repeatedly to assess the accuracy of range calibration. Samples of fresh animal tissues including fat, brain, kidney, liver, and several muscle tissues were used. First, samples were CT scanned. Then carbon ion radiographic measurements were performed at several positions. Residual ranges behind the samples were compared to ranges in water.
Based on the measured data the accuracy of the current Hounsfield look-up table for range calibration of soft tissues is 0.2 ± 1.2%. Accuracy in range calibration of 1% corresponds to ~1 mm carbon ion range control in 10 cm water equivalent depth which is comparable to typical treatment depths for head and neck tumors.
Carbon ion ranges can be controlled within ~1 mm in soft tissue for typical depths of head and neck treatments.
At the German carbon ion therapy facility Gesellschaft für Schwerionenforschung (GSI) more than 300 patients have been treated since 1997, primarily in the head and neck region [1, 2]. The inverse depth dose profile, the so called Bragg curve, as well as the small lateral scattering of carbon ions allow to achieve good conformity between target volume and treated volume. The range of charged particles in tissue is determined by their primary energy as well as the tissue density distribution along the beam path. Therefore precise knowledge of ion stopping powers within the patient anatomy is essential for precise treatment planning.
At GSI, treatment planning is performed with the in-house treatment planning system Treatment Planning for Particles (TRiP) . For optimization and dose calculations, patient CT data in Hounsfield units (HU) are transformed in a water-equivalent system. Already in 1979 Chen et al  as well as Mustafa and Jackson in 1983  published the use of such range calibration tables and their significance for charged particle therapy. At GSI the transformation of CT HUs to water equivalence is based on a Hounsfield look-up table (HLUT) that was initially measured using tissue equivalent phantom materials as well as bovine and human bony tissues [6, 7].
Methods to obtain and validate precise ratios between proton stopping powers and CT values have been systematically investigated at the Paul Scherrer Institut (PSI), Switzerland. Schneider et al.  reported a stoichiometric calibration of CT HUs to proton stopping powers. They conclude that tissue substitute calibrations should be used with caution. Their results were validated with proton radiographic measurements of a sheep head. The method of proton radiography as a tool for quality control in proton therapy had been previously published by Schneider and Pedroni . Schaffner and Pedroni then reported the experimental verification of the relation between CT HUs and proton stopping powers for proton therapy treatment planning . CT scans as well as proton radiographic measurements of several animal tissues and bone samples were performed. In conclusion, they expected that the range of protons in the human body can be controlled to better than ± 1.1% of the water equivalent range in soft tissue and ± 1.8% in bone, which translates into a range precision of about 1–3 mm in typical treatment situations. Recently Schneider et al reported the feasibility of optimizing the relation between CT-HUs and proton stopping powers patient specifically . They acquired an in vivo proton radiograph of a dog patient treated for a nasal tumor. The HLUT was then optimized patient specifically and possible dosimetric consequences were assessed. The standard deviation between measured and calculated water equivalence was reduced from 7.9 to 6.7 mm when using the patient specifically optimized HLUT. Note that these standard deviations were derived from proton radiography and therefore correspond to uncertainties for penetrating the full extent of the dog head.
The most advanced method to obtain information on proton stopping powers in 3D is probably proton cone-beam computed tomography. The development of such a system for the acquisition of volumetric information on proton stopping powers was reported by Zygmanski et al from Massachusetts General Hospital . Their feasibility study suggests that there may be some advantage in obtaining proton stopping powers directly with proton cone-beam CT.
The relation between carbon ion stopping powers and CT HUs has been extensively investigated at the National Institute of Radiological Sciences (NIRS) in Japan and at GSI. Matsufuji et al (NIRS) investigated the relationship between CT HU and electron density, scatter angle and nuclear reaction . To assess conversion accuracy, they compared the method to determine HLUTs as reported by Chen et al  to that of PSI [8, 10]. They concluded that Chen et al's method shows good agreement with real tissues in the lung to soft tissue HU region, whereas PSI's method retains good agreement over the entire HU range including bone. The difference between both methods reaches a maximum of 2.6% in the high HU region.
Kanematsu et al (NIRS) published a polybinary tissue model for radiotherapy treatment planning . Body tissues are approximated by substitutes, namely water, air, ethanol, and potassium phosphate solution. Based on standard mixtures with known stopping powers, it is then possible to calibrate the relationship between CT HUs and carbon ion stopping powers by CT scanning of the samples only. The calibration method was successfully tested with biological materials.
In this work we present a summary of data for repeated measurements in the soft tissue HU region with different CT scanners to document the precision of the HLUT calibration curve. While quality assurance of the CT scanner calibration can routinely be performed with tissue equivalent materials as well as bone tissue samples once their integral stopping powers have been measured, this is not possible for soft tissues. For soft tissue samples CT HUs and integral stopping powers have to be measured on the same day. Measurements with soft tissues were repeated mainly for quality assurance and to assess accuracy of the HLUT in the soft tissue HU region. Some of the initial results have been reported previously [16–18].
Fresh pig soft tissue samples were obtained directly from the butcher. These samples included brain, kidney, fat, liver, and various muscle tissues. Tissues were purified, for example fat was cut off muscle tissue and out of kidneys. Then each tissue was cut in blocks and wrapped in thin plastic foil (to avoid drying out) to fit into a PMMA box (inner dimensions 10 × 10 × 30 cm3, wall thickness 1 cm). The PMMA box was closed applying slight pressure. This was necessary to avoid shifting of the samples between CT scanning and carbon ion radiography. All measurements were performed within 12 hours after the pig was butchered.
Two different CT scanner models were used for the HLUT measurement series in this study. Initially, a Siemens Somatom Plus 4 scanner (1), later a Siemens Somatom Volume Zoom scanner (2) was used. Image date were acquired according to a scan protocol for carbon ion therapy to ensure consistency between patient treatments and HLUT measurements. CT data were acquired in sequence scan mode slice by slice, reconstruction filter for the adult head (AH50), tube voltage of 120 kVp, and an integrated current of 420 mAs. CT voxel sizes were 1.29 × 1.29 × 1.00 mm3 (1) and 1.38 × 1.38 × 1.00 mm3 (2).
For radiography measurements positions in homogeneous regions of the samples were selected. For example small inclusions of air within the tissue materials could not completely be excluded although special attention was paid to avoid air gaps during sample preparation. For paths in carbon ion beam direction (z-direction, orthogonal to slices), means and standard deviations of lines in the CT data were computed. These data were plotted similar to a projection to identify homogeneous tissue regions. Regions with low standard deviations per tissue sample were then selected for carbon ion radiography measurements.
with Δ shift of residual range behind the sample compared to water and d thickness of the sample. To assess the accuracy of the current HLUT, water equivalence for measured average HUs was calculated based on the current HLUT and compared to the measured water-equivalent path lengths (WEPL).
Small inclusions of air in the phantom as well as partial volume effects adjacent to the PMMA box's walls can affect the calculation of average HUs as well as residual range measurements. Voxels that clearly contained air, mainly between samples and PMMA box, were excluded for average HU calculation. Corresponding residual range measurements were consequently adjusted as well. Voxels containing air have a negligible stopping power in comparison to soft tissues and water. Therefore it is reasonable to simply subtract the distance of traversed air within the box from the residual range that was measured in the water telescope. This corresponds to virtually filling the air gaps with water.
Voxels with increased HUs adjacent to PMMA walls or obviously decreased HUs within or next to the sample tissues were excluded from average HU calculations only. This seems reasonable since radiographic measurements will not suffer from partial CT volume effects and voxels with slightly decreased HUs, e.g. from average 40 HU to local -100HU, are expected to consist of ~10% air (-1000 HU) and ~90% tissue (~40 HU).
Comparison of measured and calculated Hounsfield look-up table points
CT scanner 1
CT scanner 2
-73.9 ± 20.8
-97.9 ± 45.6
-72.7 ± 21.2
-109.3 ± 43.6
-73.9 ± 27.6
-102.6 ± 44.4
45.0 ± 17.4
47.4 ± 16.0
40.9 ± 15.0
38.7 ± 18.6
44.0 ± 16.4
53.1 ± 26.9
49.0 ± 20.7
66.0 ± 15.8
54.0 ± 15.6
57.5 ± 26.6
83.3 ± 20.5
75.5 ± 20.1
79.7 ± 19.0
74.4 ± 27.6
81.3 ± 13.7
72.9 ± 21.3
65.5 ± 18.8
66.9 ± 23.6
65.4 ± 19.6
66.0 ± 25.8
50.0 ± 47.5
73.3 ± 16.2
63.6 ± 25.5
69.5 ± 22.6
Residual ranges were measured in 200 μm steps. The data in figure 5 demonstrate that determination of the Bragg peak positions is possible with at least the same precision. Radiographic measurements were performed for 10 cm of tissue. Uncertainties introduced by carbon ion radiography directly are therefore negligible. Only positioning errors of the samples could have an impact on radiography measurements because integral stopping powers would then be measured for the wrong beam paths. The phantom was aligned according to a laser system in the treatment room with a precision that can be expected to be better than 1 mm. By selecting the radiography positions based on HU averages and standard deviations along beam paths possible impacts of small positioning errors were further decreased.
One of the most critical tasks in charged particle radiotherapy is appropriate calibration of the CT scanner, concerning both, stability as well as reproducibility of absolute HUs. For slightly heterogeneous materials like soft tissue samples, it is not possible to differentiate between partial volume effects and tissue heterogeneities based on CT HUs. HU variations as denoted by the standard deviations along the radiography beam paths in table 1 can therefore not be analyzed further. The penetrated 10 cm of tissue correspond to 100 voxels. We expect this number of voxels to be sufficient for representative HU averages.
Systematic shifts between measured HU data and the current HLUT possibly occur for measurement series (1) in the region of 60 to 80 HU and and series (2) in the region of -100 to -110 HU. For series (1), the systematic shift is within the 1% HLUT confidence interval. For series (2), the shift in the fat tissue HU region is slightly outside of the 2% confidence interval. To possibly improve the HLUT calibration it might be necessary to generate a new calibration curve for scanner (2). However, another unceartainty can result from sample selection and preparation. The standard deviation for fat tissues was ~45 HU in measurement series (2) compared to ~20 HU in series (1). In combination with the decreased average HUs in series (2) for fat tissues, this indicates that most likely differences between the two samples were present that resulted in a relative WEPL difference of -2.6%.
The slightly higher standard deviations of HUs in comparison to the data reported by Jäkel et al  are attributed to the CT slice thickness of 1 mm in this study compared to 3 mm. We simulated 3 mm slice thickness by averaging 3 slices throughout the samples. For example the average HU for one of the brain tissue HLUT points then changes from 40.9 HU to 41.0 HU only, whereas the corresponding standard deviation decreases from 15.0 HU to 9.8 HU. For real measurements with 3 mm slice thickness further decrease of the standard deviations can be expected due to improved signal-to-noise ratios.
In general our goal is to control the range of carbon ions within the patient to better than 1%. For typical patient treatments in the head and neck region water equivalent ranges to the target of approximately 10 cm can be expected. With range control of ~1% this results in a range uncertainty of ~1 mm. Schaffner et al (PSI) reported that they expect the range of protons to be controlled in soft tissue within 1.1% of the water equivalent range . Our results are comparable. By repeated measurements we showed that on average the range of carbon ions in soft tissue can be reproduced with an accuracy of 0.2 ± 1.2%.
Another aspect of HLUT measurements are beam hardening effects as initially reported by Minohara et al . They demonstrated the effect of different object sizes on the calibration of HUs to water equivalence. To date, only patients with tumors in the head and neck as well as in the pelvic region are treated at GSI [1, 2]. We selected the dimensions of the PMMA box phantom to be comparable to typical head and neck dimensions because most of the tumors treated at GSI are located in this region, many of them directly abutting the brain stem. This ensures highest precision for treatments of head and neck tumors while slightly decreased range control might be expected for targets in the pelvic region.
Calibration of CT HUs to water equivalence is critical to control the range of charged particles in the human body. With repeated measurements we found a precision for carbon ion range calibration in soft tissues of 0.2 ± 1.2%, and in soft tissues involved in typical head and neck treatment of 0.4% ± 1.4%. For soft tissues in typical patient treatments in the head and neck region this corresponds to a range uncertainty below 1 mm.
All authors were funded by GSI.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.