- Open Access
Effective radiation attenuation calibration for breast density: compression thickness influences and correction
BioMedical Engineering OnLinevolume 9, Article number: 73 (2010)
Calibrating mammograms to produce a standardized breast density measurement for breast cancer risk analysis requires an accurate spatial measure of the compressed breast thickness. Thickness inaccuracies due to the nominal system readout value and compression paddle orientation induce unacceptable errors in the calibration.
A thickness correction was developed and evaluated using a fully specified two-component surrogate breast model. A previously developed calibration approach based on effective radiation attenuation coefficient measurements was used in the analysis. Water and oil were used to construct phantoms to replicate the deformable properties of the breast. Phantoms consisting of measured proportions of water and oil were used to estimate calibration errors without correction, evaluate the thickness correction, and investigate the reproducibility of the various calibration representations under compression thickness variations.
The average thickness uncertainty due to compression paddle warp was characterized to within 0.5 mm. The relative calibration error was reduced to 7% from 48-68% with the correction. The normalized effective radiation attenuation coefficient (planar) representation was reproducible under intra-sample compression thickness variations compared with calibrated volume measures.
Incorporating this thickness correction into the rigid breast tissue equivalent calibration method should improve the calibration accuracy of mammograms for risk assessments using the reproducible planar calibration measure.
Breast density is a significant breast cancer risk factor [1–3]. When estimating breast density from mammograms, the breast is considered as a two-component model consisting of adipose and fibroglandular (abbreviated as glandular hereafter) tissue to varying degrees. One method of measuring breast density uses binary labeling resulting in areas of radiographically dense tissue (glandular tissue) or adipose (non-dense) tissue. Breast density is then estimated as the ratio of the radiographically dense area to the total breast area (dense + adipose) [4–6]. Binary labeling techniques have repeatedly produced a measure that correlates well with breast cancer  without considering inter-image acquisition technique differences.
Recent work has focused on calibration to compensate for differences in the inter-image acquisition technique [7–15]. Calibration produces various standardized data representations by adjusting for variations in the target/filter combination, x-ray tube voltage, radiation exposure, and compressed breast thickness. By reducing measurement variation, calibration should produce a breast density measure that shows a stronger association with breast cancer in comparison with measurements derived without calibration. Moreover if the calibration measures prove viable, breast density assessments can be automated. Additionally, calibration applied at the local level supports the analysis of the calibrated measure's spatial distribution across the breast field of view that is not supported by the binary measure of breast density. In contrast, recent work [16, 17] indicates that calibrated measures of breast density do not produce risk associations stronger than those produced without calibration. We hypothesize, calibration techniques will require further investigation and modification before they prove useful.
We have built our approach [8, 9, 18] upon earlier calibration work  in full field digital mammography (FFDM) to produce a normalized effective radiation attenuation coefficient representation for breast density. This work was developed under the assumption that known phantom heights corresponded with the mammography system compressed breast thickness digital readout value. Preliminary analyses showed that this assumption was not valid. Inaccurate compressed breast thickness represents an ongoing technical challenge in calibrated breast density research [19–21].
This paper addresses compressed breast thickness inaccuracies using deformable phantoms with the following objectives: (1) develop and evaluate a compressed breast thickness correction method that can be incorporated into the rigid breast tissue equivalent phantom calibration model, and (2) compare calibration representation reproducibility under compression thickness variations using known compositions. We used a surrogate breast model because the volumetric compositions were fully specified.
To address the study objectives, a method was derived from the calibration methodology to characterize the compressed sample's spatial thickness variation. This method was evaluated under controlled conditions using modified (rigid) breast tissue equivalent phantoms with known height variations before addressing deformable samples. The rigid phantoms used for this work were purchased from Computerized Imaging References Systems (CIRS, Norfolk VA) and were described in our previous report . These phantoms (non-modified) are standards in calibration research. We coupled this spatial thickness characterization with mechanical measurements of the compression paddle to construct a correction. The correction was evaluated within the calibration application using an alternative two-component deformable phantom model constructed with water and oil filled balloon phantoms that replicated patient imaging. We investigated the compression behavior similarity between mammograms and these deformable phantoms. This alternative model was then used to investigate the calibration representation reproducibility while varying the compression thickness.
Imaging was performed with a General Electric Senographe 2000 D FFDM system, which is used for routine breast cancer screening examinations. The detector specifics were described previously . All phantom images were acquired as left craniocaudal (LCC) views using a Molybdenum/Molybdenum target/filter combination and 26 kV x-ray tube voltage with 160 mAs, where mAs is the system readout value for the generated radiation. An extensive array of acquisition techniques was not required to validate and illustrate the main principles. The image data matrix is 1914 × 2294 pixels. A standard (x, y) positive coordinate system was used with the origin, (0, 0), located at the bottom left hand-corner of the displayed image, where × and y locations are integer valued pixel coordinates ranging from 0-1903 and 0-2293, respectively. The outside detector edge defined the y-axis (vertical direction). The following definitions were used below: xmax = 1903, and ymax = 2293. This system produces both raw and processed image data for display (for presentation) purposes. Raw image data, represented by r(x, y) below, was used for this work. This system is equipped with 15 × 20 cm2 rigid compression paddle. The system compression force changes in 10 N increments with a minimum system readout value of 30 N (the first system readout value above 0.0). The system digital compression thickness readout value, defined as ts below, is cited in cm.
The calibration method described previously is outlined to support the subsequent analysis. The logarithmic response (LR) is given by LR(x, y) = ln[r(x, y)/mAs], where mAs is the system readout value and r(x, y) is the raw image. Calibration curves are generated by measuring the logarithmic response for both adipose and glandular tissue equivalent phantoms as a function of phantom height above the breast support surface using a specific reference mAs [8, 9]. Two calibration points are required to standardize a given image. These points correspond to the logarithmic response for both the adipose and glandular tissue equivalent phantoms for the same image acquisition technique (same filter/target combination, kV and compressed breast thickness). These two calibration points (explained below) are generated with previously estimated calibration regression parameters
where μj is the effective radiation attenuation coefficient (cm-1) for either the glandular (μg ) or adipose (μf ) tissue equivalent phantoms, lj is the respective logarithmic intercept for either the glandular (lg ), or adipose (lf ) phantoms, and T(x, y) is the spatially dependent compressed breast thickness above the breast support surface (or deformed paddle height). We have demonstrated previously [8, 9] that calibration curves measured over a wide range of phantoms heights [T in Eq. (1)] and modeled with the Eq. (1) form as function of T were well approximated as linear for a fixed x-ray tube voltage (kV) and target/filter combination (fixed beam condition). Likewise, the spatial dependencies for μj and lj can also be dropped without introducing significant error. For each target/filter combination and kV setting, there is a unique set of four calibration parameters derived from regression analysis. These regression parameters (μj and lj ) are stored and used in the calibration application to generate the two required calibration points (discussed above). For a given kV and target/filter combination, a generalized (approximation) form of Beer's law holds by replacing the monochromatic radiation attenuation coefficient with the effective radiation coefficient as expressed in Eq. (1). The idealizations used to develop this model were also discussed previously [8–10] and are not addressed here. All calibration data was acquired with the reference exposure setting mAs = 160. For a given beam type, calibration data acquired with one reference mAs value is sufficient to generate calibration points for arbitrary mAs values. Because the calibration curves are linear, the calibration application takes a linear form for a given beam (given target/filter and kV). Dropping the spatial dependencies, the calibration mapping takes this form
where LRa is a measured arbitrary logarithmic response for the same compressed sample thickness, T, used in Eq. (1) and percent glandular (PG) defines the calibrated representation. To determine M and B, we use the adipose and glandular regression parameters with Eq. (1) to provide the two boundary conditions using the known percent glandular compositions (that is, when j = f, PG = 0, and when j = g, PG = 100), which gives
where LRa is a measured arbitrary logarithmic response for the same compressed sample thickness, T, used in Eq. (1). The choice to constrain the mapping between 0-100 was arbitrary. Thus, for a given beam-type, the mapping is a function of the four related spatially-static regression parameters and the variable T that has a spatial dependency in general. The mapping indicates that an arbitrary logarithmic response (a measured LRa ) is a linear combination of the adipose and glandular response as defined in Eq. (1) for the same beam type. A variant of Eq. (2) is found by rescaling the percent glandular form, p = PG/100, which gives the effective attenuation coefficient representation
Equations (2) and (5) are planar representations. Spatial averaging [Eq. (2) or Eq. (5)] gives either the average percent glandular, <PG >, composition or equivalently the average effective radiation attenuation coefficient, < μe >, for the sample, respectively.
Inaccurate Compression Thickness
Inaccurate calibration of mammograms stems from applying Eqs. (1-4) to an arbitrary logarithmic response with the incorrect compression thickness. Thickness inaccuracies are due to both the compression paddle deformation/tilt and inaccurate (nominal) system compression thickness digital readout value. There is a rigid/non-rigid misalignment between the non-compressible tissue equivalent phantoms used to generate the height dependent calibration data, which have precise heights and plane surfaces, and the compressed breast thickness determined by the system value. We replicated this misalignment (described below in more detail) with deformable phantoms for the calibration data generation, estimating the compression thickness variation, and evaluating the thickness correction with calibration accuracy comparisons.
Related Calibration Representations
Related calibration representations are developed for comparisons. The spatially distributed glandular height representation is given by
where p was defined above. Using an alternative approach, some researchers estimate the glandular height as the calibrated measure . Either the total glandular volume or average glandular-volume/pixel can be determined with Eq. (6). When the volume of interest projection contains n pixels with digital spatial resolution = d (the detector element spacing measured in length units), the average glandular-volume/pixel is given by
Multiplying the above equation by n gives the total glandular volume. Other researchers use a normalized volume breast density measure  that we label as VN. VN is the total glandular volume normalized by the total volume considered. Using Eq.(6) and Eq.(7) with constant compression thickness gives
The above expression is equivalent to <PG >/100, which has different dimensionality compared with Eq. (7). A similar [Eq. (8)] breast density representation  results using the total glandular height normalized by the total breast height (approximated by nT above). We compared these representations below.
Thickness Variation Characterization
We derived a method to estimate the compression paddle deformation due to applied compression force using Eq. (1). This approach leverages the linear logarithmic response characteristic. Expressing Eq. (1) at some arbitrary × location, x0 , gives
where the subscript defines previously estimated calibration parameters for an attenuating material referenced as k. We use k because the approach was evaluated first with the rigid glandular (modified) breast tissue equivalent phantoms (k = g) with known height variations before applying the technique to the deformable water phantoms (k = w) with unknown height variations. The logarithmic response at the adjacent value of × is similarly expressed
where Δt1 is the height variation of the adjacent logarithmic response along the x-direction. The above expression can be extended to the next value, LR2(x+ 2Δx) evaluated at t+Δt1+Δt2 , and so on. The relative height variation n-pixels from x0 is then estimated by subtracting the zero-order term and dividing by the known effective radiation attenuation coefficient giving
The logarithmic-intercept was eliminated because it is influenced by height uncertainty. Equation (11) gives the relative height variation from × = x0 to × = x0 +n pixels, orthogonal to the y-axis at a given y location. A one-dimensional profile is determined by letting n become an integer variable defined over a given x-range. The two-dimensional relative height surface
[H(x, y) ] was derived by applying Eq. (11) over an extended y-range. The H(x, y) surface describes the relative height (lower surface) of the compression paddle warp/deformation. We let x0 = 0, which keyed the analysis to x0 = 0 (detector/paddle front-edge). To reduce variation, operations along the × direction were performed using the average of a sliding 10 pixel window, maneuvered without overlap. The approach was applied along the y-direction by substituting y and Δy for × and Δx in the above development.
Thickness Variation Characterization Evaluation Methods
The H(x, y) method (described above) was applied to modified glandular breast tissue equivalent phantoms for evaluation purposes before applying it to unknown thickness variations. Standard 2 cm (rigid glandular breast tissue equivalent) phantoms were machined by the phantom manufacturer (CIRS) along one face forming slanted-plane phantoms (slant-phantoms) with constant height gradients rising from 1.7-1.8 cm, 1.6-1.8 cm, 1.5-1.8 cm, and 1.4-1.8 cm along the x-direction (inclined planes rising from 1 mm to 4 mm in height along the x-direction). The modification is shown in Figure 1. Modified phantoms were used in combination with the standard (uniform thickness) glandular tissue equivalent phantoms to build slant-phantoms of varying total heights as specified in Table 1 (see Figure 1). The lower side of the slant-phantom was aligned along × = 0 to simulate the upward paddle bulge caused by compression near the central region of the paddle. For evaluation, H(x, y) was compared with the known height variation of the slant-phantoms. The effective radiation attenuation coefficient for the glandular equivalent phantom, which was measured previously using regression analysis for 1-6 cm phantom heights  for the respective acquisition technique (μg = 0.833 cm-1 ), was used with Eq. (11).
Non-rigid Breast Simulating Phantoms
Phantoms were constructed to approximate the shape and deformable behavior of breast tissue. Breast tissue is assumed deformable but non-compressible . We use compress herein to imply the compression paddle operated and the phantom (or breast) deformed accordingly. Thick walled balloons were filled with either distilled water (water), vegetable oil (oil), or water/oil mixtures with known proportions to simulate breast compression.
The breast (CC orientation) is a deformable organ that is relatively pliable. Therefore, the paddle plane was treated as a plate that warps when loaded. The applied compression force is a measure of the sample's resistance (load) to deformation distributed over the contact area transmitted through the strained paddle to the compressor arm. It is the rigidity/elasticity of the paddle plane in combination with that elastic resistive force offered by the compressed sample that determines the final paddle warp (bulge) required for static equilibrium during imaging. Although the breast is a complicated mixture of tissue with varying elastic properties, as an approximation we assume the entire organ behaves as composite deformable body with its own global elastic property. This is supported by earlier work showing that breast compression and mammographic density are unrelated . To show that the deformable phantoms reasonably approximate the resistance offered by the breast undergoing compression, two requirements should hold: (1) the applied force and paddle contact area should be approximately coincident, and (2) the contact area geometry should be similar. Thus, the actual compression thickness similarity is irrelevant for this comparison. If we assume the paddle surface lies in a plane with a given surface area when not stressed, the corresponding warped (stressed) surface will have a slightly greater surface area than that calculated with its x-y dimensions due to the curvature induced by load. In the area calculations, we used the x-y planar dimensions of the paddle, which neglects the increased surface area.
To estimate the breast-paddle contact area, 110 FFDM CC view study mammograms were selected consecutively from the database. The CC view has reduced chest muscle interference and is therefore the preferable view . This is not a limitation, because we are only considering CC views in this calibration work for the same reason. The breast area was automatically segmented from the background using a simple pixel threshold method based on the acquisition technique. We estimated that eroding the breast outline by 21 percent along a radial direction located the breast-paddle contact area, which is an average approximation. This estimate comes from prior user-assisted analysis of estimating the paddle-contact area by evaluating line-profiles through the breast (100 mammograms) to determine the location of the intensity drop-off due to the breast curvature. The radial direction origin was centered at × = 0, and y = vertical direction centroid of the segmented breast calculated with pixel values = 1 (on the segmented breast area). The contact areas for the deformable phantoms shown in Figure 2 were estimated with a threshold approach based on their characteristic single valued pixel value distribution over the paddle contact area. We compared the compression force and contact areas of the mammograms with the deformable phantoms to assess similarities.
Compression Paddle Measurements
The compression paddle's orientation to the breast support surface and system thickness readout were characterized under conditions that simulated patient imaging using (compressing) water filled phantoms. The paddle was inspected manually to assess its pliability. The resting paddle is shown in Figure 3. Measurements from the breast support surface to the paddle corners were taken under various conditions and compression forces by compressing the water-filled phantoms. These measures were taken repeatedly over an 18-month period. As an approximation, we assumed the detector and paddle outlines were aligned (the paddle area dimension is less than the breast support surface dimension). A feeler gauge technique, similar to that used to gap spark plugs, was used to make these compression paddle height measurements. Combinations of materials with precise thicknesses ranging in thickness from 1 mm to 10 mm were used to measure the distance between the breast support surface and paddle at each corner. Perimeter bulge was assessed with a straightedge along the x-direction perimeter (for y = 0 and y = ymax) from the bottom side of the paddle. The paddle front-edge perimeter flex (along y for × = 0) was difficult to assess properly with mechanical measurements due to both the positioning of the deformable phantoms and the construction of the paddle near the edge from within the top side. Therefore, Eq. (11) was modified (along y) and used to better estimate the degree of flex along the paddle front-edge perimeter. Secondly, the compression paddle bulge was coarsely characterized using a similar feeler gauge technique applied within the central portion of the paddle (top side) along the y-direction. These mechanical measurements in combination with the H(x, y) analysis were used to construct a thickness correction surface.
Paddle Deformation Characterization
A set of six water filled phantoms was imaged over a range of compression forces resulting in 35 images. These were used to characterize the compression thickness spatial variation due to the compression paddle plane deformation (bending or warping) by applying the H(x, y) analysis. In this analysis, we used the estimated regression parameters for water (k = w) to estimate the H(x, y) surface [see Eqs. (9-11)] because the phantom conforms to the warped compression paddle surface (the compression thickness surface). These phantoms were filled with arbitrary volumes of water ranging from 500-1200 ml to simulate various breast sizes as shown in Figure 2. The analysis was constrained to the outlined regions to avoid the curvature regions. These regions were 500 ×500 pixels or larger. Phantoms were imaged over a range of compression forces (summarized below). Phantoms were placed on the breast support surface in the central portion of the detector in the y-direction by observation to simulate patient positioning. The effective radiation attenuation coefficient for water was estimated with methods described below using the Eq. (1) form as the regression model. The H(x, y) method was used to locate the maximum bulge heights and positions. Regression analysis was used to determine the relationships between the compression force and the paddle bending characteristics.
The thickness correction was developed using the two forms of measurements outlined above after validating the H(x, y) approach. The paddle bending summaries (from above) were joined with the paddle perimeter measurements. These measures, in combination, were used as boundary conditions to construct the polynomial thickness correction as a function of compression force.
Alternative Calibration Model and Correction Evaluation Methods
We used deformable phantoms to construct an alternative two-component model to (1) simulate calibrating mammograms, (2) duplicate the rigid/non-rigid misalignment, and (3) evaluate the thickness correction. Two additional calibration references phantoms were constructed with either water or oil with arbitrary volumes. Calibration curves were generated for each reference phantom shown in Figure 4 as a function of compressed phantom thicknesses (2.5-6.5 cm range). The outlined regions (small strips) were used because the compressed phantom thickness was known for these regions without applying the correction using ts +0.5 cm for the entire strip thickness (demonstrated below). Restricting the analysis to this strip simulated the methods used for generating the breast tissue equivalent regression parameters (using rigid phantoms with flat surfaces and precise heights). The water and oil regression parameters were used in Eqs. (1-4) as alternatives for the tissue equivalent parameters. These alternative parameters were used to initialize M and B as described above. Two additional mixture phantoms (deformable) were constructed with measured (known) volumetric proportions of water and oil to assess the thickness correction within the calibration application. These mixture phantoms were 34% and 31% water by volume (34/66 and 31/69 water/oil mixtures). The 34/66 mixture contained 300 ml water and 590 ml oil, whereas the 31/69 mixture contained 200 ml water and 441.5 ml oil. We estimated a 2% error in the water percentage for either mixture. The mixture phantoms were calibrated with and without the correction for comparison. Because these calibration parameters were generated with minimal height uncertainty within the strips (known uniform heights determined by the mechanical measurements), they were used to approximate the rigid/non-rigid misalignment. Calibrating the mixture balloon phantoms with the system readout height captures the rigid/non-rigid misalignment that occurs when calibrating mammograms using the system breast thickness readout height (nominal height); this quantity (incorrect height) is then used to generate the two required calibration points (incorrect points) with the phantom (with precise uniform heights) regression parameters. Although water and oil were used to develop an alternative two-component system, the mapping in Eq. (2) was referred to as percent glandular below rather than the percent water mapping because these two mappings are isomorphic.
Standardized Representations Analysis
We used the alternative two-component system to investigate the various calibration representations and determine their reproducibility under varying compression forces. The 34/66 mixture was calibrated for three system compression readout thicknesses: 5.0 cm, 4.4 cm, and 3.8 cm. This simulated imaging the same patient at different times with varying compression forces. The total glandular volume, average glandular-volume/pixel, and percent glandular representations were calculated using Eqs. (2-8) and compared. The similarity between the effective x-ray attenuation coefficient representation expressed in Eq. (5) and the percent glandular representation expressed in Eq. (2) was demonstrated with regression analysis by calibrating the 34/66 mixture over a range of compression thicknesses. Both the percent glandular and glandular descriptions are used below for consistency with the understanding that they apply to water content for this work only.
Thickness Variation Characterization Validation
The H(x, y) method was evaluated with modified breast tissue equivalent phantom imaging (Table 1). This analysis was constrained to large rectangular regions of 1000 × 1600 pixels centered on the detector in the y-direction aligned with the front edge of the breast support surface. The difference image
was used for comparison, where HT(x, y) is the respective theoretical relative surface generated with the known constant height gradient for a given slant-phantom. The d(x, y) pixel distributions were summarized for all examples in Table 1. The average deviation of d(x, y) was generally less than 0.5 mm and not dependent upon the total phantom height.
Compression Paddle Assessment
By physical inspection, the paddle plane has a stiff-membrane characteristic that permits constrained flexing. As shown in Figure 3, the sidewalls provide rigidity to the perimeter. Exerting spatially limited pressures at arbitrary locations within the plane induces similar bulge profiles with crests about the midlines. The plane of the paddle also has an upward curvature (about 1 mm crown) when resting with the maximum at approximately 73-75 mm from the chest wall slightly below the y-midpoint. We made mechanical measurements of the compression paddle perimeter repeatedly over an 18-month period. These measurements were consistent in both distance from the breast support surface and compression force. Surface flexing had negligible effects about the paddle perimeter in the x-direction at y = 0 and y = ymax due to the paddle sidewalls (Figure 3). The paddle perimeter tilts in the x-direction when experiencing compression resistance. The relative perimeter elevation (measured in cm) was approximated by this expression
The absolute perimeter height (cm) was given by tx + ts. Equation (13) shows an upward paddle tilt toward the front edge of the detector, which is not present without applied compression force. We approximate less than ± 1.0 mm uncertainty in all measurements due to both measuring error/resolution and torque exerted on the paddle due to the position of the phantom. The arrow in Figure 3 points to the paddle triangular slide connection. The upward deflection is due primarily to the slack in this connection. Play in the slide allows the entire paddle plane to deflect upward in accord with the above relation occurring at less than 3 dN before bending occurs. Therefore, we assumed the paddle tilt was a maximum when imaging. The coarse measurements taken inside the compression paddle with the straightedge technique indicated the paddle surface bulge ranged from (0-4) mm from small to large compression forces up to 15 dN. These coarse measurements showed there is one maximum paddle bulge height that is a function of compression force. Figure 5 shows the relevant elevations of the paddle perimeter with respect to the breast support with an arbitrary bulge profile. We used Eq. (13) with the maximum bulge height and position coordinates as boundary conditions for the thickness correction.
The modified form of Eq. (11) was used (along y) to assess the degree of bulge along the paddle front-edge perimeter. Figure 6 shows three one-dimensional profiles along the y-direction (x = 20) for a typical deformable phantom for three compression force levels. The curvature was less than 0.4 mm over an 8 cm span, which was negligible. Similar findings resulted when applying the analysis to the other water phantoms (from the 35 images). Therefore, we used Eq. (13) as an approximation for all y at × = 0 and × = xmax. The maximum upward paddle height due to tilt alone was estimated with Eq. (13) for × = 0, which gives 0.5 cm (above ts).
Paddle Deformation Characterization
We applied the H(x, y) analysis to the collection of 35 water phantom images. For each H(x, y) surface, the coordinates, (xm , ym), and the maximum bulge height, hm, were estimated as a function of compression force (Fn = system compression force readout quantity). The relevant parameters and distances are shown in Figure 5 and in Figure 7. The estimated maximum bulge height (hm ) quantities are relative to Eq. (13) evaluated at × = xm. The Hm distance is the maximum bulge height estimated with H(x, y). This is the distance above the front-edge of the compression paddle (above ts + 0.5 cm) as illustrated in Figure 7. Figure 8 shows a one dimensional profile through H(x, y) along the × direction that intersects hm for the second phantom shown in Figure 2. The hm - Fn regression plot is shown in Figure 9, and the related regression analysis is summarized in Table 2, which shows hm is well approximated as a linear function of Fn. The R-square value indicates the model validity. Figure 10 shows the Fn - An scatter plot that compares the 110 mammograms (squares) and the 35 deformable phantom (filled circles) images. Summaries of the water phantom characteristics are listed in Table 3. For reference, the average maximum distance from the × = 0 to the eroded breast border for 110 mammograms was approximately 81 mm with a 24 mm distribution standard deviation, whereas the estimated quantities for the balloon phantoms were 86 mm and 60 mm, respectively (Table 3). For this system, the average compression force was estimated with 395 mammograms: <Fn> = 57 N (distribution standard deviation ≈ 21 N). Eighty-eight percent of these images were within the 3-8 dN range. Because (1) the mammogram samples encapsulate the deformable phantom samples in Figure 10 over the 3-9 dN range, and (2) both the mammogram and phantom borders are approximately semi-circular, we conclude the two requirements specified earlier were approximately met, and their compression properties are similar. In summary, for the correction (from Table 3), we used ym = 2294/2 (approximation) and the average value of xm for bulge height positions independent of Fn. The value of hm was generated for each specific Fn in the correction construction using the parameters in Table 2.
Thickness Correction Construction
The compression paddle measurements provided four boundary conditions for constructing the thickness surface correction. Therefore, we used separable third degree polynomials for the correction. First, a one dimensional ridge-polynomial (Figure 8) along the y-direction was constructed that passes through the maximum bulge height and coordinates
The bulge coordinates, (xm , ym,), and associated height, hm , provided two interior boundary conditions. There is one ridge-profile per correction surface. Two other boundary conditions resulted from matching the ridge-profile height with the measured compression paddle perimeter height at (xm, 0) and (xm, ymax). These four boundary conditions defined the coefficients in Eq. (14): (1) R(xm, 0) = tx(xm), (2) R(xm, ymax) = tx(xm), (3) R(xm, ym ) = hm + tx(xm), and (4) dR/dy = 0 at (xm, ym). The two-dimensional correction surface was generated as a series of one-dimensional cubic polynomials in the x-direction using the ridge-profile intersection as the maximum height and position boundary conditions as shown in Figure 7 (interior boundary conditions). Two other boundary conditions were found by matching the x-direction polynomial height with the measured paddle parameter height at × = 0 and × = xmax. The two-dimension relative correction surface was generated by constructing a one-dimensional x-polynomial for each value of y expressed as
where the coefficient subscripts include the y dependency for given profile. For fixed y, the coefficients were defined with these boundary conditions: (1) tc(0, y) = tx(0), (2) tc(xmax, y) = tx(xmax), (3) tc(xm, y) = R(xm, y), and (4) ∂tc/∂x = 0 at × = xm. A 20 pixel constant (relative) height margin (x = 0-19) was set equal with tx(0) to approximate the rigidity of the paddle front-edge. This was neglected at the other three perimeter segments due to the large distances from the bulge height position. The corrected compressed sample thickness in cm was expressed as t(x, y) = ts + tc(x, y).
Thickness Correction Evaluation
We evaluated the compressed thickness correction within the calibration application.
Regression (calibration) parameters were measured from the regions (strips) outlined in the water/oil reference images shown in Figure 4 over a range of compressed sample thicknesses. These regions were divided into 50 × 50 pixel sub-regions. The average pixel value for each sub-region was used to generate the logarithmic response for each region and phantom thickness. Phantom thicknesses were derived from the mechanical measurements given by t = ts+ 0.5 cm, which is a close approximation because tx(50) ≈ 0.50 cm. Linear regression was applied to each sub-region. The summarized regression distribution quantities are provided in Table 4 with ts for reference. Spatial averages of the regression parameters were used in Eqs. (3-4) with Eq. (2) to calibrate the mixture examples.
We used two mixture phantoms, shown in Figure 11, to estimate the uncertainty caused by inaccurate compression thickness. The calibration was applied by dividing these larger regions into a grid of 10 × 10 pixel regions and averaging within each grid (analogous to the calibration data generation). The logarithmic response (LR) was formed by the average pixel value within each grid: LRa= ln(grid-average/160.0) The average corrected thickness calculated over the respective 10 × 10 grid was used for T(x, y) in Eqs. (6-8). The calibration results for the 34/66 mixture (example # 1) corresponding to Figure 11 (left) are given in Table 5, and the calibration results for the 31/69 mixture corresponding to Figure 11 (right) are given in Table 6. The percent glandular (PG) rows show the calibration with the correction. The PGs rows show the calibration using the system compression thickness readout, ts , and the PGs+5 rows show the calibration using ts + 0.5 cm, which is a static spatial correction for comparison. The region of interest in Figure 11 (left) is shown in Figure 12 in both the raw (left) and calibrated representations (right). The thickness correction precision was estimated with the 34/66 mixture (example # 1) by performing the calibration with a small perturbation, t(x, y) +0.1 cm, added to the corrected thickness. The PGΔ row (Table 5 only) gives the perturbed calibration results as reference for 1 mm thickness variation. The perturbation analysis indicates the correction was within ±1 mm (average) precision. To demonstrate reproducibility, the 34/66 mixture was rotated by approximately 90 degrees (clockwise) and imaged over a range of compression forces shown in Table 7 (same format), which gave similar results.
Calibrated Representation Comparison
We compared the percent glandular (PG) and volumetric representations using Eqs. (7-8) with the polynomial correction for three system thickness readout values: 5.0 cm 4.4 cm and 3.8 cm. We retained the usage of glandular for comparison purposes, although water content was determined in various forms. The analysis was applied to the ROI (34/66 mixture) shown in Figure 11 (left) and in Figure 12. Using Eq. (7), the respective average glandular-volume/pixel quantities were estimated as [0.196, 0.175, 0.155] mm3/pixel, whereas the respective total glandular volumes were [101.9, 91.4, 80. 8] ml. These volumetric quantities changed significantly for the selected volume, whereas the PG representation was consistent (Table 5). To emphasize this finding, total fluid volumes for these examples were also estimated as [288.0, 257.9, 227.5] ml respectively. Using Eq. (8), the respective planar spatial summaries are given by: <PG> = 101.9/288.0 × 100 = 35.3, <PG> = 91.4/257.9 × 100 = 35.4, and <PG> = 80.8/227.5 ×100 = 35.5 These examples show the validity of Eq. (8) and that the PG representation is consistent with respect to thickness variations caused by applied compression force variations (Table 5).
We used the 34/66 mixture example # 2 (because of the wider range of thickness samples) to show the relation between the PG and the Eq. (5) representation. The LR was calculated by averaging the sub-regions for each thickness (average corrected thickness over the region) in Table 7. Figure 13 shows the regression analysis findings for the 34/66 mixture (example # 2). The absolute value slope is an estimate of the average effective radiation attenuation coefficient: < μe > = 0.546 ± 0.01. Using p = 0.34 as a known quantity with the effective attenuation coefficient quantities in Table 4 gives μe = 0.34 × 0.708 +0.66 × 0.458 = 0.543, which is in agreement with the slope estimation. The findings for the other 34/66 sample (example # 1) gave < μe > = 0.513 ± 0.01, which is also in agreement and shows that both the PG and μe representations more resemble planar measures than volumetric breast density measurements.
The inaccurate compression thickness problem was addressed as two separate components (1) the paddle tilt due to play in the mechanical connection, which was not dependent upon the compressed sample, and (2) the paddle bulge (flex) due its elasticity and the compressed sample's resistance. Serial mechanical measurements of the paddle perimeter were approximately invariant and within ±1.0 mm precision. The H(x, y) analysis was evaluated under known conditions (precision ≤ 0.5 mm), and then used to estimate the paddle bulge. The paddle tilt and bulge assessments were used as boundary conditions for the cubic polynomial thickness correction.
We evaluated the thickness correction using methods that duplicated the rigid/non-rigid misalignment. Figure 10 shows agreement between the compressed behavior of patient mammograms and the deformable (water) phantoms. The overlap in the region between 3-8 dN illustrates the similarity. The relative calibration (average) error was reduced to 7% from 48-68% when applying the thickness correction (Tables 5, 6, 7. The thickness-corrected calibration results were in agreement with the known percent glandular (PG) quantities and within the margin of composition uncertainty. When comparing the static correction findings with those estimated with the surface correction, the latter produced calibration quantities that were closer to the known values. However, the static correction accounted for a greater portion of the overall deviation as gauged by comparing the PG and PGs+5 entries with the PGs entries (Table 5 and Table 6). This is expected because the static correction is embedded within the surface correction. The mechanical correction component was not heavily dependent upon the phantom - breast similarity. To emphasize these overall improvement gains, the average relative difference between the known and measured PG composition quantities is provided parenthetically in the first column for each of the three calibration examples (Tables 5, 6, 7). The accuracy improvements are due to the overall (average) corrected thickness precision, which was approximately within ±1 mm (Table 5). To evaluate the replication properties of both the phantom construction and the correction, the 34/66 mixture was repositioned, imaged, and calibrated, which resulted in similar findings (Table 7). The thickness correction was evaluated further by measuring the calibration parameters over a wide-area in the reference phantoms (Figure 4). The agreement between the wide-area parameters (with the correction) and those parameters measured from the strip regions (Table 4) shows the validity of the correction. In contrast, when there is thickness inaccuracy, the intercepts showed marked variation as demonstrated by comparing corrected quantities (generated from the same wide-area) with the non-corrected quantities (Table 4). We presented these findings in the PG representation because it was reproducible with respect to intra-sample thickness variations, in contrast with other volume measures.
Inaccurate compressed breast thickness is a known source of uncertainty in calibration research. Optical stereoscopic photogrammetry (OSP) methods [20, 21] using stereo triangulation are also under investigation to address this problem. One variation mounted the OSP device on the mammography unit to make compressed breast measurements , which may not be of practical use in the clinical setting . Another variation used OSP measurements of various breast models under compression to generate a thickness correction . Mawdsley et al  found the maximum paddle height occurs at 20 mm from the chest wall (at the y-midpoint) using a system with a specific tilt-paddle. In contrast, our findings (Table 3) show the maximum occurs approximately 57 mm from the chest wall. These findings may not be directly comparable because of the differing paddle connection and operating mechanisms. Varying tilt orthogonal to the chest wall position will impact the maximum paddle height position. If the paddle front edge is fixed while increasing the tilt angle (lowering the paddle at × = xmax), the bulge height maximum position will shift towards the chest-wall position. Moreover, the plane of the paddle used for our work has an upward curvature (about 1 mm crown) when resting with the maximum at approximately 73-75 mm from the chest wall slightly below the y-midpoint. The central portion of the paddle-plane also has slight but noticeable membrane characteristic when flexed with small forces. Our bulge height positions are consistent with the outer breast-paddle contact distance (~81 mm for breast and 86 mm for phantoms) when considering the plane of the paddle behaves as a deformed (bent) thin plate  with the load changing from a distributed load to no-load past the paddle-sample contact area. Our findings agree with Mawdsley et al  in that (1) the linear correction offers improved accuracy because (in this case) the offset with the system readout thickness and paddle tilt induce more variation than the paddle flex, and (2) in general there can be a significant deviation between the system readout value and the actual compressed breast thickness that requires correction. Other researchers investigated thickness inaccuracies using radio-opaque markers and magnification geometry,  which showed negligible deformation parallel to the chest wall and upward tilt from the breast periphery to the chest wall but to a much larger degree than indicated by Eq. (13). Our findings agree, in part, with these researchers  in that the deformation (near × = 0 only) parallel to the chest wall position was small; this related work did not address paddle bulge in the direction orthogonal to the chest wall position.
The calibration representation comparison showed the similarities and differences between percent glandular (PG) and related calibrated volume and height measures. The PG representation is a planar measure that is equivalent to both the normalized volumetric  and the normalized height  measurements in summary, suggesting the definitions used in the literature are not uniform. The total volume representations varied under the assumptions made here, whereas the PG measure was consistent under thickness variations for the same sample. Similar arguments apply to the total glandular height representation  as well.
We developed an alternative model to meet the study objectives because the phantom compositions were known. This eliminated uncertainty but its applicability relies on the similarity of the surrogate phantoms with the original model. When using mammograms to evaluate the various relationships, the compositions are unknown. Some researchers use binary labeled (breast density) mammograms [13, 20] or tissue measures derived from other imaging modalities  in the calibration developmental work, which could introduce uncertainty. In the final validation analysis, the various calibrated measures will require a known cancer/no-cancer (CNC) endpoint to show measurement association. The developmental work could use the CNC endpoint as class separation optimization criterion for making correction adjustments, but this would preclude using the same data for independent association validation. It is less-costly to develop alternative strategies to develop and assess calibration modifications because properly designed databases that include cancer patients are time consuming and expensive to construct. The best approach is still an open ended inquiry because there is little evidence at this time showing that calibrated measurements are efficacious.
The evaluation was performed with phantoms that behaved similar to that of compressed breast deformation, which is a coarse approximation. The effect of skin thickness, (if any) on the calibration accuracy was not addressed because the stretched balloon thickness was negligible compared to skin thickness, which is on the order of ~1-3 mm . The overall analysis could be improved with better phantom construction methods using manufacturing techniques. The paddle bulge assessments provided an empirical (averaged) solution to loaded thin plate problem. Alternatively, the warp of the paddle plane could be estimated using numerical methods derived from plate theory  by (1) considering the paddle plane (thin-plate) loading of each breast separately, and (2) determining the appropriate loading geometry (eroded breast silhouette) and paddle perimeter boundary conditions. Future work includes exploring these more formal techniques of modeling the loaded paddle that could eliminate the need for the deformable breast surrogate models. Nevertheless while the deformable phantoms were less than perfect, the work showed that the thickness correction improved the calibration accuracy dramatically. Our preliminary studies were performed with homogenous phantoms, which are reasonable surrogates for developmental work but are not capable of capturing either the tissue heterogeneity present in mammograms or chest wall compression interaction. The final validation of the percent glandular measure will require a cancer/no-cancer endpoint comparison.
Boyd NF, Rommens JM, Vogt K, Lee V, Hopper JL, Yaffe MJ, Paterson AD: Mammographic breast density as an intermediate phenotype for breast cancer. Lancet Oncol 2005, 6: 798–808. 10.1016/S1470-2045(05)70390-9
McCormack VA, dos Santos Silva I: Breast density and parenchymal patterns as markers of breast cancer risk: a meta-analysis. Cancer Epidemiol Biomarkers Prev 2006, 15: 1159–1169. 10.1158/1055-9965.EPI-06-0034
Boyd NF, Guo H, Martin LJ, Sun L, Stone J, Fishell E, Jong RA, Hislop G, Chiarelli A, Minkin S, Yaffe MJ: Mammographic density and the risk and detection of breast cancer. N Engl J Med 2007, 356: 227–236. 10.1056/NEJMoa062790
Byng JW, Boyd NF, Fishell E, Jong RA, Yaffe MJ: The quantitative analysis of mammographic densities. Phys Med Biol 1994, 39: 1629–1638. 10.1088/0031-9155/39/10/008
Boyd NF, Byng JW, Jong RA, Fishell EK, Little LE, Miller AB, Lockwood GA, Tritchler DL, Yaffe MJ: Quantitative classification of mammographic densities and breast cancer risk: results from the Canadian National Breast Screening Study. J Natl Cancer Inst 1995, 87: 670–675. 10.1093/jnci/87.9.670
Boyd NF, Lockwood GA, Byng JW, Tritchler DL, Yaffe MJ: Mammographic densities and breast cancer risk. Cancer Epidemiol Biomarkers Prev 1998, 7: 1133–1144.
Highnam R, Brady M: Mammographic Image Analysis. Boston, MA: Kluwer Academic Publishers; 1999.
Heine JJ, Behera M: Effective x-ray attenuation measurements with full field digital mammography. Med Phys 2006, 33: 4350–4366. 10.1118/1.2356648
Heine JJ, Thomas JA: Effective x-ray attenuation coefficient measurements from two full field digital mammography systems for data calibration applications. Biomed Eng Online 2008, 7: 13. 10.1186/1475-925X-7-13
Kaufhold J, Thomas JA, Eberhard JW, Galbo CE, Trotter DE: A calibration approach to glandular tissue composition estimation in digital mammography. Med Phys 2002, 29: 1867–1880. 10.1118/1.1493215
Pawluczyk O, Augustine BJ, Yaffe MJ, Rico D, Yang J, Mawdsley GE, Boyd NF: A volumetric method for estimation of breast density on digitized screen-film mammograms. Med Phys 2003, 30: 352–364. 10.1118/1.1539038
Shepherd JA, Herve L, Landau J, Fan B, Kerlikowske K, Cummings SR: Novel use of single X-ray absorptiometry for measuring breast density. Technol Cancer Res Treat 2005, 4: 173–182.
Malkov S, Wang J, Kerlikowske K, Cummings SR, Shepherd JA: Single x-ray absorptiometry method for the quantitative mammographic measure of fibroglandular tissue volume. Med Phys 2009, 36: 5525–5536. 10.1118/1.3253972
van Engeland S, Snoeren PR, Huisman H, Boetes C, Karssemeijer N: Volumetric breast density estimation from full-field digital mammograms. IEEE Trans Med Imaging 2006, 25: 273–282. 10.1109/TMI.2005.862741
Highnam R, Pan X, Warren R, Jeffreys M, Davey Smith G, Brady M: Breast composition measurements using retrospective standard mammogram form (SMF). Phys Med Biol 2006, 51: 2695–2713. 10.1088/0031-9155/51/11/001
Ding J, Warren R, Warsi I, Day N, Thompson D, Brady M, Tromans C, Highnam R, Easton D: Evaluating the effectiveness of using standard mammogram form to predict breast cancer risk: case-control study. Cancer Epidemiol Biomarkers Prev 2008, 17: 1074–1081. 10.1158/1055-9965.EPI-07-2634
Boyd N, Martin L, Gunasekara A, Melnichouk O, Maudsley G, Peressotti C, Yaffe M, Minkin S: Mammographic density and breast cancer risk: evaluation of a novel method of measuring breast tissue volumes. Cancer Epidemiol Biomarkers Prev 2009, 18: 1754–1762. 10.1158/1055-9965.EPI-09-0107
Heine JJ, Cao K, Beam C: Cumulative Sum Quality Control for Calibrated Breast Density Measurements. Medical Physics 2009, 36: 5380–5390. 10.1118/1.3250842
Diffey J, Hufton A, Beeston C, Smith J, Marchant T, Astley S: Quantifying Breast Thickness for Density Measurement. In Digital Mammography, 9th International Workshop, IWDM, 2008; July 20–23, 2008; Tuscon, AZ, USA. Edited by: Krupinski EA. Springer-Verlag; 2008:651–658.
Mawdsley GE, Tyson AH, Peressotti CL, Jong RA, Yaffe MJ: Accurate estimation of compressed breast thickness in mammography. Med Phys 2009, 36: 577–586. 10.1118/1.3065068
Tyson AH, Mawdsley GE, Yaffe MJ: Measurement of compressed breast thickness by optical stereoscopic photogrammetry. Med Phys 2009, 36: 569–576. 10.1118/1.3065066
Vedantham S, Karellas A, Suryanarayanan S, Albagli D, Han S, Tkaczyk EJ, Landberg CE, Opsahl-Ong B, Granfors PR, Levis I, et al.: Full breast digital mammography with an amorphous silicon-based flat panel detector: physical characteristics of a clinical prototype. Med Phys 2000, 27: 558–567. 10.1118/1.598895
Jeffreys M, Warren R, Highnam R, Smith GD: Initial experiences of using an automated volumetric measure of breast density: the standard mammogram format. Br J Radiol 2006, 79: 378–382. 10.1259/bjr/24769358
Krouskop TA, Wheeler TM, Kallel F, Garra BS, Hall T: Elastic moduli of breast and prostate tissues under compression. Ultrason Imaging 1998, 20: 260–274.
Swann CA, Kopans DB, McCarthy KA, White G, Hall DA: Mammographic density and physical assessment of the breast. AJR Am J Roentgenol 1987, 148: 525–526.
Chow CK, Venzon D, Jones EC, Premkumar A, O'Shaughnessy J, Zujewski J: Effect of tamoxifen on mammographic density. Cancer Epidemiol Biomarkers Prev 2000, 9: 917–921.
Timoshenko SP, Woinowsky-Kreiger S: Theory of Plates and Shells. Second edition. Auckland: McGraw-Hill; 1959.
Huang SY, Boone JM, Yang K, Kwan AL, Packard NJ: The effect of skin thickness determined using breast CT on mammographic dosimetry. Med Phys 2008, 35: 1199–1206. 10.1118/1.2841938
The work was supported by NCI grant #R01 CA114491. The authors would like to thank Autumn Smallwood, Christy Smallwood, Gail Tiffenberg, and all of the staff at the Lifetime Cancer Screening Center at the Moffitt Cancer Center for their support of this experimental work. The authors also thank Drs. Elsie Gross, Ambuj Kumar, and Srinivas Nagaraj for reviewing the manuscript and providing invaluable suggestions that improved the presentation.
The authors declare that they have no competing interests.
JJH is responsible for the theoretical developments. JJH and JAT designed and constructed the surrogate system. JJH and JAT performed the experimental imaging. JJH and KC performed the data analyses and developed the computer code. JJH is the primary author. KC and JAT are secondary authors. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.