The CMR images used in our experiment were provided by the Department of Cardiology in Stavanger University Hospital and were obtained from 44 patients with myocardial infarction. 24 patients had old MI and ICD was implanted in all patients as primary- or secondary prophylaxis. This group was defined as the high risk arrhythmic group. The remaining 20 patients had MI one year prior to CMR imaging and during this observation period, no reported incidents of serious arrhythmias occurred. This group was defined as the low risk arrhythmic group. The study was approved by the Regional Ethics Committee and informed consent was obtained from all patients.
All the CMR images were obtained from 1.5 Tesla Philips Intera machine. Images were acquired with a pixel size of 0.82 × 0.82 mm2, covering the whole ventricle with short-axis slices of 10 m thickness, without inter-slice gap. The LG enhanced CMR images were stored according to the digital imaging and communications in medicine (DICOM) format with 512 × 512 pixel resolution. The number of image slices with visible scar in each patient varies depending on the size of the scar and the scar size varies from one slice to the other. Only short-axis image slices were used in our experiments. Out of the total 444 CMR images belonging to the 44 patients, 11 CMR images of poor quality due to motion and artifacts were not included. An experienced cardiologist delineated the borders of the myocardium and the infarction tissues as shown in Figure 1. The resulting set of segmented images were further split into labeled training and test sets. All our experiments were carried out in MATLAB. The CMR training and test images were cropped from 512×512 original CMR image so that they contain only left ventricle with myocardium and infarction areas to save execution time. Preprocessing of any kind was not used on these cropped images.
In all the CMR images, we took into account only the myocardium segmented by the cardiologists. Patients with high risk of getting arrhythmia have profound scars with possibly large gray-zone areas. We therefore anticipated that images from these patients contained the information we wanted to represent in the dictionaries: information from the myocardium, the core area of scars, and the gray-zone areas. Hence, the high risk patients group was used in the training phase. From the group of 24 high risk patients, 6 patients were randomly selected to form a training set, whereas the remaining 18 high risk patients were used to form a test set. Two sets of training vectors were generated, one from the scar, and one from the healthy myocardium.
This section is organized as follows: Sections ‘Probability maps obtained using DC feature, dc(i,j)’ and ‘Probability maps obtained using texture feature, R
p
(i,j)’ explains the processes of calculating the probability maps using DC and texture feature respectively. Section ‘Examples of probability maps obtained from DC and texture feature’ discusses the examples of probability maps obtained in our experiments. Section ‘A possible application - cardiac segments’ illustrates how probability maps obtained with texture feature can be used to define cardiac segments. Section ‘The discriminatory power of the DC and texture features’ considers the discriminative power of the DC and texture features.
Probability maps obtained using DC feature, dc(i,j)
The neighborhood size 3×3 was used to form training vectors as explained in ‘DC Feature’. The same neighborhood size must be used while training and finding the DC images I
dc
. The DC images obtained from the training images were used to form the training feature set. The parameters of the class specific PDFs: the mean and the standard deviation were found using ML estimation using the training feature set. DC-values were scaled to have zero mean and unit variance before finding the ML estimates. The scaling coefficients from the training were stored to scale the test vectors. The probability maps of the scarred myocardium were calculated using Bayes rule as explained in section ‘Calculation of probability maps using Bayes rule’. An example of a probability map obtained using the DC feature with color code is shown in Figure 2(b). Note the sharp transition from the segmented scar to the myocardium.
Probability maps obtained using texture feature, R
p
(i,j)
The training vector and test vectors were generated in the same way as in the DC feature experiment using the same neighborhood size 3×3. Consider the pixels on the border zones, their neighborhood extends into other regions that are not under consideration. If we use training vectors from border regions, the dictionaries might learn the texture properties of other regions along with the texture properties they are intended to learn. So, the training vectors for the pixels whose neighborhood span other regions were not considered in our experiments. This is depicted in Figure 4.
The dictionaries were learned on the training vectors generated from scar and healthy myocardium using RLS-DLA. The dictionary size of 9 × 90 atoms was used in our experiments. The initial dictionaries were formed by randomly selecting 90 vectors of length 9 from the training sets. Dictionary size was selected based on previous experience, and it should be K > N < < L. The forgetting factor is initialized to 0.995 and slowly increased towards 1 according to the exponential method described in [17]. If the number of dictionary atoms used to represent the image patch increases, then the residual decreases, and a large number of atoms will provide a good approximation with any full rank dictionary. Therefore, higher sparsity lowers the difference between the residuals of healthy myocardium and scar which in turn decreases the differentiation between the classes. The sparsity s used in our work is two, i.e. we used two vectors from the dictionary to represent the original image patch.
After learning the dictionaries, they were used to obtain the residual images. For each pixel in the myocardium of training images, the scaled residual R
p
(i,j) was found from residual images as explained in section ‘Dictionary-based textural feature’. The training feature set was then generated from these scaled residual values. The parameters of the PDFs to be used with the classifier was estimated from the labeled training vector set using ML estimation. The test vector set in this experiment was calculated after smoothing (using low pass Gaussian filter with σ = 5 and 9 × 9 window size) the test residual images. Using the ML estimates and Bayes rule, the probability maps for texture features of the scarred myocardium were obtained. An example of such a probability map with color code is shown in Figure 2(c). The color code used is explained under Figure 2. Here, the dictionaries were learned without the removal of the DC value in each image patch.
Dictionary learning was performed with and without the removal of the DC factor from the image patch of the training set. In order to illustrate some of the texture captured by the dictionary atoms, they were plotted as image patches in Figure 5. The difference between the scar and the healthy myocardium dictionaries was visible when the DC value was not removed, and this is according to the known intensity difference between scar and healthy myocardium. To investigate the variation in the dictionary atoms used as image patches, we had calculated a measure from the first derivative of dictionary atoms in both vertical and horizontal directions. The absolute mean of the first derivative of the dictionary atoms was used to compare the scar and healthy myocardium dictionaries learned with and without DC value. The calculated absolute means are shown under Figure 5. The difference between the scar and healthy dictionaries without DC removal was observed clearly, both from the depicted dictionaries and the absolute means. The difference between the scar and healthy myocardium dictionaries trained by removing the DC value from image patches could not be observed visually. A larger block for DC removal prior to textural capturing is probably more appropriate, more similar to suppression of difference in illuminance conditions in computer vision.
Figure 6 shows probability maps of some example CMR images and all of them were calculated using the texture feature, R
p
(i,j). In the middle column, the DC value was not removed from each image patch before training dictionaries and calculating the probabilities. In the last column, the DC value was removed from every image patch both before learning dictionaries and before finding probabilities. The DC value is obviously important, so we do not expect this experiment to provide competitive results, but we see a tendency of higher values in the scar and border areas, but not necessarily in the core of the scar. We expect the scar core to be very bright and could be captured easily by some sort of intensity based measure (for example the DC feature). If the physical nature of the tissue is reflected in the image texture, we can expect the scar core to be quite homogeneous. Thus, we are not surprised that the regions that are the brightest in column II were actually quite dark in column III (core), but the intermediate regions were captured fairly. We have continued with the experiments in this work by letting the dictionaries incorporate the DC value as we did not combine the DC feature, dc and texture feature, R
p
into a 2D dimensional feature for computing the probability maps. Unless it is mentioned under figures of probability maps, all the probability maps of texture feature, R
p
were computed with dictionaries learned without removing DC from every image patch.
Examples of probability maps obtained from DC and texture feature
The second and the third column of Figure 7, shows more examples of probability maps of scarred myocardium using the DC, dc(i,j), and texture, R
p
(i,j), features, respectively. Figure 8 shows examples of probability maps of scarred myocardium for both DC and texture feature for the patients without ICD. This shows that the PDFs estimated for the DC and texture features from the patient group implanted with ICD were also able to produce probability maps for low risk arrhythmia patients (i.e. patients without ICD). Both the features were able to identify the healthy myocardium without scars, and an example of this is shown in Figure 9. In order to compare the probability maps obtained by the two methods, they are plotted with the same color map. Experiments of probability mapping combining the DC feature and texture feature gave almost identical probability maps as the DC feature because of the dominance of the DC feature over the texture feature. Improved ways of combining both the features will be explored in our future work.
A possible application - cardiac segments
A cardiac segment, CS can be defined as a region of interest in the myocardium which might have diagnostic importance. We define a candidate cardiac segment by a lower and upper probability so that the segment consists of all pixels with probability values in the range between these. Different candidate cardiac segments might be evaluated, for example, calculating one or several features from the candidate segment and correlate result to a clinical meaningful hypothesis. We demonstrate the concept by making a comprehensive search for significant cardiac segments by comparing two patient groups and testing for statistical significance. The demonstration experiment is meant to illustrate how the probability mapping can be used to localize cardiac segments containing information discriminating between the high and low risk patient groups.
For each patient, cardiac segments were determined by a lower boundary L and an upper boundary U. These boundaries were varied in the range 0–1 in steps of 0.025 with the restriction that U - L ≥ 0.1. Figure 10(a) illustrates this by visualizing each of the resulting candidate cardiac segment as a black dot with coordinate (L,U). For example, the three yellow dots in coordinates (0.15,0.3), (0.3,0.7), and (0.7,0.825) correspond to the cardiac segments CS1 = 0.15 - 0.3, CS2 = 0.3 - 0.7, and CS3 = 0.7 - 0.825, respectively. Figure 10(b) and (c) show one of the original image slices and the corresponding probability mapping (texture feature) for one of the high risk patients. The probability maps, which do not have 0 - 1 probability range, was extended using a sigmoid function for easy comparability. Figure 10(d–f) shows the cardiac segments corresponding to CS1, CS2 and CS3 for the image slice shown in Figure 10(b).
Preliminary experiments indicate that corresponding segments determined without using the probability mappings depends on the signal intensity values and seems to lack the ability to define contiguous areas in the same way as the probability mapping for some of the cardiac segments. For example, the cardiac segment determined directly from thresholding relative to the maximum 3D scar signal intensity is shown in Figure 10(g–i) for boundaries corresponding to CS1CS2 and CS3, respectively. These plots are comparable to the corresponding cardiac segments based on the probability mapping in Figure 10(d–f), and show less contiguous cardiac segment, especially CS1. Figure 11 shows similar plots of the cardiac segments corresponding to CS1, CS2 and CS3 for one of the low risk patients.
For each candidate cardiac segment, its relative size (accumulated for all slices) to the total myocardium was calculated. The computed relative sizes for high and low risk patient groups were compared using a Mann-Whithney test at significance level 0.05. The LU-plots in Figure 12 show the p-values for the test of all the candidate cardiac segments derived using the texture based (a) and DC based (b) probability map and those derived without the use of probability mapping (c). The blue colored dots indicate cardiac segments for which the high and low risk groups showed significant differences of the relative size.
Interpreting the comparison results for the texture feature probability map based cardiac segments in Figure 12 (a), the significant CS with L = 0 and U > 0.5 correspond to the cardiac segment containing all of the healthy myocardium and gradually including the scar area. More interesting is the localization in the upper right corner which makes more specific localization in the myocardial tissue in the regions close to the scar. Note that CS3 is identified as significant while CS1 and CS2 are not. Figure 12 (b) shows the LU-plot from the probability maps of DC feature and, the plot shows that cardiac segments does not give any localization. This is due to the narrow transition between high and low probabilities of the DC based probability maps. The DC probability maps will predominantly have extreme values close to 0 or 1 and few values in between 0–1. This provides little information to explore. These plots (Figure 12 (a) and Figure 12 (b)) of cardiac segments clearly shows that the probability maps obtained from the DC features reflect the way cardiologist perceive the scarred myocardium in CMR images and, the probability maps from texture features gives information about the characteristics of myocardial tissue and might emphasize information that is not easy for inspection with a human eye in the original CMR image. The difference between the narrow and broad transitions for the DC- and texture-based probability mappings might explain this. As the narrow transition gives little information besides the crisp core/scar discrimination corresponding to the cardiologist’s delineation based on the visual information. Thus, most of the candidate cardiac segments will be empty as there are virtually no intermediate probability values. This is not the case for the texture based probability mappings, which discloses the information present in this intermediate probability range.
In comparison the results for the cardiac segments derived directly without using the probability plots (Figure 12 (c)), the localization seems vaguer, not specifically identifying any coherent region in the myocardium. It is our belief that probability mapping might be used to identify cardiac segments that might be used in future studies.
The discriminatory power of the DC and texture features
The probability maps calculated for the two features, DC and textural feature, give different information. The discriminatory power of the two features were explored by comparing to the manual segmentation of scar and healthy myocardium to obtain Receiver Operating Characteristic (ROC) curves. The ROC curves calculated for the DC and texture features on the CMR images of the 18 test patients with high risk of getting arrhythmia are shown in Figure 13. The area under the average ROC curves is 0.9052 and 0.8428 for the DC and texture features, respectively. From ROC curves, it is observed that the discriminative power of the DC feature is comparatively higher than that of the texture features. The sensitivity, and specificity reported in [7] are plotted as a single point in Figure 13. This reported result is not calculated on our CMR data. This single point lies below within 95% confidence interval of DC feature. This reported segmentation result performed well compared to our average DC and texture ROC curves as Dikici et al.[7] does not include all the CMRI slices of a patient for training and testing where as in our work we include all the CMRI slices i.e. the scar volume.