# A shape-optimized framework for kidney segmentation in ultrasound images using NLTV denoising and DRLSE

- Fan Yang†
^{1, 2, 3}, - Wenjian Qin†
^{1, 2}, - Yaoqin Xie
^{1, 2}, - Tiexiang Wen†
^{1, 2, 3}and - Jia Gu†
^{1, 2}Email author

**11**:82

https://doi.org/10.1186/1475-925X-11-82

© Yang et al.; licensee BioMed Central Ltd. 2012

**Received: **20 June 2012

**Accepted: **16 October 2012

**Published: **30 October 2012

## Abstract

### Background

Computer-assisted surgical navigation aims to provide surgeons with anatomical target localization and critical structure observation, where medical image processing methods such as segmentation, registration and visualization play a critical role. Percutaneous renal intervention plays an important role in several minimally-invasive surgeries of kidney, such as Percutaneous Nephrolithotomy (PCNL) and Radio-Frequency Ablation (RFA) of kidney tumors, which refers to a surgical procedure where access to a target inside the kidney by a needle puncture of the skin. Thus, kidney segmentation is a key step in developing any ultrasound-based computer-aided diagnosis systems for percutaneous renal intervention.

### Methods

In this paper, we proposed a novel framework for kidney segmentation of ultrasound (US) images combined with nonlocal total variation (NLTV) image denoising, distance regularized level set evolution (DRLSE) and shape prior. Firstly, a denoised US image was obtained by NLTV image denoising. Secondly, DRLSE was applied in the kidney segmentation to get binary image. In this case, black and white region represented the kidney and the background respectively. The last stage is that the shape prior was applied to get a shape with the smooth boundary from the kidney shape space, which was used to optimize the segmentation result of the second step. The alignment model was used occasionally to enlarge the shape space in order to increase segmentation accuracy. Experimental results on both synthetic images and US data are given to demonstrate the effectiveness and accuracy of the proposed algorithm.

### Results

We applied our segmentation framework on synthetic and real US images to demonstrate the better segmentation results of our method. From the qualitative results, the experiment results show that the segmentation results are much closer to the manual segmentations. The sensitivity (SN), specificity (SP) and positive predictive value (PPV) of our segmentation result can reach 95%, 96% and 91% respectively; As well as we compared our results with the edge-based level set and level set with shape prior method by means of the same quantitative index, such as SN, SP, PPV, which have corresponding values of 97%, 88%, 78% and 81%, 91%, 80% respectively.

### Conclusions

We have found NLTV denosing method is a good initial process for the ultrasound segmentation. This initial process can make us use simple segmentation method to get satisfied results. Furthermore, we can get the final segmentation results with smooth boundary by using the shape prior after the segmentation process. Every step enjoy simple energy model and every step in this framework is needed to keep a good robust and convergence property.

## Keywords

## Background

US imaging is widely used in the area of diagnosis, image-guided interventions and therapy. It has the advantages of real-time capabilities and low cost compared with X-ray, computed tomography (CT) and magnetic resonance imaging (MRI). The accurate segmentation of organs or objects from US image plays a key role in its applications. However, compared with other medical imaging modalities, such as CT and MRI, US image is particularly difficult to do segmentation since the quality of the image is relatively low [1]. There are characteristic artifacts such as attenuation, speckle, shadows, and signal dropout which makes the segmentation task be complicated [2]. Moreover, kidney segmentation of ultrasound image is rarely studied in certain fields, such as [1] regarding the noise as a part of texture feature,which has to be extracted in every segmentation process due to diverse clinical scenes.

Recent years, many people use level set to conduct the segmentation. Since level set was proposed by Osher and Sethian, many models have been embedded in level set functions such as global intensity statistics [3], texture models [1], shape prior models [4] and Markov random field models [5] to improve the result. However, complicated computation in the level set evolution result in longer computing time. Furthermore, some methods have some drawbacks in themselves, such as texture models [1], shape prior models [4] have not taken into consideration of the problem when the shape prior training set of is not enough for the segmentation.

In this paper, we presented a novel framework for kidney segmentation in ultrasound (US) images to counter aforementioned limitations. This framework was combined with NLTV image denoising, DRLSE
[6] and shape prior for initial process, segmentation process and post optimization process. In the initial process, in order to reduce the bad influence of noise, we used the effective NLTV image denoising to get an almost homogenous intensity gray scale image of kidney region. In the segmentation process, we used the DRLSE method to get the coarse segmentation results. DRLSE is a simple level set formulation to reduce the segmentation cost time. The main PDE equation contains two items. The first item detects object boundaries from image gradients. The second part is used to maintain the signed distance property | ∇ *Φ*| = 1. Thus, it needs not reinitialize the level set function (LSF).

However, the noise cannot be eliminated completely by use of the NLTV denoising method. Thus, the result of DRLSE method may also be influenced by the noise. Therefore, in the final post process, we used the shape prior to get a kidney shape space on the monochrome image produced by DRLSE to optimize the segmentations result of DRLSE.

Statistical modeling is one of the most used methods to model shape priors. Cootes et al. [7, 8] proposed an active shape model (ASM) which relies on the statistics of an object’s shape gathered from a training set of manually land-marked instances of the object. They developed a parametric point distribution model for describing the segmenting curve. And the automatic placement of landmarks was presented in [9] and [10]. Reference [11] proposed a novel technique for solving the crucial correspondence problem of automatic placement of landmarks using non-rigid image registration. But this representation does not contain any explicit information about the point connectivity and landmarks are often obtained manually, which is a time-consuming, error-prone and subjective tendency.

Chen et al., [12], represented shapes using a collection of points. They applied clustering method instead of statistical method to get the shape prior model which is the average shape of given curves with similar shape, but different size, orientation and translation. However, the similarity of shapes in this method is measured by area information which makes it highly time-consuming. Rousson and Paragious [13] showed a method to recover a segmentation map in accordance with the shape prior model as well as a rigid registration between the map and the model, which is based on global-to-local registration [14] and prior region statistical properties. The advantage of this method is accounting for local degrees of variability and local shape variations, but it consists of N*N variables, and thus, is unstable. Using the Gabor filter bank to characterize the prostate boundaries in a multiscale and multiorientation pattern, Shen et al. [15] proposed a statistical shape model for the automatic prostate segmentation in transrectal US images.

In this paper, we adopted the proposed method in [4] and [16] to construct the kidney shape space. Leventon et al. [4] proposed a segmentation method including two steps: initial segmentation and its correction based on a shape prior model. The model is obtained through a principle component analysis (PCA) operation on a collection of signed distance maps of the training shapes. In this framework, the boundaries are represented as the zero level set of a 2-D scalar function [4]. This representation is intrinsic (independent of parameterizations) and is topologically flexible since different topologies of the curve are represented by the constant topology of scalar function. This characteristics is valuable for realistic image-guided diagnosis. In addition, when the sensitivity (SN), specificity (SP) and positive predictive value (PPV) of the segmentation results were all under 90%, we used the alignment model [16] to avoid the situation of lacking enough training shape sample for the segmentation. As a result, the shape space became richer and richer during the segmentation process compared with methods in [4, 7, 17, 18] without applying the alignment model after the segmentation to enrich the shape space.

## Results

### Qualitative results

Our goal was to accurately and quickly delineate the boundaries of the kidney from the synthetic and US image in row (a) of Figure 2. We used four different segmentation algorithms to do comparison analysis. The segmentation results by manual method, DRLSE method, DRLSE method on NLTV denoised image, Bression’s method, Bression’s method on NLTV denoised image and our method were shown in rows (b), (c), (d), (g), (h), (e) in Figure 2 respectively. The manual delineations were got by the surgeon using ITK-SNAP software. From the results, we can see that NLTV denoising is essential in segmentation of US image, and DRLSE with shape prior is very effective. The segmentation results by our method were very close to the manual segmentation results.

The NLTV denoising is indispensable for ultrasound image processing. The initial process of images row (c) in the Figure 2 uses Gaussian low pass filter. Moreover, the initial process of images row (d) in Figure 2 uses the NLTV denoising. From the segmentation results, we can see that the NLTV dennoising method helps the DRLSE method get a better result. Because the NLTV denoising method deal with the every paths use the different weights, which makes the smooth path with homogeneity gray scale and the paths with large gradients to keep its boundary.

When using the level set with shape prior, it’s better to use the shape prior in the end of curve evolution. The initial process of the row (f) and (g) in Figure 2 use the NLTV denoising method. The row (f) use the common level set with shape prior method. The row (f) in Figure 2 uses the shape prior in the end of the curve evolution. We can see that using the shape prior in the end of curve evolution gives an accuracy result in the same initial process. Because use the shape prior in every level set iteration leads to the energy model getting convergence in the local minimum. Once we use the shape prior in the end of the level set evolution, both the level set process and the shape prior process gets its global minimum to make the level set with the shape prior get its global minimum.

In Figure 3, the experiment results show how we deal with the situation where the shapes in the shape space may be not enough. The segmentation results by manual method, DRLSE, DRLSE on NLTV denoised image, our method without using the AM after the segmentation and our method with using the AM after the segmentation were illustrated in rows (b), (c), (d), (e), (f). From Figure 3, it demonstrates that our method may not achieve the satisfied result occasionally, which is due to lack of sufficient kidney shapes from the shape space. Therefore the alignment model was used to absorb new shape to make the kidney shape space cover more shapes. Image (f) in Figure 3 is the segmentation result by our method with the initial shape prior, we can see that the segmentation results is not good. Because the kidney shape of image (a) in Figure 3 is not existing in the shape space. So we use the alignment model to absorb the new shape to get the new shape prior. Image (h) in Figure 3 shows the segmentation result with the new shape prior. After the alignment model had been applied, the accuracy of segmentation results were improved obviously.

It is worth mentioning that the total cost time is only around 10s(32-bit desktop PC:2.93 GHz Core 2 and 2GB RAM). The cost time of initial process, segmentation process and optimization process is 0.01s, 8s and 2s respectively, while it needs 5 min cost time of the level set with shape method in [17] and 40s cost time in [1].

### Quantitative results

For each image, the set of pixels lying within the manual delineations of kidney was denoted as *M*. The set of pixels lying within any boundary resulting from any method was denoted as *A*. We used SN, SP and positive PPV as the quantitative metrics of the segmentation results. SN, SP and PPV were defined as:

*W*denoted the whole image, | · | denoted the point set of the segmentation result. We computed SN, SP and PPV for each image as well as the average and standard deviation over the experiment data set. Higher value of these metrics means that the segmentation results is closer to the manual segmentations. The statistics result is shown in Table 1. The difference of SN, SP and PPV between using and not using the alignment model in the segmentation process is shown in Table 2. From the Table 1 and Table 2, we can see that the better result we get in the Qualitative result region, the higher number we get in these two tables.

**Quantitative evaluation of segmentation results for the five methods**

Method | SN | SP | PPV |
---|---|---|---|

| 0.97±0.02 | 0.88±0.02 | 0.78±0.08 |

| 0.99±0.01 | 0.94±0.02 | 0.89±0.04 |

Bression’s method | 0.81±0.10 | 0.91±0.06 | 0.80±0.07 |

Bression’s method with NLTV | 0.87±0.04 | 0.93±0.05 | 0.87±0.07 |

Our method | 0.96±0.02 | 0.95±0.02 | 0.91±0.04 |

**Quantitative evaluation of segmentation results for four methods**

Method | SN | SP | PPV |
---|---|---|---|

| 0.95 | 0.87 | 0.81 |

| 0.94 | 0.93 | 0.88 |

Our method without AM | 0.89 | 0.87 | 0.79 |

Our method with AM | 0.92 | 0.96 | 0.95 |

Compared with the original DRLSE method, our method makes an increasement by 1%, 7% and 13% on the SN, SP and PPV value respectively. And compared with the original level set with shape priors, out method makes an increasement by 15%, 1% and 14% on the SN, SP and PPV value respectively. The experiment results show our framework is the best.

With the help of the NLTV denoising method, the original DRLSE method get an increasement by 2%, 6% and 11% on the SN, SP and PPV value respectively, the original level set with the shape prior get an increasement by 6%, 2% and 7% on the SN, SP and PPV value respectively. From experiment results, we can see that the NLTV denoising method is important for the segmentation of ultrasound images.

When using the same initial process of the NLTV denoising method, out method still obtains an increasement by 9%, 1% and 4% on the SN, SP and PPV value respectively. This shows that our way of using shape prior is better than the common method.

Although we can get a promising result by integrating the shape prior into level set,our segmentation method sometimes does not get a satisfied result due to the lack of enough shapes in the shape space. While we use the alignment model to generate all kinds of shape, we can get an increasement by 3%, 9% and 16% on the SN, SP and PPV value.

## Conclusions

In this paper, we have presented a novel framework in the kidney segmentation of US images, dedicated to kidney surgery. NLTV denosing method, DRLSE and shape prior were combined in our framework. In order to validate our framework, we have quantitatively and qualitatively compared our method with other methods in literature. Disregarded the fact of having three steps in the framework, every step celebrated light computational burden and avoided the negative impact such as speckle noise and long computational time caused by the complicated iteration in the methods such as Bression’s. Furthermore, each step has shown to be able to improve the segmentation results effectively. Many methods have tried to reduce the noise influence by introduce more priors. However, from our experiments, we can see that simple level set method with preprocessing of NLTV denosing gets a good result. In addition, better results with smooth boundary can be available using the shape prior. Moreover, with the application of alignment model, our shape space has been effectively enlarged for better result. In our experiments, we can see that it’s better to use the shape prior in the end of the level set evolution to avoid the local minimum caused by the complicated.

## Methods

### Initial process: NLTV denoising

*x*,

*y*) ∈

*Ω*×

*Ω*as:

*x*,

*y*(

*w*defines a graph), function

*w*is defined as:

Where *f*(*x*, *y*, *z*) = ∫ _{
Ω
}
*G*
_{
a
}(*z*)|*u*
_{0}(*x* + *z*) − *u*
_{0}(*y* + *z*)|^{2}
*dz* is the distance between the points *x* and *y*, *G*
_{
a
} is a Gaussian function with standard deviation *a*, and *h* is a Positive constant which acts as a scale parameter. The NL/G gradient is to give a weighted value between two points *x*, *y* to indicate whether the patches around point *x* and patches around point *y* are the same.

_{ Ω }| ∇

_{NL}

*u*|. The whole image denoising model is written as follows:

where *u*, *u*
_{0}:
$\Omega \to \mathbb{R}$, ∇ _{NL}
*u*:
$\Omega \times \Omega \to \mathbb{R}$, and *u*
_{0} is the given noisy image and *λ* is a positive constant that controls the trade-off Between the regularization process and the fidelity with respect to the original image.

In order to get the denoised image *u*
_{
opt
}, we have to minimize Equation (2). Here, Zhang-Burger-Bression- Osher’s model
[21] was used to solve the problem of NLTV minimization. More details can be referred to
[21]. This initial process helped us to acquire an image with the kidney region in almost homogenous intensity gray scale. This is very beneficial for the following segmentation.

### Segmentation process: DRLSE

In level set method, the contour concerned is embedded as the zero level set of a level set function (LSF). During the evolution of level set, LSF may not be smooth. It may become steep or flat which destroy the unique property | ∇ *ϕ*| = 1. As a result reinitialization is in need. The most popular method is proposed in
[22]. In this section, we used a level set formulation that has an intrinsic mechanism of maintaining this desirable property of the LSF. This formulation is called DRLSE proposed by Chunming Li
[6].

*I*be an image on a domain

*Ω*, edge indicator function

*g*is defined by

Where *G*
_{
σ
} is a Gaussian kernel with a standard deviation *σ*. The convolution in (3) is used to smooth the image to reduce the surplus noise. This function *g* usually takes smaller values at object boundaries than at other locations.

*ε*(

*ϕ*) is defined by

*μ*> 0

*λ*>0 and $\alpha \in \mathbb{R}$ are the coefficients of the energy functionals

*R*(

*ϕ*),

*L*(

*ϕ*) and

*A*(

*ϕ*) which are defined by

*δ*and

*H*are the Dirac delta function and the Heaviside function, respectively. This energy functional (4) can be minimized by solving the Following gradient flow:

Given an initial LSF *ϕ*(*x*, 0) = *ϕ*
_{0}(*x*), the first term on the right side in (8) is associated with the distance regularization energy *R*(*ϕ*) (5) which keeps the unique property LSF | ∇ *ϕ*| = 1. The second is associated with the energy terms *L*(*ϕ*) (6) which is minimized when the zero contour of *ϕ* is located at the object boundaries, while the third term is related to *A*(*ϕ*) (7) which is introduced to speed up the motion of the zero level contour in the level set evolution process. The segmentation process produced a binary image with the black and white region representing the kidney and background respectively.

### Post process: shape prior optimization

*ϕ*

_{ j }}:

where x_{pca} is called the coefficients of eigen vector,
$\widehat{\varphi}$ is a shape formed from the shape space,
$\overline{\varphi}$ is the mean shape of the shape space, and matrix W_{
p
} contains *p* principal components of the shape space.

*F*

_{shape}is based on the fixed contour

*C*, the vector x

_{pca}of PCA eigen coefficients and the vector x

_{ T }of geometric transformations. This functional evaluates the shape difference between the contour

*C*after the geometric transformations

*x*

_{ T }and the zero level set $\widehat{C}$ of the shape function $\widehat{\varphi}$ provided by the PCA. The function ${\widehat{\varphi}}^{2}$ at the point

*C*(

*q*) is:

where | · | stands for the Euclidean norm.

*F*

_{shape}with respect to the vector of eigen coefficients x

_{pca}is :

and *φ* is a constant representing the signed distance function of the shape in the binary image.

*F*

_{shape}with respect to the vector of geometric transformation x

_{ T }is:

For one iteration, the computation order is in the following order: (10), (9).

### Compare with other methods

*ϕ*| = 1. In the every iteration, they find the interesting shape.

*ϕ*| = 1. When the contour evolution is terminated, we find the interesting shape from the shape. Compared with the other method in [1, 3, 4, 16–18], we do not need to search the interesting shape in every iteration and reinitialize the level set function. Thus, we save a lot of computation time.

Compared with the original level set or DRLSE methods, we use the shape prior to optimize the segmentation result. Because the ultrasound image is low quality with speckle noise, the original level set or DRLSE method usually gets the segmentation result with indent boundary. In our framework, we use the shape prior to get a satisfied segmentation result with smooth boundary.

Due to the diversity in patients’ kidney shapes, the training sets used in the construction of the kidney shape space may not be enough. In many level sets with shape prior method [4, 16–18], they had only used the similar model like the alignment model in the process of shape space construction. Once the shape space was established, it remained unchanged. However, in many situations, we cannot cover all kinds of shape from the initial data set. Hence the alignment model should be used to generate a new shape in order to give more kind of the shape space after the segmentation. In our process, when the SN, SP and PPV of the segmentation results all fell under 90%, we used the alignment model (AM) proposed in [18] to reconstruct the shape space and the new procduced shape was done by manual segmentations.

## Notes

## Declarations

### Acknowledgments

This work is supported in part by grants from National Natural Science Foundation of China for the Youth NSFC:61103165), National Natural Science Foundation of China (NSFC:81171402), Shenzhen Outstanding Scholar Project grant No. JC201005260248A), NSFC Joint Research Fund for Overseas Research Chinese, Hong Kong and Macao Young Scholars (30928030), National Basic Research Program 973 (2010CB732606) from Ministry of Science and Technology, China.

## Authors’ Affiliations

## References

- Xie J, Jiang Y, Tsui HT:
**Segmentation of kidney from ultrasound images based on texture and shape priors.***IEEE Transaction on Medical Imaging*2005,**24:**1.View ArticleGoogle Scholar - Noble JA, Boukerroui D:
**Ultrasound image segmentation: a survey.***IEEE Transaction on Medical Imaging*2006,**25:**8.View ArticleGoogle Scholar - Yezzi A Jr, Tsai A, Willsky A:
*A statistical approach to snakes for bimodal and trimodal imagery*. Kerkyra: Proceedings of IEEE International Conference on Computer Vision; 1999.View ArticleGoogle Scholar - Leventon M, Grimson E, Faugeras O:
*Statistical Shape Influence in Geodesic Active Contours*. Hilton Head: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition; 2000.View ArticleGoogle Scholar - Medical Image Computing and Computer-Assisted Intervention - MICCAI:
*13th International Conference, Beijing, China, September 20–24, 2010, Proceedings, Part III. Lecture Notes in Computer Science 6363*. Beijing: Springer; 2010.Google Scholar - Li C, Xu C, Gui C, Fox MD:
**Distance regularized level set evolution and its application to image segmentation.***IEEE Transaction on Image Processing*2010,**19:**12.MathSciNetView ArticleGoogle Scholar - Cootes T, Taylor C, Cooper D, Graham J:
**Active shape models-their training and application.***Comput Vis Image Understanding*1995,**61:**38–59. 10.1006/cviu.1995.1004View ArticleGoogle Scholar - Cootes T, Beeston C, Edwards G, Taylor C:
*“A unified framework for atlas matching using active appearance models”, in Proc*. Imaging: sInt. Conf. on Image Processing in Medical; 1999:322–333.Google Scholar - Davies R, Twining C, Cootes T, Waterton J, Taylor C:
**A minimum description length approach to statistical shape modeling.***IEEE Trans Med Imag*May 2002,**21**(5):525–537. 10.1109/TMI.2002.1009388View ArticleMATHGoogle Scholar - Thodberg HH, Olafsdottir H:
*“Adding curvature to minimum description length shape models”, in Proc*. Sep: Br. Machine Vision Conf; 2003.View ArticleGoogle Scholar - Spiegel M, Hahn D, Daum V,
*et al*.:**Segmentation of kidneys using a new active shape model generation technique based on non-rigid image registration.***Comput Med Imaging Graph*2009,**33:**29–39. 10.1016/j.compmedimag.2008.10.002View ArticleGoogle Scholar - Chen Y, Thiruvenkadam H, Wilson D,
*et al*.:*“On the incorporation of shape priors into geometric active contours”, in Proc*. Vancouver, BC, Canada, Jul: IEEE Int. Workshop Variational and Level Set Methods; 2001:145–152.Google Scholar - Rousson M, Paragios N:
*“Shape prior for level set representation”, in Proc*. Kobe, Japan, Oct: IEEE Int. Conf. on Image Processing; 1999:188–192.Google Scholar - Paragios N, Rousson M:
*“Matching distance functions: a shape-to-area variational approach for global-to-local registration”, in Proc*. Copenhagen, Denmark, May: European Conf. on Computer Vision; 2002:78–92.MATHGoogle Scholar - Shen D, Zhang Y, Davatzikos C:
**Segmentation of prostate boundaries from ultrasound images using statistical shape model.***IEEE Trans Med Imag*Apr. 2003,**22**(4):539–551. 10.1109/TMI.2003.809057View ArticleGoogle Scholar - Tsai A, Yezzi A, Wells W, Tempany C, Tucker D, Fan A, Grimson WE, Willsky A:
*Model-based curve evolution technique for image segmentation*. Kauai: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition; 2001.View ArticleGoogle Scholar - Bresson X, Vandergheynst P, Thiran JP:
**A variational model for object segmentation using boundary information and shape prior driven by the Mumford-Shah functional.***International Journal of Computer Vision*2006,**68:**2.View ArticleGoogle Scholar - Tsai A, Yezzi A, Wells W, Tempany C, Tucker D, Fan A, Grimson WE, Willsky A:
**A shape-based approach to the segmentation of medical imagery using level sets.***IEEE Transaction on Medical Imaging*2003,**22:**2. 10.1109/TMI.2003.809707View ArticleGoogle Scholar - D. Zhou, B. Scholkopf, A regularization framework for learning from graph data, Proceedings of International Conference on Machine Learning Workshop on Statistical Relational Learning and Its Connections to Other Fields:
*July 4–8; Banff*. Canada: Alberta; 2004.Google Scholar - Gilboa G, Osher S:
**Nonlocal operators with applications to image processing.***Proceedings of Society for Industrial and Applied Mathematics Multiscale Modeling and Simulation*2007,**6:**2.MathSciNetMATHGoogle Scholar - Bresson X:
*A Short Note for Nonlocal TV Minimization technical report*. 2009.Google Scholar - Adalsteinsson D, Sethian JA:
**A fast level set method for propagating interfaces.***J Comput Phys*1995.**118.**Google Scholar

## Copyright

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.