Skip to main content

Automatic construction of statistical shape models using deformable simplex meshes with vector field convolution energy

Abstract

Background

In the active shape model framework, principal component analysis (PCA) based statistical shape models (SSMs) are widely employed to incorporate high-level a priori shape knowledge of the structure to be segmented to achieve robustness. A crucial component of building SSMs is to establish shape correspondence between all training shapes, which is a very challenging task, especially in three dimensions.

Methods

We propose a novel mesh-to-volume registration based shape correspondence establishment method to improve the accuracy and reduce the computational cost. Specifically, we present a greedy algorithm based deformable simplex mesh that uses vector field convolution as the external energy. Furthermore, we develop an automatic shape initialization method by using a Gaussian mixture model based registration algorithm, to derive an initial shape that has high overlap with the object of interest, such that the deformable models can then evolve more locally. We apply the proposed deformable surface model to the application of femur statistical shape model construction to illustrate its accuracy and efficiency.

Results

Extensive experiments on ten femur CT scans show that the quality of the constructed femur shape models via the proposed method is much better than that of the classical spherical harmonics (SPHARM) method. Moreover, the proposed method achieves much higher computational efficiency than the SPHARM method.

Conclusions

The experimental results suggest that our method can be employed for effective statistical shape model construction.

Introduction

Medical image segmentation is a prerequisite for various clinical applications, such as medical diagnosis, treatment planning and image-guided surgery. In the last few decades, numerous medical image segmentation methods have been proposed [1]. Among all these proposed methods, active shape model (ASM) method [2] has achieved state-of-the-art segmentation accuracy [3]. ASM uses principal component analysis (PCA) [4] based statistical shape models (SSMs) to incorporate high-level a priori shape knowledge of the structure to be segmented to achieve robustness. SSMs can capture the range of shape variability for the structure to be segmented based on a set of training samples. A crucial component of building SSMs is to establish shape correspondence between all training shapes, which is a very challenging task, especially in three dimensions. Also, this component to a great extent determines the final performance of the learned shape prior models. Various methods for establishing shape correspondence have been proposed in the literature, and these methods can be roughly divided into five major categories [3]: mesh-to-mesh registration [5, 6], mesh-to-volume registration [11], volume-to-volume registration [7], parameterization-to-parameterization registration [8], and population-based optimization [9]. Alternatively, these methods can be classified as pairwise correspondence methods [10] and groupwise methods [11]. In particular, Gollmer et al. [12] presented a thorough evaluation of different groupwise correspondence methods, including their objective functions, re-parameterization schemes, and optimization strategies. However, a major limitation of current shape correspondence establishment methods, such as the spherical harmonics (SPHARM) method [8], is that they are very computationally intensive and the structure to be modeled has to be of the same topology as the sphere (i.e., of zero genus).

Deformable models [13] have been quite popular in medical image analysis, particularly in image segmentation [14]. Deformable models are contours or meshes that evolve under the constraints of both internal and external energy to segment the object of interest. In this paper, to overcome the above-mentioned limitations of current shape correspondence establishment methods, we propose an automatic deformable surface model based shape correspondence establishment method to improve the accuracy and reduce the computational cost. Specifically, we develop a greedy algorithm based deformable simplex mesh that uses vector field convolution (VFC) as the external energy. Simplex meshes [15] are efficient and versatile surface representation for physics-based deformable models. These models are formulated based on the concept of force equilibrium, which includes internal and external forces, rather than the original energy minimization framework [13]. The greedy algorithm that minimizes a variational energy functional was introduced in [16] as an alternative to the physics-based method for deformable model evolution. It has been shown that the greedy algorithm outperformed physics-based method both in computation cost and segmentation accuracy [17]. Therefore, we adapt the greedy algorithm to perform the evolution of the deformable simplex meshes. VFC field [18] is a widely used static external force for physics-based deformable models that can guide the active contour into long and thin boundary. Furthermore, in comparison with the classical gradient vector flow (GVF) external force [1921], VFC force shows superior robustness to noise and initialization, much less computational cost (3–10 times less), and changing flexibility [18]. Also, GVF force in homogeneous regions of the image has demonstrated to be a special case of VFC force under certain condition [18]. In order to overcome the main issues of traditional external energy (i.e., sensitivity to shape initialization and poor convergence to the long and thin boundary concavities), we adapt the VFC field for greedy algorithm as external energy. Therefore, our proposed method integrates the computational simplicity of simplex meshes, speed of the greedy algorithm, and robustness of VFC in a unified system.

The deformable models are known to be sensitive to shape initialization [19]. In order to get accurate and robust results, we develop an automatic shape initialization method to derive an initial shape that has high overlap with the object of interest, such that the deformable models can then evolve more locally. We apply our proposed deformable surface model to the application of femur statistical shape model construction to illustrate its accuracy and efficiency. This paper significantly extends our preliminary conference paper on GVF-based deformable simplex meshes [21] by (1) introducing a new VFC external energy, (2) developing an automatic shape initialization method, and (3) applying our method to establish shape correspondence and construct statistical shape models.

Related work

In this section, we briefly review the related mesh-to-volume registration based methods for establishing shape correspondence.

In mesh-to-volume registration based methods, a landmarked deformable surface model is fitted to all the segmented training images. After the evolution of deformable model has converged, the final landmark locations of the deformable template determine the point correspondence [3]. Kaus et al. [22] presented a 3-D elastically deformable model for the automated establishment of shape correspondence from segmented images, where a triangulated deformable template was adopted to segment binary volumetric images of the remaining training samples. Their method can approximate and predict unseen shapes well. Shang and Dossel [20] then extended the above-mentioned method by adopting the force equilibrium concept of deformable surface model, which included the internal forces and the external GVF image forces [19]. Compared with traditional simple gradient image forces, their use of GVF forces greatly improved the accuracy of shape correspondence. Shen et al. [23] proposed an adaptive-focus deformable model (AFDM) to establish shape correspondence based on the hand-labeled 3-D brain images. They introduced an attribute vector for each landmark of the deformable template to preserve its geometric shape while evolution. However, there exit failure cases where the deformable template has to be manually pulled towards the boundary. Later, Zhao and Teoh [24] extended the AFDM method by developing a “bridge over” framework and used it to construct SSMs of brain ventricles. Compared with the original AFDM method, their new method achieved more accurate shape representation, but at the cost of more computation time. Clogenson et al. [25] proposed a model-fitting based approach to establish shape correspondence from manually segmented CT scans. An initial Procrustes alignment between the training data and the reference surface mesh is performed, followed by a non-rigid registration between the reference surface mesh and distance maps of the training label maps. The non-rigid registration is formulated as a model-fitting problem, where a Gaussian Process prior is employed to model smooth deformations of the reference surface mesh.

Background

In this section, we briefly show the main geometry of simplex meshes [21], the reader is referred to [15] for the detailed definition. These geometry information, especially the metric parameters and simplex angle, will be used to define the internal energy of our proposed deformable simplex meshes.

A k-simplex mesh is a polygonal mesh where each vertex has precisely \((k+1)\) neighbors. In this study, we will only consider 2-simplex meshes defined in \(\mathbb {R}^3\), which are also the dual of the triangular meshes (see Fig. 1). Each vertex \(P_i\) is linked to exactly 3 distinct vertices \((P_{N_1(i)},P_{N_2(i)},P_{N_3(i)})\), which form the tangent plane at \(P_i\) with its normal vector \(\mathbf {n}_i\). Local geometry around a vertex \(P_i\) involves the circumscribed circle \(S_1\) with center \(C_i\) and radius \(r_i\) of the triangle \((P_{N_1(i)},P_{N_2(i)},P_{N_3(i)})\), and the circumscribed sphere \(S_2\) with center \(O_i\) and radius \(R_i\) of the tetrahedron \((P_i,P_{N_1(i)},P_{N_2(i)},P_{N_3(i)})\) (please refer to Fig. 8 of [15]).

Fig. 1
figure 1

Diagram showing the duality between 2-simplex meshes (black lines) and triangular meshes (red lines)

The metric parameters \((\epsilon _{1i},\epsilon _{2i},\epsilon _{3i})\) at a vertex \(P_i\) are the barycentric coordinates of the orthogonal projection \(F_i\) of \(P_i\) onto the tangent plane with respect to the triangle \((P_{N_1(i)},P_{N_2(i)},P_{N_3(i)})\):

$$F_i = \epsilon _{1i}P_{N_1(i)}+\epsilon _{2i}P_{N_2(i)}+\epsilon _{3i}P_{N_3(i)},$$
(1)
$$\epsilon _{1i}+\epsilon _{2i}+\epsilon _{3i}=1.$$
(2)

The simplex angle \(\phi _i \in {[-\pi , \pi ]}\) at \(P_i\) is defined as:

$$\cos {(\phi _i)}= \frac{ \left\| C_i-O_i\right\| }{ R_i } \text { sign} \left( (C_i-O_i) \cdot \mathbf {n}_i \right) .$$
(3)

The height \(L(r_i,d_i,\phi _i)\) of \(P_i\) with respect to the tangent plane is defined as:

$$L(r_i,d_i,\phi _i) = \frac{ (r^2-d^2)\tan (\phi _i) }{ \epsilon \sqrt{ (r^2-d^2) \tan ^2(\phi _i) }+r_i },$$
(4)

with

$$\epsilon = {\left\{ \begin{array}{ll} 1, &\quad\text {if }|\phi _i| < \pi /2 \\ -1, &\quad\text {if }|\phi _i| > \pi /2, \end{array}\right. }$$

where \(d_i=\left\| F_i-C_i\right\|\). Therefore, the position of \(P_i\) can be uniquely defined in terms of its three neighbors:

$$P_i = \epsilon _{1i}P_{N_1(i)}+\epsilon _{2i}P_{N_2(i)}+\epsilon _{3i}P_{N_3(i)}+ L(r_i,d_i,\phi _i) \mathbf {n}_i.$$
(5)

While the metric parameters control the position of the orthogonal projection \(F_i\) of \(P_i\) onto the tangent plane, the local mean curvature \(H_i = \frac{ \sin {(\phi _i)} }{ r_i }\) at \(P_i\) is controlled by the simplex angle.

Methods

In this section, we describe the details of the proposed deformable surface model based statistical shape models construction method. It adopts a greedy algorithm which allows simplex meshes to converge on the object of interest under the constraints of both internal energy and VFC external energy. The whole procedure is illustrated in Fig. 2.

Fig. 2
figure 2

Flowchart of the proposed method for statistical shape model construction

Automatic initialization of deformable surface models

In order to get accurate and robust results, we develop an automatic shape initialization method to derive an initial shape that has high overlap with the object of interest, such that the deformable models can then evolve more locally. Given the training samples with ground truth segmentation, we firstly obtain their simplex mesh representations, which are defined as the dual of the triangular mesh derived via Marching Cubes algorithm [26] and mesh smoothing methods [27]. To minimize bias towards the chosen initial shape and make the deformable simplex meshes more robust to local minima, we then choose the training sample similar to an average femur shape as the template mesh for the deformable models.

In order to obtain a good initial shape for the deformable models, we employ the Gaussian mixture model (GMM) based point set registration method [28] by aligning the template mesh with other training meshes via an affine transformation. In GMM based registration method, the main idea is to represent the point sets to be registered as Gaussian mixture models, and then align the two corresponding Gaussian mixtures by minimizing their \(L_2\) distance. Specifically, we represent the point sets of the template mesh as a Gaussian mixture \(f(\mathbf {x}) = \sum _{ i=1 }^{m} {\alpha _{i} \phi (\mathbf {x} | \mu _{i},\Sigma _{i})}\), where \(\alpha _{i}\) is the weight of each component, \(\phi (\mathbf {x} | \mu _{i},\Sigma _{i})\) is a Gaussian distribution with mean \(\mu _{i}\) and covariance variance \(\Sigma _{i}\). Similarly, the point sets of other training meshes are represented by \(g(\mathbf {x}) = \sum _{ j=1 }^{n} {\beta _{j} \phi (\mathbf {x} | \nu _{j},\Gamma _{i})}\). An affine transformation can be represented by a \(3 \times 3\) matrix \(\mathbf {A}\), and a translation vector \(\mathbf {t}\). For convenience, the matrix \(\mathbf {A}\) is factorized as an orthogonal matrix \(\mathbf {Q}\) and a symmetric positive definite matrix \(\mathbf {S}\), i.e., \(\mathbf {A}=\mathbf {Q}\mathbf {S}\) [28]. Then the affine transformation (i.e., \(\mathbf {A}\) and \(\mathbf {t}\)) can be found by minimizing the following \(L_2\) distance between Gaussian mixtures \(f(\mathbf {x})\) and \(g(\mathbf {x})\) [28]:

$$d_{L_2}(f,g,\mathbf {A},\mathbf {t}) = \int {(g-f_{\mathbf {A},\mathbf {t}})^2dx} = \int {(f_{\mathbf {A},\mathbf {t}}^2-2f_{\mathbf {A},\mathbf {t}}g+g^2)dx},$$
(6)

where \(f_{\mathbf {A},\mathbf {t}}(\mathbf {x})= \sum _{ i=1 }^{m} {\alpha _{i} \phi (\mathbf {x} | \mathbf {A}\mu _{i}+\mathbf {t},\mathbf {Q}(\Sigma _{i})\mathbf {Q}^T)}\) is the transformed distribution of the template mesh. Since \(\int {g^2}\) is independent of the affine transformation, we only need to consider \(\int {f_{\mathbf {A},\mathbf {t}}^2}\) and \(\int {f_{\mathbf {A},\mathbf {t}}g}\). Fortunately, both terms has closed-form expressions, for the latter term:

$$\int {f_{\mathbf {A},\mathbf {t}}g \, dx} = \sum _{ i=1 }^{m} {\sum _{ j=1 }^{n} {\alpha _{i}\beta _{j} \phi (\mathbf {x} | \mu _{i}-\nu _{j},\mathbf {Q}(\Sigma _{i})\mathbf {Q}^T+\Gamma _{i})}}.$$
(7)

Considering that the gradients associated with affine transformation have analytical solutions [28], fast gradient-based numerical optimization techniques (e.g., Quasi-Newton methods) can be deployed to minimize the objective function.

After deriving the affine transformation (i.e., \(\mathbf {A}\) and \(\mathbf {t}\)), we used it to transform the template mesh to the space of other training meshes, resulting in the initial shape for the deformable models.

Evolution method

We adopt the greedy algorithm [16] as evolution method to guide deformable 2-simplex meshes to the object of interest (e.g., edges) [21]. Let \(V_i\) be the voxel in the volumetric image containing vertex \(P_i\). During each iteration, a \(w\times w\times w\) cubic window around \(V_i\) is searched, and the energy is computed at each voxel within the window (see Fig. 3a). The energy at vertex \(P_i\) is defined as a combination of both internal and external energy normalized within the window:

$$\begin{aligned} {\varvec{E}(P_i) }&= {\alpha \varvec{E}_{int}(P_i) + \beta \varvec{E}_{ext}(P_i)} \\&= {\alpha \left( \varvec{E}_{Tangent}(P_i) + \varvec{E}_{Normal}(P_i) \right) + \beta \varvec{E}_{VFC}(P_i)}, \end{aligned}$$
(8)

where \(\varvec{E}_{Tangent}\) and \(\varvec{E}_{Normal}\) are internal energy, \(\varvec{E}_{VFC}\) is the VFC external energy, and \(\alpha > 0\), \(\beta > 0\) are weighting parameters that control the relative influence of internal and external energy. The internal energy regularizes the rigidity and elasticity of the deformable models, while VFC external energy attracts the deformable models to the object of interest (e.g., edges).

We define \({N_i}^{t}\) as the \(w\times w\times w\) cubic window around voxel \(V_i\). The position \({Q_i}^{t}\) with minimum energy within the window \({N_i}^{t}\) is chosen as:

$${Q_i}^{t}={\mathop{\text{arg min}}\limits_{{P_j}\in {N_i}^{t}}} \,\varvec{E}(P_j).$$
(9)

In order to guarantee a stable and smooth deformation of the 2-simplex meshes, the vertex \(P_i\) is moved only along its normal direction \(\mathbf {n}_i\), rather than directly moved to \({Q_i}^{t}\) as in the classical greedy algorithm (see Fig. 3b):

$${P_i}^{t+1} = {P_i}^{t}+ \left( ({Q_i}^{t}-{P_i}^{t})\cdot \mathbf {n}_i \right) \mathbf {n}_i.$$
(10)
Fig. 3
figure 3

The greedy algorithm: a the energy function is calculated at vertex \(P_i\) and voxels in the \(w\times w\times w\) cubic window around \(V_i\), and the point with the smallest energy is selected as the target position of \(P_i\). b The vertex \(P_i\) is moved only along its normal direction \(\mathbf {n}_i\). For illustration purpose, only a 2-D window is showed

The overall pseudocode of the greedy algorithm is summarized in Algorithm 1 [21].

figure a

Internal energy computation

The internal energy maintains the geometrical regularity of the deformable 2-simplex meshes. We adopt the original internal force proposed in [15] for greedy algorithm as internal energy, which is decomposed into tangential energy and normal energy [21]:

$$\begin{aligned} {\varvec{E}_{int}(P_i) }&= {\varvec{E}_{Tangent}(P_i) + \varvec{E}_{Normal}(P_i)} \\&= {{\left\| \frac{1}{3}( P_{N_1(i)}+P_{N_2(i)}+P_{N_3(i)} )-F_i \right\| }^2 + \;{\left\| L(r_i,d_i,\tilde{\phi }_i)-L(r_i,d_i,\phi _i) \right\| }^2,} \end{aligned}$$
(11)

where \(\tilde{\phi }_i\) is the reference simplex angle defined as the mean of neighboring vertices’ simplex angles. The tangential energy measures the distance of the orthogonal projection \(F_i\) of \(P_i\) from the center of gravity of its three neighbors in the tangent plane. When the energy achieves the minimum, it ensures that the vertices exhibit uniformly on the surface. While the normal energy keeps a local smoothing simplex mesh through a \(C^2\) smoothness constraint.

VFC external energy computation

Vector field convolution (VFC) field [18] is a widely used static external force for physics-based deformable models. It largely solves the problems associated with traditional external force and can guide the active contour into long and thin boundary. Furthermore, in comparison with the classical GVF external force [1921], VFC force shows superior robustness to noise and initialization, and much less computational cost [18]. The three-dimensional VFC field \(\mathbf {v}({x, y, z})=[u({x, y, z}), v({x, y, z}), w({x, y, z})]\) is defined as the convolution of a vector field kernel \(\mathbf {k}({x, y, z})\) with the edge map f(xyz) generated from the input image I(xyz):

$$\mathbf {v}({x, y, z}) = f({x, y, z}) \otimes \mathbf {k}({x, y, z}),$$
(12)

where \(\otimes\) denotes linear convolution. The basic principle behind this formula is to propagate the gradient vectors of input image I(xyz) into homogeneous image regions via image convolution, such that the deformable models have more chances to move towards the object of interest (e.g., edges).

The edge map f(xyz) for gray-level input images is usually defined as the gradient magnitude of the blurred image:

$$f({x, y, z}) = | \nabla [G_{\sigma }({x, y, z})] \otimes I({x, y, z})|,$$
(13)

where \(G_{\sigma }({x, y, z})\) is a 3-D Gaussian function with standard deviation \(\sigma\), and \(\nabla\) is the gradient operator. The vector field kernel \(\mathbf {k}({x, y, z})=[u_k({x, y, z}), v_k({x, y, z}), w_k({x, y, z})]\) can be calculated as follows:

$$\mathbf {k}({x, y, z}) = m({x, y, z}) \mathbf {n}({x, y, z}),$$
(14)

where m(xyz) is the magnitude of the vector at (xyz) and \(\mathbf {n}({x, y, z})\) is the unit vector from (xyz) to the kernel origin (0, 0, 0) given as:

$$\mathbf{{n}}(x,y,z) = \left\{ {\begin{array}{lll} {[0,0,0],}&\quad{\mathrm{{if }}(x,y,z) = (0,0,0),}\\ {[ - \frac{x}{r}, - \frac{y}{r}, - \frac{z}{r}],}&\quad{\mathrm{{otherwise}},} \end{array}} \right.$$

where \(r=\sqrt{x^2 + y^2 + z^2}\) is the distance from the kernel origin (0, 0, 0).

The magnitude of the vector field kernel m(xyz) plays a major role in generating the VFC field. Considering that the influence of the object of interest should be diminished as the deformable models become further away, then two types of magnitude functions can be defined as decreasing functions of the distance from the kernel origin [18]:

$$m_1({x, y, z})=( r + \varepsilon )^{-\gamma },$$
(15)
$$m_2({x, y, z})= {\rm exp} \left( \frac{-r^2}{\zeta ^2}\right),$$
(16)

where \(\gamma > 0\) and \(\zeta > 0\) are parameters that control the decrease of the magnitude, and \(\varepsilon > 0\) is a small constant for the prevention of division by zero at the kernel origin. The parameters \(\gamma\) and \(\zeta\) are set by considering the signal-to-noise ratio (SNR) of the input image I(xyz) [18].

In the practical implementation, the 3-D VFC field \(\mathbf {v}({x, y, z})=[u({x, y, z}), v({x, y, z}), w({x, y, z})]\) can be calculated by convolving the edge map f(xyz) with each component of the vector field kernel \(\mathbf {k}({x, y, z})=[u_k({x, y, z}), v_k({x, y, z}), w_k({x, y, z})]\) [18]:

$$u({x, y, z})= f({x, y, z}) \otimes u_k({x, y, z}),$$
(17)
$$v({x, y, z}) = f({x, y, z}) \otimes v_k({x, y, z}),$$
(18)
$$w({x, y, z})= f({x, y, z}) \otimes w_k({x, y, z}).$$
(19)

The continuous vector field kernel \(\mathbf {k}({x, y, z})\) is approximated by a discrete matrix defined on a 3-D grid [18]:

$$\{\mathbf {k}(x,y,z)~|x,y,z=-R,\ldots ,-1,0,1,\ldots ,R\},$$
(20)

where R is the chosen kernel radius.

To the best of our knowledge, all current work on deformable models employs VFC as force fields rather than potential energy. To tackle the main issues of the traditional external energy, we extend it for greedy algorithm as external potential energy. Specifically, we make the following enhancements to the original VFC method:

  1. 1.

    The magnitude of the VFC field \(\left\| \mathbf {v}({x, y, z})\right\|\) is used as external energy instead of the force field itself.

  2. 2.

    One major drawback of the original VFC method is that the generated VFC fields are dominated by the strong edges, this will make the weak yet important edges smooth out along with the noise and cause leakage problem [18]. The GVF fields presented in [19] have successfully avoided this leakage problem and can retain the weak edges, as can be seen from Fig. 3(b) in [18]. It is mainly because in GVF, all the edges generated from different features are treated equally, which is achieved by normalizing the used edge maps to the range [0, 1] during GVF computations. After this normalization step, the weak edges will thus have similar magnitude to that of the strong edges, resulting in equal importance between them. Considering that no noise is presented in our binary input images, we propose also to normalize the edge map f(xyz) to [0, 1] before the computation of VFC energy, in order to alleviate the leakage problem and preserve the weak yet important edges.

Therefore, the VFC energy at vertex \(P_i\) can be defined as:

$$\varvec{E}_{VFC}(P_i) = {\left\| \mathbf {v}(P_i)\right\| }.$$
(21)

When vertex \(P_i\) is within the volumetric image, its external energy can be derived through a trilinear interpolation of the external energy at its neighboring image grid points; otherwise, to keep the deformable simplex meshes from moving out of the volumetric image, its VFC energy is set to be the maximal value of the external energy in the volumetric image [21]. When VFC is used as potential energy, we can pre-compute it and store its field magnitude as gray-level images, this can help greatly reduce the computation time of the VFC, while this does not hold when VFC is used as force fields.

Statistical shape model construction

Using our proposed greedy algorithm-based deformable simplex meshes, we obtain a set of K corresponding femur training shapes \(\{M_i~|~i=1,2,\ldots ,K\}\), and each shape \(M_i\) is represented by a shape vector \(\mathbf {x}_i\) with N landmark points. Firstly, all training shapes are spatially aligned into a common coordinate frame using the generalized Procrustes analysis (GPA) [29]. We define the corresponding covariance matrix as:

$$\mathbf {S} = \frac{1}{K-1} \sum _{i=1}^{K} (\mathbf {x}_i - \bar{\mathbf {x}})(\mathbf {x}_i -\bar{ \mathbf{x}})^T,$$
(22)

where \(\bar{\mathbf {x}}\) is the mean shape vector of all subjects: \(\bar{\mathbf {x}} = \frac{1}{K} { \sum _{i=1}^{K} { \mathbf {x}_i } }\). Then the principal component analysis (PCA)-based statistical shape model (SSM) can be built by an eigen-decomposition on the covariance matrix \(\mathbf {S}\):

$$\mathbf {S} = \mathbf {U}\mathbf {D}\mathbf {U}^T,$$
(23)

where columns of matrix of \(\mathbf {U}\) form the principal modes of variation \(\phi _m\), and diagonal entries of \(\mathbf {D}\) are their respective variances \(\lambda _m\). Then any valid shapes of femur structure can be approximated by a linear combination of the first c modes of variation:

$$\mathbf { x } = \bar{\mathbf {x}} + \sum _{m=1}^{c} { b_m\phi _m },$$
(24)

where \(c = \min \{ t ~ | ~ { \sum _{i=1}^{t} \lambda _t }/{ \sum _{i=1}^{K-1} \lambda _t } > 0.98 \}\), and \(b_m\) is the shape parameter constrained to the interval \(b_m \in \left[ -3\sqrt{\lambda _m}, 3\sqrt{\lambda _m}\right]\).

Experimental setup

Datasets and parameters

To evaluate the performance of our proposed deformable simplex meshes-based method for establishing shape correspondence, we applied it to construct the femur statistical shape models based on a femur database. The database consists of ten femur CT scans from 10 different patients (6 males, 4 females) with ground truth segmentation manually delineated by clinical experts. The ages of patients ranged from 30 to 71 years with an average age of 54 years. These CT scans were provided by the Weihai Municipal Hospital, and were acquired by use of a 64-row multidetector CT scanner (Brilliance 64; Philips Healthcare, Best, the Netherlands) with varied in-plane resolution between 0.58 and 0.67 mm, and a slice thickness of 1.5 mm.

In the experiments, the first magnitude function \(m_1(\mathbf {x})\) is employed for the computation of VFC energy. The parameter settings of our proposed method are as follows: the weighting parameters in Eq. 8 \(\alpha =0.4\), \(\beta =1.0\), the width of the cubic window \(w=11\); the vector field kernel radius \(R = 256\), \(\gamma =1.7\), and \(\varepsilon = 10^{-8}\) for VFC energy. All these parameters are the same for all the training data. The parameters for the deformable model were chosen empirically according to that used in our previous paper [21], where we thoroughly evaluated the effect of these parameters on the final segmentation accuracy via the metric of mean radial error (MRE); while for VFC energy, the parameters were empirically set to the values suggested in the original VFC paper [18], where the authors experimentally showed the effect of these parameters on the final segmentation accuracy via the metric of root mean square error (RMSE).

Compared method

We compared our proposed method with the classical spherical harmonics (SPHARM) shape correspondence method [8]. Although the SPHARM method is not being proposed recently, the reason we chose this method for comparison is that it is still quite efficient and popular, especially for the analysis of brain morphology structures in the neuroimaging community [30]. Moreover, the SPHARM method is still frequently cited by researchers for comparative experiments [3032]. Since the SPHARM method requires the input shape to be of spherical topology (i.e., objects without holes), all holes in the training images must be filled firstly. The SPHARM method maps all the input shapes to a common parameter domain (i.e., the unit sphere) using an area-preserving spherical parameterization. It means that every vertex on the input surface is mapped to a unique point on the unit sphere using a spherical coordinate system. By using these spherical parameterizations, a unique shape descriptor coefficient can be derived for each input shape based on SPHARM basis functions. Point correspondence in the parameter domain is then determined by rotating the spherical parameterization, such that the poles of the input shape described by the first order coefficient (an ellipsoid) are coinciding with the poles of the parameterization. Finally the corresponded training shapes are derived by uniformly sampling the rotated spherical parameterization via icosahedron subdivision.

Results

Qualitative analysis

Figure 4 shows 3D visualization of the shape variation on the surface for the constructed femur shape model. We can see that the upper extremity and lower extremity of the femur have more variation than other parts of the femur surface, while the body of the femur (middle parts) only has little variation (shown in dark blue color). Figure 5 illustrates the first two principal modes of variation for the constructed femur shape model. Each row shows the variation of a specific mode between \(-3\sigma\) and \(+3\sigma\). It can be seen that the first mode accounts for the whole volume size of the femur structure, while the second mode mainly represents the change of femoral neck and greater trochanter. These demonstrate that we have successfully constructed the femur shape model by using our proposed deformable simplex meshes-based method.

Fig. 4
figure 4

3D visualization of the shape variation on the surface for the constructed femur shape model. The variation spans from small (blue) to large (red)

Fig. 5
figure 5

The first two principal modes of variation for the constructed femur shape model. Each row shows the variation of a specific mode between \(-3\sigma\) and \(+3\sigma\)

Quantitative analysis

Compactness, generalization ability, and specificity are the three standard metrics for assessing the quality of a constructed shape model [33]. In this section, we use all these three metrics to quantitatively evaluate the quality of the constructed femur shape model by using our proposed method.

Compactness

A compact shape model should the one that has little variance and can accurately reconstruct new shape instances with few shape parameters. Thus, the compactness is defined as the cumulative variance of the Mth mode used in the shape reconstruction [33]:

$$C(M) = \sum _{m=1}^{M} {\lambda _m},$$
(25)

where \(\lambda _m\) is the mth eigenvalue. For the compactness, the smaller the value is, the better the constructed shape model.

Figure 6 shows the compactness for both SPHARM and our proposed method with varying number of modes of variation. For all the employed number of modes, our method achieves better compactness than the SPHARM method. Table 1 shows the compactness for the two compared shape prior modeling methods with nine modes. Specifically, the compactness of our method is 1.38, and SPHARM method’s compactness is more than three times that of our method. Therefore, the shape model constructed by our method is more compact than that of the SPHARM method.

Fig. 6
figure 6

Compactness for both SPHARM and our proposed method

Table 1 The compactness for the two compared shape prior modeling methods with nine modes

Generalization ability

The generalization ability quantifies the ability of the constructed shape model to represent new shape instances of the same structure class. It is measured based on the training data by performing leave-one-out tests. Specifically, a shape model is built by using all but one training shape \(\mathbf {x}_i\), and then the constructed model is employed to reconstruct the excluded shape \(\mathbf {x}_i\). The approximation error is defined as the distance between the excluded shape \(\mathbf {x}_i\) and its reconstructed shape \(\mathbf {x}_i^{'}\). The generalization ability is the average approximation error of all the performed K tests [33]:

$$G(M) = \frac{1}{K} \sum _{i=1}^{K} { \Vert \mathbf {x}_i - \mathbf {x}_i^{'}(M) \Vert }^2,$$
(26)

where the reconstructed shape \(\mathbf {x}_i^{'}(M)\) is defined as a linear combination of the first M modes of variation:

$$\mathbf {x}_i^{'}(M) = \bar{\mathbf {x}} + \sum _{m=1}^{M} { b_m\phi _m }.$$
(27)

And a smaller value of the generalization ability indicates a better constructed shape model.

The generalization ability for both SPHARM and our proposed method is shown in Fig. 7 with different number of modes of variation. And our method shows consistently better generalization ability than the SPHARM method in all the used number of modes. The mean and standard deviation of the generalization ability for the two compared shape prior modeling methods are shown in Table 2 by using nine modes. The generalization ability of our method is 0.84 mm, while the generalization ability of SPHARM method is nearly 50% higher than that of our method. These demonstrate that our method can reconstruct new shape instances more accurately than the SPHARM method.

Fig. 7
figure 7

Generalization ability for both SPHARM and our proposed method

Table 2 The mean and standard deviation of the generalization ability for the two different shape prior modeling methods with nine modes

Specificity

The specificity measures the validity of shape instances generated by the constructed shape model. It is measured by generating a large set of N shape instances using the constructed model. The approximation error is defined as the distance between the generated shape instance \(\mathbf {x}_j\) and its most similar sample in the training data \(\mathbf {x}_j^{'}\). The specificity is the average approximation error of all the generated N shape instances [33]:

$$S(M) = \frac{1}{N} \sum _{j=1}^{N} { \Vert \mathbf {x}_j(M) - \mathbf {x}_j^{'} \Vert }^2,$$
(28)

where the shape instance \(\mathbf {x}_j\) is generated by choosing random values for the first M’s shape parameter \(\mathbf {b}\) from the range \(b_m \in \left[ -3\sqrt{\lambda _m}, 3\sqrt{\lambda _m}\right]\):

$$\mathbf {x}_j(M) = \bar{\mathbf {x}} + \sum _{m=1}^{M} { b_m\phi _m },$$
(29)

and the most similar sample \(\mathbf {x}_j^{'}\) is defined as:

$$\mathbf {x}_j^{'} = {\mathop{\text{arg min}}\limits_{k \in [1, K]}} \,{ \Vert \mathbf {x}_j(M) - \mathbf {x}_k \Vert }^2.$$
(30)

A value of N =  10,000 was used to obtain the results reported in this study. For the specificity, a smaller value means a better constructed shape model.

Figure 8 illustrates the specificity for both SPHARM and our proposed method with various number of modes of variation. In all the used number of modes, our method shows better specificity than the SPHARM method. Table 3 summarizes the mean and standard deviation of the specificity for the two compared shape prior modeling methods with nine modes. In particular, the specificity of our method is 1.52 mm, and SPHARM method’s specificity is more than 50% higher than that of our method. These indicate that the shape instances generated by the model of our method is more specific to the modeled structure than that of the SPHARM method.

Fig. 8
figure 8

Specificity for both SPHARM and our proposed method

Table 3 The mean and standard deviation of the specificity for the two different shape prior modeling methods with nine modes

Through all the above quantitative comparative results of the three employed metrics, we can conclude that the quality of the constructed femur shape models by using the proposed method is much better than that of the classical SPHARM method. Therefore, our method can be employed for effective femur shape model construction.

Computational time

The proposed method was implemented in C++Footnote 1 and tested on a PC with an Intel processor and 2 GB of RAM. Our proposed method takes an average of about 15 min to establish shape correspondence for each shape. For comparison, the time taken by the SPHARM method for the same task is about 1 h in average. Thus our proposed method achieves much higher computational efficiency than the SPHARM method.

Conclusion

In this paper, we have proposed a novel mesh-to-volume registration based method for automatically establishing shape correspondence. It integrates the computational simplicity of simplex meshes, speed of the greedy algorithm, and robustness of VFC in a unified system. Through extensive experiments on ten femur CT scans from ten different patients, we demonstrate that the quality of the constructed femur shape models by using the proposed method is much better than that of the classical SPHARM method. Moreover, the proposed method achieves much higher computational efficiency than the SPHARM method. Therefore, the proposed method can be employed for effective femur shape model construction.

For this study, we mainly focus on a new method for automatically establishing shape correspondence, rather than a direct extension of an existing method, thus few similar methods are available for direct comparisons. Nevertheless, we adapted the original implementation of the classical SPHARM method with fine-tuned parameter settings, and provided an extensive quantitative comparison based on the same femur CT datasets. Considering that no public femur CT datasets with ground truth are currently available, in our future work, we will implement more state-of-the-art methods and provide more comparative results. Furthermore, our future research will go in the direction of applying the constructed femur shape models in ASM based femur segmentation method as the shape prior models.

Notes

  1. The source code is available at https://github.com/ivanshih/GA-DSM.

Abbreviations

ASM:

active shape model

SSM:

statistical shape models

PCA:

principal component analysis

VFC:

vector field convolution

GVF:

gradient vector flow

GMM:

Gaussian mixture model

SPHARM:

spherical harmonics

GPA:

generalized Procrustes analysis

AFDM:

adaptive-focus deformable model

References

  1. Pham DL, Xu C, Prince JL. Current methods in medical image segmentation. Annu Rev Biomed Eng. 2000;2:315–37.

    Article  Google Scholar 

  2. Cootes TF, Taylor CJ, Cooper DH, Graham J. Active shape models—their training and application. Comput Vis Image Underst. 1995;61(1):38–59.

    Article  Google Scholar 

  3. Heimann T, Meinzer H-P. Statistical shape models for 3D medical image segmentation: a review. Med Image Anal. 2009;13(4):543–63.

    Article  Google Scholar 

  4. Jolliffe IT. Principal component analysis. Springer series in statistics. 2nd ed. Berlin: Springer; 2002.

    MATH  Google Scholar 

  5. Chui H, Rangarajan A. A new point matching algorithm for non-rigid registration. Comput Vis Image Underst. 2003;89(2–3):114–41.

    Article  MATH  Google Scholar 

  6. François Chung, Hervé Delingette. Regional appearance modeling based on the clustering of intensity profiles. Comput Vis Image Underst. 2013;117(6):705–17.

    Article  Google Scholar 

  7. Rueckert D, Frangi AF, Schnabel JA. Automatic construction of 3-D statistical deformation models of the brain using non-rigid registration. IEEE Trans Med Imaging. 2003;22(8):1014–25.

    Article  Google Scholar 

  8. Styner M, Oguz I, Xu S, Brechbühler C, Pantazis D, Levitt JJ, Shenton ME, Gerig G. Framework for the statistical shape analysis of brain structures using SPHARM-PDM. In: Proceedings of insight journal-ISC/NA-MIC workshop on open science at MICCAI 2006, Copenhagen, Denmark (2006). http://hdl.handle.net/1926/215.

  9. Davies RH, Twining CJ, Cootes TF, Waterton JC, Taylor CJ. A minimum description length approach to statistical shape modeling. IEEE Trans Med Imaging. 2002;21(5):525–37.

    Article  MATH  Google Scholar 

  10. Munsell BC, Temlyakov A, Styner M, Wang S. Pre-organizing shape instances for landmark-based shape correspondence. Int J Comput Vis. 2012;97(2):210–28.

    Article  Google Scholar 

  11. Oguz I, Cates J, Datar M, Paniagua B, Fletcher T, Vachet C, Styner M, Ross T, Whitaker R. Entropy-based particle correspondence for shape populations. Int J Comput Assist Radiol Surg. 2016;11(7):1221–32.

    Article  Google Scholar 

  12. Gollmer S, Kirschner M, Buzug TM, Wesarg S. Using image segmentation for evaluating 3D statistical shape models built with groupwise correspondence optimization. Comput Vis Image Underst. 2014;125:283–303.

    Article  Google Scholar 

  13. Kass M, Witkin A, Terzopoulos D. Snakes: active contour models. Int J Comput Vis. 1988;1(4):321–31.

    Article  MATH  Google Scholar 

  14. McInerney T, Terzopoulos D. Deformable models in medical image analysis: a survey. Med Image Anal. 1996;1(2):91–108.

    Article  Google Scholar 

  15. Delingette H. General object reconstruction based on simplex meshes. Int J Comput Vis. 1999;32(2):111–46.

    Article  Google Scholar 

  16. Williams DJ, Shah M. A fast algorithm for active contours and curvature estimation. CVGIP Image Underst. 1992;55(1):14–26.

    Article  MATH  Google Scholar 

  17. Mille J, Boné R, Makris P, Cardot H. Greedy algorithm and physics-based method for active contours and surfaces: a comparative study. In: Proceedings of the IEEE international conference on image processing (ICIP), Atlanta, Georgia, USA; 2006. p. 1645–48

  18. Li B, Acton ST. Active contour external force using vector field convolution for image segmentation. IEEE Trans Image Process. 2007;16(8):2096–106.

    Article  MathSciNet  Google Scholar 

  19. Xu C, Prince JL. Snakes, shapes and gradient vector flow. IEEE Trans Image Process. 1998;7(3):359–69.

    Article  MathSciNet  MATH  Google Scholar 

  20. Shang Y, Dossel O. Statistical 3D shape-model guided segmentation of cardiac images. In: Proceedings of the computers in cardiology 2004, Chicago, Illinois, USA; 2004. p. 553–56

  21. Shi C, Guo C, Cheng Y, Wang J. Greedy algorithm based deformable simplex meshes using gradient vector flow as external energy. In: Proceedings of the IEEE international conference on biomedical engineering and informatics (BMEI), Dalian, China; 2014. p. 199–204

  22. Kaus MR, Pekar V, Lorenz C, Truyen R, Lobregt S, Weese J. Automated 3-D PDM construction from segmented images using deformable models. IEEE Trans Med Imaging. 2003;22(8):1005–13.

    Article  Google Scholar 

  23. Shen D, Herskovits E, Davatzikos C. An adaptive-focus statistical shape model for segmentation and shape modeling of 3-D brain structures. IEEE Trans Med Imaging. 2001;20(4):257–70.

    Article  Google Scholar 

  24. Zhao Z, Teoh EK. A novel framework for automated 3D PDM construction using deformable models. In: Proceedings of SPIE medical imaging, San Diego, CA, USA; 2005. p. 303–14

  25. Clogenson M, Duff JM, Lüthi M, Levivier M, Meuli R, Baur C, Henein S. A statistical shape model of the human second cervical vertebra. Int J Comput Assist Radiol Surg. 2015;10(7):1097–107.

    Article  Google Scholar 

  26. Lorensen WE, Cline HE. Marching cubes: a high resolution 3D surface construction algorithm. In: Proceedings of SIGGRAPH ’87, New York, NY, USA; 1987. p. 163–69

  27. Taubin G. A signal processing approach to fair surface design. In: Proceedings of the SIGGRAPH ’95, Los Angeles, CA, USA; 1995. p. 351–58

  28. Jian B, Vemuri BC. Robust point set registration using Gaussian mixture models. IEEE Trans Pattern Anal Mach Intell. 2011;33(8):1633–45.

    Article  Google Scholar 

  29. Goodall C. Procrustes methods in the statistical analysis of shape. J R Stat Soc Ser B Methodol. 1991;53(2):285–339.

    MathSciNet  MATH  Google Scholar 

  30. Hosseinbor AP, Chung MK, Koay CG, Schaefer SM, van Reekum CM, Peschke-Schmitz L, Sutterer MJ, Alexander AL, Davidson RJ. 4D hyperspherical harmonic (HyperSPHARM) representation of surface anatomy: a holistic treatment of multiple disconnected anatomical structures. Med Image Anal. 2015;22(1):89–101.

    Article  Google Scholar 

  31. Yi Gao, Sylvain Bouix. Statistical shape analysis using 3D Poisson equation—a quantitatively validated approach. Med Image Anal. 2016;30:72–84.

    Article  Google Scholar 

  32. Hong J-P, Vicory J, Schulz J, Styner M, Marron JS, Pizer SM. Non-Euclidean classification of medically imaged objects via s-reps. Med Image Anal. 2016;31:37–45.

    Article  Google Scholar 

  33. Davies RH. Learning shape: optimal models for analysing natural variability. PhD thesis, University of Manchester, Manchester, UK. 2002. http://academy.bcs.org:8080/sites/academy.bcs.org/files/rhodri-davies.pdf

Download references

Authors' contributions

JKW implemented the experiments, draft and analyzed the data, CFS suggested the proposed algorithm and the design of the study. Both authors read and approved the final manuscript.

Acknowledgements

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

The datasets analyzed during the current study are not publicly available due to the further analysis of the datasets being doing in our research but they can be provided upon reasonable request.

Consent for publications

The participants (all adults) gave their consent for publication.

Ethics approval and consent to participate

The clinical images used in the research are provided by the Weihai Municipal Hospital and we have been informed that the patients were consent to participate in the study.

Funding

This work was supported by the Nature Science Foundation of Heilongjiang Province of China (No. QC2016090).

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Changfa Shi.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, J., Shi, C. Automatic construction of statistical shape models using deformable simplex meshes with vector field convolution energy. BioMed Eng OnLine 16, 49 (2017). https://doi.org/10.1186/s12938-017-0340-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12938-017-0340-0

Keywords