 Research
 Open Access
 Published:
Automatic discrimination between safe and unsafe swallowing using a reputationbased classifier
BioMedical Engineering OnLine volume 10, Article number: 100 (2011)
Abstract
Background
Swallowing accelerometry has been suggested as a potential noninvasive tool for bedside dysphagia screening. Various vibratory signal features and complementary measurement modalities have been put forth in the literature for the potential discrimination between safe and unsafe swallowing. To date, automatic classification of swallowing accelerometry has exclusively involved a singleaxis of vibration although a second axis is known to contain additional information about the nature of the swallow. Furthermore, the only published attempt at automatic classification in adult patients has been based on a small sample of swallowing vibrations.
Methods
In this paper, a large corpus of dualaxis accelerometric signals were collected from 30 older adults (aged 65.47 ± 13.4 years, 15 male) referred to videofluoroscopic examination on the suspicion of dysphagia. We invoked a reputationbased classifier combination to automatically categorize the dualaxis accelerometric signals into safe and unsafe swallows, as labeled via videofluoroscopic review. From these participants, a total of 224 swallowing samples were obtained, 164 of which were labeled as unsafe swallows (swallows where the bolus entered the airway) and 60 as safe swallows. Three separate support vector machine (SVM) classifiers and eight different features were selected for classification.
Results
With selected time, frequency and information theoretic features, the reputationbased algorithm distinguished between safe and unsafe swallowing with promising accuracy (80.48 ± 5.0%), high sensitivity (97.1 ± 2%) and modest specificity (64 ± 8.8%). Interpretation of the most discriminatory features revealed that in general, unsafe swallows had lower mean vibration amplitude and faster autocorrelation decay, suggestive of decreased hyoid excursion and compromised coordination, respectively. Further, owing to its performancebased weighting of component classifiers, the static reputationbased algorithm outperformed the democratic majority voting algorithm on this clinical data set.
Conclusion
Given its computational efficiency and high sensitivity, reputationbased classification of dualaxis accelerometry ought to be considered in future developments of a pointofcare swallow assessment where clinical informatics are desired.
1 Introduction
Dysphagia refers to any swallowing disorder [1] and may arise secondary to stroke, multiple sclerosis, and eosinophilic esophagitis, among many other conditions [2]. If unmanaged, dysphagia may lead to aspiration pneumonia in which food and liquid enter the airway and into lungs [3]. The videofluoroscopic swallowing study (VFSS) is the gold standard method for dysphagia detection [4]. This method entails a lateral Xray video recorded during ingestion of a bariumcoated bolus. The health of a swallow is then judged by clinical experts according to criteria such as the depth of airway invasion and the degree of bolus clearance after the swallow. However, this technique requires expensive and specialized equipment, ionizing radiation and significant human resources, thereby precluding its use in the daily monitoring of dysphagia [5]. Swallowing accelerometry has been proposed as a potential adjunct to VFSS. In this method, the patient wears a dualaxis accelerometer inferoanterior to the thyroid notch. Swallowing events are automatically extracted from the recorded acceleration signals and pattern classification methods are then deployed to discriminate between healthy and unhealthy swallows. It is important to distinguish between swallowing vibrations and swallowing sounds, based on current evidence in the literature. Swallowing sounds have been largely attributed to pharyngeal reverberations arising from opening and closing of valves (oropharyngeal, laryngeal and esophageal valves), action of various pumps (pharyngeal, esophageal, and respiratory pumps) and vibrations of the vocal tract [6]. In contrast, in swallowing accelerometry, vocalizations are explicitly removed by preprocessing [7] and studies have implicated hyolaryngeal motion as the primary source of the acceleration signal [8, 9]. Fundamentally, both the method of transduction and the primary physiological source of these signals are different. Our focus here is swallowing vibrations and recent progress in swallowing accelerometry is reviewed below.
1.1 Automatic classification
Das, Reddy & Narayanan [10] deployed a fuzzy logiccommittee network to distinguish between swallows and 'artifacts' using time and frequency domain features of singleaxis accelerometry signals. Although they achieved very high accuracies, their sample of swallows and 'artifacts' was very modest. Using a radial basis classifier with statistical and energetic features, Lee et al. [11] detected aspirations from singleaxis cervical acceleration signals with approximately 80% sensitivity and specificity in a large pediatric cerebral palsy population. Both of these studies only examined accelerations in the anteriorposterior anatomical direction. However, recent research has shown that there is distinct information about swallowing that is encoded in the superiorinferior vibration [12]. Further, hyolaryngeal motion associated with swallowing is inherently twodimensional and this motion was implicated as the likely source of swallow vibrations [9].
In the first dualaxis classification study, Lee et al. [5] discriminated between no airway invasion and airway invasion past the true vocal folds in 24 adult stroke patients using a variety of classifiers (linear discriminant, neural network, probabilistic network and nearest neighbor). A genetic algorithm (GA) selected the most discriminatory feature combinations. With linear classifiers, an adjusted accuracy of 74.7% was achieved in feature spaces of up to 12 dimensions.
In the aforementioned studies, various genres of features have demonstrated discriminatory potential. These include statistical features such as dispersion ratio and normality [11], timefrequency features such as wavelet energies [12], information theoretic features such as entropy rate [13], temporal features such signal memory [14], and spectral features such as the spectral centroid [15]. Further, there is evidence to suggest that complementary measurement modalities, such as nasal air flow and submental mechanomyography [16] may enhance segmentation and classification. Given the presence of multiple feature genres and different measurement modalities, the swallow detection and classification problem lends itself to a multiclassifier approach. For example, it may be sensible to dedicate one classifier to each feature genre [17].
In this paper, we invoke a novel, computationally efficient reputationbased classifier combination to automatically categorize dualaxis accelerometric signals from adult patients into safe and unsafe swallows, as labeled via videofluoroscopic review. We consider multiple feature genres from both anteriorposterior and superiorinferior axes and examine a much larger data set than that of previous swallow accelerometry classification studies.
2 Methods
2.1 Data collection
In this paper, we reexamine data from a subset of participants originally reported in [18]. Briefly, we recruited 30 patients (aged 65.47 ± 13.4 years, 15 male) with suspicion of neurogenic dysphagia who were referred to routine videofluoroscopic examination at one of two local hospitals. Patients had dysphagia secondary to stroke, acquired brain injury, neurodegenerative disease, and spinal cord injury. Research ethics approval was obtained from both participating hospitals.
The data collection setup is shown in Figure 1.
Sagittal plane videofluoroscopic images of the cervical region were recorded to computer at a nominal 30 frames per second via an analog image acquisition card (PCI1405, National Instruments). Each frame was marked with a timestamp via a software frame counter. A dualaxis accelerometer (ADXL322, Analog Devices) was taped to the participants neck at the level of the cricoid cartilage. The axes of the accelerometer were aligned to the anatomical anteriorposterior (AP) and superiorinferior (SI) axes. Signals from both the AP and SI axes were passed through separate preamplifiers each with an internal bandpass filter (Model P55, Grass Technologies). The cutoff frequencies of the bandpass filter were set at 0.1 Hz and 3 kHz. The amplifier gain was 10. The signals were then sampled at 10 kHz using a data acquisition card (USB NI6210, National Instruments) and stored on a computer for subsequent analyses. A trigger was sent from a custom LabView virtual instrument to the image acquisition card to synchronize videofluoroscopic and accelerometric recordings. The above instrumentation settings replicate those of previous dualaxis swallowing accelerometry studies [7, 9, 13–15, 19, 20].
Each participant swallowed a minimum of two or a maximum of three 5 mL teaspoons of thin liquid barium (40%w/v suspension) while his/her head was in a neutral position. The number of sips that the participant performed was determined by the attending clinician. The recording of dualaxis accelerometry terminated after the participant finished his/her swallows. However, the participant's speechlanguage pathologist continued the videofluoroscopy protocol as per usual. In total, we obtained 224 individual swallowing samples from the 30 participants, 164 of which were labeled as unsafe swallows (as defined below) and 60 as safe swallows.
2.2 Data segmentation
To segment the data for analysis, a speechlanguage pathologist reviewed the videofluoroscopy recordings. The beginning of a swallow was defined as the frame when the liquid bolus passed the point where the shadow of the mandible intersects the tongue base. The end of the swallow was identified as the frame when the hyoid bone returned to its rest position following bolus movement through the upper esophegeal sphincter. The beginning and end frames as defined above where marked within the video recording using a custom C++ program. The cropped video file was then exported together with the associated segments of dualaxes acceleration data. An unsafe swallow was defined as any swallow without airway clearance. Typically, this would include penetration and aspiration. Residue would be considered a situation of swallowing inefficiency that is not unsafe swallowing unless the residue was subsequently aspirated. Backflow is extremely rare in the oropharynx, and would only be classified as unsafe should it lead to penetrationaspiration. This definition of unsafe swallowing is in keeping with the industry standard PenetrationAspiration Scale [21].
2.3 PreProcessing
It has been shown in [12] that the majority of signal power in swallowing vibrations of healthy adults lies below 100 Hz. However, given that we were dealing with patient data, we estimated the bandwidth of each of the 224 swallows as the spectral range from 0 Hz up to the frequency at which 95% of the signal energy was captured. We obtained average bandwidths of 175 ± 73 Hz and 226 ± 84 Hz for the AP and SI axes, respectively. Moreover, spectral centroids were < 70 Hz in both axes, suggesting that there is no appreciable signal energy beyond a few hundred Hz. Therefore, we downsampled all signals to 1 kHz. Vocalization was removed from each segmented swallow according to the normalized crosscorrelation periodicity detector proposed in [7]. Whitening of the accelerometry signals to account for instrumentation nonlinearities was achieved using inverse filtering and autoregressive modeling [15]. Finally, the signals were denoised using a Daubechies8 wavelet (8db) transform with soft thresholding. As detailed in [20], both the decomposition level and the wavelet coefficients were chosen to minimize the reconstruction error within a reduced wavelet subspace. Figures 2 and 3 exemplify preprocessed safe and unsafe swallowing signals, respectively.
2.4 Feature Extraction
Let S be a preprocessed acceleration time series, S = {s _{2}, s _{2}, ..., s _{ n }}. As in previous accelerometry studies, signal features from multiple domains were considered [13, 16]. The different genres of features are summarized below.

1.
Time Domain Features

The sample mean is an unbiased estimation of the location of a signal's amplitude distribution and is given by,
{\mu}_{s}=\frac{1}{n}\sum _{i=1}^{n}{S}_{i}.(1) 
The variance of a distribution measures its spread around the mean and reflects the signal's power. The unbiased estimation of variance can be obtained as
{\sigma}_{s}^{2}=\frac{1}{n1}\sum _{i=1}^{n}{\left({S}_{i}{\mu}_{s}\right)}^{2}.(2) 
The median is a robust location estimate of the amplitude distribution. For the sorted set S, the median can be calculated as
MED\left(s\right)=\left\{\begin{array}{cc}\hfill {S}_{v}+1,\hfill & \hfill \mathsf{\text{if}}\phantom{\rule{2.77695pt}{0ex}}n=2v+1;\hfill \\ \hfill \frac{{s}_{v}+{s}_{v+1}}{2},\hfill & \hfill \mathsf{\text{if}}\phantom{\rule{2.77695pt}{0ex}}n=2v.\hfill \end{array}\right.(3) 
Skewness is a measure of the symmetry of a distribution. This feature can be computed as follows.
{\gamma}_{1,s}=\frac{\frac{1}{n}{\sum}_{i=1}^{n}{\left({S}_{i}{\mu}_{s}\right)}^{3}}{{\left(\frac{1}{n}{\sum}_{i=1}^{n}{\left({S}_{i}{\mu}_{s}\right)}^{2}\right)}^{1.5}}.(4) 
Kurtosis reflects the peakedness of a distribution. A high kurtosis value indicates a distribution with a sharp, narrow peak and heavy tails while a low kurtosis value signifies a distribution with a flattened peak and thin tails. This feature was computed as:
{\gamma}_{2,s}=\frac{\frac{1}{n}{\sum}_{i=1}^{n}{\left({S}_{i}{\mu}_{s}\right)}^{4}}{{\left(\frac{1}{n}{\sum}_{i=1}^{n}{\left({S}_{i}{\mu}_{s}\right)}^{2}\right)}^{2}}.(5)

2.
Frequency Domain Features

The peak magnitude value of the Fast Fourier Transform (FFT) of the signal S was also used as a feature. All the FFT coefficients were normalized by the length of the signal, n.

The centroid frequency of the signal S[15] was estimated as
\widehat{f}=\frac{{\int}_{0}^{{f}_{max}}f{F}_{s}\left(f\right){}^{2}df}{{\int}_{0}^{{f}_{max}}{F}_{s}\left(f\right){}^{2}df},(6)
where F _{ s }(f) is the Fourier transform of the signal S and f _{ max }is the Nyquist frequency (effectively 500 Hz after downsampling).

The bandwidth of the spectrum was computed using the following formula
BW=\sqrt{\frac{{{\int}_{0}^{{f}_{max}}\left(f\widehat{f}\right)}^{2}{F}_{s}\left(f\right){}^{2}df}{{\int}_{0}^{{f}_{max}}{F}_{s}\left(f\right){}^{2}df}.}(7)

3.
Information TheoryBased Features

The entropy rate [22] of a signal quantifies the extent of regularity in that signal. The measure is useful for signals with some relationship among consecutive signal points. We first normalized the signal S to zeromean and unit variance. Then, we quantized the normalized signal into 10 equally spaced levels, represented by the integers 0 to 9, ranging from the minimum to maximum value. Now, the sequence of U consecutive points in the quantized signal, \u015c=\left\{{\u015d}_{1},{\u015d}_{2},...,{\u015d}_{3}\right\}, was coded using the following equation
{a}_{i}={\u015d}_{i+U1}\cdot 1{0}^{U1}+...+{\u015d}_{i}\cdot 1{0}^{0},(8)
with i = 1, 2, ..., n  U + 1. The coded integers comprised the coding set A _{ U }= {a _{1}, ..., a _{ nU+1}}. Using the Shannon entropy formula, we estimated entropy
where {p}_{{A}_{U}}\left(t\right) represents the probability of observing the value t in A _{ U }, approximated by the corresponding sample frequency. Then, the entropy rate was normalized using the following equation
where \hat{E\left(U\right)} denotes the normalized entropy, and β was the percentage of the coded integers in A _{ L }that occurred only once. Finally, the regularity index ρ ∈ [0, 1] was obtained as
where a value of ρ close to 0 signifies maximum randomness while ρ close to 1 indicates maximum regularity.

To calculate the memory of the signal [13], its autocorrelation function was computed from zero to the maximum time lag (equal to the length of the signal) and normalized such that the autocorrelation at zero lag was unity. The memory was estimated as the time required for the the autocorrelation to decay to 1/e of its zero lag value.

LempelZiv (LZ) complexity [23] measures the predictability of a signal. To compute the LZ complexity for signal S, first, the minimum and the maximum values of signal points were calculated and then, the signal was quantized into 100 equally spaced levels between its minimum and maximum values. Then, the quantized signal, {B}_{1}^{n}=\left\{{b}_{1},{b}_{2},...,{b}_{n}\right\}, was decomposed into T different blocks, {B}_{1}^{n}=\left\{{\psi}_{1},{\psi}_{2},...,{\psi}_{T}\right\}. A block ψ was defined as
\Psi ={B}_{j}^{\ell}=\left\{{b}_{j},{b}_{j+1},...,{b}_{\ell}\right\},1\le j\le \ell \le n.(12)
The values of the blocks can be calculated as follows:
where h _{ m }is the ending index for ψ _{ m }, such that ψ _{ m+1}is a unique sequence of minimal length within the sequence {B}_{1}^{{h}_{m+1}1}. Finally, the normalized LZ complexity was calculated as
2.5 ReputationBased Classification
Reputation typically refers to the quality or integrity of an individual component within a system of interacting components. The notion of reputation has been widely used to ascertain the health of nodes in wireless networks [24], identify malicious hosts in a distributed system [25] and detect freeriders in peertopeer networks [26], among many other practical applications. Here, we apply the concept of reputation to judiciously combine decisions of multiple classifiers for the purpose of differentiating between safe and unsafe swallows. The general idea is to differentially weigh classifier decisions on the basis of their past performance.
The past performance of the i^{th}classifier is captured via its reputation, {r}_{i}\in \Re ,0\le {r}_{i}\le 1, where 1 signifies a strong classifier (high accuracy) and 0 denotes a weak classifier. Briefly, the classifier is formulated as follows. Let Θ = {θ _{1}, θ _{2},..., θ _{ L }} be a set of L ≥ 2 classifiers and Ω = {ω _{1}, ω _{2}, ..., ω _{ c }} be a set of c ≥ 2 class labels, where ω _{ j }≠ ω _{ k }, ∀j ≠ k. Without loss of generality, Ω ⊂ ℕ. The input of each classifier is the feature vector x\in {R}^{{n}_{i}}, where n _{ i }is the dimension of the feature space for the i^{th}classifier θ _{ i }, whose output is a class label ω _{ j }, j = 1, ..., c. Let p(ω _{ j }) be the prior probability of class ω _{ j }.

1.
For a classification problem with c ≥ 2 classes, we invoke L ≥ 2 individual classifiers.

2.
After training the L classifiers individually, the respective accuracy of each is evaluated using a validation set and expressed as a real number in [0, 1]. This number is the reputation of the classifier.

3.
For each feature vector, x, in the test set, L decisions are obtained using the L distinct classifiers:
\Omega \left(x\right)=\left\{{\theta}_{1}\left(x\right),{\theta}_{2}\left(x\right),...,{\theta}_{L}\left(x\right)\right\}.(15) 
4.
We sort the reputation values of the classifiers in descending order,
{R}^{*}=\left\{{r}_{{1}^{*}},{r}_{{2}^{*}},...,{r}_{{L}^{*}}\right\},(16)
such that r _{1*} ≥ r _{2*} ≥ ··· ≥ r _{ L*}. Then, using this set, we rank the classifiers to obtain a reputationordered set of classifiers, Θ*.
The first element of this set corresponds to the classifier with the highest reputation.

5.
Next, we examine the votes of the first m elements of the reputationordered set of classifiers, with
m=\left\{\begin{array}{cc}\hfill \phantom{\rule{1em}{0ex}}\phantom{\rule{2.77695pt}{0ex}}\frac{L}{2},\hfill & \hfill \mathsf{\text{if}}\phantom{\rule{2.77695pt}{0ex}}L\phantom{\rule{2.77695pt}{0ex}}\mathsf{\text{is}}\phantom{\rule{2.77695pt}{0ex}}\mathsf{\text{even}},\hfill \\ \hfill \frac{L+1}{2},\hfill & \hfill \mathsf{\text{if}}\phantom{\rule{2.77695pt}{0ex}}L\phantom{\rule{2.77695pt}{0ex}}\mathsf{\text{is}}\phantom{\rule{2.77695pt}{0ex}}\mathsf{\text{odd}}.\hfill \end{array}\right.(18)
If the top m classifiers vote for the same class, ω _{ j }, we accept the majority vote and take ω _{ j }as the final decision of the system. However, if the votes of the first m classifiers are not equal, we consider the classifiers' individual reputations (Step 2) in arriving at the final decision, as detailed in step 6.

6.
The probability that the combined classifier decision is ω _{ j }given the input vector x and the individual local classifier decisions is denoted as the posterior probability,
p\left({w}_{j}{\theta}_{1}\left(x\right),{\theta}_{2}\left(x\right),...,{\theta}_{L}\left(x\right)\right)(19)
which can be estimated using Bayes rule as
when the classifiers are independent. For notational convenience, we have dropped the argument for θ above, but it is understood to be a function of x. The local likelihood functions, p(θ _{ i }ω _{ j }), are estimated by the reputation values calculated in Step 2. When the correct class is ω _{ j }and classifier θ _{ i }classifies x into the class ω _{ j }, i.e., θ _{ i }(x) = ω _{ j }, we can write
In other words, p(θ _{ i }= ω _{ j }ω _{ j }) is the probability that the classifier θ _{ i }correctly classifies x into class ω _{ j }when x actually belongs to this class. This probability is exactly equal to the reputation of the classifier. On the other hand, when the classifier categorizes x incorrectly, i.e., θ _{ i }(x) ≠ ω _{ j }given that the correct class is ω _{ j }, then
When there is no known priority among classes, we can assume equal prior probabilities. Hence,
Thus, for each class, ω _{ j }, we can estimate the a posteriori probabilities as given by (20) using (21), (22), and (23). The class with the highest posterior probability is selected as the final decision of the system and the input subject x is categorized as belonging to this class.
2.6 Classifier evaluation
We ranked the signal features introduced above using the Fisher ratio [27] for univariate separability. In the time domain, mean and variance in the AP axis and skewness in the SI axis were the topranked features. Similarly, in the frequency domain, the peak magnitude of the FFT and the spectral centroid in the AP direction and the bandwidth in the SI direction were retained. Finally, in the information theoretic domain, entropy rate for the SI signal and memory of the AP signal were the highest ranking features. Subsequently, we only examined these feature subsets for classification, i.e., in total 8 different features were selected. For comparison between single and dualaxes classifiers, we also considered classifiers that employed feature subsets (as identified above) from a single axis.
Swallows from all 30 participants were pooled together. Given the disproportion of safe and unsafe samples, we invoked a smooth bootstrapping procedure [28] to balance the classes. All features were then standardized to zero mean and unit variance. Three separate support vector machine (SVM) classifiers [29] were invoked, one for each feature genre (time, frequency and information theoretic). Hence, the feature space dimensionalities for the classifiers were 3 (SVM with time features), 3 (SVM with frequency features) and 2 (SVM with informationtheoretic features).
The use of different feature sets for each classifier increases the likelihood that the classifiers will perform independently [30].
Classifier accuracy was estimated via a 10fold cross validation with a 90%10% split. In each fold, performance on the training set was used to estimate the individual classifier reputations. Classifiers were then ranked according to their reputation values. Without loss of generality, assume r _{1} ≥ r _{2} ≥ r _{3}. If θ _{1} and θ _{2} cast the same vote about a test swallow, their common decision was accepted as the final classification. However, if they voted differently, the a posteriori probability of each class was computed using (20) and the maximum a posteriori probability rule was applied to select the final classification.
3 Results
The sensitivity, specificity and accuracy of the singleaxis and dualaxis accelerometry classifiers are summarized in Figure 4. The dualaxis classifier had significantly higher accuracy (80.48 ± 5.0%) than either singleaxis classifier (p << 0.05, twosample ttest), specificity (64 ± 8.8%) comparable to that of the SI classifier (p = 1.0) and sensitivity (97.1 ± 2%) on par with that of the AP classifier (p = 1.0). In other words, the dualaxis classifier retained the best sensitivity and specificity achievable with either singleaxis classifier.
Figure 5 is a parallel axes plot depicting the internal representation of safe and unsafe swallows acquired by the reputationbased classifier. Each feature has been normalized by its standard deviation to facilitate visualization. On each axis, the median feature value is shown. The median values of adjacent axes are joined by solid (safe swallow) or dashed (unsafe swallow) lines.
4 Discussion
4.1 Dual versus single axis
Of the two axes, the AP axis tended to carry more useful information than the SI direction for discrimination between safe and unsafe swallowing. This observation is evidenced in Figure 4, where AP accuracy is dramatically higher than SI levels, echoing the findings of [12] who suggested that the AP axis is richer in information content (i.e., higher entropy) relating to swallowing. Note that data collection conditions and experimental protocols of the present study were similiar to that of [12]. Nonetheless, the SI axis does carry information distinct from that of the AP orientation, as dualaxis classification exceeds any singleaxis counterpart. Our results thus support the inclusion of selected features from both the AP and SI axes for the automatic discrimination between safe and unsafe swallowing. Indeed, when comparing AP and SI signals, [12] reported minimal mutual information, and interaxis dissimilarities in the scalograms, pseudospectra and temporal evolution of low and highfrequency content.
In a recent videofluoroscopic study, both AP and SI accelerations were attributed to the planar motion of the hyoid and larynx during swallowing [9]. In that study, the displacement of the hyoid bone and larynx along with their interaction explained over 70% of the variance in the doubly integrated acceleration in both AP and SI axes at the level of the cricoid cartilage. This physiological basis of swallow accelerometry suggests that differences in hyolaryngeal motion between safe and unsafe swallowing are manifested in our selected features. Indeed, early singleaxis accelerometry research had implicated decreased laryngeal elevation as the reason for suppressed AP accelerations in individuals with severe dysphagia [8].
4.2 Internal representation
In Figure 5, we immediately observe some distinct patterns which characterize each type of swallow. In the AP axis, unsafe swallows tend to have lower acceleration amplitude, higher variance, higher spectral centroid and shorter memory. The lower mean vibration amplitude in unsafe swallowing resonates with previous reports of suppressed peak acceleration [8] in dysphagic patients and reduced peak anterior hyoid excursion [31] in older adults, both suggesting compromised airway protection. The observation of a higher spectral centroid in unsafe swallowing may reflect departures from the typical axial highlow frequency coupling trends of normal swallowing as detailed in [12]. Likewise, the shorter memory and hence faster decay of the autocorrelation may be indicative of compromised overall coordination in unsafe swallowing.
It is also interesting to note that unsafe swallows tend to be negatively skewed while safe swallows are evenly split between positive and negative skew. In other words, in unsafe swallowing, the upward motion of the hyolaryngeal structure appears to have weaker accelerations than during the downward motion. This is opposite of the tendency reported in [12] for healthy swallowing and may reflect inadequate urgency to protect the airway.
4.3 Reputationbased classification
The merit of a reputationbased classifier for the present problem can be appreciated by contrasting its performance against that of the classic method of combining classifiers, i.e., via the majority voting algorithm. To this end, Figure 6 summarizes the accuracies of both approaches from a 10fold crossvalidation using the data of this study. The accuracy, specificity, and sensitivity of classification using the majority voting algorithm on these data were 76.10%, 56.66%, and 94.51%, respectively. The histograms summarize the distribution of accuracies obtained from crossvalidation. To aid in the visualization of underlying differences in performance, the corresponding density estimate (solid line) was obtained using a semiparametric maximum likelihood estimator based on a finite mixture of Gaussian kernels. Clearly, the location of the density of reputationbased accuracies appears to be further to the right of the location of the majority voting density. The large spread in both densities amplifies the risk of Type II error and thus conventional testing (e.g., Wilcoxon ranksum) fails to identify any differences. However, upon more careful inspection using a twosample KolmogorovSmirnoff test of the 20% onesided trimmed densities (i.e., omitting the 2 most extreme points in each density), a statistically significant difference between the distributions (p = 0.0098) is confirmed.
The reputationbased classifier achieved higher adjusted accuracies (> 85%; average of sensitivity and specificity) than those reported in [5] (no greater than 75%). Patients were similarly aged and all had neurogenic dysphagia. Similar to the present study, the authors in [5] considered any entry into the airway as unsafe swaloowing. However, some key differences between the studies are worthy of mention. The present study had a slightly larger sample size, a better balance between males and females ([5] almost exclusively had males), and most importantly, a more significant representation of unsafe swallows (73% of total swallows compared to only 13% in [5]). Arguably, vibration patterns of pathological swallows vary more widely than those of safe swallows and hence a more comprehensive representation of the former may be welljustified.
Generally, the reputationbased classification scheme mitigates the risk of the overall classifier performance being unduly affected by a poorly performing component classifier within a multiclassifier system. Additionally, as exemplified in this study, the dimensionality of individual classifiers can be minimized, reducing the demand for voluminous training data.
4.4 Limitations
The dualaxes classifier attained very high sensitivity but modest specificity. In part, this bias towards higher sensitivity may be attributable to the preponderance of unsafe swallow examples in the original data set, despite our efforts to balance the classes via bootstrapping. In a practical system, it would mean that the classifier may overzealously flag a safe swallow as unsafe. This class imbalance issue may be a limitation of studying patients referred to videofluoroscopy, the majority of whom likely have a greater propensity for problematic swallowing. Hence, to obtain a larger number of safe swallows, a significantly expanded sample of patients may need to be recruited in the future.
The reputation classifier assumes independent features. This constrains the admissible features, but [12] has argued that many SI and AP features have low correlations. Future work may invoke independent component analysis or principal component analysis to generate additional novel independent features. The present classifier relies on static reputation values. In clinical application, the classifier may be trained and tested at different times with different patients. As a consequence, the feature distributions may change over time. In such case, dynamic reputation values may be more appropriate and future work may consider an online approach to dynamically update classifier reputations.
5 Conclusion
This study has demonstrated the potential for automatic discrimination between safe and unsafe (without airway clearance) swallows on the basis of a selected subset of time, frequency and information theoretic features derived from noninvasive, dualaxis accelerometric measurements at the level of the cricoid cartilage. Dualaxis classification was more accurate than singleaxis classification. The reputationbased classifier internally represented unsafe swallows as those with lower mean acceleration, lower range of acceleration, higher spectral centroid, slower autocorrelation decay and weaker acceleration in the superior direction. Our results suggest that reputationbased classification of dualaxis swallowing accelerometry from adult stroke patients deserves further consideration as a clinical informatic in the management of swallowing disorders.
References
 1.
Logemann J: Evaluation and treatment of swallowing disorders. ProEd, Austin, TX 1997.
 2.
Miller A: The neuroscientific principles of swallowing and dysphagia. Singular Publishing Group, San Diego 1999.
 3.
Ding R, Logemann J: Pneumonia in stroke patients: a retrospective study. Dysphagia 2010, 15: 51–57.
 4.
Tabaee A, Johnson P, Gartner C, Kalwerisky K, Desloge R, Stewart M: Patientcontrolled comparison of flexible endoscopic evaluation of swallowing with sensory testing (FEESST). The Laryngoscope 2006, 116: 821–825. 10.1097/01.mlg.0000214670.40604.45
 5.
Lee J, Steele C, Chau T: Classification of healthy and abnormal swallows based on accelerometry and nasal airflow signals. Artificial Intelligence in Medicine 2011, 52: 17–25. 10.1016/j.artmed.2011.03.002
 6.
Cichero J, Murdoch B: The physiologic cause of swallowing sounds: answers from heart sounds and vocal tract acoustics. Dysphagia 1998, 13: 39–52. 10.1007/PL00009548
 7.
Sejdić E, Falk T, Steele C, Chau T: Vocalization removal for improved automatic segmentation of dualaxis swallowing accelerometry signals. Medical Engineering & Physics 2010, 32(6):668–672. 10.1016/j.medengphy.2010.04.008
 8.
Redy N, Katakam A, Gupta V, Unnikrishnan R, Narayanan J, Canilang E: Measurements of acceleration during videofluoroscopic evaluation of dysphagic patients. Medical Engineering & Physics 2000, 22(6):405–412. 10.1016/S13504533(00)000473
 9.
Zoratto D, Chau T, Steele C: Hyolaryngeal excursion as the physiological source of swallowing accelerometry signals. Physiological Measurement 2010, 31(6):843–855. 10.1088/09673334/31/6/008
 10.
Das A, Reddy N, Narayanan J: Hybrid fuzzy logic committee neural networks for recognition of swallow acceleration signals. Computer Methods and Programs in Biomedicine 2001, 64: 87–99. 10.1016/S01692607(00)000997
 11.
Lee J, Blain S, Casas M, Berall G, Kenny D, Chau T: A radial basis classifier for the automatic detetion of aspiration in children with dysphagia. Journal of Neuroengineering and Rehabilitation 2006, 3(14):1–17.
 12.
Lee J, Steele C, Chau T: Time and timefrequency characterization of dualaxis swallowing accelerometry signals. Physiological Measurement 2008, 29(9):1105–1120. 10.1088/09673334/29/9/008
 13.
Lee J, Sejdić E, Steele C, Chau T: Effects of liquid stimuli on dualaxis swallowing accelerometry signals in a healthy population. Biomedical Engineering OnLine 2010, 9(7):10.
 14.
Hanna F, Molfenter S, Cliffe R, Chau T, Steele C: Anthropometric and demographic correlates of dualaxis swallowing accelerometry signal characteristics: a canonical correlation analysis. Dysphagia 2010, 25(2):94–103. 10.1007/s0045500992299
 15.
Sejdić E, Komisar V, Steele C, Chau T: Baseline characteristics of dualaxis cervical accelerometry signals. Annals of Biomedical Engineering 2010, 38(3):1048–1059. 10.1007/s104390099874z
 16.
Lee J, Steele C, Chau T: Swallow segmentation with artificial neural networks and multisensor fusion. Medical Engineering & Physics 2009, 31(9):1049–1055. 10.1016/j.medengphy.2009.07.001
 17.
Jain A, Duin R, Mao J: Statsitical Pattern Recognition: A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence 2000, 22: 4–37. 10.1109/34.824819
 18.
Steele C, Sejdić E, Chau T: NDualaxis cervical accelerometry for aspiration and dysphagia identification. Poster presentation. In 19th Annual Dysphagia Research Society Meeting, Volume Under Review. San Antonio, TX; 2011.
 19.
Orović I, Stanković S, Chau T, Steele C, Sejdić E: Timefrequency analysis and Hermite projection method applied to swallowing accelerometry signals. EURASIP Journal of Advances in Signal Processing 2010, 2010(article ID 323125):7.
 20.
Sejdić E, Steele C, Chau T: A procedure for denoising dualaxis accelerometry signals. Physiological Measurement 2010, 31: N1N9. 10.1088/09673334/31/1/N01
 21.
Rosenbek J, Robbins J, Roecker E, Coyle J, Woods J: A penetrationaspiration scale. Dysphagia 1996, 11(2):93–98. 10.1007/BF00417897
 22.
Porta A, Guzzetti S, Montano N, Furlan R, Pagani M, Malliani A, Cerutti S: Entropy, entropy rate, and pattern classification as tools to typify complexity in short heart period variability series. IEEE Transactions on Biomedical Engineering 2001, 48(11):1282–1291. 10.1109/10.959324
 23.
Lempel A, Ziv J: On the complexity of finite sequences. IEEE Transactions on Information Theory 1976, 22: 75–81. 10.1109/TIT.1976.1055501
 24.
Ciszkowski T, Dunajewski I, Kotulski Z: Reputation as optimality measure in wireless sensor networkbased monitoring systems. Probabilistic Engineering Mechanics 2011, 26: 67–75. 10.1016/j.probengmech.2010.06.009
 25.
Tajeddine A, Kayssi A, Chehab A, Artail H: Fuzzy reputationbased trust model. Applied Soft Computing 2011, 11: 345–355. 10.1016/j.asoc.2009.11.025
 26.
Tseng Y, Chen F: A freerider aware reputation system for peertopeer filesharing networks. Expert Systems with Applications 2011, 38(3):2432–2440. 10.1016/j.eswa.2010.08.032
 27.
Lin T, Li H, Tsai K: Implementing the Fisher's Discriminant Ratio in a kMeans Clustering Algorithm for Feature Selection and Data Set Trimming. Journal of Chemical Information and Computer Science 2004, 44: 76–87. 10.1021/ci030295a
 28.
Efron B, Tibshirani R: An Introduction to the Bootstrap. Boca Raton, FL: CRC Press; 1994.
 29.
Duda R, Hart P, Stork D: Pattern Classification. 2nd edition. WileyInterscience; 2000.
 30.
Xu L, Kryzak A, Suen C: Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Transactions on Systems, Man and Cybernetics 1992, 22(3):418–435. 10.1109/21.155943
 31.
Kim Y, McCullough G: Maximal hyoid displacement in normal swallowing. Dysphagia 2008, 23(3):274–279. 10.1007/s004550079135y
Acknowledgements
This research was supported in part through funding from the Ontario Graduate Scholarship program, the Canada Research Chairs Program and the Natural Sciences and Engineering Research Council of Canada.
Author information
Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
MSN proposed and mathemathically formulated the static reputationbased algorithm, implemented the proposed algorithm and applied it to the problem of dysphagia detection, and wrote the entire manuscript. CS designed and oversaw the data collection protocol and critically reviewed the manuscript. ES helped in data collection, carried out swallow segmentation, and programmed some of the denoising methods. TC supervised this work and revised various versions of the manuscript. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Nikjoo, M.S., Steele, C.M., Sejdić, E. et al. Automatic discrimination between safe and unsafe swallowing using a reputationbased classifier. BioMed Eng OnLine 10, 100 (2011). https://doi.org/10.1186/1475925X10100
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/1475925X10100
Keywords
 Support Vector Machine
 Eosinophilic Esophagitis
 Entropy Rate
 Spectral Centroid
 Neurogenic Dysphagia