 Research
 Open Access
 Published:
A Novel Segmentation, Mutual Information Network Framework for EEG Analysis of Motor Tasks
BioMedical Engineering OnLine volume 8, Article number: 9 (2009)
Abstract
Background
Monitoring the functional connectivity between brain regions is becoming increasingly important in elucidating brain functionality in normal and disease states. Current methods of detecting networks in the recorded electroencephalogram (EEG) such as correlation and coherence are limited by the fact that they assume stationarity of the relationship between channels, and rely on linear dependencies. In contrast to diseases of the brain cortex (e.g. Alzheimer's disease), with motor disorders such as Parkinson's disease (PD) the EEG abnormalities are most apparent during performance of dynamic motor tasks, but this makes the stationarity assumption untenable.
Methods
We therefore propose a novel EEG segmentation method based on the temporal dynamics of the crossspectrogram of the computed Independent Components (ICs). We then utilize mutual information (MI) as the metric for determining also nonlinear statistical dependencies between EEG channels. Graphical theoretical analysis is then applied to the derived MI networks. The method was applied to EEG data recorded from six normal subjects and seven PD subjects off medication. Oneway analysis of variance (ANOVA) tests demonstrated statistically significant difference in the connectivity patterns between groups.
Results
The results suggested that PD subjects are unable to independently recruit different areas of the brain while performing simultaneous tasks compared to individual tasks, but instead they attempt to recruit disparate clusters of synchronous activity to maintain behavioral performance.
Conclusion
The proposed segmentation/MI network method appears to be a promising approach for analyzing the EEG recorded during dynamic behaviors.
Background
Connectivity between brain regions is important for normal brain functioning, and may be impaired in many neurological diseases [1]. The electroencephalogram (EEG), with its excellent temporal resolution (~1 msec), is the most widely available technology used for inferring transient synchronization between brain regions. Both linear and nonlinear measures have been applied to assess the interdependencies between EEG channels [2]. For example, coherence and correlation methods [3, 4], which measure the dependencies between a pair of EEG signals in the frequency and time domains respectively, have been applied to the EEG to study the cortical synchrony that can be modulated as a function of task, and may systematically differ between normal and disease groups [5, 6]. Nevertheless, these measures consider only linear dependencies and may be particularly sensitive to outliers. Other methods may also be used to investigate both linear and nonlinear relationships between multivariate time series in the EEG, such as the Synchronization Likelihood (SL), but this and related methods assume a fixed phase relationship between time series [7]. However, in some diseases such as PD, transient phaselocked behavior between different parts of the motor system may be interrupted by "phase slips" [8] making the assumption of prolonged periods of phase synchrony potentially unsuitable.
An alternative to the linear methods of coherence and correlation and phase synchronization is to consider the mutual information (MI) between channels within a specified window. This enables estimation of both the linear and nonlinear statistical dependencies between time series and can be used to detect functional coupling. MI is a statistical technique that quantifies the information transmitted from one time series to another, with maximum value when two time series are the same and a value of zero if two time series are statistically independent. Previously, researchers have utilized MI as a suitable metric to investigate EEG coupling in various pathological conditions [9–11]. For example, by estimating the MI between the time series of multiple pairs of EEG channels, Jeong et al. demonstrated abnormalities in the information transmission between different cortical areas in Alzheimer's patients [9]. Similar studies have used MI as a marker for corticocortical connections in schizophrenic patients [10] and odor stimulation [11].
Another disease where altered connectivity may be important is Parkinson's disease (PD), a movement disorder that is characterized by muscle rigidity, tremor, and bradykinesia (slowing of physical movement). These symptoms do not reflect a primary failure of the cortex (making resting EEG less likely to be abnormal), but rather the effects of failure of the basal ganglia to prime the cortex for preparation and execution of movement. As a result, PD patients have a difficult time performing simultaneous movements compared to normal subjects [12, 13]. In order to assess the indirect effects of basal ganglia dysfunction on the cortex in PD, it is necessary to have subjects perform a motor task. Furthermore, stressing the motor system by having PD subjects performing simultaneous movements is more likely to induce abnormalities in the recorded EEG.
However, as soon as a subject performs a dynamic motor task, the nonstationarity nature of the EEG must be considered [14]. The nonstationarity likely reflects the switching of inherently metastable states of neural assemblies during task performance causing abrupt transitions. The nonstationary property of EEG suggests that techniques assuming stationarity may result in misleading interpretations. To address this concern, a conventional approach is to incorporate a sliding time window into the original signal models, and assume that the stationarity assumption is valid for the segment of data in the window. However, the selection of an appropriate (possibly timevarying) window length is nontrivial and could have a significant effect on the analysis results.
In order to obtain quasistationary segments in EEG signals and select taskrelated segments, we first propose a novel segmentation method of the EEG based on the temporal dynamics of the crossspectrogram of the Independent Components (ICs), and then compute the MI between channels within the temporallysegmented regions. We then apply graph theoretical analysis to the network of each group defined by edges whose MI values exceed a suitable threshold, and compute the clustering coefficient (C) and shortest path length (L) [15, 16]. In order to accommodate the magnitude of MI values above a given threshold, the intragroup (ie. task) and intergroup (ie. subject groups) network differences are further analyzed by oneway analysis of variance (ANOVA).
For motor tasks, changes in the EEG are most likely related to eventrelated synchronization/desynchronization (ERS/ERD), particularly in the beta band [17]. In selfpaced movements, ERD corresponds to changes in coherence between brain regions [18]. Thus we suggest the use of transient synchronization of between ICs as suitable markers for segmentation [19, 20]. We emphasize that the concept of "taskrelated sections" is flexible and may be dependent upon the underlying behavioral paradigm subjects are asked to perform. For example, if a subject was asked to push a button every 10 seconds, then transient synchronization of ICs occurring approximately every 10 seconds may be a suitable marker for segmentation. Here we demonstrate the proposed segmentation method in a less obvious situation: ongoing modulation of continual manual force production.
To our knowledge, this is the first application of using transient synchronization of ICs for temporal segmentation of time series. Also, it is the first application of joint MI and network analysis to assess information transmission abnormalities between different cortical areas in PD. The main contributions of this paper are as follows:

Propose a novel EEG segmentation approach to address the nonstationary nature of EEG data especially during performance of motor tasks, and to select taskrelated segments.

Present a coherent MIbased network analysis framework for modeling EEG to determine dependencies between EEG channels and infer statistically significant difference between groups.

Demonstrate how the proposed framework can be used for assessing the EEG during dynamic motor behaviors in pathological conditions, such as PD.
The paper is organized as follows: the detailed discussion of the proposed framework is presented in the Methods section. The Results and Discussion section describes the EEG experimental design and summarizes the results in a real case study of PD. Finally, we summarize and conclude the paper in Conclusions.
Methods
We recorded five minutes of EEG data while subjects performed simple hand movements in order to gain insight into the difficulty that PD subjects face when performing simultaneous movements.
Preprocessing
Fig. 1 presents the flowchart diagram of the steps of EEG preprocessing. The preprocessing steps include bandpass filtering to focus on the frequency range of clinical interest, Independent Component Analysis (ICA) for artifact removal in EEG, and EEG segmentation based on the crossspectrogram of the ICs to address the nonstationary nature of EEG data and to select taskrelated segments. At the beginning of each step, we briefly describe the motivation for selecting the respective method.
Bandpass Filtering
EEG data contain a wide range of frequency components, many of which are not of clinical or physiological interest. The data are therefore initially bandpassfiltered by a 4th order Butterworth filter between 0.5–55 Hz [21].
ICA Noise Removal
ICA has been proven capable of isolating both artifactual and neurally generated EEG sources [22]. As various contaminants of EEG recordings such as eye movements, eye blinks, cardiac signals, muscle contamination, etc., can be considered temporally independent from ongoing brain activity, ICA is a popular class of methods for EEG denoising and artifact removal in EEG. ICA decomposes mixtures of time courses into a sum of temporally statistical maximally independent components. The EEG measurements from the scalp, x = {x_{1}(t),..., x_{ N }(t)}, are mixtures of the source signals, s = {s_{1}(t),..., s_{ N }(t)}, where N is the number of EEG channels and t denotes the time. The task of ICA is to recover a version, u, of the original sources, s, by finding an unmixing matrix, W, specifying spatial filters that invert the mixing process linearly, as
Here the infomaxICA algorithm [23] is applied to decompose the EEG signals. ICA finds a coordinate frame in which the data projections have minimal temporal overlap by minimizing the mutual information among the data projections or maximizing the joint entropy of a nonlinear function of s. It is most appropriate to perform ICA decomposition on sources that are linearly mixed in the recorded signals without time delays. After the artifactual sources are identified, the corresponding columns of the mixture matrix (i.e. calculated as the pseudoinverse of W) that multiply the artifactual sources are set to zero to eliminate the artifacts and thus obtain the "corrected" EEG signal. In our study, we only remove wellknown, obvious artifacts by identifying 1 to 2 components (e.g. representing eyemovements and/or electrocardiac signals) by visual inspection. Failure to remove these artifacts may result in correspondence between EEG channels being falsely attributed to synchronized brain activity.
EEG Segmentation based on the CrossSpectrogram
Since the EEG contains much background brain activity that may be unrelated to the motor task being performed, it is necessary to segment the data into taskrelevant sections. The taskrelevant sections are those segments of the EEG data that correspond to the underlying experimental motor task being performed by the subject. The motor task here is designed to target the relative difficulties PD subjects have with performance of simultaneous movement compared. The experimental design is explained in detail in the section Results and DiscussionA. Another purpose of segmentation is to address the nonstationary nature of EEG data [14] by achieving local stationarity. Since final motor output is dependent upon cortical, subcortical, brainstem and spinal circuits yet the EEG measures only cortical activity, segmentation of the EEG based on behavioral data alone may be misleading (Fig. 2). We therefore propose segmenting the EEG based on the cross spectrum of taskrelated ICs.
We note that if the derived ICs were truly independent, then the crossspectrum would not be significant. However, in real data many of the assumptions of ICA are violated. The data are not stationary, and the time courses are not temporally white. By using infomaxICA, which does not incorporate time delays, the derived components will be maximally independent at zero lag. As such, it will deal with the problem of volume conduction – where a deep electrical source may impart common electrical activity to two or more channels. Even though ICs are maximally independent over the whole time range, they may exhibit partial synchronization within specific time/frequency window [24], through which the transient coupling of neural networks might be revealed. By examining the ICs within a short moving window, the nonstationary nature of the EEG will be explored, and significant dependencies between ICs become apparent. Recent studies such as [19, 20] have also explored the transient synchrony between ICs and suggested transient correlation between ICs.
The ICs are thus transformed into timefrequency domain and the crossspectrogram of every pair of ICs is computed. The frequency contents are computed by cross power spectral density using the Welch's averaged, modified periodogram [25] method of spectral estimation. Suppose {x(k)} and {y(k)} are real sequences with length N normalized to zero mean and unit variance, their crosscorrelation sequence is defined as:
The cross spectral density function S_{ xy }(f) is the Fourier transform of the crosscorrelation sequence {R_{ xy }(m)}, expressed as,
with f being the frequency normalized by the sampling frequency. By using a shortterm time shifting window, we are able to obtain localized frequency contents of the two signals and their relationship with respect to time and frequency. The crossspectrum is computed based on a short (3 s) time window shifted by 0.5 s to obtain the localized time information. Power in the higher frequency ranges, such as the gamma band (> 20 Hz) are more likely to distributed over a broad frequency range. To avoid any potential confounds from the AC current at 60 Hz, we look for sharp increases in activity in the range 45 Hz–55 Hz as a good marker for transient broadband artifacts that were not eliminated by the ICA Noise Removal step. In contrast, by examining the crossspectrogram of pairs of ICs within the lower frequency bands of physiologic interest, we can identify taskrelated segments.
We compared the proposed approach of examining the crossspectra between ICs to a commonly used approach, Hidden Markov Models (HMM), a probabilistically tractable and robust way of modeling the dynamic changes of state. We coupled the HMM framework with multivariate Autoregressive (mAR) models, an approach that has been previously suggested for examining nonstationary multivariate electrophysiological signals [26, 27]. Our results suggest that the proposed scheme provides a more reasonable segmentation performance (Fig. 3). In contrast, the HMMmAR technique resulted in rapidly cycling between states with no discernable relationship to the behavioral data (Fig. 3). Thus, after appropriate segmentation, the EEG sections were then concatenated and a mutual information network was derived.
Mutual Information based Network
MI measures the mutual dependence or information gained about one signal from another. The detailed derivation and background of information theory can be found in [28]. Given two random variables X and Y, the pairwise MI is defined as
where P_{ X }(x) is the probability that x is drawn from X and P_{ XY }(x, y) is the joint probability density function for the measurements of X and Y that produce the values x and y.
MI quantifies the amount of information about X that Y contains. It is a symmetric function meaning I(X, Y) = I(Y, X). A MI at zero means that Y does not contain additional information about X, because P_{ XY }(x, y) factorizes to P_{ X }(x)P_{ Y }(y) resulting in MI being zero. On the contrary, the higher the MI between two signals, the more information they contain about each other. Hence, the higher MI, the more likely that the two signals are biologically related. MI is estimated from a finite number of samples, the probability densities, P_{ X }(x) and P_{ XY }(x, y), are approximated by histogram (using bin size of 20). For a fair comparison across subjects, in our paper we have used the relative MI as
where I_{ r }(X, Y) is in the range [0, 1], and H(X) and H(Y) are the entropies. Entropy H(X), defined asΣ_{ x }P_{ X }(x)log_{2}P_{ X }(x), is regarded as a measure of uncertainty about a random variable X.
In our study, the data are separated into 4 second epochs for MI computation in order to increase the sample size as well as to enhance the stationarity and consistency of the MI estimates. Preliminary work of varying the length of the epochs suggested that 4s epochs give a more Gaussian distribution (Fig. 4(a), (b)).
Network Analysis
In order to graphically represent a large set of data, we have derived both a relevance network [29] and an MI network. A relevance network, originally devised for graphically depicting the relationship between genes [29], can be generalized to take large data sets of experimental data and graphically depict the result of pairwise MI. It is obtained by applying a threshold and only the connections that are above the threshold are displayed in the network. The relevance network can then be used in graphical theoretical analysis (discussed in Methods – Network Analysis).
In addition, we have also taken into account the magnitude of MI values and obtained an MI network from the oneway ANOVA test (with details given in Methods – Network Analysis (Statistical Analysis)). The connections are established in the MI network if their MI values exceed a specified threshold and the ANOVA tests indicate significantly different values between groups.
Graphical Theoretical Analysis on Relevance Network
Graph theoretical analysis is applied to the MI matrices of all possible pairwise combinations of EEG channels. The resulting MI matrices are converted to binary relevance networks/graphs by applying a threshold. Graphs are characterized by a cluster coefficient, C, and a characteristic shortest path length, L. A graph G = (V, E), consisting of a set of vertices V (channels) and a set of edges E (connections) between the vertices, is a basic representation of a network. An edge e_{ ij }connects vertex i with vertex j. The neighborhood N_{ i }for a vertex v_{ i }is defined as its immediately connected vertex neighbors. The graph degree k_{ i }of vertex i is the number of edges linking vertex i to its neighbors. The cluster coefficient C_{ i }for a vertex is thus defined as the ratio of the number of edges between the neighbors of vertex i and the maximum possible number of edges between k_{ i }neighbors of vertex i. It is defined as
where . means the number of edges included in {e_{ jk }}. The cluster coefficient C of a graph (the whole system) is defined as the mean cluster coefficient:
with n being the total number of vertices in the graph. Such C measures the local connectivity and ranges from 0 to 1. The higher the C, the greater the intensity of connections within a cluster.
The L of a graph is the mean of all shortest paths (shortest distance) connecting all pairs of vertices. It has a value greater than 1 and measures the global connectivity of the graph. A detail graphical explanation of a graph and graph theoretical measures can be found in [16].
We computed the C and L of a graph as a function of a threshold, T, ranging from 0.01 to 0.3 with an increment of 0.002 to determine the graph differences between the two groups. Since the results might be biased by the mean level of MI, the graph degree, K, defined as the average number of edges per vertex, may be a more suitable measure. However, since the relationship between K and T were almost identical between groups (Fig. 5) any differences in C and/or L at the same level of T would reflect the actual differences in graph organization.
Statistical Analysis
In addition to the graphical theoretic analysis on the thresholded matrices, we created an "MI network" which also incorporated the magnitude of the MI values. As before, MI values were first thresholded by zeroing values less than the 95th percentile on a null distribution. A null distribution was obtained by repeatedly (n = 100) randomly permuting the order of the second signal and computing the pairwise MI based on them [30]. The MI differences between segments are analyzed by oneway ANOVA with subject number, groups, and tasks as factors [31]. The connections between any two channels are established for the MI network if they are significantly different according to the ANOVA test and have magnitudes that are greater than the permutation threshold. The normality of the distribution of the MI values is verified by the KolmogorovSmirnov (KS) test [31].
In the oneway ANOVA test for each pairwise MI I(X, Y), the effect of a factor (e.g. Group) is tested, by comparing with the Ftest the variance of I(X, Y) explained by the factor against the variance of the residuals. Consequently, a pvalue was calculated for each possible connection in the MI network. To account for the effect of testing multiple connections simultaneously, the pvalues are corrected for multiple hypothesis testing using Storey's positivefalsediscoveryrate (pFDR) procedure [32] which computes a qvalue, the expected ratio of falsely rejected hypotheses among all those being rejected. Connections whose qvalues were smaller than 5% are considered statistically significant.
Because we are more interested in the connection with greater MI values, the permutation test is used in conjunction with the ANOVA test to select the relevant features for the MI based network. We have chosen the largest observed value of the permutation test as our threshold. A connection is thus established in the MI network if the MI values are significantly different based on the qvalue and are above the maximum observation from the permutation test.
Results and Discussion
Subjects and Experiment Design
All research was approved by the University of British Columbia Ethics Board. After giving informed consent, seven PD and six agematched control subjects volunteered to participate in the study. All patients were diagnosed with mild to moderately severe PD (Hoehn and Yahr stage 1–3) [33]. The control subjects were confirmed to be without active neurological disorders by a qualified neurologist. All patients were taken off LDopa medication after overnight withdrawal of at least 12 hours.
Subjects were asked to hold a custombuilt rubber squeeze bulb in their right hand with their arm stabilized. All subjects had their maximum voluntary contraction (MVC) tested at the start of the experiment and all subsequent forces were scaled accordingly. Subjects were instructed to control an "inflatable" ring as shown as the horizontal bar in Fig. 6 by squeezing the bulb. The ring must move through an undulating tunnel without touching the sides. The required pressure was between 5–15% MVC in order to successfully avoid the sides of the tunnel. Two fiveminute trials were performed by both normal subjects (N) and PD subjects off medication (PD). During one trial, subjects were asked to squeeze the bulb (SQ) with right hand alone. In another trial, subjects were asked to squeeze the bulb exactly as before, but in addition press a mouse button intermittently with their left hand when they observed a color change in the ring. We were particularly interested in how subjects fared when they were required to do both movements simultaneously (BO) compared to the SQ condition (Fig. 6), as clinically it is observed that PD patients have difficulty performing simultaneous movements.
EEG Data Preprocessing
Subjects wore an electrode cap (ElectroCap International, Eaton, OH) which contained 19 channels and a common mastoid reference (as shown in Fig. 7). The data were collected using a Ceegraph Netlink system from BioLogic Systems (Illinois), sampled at 128 Hz and bandpassfiltered between 0.5–55 Hz. After decomposition into temporally independent components with infomax ICA [23], the obviously artifactual components (typically 1 to 2 for each dataset) were screened and removed from the data by visual inspection. Subsequently, two operations are done on the crossspectrogram of the ICA components: noisy EEG segment removal and taskrelated EEG segmentation.
Noisy EEG Segment Removal
As can be seen in Fig. 8, the broadband artifact right around 23 s of the crossspectrogram of the ICs reflects a segment in the actual EEG data that is corrupted by noise, and was eliminated by examining the cross spectra of ICs in the 45–55 Hz range.
TaskRelated EEG Segmentation
The amount that subjects were asked to squeeze was based on two sinusoids with period of 10 and 18 seconds. For the BO condition, the color change occurred every 20 seconds. Therefore, autocorrelations of the crossspectrogram of the ICs over the three physiologicallyrelevant frequency bands [21] 5–8 Hz (Theta), 8–12 Hz (Alpha), 12–30 Hz (Beta) that have a peak at 10 seconds or 18–20 seconds are selected for sections of EEG segmentation. Depending on the features of each dataset, approximately five pairs were chosen for each task. Only segments that are above the mean plus the mean absolute deviation are considered as taskrelated and obtained for further analysis.
To demonstrate the effectiveness of the segmentation method, the power spectral densities (PSDs) of the taskrelated EEG segments and the nontaskrelated EEG segments (ie. the ones that are not selected by segmentation for further analysis) were determined. We chose one channel (ie. channel 19O2) that contained the most number of significant connections (from the network analysis results in Statistical Analysis on MI Network) and investigated its PSD as shown in Fig. 9. The red solid lines are the PSDs of the taskrelated EEG segments and the grey dashed lines are the PSDs of the nontaskrelated segments. Fig. 9 clearly illustrates that the variance of the nontaskrelated segment PSDs is much greater than the one of the taskrelated segment PSDs. This reassures us that the proposed segmentation procedure has segmented out similar sections of the EEG, and since the segmentation was based on taskrelated ICS, the segmented sections correspond to taskrelated parts of the EEG.
Mutual Information based Network Analysis
To assess the importance of nonlinear dependencies, which are captured by the proposed MI method, we first linearly decorrelated our data and examined for residual MI values. As one example shown in Fig. 10, we note that there remains dependencies between EEG channels 16T6 and channel 19O2 after linear decorrelation, suggesting that MI is a suitable metric to incorporate both linear and nonlinear interactions between EEG channels to derive a more accurate network. Here to make a fair comparison, for each case we derived a nullhypothesis distribution with mean μ and standard deviation σ from permutations, and then the normalized MI is calculated as for the scatter plot in Fig. 10.
Graphical Theoretical Analysis Applied to the Relevance Network
The MI matrix for each subject is converted to a graph separately, and the means of the cluster coefficient C and the shortest path length L of the graph within the group (ie. NSQ, PDSQ, NBO, PDBO) were computed as a function of the threshold T.
As T is varied from 0.01 to 0.3, the graphs started to break into subgraphs. In addition, at higher T, some subjects start to have empty graphs meaning the graphs contain no connection at all. Therefore, when we interpret the results, we need to make note of where those points are and they are summarized in Table 1. Again, we see that the points between N and PD do not differ much because their means are very close. The overall mean C of the graph for each group as a function of T was computed and compared. Because the means of the four groups (ie. NSQ, PDSQ, NBO, PDBO) as a function of T follow the same pattern and we are more interested in the differences between groups, the deviation from the overall mean of the four groups as a function of T is illustrated at the top panel of Fig. 11. The overall mean of the four group as a function of T is displayed at the bottom left corner of the top panel. The bottom panel shows the region that is significantly different between groups (at 2: N vs. PD during SQ; at 3: N vs. PD during BO; at 4: SQ vs. BO for N; at 5: SQ vs. BO for PD). The significance of the C between groups is tested with the pairwise ttest (p < 0.05). In general, the intensity of connections within the cluster does not differ significantly between tasks (SQ and BO) within a group. The intensity of connections within the cluster is higher for PD compared to N for SQ task and BO task, especially at higher threshold T (T > 0.15). The consistently lower C seen in N compared to PD across all frequency bands probably reflects the widespread, excessive synchronization seen in PD [34, 35]. Unlike prior studies that have emphasized synchronization in the beta band, we have observed excessive synchronization in all bands, especially the theta band.
The top panel of Fig. 12 shows the deviation of L from the overall mean of the four groups as a function of the threshold T. The overall mean of the four groups is presented at the top left corner of the top panel. The region that is significantly different between groups (denoted at 2: N vs. PD during SQ; at 3: N vs. PD during BO; at 4: SQ vs. BO for N; at 5: SQ vs. BO for PD) is shown at the bottom panel. The L of the N group is quite a bit larger than that of the PD group for a threshold between 0.1 to 0.25, but L of PD becomes quite a bit larger than that of N after that. However, the magnitude of L must be interpreted with caution as the region of significance lies in where subgraphs emerge. Overall, the observation that PD subjects are with larger C and shorter L compared to N subjects suggests that the graphs for the PD group are more broken up into small, tightly connected clusters.
In order to gain insight into the location distribution of the clusters, the graphical representations of the overall group mean clustering coefficient C for each vertex (i.e. EEG node) are illustrated in Fig. 13 and Fig. 14 for SQ and BO task, respectively. As mentioned earlier, we study three physiologicallyrelevant frequency bands: 5–8 Hz (Theta), 8–12 Hz (Alpha), 12–30 Hz (Beta). For both the N and PD groups, the images of the group mean clustering coefficient values of 19nodes are displayed using the jet colormap. In the figures, the node that is crossed out indicates that the corresponding mean C for the PD group is significantly greater than that of the N group. The node that is circled denotes that the corresponding mean C of that vertex for the N group is significantly greater than the one for the PD group. The significance between the mean C per vertex for the N and the PD groups is tested by pairwise Ttest (p < 0.05) and the details are shown in Table 2 and 3. Only the channels that have any significant difference between groups are shown in the tables. It can be seen that the PD group has more heavily connected clusters at the frontal and motor cortex of the brain over all frequencies. On the other hand, the N group has more heavily connected clusters at the posterior region of the brain over all frequencies.
Statistical Analysis on MI Network
To further demonstrate the importance of segmentation, we calculated MI networks for both segmented and unsegmented data. The majority of the connections indicates that the MI values based on the taskrelated segments are significantly greater than the ones based on the nontaskrelated segments for all networks. One example of the MI distribution of a significant connection is illustrated in Fig. 15. As implied from the networks, there is a shift in the mean of the MI values between the taskrelated segments and the nontaskrelated segments.
To investigate the effect of the disease, the MI networks were computed for the intragroup analysis (SQ vs. BO) and the intergroup analysis (N vs. PD). The graphical results at three different frequency bands for Normals and PD subjects are presented in Fig. 16. The solid lines denote that the MI values of the BO task are significantly greater than the ones for the SQ task and the dotted lines represent the converse condition. The results suggest that PD subjects are unable to independently recruit different areas of the brain while performing simultaneous tasks, but instead simultaneously recruit focal islands of increased synchrony.
We also investigated the results for intergroup analysis at three different frequencies (Fig. 17). We observe higher MI values in the frontal region at lower and medium frequency bands and motor cortex at higher frequency band in PD which coincide with the finding in the previous graphical theoretical analysis.
Conclusion
This paper proposed a novel segmentation, mutual information network framework for EEG connectivity analysis for subjects performing a motor task. The greatest EEG changes during motor performance are typically eventrelated synchronization/desynchronization: EEG responses are not phaselocked to motor performance, but rather tend to be associated with augmentation or attenuation of specific frequency bands. This means that standard methods of averaging the EEG timelocked to the motor performance will tend to be inaccurate, necessitating the use of alternate methods. In addition, ERS/ERD is typically investigated in univariate fashion, where each EEG channel is examined independently for altered localized neuronal synchrony resulting in changes in the frequency spectra at that channel. Here we have used the crossspectrum of ICs as a marker for segmentation. This has multiple benefits: first, consistent with ERS/ERD, it examines the data in the frequency domain, second, it allows the examination of multiple channels simultaneously, as each IC will consist of a linear weighting of all channels, and lastly, it will allow unmixing of the raw data so that task related activity can be extracted from ongoing background brain rhythms. After segmentation, we used mutual information to measure both linear and nonlinear dependencies, without assuming strict phase locking of signals. This allowed us to create a relevance network suitable for graphtheoretic analysis methods, and an MI network which further incorporated the magnitude of the MI at each channel pair to allow statistical analysis with ANOVA.
The proposed method provided several novel insights into abnormalities in PD subjects. The well known clinical observation of difficulty in performing simultaneous movements [12] appears to be related to an inability to recruit different brain areas independently. When normal subjects were asked to perform tasks with two hands compared to a task with one hand, there was no significant difference in the theta bands, and only mild changes in connectivity in the alpha and beta bands (Fig. 16). In contrast, when PD subjects attempted to perform simultaneous movements, they appeared unable to recruit different brain areas independently, resulting multiple areas of synchronization in the theta range, and also in the beta range. These results appear novel, as previous research has emphasized excessive synchronization in the beta range in PD [36].
Additionally, the widespread synchrony that is normally already seen during regular unimanual or bimanual performance was of a different form in PD (Figs. 11, 12, and 17). PD subjects tended to have higher cluster coefficients, C, and lower shortest path lengths, L suggesting focal clusters of synchronous activity. Taken together, these results suggest that normal subjects can synchronously activate broad areas or cortex independently in response to differing task demands. In contrast, PD subjects appear to have islands of hypersynchronicity that cannot be recruited independently. This is consistent with known biology of PD, where bradykinesia is most likely the result of "noisy" basal ganglia input to the frontal cortex and appears to critically depend upon dopamine depletion [37] as would be seen in the PD subjects off medication in this study. The higher MI values in the frontal region at lower and medium frequency bands and motor cortex at higher frequency in PD (Fig. 17) are also consistent with the cortical regions that receive output from the basal ganglia. Since the EEG is normally assumed to reflect cortical activity, isolation of clear abnormalities from PD subjects who have predominantly basal ganglia dysfunction and preserved cortical function is notable.
Although we suggest that our results demonstrate strong evidence to support the proposed framework as a tool to study EEG signals, there are nevertheless limitations of the proposed method. For example, currently only pairwise MI, one particular case of calculating the MI between M random variables, is investigated. In a more general presentation, the corresponding M–dimensional MI can be defined. Since pairwise independence does not necessarily imply global independence, M–dimensional MI may reveal additional information from that of pairwise MI, and thus may be a fruitful avenue to explore in the future for EEG analysis. Similarly, pairwise MI may suffer from a high false discovery rate, i.e. nodes are erroneously associated while in truth they only indirectly interact through one or more other nodes. Therefore, to prune the reconstructed network of such false positives, we can extend the current work by exploring the concept of conditional mutual information (CMI) instead. Also, in the current approach, we did not consider the temporal information embedded in the timeseries EEG data. As one future work, we intend to introduce temporality into the proposed MI network construction.
References
 1.
Rowe J, Stephan KE, Friston K, Frackowiak R, Lees A, Passingham R: Attention to action in Parkinson's disease: Impaired effective connectivity among frontal cortical regions. Brain 2002,125(2):276–289. 10.1093/brain/awf036
 2.
Perda E, Quiroga RQ, Bhattacharya J: Nonlinear multivariate analysis of neurophysiological signsl. Progress in Neurobiology 2005, 77: 1–37. 10.1016/j.pneurobio.2005.10.003
 3.
Shen B, Nadkarni M, Zappulla RA: Spectral modulation of cortical connections measured by EEG coherence in human. Clinical Neurophysiology 1999, 110: 115–125. 10.1016/S00134694(98)001047
 4.
Weiss S, Rappelsberger P: Longrange EEG synchronisation during word encoding correlates with successful memory performance. Cognitive Brain Ressearch 2000, 9: 299–312. 10.1016/S09266410(00)000112
 5.
Blandini F, Nappi G, Tassorelli C, Martignoni E: Functional changes of the basal ganglia circuitry in Parkinson's disease. Progress in Neurobiology 2000, 62: 63–88. 10.1016/S03010082(99)000672
 6.
Samuel M, CeballosBaumann AO, Blin J, Uema T, Boecker H, Passingham RE, Brooks DJ: Evidence for lateral premotor and parietal overactivity in Parkinson's disease during sequential and bimanual movements. A PET study. Brain 1997,120(6):963–976. 10.1093/brain/120.6.963
 7.
Stam CJ, van Dijk BW: Synchronization likelihood: an unbiased measure of generalized synchronization in multivariate data sets. Phys D 2002, 163: 236–251. 10.1016/S01672789(01)003864
 8.
Hurtado JM, Rubchinsky LL, Sigvardt KA, Wheelock VL, T PC: Temporal Evolution of Oscillations and Synchrony in GPi/Muscle Pairs in Parkinson's Disease. Journal of Neurophysiology 2005, 93: 1569–1584. 10.1152/jn.00829.2004
 9.
Jeong J, Gore JC, Peterson BS: Mutual information analysis of the EEG in patients with Alzheimer's disease. Clinical Neurophysiology 2001, 112: 827–835. 10.1016/S13882457(01)005132
 10.
Na SH, Jin SH, Kim SY, J HB: EEG in schizophrenic patients: mutual information analysis. Clinical Neurophysiology 2002, 113: 1954–1960. 10.1016/S13882457(02)001979
 11.
Min BC, Jin SH, Kang IH, Lee DH, Kang JK, Lee ST, Sakamoto K: Analysis of Mutual Information Content for EEG Responses to Odor Stimulation for Subjects Classified by Occupation. Chem Senses 2003, 28: 741–749. 10.1093/chemse/bjg066
 12.
Benecke R, Rothwell JC, Dick JP, Day BL, Marsden CD: Performance of simultaneous movements in patients with Parkinson's disease. Brain 1986, 109: 793–757. 10.1093/brain/109.4.739
 13.
Berardelli A, Rothwell JC, Thompson PD, Hallett M: Pathophysiology of bradykinesia in Parkinson's disease. Brain 2001, 124: 2131–2146. 10.1093/brain/124.11.2131
 14.
Kaplan AY, Fingelkurts AA, Fingelkurts AA, Borisov SV, Darkhovsky BS: Nonstationary nature of the brain activity as revealed by EEG/MEG: Metholdological, practical and conceptual challenges. Signal Processing 2005, 85: 2190–2212. 10.1016/j.sigpro.2005.07.010
 15.
Sporns O, Zwi JD: The small world of the cerebral cortex. Neuroinformatics 2004, 2: 145–162. 10.1385/NI:2:2:145
 16.
Stam CJ, Jones BF, Nolte G, Brekspear M, Scheltens p: Smallworld Networks and functional connectivity in Alzheimer's Disease. Cerebral Cortex 2007, 17: 92–99. 10.1093/cercor/bhj127
 17.
Rappelsberger P, Pfurtscheller G, Filz O: Calculation of eventrelated coherence A new method to study short lasting coupling between brain areas. Brain Topography 1994,7(2):121–127. 10.1007/BF01186770
 18.
Leocani L, Toro C, Manganotti P, Zhuang P, Hallett M: Eventrelated coherence and eventrelated desynchronization/synchronization in the 10 Hz and 20 Hz EEG during selfpaced movements. Electroencephalography and Clinical Neurophysiology/Evoked Potentials Section 2007,104(3):199–206. 10.1016/S01685597(96)960517
 19.
Gupta D, James CJ: Narrowband vs. Broadband Phase Synchronization Analysis Applied to Independent Components of Ictal and Interictal EEG. IEEEEMBS Proceedings 2007.
 20.
Hong B, Acharya S, Gao S, Thakor NV: Transient phase synchrony of independent cognitive components underlying scalp EEG. IEEEEMBS Proceedings 2005.
 21.
Pfurtscheller G, da Silva FHL: Eventrelated EEG/MEG synchronization and desynchronization: basic principles. Clinical Neurophysiology 1999, 110: 1842–1857. 10.1016/S13882457(99)001418
 22.
Jung TP, Makeig S, Humphries C, Lee TW, McKeown ML, Iragui V, Sejnowski TJ: Removing electroencephalographic artifacts by blind source separation. Psychophysiology 2000, 37: 163–178. 10.1017/S0048577200980259
 23.
Bell A, Sejnowski T: An informationmaximization approach to blind separation and blind deconvolution. Neural Computation 1995, 7: 1129–1159. 10.1162/neco.1995.7.6.1129
 24.
Delorme A, Makeig S: EEGLAB: an open source toolbox for analysis of singletrial EEG dynamics including independent component analysis. Journal of Neuroscience Methods 2004, 134: 9–21. 10.1016/j.jneumeth.2003.10.009
 25.
Oppenheim AV, Schafer RW: DiscreteTime Signal Processing. New Jersey: PrenticeHall; 1999:737.
 26.
Cassidy MJ, Brown P: Hidden Markov based autoregressive analysis of stationary and nonstationary electrophysiological signals for functional coupling studies. Journal of Neuroscience Methods 2002, 116: 35–53. 10.1016/S01650270(02)000262
 27.
Chiang J, Wang Z, McKeown MJ: A Hidden Markov, Multivariate Autoregressive (HMMmAR) Network Framework for Analysis of Surface EMG (sEMG) Data. IEEE Trans on Signal Processing 2008,56(8):4069–4081. 10.1109/TSP.2008.925246
 28.
Cover TM, Thomas JA: Elements of information theory. New York: John Willey & Sons; 1991.
 29.
Butte AJ, Kohane IS: Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. Pacific Symposium on Biocomputing 2000, 5: 415–42.
 30.
Francois D, Wertz V, Verleysen M: The permutation test for feature selection by mutual information. ESANN Proceedings 2006, 26–28.
 31.
Hogg RV, Ledolter J: Engineering statistics. New York: MacMillan; 1987.
 32.
D SJ: A direct approach to false discovery rates. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2002,64(3):479–498. 10.1111/14679868.00346
 33.
Hoehn M, Yahr M: Parkinsonism: onset, progression and mortality. Neurology 1967, 17: 427–442.
 34.
Salenius S, Avikainen S, Kaakkola S, Hari R, Brown P: Defective cortical drive to muscle in Parkinson's disease and its improvement with levodopa. Brain 2002, 125: 491–500. 10.1093/brain/awf042
 35.
Timmermann L, Gross J, Dirks M, Volkmann J, Freund HJ, Schnitzler A: The cerebral oscillatory network of parkinsonian resting tremor. Brain 2003, 126: 199–212. 10.1093/brain/awg022
 36.
Chen C, Litvak V, Gilbertson Te: Excessive synchronization of basal ganglia neurons at 20 Hz slows movement in Parkinson's disease. Experimental Neurology 2007, 205: 214–221. 10.1016/j.expneurol.2007.01.027
 37.
RivlinEtzion M, Marmor O, Heimer G, Raz A, Nini A, Bergman H: Basal ganglia oscillations and pathophysiology of movement disorders. Current Opinion in Neurobiology 2006,16(6):629–637. 10.1016/j.conb.2006.10.002
Acknowledgements
This work was supported by the Canadian Natural Sciences and Engineering Research Council (NSERC) under grant STPGP 36520808, and by the CHRP grant.
Author information
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
ZW proposed the main ideas, checked the method procedure, and participated in the data analysis and manuscript writing. PL participated in the data acquisition and algorithm implementation, analyzed the data, and drafted the manuscript. MM organized the study, provided the EEG data, and helped to interpret the results and draft the manuscript. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Wang, Z.J., Lee, P.W. & McKeown, M.J. A Novel Segmentation, Mutual Information Network Framework for EEG Analysis of Motor Tasks. BioMed Eng OnLine 8, 9 (2009). https://doi.org/10.1186/1475925X89
Received:
Accepted:
Published:
Keywords
 Mutual Information
 Independent Component Analysis
 Cluster Coefficient
 Short Path Length
 Simultaneous Movement