Decoding hand movement velocity from electroencephalogram signals during a drawing task

Background Decoding neural activities associated with limb movements is the key of motor prosthesis control. So far, most of these studies have been based on invasive approaches. Nevertheless, a few researchers have decoded kinematic parameters of single hand in non-invasive ways such as magnetoencephalogram (MEG) and electroencephalogram (EEG). Regarding these EEG studies, center-out reaching tasks have been employed. Yet whether hand velocity can be decoded using EEG recorded during a self-routed drawing task is unclear. Methods Here we collected whole-scalp EEG data of five subjects during a sequential 4-directional drawing task, and employed spatial filtering algorithms to extract the amplitude and power features of EEG in multiple frequency bands. From these features, we reconstructed hand movement velocity by Kalman filtering and a smoothing algorithm. Results The average Pearson correlation coefficients between the measured and the decoded velocities are 0.37 for the horizontal dimension and 0.24 for the vertical dimension. The channels on motor, posterior parietal and occipital areas are most involved for the decoding of hand velocity. By comparing the decoding performance of the features from different frequency bands, we found that not only slow potentials in 0.1-4 Hz band but also oscillatory rhythms in 24-28 Hz band may carry the information of hand velocity. Conclusions These results provide another support to neural control of motor prosthesis based on EEG signals and proper decoding methods.


Background
Brain-computer interface (BCI) is a system that translates brain signals reflecting user intentions into commands and drives external devices [1,2]. In the past decades, various BCI systems have been developed for the purpose of rehabilitation and medical care for the disabled patients [3][4][5][6][7]. Among them, researchers have particular interest in neuromotor prosthesis that moves an artificial limb by the brain signals which control the equivalent movement of a corresponding body part such as an arm or a hand [2]. To date, most progresses of these BCI systems have been based on invasive approaches using neuronal firing patterns [4,8,9], local field potentials (LFPs) [10,11] or electrocorticogram (ECoG) [12][13][14]. These signals inside head possess the advantages of little noise, high topographical resolution and broad bandwidth.
However, for applications on human being, invasive ways are seriously limited by questions about the safety and durability of implanted channels [15]. Some recent studies have demonstrated that brain signals recorded by non-invasive approaches also carry significant information of detailed limb movements. For instance, from magnetoencephalogram (MEG) signals, hand movement directions have been decoded in the discrete center-out reaching task [16]; hand positions have been decoded during the continuous joystick movements [17]; and hand velocities have been decoded during the discrete center-out drawing task [18], the target-to-target joystick movements [19] and the continuous trackball movements [20]. It has been reported that low frequency band (≤3 Hz or 2-5 Hz) MEG on motor-related areas is critically involved in representing limb movement direction and speed [16,20]. Moreover, long-distance coupling between primary motor cortex and multiple brain areas in the low frequency band has been found during a continuous visuomotor task [20]. And the neural mechanisms of speed and tau in pointing hand movement from MEG have been revealed (tau is defined as the ratio of the current distance-to-goal gap over the current instantaneous speed towards the goal) [19].
Compared with MEG, electroencephalogram (EEG) has lower signal-to-noise ratio and spatial resolution. It was generally thought that EEG could not extract sufficient information to reconstruct limb movements. However, EEG is easily available and more suitable for ambulatory prosthetic system [17,21]. Therefore, a few ambitious researchers have extended the exploration to EEG signals. For example, hand directions have been inferred from EEG recorded in a center-out joystick operation [16]. The subjects were constrained to small finger and wrist movements. Another study has been presented about the prediction of reaching target from EEG recorded in multijoint center-out movements [22]. Later, a movement delay paradigm was designed to investigate brain activities in the human posterior parietal cortex (PPC) during the planning of intended movements [23]. Newly, the positions, the velocities and the accelerations of hand movement were modestly decoded during a 3-D center-out reaching task [24,25]. As far as we know, most of these EEG studies employed a center-out movement task which contained pre-specified point-to-point movements. Specifically, the starting and end points were fixed, and the length of each movement was well constrained.
In our study, we designed a 2-D drawing task in which the subjects were required to move a pen at their own pace along a zigzag route in each trial (refer to Figure 1). This zigzag route was determined online by the subjects themselves. Specifically, this task can be regarded as sequential point-to-point movements. At each point the subjects selected one of the four directions, i.e., up, down, left and right. Moreover, the numbers and the positions of these points, and the distance between two sequential points were up to the subjects (not pre-specified). Thus the starting point, the end point and the length of each point-to-point movement were less restricted compared to the center-out task. During the experiment, multi-channel EEG activities from whole scalp were recorded. Then, independent component analysis (ICA) [26] was used to remove the effects of electrooculogram (EOG) and electromyogram (EMG) activities. After that, discriminative spatial pattern (DSP) filtering [27] and common spatial pattern (CSP) filtering [28] were employed to extract the amplitude features and the power features from the retained independent components (ICs) in multiple frequency bands. Then Kalman filtering and a smoothing algorithm [29] were applied to decode the hand movement velocity with these features. Furthermore, we investigated the scalp areas most involved for the decoding and evaluated the decoding performance of each frequency band.

Subjects and Recording System
Five right-handed healthy male subjects participated voluntarily in this study. Among them, subject 1 had been well trained in the BCI experiments of hand motor imageries, while the other subjects had less or never participated in any kind of BCI experiment before. These five subjects were instructed to move a pen (using their right wrist only and relaxing left hand on the lap) on the touch screen of a laptop in front of them. Meanwhile, the pen tracks denoting the trajectories of hand movements were recorded with a sampling rate of 64 Hz by the laptop. At the same time, a 40-channel EEG cap LT37 from Compumetics was used to collect EEG signals from the subjects. And a portable amplifier (NeuroScan NuAmps) amplified the analog EEG signals, digitalized them with a sampling rate of 250 Hz. The laptop received the EEG data from the amplifier through a USB port and sent synchronous stimulus code through parallel port to the amplifier.

Experimental Paradigm
Our experiment contained 60 trials. Each trial started with a fixation cross shown on the touch screen for 2 seconds. After that, a graphical user interface (GUI) was displayed. It was a 7 cm × 7 cm square in which a green ball denoting the starting point was randomly initialized. Then, in the next 40-50 seconds, subjects were asked to touch the green ball by a pen and move it to arbitrary points at their own pace in 4 directions (up, down, left and right). An example of the task is shown in Figure1. Actually, the pen track of each trial corresponded to a sequence of directional hand movements. In this experiment, the subjects self-chose the number of point-to-point movements during the drawing task. After the drawing time slot, this GUI disappeared and the trial was ended. The time for rest between the trials was randomized in a range from 8 s to 10 s to prevent subjects getting used to the timing of rest state to drawing task. During the time for rest, subjects were periodically told which directions were under-represented by the laptop for data balance. More detailed parameters of this experiment are listed in Table 1.

EOG and EMG removal
During our drawing tasks, the recorded EEG signals were contaminated with various artifacts such as EOG and EMG [30]. These artifacts may confound the EEG decoding of hand movements [18]. To show an example, we collected the EOG of Subject 3 and provided an off-line analysis in Appendix A1. The off-line analysis of EOG and the decoding of hand velocity of Subject 3 were based on the same dataset. To remove EOG and EMG, we employed ICA. It is a process that detects and isolates independent components (ICs) of signals consisting of mixed sources. For each subject, 30 ICs were decomposed from EEG signals by using the EEGLAB software [31], and about 12 ICs regarded as EOG/EMG were removed by the following heuristics: (i) Eye movements should project mainly to frontal sites with a low-pass time course; (ii) Eye blinks should project to frontal sites and have large punctate activations; (iii) Temporal muscle activities should project to temporal sites with a spectral peak in the band above 20 Hz [32]. An example of EOG and EMG removal is also given in Appendix A1.

Feature extraction
Since the direction was approximately fixed (up, down, left or right) in each point-topoint movement in our study, the values of hand velocities have close relationship with the directions. For example, when a subject performed a movement to the right, the absolute value of hand velocity in y-dimension is small and the hand velocity in x- The mean ± standard deviation of the experiment parameters are shown for each subject. The abbreviations of these parameters are listed below: dimension is large. It may suggest that the brain components discriminative for different directional movements were helpful for reconstructing the profiles of hand velocities. Therefore, supervised spatial filtering methods CSP and DSP were employed here to extract the discriminative brain components. Specifically, after EOG and EMG were removed, a filter bank was applied to filter the retained ICs into multiple bands (0.1-4 Hz, 4-8 Hz, 8-12 Hz, ..., [36][37][38][39][40]. Then DSP was used to extract the amplitude features of slow potentials within 0.1-4 Hz band of the ICs. And CSP was applied to extract the power features of oscillatory rhythms from the other bands of the ICs. The details of DSP and CSP methods can be found in Appendix A2. In DSP and CSP training procedure, we cut hand movement trajectories into segments with a sliding window (1s wide and 0.5s overlap) to obtain the directions in the drawing task. It was expected that the trajectory in each segment only exhibits one movement direction. However, in practice, the trajectories of some segments may not be straight lines or not extend enough in a direction. The ICs of these segments were not used into DSP or CSP training. Note that DSP and CSP were originally proposed to deal with binary classification problems. As far as our 4-direction hand movements are concerned, DSP and CSP need to be extended to multiclass paradigms. In this study, they were computed between each pair of directions [33], and the number of the pairs was C 4 2 6 = .
After CSP/DSP filter training was completed, regarding each pair of directions, 2 most discriminating filters of DSP and 4 most discriminating filters of CSP were obtained (see Appendix A2). Then they were used to filter the multi-band ICs into time series. In each frequency band, the combination of ICA and DSP/CSP can be formulated as: where X i R C×T is the recorded EEG signal in the ith frequency band, i = 1,2,...,10, C is the number of channels, T is the number of sample points covering the entire time period of an experiment, U R m×C is the 'unmixing' matrix of ICA, m is the number of retained ICs, W i m l R i ∈ × is the filtering matrix of DSP or CSP in the ith frequency band, l i is the number of the selected filters (l 1 = 12, l 2 = l 3 =...= l 10 = 24), At last, we extracted the features from the filtered data ξ i every 200 ms without over-

Decoding Algorithms
The decoding algorithm presented in this paper consists of a standard Kalman filter and a smoother. The Kalman filter is a real-time processing algorithm in which the state estimate is updated immediately after a new observation is available. On the other hand, the smoother optimally combines the Kalman filter with a reverse-time information filter. The result is a minimum variance estimate based on past, present and future information [34].
(1) Kalman filter Kalman filter considers a discrete filtering model [29], of which the system and observation models are: In this paper, the state vector is denoted by v j = [v x, j , v y, j ] T with v x, j and v y, j representing the horizontal and the vertical velocities respectively at time step j; A j R 2×2 is the state transition matrix, and n j~N (0,N j ) is the noise term, where N j R 2×2 . The observation vector z j R D is made up of the extracted features, H j R D×2 is the observation matrix, and q j~N (0,Q j ) is the noise term of observation, where Q j R D×D , D = 228, j = 1,2,..., M k , and M k is the number of time steps in the kth trial. Here A j , H j , N j and Q j are simplified as constant matrices. The matrices A and H can be where Tr is the set of training trials. For the estimated A and H, the noise covariance matrices N and Q can be obtained by equation (2) and (3). The prediction and update equations of Kalman filter for test can be written as follows [29]: wherev j − and P j − are the predicted mean and covariance of the state before seeing z j ; S j is the prediction covariance of the observation;v j and P j are the estimated mean and covariance of the state after seeing z j , K j is the filter gain. (

1) Smoother
The smoother is calculated from the results of Kalman filter by recursions [34]: where C j is the smoother gain;v j and P j are the filter estimates for the state mean and state covariance;v j s and P j s are the smoother estimates for the state mean and state covariance. The recursions start from the last time step.

Results
To study the fidelity of the drawing movement decoding and the characteristics of the associated EEG signals, we will show the accuracy of the hand velocity decoding, demonstrate the scalp areas most involved for the decoding and present the frequency bands that carried information of hand velocity. 5-fold cross-validation was employed in the evaluation, i.e., each subject's data were divided into 5 parts, among them 4 parts were used for training, and the retained part was adopted for test. This procedure was repeated 5 times. In each time, a different part was used as the test set. The results of these evaluations are described below.
Decoding accuracy of drawing movement Table 2 shows three performance indexes to assess the decoding accuracy, including (i) Pearson correlation coefficient (r-value), abbreviated as CC, between the measured and the decoded hand velocities; (ii) p-value for testing the null hypothesis that the measured and the decoded hand velocities are uncorrelated by Student's t-test; (iii) signal- , v denotes the measured hand velocity,v represents the decoded hand velocity.
From Table 2, we can find that, except the result of Subject 1 in y-dimension, the small p-values indicate that the CCs are significant. On average, the modest CCs and Pearson correlation coefficients (CCs), p-values and signal-to-noise ratios (SNRs) between measured and decoded hand velocities in x-dimension and y-dimension are listed. Top group: the mean ± stand error of the mean (SEM) of CCs is given for each subject and dimension across all 5 folds. The average of CCs across subjects is also given. Middle group: the mean of p-values is provided for each subject and dimension across all 5 folds. Button group: the SNRs are recorded for each subject and dimension across all 5 folds. The average of SNRs across subjects is also given. Before computing CC, p-value and SNR, the measured and decoded hand velocities were smoothed with a zero-phase, fourth-order, lowpass Butterworth filter with a cut-off frequency of 1 Hz.
SNRs demonstrate that it is possible to infer information about hand velocities in drawing task by EEG. For most subjects, the hand velocities in horizontal dimension, x, were better decoded than those in vertical dimension, y. Similar disparity in the MEG decoding between dimensions of hand movement has been discussed in [35]. Because the subjects were asked to draw on the vertical touch screen, gravitational force may impact the drawing action of subjects and degrade the decoding in y-dimension [35]. Although we only presented the results for one parameter setting (1s segment length for CSP/DSP filter training and 200 ms step size for Kalman smoother decoding), it was also found that these parameters could be chosen in a wide range.
For instance, we also tried other parameter settings (segment length for CSP/DSP filter training: 0.5s and 2s; decoding step size: 100 ms and 300 ms), and obtained comparable results. These results are not included in this paper due to limited page space. Some examples of measured and decoded hand velocities in x-dimension and ydimension are displayed in Figure 2. It can be seen that, in y-dimension, the decoded velocities hardly reflect the trends of the measured ones, while in x-dimension, generally, the decoded velocities match the measured ones better. Meanwhile, the measured velocities roughly consist of sequential bell shapes. Each bell shape indicates a relative straight trajectory made by a subject in a certain direction. Note that most bell shapes are irregular, which may be caused by two facts (i) the variable friction exists between the pen and the touch screen; (ii) visual guided point-to-point movements are not implemented in a purely feed-forward manner [19].

Scalp areas most involved for hand velocity decoding
Note that the brain components were generated by applying ICA and CSP/DSP to EEG signals. We rewrite equation (1) as l i is the number of selected filters in the ith frequency band, i = 1,2,...,10, C is the number of channels. Each row of B i gives a weight vector for channels to construct a brain component. Regarding velocity decoding by Kalman model, the observation is consisted of the features extracted from these brain components. Thus B i partly reflects the importance of the channels for velocity decoding. To investigate which channels were more involved for the velocity decoding in the ith frequency band, we average the rows of B i as follows: , |·| is an element-wise absolute operator.   [24]; And Vaillancourt DE et al. presented that the parietal and premotor cortex are associated with visuomotor processes [36]. Figure 3(B) displays the scalp topographies separately for each subject. On the whole, the channels on motor, posterior parietal and occipital areas get greater weights both in 0.1-4 Hz band and in 4-40 Hz band for all the subjects, although the weights of these areas are subject-dependent. As an exception, for Subject 4, the channels on prefrontal area also get greater weights. It may have been caused by some artifacts. Examples of smoothed and standardized measured (blue) and decoded (red) hand velocities. The left column is for x-dimension, and the right column is for simultaneous ydimension. Each row contains data for one subject. The Pearson correlation coefficient (CC) between measured and decoded velocities is listed for each subplot.

Decoding performance of different frequency bands
In order to explore which frequency bands carry information about hand velocity, we studied the decoding performance of each band, and show them in Figure 4. It can be seen that the frequency distribution for decoding is highly subject-dependent. For example, for Subject 1, the CC value of low frequency band (0.1-4 Hz) is significantly inferior to those of the other frequency bands in x-dimension (p < 0.05, paired lefttailed Student's t-test). However, for Subject 3, the CC value of low frequency band (0.1-4 Hz) is significantly superior to those of the other frequency bands in x-dimension (p < 0.05, paired right-tailed Student's t-test). Moreover, the CC values for Subject 1 are essentially zero in y-dimension for all the frequency bands and about 0.5 in x-dimension above 8 Hz. This may be due to the following fact. Subject 1 has been well trained for cursor control in a BCI system through left and right hand movement imageries. His voluntary power modulation of 8-40 Hz rhythms has been reinforced. The drawing task performed by right hand may have activated this power modulation in x-dimension which masks the information about hand movement in y-dimension. For Subject 2, the poor CC values of the frequency bands beyond 4 Hz indicated that, for certain people, the information about limb kinematics may not be inferred from the EEG above 4 Hz. The study of Waldert et al. provided similar results [16]. Regarding the average across all the subjects, there is no significant difference between the CC values of the low frequency band (0.1-4 Hz) and those of the other frequency bands in x-dimension (p > 0.40, paired two-tailed Student's t-test); however, the CC values of the frequency band from 24 Hz to 28 Hz are significantly higher than those of the low frequency band (0.1-4 Hz) in y-dimension (p < 0.05, paired right-tailed Student's t-test). These findings imply that, besides the slow potentials from 0.1 Hz to 4 Hz, the oscillatory rhythms from 24 Hz to 28 Hz may also carry notable information about hand movement velocity.
Comparison on decoding performance with ICA-cleaned data and non-ICA-cleaned data Here we list the decoding performance (CC) with non-ICA-cleaned data in Table 3. Comparing the CCs in Table 2 and Table 3, we can find that non-ICA-cleaned data Figure 4 Decoding performance of different bands. By using the features from different frequency bands respectively, we show the mean and SEM of the Pearson correlation coefficients (CCs) between measured and decoded hand velocities across cross-validation folds for each subject in x-dimension (blue) and y-dimension (red). Stars indicate the bars with significant CCs (p < 0.05 for no correlation hypothesis, Student's t-test). The average CCs across subjects for each band feature are also given.
Lv et al. BioMedical Engineering OnLine 2010, 9:64 http://www.biomedical-engineering-online.com/content/9/1/64 result in remarkably higher decoding accuracies in x-dimension and in y-dimension (p < 0.05, paired right-tailed Student's t-test). It indicates that the components removed by ICA could offer considerable contribution to hand velocity decoding. Although most of these components are EOG and EMG (see Appendix A1), these removed components may contain EEG signals which carry the information of hand velocity to some degree.
Comparison on decoding performance of linear filter, Kalman filter and Kalman smoother Until now, many decoding algorithms have been used to reconstruct hand velocities, such as linear filter in the study of Bradberry et al. [24] and Kalman filter in the research of Wu et al. [37]. As discussed in [37], compared with linear filter, Kalman filter possesses the advantages of a clear probabilistic foundation and a model of the temporal hand kinematics. Based on the work of Wu et al. [37], here we employed the smoothing method to integrate not only past and present information but also future information of hand velocities into Kalman model. With different lag time, the average decoding performance across five subjects for linear filter, Kalman filter and Kalman smoother are shown in Figure 5. Paired Student's t-test is employed to compare the decoding performance of the three methods. The results are listed in Table 4. From Figure 5 and Table 4, we find that with different lag times, the CCs and SNRs of Kalman smoother are significantly better than those of the linear filter and Kalman filter (p < 0.05, right-tailed), except in y-dimension where the SNRs of Kalman smoother are not significantly superior to those of Kalman filter (p > 0.05, right-tailed). Considering the Kalman smoother in this paper being an off-line algorithm, we plan to modify it and extrapolate this work to an online system in the future.

Comparison with other related studies
In this paper, the average CC across the five subjects over x-dimension and y-dimension is 0.30. As the most related work, hand velocity was reconstructed from EEG during a 3-D center-out reaching task, and a very close CC (0.29) was obtained [24]. In addition, MEG signals also reflect the activities of large neuronal populations. From MEG, hand velocities were predicted during a 2-D center-out drawing task, and a higher CC (0.4) was gained without EOG or EMG removal [18]. Therefore, the decoding accuracy of our work is within the range of those achieved in the studies mentioned above. Moreover, we would like to compare the experimental paradigms in this paper and that in [24] as below: (i) In [24], the center-out task is a 3D reaching movement, in which the subject moved his hand from a fixed starting point (center) to one of the 8 stationary targets, and then moved his hand back to the center. In this paper, the task is a 2D self-routed drawing movement, in which the subject was required to move a pen at his own pace along a zigzag route in each trial. This task can be regarded as sequential point-topoint movements. At each point the subject selected one of the four directions. Moreover, the numbers and positions of these points, and the distance between two sequential points were up to the subject. Therefore, compared to [24], the starting point, the end point and the length of each point-to-point movement in our experiments were less constrained. The subjects can perform the movements with higher variability. It has been reported in [24] that the variabilities of movement time and movement length are negatively correlated with the accuracy of hand velocity decoding. From this viewpoint, the hand velocity of our drawing movement could be harder to decode than that of the center-out movement task.
(ii) In [24], subjects were asked to perform multi-joint movements of the upper limb. In our work, the subjects were instructed to make movements only with their hands and wrists, while keeping their shoulders and arms at rest. We studied hand movements not only because of the interesting work on hand movement direction decoding [16], but also because hand is relatively far from the EEG cap, therefore reduces EMG contamination to the EEG signals. Since our drawing task needs the coordination of eye and hand, EOG and EMG may confound the EEG decoding. Thus we employed ICA to remove EOG and EMG artifacts.

Decoding hand kinematics in different frequency bands
Which frequency band of neural signal carries most information about limb kinematics is an important issue discussed in the existing studies. For example, Ball et al. summarized the decoding accuracies of arm movement direction with different band ECoG, and indicated that highest decoding accuracy can be obtained from slow movementrelated potentials (MRPs) (<2 Hz) [38]. Jerbi et al. reported the notable phase locking between 2-5 Hz MEG oscillatory activity in the contralateral primary motor cortex and time-varying hand speed [20]. Regarding EEG recording, Waldert et al. discovered that low frequency band (≤3 Hz) EEG of the sensors located in the motor-related area have close relationship with movement directions [16]. In addition, it is well known that the planning and execution of movement leads to significant power modulation in 8-30 Hz EEG, i.e., event-related synchronization/desynchroniza-tion (ERS/ERD) [39,40]. Such characteristic changes in EEG rhythms have been used to classify brain states related to the planning/imagery of different types of limb movement [41]. Newly, Han et al. reported that EEG activities in the alpha (8)(9)(10)(11)(12) and beta (18)(19)(20)(21)(22)(23)(24)(25)(26)(27)(28) frequency bands were correlated with the speed of imagery clenching [42]. In our study, we have shown that displacement velocity can be represented by the MRP in 0.1-4 Hz band and the ERD/ERS in 24-28 Hz band. Further more, we analyzed the relevance of decoding results from different frequency bands (see Appendix A3), and found that the decoding results of MRPs from low frequency band (0.1-4 Hz) are little correlated with those of oscillation rhythms from higher frequency bands . It indicates that  the potential shifts in the low frequency band and the power modulations in the higher frequency bands reflect different aspects of brain activities related to hand movement velocity. Furthermore, from the scalp map in Figure 3 (A), we find that in the low frequency band, the channels in the motor, posterior parietal and occipital areas get greater weights. This demonstrates that the features in the low frequency bands capture the neural signature. The finding is in accordance with the ECoG study of Schalk et al. which also focused on decoding kinematic parameters of hand movement [14].

Conclusions
Decoding limb kinematics from brain signals in non-invasive ways may realize safe and convenient control of motor prosthesis. In this paper, we demonstrated that EEG signals can be used to decode hand velocity during a sequential drawing task. The scalp areas over motor cortex, posterior parietal cortex and occipital areas were most involved for the decoding. Furthermore, we show that not only slow potentials in 0.1-4 Hz band, but also oscillatory rhythms in 24-28 Hz band may carry information about hand velocity.

A1. EOG and EMG removal based on ICA
In our study, we recorded Subject 3's EOG activity with a bipolar sensor montage with sensors attached superior and inferior to the orbital fossa of the right eye for vertical eye movements and to the external canthi for horizontal eye movements. Firstly, we computed Pearson correlation coefficient (CC) and p-value (for no correlation hypothesis, Student's t-test) between the EOG signal and the measured hand velocity. The results are listed in Table 5. It is found that the correlation between the horizontal EOG activity and the horizontal hand velocity is significant (p < 0.001). Next, we removed EOG and EMG artifacts using ICA method. ICA removes artifacts from EEG records by eliminating the contributions of artifact sources to the scalp sensors. Using the data from Subject 3, we provided the regularized scalp maps of all the ICs in Figure 6.
From Figure 6, we can find that the projection strengths of IC5, IC6 and IC14 were concentrated on Fp1 or Fp2. These ICs were removed as the eye movement artifacts [31]. To demonstrate the validity of ICA for EOG removal in our study, we have computed the CCs between the independent components (ICs) and the recorded EOG activities. The results are shown in Figure 7, where we can observe that, except IC5, IC6 and IC14, all the components are not obviously correlated with EOG activities.
On the other hand, from Figure 6, we can find the projection strengths of IC10 and IC29 are concentrated on the temporal sites. Their power spectrums are shown in Figure 8, which demonstrates high power at frequencies above 20 Hz. Here, IC10, IC29 were removed as the EMG artifacts [31]. In our study, some ICs partially exhibit the characters of EOG/EMG, such as IC1, IC7, IC13, IC15, IC21, IC22, IC25, IC26 and IC27. They were also removed.

A2. Details of DSP and CSP algorithms
Both DSP and CSP are linear projection methods [27,28]. They have the same data model as Y = W T X , where Y R C×T denotes the source component, W R C×C is the projection matrix and X R C×T represents the EEG segment, with C denoting the number of channels, and T denoting the number of samples in the time interval of interest. However, the goals of DSP and CSP are different. For DSP, W is sought for the purpose of extracting the amplitude of slow non-oscillatory source. It projects EEG segments to the linear subspace where the between-class separation is maximized while the within-class separation is minimized. The projection vector achieving the largest ratio of between-class separation and within-class separation is defined as the most   Lv et al. BioMedical Engineering OnLine 2010, 9:64 http://www.biomedical-engineering-online.com/content/9/1/64 discriminative filter. Let S b , S w denote the between-class and the within-class scatter matrices of EEG segments, respectively.
where X j (i) represents the ith EEG segment of class j, K is the number of classes, n j is the number of EEG segments for class j, M j is the average of EEG segments for class j, M is the average of all the EEG segments. Then the objective function of DSP can be written as [27]: where q = 1,2,..., C, g q is an eigenvalue and w q is the corresponding eigenvector. Assuming these eigenvalues are sorted in a descending order, only a few eigenvectors W* = [w 1 ,...,w d ] associated with the largest eigenvalues are chosen as the most discriminative spatial filters, where d <<C. Then each EEG segment is projected as Y* = W* T X, Y* R d×T . To obtain the amplitude features of slow potential shifts, we calculate the mean of Y* as f mean DSP r r = ( ) * y , where r = 1, ..., d, y r * is the rth row of Y*.
In our work, d = 2. For CSP, W is optimized to obtain the band power of oscillatory source. It maps EEG segments to the linear subspace where the variance of one class is maximized while the variance of the other class is minimized. The projection vectors achieving the largest and smallest ratios of the variances of the two classes are defined as the most discriminative filters. Assuming R denotes the normalized covariance matrix of EEG segment, i.e., R = XX T /trace (XX T ), then the objective function of CSP can be formulated as [28]: where R 1 and R 2 represents the average of the covariance matrices from EEG segments within class 1 and class 2 respectively. Similar to (A3), (A5) is also in the form of Rayleigh quotient. The solution can be obtained by solving the generalized eigenvalue problem: where q = 1,2,..., C, b q is an eigenvalue and w q is the corresponding eigenvector. Suppose these eigenvalues are sorted in a descending order, the eigenvectors associated with the largest and smallest m eigenvalues are chosen as the most discriminative spatial filters, i.e., W* = [w 1 ,...,w m , w C-m+1 ,...,w C ], where m <<C. Then each EEG segment is projected as Y* = W* T X, Y* R 2m×T . To extract the power features, we calculated the logarithm transformation, normalized the variance of Y* by rows . In this paper, m = 2. The logarithm transformation is performed to normalize the distribution of the elements in f CSP r .

A3. Relevance of decoding results from different frequency bands
The absolute correlation coefficient matrices of the decoded hand velocities from different frequency bands are shown in Figure 9. Figure 9(A) illustrates the average of the matrices of the 5 subjects. The decoding result from low frequency band (0.1-4 Hz) is little correlated with those from the frequency bands above 4 Hz in x-dimension and in y-dimension (|cc|<0.05). When we consider the patterns for individual subjects, we obtain similar results as above. Figure 9(B)-(F) show the matrices for the five subjects respectively. For all the 5 subjects, the decoding result from low frequency band (0.1-4 Hz) is not significantly correlated with those from the frequency bands above 4 Hz in x-dimension and in y-dimension (|cc|<0.07, p > 0.05 for testing the hypothesis of no correlation).

Figure 9
The absolute correlation coefficient matrices of decoded hand velocities from different frequency bands.