The fundamental assumptions for various kinds of fetal electrocardiogram (fECG) extraction methods are not consistent with each other, which is a very important problem needed to be ascertained.
Based on two public databases, the regularity on ECG wave durations for normal sinus rhythm is investigated statistically. Taking the ascertained regularity as an assumption, a new fECG extraction algorithm is proposed, called Partial R-R interval Resampling (PRR).
Both synthetic and real abdominal ECG signals are used to test the algorithm. The results indicate that the PRR algorithm has better performance over the whole R-R interval resampling based comb filtering method (RR) and linear template method (LP), which takes advantages of both LP and RR.
The final drawn conclusion is: (1) the proposition should be true that the individual’s heart beat span is invariable for normal sinus rhythm; (2) the proposed PRR fetal ECG extraction algorithm can estimate the maternal ECG (mECG) more accurately and stably even in the condition of large HRV, finally resulting in better fetal ECG extraction.
Non-invasive fetal electrocardiogram (fECG) extraction is a classic dilemma in biomedical signal processing. Researchers put forward various kinds of methods to solve this problem. These methods can be classified into: (1) multi-channel methods, like principal component analysis and its variants [1–3], independent component analysis [4, 5], periodic component analysis (piCA) [6–9], some wavelet based methods [10, 11], adaptive method [12–14] and so on; (2) single-channel methods, such as singular value decomposition (SVD) , R-R interval resampling based comb filter (RR) , linear template methods (LP) [17–21] and so on; (3) and fusion methods [22–25]. A detailed description about these methods can be found in literature .
Among the above three kinds of methods, the single-channel methods are always used to subtract the maternal electrocardiogram (mECG) component from the mixed abdominal ECG (aECG) recording. Since mECG component usually takes up the highest proportion in aECG, the accurate estimation of mECG component is a key procedure to obtain final high-quality fECG.
When using SVD  and RR  methods to estimate mECG component, R-R interval resampling is required because of the periodicity and the heart rate variability (HRV) of mECG signal. In essence, an underlying hypothesis is taken that the lengths of various waves in each cardiac cycle are proportional to the length of the R-R interval (heart beat interval).
Linear template (LP) [17–21] methods are simple, effective and widely used mECG estimation methods. In LP methods, the wave complexes in current cardiac cycle are estimated by summing the weighted wave complexes from neighbour cardiac cycles, depending on at least the peak detection of each QRS complex for different heart cycles. In essence, it is assumed that when heart beat interval is time-variant, the heart beat span (i.e. the length from the beginning of P wave to the end of T wave) is stable. So no resampling step is taken in current LP methods.
For the above two assumptions, there is no clear conclusion on which one is correct. In [27, 28], it is said that the length from the beginning of P wave to the peak of R wave (Ps-R) is weakly dependent on heart rate, while in literatures [21, 29] the authors think it depends on the instantaneous heart rate. As for the length from the peak of R wave to the end of the T wave (R-Te), literatures [27, 28] have the opinion that the T wave is strongly dependent on heart rate, becoming narrower and closer to the QRS complex at rapid heart rates, while the opposite view is given in . In , it is considered that the length from the end of T wave to the beginning of next P wave (Te-Ps) is strongly dependent on the heartbeat duration. So far, we have not seen literatures to ascertain such disagreement problem for normal sinus rhythm.
Obviously, the above disagreement problem is related to whether SVD, RR and LP methods are used correctly. It is a very crucial thing for designing a high-quality fECG extraction method, especially when the mECG has large HRV.
Based on MIT Normal Sinus Rhythm Database (NSRDB as Set A)  and the MIT-BIH Arrhythmia Database (ADB as Set B) [30, 31], the problem is investigated in this paper and a regularity comes out that heart beat span is invariant for variable heart beat intervals. Then we propose a new corresponding single channel fECG extraction algorithm based on the regularity. Both HRV and periodicity are considered in the method. And its effectiveness is verified by synthetic and real recordings.
Investigation on regularity of ECG waves duration
A typical complete cardiac cycle is showed in Figure 1. Ps is the beginning point of P wave, and Te is the end point of T wave. Each cardiac cycle begins with a Ps point, including P wave, QRS complexes, T wave and then ends at the next Ps point. Let’s define the period from Ps to Te as heart beat span, and the period from Te to the next Ps as diastolic period.
The databases selected as research resources are Set A  and Set B [30, 31]. Set A is a normal sinus rhythm database, including 18 long-time records, with two channels for each record, and the sampling frequency is 128 Hz. And Set B is a database containing 48 two-channel recordings with different degree of arrhythmia; the sampling frequency is 360 Hz.
Take the record “19093” from Set A as an example. A period from its first channel is given in Figure 2(a) and the first 5 cardiac cycles are depicted in Figure 2(b). It is quite obvious that the heart beat spans of the five cardiac cycles are almost the same, while the diastolic periods are quite different in length. The length of Ps-Ps (tPs-Ps), the length of Ps-R (tPs-R), the length of R-Te (tR-Te) and the length of Te-Ps (tTe-Ps) for the first 500 seconds-length data of record “19093” are illustrated in Figure 3. Details about how these characteristic waves are detected can be seen in Figure 4; an eye inspection is a must to ensure the validation of the detected results. And the final results are exported after manually correcting the inappropriate detections. Gradients (k s) for regression lines of tPs-R vs. tPs-Ps, tR-Te vs. tPs-Ps and tTe-Ps vs. tPs-Ps are also calculated and listed in Figure 3. Since k for tTe-Ps vs. tPs-Ps is nearly one and k s for the other two are almost zero, we can say that the length differences of Ps-Ps intervals are reflected totally by the change of tTe-Ps.
The above result on record “19093” presents that the tPs-R and tR-Te are quite stable compared with tPs-Ps. That is, there exists so called invariant heart beat span with the variant heart beat intervals. A further validation with the databases is given below.
First, because of the difficulty in detecting Ps and Te points accurately, for convenience we replace Ps-R by P-R interval, R-Te by R-T interval, Te-Ps by T-P interval and Ps-Ps by R-R interval for statistics’ validation. If the above conclusion from data “19093” is correct, the both substitute intervals P-R and R-T also should be stable, and T-P interval and R-R interval should always vary accordingly. Then, twelve records from Set A and seven from Set B with obvious P and T waves are selected. The first 500 seconds (s) data for each selected record are cut out for statistical analysis. Note that pieces that P or T wave is not clear are thrown away.
As in Table 1 shown, for the nineteen records the lengths of T-P (tT-P) and R-R interval (tR-R) change all the time, their standard deviation (std) are about 10-2; the lengths of P-R (tP-R) and R-T interval (tR-T) are comparatively stable, their std are 10-3. For tP-R vs. tR-R and tR-T vs. tR-R their gradients are in the order of 10-2; for tT-P vs. tR-R the gradients are bigger than 0.85. Considering the std and gradients, we think the results are consistent with that on record “19093”. The conclusion agrees with the underlying hypothesis taken by LP. It also illustrates the irrationality lying in SVD  and RR  methods.
Partial R-R interval resampling based fECG extraction Algorithm (PRR)
According to the conclusion earlier mentioned, combining the advantages of LP and RR method, a modified fECG extraction algorithm is proposed. The core algorithm is shown in Figure 5. Unlike RR method, the proposed algorithm only resamples part of each R-R interval, so it is called Partially R-R Resampling based comb filter i.e. PRR.
The proposed core steps of PRR are further described as follows:
Step 1 in Figure 5: segmentation. Detect the maternal R peaks in aECG. Take the Ps point as the point TPs-R s before each R peak and Te point as the point TR-Te s after each R peak. Then a complete cardiac cycle can be divided as Ps-Te-Ps. Ps-Te period is the previously mentioned heart beat span and Te-Ps period is the diastolic period.
Step 2 in Figure 5: Te-Ps interval up-resample. Denote the lengths of diastolic period as l1p, l2p,…, lnp-1, lnp (points). Each diastolic period is resampled to have the same number of samples lvp. lvp can be the maximum one of l1p, l2p,…, lnp-1, lnp. Thus every diastolic period is up resampled. And every heart beat span keeps unchanged. The total length of a cardiac cycle after resampling is L = (TPs-R + TR-Te)*fs + lvp, here fs is the sampling rate .
Step 3 in Figure 5: comb filtering. The up-resampled periodic signal convolutes with a comb filter. The designing detail for a comb filter can be found in literature . The result is an estimation of up-resampled mECG.Step 4 in Figure 5: Te-Ps interval down-resample. Resample diastolic periods of the estimated up-resampled mECG to recover their original lengths. It turns out to be the estimation to mECG. The residual ECG (rECG) is obtained after subtracting it from the original aECG. The rECG can be seen as the estimation to fECG; of course, the noise can be weakened in a further step to get a clearer fECG which is not discussed in the paper.
In above Step 1, the TPs-R and TR-Te values have slight difference for different individuals. TPs-R lies between 0.15 and 0.20 s, while TR-Te lies 0.20 to 0.35 s . In reality, it is difficult to detect the points Ps and Te steadily online, since P wave and T wave may be invisible in aECG. Therefore, we set TPs-R =0.2 s and TR-Te = 0.4 s fixedly in the proposed PRR. A further illustration about this can be found in the Discussion Section.
The performance of the PRR algorithm is tested on synthetic mECG and aECG data and real antenatal abdominal recordings. The RR and LP are also implemented for comparison. In LP, the template window is chosen as the window between R peak - 5/12 T and R peak + 7/12 T, here T is the average length of several R-R intervals [17, 18].
Tests on synthetic data
The simulated synthetic data is created as follows: (1) generate the ECG without HRV as in literature ; (2) change the length of each Te-Ps interval, to make it has HRV and constant heart beat span. The new length of a changed Te-Ps interval N = (1 + r × rand) × n, here n is the original length of a Te-Ps interval, rand is a random number between -1 and 1, and r is a set number between 0 and 1, called HRV variation coefficient. The bigger r, the larger signal’s HRV. Figure 6(a) gives an example of the synthetic ECG with r = 0.4.
For the simulated synthetic data showed in Figure 6(a), its ECG component is estimated by PRR, RR and LP respectively. The estimation errors are calculated and showed in Figure 6(b-d) respectively, in which error = eECG-ECG, with eECG representing the estimation of ECG. The root mean squares (RMS) of the estimation errors are also calculated, which are 0.0159, 0.0234 and 0.0419 respectively.
Further take r as 0, 0.1, …, 0.8, 0.9 respectively to generate synthetic data to test. Create 20 synthetic ECG data for each r. The tested results are depicted with an errorbar plot as Figure 7, in which the Y-axis represents RMS value of the estimation errors.
We also generate 20 synthetic aECGs to test. For each synthetic aECG, we fixedly set r = 0.4 to create its mECG component and r = 0 to create its fECG component. In the test, the mECG components are estimated and subtracted from aECGs, resulting in rECGs. Then the added fECG components are excluded from rECGs, obtaining the resultant errors, whose wave power ratios (WPR s) are listed in Table 2. The WPR is defined as
here s and e are respectively the point 0.2 s before and 0.4 s after i-th R peak. N is the number of R peaks. The numerator measures the residual mECG component power in rECG, while the denominator mainly measures the original mECG power in aECG. The WPR indicates how much mECG component is kept in rECG. Figure 8 has illustrated estimation errors for a specific aECG when using three methods, and their WPR calculated are 0.0034, 0.0166 and 0.0048 respectively.
From Figure 6(b-d) and Figure 8(b-d), it’s quite clear that: (1) RR method always has quite serious erroneous estimations on P and T waves; (2) LP method may have some transient errors at the edges of a diastolic period; (3) for PRR and LP, their estimations to the QRS complexes are almost the same. From Figure 7 it’s showed that: when r = 0, i.e. ECG has no HRV, the RMSs of errors for synthetic ECGs are the same for the three method; for others, RMSs for RR and LP increase with r but RMSs for PRR stay the same. Generally speaking, the tests on synthetic ECG show that PRR can estimate the ECG signal more accurately especially when it has large HRV.
Experiments on real data
Tests are taken on data from MIT ADFED [30, 31] which contains five real-life records. Each record has five channels - one direct fECG and four abdominal ECG. The sampling rate is 1 KHz and their sampling durations are five minutes.
First, take record “r01” to test. 12 pieces of 5 second-length signals from 2.5 s to 62.5 s of the whole record are chosen. According to the steps described in Figure 5, mECGs are estimated and rECGs are obtained for these signals. rECGs and their spectrums by PRR, RR and LP respectively are depicted in Figure 9(b-e, g-j). From Figure 9, the three methods have almost same residual maternal QRS complexes; however, RR method has larger residual maternal T wave than the other two methods. The WPRs for rECGs of the 12 pieces are listed in Table 3. We can see that the WPR s for PRR are always the smallest among the three methods.
The errorbar of WPRs for all the five records are depicted in Figure 10. Generally speaking, the WPR s for PRR are the smallest.
On invariant heart beat span
Based on the lengths of ECG waves for recordings in Set A and Set B, we get a conclusion that for normal sinus rhythm when the instant heart beat interval changes the heart beat span keeps almost unchanged, while the length of diastolic period fluctuates with heart beat interval.
HRV is always explained by the autonomic nerve’s regulation of heart and circulation system . The activity of parasympathetic or sympathetic nervous system affects the sinus node’s beating rhythm [26, 27], and further affect the heart beat to have HRV. However, autonomic nerve may affect HRV just by determining the beat beginning point, i.e., Ps point on ECG, don't control the duration of heart being excited, i.e., heart beat span. The heart beat span for a person should only be determined by his or her own heart's physiology and healthy condition itself. Therefore, if there is no conduction block disease, the HRV for normal sinus rhythm just affects the variation of the length of Te-Ps interval. The lengths of P wave, P-R interval, QRS wave and S-T interval are independent of the heart beat rate , which constituting the stable heart beat span.
Individual difference’s effect on PRR performance
In Method Section we set TPs-R and TR-Te to be fixed. But actually different individuals may have slight difference on them. Now we discuss the effect below.
Generating 11 synthetic ECG signals as Result Section described. Their r is 0.4. The real lengths of Ps-R tPs-R and R-Te tR-Te intervals for these signals and their offsets to TPs-R and TR-Te are listed in Table 4. The RMS for estimation errors by using PRR and RR are given in Figure 11. For PRR when TPs-R and TR-Te get closer to their real values, the RMS becomes smaller; when they are equal the RMS-curve gets its minimum. The offset between them indeed affects PRR’s performance. In addition, no matter how large the offset is, the proposed method outperforms RR method. Considering the difficulty in P wave detection and its lower amplitude compared with T wave, in PRR TPs-R can be set as a fixed value while TR-Te can be decided online according to the T peak detection.
With statistical analysis to the lengths of ECG waves from two databases, the conclusion comes out: heart beat span is invariant while the heart beat interval changes all the time. According to this prior information, a new modified fECG extraction algorithm is proposed called PRR. It copes with the mECG estimation inaccuracy caused by HRV and enhances the signal to noise ratio of the remaining fECG signal. Experiments show that the larger the HRV, the more obvious the advantage of PRR over other methods. So the proposed PRR method is outstanding on robustness, and it is believed it can be used into the practical fECG extraction. The computation complexity of this method can be studied in further work.
Langley P, Bowers EJ, Murray A: Principal component analysis as a tool for analyzing beat-to-beat changes in ECG features: application to ECG-derived respiration.IEEE Trans Biomed Eng 2010, 57: 821–829.
Liu C, Li P, Di Maria C, Zhao L, Zhang H, Chen Z: A multi-step method with signal quality assessment and fine-tuning procedure to locate maternal and fetal QRS complexes from abdominal ECG recordings.Physiol Meas 2014, 35: 1665–1683. 10.1088/0967-3334/35/8/1665
Zhang H, Shi Z, Guo C, Feng E: Semi-blind source extraction algorithm for fetal electrocardiogram based on generalized autocorrelations and reference signals.J Comput Appl Math 2009, 223: 409–420. 10.1016/j.cam.2008.01.031
Shuicai W, Yanni S, Zhuhuang Z, Lan L, Yanjun Z, Xiaofeng G: Research of fetal ECG extraction using wavelet analysis and adaptive filtering.Comput Biol Med 2013, 43: 1622–1627. 10.1016/j.compbiomed.2013.07.028
Zeng XP, Li SH, GuoJun Li Y, Zhou DHM: Fetal ECG extraction by combining single-channel SVD and cyclostationarity-based blind source separation.Int J Signal Process, Image Process Pattern Recognit 2013, 6: 367–376. 10.14257/ijsip.2013.6.5.32
Wei Z, Hongxing L, Aijun H, Xinbao N, Jianchun C: Single-lead fetal electrocardiogram estimation by means of combining R-peak detection, resampling and comb filter.Med Eng Phys 2010, 32: 708–719. 10.1016/j.medengphy.2010.04.012
Comani S, Mantini D, Lagatta A, Esposito F, Di Luzio S, Romani GL: Time course reconstruction of fetal cardiac signals from fMCG: independent component analysis versus adaptive maternal beat subtraction.Physio Meas 2004, 25: 1305–1321. 10.1088/0967-3334/25/5/019
Cerutti S, Baselli G, Civardi S, Ferrazzi E, Anna Maria M, Massimo P, Giorgio P: Variability analysis of fetal heart rate signals as obtained from abdominal electrocardiographic recordings.J Perinat Med 1986, 14: 445–452. 10.1515/jpme.19220.127.116.115
Haghpanahi M, Borkholder DA: Fetal QRS extraction from abdominal recordings via model-based signal processing and intelligent signal merging.Physiol Meas 2014, 35: 1591–1605. 10.1088/0967-3334/35/8/1591
Goldberger AL, Amaral LAN, Glass L, Hausdorff JM, Ivanov PC, Mark RG, Mietus JE, Moody GB, Peng C-K, Stanley HE: PhysioBank, PhysioToolkit, and PhysioNet: Components of a New Research Resource for Complex Physiologic Signals.Circulation 2000, 101: e215-e220. Circulation Electronic Pages [http://circ.ahajournals.org/content/101/23/e215.full] 10.1161/01.CIR.101.23.e215
Jezewski J, Matonia A, Kupka T, Roj D, Czabanski R: Determination of the fetal heart rate from abdominal signals: evaluation of beat-to-beat accuracy in relation to the direct fetal electrocardiogram.Biomed Tech 2012, 57: 383–394.
The authors declare that they have no competing interests.
YH was responsible for writing the codes and carried out the experiments. LH conducted the whole experiments and the design of the algorithm. All authors contributed in general concept and writing of the paper. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.
The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Yan, H., Liu, H., Huang, X. et al. Invariant heart beat span versus variant heart beat intervals and its application to fetal ECG extraction.
BioMed Eng OnLine13, 163 (2014). https://doi.org/10.1186/1475-925X-13-163