

ORIGINAL ARTICLE 

Year : 2018  Volume
: 8
 Issue : 4  Page : 225230 

The empirical mode decompositiondecision tree method to recognize the steadystate visual evoked potentials with wide frequency range
Sahar Sadeghi, Ali Maleki
Department of Biomedical Engineering, Semnan University, Semnan, Iran
Date of Submission  01May2018 
Date of Acceptance  11Aug2018 
Date of Web Publication  13Sep2019 
Correspondence Address: Dr. Ali Maleki Department of Biomedical Engineering, Semnan University, Semnan Iran
Source of Support: None, Conflict of Interest: None
DOI: 10.4103/jmss.JMSS_20_18
Background: The empirical mode decomposition (EMD) is a technique to analyze the steadystate visual evoked potential (SSVEP) which decomposes the signal into its intrinsic mode functions (IMFs). Although for the limited stimulation frequency range, choosing the effective IMF leads to good results, but extending this range will seriously challenge the method so that even the combination of IMFs is associated with error. Methods: Stimulation frequencies ranged from 6 to 16 Hz with an interval of 0.5 Hz were generated using Psychophysics toolbox of MATLAB. SSVEP signal was recorded from six subjects. The EMD was used to extract the effective IMFs. Two features, including the frequency related to the peak of spectrum and normalized local energy in this frequency, were extracted for each of six conditions (each IMF, the combination of two consecutive IMFs and the combination of all three IMFs). Results: The instantaneous frequency histogram and the recognition accuracy diagram indicate that for wide stimulation frequency range, not only one IMF, but also the combination of IMFs does not have desirable efficiency. Total recognition accuracy of the proposed method was 79.75%, while the highest results obtained from the EMDfast Fourier transform (FFT) and the CCA were 72.05% and 77.31%, respectively. Conclusion: The proposed method has improved the recognition rate more than 2.4% and 7.7% compared to the CCA and EMDFFT, respectively, by providing the solution for situations with wide stimulation frequency range.
Keywords: Braincomputer interface, decision tree classifier, empirical mode decomposition, frequency recognition, harmonic frequency, steadystate visual evoked potential
How to cite this article: Sadeghi S, Maleki A. The empirical mode decompositiondecision tree method to recognize the steadystate visual evoked potentials with wide frequency range. J Med Signals Sens 2018;8:22530 
How to cite this URL: Sadeghi S, Maleki A. The empirical mode decompositiondecision tree method to recognize the steadystate visual evoked potentials with wide frequency range. J Med Signals Sens [serial online] 2018 [cited 2020 Oct 22];8:22530. Available from: https://www.jmssjournal.net/text.asp?2018/8/4/225/246736 
Introduction   
Steadystate visual evoked potential (SSVEP) is the brain response to repetition of visual stimulation with a certain frequency. It is certain type of evoked potential that refers to stimulation frequencies above 6 Hz. The dominant frequency of SSVEPs depends on the frequency of flickering source. Due to advantages of SSVEP, such as highdata transfer rate and lowtraining time, this control signal is widely used in braincomputer interface (BCI) systems.^{[1],[2],[3]} Several methodologies had been proposed for frequency recognition in SSVEPbased BCI systems. More specifically, the fast Fourier transform (FFT), power spectral density analysis, canonical correlation analysis (CCA), stability coefficient (SC), minimum energy combination, multivariate synchronization index, and least absolute shrinkage and selection operator are used.^{[4],[5],[6]}
The EMD is a preprocessing step for SSVEP feature extraction methods. The method of EMD is an adaptive approach to extract timefrequency information of the signal. In this method, since the signal is decomposed into intrinsic mode functions (IMFs) based on local characteristics of it in timescale, EMD is useful for processing nonstationary signals such as electroencephalogram (EEG).^{[7],[8]} The EMD is a datadriven scheme, and any change in input signal (for example, the existence of noise in the signal) will affect signal decomposition and frequency characteristics of its IMFs. Thus, selecting suitable IMF is one of the important issues after the decomposition of the signal. In studies that use limited frequency range, the effective IMF corresponding to that range has usually been analyzed. For example, the transition from attentiontorest response has been done by analyzing the IMF_{6} corresponding to very low frequencies (0.5–2 Hz).^{[9]} Ruan et al. have been applied IMF_{3} to analyze two frequencies including 11 and 12 Hz.^{[10]} In other studies, by assigning a mean instantaneous frequency to each IMF, the IMF with the mean instantaneous frequency closer to stimulation frequencies has applied as effective IMF. In this regard, Wu et al. have focused on IMF_{6} for the frequency range of 30–35 Hz by calculating mean instantaneous frequencies.^{[11]}
When using wider range of stimulation frequencies, the use of just one IMF will not be sufficient and more IMFs should be analyzed. Huang et al. have focused on IMF_{2} and IMF_{3} to apply the frequency range of 34–48 Hz since the highfrequency SSVEP centralize at these IMFs.^{[12]} In the study done by Wang et al. with SSVEP stimulation frequencies ranged from 9 Hz to 13 Hz, they have focused on the first four IMFs as their spectra overlapped with stimulation frequencies while discarded the lowfrequency components (IMF_{514}).^{[13]} Then, since the IMF_{3} has contained main stimulus frequencies involved, they have combined IMF_{3} with IMF_{1}, IMF_{2}, and IMF_{4} to reconstruct four versions of EMD preprocessed signal: the IMF_{3}; the sum of IMF_{3} and IMF_{4}; the sum of IMF_{24}, and the sum of IMF_{14}. They have finally found that the sum of IMF_{14} had the best recognition rate. In our previous study with SSVEP stimulation frequency range of 6–16 Hz, we had used the neural network to classify the recognition results of IMFs.^{[14]}
Decision tree (DT) is a popular and powerful method used for classification and prediction purposes. Many previous SSVEP studies have also used this method to classify targets.^{[15],[16]} In this paper, we have proposed the DT to classify frequency recognition results of different IMFs. The goal of this paper is to provide an approach for applications with wide range of stimulation frequencies.
Materials and Methods   
Electroencephalogram recording
EEG signals were recorded using the Bayamed system (EEG V.16.24) at a sampling rate of 250 Hz. Signals were acquired from the surface of the scalp through one electrode placed at Oz, referenced to AFZ, and grounded to the right earlobe. The impedance was kept below 5 KΩ. All stimuli were presented on a 16” LCD monitor with a refresh rate of 60 Hz (Samsung NW 1633) using MATLAB and psychophysics toolbox extensions. This toolbox is used to create stimulus with precise frequency and correct timing. The visual stimulus was a white flashing circle with a diameter of 10 cm rendered at the center of the black background screen.
Six healthy young adults with the age between 21 and 30 years without any known history of BCI experience participated in this study. All subjects were sitting on an armchair in a fixed location, watching a flat screen monitor placed approximately at a distance of 40 cm from the eye. The nature of the study was fully explained for them before the start of the study and all signed informed consent. Each subject participated in 10 sessions. In each session, stimulus frequencies were in the range from 6 to 16 Hz with an interval of 0.5 Hz. Subjects were asked to gaze at the flickering stimulus for 4 s and then take a 5 s rest after each trial to avoid visual fatigue caused by flickering. There was 5min break between two consecutive sessions.
The empirical mode decomposition
The EMD is the core algorithm of HilbertHuang transform that decomposes nonlinear and nonstationary signal to its constructive components (IMFs) with different frequency contents. An IMF must satisfy two conditions: (1) in the whole data of IMF, the number of extrema (maxima and minima) and the number of zero crossings must either equal or differ at most by one and (2) at any point, the mean value of the envelope defined by the local maxima and the envelope defined by the local minima must be zero. By virtue of IMF definition, the decomposition method can simply use envelopes defined by the local maxima and minima separately. Once the extrema are identified, all the local maxima are connected by cubic spline line as the upper envelope. This procedure is repeated for the local minima to produce the lower envelope. Then, the mean value of upper and lower envelop is calculated. The difference between the original data and this mean value is the first component. If this component does not satisfy the definition of IMF, it is substituted with the main signal and enters the cycle as the primary input. Otherwise, this component is as separated IMF and the difference between the main signal and this IMF entered the cycle.^{[14],[17]} Due to the overlapping of the stimulation frequency range used in this study with three first IMFs, only these IMFs are extracted as effective IMFs of the signal.
The generalized zero crossing (GZC) method are used to determine the mean instantaneous frequency of an IMF. After defining the local extrema and zerocrossings as critical points, the GZC approach separates each IMF into a series of period based on these points. It estimates the instantaneous frequency of a period by measuring local full and partial wave periods. The instantaneous frequency of each period along the time series can be computed as Eq. 1. To determine the mean instantaneous frequency of an IMF, the ensemble mean of instantaneous frequencies over all time periods in that IMF is calculated as Eq. 2.^{[11]}
Where f (i) is the instantaneous frequency of the i^{th} time period, is the mean instantaneous frequency of the IMF and L is the number of critical points denoted as P (i), i = 1,…, L. Assuming that the time period for the i^{th} critical point is defined as T (i) = P (i + 1) − P (i), are full periods (k = 1, 2, 3 and 4) from P (I – 4 + k) to P (i + k) covering the i^{th} time period, are half periods (m = 1 and 2) from P (i – 2 + m) to P (i + m), and T_{quarter} (i) is a quarter period from P (i) to P (i + 1).
Decision tree
In a sequential decision problem, a decisionmaker faces a sequence of decisions, nd each decision may impact future decisions. A convenient language to introduce sequential decision problems is through DTs. The most important feature of DT is their capability to break down a complex decisionmaking process into a collection of simpler decisions, thus providing a solution, which is often easier to interpret. A DT is a connected graph without cycles for illustrating and analyzing a decision, where expected values are calculated alternately. It is composed of the complex of decision nodes which are connected to each other by branches, and branches go down from root nodes until they come to leaf nodes. A DT is started from the top point of it which is called tree root. Indicators are examined in decision nodes, and each result is evaluated in branches. Each branch leads to another decision node or the final leaf node. DTs are used successfully in many diverse areas such as signal classification, character recognition, remote sensing, medical diagnosis, expert systems, and speech recognition, to name only a few.^{[15],[18],[19]}
The proposed method
[Figure 1] shows the block diagram of the proposed method. The length of the processing window has been considered 4 s, according to the recorded signal length during one trial. First, the signal was bandpass filtered from 1 to 40 Hz. Then, effective IMFs (first three IMFs) are extracted from the signal by applying EMD on the filtered signal. In the following, each separate IMF, the combination of two consecutive IMFs and combination of all three IMFs are considered as six conditions. For each condition, two features are extracted. These features include the frequency related to the peak of spectrum and the normalized local energy in this frequency. Eq. 3 defines the normalized local energy where f_{max} is the frequency related to the peak of the spectrum, P (f) is the power value, and △f indicates frequency resolution.  Figure 1: Explanation of the empirical mode decompositiondecision tree model for recognizing steadystate visual evoked potential frequency components, f_{i}denotes the frequency related to the peak of spectrum, and NLE_{i}is the normalized local energy in this frequency
Click here to view 
Finally, the DT classifier designed based on harmonic detection will be used for data classification.
In the DT design process, at the root node, the maximum frequency ratio of the first two IMFs (IMF1 and IMF2) is investigated. If there is half relation between frequencies, the smaller frequency is accepted as the fundamental stimulation frequency. However, if frequencies do not have harmonic relation, this ratio is calculated for the second consecutive IMFs (IMF2 and IMF3). In this step, if the condition is true, the normalized local energy ratio of these IMFs must also be calculated, and the final decision is made from the response of the energy ratio condition. Otherwise, the maximum frequency ratio of the effective IMFs and the combination of IMFs is evaluated stepbystep. In general, in these nodes, if there is a halfrelation between frequencies, the smaller frequency is accepted as the fundamental stimulation frequency; otherwise, they go to the next comparison node. [Figure 2] shows the proposed classification algorithm.
Results   
[Figure 3] illustrates the instantaneous frequency histogram of six conditions (each separate IMF, the combination of two consecutive IMFs and combination of all three IMFs) for six subjects, 10 sessions for each subject and 21 stimulation frequencies for each session. Results of recognition accuracy^{[14]} stimulation frequency are illustrated in [Figure 4] for two combinations of two consecutive IMFs. As shown from the figure, the recognition rate of IMF_{23} has a high value for low stimulation frequencies; however, this rate will gradually decline with increasing frequency. On the other hand, IMF_{12} associated with good recognition accuracy, only for the middle and upperfrequency ranges.  Figure 3: The instantaneous frequency histogram for sixintrinsic mode functioncombination conditions
Click here to view 
 Figure 4: The comparison of recognition accuracy regarding stimulation frequency for two combinations of two consecutive intrinsic mode functions
Click here to view 
The recognition accuracy values of the proposed method for each subject were calculated according to the [Table 1]. To compare the proposed approach with EMDbased methods used in previous studies, accuracy values of the EMDFFT (in different conditions) have been computed. To further evaluate the validity of the proposed method, results of the benchmark CCA method (for two cases, choosing one harmonic (n = 1) and two harmonics (n = 2) in constructing reference signals) have also been reported.  Table 1: Recognition accuracy of the proposed approach in comparison with other conventional methods
Click here to view 
The highest recognition accuracy of the proposed method was obtained by subject 3 with 89.04% while the lowest value was achieved by subject 4 with 63.80%. The total mean recognition rate was obtained 79.75% for the proposed method while the maximum corresponding value obtained by six EMDFFT conditions was 72.05%. The accuracy value for the CCA achieved by choosing one and two harmonics were 76.47% and 77.31%, respectively.
Discussion   
In the EMD, decomposition method, selecting an IMF based on the proximity with the stimulation frequency range, has optimum performance only for limited stimulation frequency ranges. If we use wider range of frequencies, this selecting method and even the method of IMF combination are not satisfactory. The instantaneous frequency histogram for six IMF combination conditions reveals that for wider range of stimulation frequencies, not only one IMF, but also the combination of IMFs does not have desirable efficiency. Moreover, the results of recognition accuracy also indicate inefficient use of IMF for the selected stimulation frequency range.
Conclusion   
The idea of selecting the desired IMF adaptively based on Decision tree, has improved the recognition rate more than 2.4% and 7.7% compared to the CCA and EMD FFT, respectively. Using the proposed method, the utilization of the wide stimulation frequency range has been provided in SSVEP based BCI systems.
Financial support and sponsorship
None.
Conflicts of interest
There are no conflicts of interest.
References   
1.  Waytowich NR, Krusienski DJ. Multiclass steadystate visual evoked potential frequency evaluation using chirpmodulated stimuli. IEEE Trans Hum Mach Syst 2016;46:593600. 
2.  Movahedi MM, Mehdizadeh A, Alipour A. Development of a brain computer interface (BCI) speller system based on SSVEP signals. J Biomed Phys Eng 2013;3:816. 
3.  Singla R, Khosla A, Jha R. Influence of stimuli color on steadystate visual evoked potentials based BCI wheelchair control. J Biomed Sci Eng 2013;6:1050. 
4.  AcevesFernandez MA, FernandezFraga SM, PedrazaOrtega JC, TovarArriaga S. EEG Signal analysis methods based on steady state visual evoked potential stimuli for the development of brain computer interfaces: A review. Am J Comput Sci Eng Surv 2016;4:118. 
5.  Liu Q, Chen K, Ai Q, Xie SQ. Review: Recent development of signal processing algorithms for SSVEPbased brain computer interfaces. J Med Biol Eng 2013;34:299309. 
6.  Oikonomou VP, Liaros G, Georgiadis K, Chatzilari E, Adam K, Nikolopoulos S, et al. Comparative Evaluation of StateoftheArt Algorithms for SSVEPBased BCIs. ArXiv Preprint arXiv: 1602.00904; 2016. 
7.  Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, et al. The empirical mode decomposition and the Hilbert spectrum for nonlinear and nonstationary time series analysis. Proc R Soc Lond A Math Phys Eng Sci 1998;454:90395. 
8.  Williams N, Nasuto SJ, Saddy JD. Evaluation of empirical mode decomposition for event related potential analysis. EURASIP J Adv Signal Proc 2011;1:111. 
9.  Ng CM, Vai MI. Detection of Attention to Rest Transition from EEG Signals with the Help of Empirical Mode Decomposition. Intell Comput Inf Sci 2011;135:6671. 
10.  Ruan X, Xue K, Li M. Feature Extraction of SSVEPBased BrainComputer Interface with ICA and HHT Method. IEEE 11 ^{th} World Congress on Intelligent Control and Automation (WCICA); 2014. p. 241823. 
11.  Wu CH, Chang HC, Lee PL, Li KS, Sie JJ, Sun CW, et al. Frequency recognition in an SSVEPbased brain computer interface using empirical mode decomposition and refined generalized zerocrossing. J Neurosci Methods 2011;196:17081. 
12.  Huang M, Wu P, Liu Y, Bi L, Chen H. Application and Contrast in BrainComputer Interface Between HilbertHuang Transform and Wavelet Transform. IEEE 9 ^{th} International Conference In Young Computer Scientists (ICYCS); 2008. p. 170610. 
13.  Huang L, Huang X, Wang YT, Wang Y, Jung TP, Cheng CK, et al. Empirical mode decomposition improves detection of SSVEP. Conf Proc IEEE Eng Med Biol Soc 2013;2013:39014. 
14.  Sadeghi S, Maleki A. The EMDCCA with Neural Network classifier to recognize the SSVEP frequency. J Biomed Eng 2017;11:9148. 
15.  Heidari H, Einalou Z. SSVEP extraction applying wavelet transform and decision tree with bays classification. Int Clin Neurosci J 2017;4:917. 
16.  Bastos TF, Muller SM, Benevides AB, SarcinelliFilho M. Robotic wheelchair commanded by SSVEP, motor imagery and word generation. Conf Proc IEEE Eng Med Biol Soc 2011;2011:47536. 
17.  Wang YH, Yeh CH, Young HW, Hu K, Lo MT. On the computational complexity of the empirical mode decomposition algorithm. Phys A Stat Mech Appl 2014;400:15967. 
18.  Safavian SR, Landgrebe D. A survey of decision tree classifier methodology. IEEE Trans Syst Man Cybern 1991;21:66074. 
19.  Kikuti D, Cozman FG, Shirota Filho R. Sequential decision making with partially ordered preferences. Artif Intell 2011;175:134665. 
[Figure 1], [Figure 2], [Figure 3], [Figure 4]
[Table 1]
