Escolar Documentos
Profissional Documentos
Cultura Documentos
www.elsevier.com/locate/clinph
Associazione Fatebenefratelli per la ricerca (AFaR), S. Giovanni Calibita-Isola Tiberina, 00185 Rome, Italy
b
Casa di Cura SAN RAFFAELE Cassino and IRCCS SAN RAFFAELE PISANA, Rome, Italy
c
Neurology, Campus Biomedico University, Via Emilio Longoni, 83 00155 Rome, Italy
d
Semeion Research Centre, Via Sersale, 117 00128, Rome, Italy
e
Bracco SpA Medical Department, Via E. Folli, 50 20134, Milan, Italy
f
Clinical Neurophysiology Service (DISEM), University of Genova, Italy
g
Associazione Fatebenefratelli per la ricerca, Istituto di ricovero e cura a carattere scientico S. Giovanni di Dio-Fatebenefratelli,
14 Via Piastroni, 4 2512 Brescia, Italy
h
Dipartimento di Scienze Biomediche, Universita` di Foggia, Foggia, Italy
Accepted 19 March 2008
Abstract
Objective: It has been shown that a new procedure (implicit function as squashing time, IFAST) based on articial neural networks
(ANNs) is able to compress eyes-closed resting electroencephalographic (EEG) data into spatial invariants of the instant voltage distributions for an automatic classication of mild cognitive impairment (MCI) and Alzheimers disease (AD) subjects with classication
accuracy of individual subjects higher than 92%.
Methods: Here we tested the hypothesis that this is the case also for the classication of individual normal elderly (Nold) vs. MCI subjects, an important issue for the screening of large populations at high risk of AD. Eyes-closed resting EEG data (1020 electrode montage) were recorded in 171 Nold and in 115 amnesic MCI subjects. The data inputs for the classication by IFAST were the weights of
the connections within a nonlinear auto-associative ANN trained to generate the instant voltage distributions of 60-s artifact-free EEG
data.
Results: The most relevant features were selected and coincidently the dataset was split into two halves for the nal binary classication
(training and testing) performed by a supervised ANN. The classication of the individual Nold and MCI subjects reached 95.87% of
sensitivity and 91.06% of specicity (93.46% of accuracy).
Conclusions: These results indicate that IFAST can reliably distinguish eyes-closed resting EEG in individual Nold and MCI subjects.
Signicance: IFAST may be used for large-scale periodic screening of large populations at risk of AD and personalized care.
2008 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Keywords: Mild cognitive impairment (MCI); Alzheimers disease (AD); Electroencephalography (EEG); Articial neural networks (ANNs)
1. Introduction
*
1388-2457/$34.00 2008 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
doi:10.1016/j.clinph.2008.03.026
1535
1536
ical evaluation, as dened by performances P1.5 standard deviation below the mean value of age and education-matched controls for a test battery including
BusckheFuld and Memory Rey tests; (ii) normal activities of daily living as documented by the history and evidence of independent living; and (iii) clinical dementia
rating score of 0.5. The exclusion criteria for MCI
included: (i) mild AD; (ii) evidence of concomitant
dementia such as frontotemporal, vascular dementia,
reversible dementias (including pseudo-depressive dementia), uctuations in cognitive performance, and/or features of mixed dementias; (iii) evidence of concomitant
extra-pyramidal symptoms; (iv) clinical and indirect evidence of depression as revealed by Geriatric Depression
Scale scores higher than 13; (v) other psychiatric diseases,
epilepsy, drug addiction, alcohol dependence, and use of
psychoactive drugs including acetylcholinesterase inhibitors or other drugs enhancing brain cognitive functions;
and (vi) current or previous uncontrolled or complicated
systemic diseases (including diabetes mellitus) or traumatic brain injuries.
The Nold subjects were recruited mostly among nonconsanguineous patients relatives. All Nold subjects
underwent physical and neurological examinations as well
as cognitive screening. Subjects aected by chronic systemic illnesses, subjects receiving psychoactive drugs, and
subjects with a history of present or previous neurological
or psychiatric disease were excluded. All Nold subjects had
a GDS score lower than 14 (no depression).
Table 1 summarizes the demographic and clinical data
of Nold and MCI subjects.
2.2. EEG recordings
EEG data were recorded in the late morning in a fully
awake, resting state (eyes-closed) from 19 electrodes positioned according to the International 1020 System (i.e.
Fp1, Fp2, F7, F3, Fz, F4, F8, T3, C3, Cz, C4, T4, T5,
P3, Pz, P4, T6, O1, O2; 0.370 Hz bandpass; cephalic reference; see Fig. 1). To monitor eye movements, the horizontal and vertical electrooculogram (0.370 Hz bandpass)
was also collected. All data were digitized in continuous
recording mode (5 min of EEG; 128256 Hz sampling
rate). In order to keep constant the level of vigilance, an
experimenter controlled on-line the subject and the EEG
Table 1
Demographic and neuropsychological data of healthy elderly (Nold) and
mild cognitive impairment (MCI) subjects
N
MMSE
Age
Education
IAF
Female/male
MCI
Nold
115
25.2 (0.2SE)
76.3 (0.6SE)
8.2 (0.4SE)
8.9 (0.1SE)
49M/66F
171
27.7 (0.2SE)
64 (1.4SE)
8.5 (0.3SE)
9 (0.1SE)
79M/92F
traces and alerted the subject any time there were signs of
behavioral and/or EEG drowsiness.
Continuous EEG data were down-sampled to 128 Hz
(EEG track). The EEG epochs with ocular, muscular,
and other types of artifact were preliminarily identied
by a computerized automatic procedure. EEG epochs with
sporadic blinking artifacts (less than 10% of the total) were
corrected by an autoregressive method (Moretti et al.,
2003). Two independent experimenters blind to the diagnosis manually conrmed the EEG data accepted for further
analysis.
2.3. Control spectral analysis of the EEG data
The statistical analysis of the EEG power density spectra between Nold and MCI subjects allowed the evaluation
of the quality of the EEG data as an input for the IFAST
analysis, with reference to the expected slowing of EEG
rhythms in pathological aging, namely, lower alpha and a
trend towards higher delta in the MCI than Nold subjects.
A digital FFT-based power spectrum analysis (Welch technique, Hanning windowing function, no phase shift) computed power density of the EEG rhythms with 1 Hz
frequency resolution. The following standard band frequencies were studied: delta (24 Hz), theta (48 Hz), alpha
1 (810 Hz), alpha 2 (1012 Hz), beta 1 (1320 Hz), beta 2
(2030 Hz) and gamma (30-40 Hz). The spectral power
density at each electrode was normalized to the spectral
power density averaged across all frequencies (0.5-40 Hz)
and across all electrodes.
The normalized spectral power density values were used
as dependent variables for the ANOVA. The ANOVA factors (levels) were Group (Nold and MCI; independent var-
1537
1538
urate the hidden nodes when in the presence of non-stationary functions, and the vector codication (allocated)
of the same hidden nodes (Chauvin and Rumelhart,
1995).
4. Elmans hidden recurrent (Elman, 1990) can be used for
auto-associating purposes, again using the back-propagation algorithm (auto-associative hidden recurrent,
AHR). It was used in our experimentation as a variation
for MLP with memory set to one step. It is not possible
to call it a proper recurring ANN in this form, because
the memory would have been limited to one record
before. We used this variation only to give the ANN
an input vector modulated at any cycle by the values
of the previous input vector. Our purpose was not to
codify the temporal dependence of the entrance signals,
but rather to give the ANN a smoother and more
mediated input sequence. The number of connections
in the AHR BP is the same as an MLP with extended
input, whose cardinality is equal to the number of hidden units.
The entire sample of 286 subjects was recorded at
128 Hz for 1 min. IFAST classied Nold and MCI subjects
based on 1-min of continuous artifact-free EEG data (EEG
track). The EEG track for each subject was represented by
a matrix of 7680 sequential rows (i.e. 128 Hz sampling rate
60 s) and 19 columns (i.e. 19 EEG electrodes of 1020
montage system). Summarizing, the IFAST procedure
was as follows:
Table 2
Auto-associative articial neural networks (ANNs) types and parameters
used during the processing
ANN parameters/type
ABp
NRC
AMLP
AHR
Number of inputs
Number of outputs
Number of state units
Number of hidden units
Number of weights
Number of epochs
Learning coecient
Projection coecient
19
19
0
0
361
200
0,1
Null
19
19
0
19
399
200
0,1
0,5
19
19
0
10
409
200
0,1
Null
19
19
10
10
509
200
0,1
Null
ABp, A back propagation without a hidden unit layer and without connections on the main diagonal; NRC, new recirculation network; AMLP,
auto associative multi-layer perceptron; AHR, auto-associative hidden
recurrent.
1539
mum power density values at alpha 1 band in the posterior regions; minimum values of EEG power density were
detected at high frequency bands (beta 1, beta 2, and
gamma). Compared to the Nold subjects, the amnesic
MCI subjects showed a decrease of the posterior alpha
power and an increase of frontal delta, theta and alpha
power.
Statistical ANOVA of the EEG spectral power density
values showed a signicant interaction (F(60, 17040) =
17.4; p < 0.00001) among the factors Group (Nold and
MCI; independent variable), Band (delta, theta, alpha 1,
alpha 2, beta 1, beta 2, gamma), and Electrode (F3, Fz,
F4, C3, Cz, C4, P3, Pz, P4, O1, O2). Tukey HSD posthoc testing showed that the normalized EEG spectral
power density of alpha 1 and alpha 2 was stronger in
amplitude in the Nold compared to the MCI subjects at
P3, Pz, P4, O1 and O2 electrodes (p < 0.00007). On the
contrary, the amplitude of normalized EEG spectral power
density was lower in the Nold compared to MCI subjects at
F3 (delta, theta and alpha 1 bands), Fz (alpha1 band), and
F4 (delta and alpha1 bands) electrodes (p < 0.002).
Fig. 3. Grand average of the EEG spectral power density values computed in the normal elderly (Nold) and mild cognitive impairment (MCI) subjects for
the electrodes of interest (F3, Fz, F4, C3, Cz, C4, P3, Pz, P4, O1, O2) and for all frequency bands considered (delta, theta, alpha 1, alpha 2, beta 1, beta 2,
gamma). The rectangles indicate the cortical regions and frequency bands in which the EEG spectral power density presented statistically signicant
dierent values with respect to the following patterns: Nold > MCI > , and Nold < MCI (p < 0.05, planned Duncan post hoc testing).
1540
Table 3
Classication results of the four dierent auto-associated articial neural networks (ANNs) in IFAST
Auto associated network type
SE (%)
SP (%)
Errors
AHR
NRC
AutoBp
AMLP
96.16
96.30
93.09
95.87
85.82
92.17
91.39
91.06
90.99
92.73
92.24
93.46
89.98
92.87
92.07
92.82
14
10
11
10
ABp, A back propagation without a hidden unit layer and without connections on the main diagonal; NRC, new recirculation network; AMLP, auto
associative multi-layer perceptron; AHR, auto associative hidden recurrent; SE, sensibility; SP, specicity.
Table 4
Details of the two best articial neural networks (ANNs) used for the present study results
MCI
SE
SP
A.M.Acc.
W.M.Acc.
VP
VN
FP
ANN
Nold
85
78
94.6
100
97.3
90.6
89.7
90.2
92.59
94.87
93.73
92.45
93.28
92.87
70
41
56
77
70
74
8
8
8
98
65
98.1
93.7
95.9
89.8
92.3
91.1
93.94
92.98
93.46
92.67
92.97
92.82
51
59
55
88
60
74
10
5
8
FN
VP+
VP
LR+
LR
AUC
4
0
2
89.7
83.7
86.7
95.1
100
97.5
10.1
9.75
9.9
0.06
0
0.03
0.929
0.939
0.94
1
4
2.5
83.6
92.2
87.9
98.9
93.8
96.3
9.61
12.2
10.9
0.02
0.07
0.05
0.928
0.916
0.92
NRC, new recirculation network; AMLP, auto-associative multi-layer perceptron; MCI N., number of subject in the MCI sample of testing le; Nold N.,
number of subjects in the Nold sample of testing le; SE, sensibility; SP, specicity; A.M.Acc., average mean accuracy; W.M.Acc., weighted mean
accuracy; VP, positive value; VN, negative value; FP, false positive; FN, false negative; VP+, positive predictive value; VP , negative predictive value;
LR+, likelihood ratio for a positive test results (benchmark value P2); LR , likelihood ratio for a negative test results (benchmark value 60.2); AUC,
area under ROC curve (average ROC curve calculated by the threshold method).
Fig. 4. (a) Average ROC curve illustrating the results of NRC network
used for the present study (AUC 0.94); (b) Average ROC curve
illustrating the results of the AMLP network used for the present study
(AUC 0.92). NRC, new recirculation network; AUC, area under ROC
curve (Average ROC curve calculated by the threshold method); AMLP,
auto-associative multi-layer perceptron; SE, sensibility; SP, specicity.
1541
1542
Table 5
The ten dierent ANNs used for the binary classication (Nold vs. MCI) of each subject of the present database in a control analysis
n
ANN
FF_Bm(ab)
FF_Bm(ba)
SE
SP
A.M.Acc.
W.M.Acc.
VP
VN
FP
FN
VP+
98.1
93.7
87
92
92.41
92.98
90.67
92.97
51
59
85
60
13
5
1
4
79.7
92.2
FF_Bp(ab)
FF_Bp(ba)
98.1
93.7
87
92
92.41
92.98
90.67
92.97
51
59
85
60
13
5
1
4
SelfDABm(ab)
SelfDABm(ba)
98.1
95.2
86
91
91.9
93
90
92.97
51
60
84
59
14
6
SelfDABp(ab)
SelfDABp(ba)
100
93.7
86
92
92.86
92.98
90.67
92.97
52
59
84
60
SelfSABm(ab)
SelfSABm(ba)
94.2
90.5
91
95
92.52
92.93
92
92.97
49
57
SelfSABp(ab)
SelfSABp(ba)
98.1
93.7
88
92
92.92
92.98
91.33
92.97
TasmDABm(ab)
TasmDABm(ba)
100
95.2
85
91
92.35
93
TasmDABp(ab)
TasmDABp(ba)
98.1
93.7
87
92
TasmSABm(ab)
TasmSABm(ba)
98.1
95.2
10
TasmSABp(ab)
TasmSABp(ba)
Mean
RV+
RV
AUC
98.84
93.75
7.39
12.2
0.02
0.07
0.941
0.953
79.7
92.2
98.84
93.75
7.39
12.2
0.02
0.07
0.938
0.913
1
3
78.5
90.9
98.82
95.16
6.87
10.3
0.02
0.05
0.934
0.921
14
5
0
4
78.8
92.2
100
93.75
7
12.2
0
0.07
0.92
0.932
89
62
9
3
3
6
84.5
95
96.74
91.18
10.3
19.6
0.06
0.1
0.908
0.932
51
59
86
60
12
5
1
4
81
92.2
98.85
93.75
8.01
12.2
0.02
0.07
0.93
0.934
90
92.97
52
60
83
59
15
6
0
3
77.6
90.9
100
95.16
6.53
10.3
0
0.05
0.923
0.926
92.41
92.98
90.67
92.97
51
59
85
60
13
5
1
4
79.7
92.2
98.84
93.75
7.39
12.2
0.02
0.07
0.936
0.935
87
91
92.41
93
90.67
92.97
51
60
85
59
13
6
1
3
79.7
90.9
98.84
95.16
7.39
10.3
0.02
0.05
0.925
0.95
98.1
93.7
90
92
93.94
92.98
92.67
92.97
51
59
88
60
10
5
1
4
83.6
92.2
98.88
93.75
9.61
12.2
0.02
0.07
0.928
0.916
93.8
92
92.98
92.97
59.1
59.9
3.9
92.1
93.92
12.4
0.07
0.93
5.1
VP
FF, Feed Forward networks; FF_Bm, feed forward network with BiModal learning law (BiModal network was developed at Semeion Center, Buscema
2003); FF_Bp, Feed Forward network with back-propagation learning law; Self, self recurrent networks (created at Semeion Center; Buscema, 1995,
Buscema et al., 2006), which are composed in a feedback loop, by a normal Feed Forward network, an auto-associative Feed Forward network with two
layers and an additional layer of (PEs), called extended input. The auto-associative network receives in input, the feedback signal in input from PEs of the
feed forward of the hidden layer, through mono-dedicated connections. Outputs from PEs of the output layer of the auto-associative are connected,
through mono-dedicated connections, to correspondent PEs of the extended input layer. Finally, outputs from PEs of the extended input layer are
distributed in input to PEs of the Feed Forward hidden layer, through complete net connections. The patterns processed by the auto-associative coincide
with the outputs of the hidden layer of the Feed Forward; SelfSA, self recurrent network, static version; the SelfDA is the dynamic version, introducing a
dependence of the activation functions of the Feed Forward hidden layer on the weights of the connections of the auto-associative. This dependence
should increase feedback dynamism of this architecture; SelfSABp, self recurrent network, static version, with back-propagation learning algorithm;
SelfSABm, self recurrent network, static version, with BiModal learning algorithm; SelfDABp, self recurrent network, dynamic version, with backpropagation learning algorithm; SelfDABm, self recurrent network, dynamic version, with BiModal learning algorithm; TASM, temporal associative
subjective memory, which is constituted, topologically, by a composition, in a feedback loop, of a normal Feed Forward network, SelfReexive (SR)
network with free hidden units, which is a drift model with an additional layer of PEs, called extended input. SR network receives in input the feedback
signal from PEs of the hidden layer of the Feed Forward, through mono-dedicated connections. Outputs from PEs of the free hidden layer of the
SelfReexive are connected, through mono-dedicated connections, to correspondent PEs of the extended input layer. Finally, outputs from PEs of the
extended input layer are distributed in input, to PEs of the Feed Forward hidden layer, through full grid connections. Inner connections of Feed Forward
and SelfReexive networks, as well as those connecting the extended input layer to the Feed Forward hidden layer, have an adaptive weight. Connections
linking the Feed Forward hidden layer to the SelfReexive input layer, and those linking the SelfReexive free hidden layer to the extended input layer
have a xed weight, equal to 1, since they have their only function of is to copy or clone the activation value. In a TASM network (n+1)-th, input model
is not accompanied with simple traces that the network had carried out on the previous n-th model, but rather with a synthetic elaboration (clustering)
of the memory of the previous model. SelfReexive free units supply the values for what we could dene a metamemory; TASMDa = dynamic Tasm
network, which is a modication of the static version of TASM networks. It consists in introducing in the activation functions of the Feed Forward hidden
layer a dependence of the weights of the input-hidden connection on the activation values of the extended input. This dependence corresponds to the
introduction of a node-connection meta-arch, namely, a connection to each input-hidden weight so that the j-th PE of the hidden layer is the product of the
activation values of all PEs of the extended input; TasmSABp, TASM recurrent network, static version, with back-propagation learning algorithm;
TasmSABm, TASM recurrent network, static version, with BiModal learning algorithm; TasmDABp, TASM recurrent network, dynamic version, with
back-propagation learning algorithm; TasmDABm, TASM recurrent network, dynamic version, with BiModal learning algorithm.
Number of subjects
69.57
15.65
9.57
3.48
2.61
80
18
11
4
3
79.75
1.84
3.68
5.52
9.20
130
3
6
9
15
Each ANN gave a classication value from 0 to 1, where the values from 0
to 0.5 corresponded to the classication as Nold and the values from 0.51
to 1 to the classication as MCI. The mean classication value can be used
to estimate the probability of correct classication for that subject. The
distribution of the present Nold and MCI subjects was provided for several
ranges of the classication values.
1543
A crucial issue of the present study is May IFAST contribute to an early instrumental diagnosis of MCI in large
elderly populations? Non-reversible forms of dementias
represent a major health problem in all those countries
where the average life-span is progressively increasing.
An early clinical and diagnostic evidence of AD is prevented by the fact that for a long-lasting pre-clinical stage
brain neural networks rearrange via plasticity mechanisms
their connections and synapses to compensate neural loss
due to neuro-degeneration (Babiloni et al., 2000; Ferreri
et al., 2003; Stern, 2006). This process of plasticity maintains cognitive functions for years before clinical symptoms
of dementia appear, especially in the case that high education and large brain size (i.e. brain reserve) mask underlying disease pathology (Mortimer et al., 2005). Optimum
instrumental approaches for early diagnosis of pre-clinical
AD stages like MCI therefore need to target the severity of
underlying brain pathology independently of brain reserve.
This criterion seems to be satised by IFAST using just 60s eyes-closed resting EEG data as an input. Indeed, IFAST
was able to classify not only MCI vs. AD (Buscema et al.,
2007) but also Nold vs. MCI with accuracy higher than
90%. However, even if conrmed in control longitudinal
studies, this rate is clearly insucient for the use of IFAST
procedure alone for a denite diagnosis of MCI. In the case
of IFAST positive alarm, further neuropsychological and
instrumental approaches should be used to ascertain the
presence of MCI status.
The present results prompt prospective studies on the
predictive value of IFAST in the prediction of the conversion to MCI to AD or back to normalcy. However, apart
of clinical perspectives, the present ndings have an intrinsic value for clinical neurophysiology. They provided further functional data from a large aged population to
support the idea that spatial features of eyes-closed resting
EEG, as a reection of the cortical neural synchronization,
convey information content able to discriminate pre-clinical stage of dementia (amnesic MCI) from Nold when a
sophisticated mathematical approach based on ANNs is
available.
5. Conclusions
We tested the hypothesis that a new procedure (implicit
function as squashing time, IFAST) based on ANNs is able
to compress eyes-closed resting EEG data into spatial
invariants of the instant voltage distributions for a reliable
automatic classication of Nold and amnesic MCI subjects, an important issue for the screening of large populations at risk of AD. The data inputs for the classication by
IFAST were the weights of the connections within a nonlinear auto-associative ANN trained to generate the
recorded EEG data. The classication of the Nold and
MCI subjects reached 95.87% of sensitivity, 91.06% of
specicity, and 93.46% of accuracy. The results indicate
that IFAST can reliably distinguish eyes-closed resting
EEG in Nold and amnesic MCI subjects, and may be used
1544
1545