Walter J. Freeman Journal Article
e–Reprint
Methods of Analysis of Brain
Electrical & Magnetic Signals:
Analytic Techniques Used in the Search for the
Physiological Basis of the EEG
WALTER J. FREEMAN
Department of Molecular & Cell
Biology, University of California, Berkeley, CA 94720, USA
Reprinted
from Chapt 18 of EEG Handbook (revised series, Vol. 1), A.S. Gevins and A.
Rémond (Eds.) 9 1987 Elsevier Science Publishers B.V. (Biomedical
Division)
In the course of
generating behavior, the brain also generates time-varying electric and
magnetic fields. The aim of electroencephalography is to record and measure
samples of these fields during certain states and sequences of behavior, in
order to explain some of the mechanisms by which behavior is generated.
In this chapter the
brain is treated as a hierarchically organized network of neurons whose
dynamics determine behavior and vice versa. The dynamics is largely determined
by the strengths of interaction among the neurons. The tasks of physiologists
are:
(a) to measure
accurately the time courses and spatial patterns of potential fields,
(b) to simulate the
wave shapes of these events with the solutions to differential equations or
their equivalent that represent the dynamics of neural networks,
(c) to estimate
strengths of action ('gains') from rates of change of potential, and
(d) to interpret
changes in strengths of neural action in relation to changes in behavioral
states.
Procedures are outlined
for acquiring and measuring suitable data bases, and for constructing and
evaluating explanatory equations. Although the brain is a distributed and
highly non-linear system, the treatment begins with the development of lumped,
linear systems. This is because the construction of a neural dynamic model
requires specification of the network of elements and their connections,
definition of state variables and observables, types and locations of
non-linear operators, measurement of open loop time and space constants, and
evaluation of state-dependent coefficients. Analysis of non-linear properties
presupposes an understanding in detail of these and other features of a system.
Fortunately, linear domains of complex systems are ubiquitous in the brain when
they are sought; they are the best preparatory path of access to the
distributed, non-linear, multiply looped feedback systems that generate
behavior and the EEG. Emphasis is given to techniques for fine-grain spatial
analysis of the EEG, for use of behavioral correlation to validate and optimize
procedures of measurement, and for empirical detection of chaos, limit cycles,
and random components of EEG recordings. Examples are drawn from the mammalian
olfactory bulb, in which the neural non-linear dynamics manifested in the EEG
sustain the basic operations in olfactory perception.
18.1. The need for models
The neuron releases
electromotive forces (EMFs) at synapses and trigger zones that drive electric
current along dendrites and axons, so that activity is transmitted from one
part of the neuron to another. The current must flow in a closed path; it
crosses the membrane in one direction at the site of its EMF, crosses in the
other direction elsewhere including its sites of action, and passes through the
extraneuronal tissue space in its return path. As the current flows across the
resistance of the tissue, it causes a low amplitude field of potential. That
field constitutes a sign of the activity, but this sign is not itself the
activity. The activity is the space-time pattern of depolarization and
hyperpolarization of the neuron that is established by the flow of current
across the membrane resistance, it is not the potential difference established
by the flow across the extraneuronal tissue resistance. The manifestation is
like the sound of a single part of an engine; it mixes with the sounds of all
other parts to create a roar. Our task of analysis is to localize and measure
the pings and rumbles of particular parts, so as to infer what the parts
contribute to the function of the whole engine. While it is conceivable that
extracellular EEG currents might contribute directly to brain function, just as
sound vibrations might partly determine the performance of an engine, that
possibility is disallowed here.
The key to
understanding brain function lies in determining the strength of action
(represented by a coefficient called the 'gain') of one group of neurons on
another, and the changes in gain with altered behavior. When we can isolate two
groups and show that the action is unidirectional, then the problem is mainly
to demonstrate for each group the relation between its activity and its
manifestation of activity (its sign to the observer, not to the other group).
The gain is given by the ratio of the amplitudes of the two activities.
Groups in the intact
brain are not isolated. They function in networks with parallel forward and
feedback loops. The problem of determining activities from the signs of
activities is the same as in the isolated case, but the procedures for
estimating the forward and feedback gains are more complex than taking ratios
of amplitudes.
The basic procedures
in dynamics are to measure the rates of change and the frequencies in signals,
and to construct a model so as to infer strengths of action as gains from the
measurements. The model typically is a differential equation that allows us to
infer strengths of synaptic interaction among groups of neurons from recordings
of evoked potentials and the EEG.
As an assembly of
neurons the brain is structured by their interconnections into a hierarchy of
parts. Locally dense interconnections, synaptic and non-synaptic, form local
modules such as columns in cortex and clusters in nuclei of the basal ganglia.
Axonal tracts, usually in bidirectional pairs, serve to interconnect these
local parts into subsystems comprising areas of cortex and their related parts
of the basal ganglia. The subsystems are integrated to form major entities such
as the motor, visual and limbic systems; the interactive total comprises the
functioning brain.
In this review the
description of these parts for computational purposes is by a hierarchy of sets
(Freeman 1975, pp. 25-34). A KO set represents a spatially distributed
collection (KO mass) of neurons having a common input (e.g., a set of sensory
receptors, or the first order neurons to which they project) and a common sign
of output (excitatory, +, or inhibitory, -) but no interaction between them. A
KI set models a collection (KI mass) of neurons that has the first two
properties but that is densely interactive by interconnections, either mutually
excitatory (KIe) or mutually inhibitory (KIi). A KII set describes the
interaction of a collection (KII mass) of neurons consisting of a Kle mass and a KII mass. KI and KII sets
represent the neuronal assemblies commonly comprised by areas of cortex of
nuclei; KI and KII subsets suffice for local collections such as cortical
columns and nuclear modules. A network comprising a subsystem is modeled by the
conjoining of KI and KII sets into a KIII set. A network of KIII sets may serve
to represent a neural system at the KIV level, although this step remains to be
demonstrated.
The dynamic action of neurons in the
brain is regulated by the input from sensory receptors (interoceptor and
exteroceptor) and by the strengths of the interconnections determined by genes,
by past experience incorporated by learning, and by the chemical environment in
the brain. The play of the active states is observed in goal-oriented behavior
and in the extraneuronal electric and magnetic fields generated by the neurons.
The aim of electroencephalography is to describe regular relationships between
these two observables, such that underlying neural active states can be
discerned and the dynamics of the generators can be understood.
To achieve this aim 3
levels of inference using models are required. First, a volume conductor model
is used to establish relations between neural active states and the fields of
potential that signify them. Given an electric or magnetic event and a
behavioral event, the neurons generating the electric currents must be
identified. Generally the neural elements for behavioral correlation are KO and
KI masses rather than single neurons. Estimates of their activity are derived
by appropriate ensemble averaging over the activities of neurons comprising
them. By use of neuroanatomy, neurophysiology, neuropharmacology, and volume
conductor theory an electrostatic or electric current model is devised for each
postulated generator of a component in the event, and the spatial distribution
of that component is tested for compatibility with the prediction of the model.
Upon completion of the analysis we should be able to infer that a pattern of
potential difference of a certain kind over points near or in the brain
manifests excitation or inhibition of a certain KO or KI mass or a portion
thereof.
Second, a model is
required to relate given neural active states to neural operations. When a
neural mass is given a certain space-time pattern of input, it generates
another pattern of output. In effect it operates on the input to change it, add
to it, replace it or delete it. The nature of the operation is inferred by
comparing the input and output active states. The preferred language of
description is by an integrodifferential equation, in which an operation is
defined by the ratio of output to input. By constructing and solving a set of
equations for K sets representing the dynamics of a neural mass, and by further
transforming the solution in accordance with the volume conductor model, we can
expect to derive curves that can be closely fitted to observed neuropotential
patterns.
Third, a model is
required to relate neural operations to behavioral operations. This is the turn
of a coin by which a neural event is identified with a psychological event.
This, the most difficult of the 3 steps, is embodied in the experimental
paradigm in which the electric fields are measured. Unless behavior is properly
structured and controlled, the most precise measurements of potential are
likely to be meaningless.
This chapter is mainly
about the second stage, in which data are collected with properly positioned
electrodes in an appropriate behavioral context, and in which the task, is to
construct a set of differential or difference equations to describe the
space-time dynamics of neural systems. The choice of equations is a matter of
convenience, but the physiologist must choose on the basis of prior knowledge
of the cellular anatomy and physiology of the system of interest, including
data from unit and evoked potential studies that are independent of EEG
recordings. In order to lay out the underlying principles of analysis, this
discussion begins with lumped systems and ordinary differential equations. The
extension to distributed non-linear systems is straightforward for those who
have or will acquire the necessary mathematical tools. For the same reason the
examples begin with ERPs and poststimulus time histograms (PSTHs) from
electrical stimulation of brain pathways; the approach is the same for
treatment of events related to sensory stimulation and the EEG.
Readers who are
interested in mathematical and systems analysis of large scale neural activity
and the EEG will find related and alternative approaches in the references at
the end of this chapter. Particularly noteworthy are books by Griffith (1971),
Simon (1972), Lin and Segal (1974), Basar (1976), MacGregor and Lewis (1977),
Thatcher and John (1977), Childers (1978), and Nuñez (1981), and reports
by Wilson and Cowan (1972, 1973), Lopes da Silva et al. (1976), and Ingber
(1982).
18.2. Establishing the data base
The study of neural
dynamics is based on input-output pairs: a stimulus is measured and given, a
response is recorded and digitized, and the numbers representing input, output
and context are stored. The procedure is repeated 102-105 times until there are no further surprises. The pairs are
grouped in accordance with input and context parameters; they might also be
grouped tentatively in relation to output, if there are salient features such
as the apparent presence or absence of major components or distinctive wave
shapes with no apparent intermediates to suggest gradation.
Four aspects of the
nervous system must be kept constantly in mind during these explorations:
background activity (often mislabeled 'noise'), instability, time variance, and
non-linearity. First, all normal nervous systems are continually active whether
or not stimuli are given. The data base should include a null class with no
known stimuli as a basis for characterizing intrinsic activity. The use of
randomized stimuli with respect to time windows of observation is
unsatisfactory, because the distributions of amplitudes may be warped by evoked
potentials. These records should be processed in the same way as all others,
including ensemble averaging, spectral analysis, amplitude histograms,
calculation of moments, covariance matrices, etc.
Second, in collecting
pairs the assumptions are made that the neural system is in a certain state
prior to stimulation, that it is perturbed by the stimulus away from that
state, and that it returns to the same state. More specifically it is essential
to include in each output member a segment of prestimulus baseline, in order to
verify that the presumed state does hold, and that the preceding response has
terminated. More generally we should remain aware that the stimulus may elicit
an instability, such that the neural system may change from a basal state to
another state and perhaps back again some time later. Some well-known examples
include seizures, spreading depression, kindling, long-term potentiation, and
learning, but subtler shifts may hold the key to analysis. The point is
important because the solutions to the neural dynamics in each state can be described
by a set of time-dependent equations, and a new set may be required for each
new state. This is most likely to be the case if each state is described by a
linear approximation.
Third, it is a truism
that the nervous system is never twice the same, and that even in a certain
state, defined by both behavioral and neural variables, there is time variance
such as by fluctuation, drift or learning. These variations are entered into a
set of differential equations by use of coefficients that vary randomly or with
time or with contextual parameters such as satiety or adaptation, on the
assumption that the time scale of this kind of variance is very long or very
short in comparison to the duration of an output event. The coefficients are
fixed for the duration of the simulation of each event, and they are changed to
new values with each simulation, if necessary.
Fourth, linearity
denotes the property of a limited domain of dynamics in which superposition
holds. In this kind of state outputs from multiple inputs are additive, and
their magnitudes are proportional to the input magnitudes. Experimentally this
domain is demonstrated by paired-shock testing, in which two stimuli are given
first separately and then together. The algebraic sum of the separate digitized
responses equals the compound digitized response if the neural system is
perturbed within one of its linear domains. If the sum of single responses
differs from the compound response, the system is in a non-linear domain.
Neurons and neural
systems have multiple linear and non-linear domains. The demarcation requires
extensive parametric testing using systematic variation of stimulus intensity,
duration, rate and interval. A non-linear domain may be linearized by
re-definition of variables, or it may be divided into smaller domains that are
described by piece-wise linear approximations. For example, the action
potential manifests a non-linear dynamic process, but trains of action
potentials can be described by pulse frequency or density, which in itself may
be a linear variable with respect to transmembrane potential.
However tedious
parametric analysis might be, it is essential as the basis for constructing a
first dynamic model of a neural system. We will do well to recall that 20 years
of subthreshold testing of axons in their linear domain was a necessary basis
for the construction of the explicitly non-linear Hodgkin-Huxley equations for
the nerve action potential. The dynamics of a system in one of its linear
domains is described by a set of linear differential equations. Once we
establish linear domains we can gently push the system outside and begin to ask
questions of the location and nature of non-linearities, whether static (as in
much evoked potential research) or dynamic (as with the action potential), and
of the nature of the thresholds and stability properties that characterize one
or more non-linear domains. Indeed, it is in these non-linear domains where
answers to questions on the neural bases of behavior are to be found. The
doorway to access is provided by the linear differential equation or its
equivalent.
18.3. The use of basis functions
The search for linear
domains requires use of low intensity stimuli in order to keep the neural
system in a low amplitude range. This does not guarantee linearity, but
conversely it is nearly certain that if the evoked potential amplitude exceeds
2 or 3 standard deviations of background activity, the neural mechanism will be
driven into a non-linear domain. Owing to the background activity, the low-amplitude
evoked potential can be observed and measured only with the aid of averaging
over ensembles to give an event related potential (ERP). The variability about
the average decreases with the square root of the number of events in the
average (see Ch. 5). However, to the extent that the evoked potential varies
over the time required to collect the ensemble, the information in the ERP
undergoes destructive averaging; the true shape may be lost, together with
information about the manner of variation.
Description of the
particular wave shapes of ERPs is often considered of secondary importance, the
principal aim being to determine the variation of shape within and between sets
of ERPs in relation to associated patterns of behavior. However, the
quantitative description of that variability by use of the variance in sets of
numbers requires that the shapes be clearly characterized and precisely
measured. Further, the information cannot be used to construct explanations of
neural dynamics until the wave shapes are fitted with curves.
Most ERPs have
irregular shapes that are difficult to describe and comprehend, and their
shapes vary erratically and often hugely from trial to trial. The two main
sources of the variation are the background EEG activity and the changes from
trial to trial in the neural mechanisms generating the ERPs. When the
experimental conditions relating to behavioral stimulus and response are fixed
to the extent possible, the variation in wave-form is reflected in the numbers
by what we will call 'uncorrelated variance.' When an experimental condition is
deliberately changed across trials, the ERPs can be expected to change as the
neural mechanisms change, and this introduces 'correlated variance.' A good
system of measurement should produce matrices of numbers that preserve the
correlated variance and minimize the uncorrelated variance. The aim is not to
suppress the variance but to partition it most effectively.
The primary step in
measuring the form and variance of an ERP is to decompose it into components.
Each has 3 aspects. First, each component is an elementary curve. These
elementary curves are called 'basis functions.' When a selected set of such
curves is added together, the sum should closely conform to the shape of the
ERP. Second, each component has a set of numbers or coefficients that denote
its height, width, rate of change, and other features. Third, each component
defines one or more coordinates, and the set of numbers denotes distances along
the axes. The number of coefficients in the set of basis functions specifies
the dimensions of the measurement space. In this point of view the measurement
of each ERP gives a vector in that space.
The basis functions
must be chosen so that each one represents unique aspects of the ERP. This
implies that the basis functions are independent of each other, and the
coordinate axis of each of the coefficients is not parallel to any other axis,
although it need not be orthogonal to any other. If they are not orthogonal,
then pairs of coefficients have non-zero correlation across sets of ERPs. For
example, the phase and frequency of a damped cosine function fitted to ERPs
will vary inversely, if the frequency varies from one ERP to the next, but the
duration of the first peak of the ERP will be fixed. In general, the shapes of
ERPs tend to vary in certain ways and not others owing to constraints in the
underlying neural mechanisms. These constraints become evident on correlation
and factor analysis of sets of coefficients (Freeman 1975, pp. 408-414). They
are exceedingly valuable in model-building.
Each behavioral
measure is also represented by an axis. The variation in wave-forms of the ERPs
and in behavior is represented by the variance of the numbers on each axis. If
there is covariance between the ERPs and behavior, the axes of the ERP
measurement space should be positioned (by the proper choice of the basis
functions and by varimax rotation in factor analysis - see Ch. 5 and 16) to
project strongly onto the behavioral axes in order to display that covariance.
From an alternative
perspective the ERP is conceived as a signal embedded in noise. Here the term
'noise' includes background EEG activity that is uncorrelated with the imposed
stimulus train as well as artifacts and electrode thermal noise. The signal is
asserted to have certain measurable attributes such as amplitude, latency,
frequency, etc. and the aim is to measure these in the presence of noise. With
the use of a digital computer we build a template or filter that is expected to
match the signal in order to identify it in the noise. The form of the signal
is expected to vary with changes in the behavioral conditions. That is, the
filter must be adaptive to the varying form of the signal. Furthermore, the
adaptive filter consists of the sum of a set of elementary curves (basis
functions) that is fitted to the ERP. Hence in both views the measurement
process consists in curve-fitting, and the requisite step is to choose the
basis functions, which may represent the parts of the ERP.
18.4. Decomposition of ERPs with analytic basis functions from the AERP
An ERP is commonly
interpreted as consisting of a sequence of peaks labeled N1, P1, N2, etc., each
of which is measured by height and latency. This basis function is an empirical
upright or inverted 'U.' Although useful in preliminary work this approach has
several drawbacks. Much of the information in the ERP is lost to the extent
that the amplitude and latency numbers extracted fail to allow reconstruction
of the ERP. The numbers give an inadequate basis on which to evaluate the rates
of change in the ERP, which are more valuable than the amplitudes of the peaks.
This is because interactive brain systems are more likely to reveal changes in
their sensitivities by changes in the rates and durations of their activity
patterns than by fluctuations in the amplitudes of activity. Uncertainties
arise when adjacent peaks merge or overlap. There is no way to partition the
uncorrelated variance from the variance that is related to stimulus and response
variables.
The deficiencies of
this approach can be remedied by using sets of elementary curves that are
called 'analytic basis functions.' Each is a continuous curve that extends over
the entire duration of each ERP and is defined by an algebraic term or
expression containing one or more coefficients.
The procedure for
using them has two stages. In the first stage an ensemble average ERP (AERP) is
made over a delimited set of ERPs. The individual ERP contains both signal and
noise; grand averaging minimizes the noise, although it does not entirely
eliminate it. Then a curve consisting of the sum of a set of analytic basis
functions is fitted to the AERP by adjusting the values of all the
coefficients. This curve is called a 'matched filter.' The fitted curve is
treated as an approximation for the signal, and the difference between the AERP
and the curve is treated as residual noise.
We assume that the
noise differs on successive trials, and that the shape of the signal also
differs due to the correlated variance. The shape of the average for the signal
must be unlike any of the signals; it is only an estimate for the shape of each
one. For example, if we take the ensemble average of a set of exponential
curves with a rate coefficient that varies randomly on successive trials, the
mean curve is not exponential but can be fitted with one to an approximation.
In addition to the distortion of the signal, the AERP of course contains none
of the variance of the ERPs.
In the second stage
the matched filter is used to measure the signal embedded in noise in each of
the ERPs. A curve from the same equation is fitted to each of the ERPs in the
set by readjusting the coefficients. That is, the filter is adapted to each
ERP. The variance in the set of ERPs is incorporated into a matrix of
coefficients (usually 4-10) of the basis functions, and the distortions from
the exponential form introduced into the AERP may be largely corrected.
The standard deviation
(S.D.) of the AERP may serve to evaluate the proportions of correlated and
uncorrelated variance in a preliminary assay. If the variance is determined by
the noise and not by changes in the shape of the ERPs, then the S.D. is
relatively constant over the time of the ensemble average. If there are changes
in the ERPs, then the S.D. is not uniform. An example is shown in Fig. 1 of the
AERP over ERPs from 20 trials at each of 3 stages in the training of a cat to
respond to an electrical shock to the lateral olfactory tract (LOT) as a
conditioned stimulus, while the evoked potential is recorded from the olfactory
cortex. Early in training with no significant behavioral responses the S.D. is
uniform. During acquisition the wave shape changes, and the S.D. increases. In
the later stage of consolidation the wave shape reverts near to its original
form, but the S.D. remains high in the neighborhood of the high amplitude of
the AERP, reflecting the variation in shape of the component ERPs. Higher
moments (the cubic and quartic) serve mainly to detect outliers and artifacts,
which can be deleted to improve the quality of the AERP.

Fig. 1. The relations are shown between the standard deviation (solid curves) and mean (dotted curves) of the AERPs of 3 sets of 9 event related potentials (ERPs) from implanted electrodes in a cat being trained to press a bar to the electrical stimulus (olfactory tract) used to evoke the potentials (prepyriform cortex). Above: response on <10% of trials; middle: >50% below: >90% (Freeman 1975).
This procedure
overcomes the deficiencies in raw digitized data. The number of axes in the
measurement space is reduced by an order of magnitude. Each coefficient is
evaluated on the basis of what is happening over the entire duration of an ERP,
so that information on both amplitudes and their rates of change is preserved.
Most importantly, curve-fitting introduces a structured hypothesis into the
measurement process. It asserts on the basis of the grand average that
particular parts or components such as exponentials or damped sine waves may be
used to describe the ERPs. When bursts of background EEG waves are superimposed
on the potential related to a behavioral event, and the signal in the ERP is
obscured by noise, the basis functions impose a form that is adaptive but
cannot be twisted arbitrarily.
More explicitly, if a neural mechanism
has certain stable properties, such that the generic character of its neural
dynamics manifested in a set of ERPs is invariant, then those ERPs from single
trials can be approximated by a set of curves all from the same equation over
sets of trials. The shape of ERPs varies from condition to condition due to
variations in the mechanism, and these variations are expressed in a matrix of
the coefficients in the equations of the fitted curves. Also each ERP contains
noise contributed by the background EEG. The noise can be expected to distort
the values of the coefficients from their putative true values (in the absence
of noise) on each trial.
However, a difference
will persist between the fitted curve and the ERP. The residuals may be
rejected after inspection, because the set of basis functions is not
established in respect to individual ERPs but to the AERP. This rejection is
the main basis for partitioning the variance, with retention only of that part
that is related to the coordinate space for measurement, i.e., the set of basis
functions, because the AERP serves to impose a structured hypothesis onto the
data. The matrix of coefficients will contain less information than the
digitized raw data, but that content should be more richly related to the
neural events underlying the ERPs and therefore to behavior. However, this
hypothesis should not go untested; we should always inspect the residuals and
test them for correlation with behavior before discarding them.
These procedures are
applied to the EEG in Section 18.10, where an ensemble average of a spatially
distributed set of EEG segments simultaneously recorded from an electrode array
(an AEEG) plays the role assigned here to the AERP.
18.5. Spectral analysis of ERPs
At this stage we may
use the Fourier transform and spectral analysis to decompose the wave shape of
the ERPs into sets of sines and cosines over a range of frequencies (see Ch.
4). There are several reasons for this. First, the ERPs may contain oscillatory
components that will require use of oscillatory basis functions. Second, the
results may suggest improvements in recording procedures to enhance or diminish
one or more parts of the spectrum by pre-filtering the ERPs; or, if the main
frequencies are separated by at least a factor of two, the ERPs may be
decomposed into two or more parts by pre-filtering. Third, spectral analysis
may be applied to the background EEG, which is a prime source of the variance of
the ERP. If there is a strong periodicity in the EEG, the stimulus repetition
rate should be set at an odd multiple of the half-length of the period. For
example, if a set of cortical evoked potentials is recorded in the presence of
strong gamma activity at 40 Hz, the stimulus rate should not be 5/sec (which
might lock on to a burst of gamma at any phase) but might be 4.5 or 5.5/sec; or
the interstimulus interval might be randomized.

Fig. 2. The characteristic effect on ERPs of increasing the stimulus intensity is a spectral shift to the left. The lowest curve is the amplitude spectrum of the prepyriform EEG in a cat. The remaining curves in ascending order are the amplitude spectra of the Fourier transforms of ERPs (normalized to equal maximal amplitude) over a range of increasing stimulus intensity. The asterisk shows the level of the threshold for behavioral responses to the electrical stimulation, at which the single evoked potentials are close in amplitude to the background EEG. At this level the ERP spectrum most closely conforms to the EEG spectrum. (From Freeman 1975.)
Yet another use of the
EEG spectrum is to compare it with the spectrum of the ERP (Fig. 2). Do the
spectral peaks occur at the same frequencies in the EEG and the ERP, and is it
desirable or not that they do so? From the standpoint of observability of the
single ERP it is not desirable, because the EEG and ERP cannot be separated by
bandpass filters. Moreover, such correspondence of spectral peaks happens most
often when the amplitude of the single-trial ERP is the same or less than the
amplitude of the EEG. Because the only distinguishing characteristic of the ERP
is the time of its occurrence, averaging in this case is a necessity.
From an alternative
standpoint, the stimulus is seen as superimposed on background input to a
functional group of neurons, and the characteristics of the response to the
stimulus should closely resemble those of the response to the background input.
With a strong stimulus the neural system may respond abnormally, usually
involving saturation or partial blockage of the neural mechanism, so that the
normal range of ERP variance is diminished. Then strong stimulation, which is
commonly used to improve signal to, noise ratios, should be avoided. After all,
the aim of analysis is not the suppression of variance but decomposition and
selective removal. Parametric testing should be used, in which the stimulus
intensity or some other experimental variable is changed in small steps over a
suitable range. For example, a series of ERPs and standard deviations can be
obtained over a range of stimulus intensities or output amplitudes (as
appropriate), the dependence of wave shape on intensity noted, and a range with
optimal variance chosen for further study such that there is an optimal
covariance with behavior (Fig. 3). It may also prove useful to adjust the
stimulus intensity to maximize the non-uniformity of the second moment with
respect to ERP time (Fig. 1).

Fig. 3. Two sets of ERPs (dotted
curves) are shown from the olfactory bulb on stimulation of (right) the lateral
olfactory tract (LOT) and (left) the primary olfactory nerve (PON). From above
to below the stimulus intensity is raised in steps, which changes the amplitude
and shape of the ERPs. Solid curves: sum of fitted basis functions. Dashed
curves: dominant and subsidiary components (respectively ‘signal' and
'clutter' - see Section 18.6). (From Freeman 1975.)
Typically we have some
control over the correspondence between the spectral peaks of the ERP and EEG
because we can change the stimulus rate or intensity which will change the
spectrum of the ERP. The most likely result of increasing the stimulus
intensity is an increase in amplitude of the ERP above the EEG and a decrease
in the frequencies of the ERP (Fig. 3), which is reflected in a prominent
spectral shift (Fig. 2). This makes an ERP much easier to retrieve from the
background activity, but it also decreases its variance. This may be an
instance of throwing out the baby with the bath water, because in attempting to
observe the signal, we may excessively reduce the correlated variance as well.
The Fourier transform
of an ERP or AERP also provides a basis for estimating the amount of
information that can be extracted. The phase spectrum should be displayed along
with the gain spectrum preferably in the form of a polar plot (Fig. 4). Each
peak or trough in the gain spectrum may represent a characteristic frequency
(called a 'pole') or a null (called a 'zero') in the system (Appendix B). We can
deduce the order of the system from the goodness of fit of curves to the data
in both the time and frequency domains, i.e., the number of simultaneous first
order differential equations to be evaluated. Each 90° of phase lead or lag
will require a first order differential equation in the set needed to describe
the dynamics, or equivalently yet another term in an n-th order equation. In
the example in Fig. 4 the polar plot can be viewed as the track of the tip of a
vector that rotates clockwise around its base at the origin from low to high
frequency. It sweeps through 6 quadrants comprising 540°; the neural system
is minimally sixth order. The goal of curve-fitting at this stage is not
parsimony (the smallest number) but evaluation of as many rate coefficients as
are indicated by the system. If the neural system is operating in a linear
domain, each term or equation will represent a real dynamic transformation. If
it is not, then some of the terms or equations will reflect the dynamics
spuriously. For example, a saturation non-linearity may give rise to harmonics.

Fig. 4. Above left: ERP from a cat with implanted electrodes (dotted curve) fitted with sum of basis functions (solid curve). Above right: amplitude and phase of the two curves at left after Fourier transform by numerical integration. Below left: polar plot of the Fourier transform. Lower right: dominant and subsidiary components, which are subsets of basis functions comprising the curve fitted to the ERP. (From Freeman 1975.)
Linearization is principally desired by the fact that the curves comprising the solutions to linear differential equations include pulses, steps, ramps, exponentials, sines, cosines, damped sine waves, and several other elementary wave-forms. This may be called the 'family of linear basis functions.' There are numerous electrophysiological systems at both the single neuron level and the level of masses of neurons for which the dynamics in certain ranges can be described with linear differential equations. On driving by electrical impulses each output must then consist of the sum of a set of linear basis functions. The Fourier transform is an example of linear decomposition of a signal into sinusoidal basis functions; it is most effective when the system that generates the signal is operating in a linear range.
At this stage we
return to the experiment and again carry out careful parametric testing (which
presumably has already been done), i.e., systematic variation of stimulus
interval and intensity (as appropriate) to determine where linear ranges of
function exist (Freeman 1975, Ch. 2) in which additivity and proportionality
hold. An example is shown in Fig. 5 of the ERPs from single-shock responses
recorded in the hippocampus on electrical stimulation of the fornix (Horowitz
1969). Over a range of low stimulus intensities (a-c) the ERP increases in
amplitude without change in its form (proportionality). Over a middle range
(d-h) the generic character of the ERP is invariant (approximating a damped
sine wave), but the frequency and decay rate of the oscillation change as the
initial amplitude increases. At the highest stimulus level shown (i) the ERP
changes to a new generic character. The 3 ranges are linear, piece-wise linear,
and non-linear.

Fig. 5. ERPs are shown from the
ventral hippocampus at increasing amplitudes of single shocks given to the
dorsal fornix. (a) 2.5 V, N = 6600; (b) 3.0 V, N = 3600; (c) 4.0 V, N = 1800;
(d) 5.2 V, N = 850; (e) 7.0 V, N = 650; (f) 9.0 V, N = 510; (g) 11.0 V, N =
417; (h) 15.0 V, N = 357; (i) 20.0 V, N = 290. Duration of each ERP is 125
msec. (From Horowitz 1972.)
Alternatively we may
systematically explore the background conditions for recording. An illustration
of this procedure is shown in Fig. 6. The set of ERPs is taken from a cat with
stimulating electrodes in the optic tract and recording electrodes in the
superior colliculus. The cat is placed in a light-tight box and the uppermost
ERP is taken. Thereafter the ERP is taken at the designated times during
presumptive dark adaptation. Control sequences are taken in the same box with
illumination to control for the effects on the ERP of habituation, changes in
motivation, etc. (Pickering and Freeman 1967). The sets of ERPs from the
hippocampus and superior colliculus resemble damped sine waves. The degree of
conformance is tested by fitting to each one a curve generated from the
equation for a damped sine wave. The fit is rather close over the linear (Fig.
5, a-c) and near-linear (Fig. 5, d-h) ranges for the hippocampal ERPs (for
example, lower frame in Fig. 7) but not for the non-linear range (Fig. 5, i).
The collicular responses conform only when the initial positive peak (Fig. 6)
is excluded from the curve-fitting procedure (Fig. 7, upper frame). In both
sets the frequency, decay rate and phase of the damped sine wave change in
relation to the antecedent variable. Readers may profitably compare these
illustrations of ERPs from the olfactory bulb and cortex, the hippocampus, and
the superior colliculus with examples commonly seen in the literature and with
the common varieties of human ERPs (Ch. 5) in order to see how much can be
learned by the systematic search for linear domains.

Fig. 6. A set of ERPs is shown from a cat with electrodes chronically implanted to stimulate the optic tract and record from the superior colliculus. At the time of the upper ERP the cat is placed in a dark box, and thereafter the ERP is taken at the times indicated. The frequency of the ERP increases slightly and the decay rate decreases during dark adaptation (Pickering and Freeman 1967).
18.6. Fitting curves and testing
the results
Each basis function is
specified by an algebraic term containing one or more coefficients, and the sum
of basis functions is generated by the sum of the terms. The next step is to
fit the sum of basis functions to an AERP. This entails finding the proper
values of the coefficients by curve-fitting. A number of standard computer
procedures are available for this. The most useful procedure is non-linear regression
(Golub and Pereyra 1973; Kaufman 1975; Freeman 1975, pp. 103-106), in which all
of the coefficients (usually 4-10) are simultaneously adjusted. There are 5
tests that must be passed for provisional acceptance of a fitted curve.

Fig. 7. Damped sine wave curves are fitted to the ERPs from the superior colliculus (above) and hippocampus (below) to measure frequency, decay rate, phase and amplitude. (From Freeman 1980.)
The first and most
basic criterion for determining the optimal values for the coefficients is
minimization of the sum of squares of the differences between the AERP and the
sum of basis functions. The ratio of mean square differences to mean square
data should be well under 5%. Typically a first attempt yields an
unsatisfactory match, and repeated trials must be made while adding new terms
or changing existing terms. A successful outcome of the first test is when the
residue on subtracting the fitted curve from the AERP resembles white noise or,
better still, the appearance of records of averages over trials on which no
event occurred.
A second and essential
test of such a fitted curve is to fit the same curve to each of the ERPs that
comprise the AERP. Here the basis functions are adapted, in that the values of
their coefficients are optimized to fit each subaverage. If this cannot be
done, the most likely reasons are that the basis functions were not properly
chosen to reflect the variation, or that averaging was sufficiently destructive
to remove certain components, or that one or more components occurred on some
trials and not on others. Either a new matching set of basis functions must be
sought, or the ensemble must be divided into subsets on the basis of the
results on hand.
The third test of
whether the decomposition is effective lies in study of the relations between
sets of coefficients and concomitant measurements of behavioral variables. When
the experimental conditions have been fixed, we should observe
‘spontaneous' variation of the coefficients from trial to trial. When a behavioral
factor is varied, some further variation of coefficients should appear. In both
cases the values for each coefficient should be 'well-behaved' in the sense
that they fall into a unimodal distribution (not necessarily normal), and that
we can see patterns of covariation within the matrix of coefficients.
The patterns of
covariation between coefficients are best explored with a graph of each pair to
determine the ranges and distributions, and the form of the relation.
Three examples are
shown in Fig. 8. The upper frames show the AERPs from 3 cats (triangles), the
fitted curves (solid), and the dominant basis function and clutter (dashed
curves). The lower frames show the values for the frequency and decay rate of
the ERPs plotted each as a triangle. The regression line for each set is
flanked by the 95% confidence intervals (dashed curves). Also, negative
correlations hold between amplitude and both frequency and decay rate; the
arrows indicate the direction of increasing ERP amplitude. Whenever they are
needed, suitable transformations should be sought to normalize the
distributions and to linearize as many of the relations among the coefficients
as is feasible (see Ch. 16). Then factor analysis with Varimax rotation is used
to characterize the dimensionality of the patterns (Emery and Freeman 1969)
(see Ch. 5 and 16).

Fig. 8. Three examples are shown in
the upper frames of AERPs (triangles), the sum of basis functions (solid
curves), and the dominant component and clutter (dashed curves). The recording
sites are monopolar on the surface of the olfactory bulb; the stimulus sites
are the primary olfactory nerve (PON). The frequencies in radians/sec are
plotted against the decay rates in I/sec of the component ERPs in the 3 lower
frames (see Appendix B). The ERPs in the left frame are collected concomitantly
from 64 electrodes in an 8 X 8 array; the decay rate decreases in proportion to
the increase in response amplitude (arrow) at fixed stimulus intensity. The
stimulus is increased stepwise for the other two sets of ERPs (increasing
amplitude in the direction of the arrow). Each solid line shows the regression
line; the dashed curves show the 95% confidence limits. The variation in the
slope of the relation between frequency and decay rate is analyzed elsewhere
(Freeman 1975, Ch. 6); the product moment correlation between frequency and
decay rate does not by itself adequately describe the relationship. The small
circle on the abscissa shows the average decay rate of the bulbar ERP under
deep anesthesia, when the EEG is suppressed and the bulbar evoked potential
becomes non-oscillatory; this 'open loop' response is discussed in Section 18.7
and is illustrated in Appendix A.
The fourth test is to
determine whether the same set of basis functions can be applied to the ERPs
from several subjects, and whether the same factors emerge from the matrix of
coefficients. Here we must come to terms with between-subject variability. It
should now be possible to fit the ERPs of considerable variety of shape with the
same sum of basis functions, though the mean values of the coefficients differ
significantly. Alternatively it may be necessary to add, delete, or change
terms within the sum. It is reasonable to postulate that the same neural
mechanisms operate in all subjects under study, but that their manifestations
in the form of ERPs may undergo distortions in differing ways or be overlapped
by subsidiary events in unpredicted ways. If we can identify one or more
components that are always present (though variable in shape), we may
characterize this subset of basis functions as a signal and treat the other
sporadic basis functions as background 'clutter.'
There is another way
of looking at the problem of extracting behaviorally relevant information. The
procedure of curve-fitting is essentially the design of a matched filter to
detect a desired signal against a white noise background. Any one basis
function or any subset of them within a set comprising a fitted curve can be
specified as a filter that is matched to a signal. Then the remaining basis
functions serve as a filter that is matched to events in the ERP other than the
desired signal.
For example, the ERP
in Fig. 4 has 2 damped sine wave components. The larger one is a manifestation
of the response of an intracortical negative feedback loop in the olfactory
cortex to single-shock stimulation, that is manifested by a reverberation
between sets of excitatory and inhibitory neurons. This is the signal. The
smaller component is the manifestation of a number of complicated feedback
loops that include or project onto the intracortical loop, including neurons in
the olfactory bulb, nucleus and tubercle, and possibly recurrent projections
from the cortical efferent systems. The contributions to the dynamics of those
longer pathways are not easily distinguishable from each other, but the sum of
their effects can be approximated in form by a damped sine wave. This is the
'clutter.' When the cortical dynamics change, both the signal and the clutter
vary, and the two filters are adapted (by curve-fitting) simultaneously to
reflect the change. To a certain extent the variations in the two filters are
correlated, as they should be, because they both involve the intracortical
feedback loop.
We can expect that for
differing stimulus sites or different subjects the category of the signal will
be invariant. That is, the equation for the dominant component is always a
damped sine wave, though the values for the coefficient are variable. However,
the form of the clutter may vary, meaning that the equation for the sum of the
other subsidiary basis functions may vary as well as the values of the
coefficients (Fig. 3, left and right frames).
The use of subsidiary
basis functions can be regarded as a pre-whitening transformation of the ERP,
before the AERP filter is adapted to ERPs in order to recover the desired
signal. In the interest of economy, if the ranges of the frequencies of the
filters for the signal and the clutter are sufficiently far apart, the coefficients
of the filter matched to the clutter in the AERP may be fixed, so that only the
coefficients of the filter for the signal are varied during adaptation to the
signal by non-linear regression. The variance of the dominant component in
relation to behavior may still be optimally preserved.
The fifth and
ultimately most important test is correlation with behavior. The underlying
hypothesis is that the electrical signs in each part of the brain depend on
neural activity of that part, and that some albeit undefined aspects of
on-going behavior also depend on that part, because the intact brain functions
as 'an organic whole.' Our own choice for a 'behavioral touchstone' is to
measure the rate of work done by hungry cats for food in an ergometer over a
period of 10 sec that is required to derive each ERP (Freeman 1962b). The ERPs
are fitted with curves consisting of 2 damped cosines, yielding values for 8
coefficients. From a total of 30 trials, 30 sets are collected from each cat.
The sets of 8 X 30 values are introduced into a set of 30 simultaneous linear
equations with 8 ERP coefficients as the independent variables and the rate of
work as the dependent variable. Then we calculate the multiple correlation
coefficient R2 to evaluate the fraction of the
total variance of rate of work that can be predicted from the ERPs. The
criterion for the optimization of the process of measurement is the
maximization of R2 (Freeman 1964). Upon completion of
the development the fraction ranges from 0.19 to 0.55 among 7 cats; on the
average 36% of the variance in rate of work for food is predictable from the
ERPs of one prepyriform cortex. We conclude that the ERP data are optimally
cast in forms suitable for neural modeling.
From this example, the
desired properties of the behavioral touchstone are that it be an event which
extends through the time course of an evoked potential (or a set of them for an
ERP), which is continuously variable in magnitude, and for which the magnitude
is determined by brain function, whatever might be the antecedent conditions
(e.g., food deprivation). Further, the real numbers in which events are
expressed should be normally distributed on an ordinate scale. The development
of such a criterion may tax the ingenuity, but the reward is reliable data.
18.7. Models of the dynamics of
neural masses from ERPs
In this section we
consider procedures used to explain the shapes of ERPs in terms of the
anatomical and physiological properties of neurons. The properties we rely on
are well known to hold for almost all types of neurons; no special or unusual
characteristics are needed. In the spirit of an introductory review we restrict
consideration to ERPs that accrue from electrical stimulation, that conform to
superposition under paired-shock testing, and that are fitted with sums of
linear basis functions. In this circumstance each basis function is the
solution to a differential equation that can be found in a table of
mathematical functions.
The differential
equation is not in itself a model; it must be structured so as to represent
some aspect of the dynamics of the neural system generating the ERPs. For
example, let us suppose that one of the basis functions for potential v is a
damped cosine as a function of time t (Fig. 8)
![]()
where v0 is the amplitude in microvolts, w is
the frequency in radians/sec (2π times the frequency in Hz), f is phase in radians, and a is decay
rate in 1/sec (the time constant is 1/a msec). The generic differential
equation for the damped cosine is
![]()
where v (with dot above) = dv/dt, I is the magnitude of an input impulse ∂(t)
including a scaling parameter, and b1 and b2
are coefficients
that are interpreted in a worked example below.
It is not sufficient to
state the obvious that an 'oscillator' exists in the neural system; we would
like to determine whether it is within or between neurons. In order to learn
this we must set up two hypotheses in the form of models, one to describe an
oscillation of non-interactive neurons, the other to describe the oscillation among
sets of interactive neurons. We must use each model to make predictions about
ERPs and PSTHs in order to validate or reject the hypotheses. We know on the
one hand that under restricted, controlled conditions an isolated nerve can
generate a damped cosine fluctuation of membrane potential in response to a
brief shock (as distinct from a periodic pulse train). The Hodgkin-Huxley
equations admit to corresponding solutions, but they are far from robust. On
the other hand we know that cortical excitatory neurons that are excited by an
afferent volley, in turn excite inhibitory interneurons, and thereafter receive
synaptic inhibition. A typical diphasic cortical evoked potential manifests an
extracellular compound EPSP followed by a compound IPSP, both generated by the
excitatory neurons, the IPSP mediated by the interneurons. This phenomenon is
quite robust. Our aim is to devise a model to simulate a hypothetical process
in which the diphasic ERP manifests the impulse response of a neural negative
feedback loop. That is, the feedback interaction within masses of excitatory
and inhibitory neurons is described by a linear differential equation, and the
solution of the equation with impulse input must give us a curve that closely
fits the ERP.
The best way to analyze
the neural mass is to open the loop and measure separately the passive membrane
time constant ae of the excitatory cell mass and that
of the inhibitory cell mass ai.
The loop is opened by inducing very deep anesthesia to suppress the EEG and
render the ERP non-oscillatory. What we observe is the compound EPSP of the
excitatory cells without the following IPSP. With further search we then find a
means to deliver an electrical stimulus to excitatory axons ending synaptically
on the inhibitory interneurons to elicit the interneuronal compound EPSP (not
the IPSP). We fit curves to these ERPs in the same manner as for other ERPs
(Appendix A).
For illustrative
purposes we begin with a first order approximation; we ignore the rise time and
consider only the decay. The potential Ve as a function of time for the EPSP of the excitatory mass is
![]()
and that for the inhibitory mass vi is
![]()
where as before I is the magnitude
of an input pulse, ae and ai are rate coefficients measured by regression, and ve and vi represent the time courses of the active states of the two
neural masses. These are known as the state variables of the two neural masses
making up the hypothetical neural system being considered (Freeman 1975, pp.
34-36).
Experimentally we find
that the rate constants of the two masses of cells are equal, so we set a = ae = ai, and the differential equations in the open loop state
become

For the waking 'closed loop' state
the inhibitory mass receives its input not from a stimulus directly but from
the output of the excitatory mass.
![]()
where ke is a forward gain coefficient
introduced to express the strength of action of the excitatory mass onto the
inhibitory mass. The excitatory mass receives both the initiating pulse input
and the output of the inhibitory mass.
![]()
where ki represents the strength of action of
the inhibitory mass onto the excitatory mass, and the sign (-) denotes the fact
that the action is inhibitory. We combine Eqs. 7 and 8 to find an expression
for each of the two state variables. We do this by taking the derivative with
respect to time of both sides of Eq. 7, solving Eq. 8 for ve, and using Eq. 7 to define Ve
![]()
and similarly for v, where Kn = kekI is defined as the negative feedback
gain. Eq. 9 should be compared with Eq. 2 in order to see the way in which the
coefficients b1 and b2 have been evaluated.

The two roots of Eq. 9 by the
quadratic equation are

so the solution for the inhibitory
state variable is
![]()
By using Euler's formula,
![]()