I've been learning about generalized linear models and Bayesian approaches for doing statistics on spike train data, in the Truccolo lab. Here are some notes on the subject.
In neuroscience, we are interested in the problem of how neurons encode, process, and communicate information. Neurons communicate over long distances using brief all-or-nothing events called spikes. We are often interested in how the spiking rate of a neuron depends on other variables, such as stimuli, motor output, or other ongoing signals in the brain.
To model this, we consider spikes as events that occur at a point in time with an underlying variable rate, or conditional intensity,
Point-process generalized linear models
In point-process Generalized Linear Models (GLM) we estimate
For computational tractability we work with a discretized version of the point process. A point process can be converted into a series of non-negative integers by counting the number of events
If we choose
For an alternative derivation, consider the Poisson distribution, which defines the probability of observing
In the limit of
Since
Since
which we will denote, for convenience, as
For now, restrict analysis to a single covariate
Connection between GLM and Bayesian approach
We can also derive a model for conditional intensity using Bayes rule. We are interested in learning how
For computational efficiency we work with the natural logarithm of probability. This yields an expression that allows us to directly estimate conditional intensitiy in terms of the log probability density functions (PDF) of
The performance of this approach depends on accurate modeling of
While we can always approximate the above using a series expansion of
We show in the next section that distributions from the exponential family meet these conditions, allowing direct solution of the parameters to a log-linear model from data statistics. Choosing a parametric distribution for
Selecting nonlinear features for a GLM point process model
In the previous section we discussed a direct solution for the conditional intensity using Bayes rule, and how this solution might relate to log-linear models. Specific examples are given in this section. The general steps are as follows :
- Look at the marginal distribution of the covariate
, and the distribution conditioned on the presence of a spike . - Pick a parametric probability distribution family that fits the observed distributions well.
- Look up the log probability-density function.
- Write down the log-likelihood ratio in terms of the log PDF as in
. - Expand this function until it is of the form
where is a real valued parameter and is a function of the covariate. - The functions
are the nonlinear features of that you should include in a log-linear model, and the coefficients provide an approximate fit of that model.
Exponential data imply a log-linear model
If a covariate follows an exponential distribution, the Bayesian method provides parameters for a log-linear point process model. The probability density for an exponential distribution has one scale parameter
Collecting terms yields the the linear and constant terms in a log linear inhomogenous poisson process model:
Normally distributed data imply a log-quadratic model
If a covariate follows a Gaussian distribution, the Bayesian classifier provides parameters for a log-quadratic point process model
which gives the log-density
Substituting this into equation
Expanding the quadratic expressions and collecting terms yields the constant, linear, and quadratic terms for a log-quadratic inhomogeneous Poisson model:
If the covariate
Gamma distributed data imply nonlinear features
The Gamma PDF in terms of a shape
which gives the log PDF
Substituting this into equation
Collecting terms yields a fit for a model that, in addition to linear and constant terms, includes a
Von Mises imply sine and cosine nonlinear terms
There is also a GLM formulation for von Mises distributed data. This is left left as an exercize to reader. To derive it, move the preferred phase parameter out of the cosine using trigonometric identities. The nonlinearities implied for the GLM are
Generalization to the exponential family
The exponential family of distributions inclues all the distributions discussed so far in this section, and follows the canonical form
Where
This form is already suitable for mapping onto a log linear point process model. The term
From this, the coditional intensity in the general case where
If
The
Collecting terms
If
Relationships determined by fitting a GLM can be simpler than those implied by a Bayesian approach
Although solving directly for conditional intensity using Bayes rule can imply a log-linear model and its coefficients, the converse is not in general true. For example, the quadratic terms from a Gaussian model vanish when
In general, a purely log-linear GLM will be able to fit the data if the distributions
For a particular choice of features
Additionally, the GLM directly optimizes parameters that summarize the difference between
Nevertheless, inspecting the distributions
Kullback-Leibler divergence is an easily computed predictor model performance
Mutual information is a statistic used to summarize how related two variables are. Consider the problem of measuring the mutual information between a variable
Estimating the mutual information between a point process and an external variable reduces to estimating the Kullback-Leibler divergence of the conditional
Spikes are rare in a point process, so
The mutual information between
Let's explore this in the case that
If
If we assume
From this we see that the fraction of information about
Incidentally, this also suggests that fancy Bayesian and GLM models might not tell you that much more than the Spike Triggered Average (STA), in some cases.
Another curious observation which holds empirically, at least for low information variables examined so far, is the relationship
Where AUC is the area under the receiver operating characteristic ( ROC ) curve, and is used to summarize the accuracy of a point-process decoder. (I'm not sure whether this approximation is really valid or under what conditions it might hold)
Overall,
Models that fit the log-intensity of an inhomogeneous Poisson point process are related to likelihood-ratio based Bayesian classifiers. Solving for the conditional intensity using Bayes' rule, and distributions from the exponential family, is one way to find parameters for a point process model. The distribution family of
No comments:
Post a Comment