Estimating rate of occurrence of an event with exponential smoothing and irregular events

Question 1

First, if you assume that the occurrence rate of the events itself is constant (or that you're only interested in its long-term average), then you can simply estimate it as:

λ* = N / (t − t₀)

where t is the current time, t₀ is the start of observations, N is the number of events observed since t₀ and λ* is the estimate of the true frequency λ.

At this point, it's useful to note that the estimation formula given above may be reformulated as the integral:

λ* = integral( δ_event(τ) dτ ) / integral( 1 dτ )

where the variable of integration τ ranges from t₀ to t, and δ_event(τ) = sum( δ(τ − t_i), i = 1 .. N ) is a sum N of Dirac delta functions, with a single delta-peak at the occurrence time t_i of each event i.

Of course, this would be a completely useless way to calculate λ*, but it turns out to be a conceptually useful formulation. Basically, the way to view this formula is that the function δ_event(τ) measures the instantaneous rate at which the number of events increases at time τ, while the second integrand, which is just the constant 1, measures the rate at which time increases over time (which, of course, is simply one second per second).

OK, but what if the frequency λ itself may change over time, and you want to estimate its current value, or at least its average over a recent period?

Using the ratio-of-integrals formulation given above, we can obtain such an estimate simply by weighing both integrands by some weighing function w(τ) which is biased towards recent times:

λ*_recent = integral( δ_event(τ) w(τ) dτ ) / integral( w(τ) dτ )

Now, all that remains is to pick a reasonable w(τ) such that these integrals simplify to something easy to calculate. As it turns out, if we choose an exponentially decaying weighing function of the form w(τ) = exp(k(τ − t)) for some decay rate k, the integrals simplify to:

λ*_recent = sum( exp(k(t_i − t)), i = 0 .. N ) k / ( 1 − exp(k(t₀ − t)) )

In the limit as t₀ → −∞ (i.e., in practice, when the total observation time (t − t₀) is much larger than the weight decay timescale 1/k), this further simplifies to just:

λ*_recent = k sum( exp(k(t_i − t)), i = 0 .. N )

Alas, naïvely applying this formula would still require us to remember all the event times t_i. However, we can use the same trick as for calculating usual exponentially weighted averages — given the weighted average event rate λ*_recent(t') at some earlier time t', and assuming that no new events have occurred between t' and t, we can calculate the current weighted average event rate λ*_recent(t) simply as:

λ*_recent(t) = exp( k(t' − t) ) λ*_recent(t')

Further, if we now observe a new event occurring at exactly time t, the weighted average event rate just after the event becomes:

λ*_recent(t) = k + exp( k(t' − t) ) λ*_recent(t')

Thus, we get a very simple rule: all we need to store is the time t_last of the previous observed event, and the estimated recent event rate λ*_last just after said event. (We may initialize these e.g. to t_last = t₀ and λ*_last = 0; in fact, with λ*_last = 0, the value of t_last makes no difference, although for non-zero λ*_last it does.)

Whenever a new event occurs (at time t_new), we update these values as:

λ*_last ← k + exp( k(t_last − t_new) ) λ*_last
t_last ← t_new

and whenever we wish to know the recent event rate average at the current time t, we simply calculate it as:

λ*(t) = exp( k(t_last − t) ) λ*_last

Ps. To correct for the initial bias towards the (arbitrary) initial value of t_last, we can add back the 1 / ( 1 − exp(k(t₀ − t)) ) correction term that we simplified out earlier when we assumed that t ≫ t₀. To do that, simply start from t_last = 0 at t = t₀, update t_last as above, but calculate the estimated recent event rate average at time t as:

λ*_corr(t) = exp( k(t_last − t) ) λ*_last / ( 1 − exp(k(t₀ − t)) )

(Here, t₀ denotes the time at which you start measuring events, not the occurrence of the first event.)

This will eliminate the initial bias towards zero, at the cost of increasing the early variance. Here's an example plot showing the effects of the correction, for k = 0.1 and a true mean event rate of 2:

Plot of λ* over time, with or without initial bias correction
The red line shows λ*(t) without the initial bias correction (starting from λ*(t₀) = 0), while the green line shows the bias-corrected estimate λ*_corr(t).

Pps. As the plot above shows, λ*, as calculated above, will not a be continuous function of time: it jumps up by k whenever an event occurs, and decays exponentially towards zero when events do not occur.

If you'd prefer a smoother estimate, you can calculate an exponentially decaying average of λ* itself:

λ**(t) = integral( λ*(τ) exp(k₂(τ − t)) dτ ) / integral( exp(k₂(τ − t)) dτ )

where λ* is the exponentially decaying average event rate as calculated above, k₂ is the decay rate for the second average, and the integrals are over −∞ < τ ≤ t.

This integral can also be calculated by a step-wise update rule as above:

λ**_last ← W(Δt) λ*_last + exp( −k₂ Δt ) λ**_last
λ*_last ← k₁ + exp( −k₁ Δt ) λ*_last
t_last ← t_new

where k₁ and k₂ are the decay rates for the first and second averages, Δt = t_new − t_last is the elapsed time between the events, and:

W(Δt) = k₂ ( exp( −k₂ Δt ) − exp( −k₁ Δt ) ) / (k₁ − k₂)

if k₁ ≠ k₂, or

W(Δt) = k Δt exp( −k Δt )

if k₁ = k₂ = k (the latter expression arising from the former as the limit when (k₁ − k₂) → 0).

To calculate the second average for an arbitrary point in time t, use the same formula:

λ**(t) = W(Δt) λ*_last + exp( −k₂ Δt ) λ**_last

except with Δt = t − t_last.

As above, this estimate can also be bias-corrected by applying a suitable time-dependent scaling factor:

λ**_corr(t) = λ**(t) / (1 - S(t − t₀))

where:

S(Δt) = ( k₁ exp( −k₂ Δt ) − k₂ exp( −k₁ Δt ) ) / (k₁ − k₂)

if k₁ ≠ k₂, or

S(Δt) = (1 + k Δt) exp( −k Δt )

if k₁ = k₂ = k.

The plot below shows the effects of this smoothing. The red and green lines show λ*(t) and λ*_corr(t) as above, while the yellow and blue lines show λ**(t) and λ**_corr(t), as calculated with k₁ = 0.1 (as above) and k₂ = 0.2:

Plot of λ* and λ** over time, with or without initial bias correction

Question 2

You could try this:

Keep an estimator zn so that at each event:

z_n = (z_n-1+κ).e^{-κ.(t_n-t_n-1)}

This will converge towards the event rate in s^-1. A sligtly better estimator is then (as there is still an error/noise related if you compute the estimate just before or just after an event) :

w_n = z_n.e^-κ/(2.z_n)

In your example it will converge to 2s^-1 (the inverse of 500ms)

The constant κ is responsible for the smoothing and is in s^-1. Small values will smooth more. If your event rate is roughly of seconds, a value of 0.01s-1 for κ is a good start.

This method has a starting bias, and z₀ could be set to an estimate of the value for faster convergence. Small values of κ will keep the bias longer.

There are much more powerful ways of analyzing poisson-like distributions, but they often require large buffers. Frequency analysis such as Fourier transform is one.