Converting a list of events into a series of the number of events every two minutes

Question 1

How about

as.data.frame(table(cut(x, breaks=c(y, Inf))))

                 Var1 Freq
1 2013-06-20 01:00:00    3
2 2013-06-20 01:02:00    0
3 2013-06-20 01:04:00    2
4 2013-06-20 01:06:00    0

Question 2

Here is a function that solves the problem, and runs much faster than table(cut(...)):

get.bin.counts = function(x, name.x = "x", start.pt, end.pt, bin.width){
  br.pts = seq(start.pt, end.pt, bin.width)
  x = x[(x >= start.pt)&(x <= end.pt)]
  counts = hist(x, breaks = br.pts, plot = FALSE)$counts
  dfm = data.frame(br.pts[-length(br.pts)], counts)
  names(dfm) = c(name.x, "freq")
  return(dfm)
}

The key line here is in the middle -- counts = hist(.... The hist function with the plotting option set to FALSE does the crucial thing.

To test the speed performance of this function, I ran it as follows:

# First define x, a large vector of times:    
start.time = as.POSIXct("2012-11-01 00:00:00")
x = start.time + runif(50000, min = 0, max = 365*24*3600)
x = x[order(x)]
# Apply the function, keeping track of running time:
t1 = Sys.time()
dfm = get.bin.counts(x, name.x = "time", 
                     start.pt = as.POSIXct("2012-11-01 00:00:00"),
                     end.pt = as.POSIXct("2013-07-01 00:00:00"), 
                     bin.width = 60)
as.numeric(Sys.time()-t1) #prints elapsed time

With this example, my function ran faster than table(cut(...)) by a little more than a factor of 10. Credit is due to the cut help page, which states, "Instead of table(cut(x, br)), hist(x, br, plot = FALSE) is more efficient and less memory hungry."