Using sapply to aggregate columns comma separated values

Question 1

Or a little more 'by hand', if your data frame is xx then a split endPoints into separate elements, figure out the lengths of each row

endPoints = strsplit(as.character(xx$endPoints), ",", fixed=TRUE)
startPoints = strsplit(as.character(xx$startPoints), ",", fixed=TRUE)
len = sapply(endPoints, length)

Use the lengths to expand the original data frame, unlisting previously compressed elements

yy = cbind(xx[rep(seq_len(nrow(xx)), len), c("id", "group")], 
              startPoints=as.integer(unlist(startPoints)), 
              endPoints=as.integer(unlist(endPoints)))

After that aggregate is your friend.

aggregate(endPoints - startPoints ~ group, yy, sum)

Question 2

This is not at all to do with sapply as you requested, but here's one approach using concat.split.multiple from my "splitstackshape" package.

First, split the data into a semi-long format:

library(splitstackshape)
mydf2 <- concat.split.multiple(mydf, split.cols = c("startPoints", "endPoints"), 
                               seps = ",", direction = "long")

Calculate the difference between your "endPoints" and "startPoints":

mydf2$diffs <- mydf2$endPoints - mydf2$startPoints
head(mydf2)
#   id group .id time startPoints endPoints diffs
# 1  1     A   1    1           4         8     4
# 2  1     A   2    1         120       231   111
# 3  1     B   1    1         500       550    50
# 4  1     B   2    1         650       700    50
# 5  1     C   1    1         830       850    20
# 6  1     A   1    2          20        25     5

Use aggregate (or data.table, or tapply, or your favorite aggregation function) to calculate whatever you want to.

aggregate(diffs ~ group, mydf2, sum)
#   group diffs
# 1     A   177
# 2     B   120
# 3     C    60