Recombining artificially split sessions in sales data based on multiple conditions

Question 1

If your available.spend is always zero in these cases, you can use that to group the rows (I'm assuming you sometimes have more than one of these 0's in a row, otherwise you could trivially just take actual.spend, shift it by 1 and sum back to compare):

dt[, list(session = session[1],
          available.spend = sum(available.spend),
          actual.spend = sum(actual.spend)),
     by = cumsum(available.spend != 0)]
#   cumsum session available.spend actual.spend
#1:      1       1              20           20
#2:      2       2              25           25
#3:      3       4              15           15
#4:      4       5              14           14
#5:      5       7              59           59
#6:      6       9              15           15
#7:      7      10              21           21

From this point on you should have all the info you need to proceed.

Perhaps, more generally, it would be better to group by cumsum(available.spend >= actual.spend).

Question 2

So not quite the same result data.frame you want. I am using cumsum (Cumulative Sum) on available spend and actual spend. Then I check which ones are match up, and only for those that matchup I put "1" in the new.session column.

mydt$spend.sum <-cumsum(mydt$actual.spend) #Cumulative sum of actual 
mydt$avail.sum <-cumsum(mydt$available.spend) #Cumulative sum of actual

now make a new column and make it all NA's

mydt$new.session <-NA

Check which cumulative sums match up and replace NA's with 1's

mydt$new.session[with(mydt, which(spend.sum == avail.sum))]<-1

If you only want data.frame with the 1's in the new.session column

do this

mydt[complete.cases(my.dt),]

Question 3

This is kind of a clunky solution, but given the narrow parameters and desired outcome, I can't think of a better way to do this except piece-by-piece.

mismatches <- mydt[available.spend != actual.spend, which=TRUE]
zeros <- mydt[available.spend == 0, which=TRUE]
x <- setdiff(mismatches, zeros)
followcheck <- mydt[x+1, session == mydt[zeros, session] & actual.spend > 0]
following.zeros <- zeros[followcheck]
sumthing <- mydt[x, available.spend==actual.spend + mydt[following.zeros, actual.spend]]
x <- x[sumthing]
y <- x + 1
mydt[x, actual.spend:=actual.spend + mydt[y, actual.spend]]
# Caution here, data.table gave a warning about needing to copy the table in memory to do this next line.
mydt[, newsess:=0]
mydt[x, newsess:=1]
mydt <- mydt[-y,]