Extract treatment means from an lmer object and calculate error bars

Question 1

An alternative that uses functions in package languageR. I call your data set df.

library(lme4)
library(languageR)
library(ggplot2)

# fit model
# n.b. I don't claim that this is a sensible model
# It is just used to demonstrate the plot
mod <- lmer(DV ~ TMT1 * TMT2 + (1|Block), data = df)

# create MCMC matrix
mcmc <- pvals.fnc(mod, nsim = 1000, withMCMC = TRUE)
# pval.fnc also calculates MCMC-based p-values and HPD confidence intervals,
# and plot the posterior distributions of the parameters

# plot using plotLMER.fnc 
# in addition, set withList = TRUE to create a list of data frames with plot data
# which can be used for a (possibly prettier) plot in ggplot
ll <- plotLMER.fnc(mod, withList = TRUE, pred = "TMT1", 
               intr = list(
                 "TMT2",
                 c("C", "D"),
                 "end",
                 list(c("red",  "blue"), rep(1, 2))),
               addlines = TRUE,
               mcmcMat = mcmc$mcmc)

 # here follows additional steps to plot using ggplot 

 # convert list to data frame
 df <- do.call(rbind, ll$TMT1)

 # rename 
 names(df)[names(df) == "Levels"] <- "TMT1"

 # add TMT2
 df$TMT2 <- rep(c("C", "D"), each = 2)

# plot using ggplot
dodge <- position_dodge(width = 0.1)
ggplot(data = df, aes(x = TMT1, y = Y, col = TMT2, group = TMT2)) +
   geom_point(position = dodge, size = 3) +
   geom_errorbar(aes(ymax = upper, ymin = lower, width = 0.1), position = dodge) +
   geom_line(position = dodge) +
   ylab("DV") +
   theme_classic()

Question 2

I think what you are asking for is some form of predict(), for which there is no default method for class mer in lme4 (at least the version on CRAN). You can, however, use ez::ezPredict.

library(ez)
library(ggplot2)
to_predict <- expand.grid(TMT1=c("A","B"), TMT2=c("C","D"))
t_means <- rbind(ezPredict(m2011, to_predict=to_predict, boot=F), ezPredict(m2012, to_predict=to_predict, boot=F), ezPredict(m2013, to_predict=to_predict, boot=F) )
t_means$YEAR = rep(2011:2013, each = 4)
ggplot(t_means, aes(x=YEAR, y=value, color=TMT1:TMT2)) + geom_point() + geom_line()

This function has some additional features which may prove useful, like providing bootstrapped values.

If all you want are point estimates for the treatment means, it's just as easy to do the calculations by hand, especially as all three models have the same design matrix:

mm = unique(model.matrix(m2011))
Y_bar <- c(mm%*%fixef(m2011), mm%*%fixef(m2012), mm%*%fixef(m2013))
ggplot(t_means, aes(x=YEAR, y=Y_bar, color=TMT1:TMT2)) + geom_point() + geom_line()

I'm not exactly sure what you mean by "calculate the treatment means ... accounting for the levels of nesting in my experiment". The random effects in a mixed model are structured, normally-distributed deviations from the population-level effects (the fixed effects.) It may be instructive to look at the random effect estimates ranef(m2011) and the associated design matrix m2011@Zt.

So if all you want to plot are the population-level treatment means, you can simply work with the estimates of the fixed effects fixef(m2011) and the fixed-effect design matrix model.matrix(m2011) as above. If you would like to include some measure of uncertainty around the population-level predictions, or would like predictions for each block/plot/subplot, you will need to work with both the random and fixed effects. I suggest you begin by looking at http://glmm.wikidot.com/faq under the heading "Predictions and/or confidence (or prediction) intervals on predictions".

EDIT 8/26/2013:

You might consider bootMer() in the development version of lme4 for (parametric bootstrapped) confidence intervals around predictions, which should incorporate uncertainty in the variances of the random effects and will work with GLMMs (see for example this thread).

The idea is to simulate from the model of interest, refit with the simulated values, and calculate the statistic of interest from the refit model. You can go through the steps yourself, with simulate() and refit():

t_sim <- apply(simulate(model, 999), 2, function(x) combn(unique(model.matrix(model))%*%fixef(refit(model, x)), 2, diff) )

which generates 999 bootstrap reps of the pairwise differences between treatment means, that you could use quantile() on (or whatever flavor of bootstrap confidence interval you wish):

apply(t_sim, 1, function(.) quantile(., c(0.975, 0.025)))

Question 3

With lme4 now I think bootMer() is probably the best way to go because it accounts for uncertainty of all kinds in the model. However, for certain classes of problems bootMer() is not able to work because of how long each model fit may take. For these larger problems, there is an R package called merTools which provides a predictInterval method to use arm::sim to account for uncertainty in the fixed and random effects as well as the residual error of the model. It is fairly easy to use and much faster for providing predictions in cases where the model takes a long time to fit. It has good coverage of the prediction interval produced by bootMer() for problems where the variance in the relationship among the random effects is fairly well defined.

To use it you would simply go:

library(merTools)
preds <- predictInterval(m2011, newdata = myData, level = 0.95, n.sims = 1000)

There are several other user configurable options, but the result is a prediction object similar to that produced when requesting prediction intervals from lm -- a three column data.frame with columns fit, lwr, and upr.