Slice dataframe by all rows corresponding to a country, then sample that vector

Question 1

All the acrobatics with vectors of indices are unnecessary.

Logical indexing, subsetting are really all you need, using a new 'country' field (factor) you add to your data. (Maybe also plyr::ddply if you get real fancy)

All you want to do is allow the user to:

Choose a country from a list (by selecting its number, 2-letter abbrev, whatever)...
... then sample in your dataset from within that country. That's all!

.

dat$country <- NA  # insert a new column, initialize to NA for pessimism, to catch omissions
dat$country[1:1043,]    <- 'Belgium'
dat$country[2044:3061,] <- 'Bulgaria'
dat$country[8423,8922,] <- 'Czech Rep'
...
# Now make country a factor instead of character
dat$country <- as.factor(dat$country)

# Now you can sample() using either logical indexing...
sample(dat[dat$country=='Bulgaria',] , ...)
# ...or subsetting
sample(subset(dat,country=='Bulgaria'), ...)

Question 2

I would summarize your code as:

If sampleType is TRUE, then draw a sample of size sampleSize from the indices corresponding to each country in sampleCountries, and return all these sampled indices together.
If sampleType is FALSE, then group the indices corresponding to all the countries in sampleCountries together and draw a single sample of size sampleSize.

Let's setup some sample parameters:

sampleCountries <- c("BE", "WG")
sampleSize <- 20
sampleType <- F

The first step is to build a vector of the country for each index:

countries = c(rep("BE", 1043), rep("DM", 1000), rep("WG", 1018), rep("GR", 1003),
              rep("IT", 1021), rep("SP", 1021), rep("FR", 1008), rep("IR", 1000),
              rep("NI", 308), rep("LX", 500), rep("NL", 1022), rep("PT", 1000),
              rep("GB", 1066), rep("EG", 1014))

Next, when "ALL" is in sampleCountries you want to behave like all the countries are selected:

if ("ALL" %in% sampleCountries) {
  sampleCountries <- unique(countries)
}

Finally, draw your samples:

if (sampleType) {
  personIndices <- unlist(lapply(sampleCountries, function(x) {
    return(sample(which(countries == x), sampleSize, replace=F))
  }))
} else {
  personIndices <- sample(which(countries %in% sampleCountries), sampleSize,
                          replace=F)
}

In the first part of the if statement, which(countries == x) gets the indices of country x, and lapply does this for all the countries in your vector sampleCountries. Finally, unlist converts the output of lapply to a vector.

In the second part of the if statement, which(countries %in% sampleCountries) gets the indices of every country in sampleCountries.