Extracting everything between two symbols in a string

Question 1

1) sub With sub

> sub(".*, ([^.]*)\\..*", "\\1", Name)
[1] "Mr"   "Mrs"  "Miss" "Mrs"  "Mr"   "Mr"

1a) sub variation This approach with gsub also works:

> sub(".*, |\\..*", "", Name)
[1] "Mr"   "Mrs"  "Miss" "Mrs"  "Mr"   "Mr"

2) strapplyc or using strapplyc in the gusbfn package it can be done with a simpler regular expression:

> library(gsubfn)
>
> strapplyc(Name, ", ([^.]*)\\.", simplify = TRUE)
[1] "Mr"   "Mrs"  "Miss" "Mrs"  "Mr"   "Mr"

2a) strapplyc variation This one seems to have the simplest regular expression of them all.

> library(gsubfn)
>
> sapply(strapplyc(Name, "\\w+"), "[", 2)
[1] "Mr"   "Mrs"  "Miss" "Mrs"  "Mr"   "Mr"

3) strsplit A third way is using strsplit

> sapply(strsplit(Name, ", |\\."), "[", 2)
[1] "Mr"   "Mrs"  "Miss" "Mrs"  "Mr"   "Mr"

Added additional solutions. Changed gsub to sub (although gsub works too).

Question 2

Not to note that there's anything lacking from G. Grothendieck's answer. I just want to add a solution using sub and non-greedy repetition:

vec <- c("Moran, Mr. James",
         "Rothschild, Mrs. Martin (Elizabeth L. Barrett)")

sub(".*, (.+?)\\..*", "\\1", vec)
# [1] "Mr"  "Mrs"

Another alternative with regexpr, regmatches, and lookbehind/lookahead:

regmatches(vec, regexpr("(?<=, ).+?(?=\\.)", vec, perl = TRUE))
# [1] "Mr"  "Mrs"