In [1]: import re
In [2]: s=""" LAST_NAME_1, Firs_name_1 Home Phone: 333-336-6514
Status: Member Marital: Married Adult: Y M/F: M Env.No.: 5""" ...: 192 generic St.
...: Newton MA 02471
...: Status: Attender Marital: Married Adult: M/F: Env.No.:
...:
...:
...: LAST_NAME_2, Firs_name_2 Home Phone: 777-777-2205 Cell Phone: 888-888-8888
...: 10 generic St.
...: Newton MA 02471
...:
...: E-mail : email@gmail.com
...: Status: Member Marital: Married Adult: Y M/F: M Env.No.: 5"""
In [3]:
In [4]: re.findall('[0-9]{3}-[0-9]{3}-[0-9]{4}\n(.*)', s, re.MULTILINE)
Out[4]: ['192 generic St. ', '10 generic St. ']
NODE EXPLANATION
-----------------------------------------------------
[0-9]{3} any character of: '0' to '9' (3 times)
-----------------------------------------------------
- '-'
-----------------------------------------------------
[0-9]{3} any character of: '0' to '9' (3 times)
-----------------------------------------------------
- '-'
-----------------------------------------------------
[0-9]{4} any character of: '0' to '9' (4 times)
-----------------------------------------------------
\n '\n' (newline)
-----------------------------------------------------
( group and capture to \1:
-----------------------------------------------------
.* any character except \n (0 or more times
(matching the most amount possible))
------------------------------------------------------
) end of \1