Python trimming a non-standard segment in a string

Question 1

As it is part of fasta file, so you are going to slice it like this:

>>> import re
>>> a = "TCGATCATCGATCG>IonTorrenttrimmedcontig1$CCGTAGGTGAACCTGCGGAAG"
>>> re.split(">[^$]*\$", a)
['TCGATCATCGATCG', 'CCGTAGGTGAACCTGCGGAAG']

Also, some people are answering with slicing with '>ion1'. That's totally wrong!

I believe your problem is solved! I am also editing a tag with bioinformatics for this question!

Question 2

I would use the re module for that:

>>> s = "blablabla>ion1$foobar>ion2$etc>ion3$..."
>>> import re
>>> re.split(">[^$]*\$",s)
['blablabla', 'foobar', 'etc', '...']

And if you have 1 string on each line:

>>> with open("foo.txt", "r") as f:
...   for line in f:
...     re.split(">[^$]*\$",line[:-1])
... 
['blablabla', 'foobar', 'etc', '...']
['fofofofofo', 'barbarbar', 'blablabla']

Question 3

If you are reading over every line there a few ways to do this. You could use partition (partition returns a list containing 3 elements: [the text before the specified string, the specified string, and the text after]):

for line in file:
    stripped_header = line.partition(">")[2].partition("$")[0]

You could use split:

for line in file:
    stripped_header = line.spilt(">")[1].split("$")[0]

You could loop over all the elements in the string and only append after you pass ">" but before "$" (however this will not be nearly as efficient):

for line in file:
    bool = False
    stripped_header = ""
    for char in line:
        if char == ">":
            bool = True
        elif bool:
            if char != "$":
                stripped_header += char
            else:
                bool = False

Or alternatively use a regular expression, but it seems like my peers have already beat me to it!