How to iterator over every [:2] overlapping characters in a string of DNA code?

Question 1

Just leave out the ,2 in your range and make sure to not arrive at the very end of your string:

for i in range(0, len(input)-1):
    print input[i:i+2]

The ,2 tells Python to step forward two on every iteration. By leaving it out, you default to stepping forward one.

Question 2

Forget playing with range and index arithmetic, iterating over pairs is exactly what zip is for:

>>> dna = 'GAAGG'
>>> for bigram in zip(dna, dna[1:]):
...    print(bigram)
... 
('G', 'A')
('A', 'A')
('A', 'G')
('G', 'G')

If you have the corresponding likelihoods stored in a dictionary, like so:

likelihood = {
   'GA': 1, 
   'AA': 2,
   'AG': .7, 
   'GG': .5
}

then you can sum them quite easily with the unsurprisingly named sum:

>>> sum(likelihood[''.join(bigram)] for bigram in zip(dna,dna[1:]))
4.2

Question 3

I'd use the pairwise function described at more_itertools

Question 4

The other answer should do it.

If you really want an iterator:

# define the iterator
def dnaiter(input): 
    for i in xrange(0, len(input) - 1): 
        yield input[i:i+2]

# then use the iterator
for s in dnaiter(input): 
    print s

You'll only ever need this if you have a really long sequence that you're iterating over, though.

Question 5

I wrote a small utility library that has a function named paired which does almost exactly what you want. The library is available here.

import iterlib

sequence = 'GAAGG'
bigrams = [''.join(bigram_tuple) for bigram_tuple in iterlib.paired(sequence)]

print(bigrams)