Python file i/o

Question 1

The following code will do what you want:

import re
a = {}
with open('input.txt', 'rb') as f:
    for line in f:
        x = re.search(r'<([^,]+),\s?([^>]+)>', line)
        x,y = float(x.group(1)), float(x.group(2))
        if x in a:
            a[x].append(y)
        else:
            a[x] = [y]

for key in a:
    a[key] = sum(a[key])/len(a[key])

print a

with open('output.txt', 'wb') as f:
    for i,j in a.items():
        f.write('<'+str(i)+', '+str(j)+'>\n')

[input.txt]
<122, 5>
<185, 5>
<122,4.5>

[output.txt]
<122, 4.75>
<185, 5>

Question 2

The line "if x not in movielist" will be true for the first and second line. For the first line, when you read all the lines in the second loop, "if n_obj.group(1)==x" will be true for the first and third lines (if 122 == 122). So the line "fo.write(final)" will be executed twice. In the entire run of the program, "fo.write(final)" will be executed three times, so you will get three lines of output.

At least that explains why you get three lines instead of the expected two lines.

Question 3

Thanks to Mark Lutton I edited the "subline" loop with the following condition

for subline in lines:
        n_obj = re.search(r"<(\S+), (\S+)>", subline)
        if subline == ln:
            ratinglist.append(float(n_obj.group(2)))
        elif n_obj.group(1)==x:
            ratinglist.append(float(n_obj.group(2)))
            av= (float(sum(ratinglist))/float(len(ratinglist)))
            final= "<%s, %.2f>\n" %(n_obj.group(1), av)                
            fo.write(final)

Question 4

Your code is indented so that 'if n_obj.group(1)==x:' and associated write to 'fo' gets executed for each line in lines so that there would be a record in output file corresponding to each input record which is not what its supposed to do.

The 'if' block should be changed so that the "average is written outside the loop" but check for movie_id is still within the loop. Currently, you are writing average for each subline in lines.

Just change the code and indentation accordingly.