Formatting list data by table

Question 1

Pandas is ideal for this kind of tasks:

Read your csv:

>>> import pandas as pd

>>> df = pd.read_csv('data.csv', sep=',', header=None, names=['datatable', 'col'])
>>> df.head()
     datatable  col
0    DatatableA  1
1    DatatableA  2
2    DatatableA  3
3    DatatableA  4
4    DatatableA  5

Group, select and replace max:

def replace_letter(group):
    letters = group.isin(['T', 'Q'])              # select letters
    group[letters] = int(group[~letters].max()) + 1  # replace by next max
    return group


>>> df['col'] = df.groupby('datatable').transform(replace_letter)
>>> df

     datatable   col
0    DatatableA  1
1    DatatableA  2
2    DatatableA  3
3    DatatableA  4
4    DatatableA  5
5    DatatableB  1
6    DatatableB  6
7    DatatableB  7
8    DatatableB  3
9    DatatableB  4
10   DatatableB  5
11   DatatableB  2
12   DatatableC  3
13   DatatableC  4
14   DatatableC  2
15   DatatableC  1
16   DatatableC  6
17   DatatableC  5
18   DatatableC  6

Write to csv:

df.to_csv('result.csv', index=None, header=None)

Question 2

I suppose I have to answer the question asked my by own alter-ego. Seriously, does StackExchange not sanitize usernames?

Here's a solution, not guaranteeing that it's efficient or simple, but the logic is pretty simple. First you iterate your dataset and check for anything that's not an integer string and record the largest value. Then you iterate again and replace non-integer strings.

I am using StringIO as a replacement for a file just for convenience sake.

import csv
import string
from StringIO import StringIO


raw = """DatatableA,1
DatatableA,2
DatatableA,3
DatatableA,4
DatatableA,5
DatatableB,1
DatatableB,6
DatatableB,T
DatatableB,3
DatatableB,4
DatatableB,5
DatatableB,2
DatatableC,3
DatatableC,4
DatatableC,2
DatatableC,1
DatatableC,Q
DatatableC,5
DatatableC,T"""

fp = StringIO()
fp.write(raw)
fp.seek(0)

reader = csv.reader(fp)

data = []
mapping = {}
for row in reader:
    if row[0] not in mapping:
        mapping[row[0]] = float("-inf")
    if row[1] in string.digits:
        x = int(row[1])
        if x > mapping[row[0]]:
            mapping[row[0]] = x
    data.append(row)

for i, row in enumerate(data):
    if row[1] not in string.digits:
        mapping[row[0]] += 1
        row[1] = str(mapping[row[0]])

fp.close()
fp = StringIO()
writer = csv.writer(fp)
writer.writerows(data)

print fp.getvalue()

Formatting list data by table

Edit in reply to elyase

Read your csv:

Group, select and replace max:

Write to csv: