Implementing 2D slicing in Python

Question

You pretty much have to do something like this… but at least you can remove some duplication.

First, it's probably reasonable to consider [1,] to mean "row 1", just like [1]. (numpy does this.) That means you don't need the tuple-vs.-int thing; just treat an int as a 1-element tuple. In other words:

def __getitem__(self, idx):
    if isinstance(idx, numbers.Integral):
        idx = (idx, slice(None, None, None))
    # now the rest of your code only needs to handle tuples

Second, although your sample code only handles the case of two slices, your real code has to handle two slices, or a slice and an int, or an int and a slice, or two ints, or a slice, or an int. If you can factor out the slice-handling code, you don't need to duplicate it over and over again.

One trick for handling int-vs.-slice is to treat [n] as a wrapper that does, in essence, [n:n+1][0], which lets you reduce everything even more. (It's a tiny bit trickier than this, because you have to special-case either negative numbers in general, or just -1, because obviously n[-1] != n[-1:0][0].) For 1-D arrays this may not be worth it, but for 2D arrays it probably is, because it means while you're dealing with the column, you've always got a list of rows rather than just a row.

On the other hand, you may want to share some code between __getitem__ and __setitem__… which makes some of these tricks either impossible or a lot harder. So, there's a tradeoff.

At any rate, here's an example that does all the simplification and pre/postprocessing I could think of (possibly more than you want) so that ultimately you're always looking up a pair of slices:

class Matrix(object):
    def __init__(self):
        self.m = [[row + col/10. for col in range(4)] for row in range(4)]
    def __getitem__(self, idx):
        if isinstance(idx, (numbers.Integral, slice)):
            idx = (idx, slice(None, None, None))
        elif len(idx) == 1:
            idx = (idx[0], slice(None, None, None))
        rowidx, colidx = idx
        rowslice, colslice = True, True
        if isinstance(rowidx, numbers.Integral):
            rowidx, rowslice = slice(rowidx, rowidx+1), False
        if isinstance(colidx, numbers.Integral):
            colidx, colslice = slice(colidx, colidx+1), False
        ret = self.m[rowidx][colidx]
        if not colslice:
            ret = [row[0] for row in ret]
        if not rowslice:
            ret = ret[0]
        return ret

Or it might be nicer if you refactored things along the other axis: Get the row(s), and then get the column(s) within it/them:

def _getrow(self, idx):
    return self.m[idx]

def __getitem__(self, idx):
    if isinstance(idx, (numbers.Integral, slice)):
        return self._getrow(idx)
    rowidx, colidx = idx
    if isinstance(rowidx, numbers.Integral):
        return self._getrow(rowidx)[colidx]
    else:
        return [row[colidx] for row in self._getrow(rowidx)]

This looks a whole lot simpler, but I'm cheating here by forwarding the second index to the normal list, which only works because my underlying storage is a list of lists. But if you have any kind of indexable row object to defer to (and it doesn't waste unacceptable time/space to create those objects unnecessarily), you can use the same cheat.

If you're objecting to the need to type-switch on the index parameter, yes, that does seem generally unpythonic, but unfortunately it's how __getitem__ generally works. If you want to use the usual EAFTP try logic, you can, but I don't think it's more readable when you have to try two different APIs (e.g., [0] for tuples, and .start for slices) in multiple places. You end up doing "duck-type-switching" up at the top, like this:

try:
    idx[0]
except AttributeError:
    idx = (idx, slice(None, None, None))

… and so on, and this is just twice as much code as normal type-switching without any of the usual benefits.