Using sed to copy data between two numerical patterns to a new file

Question 1

How about using awk

awk '$1=="1"&&$2=="1"{t=1};t;$1=="33"&&$2=="33"{t=0}' file

Recommand by @mklement0, if there is only one block, to avoid processing the remainder of the file you can update the command to:

awk '$1=="1"&&$2=="1"{t=1};t;$1=="33"&&$2=="33"{exit}' file

Question 2

Your problem is twofold. First, there are two blanks between the ones, but your regex only allows for one (judging from the now indented code). Second, you are probably not precise enough; the /1 1/ pattern matches 11 11, for example, and 111 111 and so on.

So, you should consider:

sed -n -e '/^ *1  *1 /,/^33  *33 /p' -e '/^33 33 /q' input.file > output.file

The patterns are anchored to the start of line by the ^ (caret). The numbers are separated by one or more blanks (there are other, longer-winded ways of writing that in standard sed; the + option is not standard sed but is widely available). And the numbers are terminated by a blank. The chances are that the first expression alone will give you what you want. The second expression terminates the search early when it recognizes the 33 33 input line, which can save a significant amount of file I/O and hence processing time if the input file is big enough.

If the lines with ID numbers in the hundreds have some different format, then it should be fairly straight-forward to tweak the regexes to match what is used. If the data contains tabs instead of (or as well as) blanks, you can tweak the regexes to manage that, too.

Question 3

If you data is all formatted exactly the same as this file, then you can use sed to just read the 3rd through the 35th line (rows 1 1 - 33 33). This is a lot easier than parsing the values, but does require that the files have a standard format:

sed -n 3,35p data.txt

Another cheap way would be to grep for only numeric lines, and take only the first 33:

grep "^[0-9 ][0-9 .-]*$" data.txt | head -n 33