iSeries query changes selected RRN of subquery result rows

Question 1

I find it hard to believe that querying a table of mere 3 million rows, even when joined with something else, should cause an out-of-memory condition, so in my view you should address this issue first (or cause it to be addressed).

As for your question of why the RRNs end up different I'll take the liberty of quoting the manual:

If the argument identifies a view, common table expression, or nested table expression derived from more than one base table, the function returns the relative record number of the first table in the outer subselect of the view, common table expression, or nested table expression.

A construct of the type ...where something in (select somethingelse...) typically translates into a join, so there.

Question 2

Trying to use RRN as a primary key is asking for trouble.

I find it hard to believe there isn't a key available.

Granted, there may be no explicit primary key defined in the table itself. But is there a unique key defined in the table?

It's possible there's no keys defined in the table itself ( a practice that is 20yrs out of date) but in that case there's usually a logical file with a unique key defined that is by the application as the de-facto primary key to the table.

Try looking for related objects via green screen (DSPDBR) or GUI (via "Show related"). Keyed logical files show in the GUI as views. So you'd need to look at the properties to determine if they are uniquely keyed DDS logicals instead of non-keyed SQL views.

A few times I've run into tables with no existing de-facto primary key. Usually, it was possible to figure out what could be defined as one from the existing columns.

When there truly is no PK, I simply add one. Usually a generated identity column. There's a technique you can use to easily add columns without having to recompile or test any heritage RPG/COBOL programs. (and note LVLCHK(*NO) is NOT it!)

The technique is laid out in Chapter 4 of the modernizing Redbook http://www.redbooks.ibm.com/abstracts/sg246393.html

1) Move the data to a new PF (or SQL table) 2) create new LF using the name of the existing PF 3) repoint existing LF to new PF (or SQL table)

Done properly, the record format identifiers of the existing objects don't change and thus you don't have to recompile any RPG/COBOL programs.

Question 3

Unless you can specifically control it, e.g., via ALWCPYDTA(*NO) for STRSQL, SQL may make copies of result rows for any intermediate set of rows. The RRN() function always accesses physical record number, as contrasted with the ROW_NUMBER() function that returns a logical row number indicating the relative position in an ordered (or unordered) set of rows. If a copy is generated, there is no way to guarantee that RRN() will remain consistent.

Other considerations apply over time; but in this case it's as likely to be simple copying of intermediate result rows as anything.