Microsoft SQL Server 2014 Nested From Query in Cross-Apply
-
22-02-2021 - |
Question
When selecting from a nested query in an OUTER APPLY statement the nested query seems to be evaluated only once in certain circumstances.
Bug reported to Azure Feedback Forum: https://feedback.azure.com/forums/908035-sql-server/suggestions/39428632-microsoft-sql-server-2014-incorrect-result-when-s
Is this the expected behavior or am I missing something in the documentation or is this a bug in SQL Server?
Also, is there any possibility to force evaluation of the nested query for every row?
Test Case 1
Evaluates nested FROM query for every row in VALUES (imho expected behaviour)
SELECT
v,
v2
FROM
(VALUES (1), (2), (3), (4)) AS inner_query(v)
OUTER APPLY (
SELECT
MAX(inner_v2) AS v2
FROM (
SELECT
15 AS id,
v AS inner_v2
) AS outer_query
GROUP BY id
) AS outer_apply
Result:
| v | v2|
|---|---|
| 1 | 1 |
| 2 | 2 |
| 3 | 3 |
| 4 | 4 |
Test Case 2
It also evaluates nested FROM query for every row in VALUES (imho expected behaviour)
SELECT
v,
v2
FROM
(VALUES (1), (2), (3), (4)) AS inner_query(v)
OUTER APPLY (
SELECT
MAX(inner_v2) AS v2
FROM (
SELECT
15 AS id,
v AS inner_v2
UNION ALL
SELECT
id AS id,
TestCaseTemp2.v AS inner_v2
FROM
(VALUES (1337, 0)) AS TestCaseTemp2(id, v)
WHERE TestCaseTemp2.v != 0
) AS outer_query
GROUP BY id
) AS outer_apply;
Result:
| v | v2|
|---|---|
| 1 | 1 |
| 2 | 2 |
| 3 | 3 |
| 4 | 4 |
Testcase 3
Evaluates nested FROM query only once
CREATE TABLE TestCaseTemp
(
id int,
v int
);
INSERT INTO TestCaseTemp VALUES (1337, 0);
SELECT
v,
v2
FROM
(VALUES (1), (2), (3), (4)) AS inner_query(v)
OUTER APPLY (
SELECT
MAX(inner_v2) AS v2
FROM (
SELECT
15 AS id,
v AS inner_v2
UNION ALL
SELECT
id AS id,
TestCaseTemp.v AS inner_v2
FROM
TestCaseTemp
WHERE TestCaseTemp.v != 0
) AS outer_query
GROUP BY id
) AS outer_apply;
DROP TABLE TestCaseTemp;
Result:
| v | v2|
|---|---|
| 1 | 1 |
| 2 | 1 |
| 3 | 1 |
| 4 | 1 |
Solution
is this a bug in SQL Server?
Yes, certainly, the 1
that is returned in all rows in your final result only exists in the first row of the outer input so shouldn't even be in scope for the subsequent rows. It looks like the same basic issue as looked at in detail by Paul White here.
I executed your final query in dbfiddle (SQL Server 2019) and pasted the plan here https://www.brentozar.com/pastetheplan/?id=Sy4sBB5lI It looks like the sort executes 4 times (once for each outer row) but for some reason it rewinds rather than rebinds so doesn't call the child operators of the sort more than once. This is a bug as the reference to the correlated parameter of the outer join (Union1004
) should cause a rebind when this has changed value. As a result the reference to Union1004
in Node 5 of the plan is never re-evaluated.
I see you have now reported it here https://feedback.azure.com/forums/908035-sql-server/suggestions/39428632-microsoft-sql-server-2014-incorrect-result-when-s
Is there any possibility to force evaluation of the nested query for every row?
Adding the query hint OPTION (MERGE UNION)
works for your example, I don't know if this will necessarily be sufficient to avoid the bug in all cases but from the linked Paul White answer it appears it should work. In the case of your example it works as the sort is pushed down lower in the plan so it only rewinds the TestCaseTemp
rows, not the entire unioned result. You could also add appropriate indexing to remove the sort entirely.
OTHER TIPS
This is a bug in the way SQL Server decides whether a rebind is required with certain plan shapes. It has been in the product since SQL Server 2005.
Minimal repro
This uses a trace flag to force a spool:
CREATE TABLE #T (v integer NOT NULL);
INSERT #T VALUES (1), (2), (3), (4);
SELECT
#T.v,
A.*
FROM #T
CROSS APPLY
(
SELECT #T.v
UNION ALL
SELECT #T.v WHERE @@SPID < 0
) AS A
OPTION (QUERYTRACEON 8691);
Plan:
Wrong results:
Expanded repro
This does not require a trace flag to produce a spool:
DROP TABLE #T;
CREATE TABLE #T (v integer NOT NULL);
INSERT #T
(v)
VALUES
(1), (1), (1), (1),
(1), (2), (3), (4);
SELECT
#T.v,
A.v
FROM #T
CROSS APPLY
(
SELECT v = MAX(U.v)
FROM
(
SELECT #T.v
UNION ALL
SELECT #T.v
WHERE @@SPID < 0
) AS U
GROUP BY U.v % 2
) AS A
ORDER BY #T.v
OPTION (USE HINT ('FORCE_DEFAULT_CARDINALITY_ESTIMATION'), HASH GROUP);
Bug status
Fixed for Azure SQL Database. Also fixed from SQL Server 2019 CU9, and 2016 SP2 CU16.