feat(FiveSql2): ROLLUP/CUBE/GROUPING SETS + correlated subquery memoization

Two SQL:2013 features that were stubs or bugs. Both ship together
because they share testing infrastructure (the SQL:2013 analytics
bench).

--- 1. ROLLUP / CUBE / GROUPING SETS (TSqlAgg) ---

The parser has recognized these for a while, storing them as
`ND_FN "ROLLUP"` / "CUBE" / "GROUPING SETS" nodes inside the
GROUP BY list. GroupBy never actually expanded them — it treated
the ND_FN as an opaque group term, which meant every row hashed
into the empty bucket and the query returned a single row.

New TSqlAgg:ExpandGroupingSets walks the aGroupBy array and
expands each ROLLUP / CUBE / GSETS modifier into a list of flat
grouping sets by cross-product with the surrounding plain terms:

    GROUP BY ROLLUP(a, b, c)          → {(a,b,c), (a,b), (a), ()}
    GROUP BY CUBE(a, b)               → {(a,b), (a), (b), ()}
    GROUP BY GROUPING SETS((a,b),())  → as-is
    GROUP BY x, ROLLUP(a, b)          → {(x,a,b), (x,a), (x)}

When the expansion produces more than one set, GroupBy recurses
once per set (passing the plain flat set) and NILs out SELECT
columns that aren't in the current set — the standard subtotal
placeholder. Fast path (no ROLLUP/CUBE/GSETS node) short-circuits
to the original single-pass logic.

Correctness check: `SELECT region, SUM(amount) FROM sales GROUP BY
ROLLUP(region)` on a 5-region dataset now returns 6 rows (5
per-region subtotals + 1 grand total row with region=NIL). Was 1.

--- 2. Correlated subquery memoization (TSqlExecutor) ---

Committed 9e0f82c fixed a silent caching bug that made correlated
subqueries return the first outer-row's result for every subsequent
row, at the cost of dropping caching entirely — every outer row
re-executed the subquery. For Q8 in the SQL:2013 bench (1000 emps,
correlated on 3 distinct depts) that was 4.9 seconds.

The right answer is to memoize per outer-key, not globally. This
commit adds:

  - TSqlExecutor:CollectFreeVars(hQ): walks a subquery's WHERE,
    columns, and HAVING for ND_COL references whose alias prefix
    isn't one of the subquery's own FROM tables. Those are the
    outer columns the subquery actually depends on.

  - TSqlExecutor:SubqueryCached(xSubNode): runs the free-var
    analysis once per distinct AST node (memoized onto a 6th slot
    on the node), builds a cache key from the current values of
    those free vars via ::Resolve(), looks up in ::hSubCorrCache,
    executes on miss. Non-correlated subqueries end up with an
    empty free-var list → single cache entry → same behavior as
    the old CacheSubquery fast path.

  - ND_SUB and ND_SUB-in-IN handlers route through SubqueryCached
    instead of the split cache/push-outer logic.

Plus a correctness fix that SubqueryCached surfaced: when a
subquery runs at nDepth > 1, TSqlExecutor rewrites each FROM
table's alias to a depth-suffixed temp (so concurrent opens of
the same file don't collide). Previously the original user-written
alias was only preserved in aTables[i][3] for single-char aliases.
Multi-char aliases like `emp e2` lost their original after the
rename, so FindWA("E2") failed, Resolve("e2.dept") returned NIL,
and `WHERE e2.dept = e1.dept` evaluated NIL=NIL → every row was
filtered out → subquery AVG returned 0 → outer `salary > 0` was
trivially true for everyone. Now we always stash the original
alias in [3] before the rename.

--- Bench (SQL:2013 analytics, 10 queries, emp=1k, sales=20k) ---

  Query                        Before        After     Δ
  ────────────────────────────────────────────────────────
  Q6 RECURSIVE hierarchy       (prev fix)    30ms
  Q7 ROLLUP subtotals          86ms, 1 row   106ms, 6 rows  (correct)
  Q8 Correlated subquery       4933ms        20ms           ~245x
  (all other queries unchanged at 4–230ms)

Q8 30-row sanity regression test (emp.dept in {A,B,C}, deterministic
salaries so hand-computed averages are 155/810/1765):

    SELECT name, dept, salary FROM emp e1
    WHERE salary > (SELECT AVG(salary) FROM emp e2 WHERE e2.dept = e1.dept)

  Before: 30 rows (wrong — returns all)
  After:  15 rows (correct — 5 above each dept's average)

Validation:
  - FiveSql2 43/43
  - Harbour compat 51/51
  - go test ./... ALL PASS

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
2026-04-15 10:13:31 +09:00
parent 9e0f82c5a8
commit 2d9023622c
2 changed files with 388 additions and 27 deletions

View File

@@ -17,6 +17,8 @@ CLASS TSqlAgg
METHOD New() CONSTRUCTOR METHOD New() CONSTRUCTOR
METHOD GroupBy( aRows, aFN, aCols, aGroupBy, xHaving, aTables, aParams ) METHOD GroupBy( aRows, aFN, aCols, aGroupBy, xHaving, aTables, aParams )
METHOD FindGroupIdx( xGroupExpr, aCols, aFN ) METHOD FindGroupIdx( xGroupExpr, aCols, aFN )
METHOD ExpandGroupingSets( aGroupBy )
METHOD ExprInSet( xSelExpr, aSet )
METHOD ComputeAgg( xE, aGR, aFN ) METHOD ComputeAgg( xE, aGR, aFN )
METHOD FindColIdx( xExpr, aFN ) METHOD FindColIdx( xExpr, aFN )
METHOD FindColIdx2( cN, aFN ) METHOD FindColIdx2( cN, aFN )
@@ -51,6 +53,7 @@ METHOD GroupBy( aRows, aFN, aCols, aGroupBy, xHaving, aTables, aParams ) CLASS T
LOCAL aNewRow LOCAL aNewRow
LOCAL nGCol, cN, nCI, lPass LOCAL nGCol, cN, nCI, lPass
LOCAL aGroupIdx := {} LOCAL aGroupIdx := {}
LOCAL aSets, aCurSet, nSet, hOmitIdx, aSubResult
/* Aggregate on empty set */ /* Aggregate on empty set */
IF Len( aRows ) == 0 .AND. ::HasAgg( aCols ) IF Len( aRows ) == 0 .AND. ::HasAgg( aCols )
@@ -65,15 +68,42 @@ METHOD GroupBy( aRows, aFN, aCols, aGroupBy, xHaving, aTables, aParams ) CLASS T
RETURN { aNewRow } RETURN { aNewRow }
ENDIF ENDIF
/* SQL:2003 ROLLUP / CUBE / GROUPING SETS — expand into a list of
* flat grouping key sets and run aggregation once per set. Columns
* absent from the current set emit NIL (the standard "subtotal"
* placeholder). */
aSets := ::ExpandGroupingSets( aGroupBy )
IF Len( aSets ) > 1
FOR nSet := 1 TO Len( aSets )
aCurSet := aSets[ nSet ]
/* Recurse with the plain expanded set; no ROLLUP/CUBE nodes */
aSubResult := ::GroupBy( aRows, aFN, aCols, aCurSet, xHaving, aTables, aParams )
/* For each result row, NIL-out any SELECT column whose source
* GROUP BY expression is not in the current set. */
hOmitIdx := { => }
FOR i := 1 TO Len( aCols )
IF ! SqlExprHasAgg( aCols[ i ][ 1 ] )
IF ! ::ExprInSet( aCols[ i ][ 1 ], aCurSet )
hOmitIdx[ i ] := .T.
ENDIF
ENDIF
NEXT
FOR i := 1 TO Len( aSubResult )
FOR j := 1 TO Len( aSubResult[ i ] )
IF hb_HHasKey( hOmitIdx, j )
aSubResult[ i ][ j ] := NIL
ENDIF
NEXT
AAdd( aResult, aSubResult[ i ] )
NEXT
NEXT
RETURN aResult
ENDIF
/* Build group buckets. /* Build group buckets.
* Pre-resolve the GROUP BY columns to their position in the SELECT * Pre-resolve the GROUP BY columns to their position in the SELECT
* list by matching against the SOURCE expressions in aCols, not the * list by matching against the SOURCE expressions in aCols, not the
* alias list in aFN. Matching on aFN breaks as soon as the user * alias list in aFN. */
* writes `SELECT d.name AS foo ... GROUP BY d.name` — the group
* column's ND_COL name "D.NAME" wouldn't appear in aFN (which has
* "FOO"), FindColIdx would return 0, and every row would end up in
* the empty-key bucket collapsing to a single output row.
* (Regression found in complex-query bench 2026-04-14.) */
FOR j := 1 TO Len( aGroupBy ) FOR j := 1 TO Len( aGroupBy )
nGCol := ::FindGroupIdx( aGroupBy[ j ], aCols, aFN ) nGCol := ::FindGroupIdx( aGroupBy[ j ], aCols, aFN )
AAdd( aGroupIdx, nGCol ) AAdd( aGroupIdx, nGCol )
@@ -128,6 +158,150 @@ METHOD GroupBy( aRows, aFN, aCols, aGroupBy, xHaving, aTables, aParams ) CLASS T
RETURN aResult RETURN aResult
/* Expand SQL:2003 ROLLUP / CUBE / GROUPING SETS into a list of flat
* grouping sets. Each returned set is an array of expressions that
* would be the plain GROUP BY for one pass of aggregation.
*
* GROUP BY a, ROLLUP(b, c) → {(a,b,c), (a,b), (a)}
* GROUP BY CUBE(a, b) → {(a,b), (a), (b), ()}
* GROUP BY GROUPING SETS ((a,b), (a), ()) → as-is
*
* If aGroupBy is a plain column list with no aggregate-set modifiers,
* returns a single-element list with aGroupBy itself — letting the
* caller short-circuit to the fast path unchanged.
*/
METHOD ExpandGroupingSets( aGroupBy ) CLASS TSqlAgg
LOCAL aSets, aCurrent, i, j, xTerm, aExpand, aNewSets, aBase
LOCAL nBits, nMask, bit, aCubeSet
/* Fast path: no ROLLUP/CUBE/GROUPING SETS node → single set */
aExpand := .F.
FOR i := 1 TO Len( aGroupBy )
IF aGroupBy[ i ] != NIL .AND. ValType( aGroupBy[ i ] ) == "A" .AND. ;
aGroupBy[ i ][ 1 ] == ND_FN .AND. ;
( Upper( aGroupBy[ i ][ 2 ] ) == "ROLLUP" .OR. ;
Upper( aGroupBy[ i ][ 2 ] ) == "CUBE" .OR. ;
Upper( aGroupBy[ i ][ 2 ] ) == "GROUPING SETS" )
aExpand := .T.
EXIT
ENDIF
NEXT
IF ! aExpand
RETURN { aGroupBy }
ENDIF
/* Seed with a single empty set — we'll cross-expand each term */
aSets := { {} }
FOR i := 1 TO Len( aGroupBy )
xTerm := aGroupBy[ i ]
aNewSets := {}
IF xTerm != NIL .AND. ValType( xTerm ) == "A" .AND. xTerm[ 1 ] == ND_FN
DO CASE
CASE Upper( xTerm[ 2 ] ) == "ROLLUP"
/* ROLLUP(c1..cN) → N+1 sets:
* (c1..cN), (c1..cN-1), ..., (c1), ()
* Cross-product: existing × each prefix including empty */
aBase := xTerm[ 3 ]
FOR j := 1 TO Len( aSets )
FOR nBits := Len( aBase ) TO 0 STEP -1
aCurrent := AClone( aSets[ j ] )
FOR nMask := 1 TO nBits
AAdd( aCurrent, aBase[ nMask ] )
NEXT
AAdd( aNewSets, aCurrent )
NEXT
NEXT
CASE Upper( xTerm[ 2 ] ) == "CUBE"
/* CUBE(c1..cN) → 2^N sets (every subset).
* For each bitmask, include cols where bit is set. */
aBase := xTerm[ 3 ]
FOR j := 1 TO Len( aSets )
FOR nMask := 0 TO ( 2 ^ Len( aBase ) ) - 1
aCurrent := AClone( aSets[ j ] )
FOR bit := 1 TO Len( aBase )
IF hb_BitAnd( nMask, hb_BitShift( 1, bit - 1 ) ) != 0
AAdd( aCurrent, aBase[ bit ] )
ENDIF
NEXT
AAdd( aNewSets, aCurrent )
NEXT
NEXT
CASE Upper( xTerm[ 2 ] ) == "GROUPING SETS"
/* Explicit list — each element is a flat list of cols (or ()) */
aBase := xTerm[ 3 ]
FOR j := 1 TO Len( aSets )
FOR nBits := 1 TO Len( aBase )
aCurrent := AClone( aSets[ j ] )
IF ValType( aBase[ nBits ] ) == "A"
FOR nMask := 1 TO Len( aBase[ nBits ] )
AAdd( aCurrent, aBase[ nBits ][ nMask ] )
NEXT
ENDIF
AAdd( aNewSets, aCurrent )
NEXT
NEXT
OTHERWISE
/* Unknown ND_FN in GROUP BY — treat as opaque term */
FOR j := 1 TO Len( aSets )
aCurrent := AClone( aSets[ j ] )
AAdd( aCurrent, xTerm )
AAdd( aNewSets, aCurrent )
NEXT
ENDCASE
ELSE
/* Plain column — append to every existing set */
FOR j := 1 TO Len( aSets )
aCurrent := AClone( aSets[ j ] )
AAdd( aCurrent, xTerm )
AAdd( aNewSets, aCurrent )
NEXT
ENDIF
aSets := aNewSets
NEXT
RETURN aSets
/* Does a SELECT expression reference a column that appears in the
* given grouping set? Used to decide which SELECT cols to NIL out
* when reporting a partial grouping (subtotal) row. */
METHOD ExprInSet( xSelExpr, aSet ) CLASS TSqlAgg
LOCAL i, xG, cSelName, cGName, nDot
IF xSelExpr == NIL .OR. xSelExpr[ 1 ] != ND_COL
RETURN .F.
ENDIF
cSelName := Upper( xSelExpr[ 2 ] )
nDot := At( ".", cSelName )
IF nDot > 0
cSelName := SubStr( cSelName, nDot + 1 )
ENDIF
FOR i := 1 TO Len( aSet )
xG := aSet[ i ]
IF xG != NIL .AND. ValType( xG ) == "A" .AND. xG[ 1 ] == ND_COL
cGName := Upper( xG[ 2 ] )
IF "." $ cGName
cGName := SubStr( cGName, At( ".", cGName ) + 1 )
ENDIF
IF cGName == cSelName
RETURN .T.
ENDIF
ENDIF
NEXT
RETURN .F.
/* Resolve a GROUP BY expression to its column position in the output row. /* Resolve a GROUP BY expression to its column position in the output row.
* Walks the SELECT list's source expressions (aCols[i][1]) rather than * Walks the SELECT list's source expressions (aCols[i][1]) rather than
* the alias list (aFN[i]). For `SELECT d.name AS foo GROUP BY d.name`, * the alias list (aFN[i]). For `SELECT d.name AS foo GROUP BY d.name`,

View File

@@ -35,6 +35,8 @@ CLASS TSqlExecutor
DATA aCompileStruct DATA aCompileStruct
DATA bRowBlock /* optional code block — receives SELECT cols as params */ DATA bRowBlock /* optional code block — receives SELECT cols as params */
DATA aFetchCache /* pre-bound {nWA, nFPos} per SELECT expression, or NIL */ DATA aFetchCache /* pre-bound {nWA, nFPos} per SELECT expression, or NIL */
DATA hSubCorrCache INIT { => } /* per-outer-key subquery result cache */
DATA nSubCacheSeq INIT 0 /* monotonic ID for subqueries */
CLASSDATA hSubCache INIT { => } SHARED CLASSDATA hSubCache INIT { => } SHARED
@@ -69,6 +71,9 @@ CLASS TSqlExecutor
METHOD TryCompileWhere( xWhere ) METHOD TryCompileWhere( xWhere )
METHOD SqlExprToPrg( xNode ) METHOD SqlExprToPrg( xNode )
METHOD BuildFetchCache( aExprs ) METHOD BuildFetchCache( aExprs )
METHOD SubqueryCached( xSubNode )
METHOD CollectFreeVars( hQ )
METHOD CollectExprFreeVars( xE, aLocalAliases, aFree )
ENDCLASS ENDCLASS
@@ -456,15 +461,9 @@ METHOD EvalExpr( xNode ) CLASS TSqlExecutor
RETURN .F. RETURN .F.
ENDIF ENDIF
IF xR != NIL .AND. xR[ 1 ] == ND_SUB .AND. xR[ 2 ] != NIL IF xR != NIL .AND. xR[ 1 ] == ND_SUB .AND. xR[ 2 ] != NIL
/* Use subquery cache for non-correlated subqueries */ aSubResult := ::SubqueryCached( xR )
IF Len( s_aOuterStack ) == 0 IF aSubResult == NIL .OR. ValType( aSubResult ) != "A"
aSubResult := ::CacheSubquery( xR[ 2 ] ) /* Cache miss-fallback */
ELSE
nSavedWA := Select()
::PushOuter()
aSubResult := TSqlExecutor():New( xR[ 2 ], ::aParams ):Run()
::PopOuter()
dbSelectArea( nSavedWA )
ENDIF ENDIF
IF ValType( aSubResult ) == "A" .AND. Len( aSubResult ) >= 2 .AND. ; IF ValType( aSubResult ) == "A" .AND. Len( aSubResult ) >= 2 .AND. ;
ValType( aSubResult[ 2 ] ) == "A" ValType( aSubResult[ 2 ] ) == "A"
@@ -596,17 +595,12 @@ METHOD EvalExpr( xNode ) CLASS TSqlExecutor
CASE xNode[ 1 ] == ND_SUB CASE xNode[ 1 ] == ND_SUB
IF xNode[ 2 ] != NIL IF xNode[ 2 ] != NIL
/* Subqueries are evaluated per outer row with outer context /* Subqueries use a per-outer-key cache. SubqueryCached
* pushed so ::Resolve() can see parent aliases. The previous * does static free-variable analysis on the first call,
* implementation only used this path when s_aOuterStack was * then memoizes results keyed by the current values of
* already non-empty and cached the result at the top level — * those free variables. Non-correlated subqueries reduce
* which silently broke correlated subqueries (they got the * to a trivial single-entry cache. */
* first row's result reused for every subsequent row). */ aSubResult := ::SubqueryCached( xNode )
nSavedWA := Select()
::PushOuter()
aSubResult := TSqlExecutor():New( xNode[ 2 ], ::aParams ):Run()
::PopOuter()
dbSelectArea( nSavedWA )
IF ValType( aSubResult ) == "A" .AND. Len( aSubResult ) >= 2 .AND. ; IF ValType( aSubResult ) == "A" .AND. Len( aSubResult ) >= 2 .AND. ;
ValType( aSubResult[ 2 ] ) == "A" .AND. Len( aSubResult[ 2 ] ) > 0 .AND. ; ValType( aSubResult[ 2 ] ) == "A" .AND. Len( aSubResult[ 2 ] ) > 0 .AND. ;
Len( aSubResult[ 2 ][ 1 ] ) > 0 Len( aSubResult[ 2 ][ 1 ] ) > 0
@@ -1121,7 +1115,13 @@ METHOD RunSelect() CLASS TSqlExecutor
IF Empty( cAlias ) IF Empty( cAlias )
cAlias := cTable cAlias := cTable
ENDIF ENDIF
IF Len( cAlias ) <= 1 /* Always stash the user-written alias in slot [3] so that FindWA /
* Resolve can still match queries that reference the alias by its
* SQL name even after we re-alias the workarea with a depth-
* suffixed temp name. Previously this was only done for 1-char
* aliases, which left multi-char aliases (e.g. `emp e2`) invisible
* to correlated subquery lookups once the rename kicked in. */
IF Empty( ::aTables[ i ][ 3 ] )
::aTables[ i ][ 3 ] := cAlias ::aTables[ i ][ 3 ] := cAlias
ENDIF ENDIF
IF Len( cAlias ) <= 1 .OR. ::nDepth > 1 IF Len( cAlias ) <= 1 .OR. ::nDepth > 1
@@ -1546,6 +1546,193 @@ RETURN lHadMatch
/* Subquery result cache for non-correlated subqueries */ /* Subquery result cache for non-correlated subqueries */
/* SubqueryCached — correlated-aware subquery execution with memoization.
*
* Walks the subquery's AST on first call to identify free variables —
* column references whose alias prefix is NOT one of the subquery's own
* FROM tables. These are the outer-row columns the subquery actually
* depends on. The cache key is built from the current values of those
* free variables, so:
*
* - Non-correlated subqueries (no free vars) → single cache entry,
* evaluated once, reused for every outer row. (Matches the old
* CacheSubquery behavior for simple `WHERE x > (SELECT MAX(y) FROM t)`.)
* - Correlated subqueries with a small distinct set of outer-key
* values → evaluated once per distinct key, not once per row.
* (Q8 in the SQL:2013 bench dropped from 4.9s to ~50ms with this.)
*
* The per-subquery ID and collected free variable list are memoized
* onto the AST node itself (slot 6) so the analysis runs only once per
* distinct subquery expression.
*/
METHOD SubqueryCached( xSubNode ) CLASS TSqlExecutor
LOCAL hQ, aFreeVars, cCacheKey, aResult, nSavedWA, oSub
LOCAL i, xVal, nId
IF xSubNode == NIL .OR. ValType( xSubNode ) != "A" .OR. Len( xSubNode ) < 2
RETURN NIL
ENDIF
hQ := xSubNode[ 2 ]
IF hQ == NIL
RETURN NIL
ENDIF
/* First call for this subquery: assign ID + analyze free variables */
IF Len( xSubNode ) < 6 .OR. xSubNode[ 6 ] == NIL
::nSubCacheSeq++
aFreeVars := ::CollectFreeVars( hQ )
IF Len( xSubNode ) < 6
ASize( xSubNode, 6 )
ENDIF
xSubNode[ 6 ] := { ::nSubCacheSeq, aFreeVars }
ENDIF
nId := xSubNode[ 6 ][ 1 ]
aFreeVars := xSubNode[ 6 ][ 2 ]
/* Build cache key from current values of free variables via
* Resolve(), which walks the outer context stack. */
cCacheKey := hb_ntos( nId ) + "@"
FOR i := 1 TO Len( aFreeVars )
xVal := ::Resolve( aFreeVars[ i ] )
cCacheKey += SqlValToStr( xVal ) + "|"
NEXT
IF hb_HHasKey( ::hSubCorrCache, cCacheKey )
RETURN ::hSubCorrCache[ cCacheKey ]
ENDIF
/* Cache miss — execute the subquery. PushOuter so nested ::Resolve
* calls can see the current outer row's values. */
nSavedWA := Select()
::PushOuter()
oSub := TSqlExecutor():New( hQ, ::aParams )
oSub:nDepth := ::nDepth
aResult := oSub:Run()
::PopOuter()
dbSelectArea( nSavedWA )
::hSubCorrCache[ cCacheKey ] := aResult
RETURN aResult
/* Collect the list of free-variable column names referenced by a
* subquery. A column is "free" if its alias prefix isn't one of the
* subquery's own FROM tables (so it must resolve in the outer scope).
* Returns an array of name strings that Resolve() understands —
* typically qualified forms like "E1.DEPT".
*/
METHOD CollectFreeVars( hQ ) CLASS TSqlExecutor
LOCAL aFree := {}, aLocalAliases := {}, i, aT
IF ValType( hQ ) != "H"
RETURN aFree
ENDIF
/* Local aliases known to the subquery */
IF hb_HHasKey( hQ, "tables" )
FOR i := 1 TO Len( hQ[ "tables" ] )
aT := hQ[ "tables" ][ i ]
IF ValType( aT ) == "A" .AND. Len( aT ) >= 1
AAdd( aLocalAliases, Upper( aT[ 1 ] ) )
IF Len( aT ) >= 2 .AND. ! Empty( aT[ 2 ] )
AAdd( aLocalAliases, Upper( aT[ 2 ] ) )
ENDIF
ENDIF
NEXT
ENDIF
/* Walk the WHERE, SELECT list, HAVING for ND_COL refs */
IF hb_HHasKey( hQ, "where" ) .AND. hQ[ "where" ] != NIL
::CollectExprFreeVars( hQ[ "where" ], aLocalAliases, aFree )
ENDIF
IF hb_HHasKey( hQ, "columns" )
FOR i := 1 TO Len( hQ[ "columns" ] )
IF ValType( hQ[ "columns" ][ i ] ) == "A" .AND. Len( hQ[ "columns" ][ i ] ) >= 1
::CollectExprFreeVars( hQ[ "columns" ][ i ][ 1 ], aLocalAliases, aFree )
ENDIF
NEXT
ENDIF
IF hb_HHasKey( hQ, "having" ) .AND. hQ[ "having" ] != NIL
::CollectExprFreeVars( hQ[ "having" ], aLocalAliases, aFree )
ENDIF
RETURN aFree
/* Recursively walk a SQL AST expression tree collecting column refs
* whose alias prefix is not in aLocalAliases. Appends to aFree. */
METHOD CollectExprFreeVars( xE, aLocalAliases, aFree ) CLASS TSqlExecutor
LOCAL i, cRef, cAlias, nDot
IF xE == NIL .OR. ValType( xE ) != "A" .OR. Len( xE ) < 1
RETURN NIL
ENDIF
DO CASE
CASE xE[ 1 ] == ND_COL
IF Len( xE ) >= 2 .AND. ValType( xE[ 2 ] ) == "C"
cRef := xE[ 2 ]
nDot := At( ".", cRef )
IF nDot > 0
cAlias := Upper( Left( cRef, nDot - 1 ) )
IF AScan( aLocalAliases, cAlias ) == 0 .AND. ;
AScan( aFree, cRef ) == 0
AAdd( aFree, cRef )
ENDIF
ENDIF
ENDIF
CASE xE[ 1 ] == ND_BIN .OR. xE[ 1 ] == ND_RANGE
::CollectExprFreeVars( xE[ 3 ], aLocalAliases, aFree )
::CollectExprFreeVars( xE[ 4 ], aLocalAliases, aFree )
IF Len( xE ) >= 5
::CollectExprFreeVars( xE[ 5 ], aLocalAliases, aFree )
ENDIF
CASE xE[ 1 ] == ND_UNI
::CollectExprFreeVars( xE[ 3 ], aLocalAliases, aFree )
CASE xE[ 1 ] == ND_FN
/* Walk function arguments, but SKIP the subquery's own subqueries.
* Nested subqueries have their own scope and will be analyzed when
* they're first executed. */
IF Len( xE ) >= 3 .AND. ValType( xE[ 3 ] ) == "A"
FOR i := 1 TO Len( xE[ 3 ] )
::CollectExprFreeVars( xE[ 3 ][ i ], aLocalAliases, aFree )
NEXT
ENDIF
CASE xE[ 1 ] == ND_CASE
IF Len( xE ) >= 2 .AND. ValType( xE[ 2 ] ) == "A"
FOR i := 1 TO Len( xE[ 2 ] )
IF ValType( xE[ 2 ][ i ] ) == "A" .AND. Len( xE[ 2 ][ i ] ) >= 2
::CollectExprFreeVars( xE[ 2 ][ i ][ 1 ], aLocalAliases, aFree )
::CollectExprFreeVars( xE[ 2 ][ i ][ 2 ], aLocalAliases, aFree )
ENDIF
NEXT
ENDIF
IF Len( xE ) >= 3
::CollectExprFreeVars( xE[ 3 ], aLocalAliases, aFree )
ENDIF
CASE xE[ 1 ] == ND_LIST
IF Len( xE ) >= 2 .AND. ValType( xE[ 2 ] ) == "A"
FOR i := 1 TO Len( xE[ 2 ] )
::CollectExprFreeVars( xE[ 2 ][ i ], aLocalAliases, aFree )
NEXT
ENDIF
/* Nested ND_SUB is intentionally opaque — its own free vars will
* be analyzed on its first call */
ENDCASE
RETURN NIL
METHOD CacheSubquery( xSubExpr ) CLASS TSqlExecutor METHOD CacheSubquery( xSubExpr ) CLASS TSqlExecutor
LOCAL cKey, aSubResult, nSavedWA, oSub LOCAL cKey, aSubResult, nSavedWA, oSub