[BUG] Literals of nested type are ignored when estimating the output size in PreProjectSplitIterator
#11903
Labels
bug
Something isn't working
PreProjectSplitIterator
#11903
Describe the bug
When running the following toy query, the estimated output size (167.688 KB) returned by "PreProjectSplitIterator.calcMinOutputSize" is quite less than the actual size (1269.768 KB) after the project .
Projection list:
According to the projection list in the above query, the output batch size is about 12 times bigger than the input batch in theory without offset and validity buffers, and we saw a batch of ~4.4G size in some custom queries on T4.
Look at the code here, GPU literals of nested type (array of integer) are not included in the estimated output size. Then the output size is still small enough to make no real splitting happen (but splitting is expected), producing a very big batch in the production env.
The text was updated successfully, but these errors were encountered: