Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve AS OF Join Performance through Max Lookback parameter #246

Open
rportilla-databricks opened this issue Aug 19, 2022 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@rportilla-databricks
Copy link
Contributor

Currently, AS OF joins require a window to be retained in memory per partition key (key for the Spark window). Improve may benefit from only looking back a certain number of records (or using a range if appropriate). This ticket should have:

  1. Tests showing perf improvement
  2. Prototype with a lookback parameter option (records lookback or range)
  3. Feedback from a user to see whether this type of approximation is acceptable.
@bendoan-db bendoan-db self-assigned this Nov 1, 2022
@tnixon tnixon added the enhancement New feature or request label Apr 24, 2023
@tnixon
Copy link
Contributor

tnixon commented Apr 24, 2023

Would this be covered by the new tolerance parameter added by @nina-hu in #304?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants