Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

General questions about snappydata #1543

Open
singhals opened this issue Apr 16, 2020 · 0 comments
Open

General questions about snappydata #1543

singhals opened this issue Apr 16, 2020 · 0 comments

Comments

@singhals
Copy link

Hi

Firstly, snappydata is amazing! We were having issues with joining large datasets and having the min/max column statistics gave us an incredible boost. I had some follow up questions regarding two things:

  1. Is it valuable to pre-sort my parquet set columns that I'm joining on hopefully organizing the column buffers to allow for a smaller number of mix/max ranges making querying and joining more efficient?

  2. Is it possible to partition by two columns on two different tables AND co-locate the two tables? We have two columns on both tables: an id and a type and we want to evenly distribute those across nodes but it seems like we get an error trying to colocate.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant