-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GH-45048: [C++][Parquet] Deprecate unused chunk_size
parameter in parquet::arrow::FileWriter::NewRowGroup()
#45088
base: main
Are you sure you want to change the base?
Conversation
@@ -89,10 +89,10 @@ def test_write_table | |||
def test_write_chunked_array | |||
schema = build_schema("enabled" => :boolean) | |||
writer = Parquet::ArrowFileWriter.new(schema, @file.path) | |||
writer.new_row_group(2) | |||
writer.new_row_group() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@kou can we break here as well?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes.
Could you omit ()
for no argument method call?
writer.new_row_group() | |
writer.new_row_group |
@@ -557,6 +557,7 @@ cdef extern from "parquet/arrow/writer.h" namespace "parquet::arrow" nogil: | |||
|
|||
CStatus WriteTable(const CTable& table, int64_t chunk_size) | |||
CStatus NewRowGroup(int64_t chunk_size) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@pitrou can we remove the previous signature without deprecation? I assume the pxd files are not widely used.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1
@@ -89,10 +89,10 @@ def test_write_table | |||
def test_write_chunked_array | |||
schema = build_schema("enabled" => :boolean) | |||
writer = Parquet::ArrowFileWriter.new(schema, @file.path) | |||
writer.new_row_group(2) | |||
writer.new_row_group() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes.
Could you omit ()
for no argument method call?
writer.new_row_group() | |
writer.new_row_group |
chunked_array = Arrow::ChunkedArray.new([build_boolean_array([true, nil])]) | ||
writer.write_chunked_array(chunked_array) | ||
writer.new_row_group(1) | ||
writer.new_row_group() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
writer.new_row_group() | |
writer.new_row_group |
Rationale for this change
Just noticed that the implementation doesn't use the parameter.
What changes are included in this PR?
Remove the parameter from
NewRowGroup()
Are these changes tested?
Are there any user-facing changes?
The
chunk_size
parameter is now deprecated.chunk_size
parameter inparquet::arrow::FileWriter::NewRowGroup()
#45048