Reduce __sequence
field size in parquet files
#5010
Labels
C-enhancement
Category Enhancements
Milestone
What type of enhancement is this?
Refactor
What does the enhancement do?
In our Parquet file analysis, the
__sequence
field occupies a disproportionate amount of file size, accounting for approximately 67% of the total size. This results in inefficient storage usage and potential performance bottlenecks.File:
9bc23ce8-7046-4ff8-a209-1245827a7a89.parquet
__op_type
greptime_value
__sequence
__primary_key
greptime_timestamp
The
__sequence
field clearly dominates the file size, overshadowing other important columns such asgreptime_value
andgreptime_timestamp
.Implementation challenges
No response
The text was updated successfully, but these errors were encountered: