You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Allow setting `write.parquet.row-group-limit`
And update the docs
* Add test
* Make ruff happy
---------
Co-authored-by: Sung Yun <107272191+sungwy@users.noreply.github.com>
|`write.parquet.compression-codec`|`{uncompressed,zstd,gzip,snappy}`| zstd | Sets the Parquet compression coddec. |
34
-
|`write.parquet.compression-level`| Integer | null | Parquet compression level for the codec. If not set, it is up to PyIceberg |
35
-
|`write.parquet.page-size-bytes`| Size in bytes | 1MB | Set a target threshold for the approximate encoded size of data pages within a column chunk |
36
-
|`write.parquet.page-row-limit`| Number of rows | 20000 | Set a target threshold for the approximate encoded size of data pages within a column chunk |
37
-
|`write.parquet.dict-size-bytes`| Size in bytes | 2MB | Set the dictionary page size limit per row group |
38
-
|`write.parquet.row-group-limit`| Number of rows | 122880 | The Parquet row group limit |
|`write.parquet.compression-codec`|`{uncompressed,zstd,gzip,snappy}`| zstd | Sets the Parquet compression coddec. |
34
+
|`write.parquet.compression-level`| Integer | null | Parquet compression level for the codec. If not set, it is up to PyIceberg |
35
+
|`write.parquet.row-group-limit`| Number of rows | 1048576 | The upper bound of the number of entries within a single row group |
36
+
|`write.parquet.page-size-bytes`| Size in bytes | 1MB | Set a target threshold for the approximate encoded size of data pages within a column chunk |
37
+
|`write.parquet.page-row-limit`| Number of rows | 20000 | Set a target threshold for the approximate encoded size of data pages within a column chunk |
38
+
|`write.parquet.dict-size-bytes`| Size in bytes | 2MB | Set the dictionary page size limit per row group |
39
+
|`write.parquet.row-group-limit`| Number of rows | 122880 | The Parquet row group limit |
40
+
|`write.metadata.previous-versions-max`| Integer | 100 | The max number of previous version metadata files to keep before deleting after commit. |
0 commit comments