Spark: Add session configs for adaptive split sizing and read parallelism by karuppayya · Pull Request #16088 · apache/iceberg

karuppayya · 2026-04-23T18:41:24Z

Summary

Add session configs to control adaptive split sizing and allow configurable read parallelism
Resolves ICEBERG-15988

Changes

Added configs to enable/disable adaptive split sizing at session level
Added config to config read split parallelism count

Test plan

Unit tests added

karuppayya · 2026-04-23T18:46:32Z

Tagging reviewers from the PR that introduced the change @rdblue @ConeyLiu @aokolnychyi
cc: @RussellSpitzer

RussellSpitzer · 2026-05-12T22:01:41Z


+  public Integer splitParallelism() {
+    Integer parallelism =
+        confParser.intConf().sessionConf(SparkSQLProperties.READ_SPLIT_PARALLELISM).parseOptional();


Why only session conf? ADAPTIVE_SPLIT_SIZE get's table versions?

I think this is runtime property(for Spark it depends on cores+memory of the application)
Do we want to make it part of table property?

RussellSpitzer · 2026-05-12T22:05:09Z

+  public static final String READ_ADAPTIVE_SPLIT_SIZE_ENABLED =
+      "spark.sql.iceberg.read.adaptive-split-size.enabled";
+
+  // Controls the parallelism used for adaptive split sizing


Currently this masks the fact that this parameter overrides parallelism(), keeping the default here I think would let you have a clearer doc too

RussellSpitzer · 2026-05-12T22:23:20Z

          assertThat(description).contains("endSnapshotId=" + endSnapshotId);
        });
  }



A few notes on the tests here,

These don't seem to be in the right place. This class is for testing the scan object and neither of these tests actually touch the spark scan, they are both essentially just parsing checkings. It may be time to start a TestSparkReadConf file for these sorts of tests

or

Actually invoke the scan here and show how the properties are changing things.

We should break out assertions, currently there are mutliple parse cases being handled in the same test.

There should probably be something like

Test Adapative enabled
test Invalid parallelism
Test ...

Make sure we cover both the positive and negative cases. If we say a value is invalid we should have a test throwing an error when that value is passed.

Created a seperate test file
2 and 3 handled

…lism

github-actions Bot added the spark label Apr 23, 2026

RussellSpitzer reviewed May 12, 2026

View reviewed changes

Comment thread spark/v4.1/spark/src/main/java/org/apache/iceberg/spark/source/SparkScan.java Outdated

RussellSpitzer reviewed May 12, 2026

View reviewed changes

Comment thread spark/v4.1/spark/src/main/java/org/apache/iceberg/spark/SparkSQLProperties.java Outdated

RussellSpitzer reviewed May 12, 2026

View reviewed changes

Spark: Add session configs for adaptive split sizing and read paralle…

922ff40

…lism

karuppayya force-pushed the ICEBERG-15988 branch from 3931eeb to 43fda2e Compare May 13, 2026 00:23

Address review comments

4193e16

karuppayya force-pushed the ICEBERG-15988 branch from 43fda2e to 4193e16 Compare May 13, 2026 01:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark: Add session configs for adaptive split sizing and read parallelism#16088

Spark: Add session configs for adaptive split sizing and read parallelism#16088
karuppayya wants to merge 2 commits into
apache:mainfrom
karuppayya:ICEBERG-15988

karuppayya commented Apr 23, 2026

Uh oh!

karuppayya commented Apr 23, 2026

Uh oh!

RussellSpitzer May 12, 2026

Uh oh!

karuppayya May 12, 2026

Uh oh!

Uh oh!

Uh oh!

RussellSpitzer May 12, 2026

Uh oh!

RussellSpitzer May 12, 2026 •

edited

Loading

Uh oh!

karuppayya May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

karuppayya commented Apr 23, 2026

Summary

Changes

Test plan

Uh oh!

karuppayya commented Apr 23, 2026

Uh oh!

RussellSpitzer May 12, 2026

Choose a reason for hiding this comment

Uh oh!

karuppayya May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

RussellSpitzer May 12, 2026

Choose a reason for hiding this comment

Uh oh!

RussellSpitzer May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

karuppayya May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RussellSpitzer May 12, 2026 •

edited

Loading