Skip to content

feat: adding s3fileio vector reader#14352

Closed
stubz151 wants to merge 1 commit into
apache:mainfrom
stubz151:s3fileio_read_vector
Closed

feat: adding s3fileio vector reader#14352
stubz151 wants to merge 1 commit into
apache:mainfrom
stubz151:s3fileio_read_vector

Conversation

@stubz151
Copy link
Copy Markdown
Contributor

What am I doing

Implementing vectored reading capabilities for S3FileIO, enabling efficient batch reading of multiple file ranges with automatic coalescing to minimize S3 requests.

How am I doing it

Using the Iceberg task manager and a executor thread pool to send requests to s3 concurrently.
Range coalescing works by using a hashmap to link ranges that, it then sends one request and uses one stream to populate the buffers for the linked ranges.

Testing:

• Added some integration tests, will try add a few more, or will try read some more data.
• Ran a benchmark and was happy with the result, will see what I can share later.

@github-actions
Copy link
Copy Markdown

This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the [email protected] list. Thank you for your contributions.

@github-actions github-actions Bot added the stale label Nov 16, 2025
@github-actions
Copy link
Copy Markdown

This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant