Proposed Change
We have developed an opensource library that aims to accelerate read access to S3. (will be referred to as AAL)
https://github.com/awslabs/analytics-accelerator-s3
It is currently merged in behind a feature flag that can be toggled with:
--conf "spark.sql.catalog.<CATALOG_NAME>.s3.analytics-accelerator.enabled=true"
This epic is to track the work needed to turn it on by default for all Iceberg and S3 customers.
If you are a Iceberg User and would like to test please let us know and we will be happy to schedule some time with you.
work needed
Proposal document
https://docs.google.com/document/d/13shy0RWotwfWC_qQksb95PXdi-vSUCKQyDzjoExQEN0/edit?usp=sharing
Specifications
Proposed Change
We have developed an opensource library that aims to accelerate read access to S3. (will be referred to as AAL)
https://github.com/awslabs/analytics-accelerator-s3
It is currently merged in behind a feature flag that can be toggled with:
--conf "spark.sql.catalog.<CATALOG_NAME>.s3.analytics-accelerator.enabled=true"This epic is to track the work needed to turn it on by default for all Iceberg and S3 customers.
If you are a Iceberg User and would like to test please let us know and we will be happy to schedule some time with you.
work needed
- Support SSE-C
- All retry logic in default stream is compatible with AAL
- All integration tests pass with AAL
Proposal document
https://docs.google.com/document/d/13shy0RWotwfWC_qQksb95PXdi-vSUCKQyDzjoExQEN0/edit?usp=sharing
Specifications