Skip to content

FileIO storage support for ADLSv2 #1360

@DerGut

Description

@DerGut

Is your feature request related to a problem or challenge?

This issue tracks support for the Azure Data Lake Storage FileIO. There's an existing issue tracking storage support completeness compared to Iceberg Java #408.

@mrcnc more recently brought up the topic of ADLS in the Iceberg Slack workspace. We've discussed and might work on this together.

Image

Describe the solution you'd like

Feature

Similar to already implemented integrations, I imagine a new feature storage-azdls. OpenDAL refers to ADLS as azdls and uses a similar feature name service-azdls. I think this makes sense, because it clearly identifies Azure under a growing list of storage integrations.

Configuration

Other implementations like pyiceberg or Iceberg Java already provides us with configuration parameters we can expect. I've started work on the OpenDAL side to add support for Azure Storage connection strings. All other configurations will be released under OpenDAL v0.53.3.
Apart from the backing OpenDAL config, there's only parsing functions (from an Iceberg property map) to be added. This isn't different from the other storage integrations.


I believe all other questions have already been answered by existing implementations. We will add a new variant to the Storage enum and plug it into the existing storage framework.

Willingness to contribute

I can contribute to this feature independently

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions