Anodot can access the files on your S3 / S3 Parquet Collector / CUR bucket only after you explicitly allow it. To provide read access to Anodot and to create a cross-account AWS role to provide Anodot with access to the AWS files, refer to the sections below (note that when creating the cross-account AWS role, you should replace the references where relevant to S3 / S3 Parquet Collector / CUR.
IMPORTANT: New S3 / S3 Parquet Collector / CUR data sources should use the role-based access method described below. The previous method of updating the bucket policy (see here for CUR, here for S3 Parquet Collector, and here for S3) is currently supported for existing data sources, but is no longer available for new sources. Contact us to ensure safe migration of existing sources without losing data.
Creating the read access policy
The following policy example shows how to provide read access to the cross-account AWS role.
Creating a cross-account AWS role and data source in Anodot
The following procedure describes how to create the S3 / S3 Parquet Collector / CUR source in Anodot and the AWS role. The process requires you to switch between Anodot and AWS, as indicated below (steps in AWS are italicized).
- In the Navigation Panel, click Catalog.
- Use the Search box to locate the S3 / S3 Parquet Collector / CUR data source.
- Click Start on the S3 / S3 Parquet Collector / CUR tile.
- In the displayed AWS dialog, enter the bucket name and folder path (optional).
- Copy the unique Anodot ID to the clipboard by clicking the copy icon.
- In AWS, create a cross-account role.
- Use the policy you previously created (see Creating the read access policy above) as the role’s policy.
- Click Another AWS account and use the Anodot account ID as the third party account ID: 340481513670
- Paste the Anodot ID you copied in Step 3 to the “External ID” field in the role definition.
- Save the cross-account role, and copy the role ARN to the clipboard.
- Paste the AWS role ARN into the Role ARN field in the AWS dialog.
- Click CONTINUE to connect to the bucket and save the S3 / S3 Parquet Collector / CUR data source.