Amazon S3

Last updated 1 year ago

Amazon S3

This guide describes how to connect Aporia to an S3 data source in order to monitor your ML Model in production.

We will assume that your model inputs, outputs and optionally delayed actuals are stored in a file in S3. Currently, the following file formats are supported:

parquet
json
csv
delta

This data source may also be used to connect to your model's training dataset to be used as a baseline for model monitoring.

Update the Aporia IAM role for S3 access

In order to provide access to S3, you'll need to update your Aporia IAM role with the necessary API permissions.

Step 1: Obtain your aporia IAM role

Use the same role used for the Aporia deployment. If someone else on your team has deployed Aporia, please reach out to them to obtain the role ARN (it should be in the following format: arn:aws:iam::<account>:role/<role-name-with-path>).

Step 2: Create an access policy

In the list of roles, click the role you obtained.
Add an inline policy.
On the Permissions tab, click Add permissions then click Create inline policy.
In the policy editor, click the JSON tab.

Copy the following access policy, and make sure to fill your correct bucket name.

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Action": [
                "s3:Get*",
		"s3:List*"
            ],
            "Resource": [
                "arn:aws:s3:::<BUCKET_NAME>",
                "arn:aws:s3:::<BUCKET_NAME>/*"
            ]
        }
    ]
}

Click Review Policy.
In the Name field, enter a policy name.
Click Create policy.

Now Aporia has the read permission it needs to connect to the S3 buckets you have specified in the policy.

Create an s3 data source in Aporia

Go to Aporia platform and login to your account.
Go to Integrations page and click on the Data Connectors tab
Scroll to Connect New Data Source section
Click Connect on the S3 card and follow the instructions

PreviousOverview NextAthena

Last updated 1 year ago