BigQuery

This guide describes how to connect Aporia to a BigQuery data source in order to monitor your ML Model in production.

We will assume that your model inputs, outputs and optionally delayed actuals can be queried with SQL. This data source may also be used to connect to your model's training set to be used as a baseline for model monitoring.

Create a materialization dataset for Aporia queries

Create a materialization dataset for Aporia to use to perform queries, see instructions here.

A separate materialization dataset location, to which query results will be written, must be designated for each project from which you want to query.

Update the Aporia Service Account for BigQuery access

In order to provide access to BigQuery, you'll need to update your Aporia service account with the necessary API permissions.

Step 1: Obtain your aporia service account

Use the same service account used for the Aporia deployment. If someone else on your team has deployed Aporia, please reach out to them to obtain it.

Step 2: Grant read access to the relevant project

Go to the IAM console and login.
Find the Aporia service account you obtain in the previous step and click on 🖋 Edit Principle
In the "Edit access" window click on ADD ANOTHER ROLE
Add the BigQuery Data Viewer and BigQuery Job User roles and click Save

Step 3: Grant access to the materialization dataset

Go to the BigQuery console and login.
In the left-hand panel, expand the relevant project and find the materialization dataset you created in the previous steps.
Click on "..." by the dataset name, then click on Share
In the "Share permissions" window click on Add Principal
In the "New principal" box, enter the email of the Aporia service account you have obtained. Choose the BigQuery Data Editor role and click Save.

Now Aporia has the permission it needs to connect to the BigQuery datasets and tables you have specified in the policy.

Create a BigQuery data source in Aporia

Go to Aporia platform and login to your account.
Go to Integrations page and click on the Data Connectors tab
Scroll to Connect New Data Source section
Click Connect on the BigQuery card and follow the instructions

Bravo! 👏 now you can use the data source you've created across all your models in Aporia.

PreviousAthena NextDatabricks

Last updated 1 year ago