BigQuery
Last updated
Last updated
This guide describes how to connect Aporia to a BigQuery data source in order to monitor your ML Model in production.
We will assume that your model inputs, outputs and optionally delayed actuals can be queried with SQL. This data source may also be used to connect to your model's training set to be used as a baseline for model monitoring.
Create a materialization dataset for Aporia to use to perform queries, see instructions here.
A separate materialization dataset location, to which query results will be written, must be designated for each project from which you want to query.
In order to provide access to BigQuery, you'll need to update your Aporia service account with the necessary API permissions.
Use the same service account used for the Aporia deployment. If someone else on your team has deployed Aporia, please reach out to them to obtain it.
Go to the IAM console and login.
Find the Aporia service account you obtain in the previous step and click on 🖋 Edit Principle
In the "Edit access" window click on ADD ANOTHER ROLE
Add the BigQuery Data Viewer
and BigQuery Job User
roles and click Save
Go to the BigQuery console and login.
In the left-hand panel, expand the relevant project and find the materialization dataset you created in the previous steps.
Click on "..." by the dataset name, then click on Share
In the "Share permissions" window click on Add Principal
In the "New principal" box, enter the email of the Aporia service account you have obtained. Choose the BigQuery Data Editor
role and click Save.
Now Aporia has the permission it needs to connect to the BigQuery datasets and tables you have specified in the policy.
Go to Aporia platform and login to your account.
Go to Integrations page and click on the Data Connectors tab
Scroll to Connect New Data Source section
Click Connect on the BigQuery card and follow the instructions
Bravo! 👏 now you can use the data source you've created across all your models in Aporia.