G
Data Warehouse

TruePrivacy + Google BigQuery

Scan BigQuery datasets for personal data and manage deletion workflows.

Auth: Service Account
Setup time: 15 minutes

Overview

Google BigQuery is a fully managed data warehouse used for analytics at scale. Organizations often store large volumes of customer and user data in BigQuery for analysis. TruePrivacy connects to BigQuery using a Google Cloud Service Account to scan datasets and tables for personal data, classify fields by category, and execute deletion operations for DSR compliance.

TruePrivacy's BigQuery integration supports both standard tables and partitioned tables, with deletion queries optimized to use partition filters for efficient data removal at scale.

What TruePrivacy can do

Data Discovery
Data Classification
DSR Automation

Data types accessed

  • Customer records
  • User event data
  • Email addresses
  • Device identifiers
  • Transaction records
  • Analytics aggregations containing PII

DSR capabilities

  • Execute DELETE statements across BigQuery tables
  • Export data subject records for access requests
  • Identify all tables containing a specific user's data
  • Anonymise fields via UPDATE statements

How it works

  1. 1

    Create a Google Cloud Service Account for TruePrivacy with BigQuery Data Viewer and optionally BigQuery Data Editor roles.

  2. 2

    TruePrivacy scans all datasets and tables in your project, identifying columns containing personal data through schema analysis and sampling.

  3. 3

    Discovered personal data is classified and added to your TruePrivacy data inventory with table and column-level granularity.

  4. 4

    For DSR deletion, TruePrivacy executes BigQuery DML DELETE statements scoped to the data subject, logging all queries in the audit trail.

Frequently asked questions

For discovery, TruePrivacy needs roles/bigquery.dataViewer on your datasets. For DSR automation, it additionally needs roles/bigquery.dataEditor. We recommend creating a dedicated service account with the minimum necessary roles and restricting it to specific datasets.

Yes. TruePrivacy's deletion queries include partition decorators where applicable to minimize query cost and improve deletion performance on large partitioned tables.

TruePrivacy scans views and identifies the underlying tables that contain personal data. Deletion is executed on the base tables; views are not modified directly. Materialized views are flagged for manual review as they may need refreshing after base table deletion.

Connect TruePrivacy to Google BigQuery today

Start your free trial and connect Google BigQuery in 15 minutes.