Skip to main content

Rebuild Tables

Rebuild tables

One of the core features of GA4Dataform is to build tables incrementally. This means that every day, new rows of GA4 data are appended to existing tables instead of rebuilding everything from scratch. However, there are situations where you may need to reprocess some tables due to bug fixes, metric definition changes, or updates in attribution logic that should be retroactively applied to your dataset. Rebuilding tables is typically a one-time action to address a specific need.

note

Outside of advanced use cases, you may occasionally need to rebuild specific tables after a GA4Dataform update. If this is necessary, we will mention it in our update communications. If you're unsure, feel free to reach out to our support team at support@ga4dataform.com

There are three ways to manage full rebuild (or full refresh) from Dataform UI:

  • Within a workspace using Execute side panel.
  • From the Release configurations section via Execute manual workflow side panel
  • By creating a new workflow with the Create workflow configuration side panel

Overview

Using Dataform UI, the process of managing a full rebuild follows a common logic:

Model(s) selection

Select model(s) you want to rebuild by choosing individual model with Selection of Actions, selecting Model(s) with specific tag(s) via Selection of tags or selecting all models with All actions

screenshot

Full refresh execution: Ensure that Run with full refresh is selected, regardless of where you are managing the rebuild.

screenshot

Detailed Review of Side Panels

Rebuild tables from developement workspaces

You can open the Execute side panel from a development Workspace by clicking the Start execution drop-down menu.

screenshot

Any selection from this menu will open the same Execute side panel.

screenshot

Once the side panel is open as seen in the overview:

  • Select model(s) you want to rebuild using one of the three options: Select of Actions, Selection of tags or All actions
  • Ensure that Run with full refresh is checked.

Run the workflow by clicking Start execution. Selected model(s) will be rebuild. Check workflow execution to track progress and verify completion.

Rebuild tables from Release configurations section

From the Release and Scheduling section, click Start execution:

screenshot

This opens the Execute manual workflow side panel:

screenshot

Follow these steps:

  • Click Release configuration drop-down menu and choose production.
  • Select model(s) you want to rebuild using one of the three options: Select of Actions, Selection of tags or All actions
  • Ensure that Run with full refresh is checked.

Run the workflow by clicking Start execution. Selected model(s) will be rebuild. Check workflow execution to track progress and verify completion.

Rebuild tables from Workflow configurations

In the Release and Scheduling section, in Workflow configurations sub-category click Create:

screenshot

This opens the Edit workflow configuration panel:

screenshot

warning

Be careful not to edit an existing workflow configuration. The GA4Dataform installer automatically creates a default workflow configuration called 'daily.' Do not modify this workflow unless you are certain of what you're doing, as changes may disrupt the daily data refresh.

The configuration process is similar to the Execute and Execute manual workflow side panels but includes additional options.

Similarities:
  • Release configuration drop-down menu where you have to select production as in the Execute manual workflow
  • Same way of selecting model(s) you want to rebuild - Select of Actions, Selection of tags, All actions
  • Checking Run with full refresh if we want to execute a rebuild
Differences:

Schedule frequency:

screenshot

You can choose to repeat the workflow automaticaly on specific schedule or run it "On-demand." For most rebuild scenarios as explain in the introduction of this chapter, select "On-demand."

Configuration ID:

screenshot

The configuration ID serves as the name of the workflow. For example, if you're creating a workflow to rebuild session tables on demand, you could name it rebuild-sessions-on-demand.

Running an On-Demand Workflow

After saving your workflow configuration, since it is not scheduled, you will need to run it manually. Click the three dots on the right of the workflow configuration and select Run now.

screenshot

Selected model(s) will be rebuild. Check workflow execution to track progress and verify completion.

Which one should I chose?

Now that you understand how to manage table rebuilds in Dataform, here is a summary of when to use each method:

  • Execute Side Panel (Workspace Development): Best for advanced users modifying models. If you are not working in custom folders, you likely won’t need to use this option frequently.

  • Execute Manual Workflow (Release Configurations): Ideal for one-time operations. However, configurations are not saved, meaning you must repeat the steps each time you need to rebuild tables.

  • Create Workflow Configuration: Similar to Execute manual workflow but allows you to save configurations. This is useful for users with custom queries or advanced GA4Dataform usage who need to perform rebuilds frequently.