How to centralise Google Analytics 4 data to gain valuable insights

Get more out of your web data with Analytics Engineering

  • Article
  • Analytics Engineering
  • Data Analytics
Jelmer van Leeuwen
Jelmer van Leeuwen
Data Analyst
5 min
09 Apr 2024

Is your web data not fully usable for your Data Analysts? Discover how Analytics Engineers, with their background in web analysis and technical expertise, bridge the gap between technology and business. Centralise your Google Analytics 4 data and enable your entire organisation to quickly and efficiently extract valuable insights.

GA4 data = unstructured data

GA4 data lacks a clear structure. The data is stored in one large table containing all your web data. Extracting the correct data from this massive table requires a lot of technical knowledge. In practice, many Data Analysts struggle with the technical complexity and are unable to extract all the necessary insights from GA4. Data Engineers may need to build a data model, but they are not always available. Additionally, they prefer to focus on tasks such as setting up data infrastructure or building data pipelines.

How to make GA4 data usable for Advanced Analyses with Analytics Engineering

Do you want to enable your Data Analysts to quickly and easily use the correct, structured GA4 data for advanced analyses? And do you want to combine online data with offline data? Then you need to send your GA4 data to BigQuery first. Subsequently, an Analytics Engineer can transform the raw data into various tables containing specific information, such as your marketing campaigns or conversions. By adding this data to your data warehouse, you make it suitable for combining with other (offline) data sources.

The required tools and knowledge for Analytics Engineering

Centralise Google Analytics 4 data with dbt

At Digital Power, we primarily utilise the open-source tool dbt (data build tool). This isn't a data warehouse, but rather a platform where you can write and store your code. Dbt runs on your data warehouse and assists with everything related to the data transformation process.

One of dbt's strengths is its ease of documentation. There's ample space for clear descriptions per column, you can collaborate with multiple people simultaneously, and you have full control over version control. Additionally, we observe that alongside dbt, there are increasingly more tools emerging in the market that could be interesting for Analytics Engineering in the future.

centralise Google Analytics 4 data with dbt

One advantage of dbt is that it becomes easier to improve data quality through automated testing. Additionally, you quickly gain insight into dependencies in the data and models, setting up documentation is simple, and it integrates with Git for version control. The latter is beneficial because you often work with multiple people simultaneously on a data model.

Dbt is open source and therefore free. The data remodelling process with the tool primarily consumes time, but this investment is quickly recouped. With dbt's assistance, the data lineage (the 'history' of your data) becomes clear swiftly, allowing you to test whether the data behaves as expected. Dependencies of data models are also built-in immediately. If model C depends on models A and B, it will only run automatically once models A and B are completed.

Analytics Engineers must master Git and advanced SQL. Additionally, they need to have knowledge of your data warehouse, such as BigQuery or Snowflake. Modelling skills are also essential, allowing them to structure tables effectively for the entire organisation to utilise. While Analytics Engineers don't set up the technical landscape of your organisation themselves, they must understand how the data is structured.

The necessary steps to centralise Google Analytics 4 data

In practice, an Analytics Engineering project may look like this:

  1. Set up the connection between Google Analytics and BigQuery.
  2. Integrate dbt with BigQuery and Git.
  3. Write the code in dbt with version control.
  4. Return the output of dbt to your data warehouse via BigQuery.
  5. Based on the data in the data warehouse, Analysts create dashboards and reports.

Need help to centralise Google Analytics 4 data with Analytics Engineering?

You can rely on us for expertise in areas such as Data Engineering, Analytics Engineering, and Data Analytics. This enables us to effectively bridge the gap between the technical and business aspects of your organisation.

All our Analytics Engineers have a background in web analysis. They possess extensive knowledge of web data but are also accustomed, as consultants, to engaging with the business. Get in touch to discuss the possibilities.

This is an article by Jelmer van Leeuwen

Jelmer has worked at Digital Power since 2020 as a Data Analyst and developed himself towards an Analytics Engineer. In the past years, he focused more on the technical side of analytics, data transformations, and data modelling.

Jelmer van Leeuwen

Data Analyst

Receive data insights, use cases and behind-the-scenes peeks once a month?

Sign up for our email list and stay 'up to data':

You may also like:

Securing historical data of Universal Analytics using the Google Reporting API

As of 1 July 2023, Google Universal Analytics (UA or GA3) will stop processing data. More and more companies are therefore transitioning to GA4. Unfortunately, historical data from GA3 is not visible in GA4, and if you don't want to lose the data, you must extract everything from UA before 1 July 2024. After that, it will no longer be possible.

Read more
business managers having a conversation

Insight into the complete sales funnel thanks to a data warehouse with dbt

Our consultants log the assignments they take on for our clients in our ERP system AFAS. In our CRM system HubSpot, we can see all the information relevant before signing a collaboration agreement. When we close a deal, all the information from HubSpot automatically transfers to AFAS. So, HubSpot is mainly used for the process before entering a collaboration, while AFAS is used for the subsequent phase. To tighten our people's planning and improve our financial forecasts, we decided to set up a data warehouse to integrate data from both data sources.

Read more
potatoes

Valuable insights from Microsoft Dynamics 365

Agrico is a cooperative of potato growers. They cultivate potatoes for various purposes such as consumption and planting future crops. These potatoes are exported worldwide through various subsidiaries. All logistical and operational data is stored in their ERP system, Microsoft Dynamics 365. Due to the complexity of this system with its many features, the data is not suitable for direct use in reporting. Agrico asked us to help make their ERP data understandable and develop clear reports.

Read more
colleagues talking to each other

Bring structure to your data

There are many different forms of data storage. In practice, a (relational) database, a data warehouse, and a data lake are the most commonly used and often confused with each other. In this article, you will read about what they entail and how to use them.

Read more
woman shopping online

A standardised way of processing data using dbt

One of the largest online shops in the Netherlands wanted to develop a standardised way of data processing within one of its data teams. All data was stored in the scalable cloud data warehouse Google BigQuery. Large amounts of data were available within this platform regarding orders, products, marketing, returns, customer cases and partners.

Read more
dutch highway

Reliable reporting using robust Python code

The National Road Traffic Data Portal (NDW) is a valuable resource for municipalities, provinces, and the national government to gain insight into traffic flows and improve infrastructure efficiency.

Read more
valk exclusief

Setting up a future-proof data infrastructure

Valk Exclusief is a chain of 4-star+ hotels with 43 hotels in the Netherlands. The hotel chain wants to offer guests a personal experience, both in the hotel and online.

Read more
kadaster header

Working more efficiently thanks to migration to Databricks

The Kadaster manages complex (geo)data, including all real estate in the Netherlands. All data is stored and processed using an on-premise data warehouse in Postgres. They rely on an IT partner for maintaining this warehouse. The Kadaster aims to save costs and work more efficiently by migrating to a Databricks environment. They asked us to assist in implementing this data lakehouse in the Microsoft Azure Cloud.

Read more
iphone with spotify music

Converting billions of streams into actionable insights with a new data & analytics platform

Merlin is the largest digital music licensing partner for independent labels, distributors, and other rightsholders. Merlin’s members represent 15% of the global recorded music market. The company has deals in place with Apple, Facebook, Spotify, YouTube, and 40 other innovative digital platforms around the world for its’ member’s recordings. The Merlin team tracks payments and usage reports from digital partners while ensuring that their members are paid and reported to accurately, efficiently, and consistently.

Read more
GA4

Implementation of e-commerce tracking for Google Analytics 4

MS Mode & America Today used Universal Analytics (UA) for analysing their online stores. They had fully implemented e-commerce tracking, with KPIs such as transactions, average order value, and abandoned shopping carts visualised in Looker Studio.

Read more