How to Build and Activate Datasets in Hubspot Data Studio
Using Hubspot datasets in Data Studio lets your team create reliable, reusable data foundations for reporting, so everyone analyzes the same trusted numbers instead of building reports from scratch.
This guide explains, step by step, how to create, manage, and activate datasets, along with practical tips to keep your reporting clean, consistent, and easy to scale.
What Are Hubspot Datasets in Data Studio?
In Hubspot, a dataset is a curated collection of fields, calculations, and filters that simplifies access to complex CRM data. Instead of building every report directly from raw objects, you define a dataset once and reuse it in multiple reports and dashboards.
Datasets are especially useful when you need:
- Standardized metrics and definitions across teams.
- Reusable logic, such as attribution models or MRR calculations.
- Safer access to sensitive data without exposing full records.
- Cleaner reporting models for non-technical users.
You can then build reports powered by these datasets inside custom reporting tools.
Where to Access Datasets in Hubspot Data Studio
To work with datasets in Hubspot, you use the Data Studio area in your account. From here you can create, edit, and manage datasets that feed into reports.
- In your Hubspot account, navigate to Reporting & Data (or the equivalent data tools section).
- Open Data Studio.
- Select the Datasets tab to view all existing datasets.
From this view you can search, filter, and open datasets, as well as create new ones.
How to Build a New Hubspot Dataset
Creating a new dataset in Hubspot Data Studio follows a guided workflow. You define the source objects, choose fields, add filters, and configure calculations.
Step 1: Start a Dataset in Hubspot
- In Data Studio > Datasets, click Create dataset.
- Give the dataset a clear, descriptive name, for example Sales Pipeline Performance or Marketing Email Engagement.
- Optionally, add a description so other users understand its purpose and scope.
Using a consistent naming convention in Hubspot helps teams quickly identify which dataset to use for specific reporting needs.
Step 2: Choose Primary Objects and Joins
Next, select the primary data object that your dataset will be based on. Common primary objects in Hubspot include:
- Contacts
- Companies
- Deals
- Tickets
- Custom objects
You can then configure relationships and joins to connect related objects. For example, you might join:
- Deals to Contacts and Companies.
- Tickets to Contacts and Conversations.
- Marketing events to Contacts.
Choosing the right joins ensures your dataset can answer the questions stakeholders care about, such as revenue by lifecycle stage or support volume by company segment.
Step 3: Select Fields for the Hubspot Dataset
After defining the objects, select which fields to expose in the dataset. These can include:
- Standard CRM properties, like Deal amount or Lifecycle stage.
- Custom properties unique to your business.
- Calculated fields defined in the dataset.
To keep reporting clean and user-friendly:
- Include only fields that are truly needed.
- Use clear display names if the default property names are technical or confusing.
- Group related fields logically, for example, revenue metrics together or engagement metrics together.
Step 4: Add Filters to Control Data Scope
You can apply filters inside the dataset to control which records are included before they ever reach a report. This is important to standardize business logic in Hubspot.
Examples of dataset-level filters include:
- Only Closed won deals for revenue reporting.
- Only Open or In progress tickets for support workload reports.
- Contacts with Marketing contact = true for engagement analytics.
These filters ensure reports using the dataset always follow the same rules, instead of relying on each report builder to recreate them.
Step 5: Create Calculated Fields in Hubspot
Datasets support calculated fields, letting you build reusable metrics and transformations.
Common calculations include:
- Revenue metrics, such as ARR or MRR.
- Conversion rates between pipeline stages.
- Time-based metrics, like days to close or first response time.
- Conditional flags, such as High-value customer or At-risk account.
By defining these calculations once in Hubspot Data Studio, every report that uses the dataset will share identical formulas and results.
How to Activate a Hubspot Dataset for Reporting
Once your dataset is configured, you can activate it by using it as the data source for reports and dashboards.
Create a Report from a Hubspot Dataset
- Go to the reporting area of your Hubspot account.
- Click Create report.
- Choose to build a custom report based on a dataset.
- Select the dataset you just created from the list.
- Drag and drop fields, metrics, and dimensions to design your report.
Any field or calculation available in the dataset can now be used to build charts, tables, and visualizations.
Use Hubspot Datasets Across Dashboards
After creating reports powered by datasets, add them to dashboards for better visibility:
- Open or create a dashboard.
- Click Add report and choose your dataset-based report.
- Arrange, resize, and configure filters and date ranges as needed.
Because these reports use shared datasets, updates to the dataset structure or calculations will roll out consistently across every dashboard that depends on them.
Best Practices for Managing Hubspot Datasets
To keep your analytics environment clean and scalable, apply these management practices in Hubspot Data Studio.
Standardize Naming and Documentation
- Use clear, descriptive names for datasets and calculated fields.
- Add descriptions summarizing purpose, data sources, and key filters.
- Adopt a naming pattern like [Team] – [Domain] – [Metric], for example, Sales – Pipeline – Forecast.
Control Access and Governance in Hubspot
Governance ensures only the right people can modify critical datasets.
- Limit edit access to data owners or analytics leads.
- Allow wider view access so teams can use datasets for reporting.
- Review who can create or change datasets to avoid duplication.
Maintain and Update Datasets Over Time
- Schedule periodic reviews of key datasets to remove unused fields.
- Deprecate or rename datasets that are outdated.
- Communicate changes to stakeholders when calculations or filters are updated.
This ongoing maintenance keeps Hubspot reporting clean and prevents confusion caused by multiple similar datasets.
When to Use Hubspot Datasets vs. Direct Object Reporting
You can build reports directly on contacts, deals, or other objects, or you can build them on a dataset. Use a dataset when:
- Multiple teams need to share the same KPI definitions.
- Reports require complex calculations or joins.
- You want a governed, curated layer between raw objects and end-user reports.
Direct object reporting in Hubspot is better suited for quick, ad-hoc questions, while datasets are ideal for standardized, long-term analytics assets.
Additional Resources
For the most detailed, up-to-date product instructions, review the official documentation on how to build and activate datasets in Data Studio in Hubspot here: Hubspot Data Studio dataset guide.
If you need strategic help designing a scalable reporting architecture or optimizing your CRM setup, you can also consult a dedicated RevOps and analytics partner such as Consultevo.
By designing well-structured datasets in Hubspot Data Studio and using them consistently across your reports, you give every team a reliable, single source of truth for performance, forecasting, and decision-making.
Need Help With Hubspot?
If you want expert help building, automating, or scaling your Hubspot , work with ConsultEvo, a team who has a decade of Hubspot experience.
“`
