dbt Mesh FAQs

What are the main benefits of implementing dbt Mesh?

What are model contracts?

What are model versions?

What are model access modifiers?

What are model groups?

What are some potential challenges when using dbt Mesh?

How does this relate to the concept of data mesh?

Can dbt Mesh handle cyclic dependencies between projects?

Is it possible for multiple projects to directly reference a shared source?

What if a model I've already built on from another project later becomes protected?

If I run `dbt build --select +model`, will this trigger a run of upstream models in other projects?

If each project/domain has its own data warehouse, is it still possible to build models across them?

Can I run tests that involve tables from multiple different projects?

Which team's data schema would dbt Mesh create?

Is it possible to apply model contracts to source data?

Can contracts be partially enforced?

Can I have multiple owners in a group?

Can contracts be assigned individual owners?

Can I make a model “public” only for specific team(s) to use?

Is it possible to orchestrate job runs across multiple different projects?

Integrations available between the dbt Cloud Discovery API and other tools for cross-project lineage?

How does data restatement work in dbt Mesh, particularly when fixing a data set bug?

How does dbt handle job run logs and can it feed them to standard monitoring tools, reports, etc.?

Can dbt Mesh reference models in other accounts within the same data platform?

How do user access permissions work in dbt Mesh?

How do all the different types of “access” interact?

There’s model-level access within dbt, role-based access for users and groups in dbt Cloud, and access to the underlying data within the data platform.

First things first: access to underlying data is always defined and enforced by the underlying data platform (for example, BigQuery, Databricks, Redshift, Snowflake, Starburst, etc.) This access is managed by executing “DCL statements” (namely grant). dbt makes it easy to configure grants on models, which provision data access for other roles/users/groups in the data warehouse. However, dbt does not automatically define or coordinate those grants unless they are configured explicitly. Refer to your organization's system for managing data warehouse permissions.

dbt Cloud Enterprise plans support role-based access control (RBAC) that manages granular permissions for users and user groups. You can control which users can see or edit all aspects of a dbt Cloud project. A user’s access to dbt Cloud projects also determines whether they can “explore” that project in detail. Roles, users, and groups are defined within the dbt Cloud application via the UI or by integrating with an identity provider.

Model access defines where models can be referenced. It also informs the discoverability of those projects within dbt Explorer. Model access is defined in code, just like any other model configuration (materialized, tags, etc).

Public: Models with public access can be referenced everywhere. These are the “data products” of your organization.
Protected: Models with protected access can only be referenced within the same project. This is the default level of model access. We are discussing a future extension to protected models to allow for their reference in specific downstream projects. Please read the GitHub issue, and upvote/comment if you’re interested in this use case.
Private: Model groups enable more-granular control over where private models can be referenced. By defining a group, and configuring models to belong to that group, you can restrict other models (not in the same group) from referencing any private models the group contains. Groups also provide a standard mechanism for defining the owner of all resources it contains.

Within dbt Explorer, public models are discoverable for every user in the dbt Cloud account — every public model is listed in the “multi-project” view. By contrast, protected and private models in a project are visible only to users who have access to that project (including read-only access).

Because dbt does not implicitly coordinate data warehouse grants with model-level access, it is possible for there to be a mismatch between them. For example, a public model’s metadata is viewable to all dbt Cloud users, anyone can write a ref to that model, but when they actually run or preview, they realize they do not have access to the underlying data in the data warehouse. This is intentional. In this way, your organization can retain least-privileged access to underlying data, while providing visibility and discoverability for the wider organization. Armed with the knowledge of which other “data products” (public models) exist — their descriptions, their ownership, which columns they contain — an analyst on another team can prepare a well-informed request for access to the underlying data.

Is it possible to request access permissions from other teams within dbt Cloud?

As a central data team member, can I still maintain visibility on the entire organizational DAG?

How can I limit my developers from accessing sensitive production data when referencing from other projects?

Does dbt Mesh work if projects are 'duplicated' (dev project <> prod project)?

How does the dbt Semantic Layer relate to and work with dbt Mesh?

The dbt Semantic Layer and dbt Mesh are complementary mechanisms enabled by dbt Cloud that work together to enhance the management, usability, and governance of data in large-scale data environments.

The Semantic Layer in dbt Cloud allows teams to centrally define business metrics and dimensions. It ensures consistent and reliable metric definitions across various analytics tools and platforms.

dbt Mesh enables organizations to split their data architecture into multiple domain-specific projects, while retaining the ability to reference “public” models across projects. It is also possible to reference a “public” model from another project for the purpose of defining semantic models and metrics. Your organization can have multiple dbt projects feed into a unified semantic layer, ensuring that metrics and dimensions are consistently defined and understood across these domains.

When using the dbt Semantic Layer in a dbt Mesh setting, we recommend the following:

You have one standalone project that contains your semantic models and metrics.
Then as you build your Semantic Layer, you can cross-reference dbt models across your various projects or packages to create your semantic models using the two-argument ref function( ref('project_name', 'model_name')).
Your dbt Semantic Layer project serves as a global source of truth across the rest of your projects.

Usage example

For example, let's say you have a public model (fct_orders) that lives in the jaffle_finance project. As you build your semantic model, use the following syntax to ref the model:

models/metrics/semantic_model_name.yml

semantic_models:
  - name: customer_orders
    defaults:
      agg_time_dimension: first_ordered_at
    description: |
      Customer grain mart that aggregates customer orders.
    model: ref('jaffle_finance', 'fct_orders') # ref('project_name', 'model_name')
    entities:
      ...rest of configuration...
    dimensions:
      ...rest of configuration...
    measures:
      ...rest of configuration...

Notice that in the model parameter, we're using the ref function with two arguments to reference the public model fct_orders defined in the jaffle_finance project.

How does dbt Explorer relate to and work with dbt Mesh?

How does the dbt Cloud CLI relate to and work with dbt Mesh?

Does dbt Mesh require me to be on a specific version of dbt?

Is there a way to leverage dbt Mesh capabilities in dbt Core?

Does dbt Mesh require a specific dbt Cloud plan?

Is there a recommended migration or implementation process?

Are there tools available to help me migrate to a dbt Mesh?

My team isn’t structured to require multiple projects today. What aspects of dbt Mesh are relevant to me?

Overview of Mesh

How dbt Mesh works

Permissions and access

Compatibility with other features

Usage example

Availability

Tips on implementing dbt Mesh

Overview of Mesh​

How dbt Mesh works​

Permissions and access​

Compatibility with other features​

Usage example​

Availability​

Tips on implementing dbt Mesh​

Overview of Mesh

How dbt Mesh works

Permissions and access

Compatibility with other features

Usage example

Availability

Tips on implementing dbt Mesh