Semantic Layer vs. Data Pipeline: Why the dbt–Fivetran Merger Leaves AI Metrics Unresolved
Quick Answer: Moving and transforming data is not the same as governing its meaning. The June 2026 dbt Labs–Fivetran merger consolidates data ingestion and transformation under one roof, but trusted AI agents still need a layer that defines business metrics consistently across every tool that reads them. Strategy Mosaic is a universal semantic layer that defines business logic once and serves it to every BI tool, AI agent, and application, on top of any pipeline, including dbt and Fivetran.
On June 1, 2026, Fivetran and dbt Labs completed their merger and described the combined company as the data infrastructure for trusted AI agents. Strategy Software has spent the past year making a related argument: that AI is only as trustworthy as the data foundation underneath it.
The merger shows the industry is waking up to the importance. It also raises a question worth answering directly for any team rebuilding its stack around AI. Does owning the pipeline that moves and shapes your data mean you own the meaning of the data that AI agents report? It doesn't, and the gap between the two is where trusted AI succeeds or fails. This post explains what the merger consolidates, what it leaves unsolved, and where a universal semantic layer fits.
What the dbt + Fivetran merger actually consolidates
The merger brings two complementary functions into one company. Fivetran handles data movement: extracting from sources and loading complete, synchronized data into a warehouse or lakehouse. dbt handles transformation: applying tested SQL logic to turn raw tables into modeled datasets. Together they cover ingestion and transformation. That is a genuine consolidation of the bottom of the modern data stack, and for teams already standardized on both tools it removes integration friction.
A pipeline that reliably lands trustworthy, well-modeled data is valuable for everything above it. The question is not whether ingestion and transformation matter. The question is whether they are sufficient on their own to give an AI agent the business context it needs to answer a question correctly.
Does owning the data pipeline mean you own the metrics?
A pipeline governs where data lives and how it is shaped. It does not govern what a metric means when four different teams and three different AI agents each ask for "net revenue." Business meaning lives in a separate layer, and if that layer does not exist as a single source of truth, every consuming tool reinvents the definition on its own.
As an example, dbt models can define a revenue metric in one project, but a Power BI report, a Tableau workbook, a spreadsheet, and an AI agent querying through a different path can each apply their own filters, time logic, and exclusions. The warehouse is synchronized and the transformation is tested, yet the number still drifts the moment it leaves the modeled tables. Metric definitions that are correct inside the transformation layer become inconsistent the moment multiple downstream tools interpret them independently.
That inconsistency is precisely what an AI agent surfaces, and amplifies, because it answers in confident natural language without showing its work.
dbt does offer its own version of a semantic layer. But there is a fundamental difference between a transformation-bound semantic tool and a universal execution engine.
A warehouse-bound semantic layer requires metrics to be defined in developer code, compiled through specific pipelines, and restricted to the boundaries of that platform. Strategy Mosaic runs above that layer. It doesn't replace your transformation logic; it extends it, governing definitions across 200+ live sources, federating logic outside the warehouse, and instantly translating those metrics for autonomous AI agents via open standards like the Mosaic MCP.
What the data infrastructure for trusted AI agents is missing
The missing layer is the one that defines and governs business meaning independently of where data is stored or how it was moved. Strategy Mosaic is that layer, it holds metric definitions, hierarchies, and security rules centrally, then serves the same governed result to every tool and agent that asks. Two of the recurring failure modes for enterprise AI make the gap concrete.
The first is the context gap. Large language models do not know your business definitions. An LLM can write fluent SQL against a warehouse, but it has no inherent knowledge of whether "active customer" excludes trials or how your fiscal calendar is structured. A semantic layer injects that business logic so the agent reasons with your definitions rather than guessing at them. Then there are accuracy issues, models are unreliable at arithmetic and aggregation. Strategy Mosaic resolves the math in a dedicated calculation engine and lets the language model handle the language, so the figure an agent reports is computed by governed logic rather than improvised by a model. Mosaic enforces row-level and column-level security at this same layer, which means an AI query passes through the same access controls as a dashboard instead of bypassing them.
Pipeline vs. semantic layer: what each layer is responsible for
The two layers solve different problems and sit at different heights in the stack. The table below shows where each responsibility lives.
Capability | Data movement & transformation (e.g. Fivetran + dbt) | Universal semantic layer (Strategy Mosaic) |
|---|---|---|
Primary job | Sync data into the warehouse; transform raw tables into modeled datasets | Define what business metrics mean and serve them consistently |
Where business logic is consumed | Inside the warehouse and modeled tables | By every BI tool, AI agent, and application that connects |
What an AI agent reads | SQL against modeled tables, with definitions re-applied per query | One governed definition, identical across every consumer |
Governance enforcement | At the transformation and warehouse layer | Row-level and column-level security enforced centrally via Mosaic |
Who resolves the metric calculation | The consuming tool or the LLM | A dedicated calculation engine, not the language model |
Portability across BI tools | Logic is re-expressed per tool | Define once, apply everywhere, with no vendor lock-in |
The point of the table is not that one layer is better than the other. The point is that they are different layers. A consolidated pipeline does excellent work at the bottom rows. The top rows, where AI agents actually get their context, are the semantic layer's job.
Do I need a semantic layer if I already use dbt and Fivetran?
Yes, and the relationship is complementary rather than competitive. A consolidated pipeline gets clean, synchronized data into your warehouse. A universal semantic layer sits above that warehouse and turns modeled data into governed business meaning that any tool or agent can consume identically. Mosaic connects to more than 200 native data sources and resolves metric definitions centrally, so a dbt model, a Fivetran-synced warehouse, and an AI agent all read the same governed number. Because the semantic layer is tool-agnostic and warehouse-agnostic, it works on top of whatever ingestion and transformation stack you run, and you can change tools underneath without rewriting your metrics. Teams like Fannie Mae and Tapestry run their governed business logic in Strategy for exactly this reason, the definitions stay consistent no matter which tool or agent reaches for them.
What this means for teams rebuilding around AI
The merger confirms that the market now treats a trustworthy data foundation as the precondition for enterprise AI, not an afterthought. The refinement worth carrying into your own architecture is that the foundation has two floors. Ingestion and transformation move and shape the data. The semantic layer governs what it means. AI agents draw their context from the second floor, so that is the layer to get right if you want answers you can defend. See how Strategy Mosaic centralizes your semantic layer on top of any stack in a guided demo.
Frequently Asked Questions
Do I need a semantic layer if I already use dbt and Fivetran?
Yes. dbt and Fivetran move and transform data into your warehouse, while a semantic layer defines what the resulting metrics mean and serves that definition consistently to every BI tool and AI agent. Strategy Mosaic provides that governed definition layer on top of any pipeline, so the same number reaches every consumer without drift.
What is the difference between a data pipeline and a semantic layer?
A data pipeline handles where data lives and how it is shaped, covering ingestion and transformation. A semantic layer handles what the data means, holding metric definitions, hierarchies, and security rules centrally. Strategy Mosaic is a universal semantic layer that sits above the pipeline and serves one governed definition to every tool and agent.
What is the data infrastructure for trusted AI agents?
Trusted AI data infrastructure has two parts: a pipeline that delivers clean, synchronized data, and a semantic layer that gives AI agents governed business context so their answers are consistent and auditable. Strategy Mosaic supplies the semantic layer, injecting business definitions and resolving metric math in a calculation engine rather than leaving arithmetic to the language model.
Can Strategy Mosaic work alongside dbt and Fivetran?
Yes. Strategy Mosaic is warehouse-agnostic and tool-agnostic, so it runs on top of a dbt and Fivetran stack without replacing it. The pipeline lands modeled data in the warehouse, and Mosaic governs the business meaning that BI tools and AI agents consume from it.
Content:
- What the dbt + Fivetran merger actually consolidates
- Does owning the data pipeline mean you own the metrics?
- What the data infrastructure for trusted AI agents is missing
- Pipeline vs. semantic layer: what each layer is responsible for
- Do I need a semantic layer if I already use dbt and Fivetran?
- What this means for teams rebuilding around AI
- Frequently Asked Questions




.png&w=750&q=75)

_(1).png&w=750&q=75)

_(2).png&w=750&q=75)