Bigeye Staff
bigeye-staff
-
May 20, 2026

Data governance for Snowflake Intelligence agents

10 min read

TL;DR: Snowflake Intelligence is Snowflake's GA agentic AI product: a conversational interface powered by Cortex Agents that lets business users query their data in natural language. Agents run under the invoking Snowflake role's privileges, which means access scope is determined by role configuration, not by the specific task the agent is performing. Two governance requirements follow from this. First, organizations need visibility into what agents are doing: which tables they're querying, on whose behalf, and what they're returning. Second, the data those agents query needs to be reliable before the agent acts on it: a Cortex Analyst query against a stale or anomalous table produces a confident wrong answer with no error signal. This article covers how Snowflake Intelligence works, what governance native tooling handles, what it doesn't, and how Bigeye's integration addresses both the data trust and access enforcement layers.

Bigeye Staff
Get Data Insights Delivered
Join hundreds of data professionals who subscribe to the Data Leaders Digest for actionable insights and expert advice.
Join The AI Trust Summit on April 16
A one-day virtual summit on the controls enterprise leaders need to scale AI where it counts.
Get the Best of Data Leadership
Subscribe to the Data Leaders Digest for exclusive content on data reliability, observability, and leadership from top industry experts.

Get the Best of Data Leadership

Subscribe to the Data Leaders Digest for exclusive content on data reliability, observability, and leadership from top industry experts.

Stay Informed

Sign up for the Data Leaders Digest and get the latest trends, insights, and strategies in data management delivered straight to your inbox.

Get Data Insights Delivered

Join hundreds of data professionals who subscribe to the Data Leaders Digest for actionable insights and expert advice.

Snowflake Intelligence is Snowflake's ready-to-use agentic application. Business users submit natural language questions, and the system queries structured data, searches documents, and synthesizes a cited response, without requiring SQL or data engineering involvement. It went GA in November 2025 and expanded significantly in April 2026 with multi-step reasoning, personalization, and support for external data sources via MCP connectors.

Data governance for Snowflake Intelligence agents starts with understanding the architecture behind the product, because the governance questions flow directly from how agents access data. The product is one layer in a stack, and each layer introduces different governance considerations.

How Snowflake Intelligence works: the product stack

Snowflake Intelligence is the end-user-facing product. Behind it is a stack of three distinct components.

Cortex Agents is the orchestration layer. When a user submits a question, the agent plans which tools to call, executes them in sequence, and synthesizes a response. Cortex Agents is the intelligence layer that decides what to do and in what order.

Cortex Analyst is the text-to-SQL engine. It translates a natural language question into SQL using a semantic model, runs the query against Snowflake tables and views, and returns structured data. Cortex Analyst handles queries against structured data.

Cortex Search is the retrieval layer for unstructured content: documents, PDFs, emails, and other text sources. It uses hybrid semantic and keyword search to surface relevant context.

Cortex Agents routes between these tools based on the question. A question about a revenue figure goes to Cortex Analyst for a SQL query. A question referencing a policy document goes to Cortex Search. A question that requires both gets both. Snowflake Intelligence exposes this as a single conversational interface.

The system can also connect to external enterprise data sources via MCP connectors: Gmail, Google Calendar, Jira, Salesforce, Slack, and Google Docs are supported in the current release. Snowflake's pending acquisition of Natoma, announced May 27, 2026, is specifically designed to bring centralized identity, policy enforcement, and audit controls to this MCP connectivity layer.

How Snowflake Intelligence agents access your data

This is the governance-critical detail. Cortex Agents inherit the privileges of the Snowflake role that invokes them. If a business user's role has SELECT access on a broad set of tables, the agent running on their behalf has the same access. The agent doesn't scope down to the minimum required for the specific task it's performing.

In practice, this means the effective access model for Snowflake Intelligence agents is a function of how carefully your Snowflake RBAC is designed. An over-scoped role becomes an over-scoped agent. The agent's query can reach any table the role can reach, including tables with sensitive data that the user wasn't explicitly intended to access through a conversational interface.

This is a known issue in the Snowflake practitioner community. The recommended approach is to create dedicated service accounts for agent workloads with tightly scoped roles. But most organizations haven't yet operationalized that pattern across every agent deployment, and Snowflake Intelligence makes it easy for business users to spin up agents without IT involvement, which means new agents regularly appear with whatever role the user already has.

What Snowflake provides natively

Snowflake has built meaningful governance tooling into its platform, and it's worth understanding what each piece covers.

AI Observability. Snowflake's native AI observability capability traces every step of agent execution: input prompts, retrieved context, tool calls, LLM inference. Trace data is stored in the SNOWFLAKE.LOCAL.AI_OBSERVABILITY_EVENTS table, which records tool names called, token usage, execution duration, model used, evaluation scores, user feedback, and agent planning steps. Organizations and third-party tools can query this table to understand what agents are doing.

One important change: a breaking change in April 2026 redacted prompts and completions from this table by default. To read unredacted conversation content (what the user asked and what the agent returned), a role must be explicitly granted the READ UNREDACTED AI OBSERVABILITY EVENTS TABLE account-level privilege. This is a deliberate governance control: even Snowflake administrators can't see full conversation content without it.

Horizon Catalog. Snowflake's universal AI catalog supports RBAC, ABAC, sensitive data classification, tagging, monitoring, lineage, and AI agent support within the Snowflake environment. The AI_REDACT function can identify and mask PII in unstructured text before it enters agent context. Coverage is Snowflake-native; data and agents outside the platform are out of scope.

Cortex AI function cost monitoring. GA'd in March 2026, this gives organizations visibility into Cortex AI function spending by function, model, user, role, and warehouse. This addresses one of the top practitioner pain points: Cortex AI queries can be expensive, and without monitoring, costs accumulate without warning before a month-end bill arrives.

Budget controls. Snowflake provides resource monitors and budget controls for Cortex AI function consumption, allowing teams to set spend limits at the account, warehouse, or team level.

What native tooling doesn't cover

Three things fall outside what Snowflake's native tools address.

Data quality signals. Snowflake Intelligence has no native mechanism to validate that the tables Cortex Analyst is querying are fresh, complete, or anomaly-free before the agent uses them as the basis for its answers. If a data pipeline writes stale or incomplete data to a table and a Snowflake Intelligence agent queries that table, the agent answers confidently using bad data. There's no error signal. The answer looks like a correct answer. The reliability of Snowflake Intelligence answers is a direct function of the reliability of the underlying data, and that reliability has to be tracked upstream of the agent, not by the agent itself.

Cross-agent registry. Native Snowflake tooling doesn't provide a registry-level view of all Snowflake Intelligence agents connected across an organization, who owns each, what scope they have, and what each has done. The AI Observability table records events, but transforming that event stream into a live agent registry with ownership, authorization scope, and activity history requires additional tooling.

Enforcement at query time based on data sensitivity. Snowflake's classification and tagging capabilities can mark sensitive columns, but they don't intercept agent queries before execution to check whether the agent is authorized to access a classified field. Enforcement that prevents an agent from reading a Restricted column requires a control layer that operates between the agent's request and the query execution, not a policy applied after the fact.

How Bigeye integrates with Snowflake Intelligence

Bigeye's integration with Snowflake Intelligence connects through the SNOWFLAKE.LOCAL.AI_OBSERVABILITY_EVENTS table and the broader Snowflake data environment. The integration surfaces two layers of trust signal that the native tooling doesn't provide: data quality as a precondition for reliable agent answers, and enforcement controls over what agents can reach at query time.

Agent visibility. When a Snowflake Intelligence agent interacts with your data through the Agent Trust Hub, it appears in the agent registry with its identity, the tables it has accessed, the queries it has run, and its token consumption. Activity is attributed to the specific agent and user, giving you the audit trail that the raw event table provides in raw form but requires interpretation to use.

Data quality signals. Bigeye monitors the tables that Snowflake Intelligence agents query, tracking freshness, volume, and anomaly status using automated lineage to know which tables are in scope. When an agent queries a table, the trust layer knows whether that table's data was fresh and anomaly-free at query time. That signal appears alongside agent activity in the Trust Dashboard, so teams can see not just what agents did, but whether the data they acted on was trustworthy when they acted on it.

Classification and enforcement. AI Guardian intercepts Snowflake Intelligence agent queries before they execute and checks them against the current sensitivity classification of the requested data. An agent requesting access to a column tagged as Restricted gets blocked at the query layer, before the field is read. The enforcement runs on live signals from Data Classification, with full lineage context attached, so the check accounts for not just what a column is classified as but what it connects to upstream and downstream.

The result is an enforcement model that doesn't require redesigning Snowflake role configurations from scratch. When the role-based access model is broader than it should be for a given agent workload, AI Guardian provides the compensating control at the data layer.

share with a colleague
Resource
Monthly cost ($)
Number of resources
Time (months)
Total cost ($)
Software/Data engineer
$15,000
3
12
$540,000
Data analyst
$12,000
2
6
$144,000
Business analyst
$10,000
1
3
$30,000
Data/product manager
$20,000
2
6
$240,000
Total cost
$954,000
Role
Goals
Common needs
Data engineers
Overall data flow. Data is fresh and operating at full volume. Jobs are always running, so data outages don't impact downstream systems.
Freshness + volume
Monitoring
Schema change detection
Lineage monitoring
Data scientists
Specific datasets in great detail. Looking for outliers, duplication, and other—sometimes subtle—issues that could affect their analysis or machine learning models.
Freshness monitoringCompleteness monitoringDuplicate detectionOutlier detectionDistribution shift detectionDimensional slicing and dicing
Analytics engineers
Rapidly testing the changes they’re making within the data model. Move fast and not break things—without spending hours writing tons of pipeline tests.
Lineage monitoringETL blue/green testing
Business intelligence analysts
The business impact of data. Understand where they should spend their time digging in, and when they have a red herring caused by a data pipeline problem.
Integration with analytics toolsAnomaly detectionCustom business metricsDimensional slicing and dicing
Other stakeholders
Data reliability. Customers and stakeholders don’t want data issues to bog them down, delay deadlines, or provide inaccurate information.
Integration with analytics toolsReporting and insights

What is the difference between Snowflake Intelligence and Cortex Analyst?

Cortex Analyst is the text-to-SQL component that translates natural language questions into SQL queries using a semantic model and returns structured data from Snowflake tables. Snowflake Intelligence is the end-user-facing product built on top of Cortex Agents, which orchestrates Cortex Analyst, Cortex Search, and other tools to answer multi-step questions and synthesize responses with citations. Cortex Analyst handles structured data queries. Snowflake Intelligence handles full conversational workflows across structured and unstructured data, including external sources via MCP connectors.

What is the inherited privilege problem for Cortex Agents?

Cortex Agents run under the Snowflake role of the user or service account that invokes them. That means the agent's data access scope is the same as the invoking role's access scope. A business user whose role has broad SELECT access across a warehouse gives that same access to any Cortex Agent running on their behalf. The agent can query any table the role can query, including tables containing sensitive data, without any scoping down to the minimum required for the specific task. The recommended mitigation is dedicated service accounts with tightly scoped roles for each agent workload. AI Guardian provides a compensating enforcement layer for organizations where role scoping hasn't been fully operationalized.

How does Bigeye's Agent Trust Hub differ from Monte Carlo's Snowflake Intelligence integration?

Monte Carlo is built only for cloud-native stacks: Snowflake, Databricks, BigQuery, Redshift. Organizations running mixed environments with on-premises Oracle or SQL Server alongside cloud platforms find lineage coverage degrades in non-Snowflake/Databricks engines. Monte Carlo's AWS-hosted OTel collector for agent monitoring is AWS-specific; organizations running AI workloads on Azure, GCP, or on-premises need to manage their own collectors. Bigeye's platform covers hybrid and legacy-plus-cloud environments natively, including Oracle, SQL Server, Informatica, and Talend, which matters for enterprises where Snowflake Intelligence sits on top of a mixed data estate rather than a fully cloud-native stack.

about the author

Bigeye Staff

Bigeye Staff represents the collective voice of the Bigeye team. Each article is informed by the expertise of individual contributors and strengthened through collaboration across our engineers, data experts, and product leaders, reflecting our shared mission to help teams build trust in their data.

about the author

about the author

Bigeye Staff represents the collective voice of the Bigeye team. Each article is informed by the expertise of individual contributors and strengthened through collaboration across our engineers, data experts, and product leaders, reflecting our shared mission to help teams build trust in their data.

Get the Best of Data Leadership

Subscribe to the Data Leaders Digest for exclusive content on data reliability, observability, and leadership from top industry experts.

Want the practical playbook?

Join us on April 16 for The AI Trust Summit, a one-day virtual summit focused on the production blockers that keep enterprise AI from scaling: reliability, permissions, auditability, data readiness, and governance.

Get Data Insights Delivered

Join hundreds of data professionals who subscribe to the Data Leaders Digest for actionable insights and expert advice.

Join the Bigeye Newsletter

1x per month. Get the latest in data observability right in your inbox.