Snowflake launches Agent GPA, aims to grade your AI agents

Published November 4, 2025

Snowflake is looking to give your AI agents a GPA. While the company is grading the accuracy of AI agents, it's really evaluating goals, plans and actions (GPA) in an open source framework that reaches near human levels of error detection rates and localization accuracy.

The framework, called Agent GPA, was outlined at its Build conference. For enterprises deploying agentic AI, Snowflake's efforts are worth a look.

In a blog post, Snowflake's AI Research team said evaluating AI agents comes down to trust. Snowflake said:

"An agentâ€™s answer may appear successful, but the path it took to get there may not be. Was the goal achieved efficiently? Did the plan make sense? Were the right tools used? Did the agent follow through? Without visibility into these steps, teams risk deploying agents that look reliable but create hidden costs in production. Inaccuracies can waste compute, inflate latency and lead to the wrong business decisions, all of which erode trust at scale."

Snowflake argued that current evaluation frameworks fall short because they focus on the final answer, not the process behind the answers. Here's a look at the Agent GPA framework. Agent GPA, outlined in a paper, is available in Truelens.

Snowflake's Agent GPA was the headliner among a set of items released by the company's research team.

Other items include:

Text-to-SQL V1.5, a specialized model that fuels Snowflake Intelligence, Snowflake's enterprise agent, by tackling the slowness, cost, and dialect issues of general LLMs. The specialized model makes text-to-SQL queries up to 3 times faster while maintaining accuracy.
New optimizations will be introduced for Cortex AISQL, a tool that integrates AI directly into SQL queries, enabling teams to analyze all data types and build flexible AI pipelines using familiar SQL syntax.
The Cortex AISQL enhancements improve AI operator efficiency and cost, featuring 2-8x more performant execution plans, 2-6x faster inference (at 90-95% accuracy), and a 15-70x reduction in execution costs and time through techniques like cost-aware optimization, adaptive model cascading, and query enhancements.

Larry Dignan

Editor in Chief of Constellation Insights
Constellation Research

Larry Dignan is Editor in Chief of Constellation Insights at Constellation Research, where he leads editorial coverage focused on enterprise technology, digital transformation, and emerging trends shaping the future of business. He oversees research-driven news, analysis, interviews, and event coverage designed to help technology buyers and vendors navigate complex markets with clarity and context. ...

Insight News

March 03, 2026

At MWC 2026, 6G gets an AI makeover

Data to Decisions

Qualcomm CEO Cristiano Amon said 6G networks will be critical for AI workloads and connect AI agents and workloads. Unlike 5G, which was viewed as a smartphone enabler and connecti...

Larry Dignan

Insight News

March 02, 2026

Elastic sees early traction with Agent Builder, context play, hybrid architecture

Data to Decisions

Elastic is seeing early traction with its Agent Builder and its platform, which has a hybrid architecture, that's resonating with enterprises. ...

Larry Dignan

Insight News

March 02, 2026

MongoDB: Q4 strong, but outlook light amid exec departures

Data to Decisions

MongoDB's fourth quarter earnings were better than expected, but the company announced executive departures and an outlook that missed expectations....

Larry Dignan

Insight News

March 02, 2026

For Middle East AI infrastructure, war is a risk factor

Data to Decisions

AI infrastructure spending in the Middle East has garnered billions in investment, but the war with Iran highlights one of the biggest risk factors in the region. ...

Larry Dignan

Insight News

March 01, 2026

Anthropic vs. SaaS: A nuanced view

Data to Decisions

We're nearing peak Anthropic hype if we're not there already. What's lost in all of these storylines is the nuance. After all, nuance isn't as much fun. Here's a nuanced view of An...

Larry Dignan

Insight News

February 27, 2026

Amazon, OpenAI forge multi-faceted partnership: Dissecting the deal

Data to Decisions

Amazon and OpenAI expanded a partnership and the initial headlines will revolve around the $110 billion raised to give the LLM giant a $730 billion valuation. However, the Amazon i...

Larry Dignan

Snowflake launches Agent GPA, aims to grade your AI agents

At MWC 2026, 6G gets an AI makeover

Elastic sees early traction with Agent Builder, context play, hybrid architecture

MongoDB: Q4 strong, but outlook light amid exec departures

For Middle East AI infrastructure, war is a risk factor

Anthropic vs. SaaS: A nuanced view

Amazon, OpenAI forge multi-faceted partnership: Dissecting the deal

Published

Author

Research

Analyst Services

Videos

Communities

Events

Insights Live

Snowflake launches Agent GPA, aims to grade your AI agents

Results

Published

Author

Business Themes

Vendors

Audience Role

Hot Topics

Related Blog Posts