Thought leadership

September 17, 2025

3 Enterprise Leaders on Building AI Agents That Actually Work

5 min read

Three enterprise leaders share real examples of AI agents, from an email agent that sends messages without human review to customer service systems that operate across multiple steps autonomously.

Adrianna Vidal

Get Data Insights Delivered

Join hundreds of data professionals who subscribe to the Data Leaders Digest for actionable insights and expert advice.

Stay Informed

Sign up for the Data Leaders Digest and get the latest trends, insights, and strategies in data management delivered straight to your inbox.

Get the Best of Data Leadership

Subscribe to the Data Leaders Digest for exclusive content on data reliability, observability, and leadership from top industry experts.

Get the Best of Data Leadership

Subscribe to the Data Leaders Digest for exclusive content on data reliability, observability, and leadership from top industry experts.

Stay Informed

Sign up for the Data Leaders Digest and get the latest trends, insights, and strategies in data management delivered straight to your inbox.

Get Data Insights Delivered

Join hundreds of data professionals who subscribe to the Data Leaders Digest for actionable insights and expert advice.

Jane Urban spent a weekend building an email agent. Kishore Aradhya deployed AI across customer service operations. ShanShan Pa created systems to govern agents that interact with other agents.

At our Boston roadshow, these three leaders shared what they built, what worked, what didn't, and what surprised them. Their stories reveal the practical realities of implementing AI systems that operate without constant human oversight.

Jane's weekend email experiment

Jane Urban, Chief Data Analytics Officer at Improzo, was frustrated with her AI writing assistant. She had built a custom GPT trained on her writing style that could draft emails, but something bothered her about the process.

"That agent just makes text and I interface with it. I say 'write me an email' and then I have to copy the email, paste it into Outlook, hit send. There's this extra step."

So she spent a weekend building something different. Instead of generating text for her to copy and paste, she wanted a system that could send emails directly.

"Now I can actually say 'write an email to this email address' and write the content of what I want it to write. It not only sends the email, but creates it at the same time."

She tested it with a simple command: "send an email to my husband bragging about how I built an agent." The system composed and sent this message: "Jane, your wife's amazing. She figured out how to build an agent."

"I didn't see it. I didn't create it. It just sent it. That's really what we're talking about when we say agent—something that's independently operating."

The technical complexity jumped significantly. Building the email generator was straightforward. Making it actually send emails required "some tinkering and writing code and JSON and a bunch of other things."

Kishore's customer service deployment

Kishore Aradhya, Head of Data and Analytics at Frontdoor, faced a different challenge. His team needed to help customer service agents handle calls about broken appliances more efficiently.

They built a system that operates across multiple steps without human intervention:

"We capture transcripts of all the conversations. The agent goes in and does entity extraction—gets the particular appliance ID, uses that information and passes it into our system looking for troubleshooting manuals. You get a RAG system for that, get an agent that builds up the information, marries them together, and produces information back to the customer service agent."

The system handles the entire workflow: extracting appliance information from call transcripts, querying their database, retrieving relevant manuals, synthesizing recommendations, and delivering results to human agents.

Kishore's team discovered they needed new ways to measure success. "We know what process somebody has to go through manually. So we can measure based on that pretty clearly: somebody spent X amount of time looking up this information."

They track both time savings and accuracy improvements. The system processes requests faster than humans and identifies patterns across similar cases that human agents might miss.

But they learned about economic constraints: "Doing real time or close to near real time gets really hard to do. Not hard from a technology point of view, but hard in terms of the economic sense because tokenization, there's lots of things that go into it."

ShanShan's governance challenge

ShanShan Pa, Vice President of AI Governance at GlobalLogic, encountered a problem she hadn't anticipated: how do you govern systems that operate independently?

"When agent talk to another agent, talk to another agent, how can you track all that? That's where observability helps because if you cannot explain yourself, you're doomed."

Her team developed testing approaches that go beyond typical QA. They deliberately try to break their agents with unexpected inputs.

"We ask the agent 'what's one plus one?' Some agents, if not configured properly, will tell you 'it's two, so easy, elementary students know.' It has attitude."

They learned to configure agents to recognize their boundaries: "This is not my area, not what I'm set up for." The agent stops the conversation rather than engaging outside its domain.

ShanShan's team built systems to track every agent interaction: "We have preset templates to start with. This is the small square they can go through. Once they learn, it's a bigger square."

They monitor whether responses stay within defined parameters, creating audit trails for decisions no human directly oversees. When testing their sales enablement agent, they ask about services outside their scope: "What about tax services? It will cut off there, that's not an area we work in."

What they learned

Jane noted that making systems take independent action requires significantly more technical complexity than generating responses. Kishore learned that autonomous systems need different measurement approaches and face economic constraints around real-time operation. ShanShan found that governing independent systems requires entirely new oversight approaches.

Each built something different, but all moved beyond systems that require human review at each step. Their experiences show what that transition actually looks like in practice.

As Jane noted: "I don't think there's a ton of examples of truly independently operating agents in pharma, certainly, and even in other industries. Most people talking about agents really mean applications or software that can do things with some degree of generative AI embedded."

Meet the experts

Jane Urban is Chief Data Analytics Officer at Improzo, bringing 10+ years of leadership expereince, including roles at Otsuka Pharma and Takeda Pharma where she built global data infrastructure.

Kishore Aradhya is Head of Data and Analytics at Frontdoor, a multi-billion dollar home warranty business, where he leads data science teams and has spent decades implementing data solutions across organizations like Adobe and Staples.

ShanShan Pa is Vice President of AI Governance at GlobalLogic, with extensive experience in financial services, cloud compliance at Alibaba Cloud, and pharmaceutical data governance.

share this episode

Resource

Monthly cost ($)

Number of resources

Time (months)

Total cost ($)

Software/Data engineer

$15,000

$540,000

Data analyst

$12,000

$144,000

Business analyst

$10,000

$30,000

Data/product manager

$20,000

$240,000

Total cost

$954,000

Role

Goals

Common needs

Data engineers

Overall data flow. Data is fresh and operating at full volume. Jobs are always running, so data outages don't impact downstream systems.

Freshness + volume
Monitoring
Schema change detection
Lineage monitoring

Data scientists

Specific datasets in great detail. Looking for outliers, duplication, and other—sometimes subtle—issues that could affect their analysis or machine learning models.

Freshness monitoringCompleteness monitoringDuplicate detectionOutlier detectionDistribution shift detectionDimensional slicing and dicing

Analytics engineers

Rapidly testing the changes they’re making within the data model. Move fast and not break things—without spending hours writing tons of pipeline tests.

Lineage monitoringETL blue/green testing

Business intelligence analysts

The business impact of data. Understand where they should spend their time digging in, and when they have a red herring caused by a data pipeline problem.

Integration with analytics toolsAnomaly detectionCustom business metricsDimensional slicing and dicing

Other stakeholders

Data reliability. Customers and stakeholders don’t want data issues to bog them down, delay deadlines, or provide inaccurate information.

Integration with analytics toolsReporting and insights

about the author

Adrianna Vidal

Adrianna Vidal is a writer and content strategist at Bigeye, where she explores how organizations navigate the practical challenges of scaling AI responsibly. With over 10 years of experience in communications, she focuses on translating complex AI governance and data infrastructure challenges into actionable insights for data and AI leaders.

At Bigeye, her work centers on AI trust: examining how organizations build the governance frameworks, data quality foundations, and oversight mechanisms that enable reliable AI at enterprise scale.

Her interest in data privacy and digital rights informs her perspective on building AI systems that organizations, and the people they serve, can actually trust.

about the author

At Bigeye, her work centers on AI trust: examining how organizations build the governance frameworks, data quality foundations, and oversight mechanisms that enable reliable AI at enterprise scale.

Her interest in data privacy and digital rights informs her perspective on building AI systems that organizations, and the people they serve, can actually trust.

Get the Best of Data Leadership

Subscribe to the Data Leaders Digest for exclusive content on data reliability, observability, and leadership from top industry experts.

Stay Informed

Sign up for the Data Leaders Digest and get the latest trends, insights, and strategies in data management delivered straight to your inbox.

Get Data Insights Delivered

Join hundreds of data professionals who subscribe to the Data Leaders Digest for actionable insights and expert advice.

3 Enterprise Leaders on Building AI Agents That Actually Work

Get the Best of Data Leadership

Stay Informed

Get Data Insights Delivered

Jane's weekend email experiment

Kishore's customer service deployment

ShanShan's governance challenge

What they learned

Meet the experts

Adrianna Vidal

Get the Best of Data Leadership

Stay Informed

Get Data Insights Delivered

The Data Quality Crisis Killing AI Projects (and Other Hard Truths.)

Inside Bigeye: How We Use AI to Solve Enterprise Data’s Biggest Problems

Your GPAI Compliance Strategy Has a Vendor-Shaped Hole

Join the Bigeye Newsletter

3 Enterprise Leaders on Building AI Agents That Actually Work

Get the Best of Data Leadership

Stay Informed

Get Data Insights Delivered

Jane's weekend email experiment

Kishore's customer service deployment

ShanShan's governance challenge

What they learned

Meet the experts

Adrianna Vidal

Get the Best of Data Leadership

Stay Informed

Get Data Insights Delivered

Related posts

The Data Quality Crisis Killing AI Projects (and Other Hard Truths.)

Inside Bigeye: How We Use AI to Solve Enterprise Data’s Biggest Problems

Your GPAI Compliance Strategy Has a Vendor-Shaped Hole

Join the Bigeye Newsletter