Agentic AI Is Here—Are Your QA and QC Practices Ready?

The rise of artificial intelligence (AI), particularly large language models (LLMs) and agentic AI systems, is reshaping the software landscape. Far beyond a tool for productivity gains—such as automating repetitive tasks or generating code snippets—AI is fundamentally altering how software is designed, developed, and validated. This transformation brings profound implications for Quality Assurance (QA) and Quality Control (QC), the twin pillars of software quality. While traditional software relied on deterministic processes with predictable outcomes, AI introduces non-determinism, complexity, and interconnectedness that challenge long-standing assumptions. In domains like Customer Relationship Management (CRM), where systems like Salesforce orchestrate critical business processes, these shifts demand a reevaluation of how we ensure software is both the right solution and built correctly.

This article explores the paradigm shift AI imposes on software design and its ripple effects on QA and QC. Drawing on the nuances of non-deterministic systems and the unique challenges of agentic AI, we’ll examine why traditional approaches fall short and propose new strategies to navigate this uncharted territory. Through a CRM-focused example, we’ll ground these concepts in a practical context, illustrating how businesses can adapt to harness AI’s potential while safeguarding quality.

The Evolution of Software Design: From Deterministic to Probabilistic

Traditional software design assumes predictability. A function in a Java program, given the same inputs, produces the same outputs—every time. Developers and testers rely on this determinism to define requirements, write code, and verify behavior. In contrast, AI systems, particularly LLMs like those powering ChatGPT, Gemini, Claude or Grok, operate probabilistically. The same prompt can yield different responses depending on factors like temperature settings, training data drift, or even the whims of random sampling. This non-determinism isn’t a flaw; it’s a feature that enables creativity, adaptability, and human-like reasoning.

Agentic AI—systems that autonomously make decisions, interact with tools, and orchestrate workflows—takes this further. Imagine a Salesforce-based agentic AI designed to manage customer interactions: it might query a CRM database, generate personalized emails using an LLM, and schedule follow-ups via an external calendar API, all without human intervention. Each component introduces variability, and their interactions create emergent behaviors that defy simple prediction. Software design in this context shifts from crafting rigid logic to defining flexible frameworks that guide AI toward desired outcomes while tolerating uncertainty.

This evolution redefines the role of developers. Instead of writing explicit instructions, they design guardrails—constraints and objectives that steer AI behavior. For example, in a Salesforce environment, developers might configure an agent to prioritize high-value leads based on CRM data, but the exact phrasing of its outreach emails depends on the LLM’s interpretation. This shift demands new skills: prompt engineering, probabilistic modeling, and systems thinking to anticipate how components interact. The result is software that’s less a monolith and more a living ecosystem, constantly adapting to new inputs and contexts.

QA and QC: A Tale of Two Disciplines

To understand AI’s impact, we must first revisit the distinction between QA and QC, as articulated by software engineering pioneers like Ian Sommerville. QA asks, “Are we building the right thing?” It’s a strategic, process-oriented discipline focused on aligning software with user needs, business goals, and ethical standards. QC, conversely, asks, “Are we building it right?” It’s tactical, centered on testing and inspecting deliverables to ensure they meet specifications.

In traditional software, QA might involve validating that a CRM feature, like Salesforce’s Opportunity Management, addresses sales teams’ needs for pipeline tracking. QC would then test the feature—checking that calculations for deal stages are accurate and the UI renders correctly across browsers. Both rely on determinism: QA assumes requirements can be clearly defined, and QC expects consistent outputs to verify against.

AI upends these assumptions. A Salesforce agentic AI, for instance, might autonomously score leads, draft emails, and log interactions. QA must grapple with questions like: Does the system prioritize the right leads? Are its communications culturally appropriate? QC faces even thornier issues: How do you test a system where identical inputs produce varied outputs? These challenges require rethinking both disciplines, particularly in the CRM domain, where trust and precision are paramount.

The QA Challenge: Building the Right Thing in a Non-Deterministic World

QA’s mission—ensuring software solves the right problem—becomes exponentially harder with AI. Non-determinism blurs the line between “right” and “wrong.” Consider a Salesforce agentic AI tasked with automating customer support ticket resolution. The “right” solution might mean resolving 90% of tickets without human intervention while maintaining customer satisfaction. But how do you define “satisfaction” when the AI’s responses vary? QA must shift from rigid requirements to flexible objectives, balancing measurable outcomes (e.g., resolution rate) with qualitative goals (e.g., tone consistency).

Use Case: Salesforce Agentic AI for Lead Nurturing

Let’s ground this in a CRM example. A company uses Salesforce to manage leads, deploying an agentic AI to nurture prospects. The system:

·     Analyzes CRM data to identify high-potential leads.

·     Generates personalized email drafts using an LLM.

·     Schedules follow-ups based on prospect responses, integrating with an external calendar tool.

QA’s Role: QA starts by defining the system’s purpose. The “right thing” here is maximizing conversions while preserving brand voice and avoiding spam-like behavior. Key questions include:

·      Does the AI correctly identify high-potential leads based on data like purchase history or engagement metrics?

·      Are emails personalized without being overly informal or off-brand?

·      Does the system respect opt-out requests to comply with regulations like GDPR?

To answer these, QA must:

·      Engage Stakeholders: Collaborate with sales teams, marketing, and compliance officers to align on goals. For instance, sales might prioritize lead volume, while marketing emphasizes tone.

·      Define Success Metrics: Combine quantitative measures (e.g., conversion rate increase of 15%) with qualitative checks (e.g., emails score ≥8/10 for relevance in human reviews).

·      Simulate Workflows: Model real-world scenarios, like a lead responding negatively, to ensure the AI adapts appropriately (e.g., pausing outreach).

The challenge lies in variability. The LLM might generate emails that are brilliant one day and awkward the next. QA must anticipate this, setting boundaries—like requiring emails to avoid certain phrases—while accepting that exact outputs can’t be prescribed. This is a departure from traditional QA, where requirements like “the button turns blue on hover” left no room for ambiguity.

Ethical Considerations

AI also raises ethical stakes for QA. In our Salesforce example, what if the AI prioritizes leads based on biased data, favoring certain demographics? QA must proactively design processes to detect and mitigate bias, such as auditing lead-scoring algorithms or sampling outputs for fairness. This isn’t just about building the right thing for the business—it’s about ensuring the system aligns with societal values.

The QC Challenge: Building It Right When “Right” Is a Range

QC, focused on verifying correctness, faces a more immediate hurdle: how do you test a system that doesn’t produce consistent results? Traditional QC relies on deterministic test cases—input X should yield output Y. With LLMs, output Y might be a range of responses, some excellent, others flawed. Agentic AI compounds this, as interactions between components create unpredictable outcomes.

Revisiting the Salesforce Example

For our lead-nurturing AI, QC must verify that:

·      Lead-scoring logic correctly ranks prospects based on CRM data.

·      Email drafts meet quality standards (e.g., grammatically correct, on-brand).

·      Follow-up scheduling respects constraints (e.g., no emails sent at 3 AM).

QC’s Role: Traditional tests won’t suffice. QC must adopt probabilistic approaches:

·      Statistical Testing: Evaluate outputs against distributions, not single cases. For instance, test that 95% of emails score above a certain quality threshold (e.g., using metrics like readability or sentiment analysis).

·      Robustness Checks: Probe edge cases, like incomplete CRM data or ambiguous prospect replies, to ensure the system doesn’t break. For example, if a lead’s email bounces, does the AI retry appropriately?

·      End-to-End Validation: Test the entire pipeline, from lead scoring to scheduling, to catch errors in component interactions. A perfectly scored lead is useless if the follow-up email is scheduled for the wrong time zone.

The Versioning Problem

A unique complication arises from how LLMs are updated. Unlike traditional APIs, where versions are clearly labeled (e.g., Salesforce API v60.0), public LLMs often receive opaque updates. If the LLM powering our email drafts changes overnight—say, becoming more formal—QC tests might fail unexpectedly. Without the ability to pin to a specific model version, teams must continuously revalidate outputs, increasing testing overhead. In our example, QC might need to retest email tone weekly to ensure it aligns with brand guidelines, a burden traditional software rarely imposes.

Agentic Complexity

Agentic AI’s interconnectedness amplifies QC challenges. If our Salesforce AI misinterprets a calendar API’s response (e.g., scheduling conflicts), the error might cascade, flooding a prospect with emails. QC must test not just individual components but their interactions, using techniques like:

·      Scenario-Based Testing: Simulate real-world workflows, like a prospect replying with a question, to verify system resilience.

·      Chaos Engineering: Introduce failures (e.g., API downtime) to ensure the AI recovers gracefully.

·      Tracing: Log intermediate outputs to pinpoint where errors occur, like a misparsed CRM field leading to an irrelevant email.

The Multiplier Effect of Agentic AI

Agentic AI, common in CRM systems like Salesforce, introduces a multiplier effect on both QA and QC. Each component—CRM data analysis, LLM-generated content, external tool integration—adds variability. When chained, small deviations can snowball. In our lead-nurturing example:

·      A slight bias in lead scoring (e.g., overemphasizing recent activity) might prioritize low-value prospects.

·      This could trigger off-tone emails, eroding trust.

·      Misaligned scheduling might then spam prospects, violating compliance.

QA must anticipate these cascades during design, defining clear handoffs between components (e.g., validating lead scores before email generation). QC must test the system holistically, ensuring errors don’t propagate. The complexity grows exponentially with more components, as each interaction creates new failure modes. Traditional software rarely faces this scale of interdependence, making agentic AI a frontier for quality practices.

Rethinking QA and QC for the AI Era

To navigate these challenges, businesses must evolve their approaches:

For QA:

·      Flexible Requirements: Define goals as ranges (e.g., “emails should convert 10-15% of leads”) rather than fixed outputs.

·      Stakeholder Collaboration: Involve diverse teams—sales, legal, ethics—to ensure the system aligns with multifaceted needs.

·      Ethical Audits: Regularly assess AI for bias, fairness, and compliance, especially in regulated domains like CRM.

For QC:

·      Probabilistic Metrics: Use statistical tools to evaluate output distributions, like sampling 1,000 emails to check quality.

·      Automated Monitoring: Deploy real-time checks to flag anomalies, such as a sudden drop in email relevance.

·      Hybrid Validation: Combine automated tests with human reviews to catch nuanced errors, like tone mismatches.

In our Salesforce example, QA might simulate a year of lead nurturing to refine the system’s objectives, while QC could run Monte Carlo simulations to map possible outcomes, identifying rare but costly failures. These methods, while resource-intensive, reflect the reality of AI-driven software.

Beyond Productivity: AI as a Paradigm Shift

AI’s impact transcends productivity. Automating email drafts or lead scoring saves time, but the true revolution lies in how it redefines software itself. Systems are no longer static artifacts but dynamic entities that learn, adapt, and occasionally surprise. This shift demands a corresponding evolution in QA and QC, moving from deterministic checklists to probabilistic frameworks that embrace uncertainty.

In CRM, where customer trust is the currency, getting this right is critical. Our Salesforce agentic AI must nurture leads effectively (QA’s domain) and execute flawlessly (QC’s responsibility). Missteps—whether prioritizing the wrong prospects or sending tone-deaf emails—can erode relationships built over years. By rethinking quality practices, businesses can harness AI’s potential while safeguarding what matters most.

Conclusion

AI, particularly agentic AI, is not just a tool but a new way of building software. Its non-determinism and interconnectedness challenge traditional QA and QC, demanding approaches that balance flexibility with rigor. In the CRM world, where Salesforce powers mission-critical processes, these changes are both a risk and an opportunity. By designing systems that anticipate variability, testing them probabilistically, and aligning them with user needs, we can build AI-driven software that’s not only innovative but trustworthy. The journey is complex, but the destination—a world where software evolves alongside its users—is worth the effort.

Salesforce’s Potential Informatica Acquisition: A Strategic Play for Data Dominance or a Costly Misstep?

Salesforce’s rumored acquisition of Informatica, a leader in data integration and master data management (MDM), has the tech world buzzing. With talks reignited in May 2025 after last year’s pricing fumble, this move could reshape Salesforce’s Data Cloud ambitions and redefine its role in the AI-driven enterprise landscape. But is it a masterstroke to close Salesforce’s MDM gap and turbocharge its Customer 360 vision, or a risky bet that could cannibalize its own MuleSoft and alienate cost-conscious customers? As a strategist, I see this as a high-stakes play that could either solidify Salesforce’s data dominance or expose cracks in its acquisition playbook. Let’s unpack the strategic rationale, the lessons from Salesforce’s past, and how mid-market companies and large enterprises will weigh this decision—backed by data that cuts through the hype.

The Strategic Rationale: Filling the MDM Void and Fueling AI

Salesforce has built a CRM empire, but its MDM capabilities are a glaring weakness. Informatica’s Intelligent Data Management Cloud (IDMC), with its robust MDM and ETL/ELT data orchestration, could plug this gap, enabling clean, unified data to power Salesforce’s Data Cloud and AI-driven Agentforce platform. This isn’t just about data plumbing—it’s about making enterprises AI-ready. As I’ve argued, poor data quality and fragmented systems are the Achilles’ heel of companies chasing the Customer 360 dream: accurate churn models, propensity-to-buy predictions, and next-best-action recommendations. Informatica’s extensive connectors to diverse data sources—think on-premises databases to cloud apps—could make Data Cloud a one-stop shop for enterprises drowning in data chaos.

But here’s the rub: Salesforce already owns MuleSoft, a $6.5 billion integration powerhouse that’s powerful but pricey and complex. MuleSoft’s API-led process orchestration is a beast for enterprise integrations, but its steep learning curve and high cost have driven some customers to cheaper alternatives like Boomi or even n8n. Informatica’s data orchestration, by contrast, is simpler, focusing on ETL/ELT pipelines that align with the market’s craving for accessibility. This could be a lifeline for customers frustrated by MuleSoft’s complexity, but it risks cannibalizing Salesforce’s own portfolio if not positioned carefully. And with AI evolving to automate integrations, the long-term value of middleware itself is under scrutiny. Is Salesforce doubling down on yesterday’s tech, or is Informatica the bridge to an AI-driven future?

Lessons from the Past: Slack’s Overreach and Vlocity’s Niche

Salesforce’s acquisition history is a rollercoaster, and Informatica’s fate hinges on learning from both triumphs and missteps. Take Vlocity, acquired for $1.33 billion in 2020 to conquer industry verticals like telecom and insurance. Rebranded as Salesforce Industries, it powers solutions like Financial Services Cloud, but let’s be blunt: it’s a niche player at best. High implementation costs and complexity have kept mid-market firms at bay, leaving enterprise heavyweights like T-Mobile and AXA as its main fans. I called it—adoption is lackluster outside big players, and Salesforce can’t afford to repeat this with Informatica.

Then there’s Slack, the $27.7 billion blunder championed by former co-CEO Bret Taylor. Taylor, who joined Salesforce via the $750 million Quip acquisition in 2016, sold Slack as the future of collaboration, but the price was astronomical, and integration has been a slog. Slack’s $1 billion revenue contribution is dwarfed by its cost, earning it a measly 3.6% revenue-to-cost ratio and activist investor scorn. Quip, Taylor’s brainchild, fares even worse—barely a blip in Salesforce’s portfolio, overshadowed by Google Docs and Microsoft 365. Taylor’s vision for Customer 360 and leadership in MuleSoft ($6.5B) and Tableau ($15.7B) deals deserve credit, but his track record with Slack and Quip raises red flags. Without him—Taylor left in 2022—Salesforce’s current leadership, like David Schmaier of Salesforce Industries, must avoid overpaying or mismanaging Informatica’s integration. Successes like ExactTarget ($2.5B, now Marketing Cloud) and Demandware ($2.8B, now Commerce Cloud) show Salesforce can nail acquisitions when it prioritizes speed and clarity, a blueprint for Informatica.

The Numbers Tell a Story: Revenue Contributions and ROI

To ground this in data, let’s look at two charts that reveal Salesforce’s acquisition performance and what Informatica might bring.

Chart 1: Absolute Revenue Contributions (FY24)

Article content

Salesforce Acquisition Revenue Chart

This bar chart shows estimated FY24 revenue contributions from key acquisitions: – Slack: ~$1.0B (pre-acquisition baseline) – ExactTarget: ~$0.7B (Marketing Cloud) – Tableau: $0.64B (Analytics Cloud, FY23 actual) – MuleSoft: $0.62B (Integration Cloud, FY23 actual) – Demandware: ~$0.5B (Commerce Cloud) – Vlocity: ~$0.3B (Salesforce Industries) – Own: ~$0.2B (data backup)

Color Commentary: Slack’s $1 billion lead looks impressive, but it’s a mirage—its $27.7 billion price tag makes it a cautionary tale of overreach, thanks to Taylor’s rosy pitch. ExactTarget and Demandware shine, proving Salesforce can turn acquisitions into revenue engines when integration clicks. Vlocity’s $0.3 billion confirms my skepticism: it’s a niche win for enterprises but a no-show for mid-market firms. MuleSoft and Tableau hold steady, but their complexity echoes my concerns about middleware bloat. Informatica, with $1.6 billion in 2024 revenue, could rival Slack’s contribution, but only if Salesforce avoids another integration quagmire.

Chart 2: Revenue-to-Cost Ratio (FY24)

Article content

Salesforce Revenue-to-Cost Ratio Chart

This chart normalizes revenue by acquisition cost, showing ROI efficiency: – ExactTarget: 28.0% ($0.7B ÷ $2.5B) – Vlocity: 22.6% ($0.3B ÷ $1.33B) – Demandware: 17.9% ($0.5B ÷ $2.8B) – Informatica (Est.): 13.3% ($1.0B ÷ $7.5B) – Own: 10.5% ($0.2B ÷ $1.9B) – MuleSoft: 9.6% ($0.62B ÷ $6.5B) – Tableau: 4.0% ($0.64B ÷ $15.7B) – Slack: 3.6% ($1.0B ÷ $27.7B)

Color Commentary: ExactTarget and Vlocity top the list, proving smaller deals can pack a punch. Vlocity’s 22.6% ratio challenges my doubts about its adoption—its low cost makes it a quiet winner. Informatica’s estimated 13.3% ratio positions it as a smart bet, outpacing MuleSoft and Tableau, but it’s no ExactTarget. Slack’s 3.6% ratio is a glaring reminder of Taylor’s overpayment, fueling investor skepticism. Activist investors, still licking wounds from Slack, will demand Informatica delivers fast to justify its $7-8 billion price tag.

Mid-Market vs. Large Enterprises: A Tale of Two Decision Processes

The Informatica acquisition’s success hinges on how different market segments perceive it. Mid-market companies and large enterprises have distinct priorities, and Salesforce must navigate these carefully.

· Mid-Market Companies:

o Priorities: Cost, simplicity, and speed of implementation are king. Mid-market firms, often resource-constrained, shy away from MuleSoft’s complexity and high price tag, as I’ve noted. They’re drawn to solutions like Boomi for their affordability and ease. Informatica’s SaaS-based IDMC could appeal here, offering simpler data orchestration to prep for AI without breaking the bank. However, implementation hurdles (my point about change management) could deter adoption if Salesforce doesn’t streamline onboarding.

o Decision Factors: Mid-market buyers will ask: Can Informatica integrate with our existing systems without a PhD in data engineering? Is it cost-competitive with Boomi or n8n? Does it deliver quick wins for AI-driven insights? Salesforce must position Informatica as a plug-and-play solution, leveraging lessons from Vlocity’s mid-market struggles.

o Risk: If Informatica feels like another complex, costly add-on, mid-market firms will balk, echoing Vlocity’s limited traction outside enterprises.

· Large Enterprises:

o Priorities: Enterprises prioritize scale, compliance, and strategic alignment with AI and Customer 360 goals. Informatica’s MDM and industry-specific strengths (healthcare, finance) align with my point about new verticals, appealing to firms like Humana or AXA. They’re less price-sensitive but demand robust governance and integration with legacy systems, where MuleSoft and Informatica could complement each other—if Salesforce clarifies their roles (process vs. data orchestration).

o Decision Factors: Enterprises will evaluate: Can Informatica unify our sprawling data for AI-driven churn models and recommendations? Does it integrate seamlessly with Data Cloud and Agentforce? Will it reduce our reliance on multiple vendors? Salesforce’s success with ExactTarget and Demandware shows it can win enterprises with clear value propositions.

o Risk: Overlap with MuleSoft could confuse enterprises, as I’ve warned, and slow integration (à la Slack) could erode trust.

The AI Wildcard and the Middleware Question

As AI evolves, middleware’s future is shaky. Agentic AI, like Salesforce’s Agentforce, could automate integrations, reducing the need for traditional tools like MuleSoft or Informatica. But for now, clean, unified data is the fuel for AI success, and Informatica’s CLAIRE AI and MDM capabilities make it a linchpin for enterprises chasing AI readiness. This isn’t about old tech—it’s about enabling the next wave of innovation. Salesforce must market Informatica as the first step in an AI adoption journey, not a legacy band-aid. AI’s Looming Disruption: As AI slashes the cost of building custom enterprise apps, the traditional SaaS model—build once, configure—faces an existential threat. Informatica’s SaaS strengths could keep it ahead, but Salesforce must innovate beyond middleware to outrun AI-native upstarts. The real question: is enterprise software’s “context” era nearing its end?

The Verdict: Bold Move, But Execution is Everything

Salesforce’s Informatica play is a calculated bet to dominate the data cloud space, filling its MDM gap and supercharging Data Cloud for AI-driven outcomes. The charts show Salesforce can extract value from acquisitions, but ROI varies wildly—ExactTarget’s 28% return dwarfs Slack’s 3.6%. Informatica’s projected 13.3% ratio is promising, but only if Salesforce avoids Vlocity’s adoption pitfalls and Slack’s integration delays. Mid-market firms will demand simplicity and affordability, while enterprises will seek scale and strategic fit. The risk of cannibalizing MuleSoft looms large, and activist investors will pounce if costs balloon.

My take? Salesforce is playing chess, not checkers, but it’s a high-stakes game. Informatica could be the key to unlocking Customer 360’s holy grail—accurate, actionable insights—but only if Salesforce learns from previous missteps and executes with precision. The market’s watching, and so am I.