Sep 18, 2025

AI in Salesforce (2025): Beyond LLM + RAG

Sep 18, 2025


AI inside Salesforce is moving from copilots to agentic systems that can plan and act - but real‑world results show big gaps if you simply wire an LLM to your knowledge base with RAG. Multi‑turn tasks, confidentiality, governance, and workflow execution demand a complete system: unified data, the right models, automation, and guardrails - not an LLM in isolation.

“Can’t we just connect an LLM to our Salesforce with RAG and call it a day?”

Short answer: No and recent evidence shows why.

A 2025 benchmark designed for CRM‑style work (CRMArena‑Pro) tested LLM agents on realistic sales, service, and CPQ scenarios (B2B and B2C). Headline results: ~58% success on single‑turn tasks, dropping to ~35% in multi‑turn interactions. Agents showed near‑zero inherent confidentiality awareness unless heavily prompted—at which point task performance often suffered. In other words, real enterprise tasks need more than retrieval and a good prompt. arXiv

What RAG does: gives the model access to up‑to‑date facts.
What RAG doesn’t guarantee: multi‑step reasoning, policy compliance, safe refusals, or action inside your Salesforce workflows (Flows, Apex, MuleSoft).

If you’re curious about the original idea behind RAG, it was formalized in 2020 to blend parametric model knowledge with retrieved documents—great for factual grounding, but not a substitute for business logic or governance. arXiv

“So what is working for enterprise Salesforce orgs in 2025?”

Short answer: Agentic platforms tied to your CRM data, with governance and automation, are gaining traction.

  • Agentforce (Salesforce’s agent platform) went from announcement at Dreamforce ’24 to a growing feature set (Summer ’25 brought hundreds of new capabilities), underscoring the market’s pivot from chat to agents that plan and act across channels and apps.

  • Adoption momentum: daily AI use among desk workers jumped 233% in six months (Slack Workforce Index, June 26, 2025), with 60% now using AI and 40% using AI agents, a cultural shift that makes agent rollouts stick.

  • Business impact: Salesforce’s 2024 State of Sales reported 83% of sales teams using AI saw revenue growth vs. 66% without AI, directional evidence that, when applied with the right data and workflows, AI correlates with better outcomes.

  • Ecosystem scale: IDC projects the “Salesforce economy” will generate $2.02T in net new business revenues (2022–2028) as AI permeates sales, service, and beyond—context for why enterprises are investing.

“Why exactly isn’t ‘LLM + RAG’ enough for Salesforce work?”

Short answer: Because enterprise work is interactive, governed, and operational—not just informational.

  • Multi‑turn clarity: Real cases require back‑and‑forth to resolve ambiguity; CRMArena‑Pro shows performance drops sharply when you leave the single‑shot comfort zone.

  • Confidentiality & refusal: Agents must recognize sensitive data and refuse or mask, most LLMs don’t do this “out of the box.”

  • Action & auditability: The value is in doing (updating records, triggering flows, handing off to humans) with logs and rollback, not just answering. Salesforce’s own guidance argues that enterprises need data + AI + automation + hand‑off, not standalone LLMs.

“What does a complete Salesforce AI system look like?”

Short answer: A stack that integrates trusted data, adaptable models, automation, and governance with humans in the loop.

  • Data: Unified, policy‑aware data (e.g. zero‑copy federation) for context and permissions.

  • AI: Fit‑for‑purpose models (yes, with RAG), plus reasoning and instruction adherence; pick models per region/use case as needed.

  • Automation & Actions: Salesforce Flow, Apex for real execution; agents that can plan, act, and hand off.

  • Governance: Evaluation against benchmarks like CRMArena‑Pro before going live.