Salesforce AI Analysis proclaims framework to optimize agent functionality and consistency by artificial information, lifelike testing, and reinforcement studying.
At the same time as AI fashions develop extra refined, a curious problem persists: programs that resolve PhD-level arithmetic battle with surprisingly easy duties. Ask a number one language mannequin the well-known riddle “The place does Christmas come earlier than Thanksgiving?” and it appropriately solutions “within the dictionary”—as a result of alphabetically, ‘C’ precedes ‘T.’
However swap the phrases—ask “The place does Thanksgiving come earlier than Christmas?”—and watch the identical mannequin confidently clarify that “within the dictionary, Thanksgiving comes earlier than Christmas alphabetically.” This phenomenon, which we name “jagged intelligence,” reveals sharp peaks of brilliance alongside sudden valleys of weak spot.
For enterprise companies, this inconsistency isn’t educational—it’s operational. When AI brokers deal with customer support calls, course of gross sales workflows, or handle healthcare billing, jagged intelligence creates actual enterprise threat. An agent would possibly flawlessly deal with complicated multi-step duties one second, then hit upon simple requests the subsequent. This unpredictability is a dealbreaker for enterprises the place reliability issues as a lot as functionality.
At Salesforce AI Analysis, we’ve developed a brand new methodology to mitigate these dangers. At present, we’re asserting eVerse: an enterprise simulation framework that trains AI brokers like elite athletes, optimizing them for each functionality and consistency by three interconnected steps: Synthesize, Measure, and Prepare.
eVerse: Synthesize – Constructing the Enterprise “Digital Twin”
Coaching best-in-class AI brokers requires best-in-class coaching environments. Simply as System 1 drivers spend 1000’s of hours in refined simulators earlier than competing at Monaco, enterprise AI brokers want lifelike follow grounds that mirror the complexity of precise enterprise operations.
As a result of belief is Salesforce’s #1 worth, we’ve designed a coaching strategy that by no means places your actual information in danger. Our latest analysis work with CRMArena-Professional is a superb instance. It creates utterly artificial coaching grounds with lifelike buyer information, multi-step workflows, and the sting instances that make enterprise operations unpredictable. Brokers study in environments that mirror actual enterprise programs, whereas your and your clients’ information stays personal, safe, and utterly untouched. Be taught extra about our work in simulation environments in my latest weblog, The New AI Agent Coaching Floor: Simulating Enterprise Environments.
The validation speaks for itself: 90% of area consultants charge our artificial information technology as lifelike or very lifelike. Much more telling—the vast majority of the demos you’re seeing at Dreamforce this week use artificial information generated by CRMArena-Professional.
eVerse: Measure – Stress-Testing in Life like Situations
Synthesis alone isn’t sufficient. We should rigorously measure agent efficiency throughout the situations that matter most to enterprises. This consists of one of the vital vital—and difficult—modalities: voice interactions.
Voice conversations introduce layers of complexity that text-based testing misses: background noise, numerous accents, translation errors, poor connections, a number of audio system. eVerse simulates these lifelike voice interactions, producing artificial cellphone conversations that sound remarkably human whereas testing brokers in opposition to complete enterprise situations.
This measurement infrastructure operates behind the scenes all through Salesforce. It’s how we validated Agentforce voice capabilities earlier than launch, operating 1000’s of artificial conversations to make sure brokers may deal with real-world complexity with each excessive functionality and unwavering consistency.
eVerse: Prepare – Closing Efficiency Gaps with Human Experience
After measurement reveals efficiency gaps, eVerse’s coaching engine closes them by reinforcement studying guided by human experience. Our analysis has demonstrated outstanding enhancements utilizing this methodology: 69% higher efficiency on enterprise duties (from 19% to 88% success charges). We’re at the moment piloting eVerse with clients. One instance is UCSF Well being, the place we’re partnering with human consultants to coach and refine AI that helps simplify and enhance the healthcare billing expertise.
“When used responsibly, we imagine AI can assist our groups simplify one of the vital complicated components of healthcare, making a billing expertise that feels extra seamless and really patient-centered.”
Sara Murray, MD, MAS, VP & Chief Well being AI Officer at UCSF Well being remarked
This steady loop—synthesize environments, measure efficiency, prepare on gaps—transforms brokers from generic language fashions into enterprise-specialized programs prepared for manufacturing deployment.
The Path to Enterprise Basic Intelligence
This work advances our imaginative and prescient for what we name Enterprise Basic Intelligence (EGI): AI optimized for enterprise purposes that excels in each functionality and consistency. Whereas shopper AI prioritizes broad general-purpose capabilities, enterprise AI calls for dependable efficiency throughout particular and sophisticated, multi-step workflows the place inconsistency carries actual enterprise threat.
eVerse addresses this by transferring brokers alongside each dimensions concurrently. Generic LLM brokers underperform in enterprise settings—excessive functionality however low consistency creates the “prodigy” drawback: good when it really works, unreliable when it issues. eVerse-trained brokers obtain the “champion” quadrant: excessive functionality mixed with excessive consistency, precisely what enterprises require.
The Aggressive Crucial
The organizations that may lead within the agentic AI period received’t essentially be these with probably the most superior fashions—they’ll be those who acknowledged early that enterprise AI excellence requires refined coaching environments bridging the hole between simulation and actuality.
This physique of analysis—from eVerse to voice simulation to reinforcement studying from human suggestions—represents Salesforce’s dedication to creating AI brokers genuinely enterprise-ready: reliable, dependable, and grounded in enterprise enterprise intelligence. The long run belongs to brokers skilled in environments that simulate hundreds of thousands of lifelike enterprise situations, validated by area consultants, and constantly refined by real-world suggestions loops.
We’re sharing eVerse at Dreamforce as a result of our analysis advances by steady buyer engagement. The human suggestions that trains brokers in eVerse comes from our clients’ area consultants—the identical organizations who will deploy these programs. This partnership between analysis and follow is how enterprise AI turns into genuinely dependable.
Be part of us as we form what enterprise-ready AI brokers can turn into.

