Elon Musk’s AI firm, xAI, late on Wednesday launched its newest flagship AI mannequin, Grok 4, and unveiled a brand new $300-per-month AI subscription plan, SuperGrok Heavy.
Grok is xAI’s reply to fashions like OpenAI’s ChatGPT and Google’s Gemini, and may analyze pictures and reply to questions. In latest months, Grok has grow to be extra deeply built-in into Musk’s social community, X, which was not too long ago acquired by xAI. Nevertheless, that has additionally put Grok’s misbehavior entrance and middle for hundreds of thousands of customers.
The expectations are excessive for Grok 4. The newest AI mannequin from xAI can be stacked up in opposition to OpenAI’s forthcoming AI mannequin, GPT-5, which is anticipated to launch later this summer season.
“With respect to educational questions, Grok 4 is healthier than PhD stage in each topic, no exceptions,” mentioned Elon Musk throughout a livestream Wednesday evening. “At occasions, it might lack frequent sense, and it has not but invented new applied sciences or found new physics, however that’s only a matter of time.”
Musk wore a leather-based jacket and sat apart from xAI leaders to launch Grok 4 (Credit score: xAI)
The launch of Grok 4 comes amid a tumultuous week for Elon Musk’s firms. Earlier on Wednesday, Linda Yaccarino stepped down from her function because the CEO of X after roughly two years with the corporate. X has but to announce her successor.
Yaccarino’s departure comes simply days after Grok’s official, automated X account responded to customers with antisemitic feedback criticizing Hollywood’s “Jewish executives” and praising Hitler. xAI needed to briefly restrict Grok’s account and delete the offensive posts. In response to the incident, xAI appeared to have eliminated a not too long ago added part from Grok’s public system immediate, a listing of directions for the AI chatbot to observe, that informed it to not draw back from making “politically incorrect” claims.
Musk and xAI’s leaders largely prevented discussing the incident, as a substitute specializing in Grok 4’s efficiency and capabilities.
xAI launched two fashions on Wednesday: Grok 4 and Grok 4 Heavy — the latter being the corporate’s “multi-agent model” that provides elevated efficiency. Musk claimed that Grok 4 Heavy spawns a number of brokers to work on an issue concurrently, after which all of them examine their work “like a examine group” to seek out the perfect reply.
xAI claims that Grok 4 exhibits frontier stage efficiency on a number of benchmarks, together with Humanity’s Final Examination— a difficult take a look at measuring AI’s skill to reply 1000’s of crowdsourced questions on topics like math, humanities, and pure science. In line with xAI, Grok 4 scored 25.4% on Humanity’s Final Examination with out “instruments,” outperforming Google’s Gemini 2.5 Professional, which scored 21.6%, and OpenAI’s o3 (excessive), which scored 21%.
xAI claims that Grok 4 Heavy, with “instruments,” was capable of obtain a rating of 44.4%, outperforming Gemini 2.5 Professional with instruments, which scored 26.9%.
The nonprofit Arc Prize says that Grok achieves a brand new state-of-the-art rating on its ARC-AGI-2 take a look at — one other troublesome benchmark that consists of puzzle-like issues the place an AI has to establish visible patterns — scoring 16.2%. That’s practically twice the rating of the following greatest business AI mannequin, Claude Opus 4.
Grok 4’s efficiency on a number of educational benchmarks (Credit score: xAI)
Alongside Grok 4 and Grok 4 Heavy, xAI launched its most costly AI subscription plan but, a $300-per-month subscription referred to as SuperGrok Heavy. Subscribers to the plan will get an early preview to Grok 4 Heavy, in addition to early entry to new options. The plan is just like ultra-premium tiers provided by OpenAI, Google, and Anthropic, however xAI now affords the costliest subscription amongst main AI suppliers.
SuperGrok Heavy subscribers might get early entry to some new merchandise xAI plans to launch within the coming months. The corporate mentioned Wednesday that an AI coding mannequin is coming in August, a multi-modal agent in September, and a video era mannequin in October.
xAI is releasing Grok 4 by way of its API in an effort to get builders to construct purposes with the mannequin. The corporate notes that xAI’s enterprise sector is just two months outdated, nonetheless, it plans to work with hyperscalers to make Grok accessible by way of their cloud platforms.
Regardless of Grok’s frontier-level efficiency on benchmarks, it might show troublesome for xAI to maneuver previous its latest mishaps because it tries to pitch Grok to companies as an actual contender to ChatGPT, Claude, and Gemini. Whether or not companies are able to undertake Grok, flaws and all, stays to be seen.