On July 9, 2025, xAI hosted the much awaited Grok 4 Demo Livestream, unveiling its next-generation artificial intelligence model to a global audience. Though delayed by nearly an hour, the unscripted event delivered a comprehensive overview of Grok 4’s innovative features, performance benchmarks, and potential applications.
Two powerful variants: Grok 4 and Grok 4 Code
xAI introduced two distinct versions of Grok 4:
General-purpose Grok 4: Designed for tasks like writing, research, and general productivity, this model aims to support a wide range of use cases from education to content creation.
Grok 4 Code: Tailored specifically for developers, it offers advanced coding support including real-time code generation, debugging, and integration with tools such as Cursor.
Performance benchmarks
Despite being unofficial, leaked performance data places Grok 4 ahead of the competition:
Humanity’s Last Exam: 45% score, more than double that of Google’s Gemini 2.5 Pro (21%).
AIME’25 (mathematics): 95% accuracy.
GPQA (graduate-level question answering): 88% accuracy.
These scores suggest that Grok 4 not only rivals but may surpass top-tier models like GPT-4o in reasoning and precision.
Multimodal ambitions
While the current model focuses on text-based tasks, xAI hinted at upcoming capabilities in:
Vision image generation and video/audio processing
This would bring Grok 4 into the realm of fully multimodal AI systems, enabling richer user experiences across different media formats.
Real-world applications
Grok 4 demonstrated fluency across several high-stakes domains, including:
finance, healthcare, legal research and scientific analysis.
Grok 4 being integrated into Tesla vehicles
Enterprise features include data extraction, summarization, and knowledge synthesis. Notably, there are rumors of Grok 4 being integrated into Tesla vehicles and even aiding in AI-powered game development.
The launch comes in the wake of earlier backlash surrounding Grok’s political bias. In response, xAI has updated its alignment strategies, aiming for balanced and less media-influenced outputs.
Initially, Grok 4 will be accessible to subscribers of X Premium+ and SuperGrok, with broader access likely to follow. Viewers of the livestream described it as “stream-of-consciousness,” reflecting xAI’s more relaxed, impromptu approach compared to traditional tech demos.
With its fast processing, improved reasoning, and expansive capabilities, Grok 4 signals xAI’s serious intent to compete with industry leaders in the AI space. As its multimodal features roll out and enterprise integrations deepen, Grok 4 could prove to be a pivotal force in the evolving AI landscape.