Grok 4

Technology

An advanced large language model from xAI that has shown significant performance improvements, positioning it as a top competitor to OpenAI's models.

First Mentioned

7/12/2025, 4:40:57 AM

Last Updated

8/20/2025, 4:55:28 AM

Research Retrieved

7/12/2025, 5:00:15 AM

Summary

Grok 4 is an advanced artificial intelligence model developed by xAI, Elon Musk's AI company. It is reportedly trained on the Colossus supercomputer and has demonstrated benchmark performance that surpasses competitors like OpenAI and Google's Gemini. This achievement aligns with "The Bitter Lesson," a principle suggesting that scalable computation is more effective than relying on human-labeled data, a strategy also employed by Tesla's FSD and in the development of LLMs using synthetic data. Grok 4 originates from the Grok chatbot, which was launched in November 2023 and is integrated with the social media platform X, known for providing unfiltered answers. Grok 4 includes specialized variants such as "Grok 4 Code" for software development and "Grok 4 Heavy" which utilizes collaborative multi-agent systems to solve complex problems, and has achieved high scores in various academic and abstract reasoning benchmarks.

Referenced in 3 Documents

Research Data

Extracted Attributes

Developer
xAI
Model Type
Generative Artificial Intelligence chatbot (Large Language Model)
Integration
X (social media platform)
Key Feature
Provides unfiltered answers
Availability
Via API
Core Principle
The Bitter Lesson (scalable computation over human-labeled data)
Parent Company CEO
Elon Musk
Claimed Performance
Capable of perfect SAT scores and near-perfect GRE results across subjects
Specialized Variant
Grok 4 Heavy (uses collaborative multi-agent systems)
Training Supercomputer
Colossus
Benchmark Performance (AIME)
100% (compared to Grok 3's 52.2%)
Benchmark Performance (GPQA)
87% (compared to Grok 3's 75.4%)
Benchmark Performance (Humanity's Last Exam)
25.4% (outperforming Google's Gemini 2.5 Pro at 21.6% and OpenAI's o3 at 21%)
Benchmark Performance (ARC-AGI-1 & ARC-AGI-2)
Top-performing publicly available model
Benchmark Performance (Artificial Analysis Intelligence Index)
Highest ranked, slightly ahead of Gemini 2.5 Pro and OpenAI's o4-mini-high

Timeline

Grok, the chatbot from which Grok 4 originates, was launched by xAI. (Source: user summary, Wikipedia)
2023-11-01
Elon Musk unveiled Grok 4, calling it 'the smartest AI in the world' and touting its advanced capabilities. (Source: web search results)
2025-07-09

Wikipedia

View on Wikipedia

Grok (chatbot)

Grok is a generative artificial intelligence chatbot developed by xAI. It was launched in November 2023 by Elon Musk as an initiative based on the large language model (LLM) of the same name. Grok is integrated with the social media platform X, formerly known as Twitter, and has apps for iOS and Android. The bot is named after the verb grok, coined by American author Robert A. Heinlein in his 1961 science fiction novel Stranger in a Strange Land to describe a form of understanding. The bot, which is marketed as providing "unfiltered answers", has generated various controversial responses in its history.

Web Search Results

The Emergence of Grok 4: A Deep Dive into xAI's Flagship AI Model
Educational Institutions: Grok 4 is envisioned as an “advanced tutoring system” capable of explaining complex concepts across multiple disciplines. Its ability to provide step-by-step logical progressions makes it particularly valuable for STEM education applications. [...] For software development, Grok 4 introduces a specialized variant known as “Grok 4 Code”. This version is designed to integrate with development tools like the Cursor editor, offering sophisticated code generation, debugging assistance, and programming support. Its capabilities extend beyond basic syntax completion to include architectural design recommendations, performance optimization suggestions, and automated testing strategies. The system can also analyze existing codebases and suggest [...] AIME (American Invitational Mathematics Examination): Grok 4 achieved a perfect 100% score, a significant improvement over Grok 3’s 52.2%. This near-perfect result is highlighted as surpassing human experts. GPQA (Graduate-Level Physics Question Answering): Grok 4 scored 87%, compared to Grok 3’s 75.4%. This demonstrates a deep scientific understanding and cross-disciplinary knowledge.
10 Grok 4 Key Insights : Future of AI or Just Another Overhyped Tech?
What if the tools shaping tomorrow’s world weren’t just evolving—they were rewriting the rules entirely? Enter Grok 4, the latest AI model from XAI, which has already set a new benchmark in artificial intelligence. Whether it’s outperforming competitors in academic problem-solving or introducing new collaborative multi-agent systems, Grok 4 promises to redefine what’s possible in computational tasks. Yet, this innovation doesn’t come without its share of challenges. From ethical dilemmas to [...] Grok 4 has redefined performance standards by excelling in academic benchmarks, surpassing competitors such as OpenAI and Google in areas like high school-level math, science, and coding tasks. Its notable achievement in the ARK AGI2 test—a rigorous evaluation of abstract problem-solving—demonstrates its ability to tackle complex intellectual challenges. Additionally, Grok 4’s proficiency in text-based processing makes it an invaluable tool for tasks such as document analysis, content [...] One of Grok 4’s most innovative features is its use of collaborative multi-agent systems, a design that enables multiple AI agents to work together to solve complex problems. This approach enhances the model’s performance in tasks requiring coordination and advanced problem-solving. For instance, Grok 4 Heavy, a specialized version of the model, uses this system to tackle intricate challenges more effectively.
New Grok 4 Takes on 'Humanity's Last Exam' as the AI Race Heats Up
Artificial Analysis, an independent benchmarking platform that ranks AI models, now lists Grok 4 as highest on its Artificial Analysis Intelligence Index, slightly ahead of Gemini 2.5 Pro and OpenAI’s o4-mini-high. And Grok 4 appears as the top-performing publicly available model on the leaderboards for the Abstraction and Reasoning Corpus, or ARC-AGI-1, and its second edition, ARC-AGI-2—benchmarks that measure progress toward “humanlike” general intelligence. Greg Kamradt, president of ARC [...] Elon Musk released the newest artificial intelligence model from his company xAI on Wednesday night. In an hour-long public reveal session, he called the model, Grok 4, “the smartest AI in the world” and claimed it was capable of getting perfect SAT scores and near-perfect GRE results in every subject, from the humanities to the sciences. [...] Elon Musk has launched xAI’s Grok 4—calling it the “world’s smartest AI” and claiming it can ace Ph.D.-level exams and outpace rivals such as Google’s Gemini and OpenAI’s o3 on tough benchmarks By Deni Ellis Béchardedited by Dean Visser Image 1: Digital illustration, structure made of cubes evolves from simple (on the left) to gradually a more complex shape of a thinking or contemplating person seated on a rock Floriana/Getty Images Artificial Intelligence
Musk unveils Grok 4 update a day after xAI chatbot made antisemitic ...
Anne Marie D. Lee Updated on: July 10, 2025 / 10:05 AM EDT / CBS News Details on Grok's antisemitic posts Image 2 What to know about antisemitic comments posted by Grok, Elon Musk's AI chatbot 04:34 Elon Musk on Wednesday unveiled Grok 4, a new version of his X platform's AI chatbot. The update comes a day after the bot posted antisemitic content on the social media network. [...] Musk introduced the new model in a livestream on X late Wednesday, calling Grok 4 "the smartest AI in the world." "It really is remarkable to see the advancement of artificial intelligence and how quickly it is evolving," Musk said, adding that "AI is advancing vastly faster than any human." He touted the model's virtues, claiming that if it were to take the SATs, it would achieve perfect scores every time, and also outsmart nearly every graduate student across disciplines.
Elon Musk's xAI launches Grok 4 alongside a $300 monthly ...
xAI claims that Grok 4 shows frontier level performance on several benchmarks, including Humanity’s Last Exam— a challenging test measuring AI’s ability to answer thousands of crowdsourced questions on subjects like math, humanities, and natural science. According to xAI, Grok 4 scored 25.4% on Humanity’s Last Exam without “tools,” outperforming Google’s Gemini 2.5 Pro, which scored 21.6%, and OpenAI’s o3 (high), which scored 21%. [...] xAI launched two models on Wednesday: Grok 4 and Grok 4 Heavy — the latter being the company’s “multi-agent version” that offers increased performance. Musk claimed that Grok 4 Heavy spawns multiple agents to work on a problem simultaneously, and then they all compare their work “like a study group” to find the best answer. [...] xAI is releasing Grok 4 through its API in an effort to get developers to build applications with the model. The company notes that xAI’s enterprise sector is only two months old, however, it plans to work with hyperscalers to make Grok available through their cloud platforms.

Location Data

GROK, 下堀川飴屋町線, 上前津一丁目, 中区, 名古屋市, 愛知県, 460-0018, 日本

pub

Coordinates: 35.1552196, 136.9026549

Open Map

Grok 4

First Mentioned

Last Updated

Research Retrieved

Summary

Referenced in 3 Documents

Research Data

Extracted Attributes

Developer

Model Type

Integration

Key Feature

Availability

Core Principle

Parent Company CEO

Claimed Performance

Specialized Variant

Training Supercomputer

Benchmark Performance (AIME)

Benchmark Performance (GPQA)

Benchmark Performance (Humanity's Last Exam)

Benchmark Performance (ARC-AGI-1 & ARC-AGI-2)

Benchmark Performance (Artificial Analysis Intelligence Index)

Timeline

Wikipedia

Grok (chatbot)

Web Search Results

Location Data