Grok 4: The World's Most Advanced AI

The latest in the Grok series, Grok 4 offers unprecedented logic, multimodal understanding, coding support, and benchmark-breaking performance.

Get Started Try Grok Imagine

Dev

Trusted by experts in AI, ML, and development

Grok 4 AI Terminal

> How can Grok 4 AI assist in solving complex problems?

Grok 4 combines advanced logical reasoning with multimodal capabilities to tackle your most challenging problems:

• Near-perfect scores on SAT, GRE, and professional exams
• Sophisticated code generation with debugging capabilities
• Real-time web data analysis for up-to-date insights
• Multimodal understanding across text, image, and audio

Would you like me to demonstrate any specific capability?

> |

Powerful Features of Grok 4

Explore the major capabilities of Grok 4 AI, from deep reasoning to multimodal input, network-aware search, and advanced coding support.

Advanced Logical Reasoning

Grok 4 features world-class logic and reasoning capabilities. In benchmark tests and real‑world puzzles — including unseen SAT and GRE questions — Grok 4 achieves near-perfect accuracy, demonstrating how Grok 4 AI reasons and solves complex problems effortlessly.

Learn more

Grok 4 Code: Intelligent Programming Assistant

Grok 4 code mode is built for developers. It offers smart code completion, syntax suggestions, debugging tips, and even full code generation. Need help writing a Python function or fixing a bug? Grok 4 AI has your back.

Learn more

Multimodal Input & Output

Grok 4 supports textured multimodal behavior: upload an image and ask questions ("What's in this chart?"), or give audio prompts and get thoughtful spoken responses. It can even pick out subjective features—like "weird face filters" on social media—showing how deeply Grok 4 handles visual and audio data.

Learn more

Real‑Time Network Search

Grok 4 AI actively browses real‑time web data. For example, on Polymarket it combines odds and FanGraphs baseball stats to calculate actual win probabilities, spotting market inefficiencies ('edges') automatically.

Learn more

New Feature

Introducing Grok Imagine

Create stunning images and videos from text prompts with xAI's revolutionary AI generation tool. Built on the Aurora model for lightning-fast creation with minimal restrictions.

AI Image Generation

Create photorealistic, animated, and anime-style images from simple text descriptions in under 20 seconds.

Video Creation

Transform static images into dynamic 15-second videos with ambient sound effects and background music.

Creative Freedom

Choose between Normal mode for family-friendly content or Spicy mode for unrestricted creative expression.

Ready to Create?

Join thousands of creators using Grok Imagine to bring their ideas to life with AI-powered visual generation.

Explore Grok Imagine

Generate in under 20 seconds

Grok 4 Benchmark Performance

Grok 4 sets new records across elite testing regimes and outperforms major AI competitors.

GPQA

In the General Physics Question Answering benchmark, Grok 4 Heavy achieved an impressive 88.9% accuracy, significantly outperforming other models.

o3 (no tool) 83.3%

Gemini 2.5 Pro 86.4%

Claude 4 79.6%

Grok 4 (no tool) 87.5%

Grok 4 Heavy 88.9%

AIME25

In the American Invitational Mathematics Examination benchmark, Grok 4 Heavy achieved a perfect 100% score.

o3 (no tool) 88.9%

Gemini (no tool) 88%

Claude 4 (no tool) 75.5%

Grok 4 (no tool) 91.7%

Grok 4 Heavy 100%

LCB (Jan-May)

In the LCB benchmark, Grok models performed exceptionally well, with Grok 4 Heavy achieving a high score of 79.4%.

o3 (no tool) 72%

Gemini 2.5 Pro 74.2%

Grok 4 (no tool) 79%

Grok 4 79.3%

Grok 4 Heavy 79.4%

HMMT25

In the Harvard-MIT Mathematics Tournament benchmark, Grok 4 Heavy achieved an outstanding 96.7% score.

o3 (no tool) 77.5%

Gemini 2.5 Pro 82.5%

Claude 4 Opus 58.3%

Grok 4 (no tool) 93.9%

Grok 4 Heavy 96.7%

USAMO25

In the USA Mathematical Olympiad benchmark, Grok 4 Heavy achieved the highest score of 61.9%.

o3 21.7%

Gemini 2.5 Pro 34.5%

Claude Opus 4 49.4%

Grok 4 37.5%

Grok 4 Heavy 61.9%

Performance Summary

Across all benchmarks, Grok 4 models demonstrate exceptional performance, with the Grok 4 Heavy version achieving top scores in multiple tests.

AIME25: Grok 4 Heavy 100% perfect score
HMMT25: Grok 4 Heavy 96.7% top performance
GPQA: Grok 4 Heavy 88.9% leads competitors
Outperforms GPT-5, Claude and Gemini across all tests

Top Performance Multi-domain Excellence Industry Leading

Grok Evolution: From Version 1 to 4

See the major improvements from Grok 1 → Grok 2 → Grok 3 → Grok 3.5 → Grok 4 AI.

Grok 1 → Grok 2

Initial reasoning and language generation in Grok 1 matured into Grok 2 with ten‑times more training compute, delivering sharper reasoning and broader domain coverage.

Key improvement: 10x training compute

Grok 2 → Grok 3

Grok 3 improved context understanding, multimodal awareness, and introduced early coding assistance. Reasoning accuracy climbed significantly.

Key improvement: Multimodal awareness

Grok 3 → Grok 3.5

Grok 3.5 added advanced prompt understanding, better code fixes, and real‑time web lookup. Foundation qualities of Grok 4 began to appear.

Key improvement: Real-time web lookup

Grok 3.5 → Grok 4

Grok 4 AI boosts training compute ~10× again (100× total from Grok 2), plus RL‑compute for reasoning. Capabilities expand deeply in logic, coding, multimodal processing, web awareness, and voice.

Key improvement: 10x compute & RL-reasoning

Grok 4 → Grok 4 Heavy

Grok 4 Heavy adds parallel expert agents working together. It's available via a $300/mo SuperGrok Heavy plan—perfect for tackling the hardest problems via collaborative computing.

Key improvement: Parallel expert agents

Real‑World Use Cases

Discover how developers and businesses use Grok 4 AI and Grok 4 Heavy.

Developer Code Assistance

Grok 4 code integration in IDEs speeds development with on‑the‑fly completion, test generation, debugging hints, and documentation. Users report drastic reductions in coding time.

Code Completion Testing Debugging

Enterprise Automation & Customer Support

Businesses deploy Grok 4 AI in chatbots to handle multimodal customer inquiries—image, voice, and text—resolving issues with expert reasoning and accuracy, reducing human agent load.

Chatbots Customer Support Automation

Multimodal Analytics & Creative Tasks

From image‑analysis and summarization to generating HTML animations (e.g., black hole collisions), Grok 4 helps researchers visualize data and bring scientific or creative ideas to life.

Data Analysis Visualization Research

Frequently Asked Questions

Common questions about Grok 4, Grok 4 Heavy, Grok 4 code, and benchmark performance.

What is Grok 4 and when did it launch?

Grok 4 is the fourth‑generation AI model from xAI, launched on July 10, 2025. It sets new standards in logical reasoning, multimodal understanding, coding support, and benchmark performance.

What is Grok 4 Heavy?

Grok 4 Heavy is a premium version that uses a multi-agent system to solve complex problems. It launches several AI agents that work together, compare results, and deliver a superior, well-vetted final answer. It's like having a dedicated team of experts for your most difficult tasks.

What are the new Grok 4 Code capabilities?

Grok 4 features powerful coding abilities, capable of generating functional code and visualizations from natural language prompts. A dedicated, even more powerful 'Grok 4 Code' model is scheduled for release in August, promising to be an indispensable tool for developers.

Is Grok 4 AI multimodal?

Yes, Grok 4 AI is highly multimodal. It can already process and understand text and images, grasping subjective concepts. It also features a new, highly responsive voice mode with emotionally expressive voices. A video generation model is planned for release in October.

What makes Grok 4 different from other AI models?

Grok 4 combines advanced logical reasoning, multimodal capabilities, real-time web access, and specialized coding assistance in a single platform, achieving benchmark-breaking performance across multiple domains.

How does Grok 4 Code assist developers?

Grok 4 Code provides intelligent code completion, syntax suggestions, debugging assistance, and full code generation across multiple programming languages, making it an invaluable tool for developers of all skill levels.

Can Grok 4 access real-time information?

Yes, Grok 4 can browse real-time web data to provide up-to-date information and analysis, making it particularly useful for tasks requiring current market data, news, or other time-sensitive information.

What's coming next after Grok 4?

The roadmap includes: a dedicated coding model in August 2025, a multimodal intelligent agent in September, and a video‑generation model in October.

Grok 4: The World's Most Advanced AI

Powerful Features of Grok 4

Advanced Logical Reasoning

Grok 4 Code: Intelligent Programming Assistant

Multimodal Input & Output

Real‑Time Network Search

Introducing Grok Imagine

AI Image Generation

Video Creation

Creative Freedom

Ready to Create?

Grok 4 Benchmark Performance

GPQA

AIME25

LCB (Jan-May)

HMMT25

USAMO25

Performance Summary

Grok Evolution: From Version 1 to 4

Grok 1 → Grok 2

Grok 2 → Grok 3

Grok 3 → Grok 3.5

Grok 3.5 → Grok 4

Grok 4 → Grok 4 Heavy

Real‑World Use Cases

Developer Code Assistance

Enterprise Automation & Customer Support

Multimodal Analytics & Creative Tasks

Frequently Asked Questions

Get Started with Grok 4