Archive - Epoch AI

GPT-5.1 is about as capable as GPT-5

With “high” reasoning, both GPT-5.1 and GPT-5 score 151 on the Epoch Capabilities Index, our tool for combining results across multiple benchmarks

11 hrs ago •

The software intelligence explosion debate needs experiments

The existing debate rests on data and assumptions that are shakier than most people realize. To make progress, we need better evidence, and experiments…

Nov 15 •

and

Parker Whitfill

Introducing the Frontier Data Centers Hub

We announce our new Frontier Data Centers Hub, a database tracking large AI data centers using satellite and permit data to show compute, power use, and…

Nov 4 •

OSWorld — AI computer use capabilities

Tasks are simple, many don't require GUIs, and success often hinges on interpreting ambiguous instructions. The benchmark is also not stable over time.

Nov 4 •

October 2025

The Epoch AI Brief - October 2025

Report on decentralized training, new Epoch Capabilities Index for tracking AI progress, FrontierMath evaluations of leading models, revenue insights on…

Oct 31 •

AI + Math Chat #2 - Formalizing Proofs & Breaking Codes

“Can LLMs go beyond pattern-matching to produce fully formalized mathematical proofs and novel cryptographic attacks?” w. Kevin Buzzard, Alex…

Oct 30 •

AI + Math Chat #1 - Thinking smarter not harder

“How close are today’s large-language models to genuine mathematical creativity?” with mathematicians Richard Borcherds, Sergei Gukov, Daniel Litt, and…

Oct 30 •

Solving AI’s power problem with decentralized training

Conventional wisdom in AI is that large-scale pretraining needs to happen in massive contiguous datacenter campuses. But is this true?

Oct 28 •

and

Anton Troynikov

AI can learn logic. But can it learn folklore knowledge? - Svetlana Jitomirskaya

Large language models can imitate reasoning steps and even verify formal proofs. But Svetlana Jitomirskaya argues they lack folklore knowledge: the…

Oct 27 •

Waiting for AI's phase change in mathematics - Ravi Vakil

Stanford mathematician Ravi Vakil, president of the American Mathematical Society, expects AI’s impact on mathematics to come as a phase change, not a…

Oct 23 •

Less than 70% of FrontierMath is within reach for today’s models

57% of problems have been solved at least once

Oct 17 •

OpenAI is projecting unprecedented revenue growth

No company has gone from $10B to $100B as fast as OpenAI projects to do

Oct 15 •

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts