<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Epoch AI: Updates]]></title><description><![CDATA[All other announcements, as well as website or organizational updates.]]></description><link>https://epochai.substack.com/s/announcements</link><image><url>https://substackcdn.com/image/fetch/$s_!ZsOK!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca617831-3128-496f-8aac-33d1fadda48f_176x176.png</url><title>Epoch AI: Updates</title><link>https://epochai.substack.com/s/announcements</link></image><generator>Substack</generator><lastBuildDate>Sat, 02 May 2026 00:58:33 GMT</lastBuildDate><atom:link href="https://epochai.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Epoch Artificial Intelligence, Inc]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[epochai@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[epochai@substack.com]]></itunes:email><itunes:name><![CDATA[Epoch AI]]></itunes:name></itunes:owner><itunes:author><![CDATA[Epoch AI]]></itunes:author><googleplay:owner><![CDATA[epochai@substack.com]]></googleplay:owner><googleplay:email><![CDATA[epochai@substack.com]]></googleplay:email><googleplay:author><![CDATA[Epoch AI]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[GPT-5.5 Pro achieves a new high score on the ECI]]></title><description><![CDATA[GPT-5.5 Pro achieves a new high score of 159 on the Epoch Capabilities Index &#8212; our statistical tool that combines multiple benchmarks into a unified scale.]]></description><link>https://epochai.substack.com/p/gpt-55-pro-achieves-a-new-high-score</link><guid isPermaLink="false">https://epochai.substack.com/p/gpt-55-pro-achieves-a-new-high-score</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Wed, 29 Apr 2026 09:53:27 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!24I8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!24I8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!24I8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg 424w, https://substackcdn.com/image/fetch/$s_!24I8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg 848w, https://substackcdn.com/image/fetch/$s_!24I8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!24I8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!24I8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg" width="1027" height="1284" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1284,&quot;width&quot;:1027,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!24I8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg 424w, https://substackcdn.com/image/fetch/$s_!24I8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg 848w, https://substackcdn.com/image/fetch/$s_!24I8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!24I8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0bcea65-73be-4ad4-bd90-e40535e22142_1027x1284.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>GPT-5.5 Pro also set new records on FrontierMath, scoring 52% on Tiers 1-3 (up from 50%) and 40% on Tier 4 (up from 38%). Across runs, it and GPT-5.5 solved two Tier 4 problems that no model had solved before, one by Hailong Dao and one by Ahsan Khan (<a href="https://x.com/ahsanxrr">@ahsanxrr</a>). They had this to say:</p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3cbeecc9-2607-4b9a-abb3-22636281aa89_1200x1200.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/71b8de7b-6274-4ffd-abd1-64764f42bec6_1200x1200.png&quot;}],&quot;caption&quot;:&quot;&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/155640ed-8e99-4709-8b7d-8804d7b8376b_1456x720.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p></p><p>For all these and more, check out <a href="https://epoch.ai/benchmarks?view=graph&amp;tab=eci">our website</a>!</p><div><hr></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://epochai.substack.com/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Opus 4.7 scores near frontier on ECI]]></title><description><![CDATA[Its score of 156 puts it behind only GPT-5.4, Gemini 3.1 Pro, and GPT-5.4 Pro.]]></description><link>https://epochai.substack.com/p/opus-47-scores-near-frontier-on-eci</link><guid isPermaLink="false">https://epochai.substack.com/p/opus-47-scores-near-frontier-on-eci</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Tue, 21 Apr 2026 17:30:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!UBqZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Opus 4.7 scores 156 on ECI, our tool for combining multiple benchmarks onto a single scale. This puts it a bit ahead of Opus 4.6 and a bit behind only GPT-5.4, Gemini 3.1 Pro, and GPT-5.4 Pro. Thread with individual scores and commentary.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UBqZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UBqZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UBqZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UBqZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UBqZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UBqZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg" width="1024" height="1280" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!UBqZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UBqZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UBqZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UBqZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9ca8e87-9598-4890-9121-f5c8f0db137e_1024x1280.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>On FrontierMath, Opus 4.7 scored 44% on Tiers 1-3 and 23% on Tier 4, ahead of every model except GPT-5.4 and GPT-5.4 Pro.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5zQD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5zQD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5zQD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5zQD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5zQD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5zQD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg" width="1024" height="1280" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!5zQD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5zQD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5zQD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5zQD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0ce5e33-09bd-4bac-abe6-249f3730db0b_1024x1280.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>On SWE-bench Verified, Opus 4.7 scored 83%, a new record for our evaluations on this benchmark. SWE-bench Verified may be increasingly contaminated, and we hope to evaluate Opus 4.7 on other coding benchmarks soon.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LPMZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LPMZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!LPMZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!LPMZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!LPMZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LPMZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg" width="1024" height="1280" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!LPMZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!LPMZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!LPMZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!LPMZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfc84487-8eec-403e-ab22-c10cf4e2ef03_1024x1280.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Opus 4.7 scored 30% on our Chess Puzzles benchmark. This is a substantial jump over previous Anthropic models, though it remains far from the frontier. Opus 4.7 hit its max output limit before answering for 31% of problems. We plan to improve elicitation here.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!agjH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!agjH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!agjH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!agjH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!agjH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!agjH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg" width="1024" height="1280" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!agjH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!agjH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!agjH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!agjH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd255c299-31e4-4e06-87f3-a459f29fdaea_1024x1280.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Opus 4.7 also solved the last problem on our older math benchmark, Mock AIME, that had not been solved by any model before. This problem involves interpreting the diagram shown below when given as Asymptote vector graphics code.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9C0s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9C0s!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9C0s!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9C0s!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9C0s!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9C0s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg" width="1046" height="660" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:660,&quot;width&quot;:1046,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!9C0s!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9C0s!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9C0s!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9C0s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7230584-883a-43dd-97e0-22e8905b6bf2_1046x660.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Opus 4.7 was a bit behind the top scores on GPQA Diamond (90% vs. 95%). Anecdotally, this may be due to refusing to answer some questions for safety reasons, even though GPQA isn&#8217;t meant to cover dangerous topics. See, for instance, the cut-off sample below.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Fbkp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Fbkp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png 424w, https://substackcdn.com/image/fetch/$s_!Fbkp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png 848w, https://substackcdn.com/image/fetch/$s_!Fbkp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png 1272w, https://substackcdn.com/image/fetch/$s_!Fbkp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Fbkp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png" width="1206" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:1206,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!Fbkp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png 424w, https://substackcdn.com/image/fetch/$s_!Fbkp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png 848w, https://substackcdn.com/image/fetch/$s_!Fbkp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png 1272w, https://substackcdn.com/image/fetch/$s_!Fbkp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae65667-f1ed-40d4-95f6-630db992f62b_1206x400.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>ECI also incorporates scores from third-party benchmarks. So far, for Opus 4.7, we have ARC-AGI-1, ARC-AGI-2, WeirdML v2, and SimpleBench. Its ECI score will evolve as we collect additional benchmark scores.<br><br>For all these and more, check out <a href="https://epoch.ai/benchmarks/eci?view=graph&amp;tab=release-date">our website</a>!</p><div><hr></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://epochai.substack.com/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Introducing the AI Chip Owners Explorer]]></title><description><![CDATA[We announce our new AI Chip Owners explorer, showing which companies own the world&#8217;s leading AI chips.]]></description><link>https://epochai.substack.com/p/introducing-the-ai-chip-owners-explorer</link><guid isPermaLink="false">https://epochai.substack.com/p/introducing-the-ai-chip-owners-explorer</guid><dc:creator><![CDATA[Josh You]]></dc:creator><pubDate>Mon, 06 Apr 2026 21:24:37 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!c6vP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Computing capacity (&#8220;compute&#8221;) is a critical input to the development, training, and deployment of AI systems. How much AI-optimized compute exists in the world, and who owns it? Earlier this year, we <a href="https://epoch.ai/blog/introducing-the-ai-chip-sales-data-explorer/">launched</a> the <a href="https://epoch.ai/data/ai-chip-sales">AI Chip Sales</a> explorer to track the first question. Today, we&#8217;re launching our <a href="https://epoch.ai/data/ai-chip-owners/">AI Chip Owners</a> explorer to track the second.</p><p>Our AI Chip Owners explorer contains interactive visualizations of our analysis of the number of leading AI chips owned by the largest US hyperscalers and cloud companies, one frontier AI developer (xAI), and Chinese customers &#8212; with breakdowns by chip family, chip model, and shifts in ownership over time. We build upon our estimates the total volumes of Nvidia, Google TPU, Amazon Trainium, AMD, and Huawei chips from the <a href="https://epoch.ai/data/ai-chip-sales">AI Chip Sales</a>, and distribute these chips among major owners using estimates from analysts and industry researchers, company financial disclosures, capital spending, and our analysis of <a href="https://epoch.ai/data/data-centers/">frontier-scale AI data centers</a>.</p><p>The AI Chip Owners explorer is intended as a resource for researchers, policymakers, and anyone tracking the strategic landscape of AI compute. You can visit the explorer <a href="https://epoch.ai/data/ai-chip-owners/">here</a>, and read on for highlights!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>Hyperscalers own the majority of global AI compute</h2><p>Most of the world&#8217;s AI computing power is owned by hyperscalers and large cloud companies. &#8220;Hyperscalers&#8221; are the leading companies in data center deployments; among US companies, this refers to Amazon, Google (Alphabet), Meta, Microsoft, and Oracle. We estimate that over 60% of global AI compute (in terms of total computing power) is owned by the five US hyperscalers, led by Google.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c6vP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c6vP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!c6vP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!c6vP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!c6vP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c6vP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!c6vP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!c6vP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!c6vP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!c6vP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b2b46b5-db38-4f7f-ba44-5ab01a1c1c34_1600x900.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Google holds the equivalent of around 5 million Nvidia H100 GPUs in compute capacity, roughly 25% of the world&#8217;s total! This capacity is led by the large scale of its custom TPU chips, which we estimate have a total compute capacity of almost 4 million H100-equivalents [confidence interval: 3.1M to 4.5M]. For the other hyperscalers, Nvidia is responsible for the majority of their AI compute acquired to date.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hnjn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hnjn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!hnjn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!hnjn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!hnjn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hnjn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hnjn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!hnjn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!hnjn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!hnjn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17d4f495-03cc-4edf-8eff-b9ffd1fc93d5_1920x1080.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Besides Meta, the hyperscalers are all major cloud companies, meaning that they rent out part of their compute to other companies. <strong>Many frontier AI developers, including Anthropic and OpenAI, acquire almost all of their compute from hyperscalers and other cloud providers.</strong> OpenAI&#8217;s compute primarily comes from Microsoft, Oracle, and CoreWeave, and Anthropic&#8217;s from Google and Amazon, though neither should be assumed to be renting out the entire capacity of these clouds. We will follow up soon with an analysis of how much compute is used by frontier AI developers like OpenAI and Anthropic, as well as the frontier model divisions housed within Google and Meta.</p><h2>Chinese customers own just 5% of global AI compute</h2><p>We estimate that as of the end of 2025, Chinese companies collectively own just over <strong>5%</strong> of the cumulative computing power of the leading AI chips sold in recent years &#8212; less than any single top US hyperscaler, and a share that has decreased over time. We bucket Nvidia, AMD, and Huawei chips purchased by mainland Chinese customers into a single &#8220;China&#8221; category, with specific customer breakdowns left for future work.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_NVI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_NVI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!_NVI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!_NVI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!_NVI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_NVI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_NVI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!_NVI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!_NVI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!_NVI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec416393-4c5a-46cd-9add-b0e6e4751c93_1920x1080.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Our estimates of Chinese compute do not include chips smuggled in contravention of US export controls. Prior research and recent reporting suggest these volumes may be significant. <a href="https://www.cnas.org/publications/reports/countering-ai-chip-smuggling-has-become-a-national-security-priority">Grunewald and Fist</a> estimate that over 100,000 Nvidia A100s and H100s were shipped to China in 2024, though with significant uncertainty. These imports continued through 2025, according to <a href="https://www.ft.com/content/6f806f6e-61c1-4b8d-9694-90d7328a7b54?syn-25a6b1a6=1">multiple</a> <a href="https://www.justice.gov/opa/pr/three-charged-conspiring-unlawfully-divert-cutting-edge-us-artificial-intelligence">reports</a> of Nvidia shipments upwards of several billion dollars, or at least tens of thousands of chips. However, chip smuggling <a href="https://www.the-substrate.net/p/where-will-china-get-its-compute">does not seem likely</a> to add up to the millions of chips required to significantly close the balance of compute between Chinese companies and US hyperscalers.</p><p>Like our other ownership figures, these China totals do not include any offshore <a href="https://www.reuters.com/world/asia-pacific/chinas-bytedance-gets-access-top-nvidia-ai-chips-wsj-reports-2026-03-13/">cloud compute</a> rented by Chinese companies.</p><p>Notably, in the past year Huawei has overtaken Nvidia as the leading source of AI computing power in China, at least in terms of rated FLOP/s, which may not reflect real-world performance. This is due to a pause in official Nvidia exports to China following tightened controls on the Nvidia H20 chip. Nvidia is now <a href="https://www.cnbc.com/2026/03/17/nvidia-ceo-jensen-huang-says-chipmaker-has-received-orders-from-china.html">preparing</a> to export the more advanced H200 to China.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9Fl0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9Fl0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!9Fl0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!9Fl0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!9Fl0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9Fl0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9Fl0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!9Fl0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!9Fl0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!9Fl0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cda4538-9cc2-4d1d-acb6-9055fc0d16d1_1920x1080.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We plan to follow up this work with more coverage and analysis of how AI compute is allocated and used by major players, complementing our coverage of overall AI chip sales and <a href="https://epoch.ai/data/data-centers/">frontier AI data centers</a> to give a detailed overview of the world&#8217;s AI compute capacity.</p><p>To learn more, visit the <a href="https://epoch.ai/data/ai-chip-owners/">AI Chip Owners explorer</a> to find the methodology, full dataset, interactive visualizations, and more analysis!</p><p><em>Thanks to Brendan Halstead, Erich Grunewald, Konstantin Pilz, and Theo Bearman for their feedback.</em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to receive the latest from Epoch AI.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[First AI solution on FrontierMath: Open Problems]]></title><description><![CDATA[AI has solved one of the problems in FrontierMath: Open Problems, our benchmark of real research problems that mathematicians have tried and failed to solve]]></description><link>https://epochai.substack.com/p/first-ai-solution-on-frontiermath</link><guid isPermaLink="false">https://epochai.substack.com/p/first-ai-solution-on-frontiermath</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Mon, 23 Mar 2026 17:58:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!TWAl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TWAl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TWAl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!TWAl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!TWAl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!TWAl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TWAl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg" width="450" height="562.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:450,&quot;bytes&quot;:87616,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/191886927?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!TWAl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!TWAl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!TWAl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!TWAl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9b67df0-ce2b-431a-9c82-1d5b9c3590ab_1024x1280.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The newly-solved problem came from Will Brian, who had placed it in the Moderately Interesting category. It is a conjecture from a paper he wrote with Paul Larson in 2019. They were unable to solve it at the time, or in several attempts since. Brian had this to say.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7e-D!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7e-D!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png 424w, https://substackcdn.com/image/fetch/$s_!7e-D!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png 848w, https://substackcdn.com/image/fetch/$s_!7e-D!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png 1272w, https://substackcdn.com/image/fetch/$s_!7e-D!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7e-D!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png" width="451" height="451" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1200,&quot;width&quot;:1200,&quot;resizeWidth&quot;:451,&quot;bytes&quot;:61119,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/191886927?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7e-D!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png 424w, https://substackcdn.com/image/fetch/$s_!7e-D!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png 848w, https://substackcdn.com/image/fetch/$s_!7e-D!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png 1272w, https://substackcdn.com/image/fetch/$s_!7e-D!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5828c2cc-954f-4616-b1bd-a5ea7b06f17f_1200x1200.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Brian plans to write up the solution for publication, possibly including follow-on work spurred by the AI&#8217;s ideas. This matches his prospective assessment: a solution would be publishable in a standard speciality journal, and would be fairly likely to generate new questions.</p><p>Congratulations to Kevin Barreto and Liam Price, who first elicited a solution from GPT-5.4 Pro! They have the option to be coauthors, with Brian, on any resulting paper. Congratulations also to Geby Jaff who elicited a solution shortly thereafter.</p><p>We have replicated this elicitation in our scaffold for testing models on the problems. In this scaffold, Gemini 3.1 Pro, GPT-5.4 (xhigh), and Opus 4.6 (max) are all capable of solving the problem at least some of the time. For more about the problem, including a full chat transcript showing GPT-5.4 Pro&#8217;s original solution, and other models&#8217; solutions in our harness, see the <a href="https://epoch.ai/frontiermath/open-problems/ramsey-hypergraphs">problem page on our website</a>.</p><p>And check out the main <a href="https://epoch.ai/frontiermath/open-problems">FrontierMath: Open Problems page</a> to learn more about the benchmark. So far one Moderately Interesting problem has been solved. Which problem will be solved next, and when?</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to receive the latest from Epoch AI.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[GPT-5.4 set a new record on FrontierMath]]></title><description><![CDATA[Solved one Tier 4 problem that no model had solved before]]></description><link>https://epochai.substack.com/p/gpt-54-set-a-new-record-on-frontiermath</link><guid isPermaLink="false">https://epochai.substack.com/p/gpt-54-set-a-new-record-on-frontiermath</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Thu, 05 Mar 2026 18:57:39 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!HSol!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>GPT-5.4 set a new record on FrontierMath, our benchmark of extremely challenging math problems! </p><p>We had pre-release access to evaluate the model. On Tiers 1&#8211;3, GPT-5.4 Pro scored 50%. On Tier 4 it scored 38%.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HSol!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HSol!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg 424w, https://substackcdn.com/image/fetch/$s_!HSol!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg 848w, https://substackcdn.com/image/fetch/$s_!HSol!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!HSol!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HSol!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg" width="800" height="1000" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1000,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:56172,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/190026020?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HSol!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg 424w, https://substackcdn.com/image/fetch/$s_!HSol!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg 848w, https://substackcdn.com/image/fetch/$s_!HSol!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!HSol!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F63741d46-1c8f-46b4-a6b2-bdf60061ecd9_800x1000.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>See below for commentary and additional experiments.<br><br>FrontierMath was funded by OpenAI, who has exclusive access to: all 290 problems in Tiers 1&#8211;3; solutions to 237 of these problems; 28 of the 48 problems in Tier 4; solutions to these 28 problems. Epoch holds out the rest.<br><br>On Tiers 1&#8211;3 GPT-5.4 Pro solved 52% of the non-held-out set and 42% of the held-out set. On Tier 4, GPT-5.4 Pro solved 25% of the non-held-out set and 55% of the held-out set. Neither of these differences is statistically significant.<br><br>GPT-5.4 Pro solved one Tier 4 problem that no model had solved before. In a preliminary analysis, it appeared to have found a preprint from 2011 which let it shortcut much of the intended work. The problem author was unaware of this preprint.<br><br>We ran GPT-5.4 (xhigh) an additional ten times on Tier 4 to get a pass@10 score. This was 38%. In one of these runs, it solved another problem no model had solved before. This problem was by Bartosz Naskr&#281;cki, who responded as follows:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!U5b_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!U5b_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!U5b_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!U5b_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!U5b_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!U5b_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg" width="800" height="800" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:82439,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/190026020?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!U5b_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!U5b_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!U5b_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!U5b_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe45822b6-45a8-45f5-b49e-5b9789480cc5_800x800.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p> Across all runs ever, 42% (20/48) of the problems in Tier 4 have now been solved at least once.<br><br>We also evaluated GPT-5.4 Pro on FrontierMath: Open Problems. It did not solve any problems. It made some novel observations on one problem, but of a form that the author had anticipated and characterized as relatively uninteresting. More <a href="https://epoch.ai/frontiermath/open-problems/small-diophantine">here</a>.<br><br>Check out our website for more results and commentary about <a href="https://epoch.ai/frontiermath">FrontierMath</a> overall!</p><div><hr></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://epochai.substack.com/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[FrontierMath: Open Problems. Aletheia. First Proof.]]></title><description><![CDATA[New piece featuring FrontierMath in IEEE Spectrum]]></description><link>https://epochai.substack.com/p/frontiermath-open-problems-aletheia</link><guid isPermaLink="false">https://epochai.substack.com/p/frontiermath-open-problems-aletheia</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Thu, 26 Feb 2026 01:27:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!eo_6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>AI is getting better at math almost as fast as we can write new benchmarks to test it.</p><p>IEEE Spectrum just <a href="https://spectrum.ieee.org/ai-math-benchmarks">published a piece</a> featuring our past &amp; present work on FrontierMath &#8212; as well as Aletheia and First Proof. </p><p>Epoch researcher Greg Burnham called this &#8220;a more-the-merrier situation.&#8221;</p><p>Our own new contribution is <a href="https://epoch.ai/frontiermath/open-problems">FrontierMath: Open Problems</a>, which consists of open problems from research mathematics that professional mathematicians have tried and failed to solve.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eo_6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eo_6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!eo_6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!eo_6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!eo_6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eo_6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:82319,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/189203864?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!eo_6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!eo_6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!eo_6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!eo_6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50fa7e67-4386-4d0a-8685-d66a596d46e3_1024x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://epochai.substack.com/subscribe?"><span>Subscribe now</span></a></p><p></p>]]></content:encoded></item><item><title><![CDATA[Gemini 3.1 Pro comparable to Gemini 3 Pro on FrontierMath]]></title><description><![CDATA[First solution to one Tier 4 problem, not like a human solution]]></description><link>https://epochai.substack.com/p/gemini-31-pro-comparable-to-gemini</link><guid isPermaLink="false">https://epochai.substack.com/p/gemini-31-pro-comparable-to-gemini</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Sun, 22 Feb 2026 00:52:48 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!9S2M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Gemini 3.1 Pro scored comparably to Gemini 3 Pro on FrontierMath. <br><br>It also solved a Tier 4 problem that no model has solved before, though not how a human would. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9S2M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9S2M!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png 424w, https://substackcdn.com/image/fetch/$s_!9S2M!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png 848w, https://substackcdn.com/image/fetch/$s_!9S2M!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png 1272w, https://substackcdn.com/image/fetch/$s_!9S2M!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9S2M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png" width="1026" height="1283" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1283,&quot;width&quot;:1026,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:38421,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/188757701?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9S2M!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png 424w, https://substackcdn.com/image/fetch/$s_!9S2M!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png 848w, https://substackcdn.com/image/fetch/$s_!9S2M!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png 1272w, https://substackcdn.com/image/fetch/$s_!9S2M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcdf640c-2fe9-4b48-aa12-d055f0f269e9_1026x1283.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We accidentally ran Gemini 3.1 Pro on Tier 4 a second time. The score above reflects the first, official run. But we noticed in the second run that it had solved a problem no model had solved before. The newly-solved problem is by Emmanuel Breuillard. See his commentary below.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!75Sf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!75Sf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg 424w, https://substackcdn.com/image/fetch/$s_!75Sf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg 848w, https://substackcdn.com/image/fetch/$s_!75Sf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!75Sf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!75Sf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg" width="1026" height="1026" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1026,&quot;width&quot;:1026,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:122742,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/188757701?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!75Sf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg 424w, https://substackcdn.com/image/fetch/$s_!75Sf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg 848w, https://substackcdn.com/image/fetch/$s_!75Sf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!75Sf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa55fdcc3-b50f-4d2d-8561-6f2e9aaa262e_1026x1026.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We are waiting for API access to evaluate Gemini 3 Deep Think</p><p>Check out our <a href="https://epoch.ai/benchmarks">benchmarking hub</a> for more, including Gemini 3.1 Pro scores on other benchmarks!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://epochai.substack.com/subscribe?"><span>Subscribe now</span></a></p><p><br><br></p>]]></content:encoded></item><item><title><![CDATA[Watch or listen — Do AI models cover their costs?]]></title><description><![CDATA[Fri Feb 6th recording now available]]></description><link>https://epochai.substack.com/p/watch-or-listen-do-ai-models-cover</link><guid isPermaLink="false">https://epochai.substack.com/p/watch-or-listen-do-ai-models-cover</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Sat, 07 Feb 2026 01:01:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ZsOK!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca617831-3128-496f-8aac-33d1fadda48f_176x176.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>RECORDING: <a href="https://www.exponentialview.co/p/do-ai-models-actually-make-enough">Do AI models actually make enough money to cover their costs?</a> Epoch AI x Exponential View</h2><p>Catch up with an event recorded Feb 6th exploring recently-published research: <a href="https://epochai.substack.com/p/can-ai-companies-become-profitable">Can AI companies become profitable?</a></p><p>In particular, explore whether OpenAI is profitable, as Azeem Azhar sets up the event, then Matt Robinson of <a href="https://www.ai-street.co/">AI Street</a> interviews both <a href="https://epochai.substack.com/">Epoch AI</a>&#8217;s Jaime Savilla and <a href="https://www.exponentialview.co/">Exponential View</a>&#8217;s Hannah Petrovic &amp; Azeem Azhar to unpack their collaborative analysis.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Epoch AI! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Join this Friday — Do AI models cover their costs?]]></title><description><![CDATA[Fri Feb 6th @9am PT (noon ET / 5pm GMT / 6pm CET)]]></description><link>https://epochai.substack.com/p/join-this-friday-do-ai-models-actually</link><guid isPermaLink="false">https://epochai.substack.com/p/join-this-friday-do-ai-models-actually</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Thu, 05 Feb 2026 19:30:33 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ZsOK!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca617831-3128-496f-8aac-33d1fadda48f_176x176.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Join a live event on Substack this Friday exploring newly-published research <a href="https://epochai.substack.com/p/can-ai-companies-become-profitable">Can AI companies become profitable?</a> where Matt Robinson of <a href="https://www.ai-street.co/">AI Street</a> interviews both <a href="https://epochai.substack.com/">Epoch AI</a>&#8217;s Jaime Sevilla and <a href="https://www.exponentialview.co/">Exponential View</a>&#8217;s Hannah Petrovic &amp; Azeem Azhar to unpack their collaborative analysis of OpenAI profitability.</p><ul><li><p>Can sign up via this <a href="https://luma.com/sq16cggw">Luma</a> page, which will display the live event link (&amp; also send an email when the event link is live).</p></li><li><p>Or can follow <a href="https://www.exponentialview.co/">Exponential View</a> or <a href="https://substack.com/@exponentialview">Azeem Azhar</a> on Substack to be notified within Substack when the event is live.</p></li></ul><p>Note that Substack live events require a (free) Substack account.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Epoch AI! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Kimi K2.5 has highest ECI score among open weight models]]></title><description><![CDATA[Its score of 147 is about on par with o3, Grok 4, and Sonnet 4.5. It still lags the overall frontier.]]></description><link>https://epochai.substack.com/p/kimi-k25-has-highest-eci-score-among</link><guid isPermaLink="false">https://epochai.substack.com/p/kimi-k25-has-highest-eci-score-among</guid><pubDate>Wed, 04 Feb 2026 19:01:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!f374!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!f374!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!f374!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png 424w, https://substackcdn.com/image/fetch/$s_!f374!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png 848w, https://substackcdn.com/image/fetch/$s_!f374!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png 1272w, https://substackcdn.com/image/fetch/$s_!f374!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!f374!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png" width="450" height="562.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:450,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!f374!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png 424w, https://substackcdn.com/image/fetch/$s_!f374!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png 848w, https://substackcdn.com/image/fetch/$s_!f374!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png 1272w, https://substackcdn.com/image/fetch/$s_!f374!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e5f9bc6-3d91-40ff-9dd1-f4eff62c18bf_1024x1280.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Kimi K2.5 set a new record among open-weight models on the Epoch Capabilities Index (ECI), which combines multiple benchmarks onto a single scale.</p><p>Kimi K2.5 also took the lead among open models that we have benchmarked on FrontierMath. Its score of 28% on Tiers 1&#8211;3 is on par with GPT-5 (medium) and Gemini 2.5 Deep Think. On Tier 4 it scored only 4% (2/48 problems solved).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kibJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kibJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png 424w, https://substackcdn.com/image/fetch/$s_!kibJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png 848w, https://substackcdn.com/image/fetch/$s_!kibJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png 1272w, https://substackcdn.com/image/fetch/$s_!kibJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kibJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png" width="451" height="563.75" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:451,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!kibJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png 424w, https://substackcdn.com/image/fetch/$s_!kibJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png 848w, https://substackcdn.com/image/fetch/$s_!kibJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png 1272w, https://substackcdn.com/image/fetch/$s_!kibJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59ac2a5-30d2-44be-b79d-da1566c1ac7e_1024x1280.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We evaluated Kimi K2.5 on Fireworks, the developer&#8217;s launch partner. When using third-party providers, performance differences are always a concern. We can note, at least, that our score for GPQA Diamond matches the score in the developer's release materials (87.6%).</p><p>Check out <a href="https://epoch.ai/benchmarks">our website</a> for more benchmark scores, on Kimi K2.5 as well as other models!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to receive the latest from Epoch AI.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Introducing FrontierMath: Open Problems]]></title><description><![CDATA[Benchmarking AI on unsolved research problems that have eluded mathematicians]]></description><link>https://epochai.substack.com/p/introducing-frontiermath-open-problems</link><guid isPermaLink="false">https://epochai.substack.com/p/introducing-frontiermath-open-problems</guid><pubDate>Tue, 27 Jan 2026 18:45:24 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!dORH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dORH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dORH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!dORH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!dORH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!dORH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dORH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dORH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!dORH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!dORH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!dORH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F533ad223-63a2-4ed7-894b-c86959cc43f1_1600x900.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>This work was supported by a grant from Schmidt Sciences.</em></p><div><hr></div><p>AI math capabilities have come far and fast. In mid-2024, <a href="https://epoch.ai/benchmarks/math-level-5">high school math</a> was still a challenge. By the end of 2025, AI systems were solving <a href="https://epoch.ai/frontiermath/about">extremely hard problems</a> designed to be solvable only by top human experts. As we write this, it seems likely that AI systems will soon be able to solve problems that <em>no</em> human has solved before.</p><p>Indeed, there are already glimmers of this, e.g. AI systems solving several <a href="https://github.com/teorth/erdosproblems/wiki/AI-contributions-to-Erd%C5%91s-problems">previously-unsolved Erdos problems</a>. However, these results have been difficult to contextualize. Are these problems mathematically significant? How hard had humans previously tried to solve them? Does this say anything new about AI capabilities?</p><p>Today we are releasing a pilot version of a new benchmark, FrontierMath: Open Problems, which we hope will shed light on this topic. The benchmark consists of open problems from research mathematics that professional mathematicians have tried and failed to solve. To facilitate evaluation at scale, we include only problems for which proposed solutions can be verified programmatically.</p><p>At the time of release, to the best of our knowledge, none of these problems have been solved by humans or AI systems. If an AI system solves any problem, it will be a meaningful advance in the frontier of human knowledge. Moreover, we will be able to say something about just <em>how</em> meaningful: contributing mathematicians have characterized the significance of the problems, ranging from moderately interesting results to major breakthroughs.</p><p>The pilot release consists of 14 problems with:</p><ul><li><p>Write-ups explaining the problem&#8217;s significance and difficulty</p></li><li><p>Precise prompts that can be given to AI systems &#8211; try them yourself!</p></li><li><p>Initial attempts by AI systems to solve the problems</p></li></ul><p>We will add more problems in the coming months and are actively commissioning contributions. See our <a href="https://docs.google.com/forms/d/e/1FAIpQLSckGHMY4ofgKfvf39Ue8fDZAbXJqN9pTcf5oLP3f3y-chE0Bg/viewform">problem proposal form</a> if you are interested in contributing.</p><p>As for the verifiers &#8212; the programs that evaluate candidate solutions &#8212; we offer access for a fee. We structure access this way to help defray costs of expanding the benchmark. Problems are labor intensive to create and each solution effectively reduces the number of problems in the benchmark, so we ask that those who wish to use the verifiers partner with us to help fund further expansion. We commit to grant access uniformly, and not exclusively to any entity. Inquire at <a href="mailto:math@epoch.ai">math@epoch.ai</a>.</p><h4>Example problems</h4><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f29b1fb5-468e-4554-a975-d03e8e67ccbf_1200x1200.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8829287c-3944-4465-ae00-05afdecc2e5f_1200x1200.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/17116dc2-e310-48d0-af8f-2189ce59cbfd_1200x1200.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5649478e-2ed4-446e-bd07-e4b5378d241a_1200x1200.png&quot;}],&quot;caption&quot;:&quot;&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f2fea703-c956-484c-839d-f5a69ac741f0_1456x1456.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p></p><h4>Problems are mathematically meaningful, diverse, and hard</h4><p>Problems are contributed by professional mathematicians. Contributors suggest problems they are familiar with from their own research, and which they would be interested to see solved. Contributors rate the meaningfulness (notability) of solutions, ranging from results of moderate interest within a subfield all the way up to major breakthroughs. We aim for problems to be evenly distributed across these tiers.</p><p>Our goal is to find problems that are meaningful to mathematicians on their own terms. We do not select problems to be difficult for AI per se. Unlike problems contrived for a test, these problems are core to doing research math.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> We want to know whether AI systems can solve them: if they can, so be it.</p><p>That said, at least for humans, the problems are hard. Contributors rate problems with estimates of how many mathematicians have made serious attempts to solve them, with responses ranging from 2&#8211;4 mathematicians to 50&#8211;100.</p><p>Contributors also take a guess at human time-to-solve, specifically how long it would take the mathematician most capable of solving the problem to have a 50% chance of solving it, if they worked full time. Responses range from 1&#8211;4 weeks to 3&#8211;10 years.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> In other words, the human baseline is high.</p><p>Problems cover a range of mathematical topic areas. The pilot set has a tilt toward combinatorics and number theory, where we happen to have found the most problems amenable to automatic verification. We aim to keep up the diversity of topic areas as we expand the benchmark.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ygQ0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ygQ0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg 424w, https://substackcdn.com/image/fetch/$s_!ygQ0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg 848w, https://substackcdn.com/image/fetch/$s_!ygQ0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg 1272w, https://substackcdn.com/image/fetch/$s_!ygQ0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ygQ0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg" width="800" height="799" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:799,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ygQ0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg 424w, https://substackcdn.com/image/fetch/$s_!ygQ0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg 848w, https://substackcdn.com/image/fetch/$s_!ygQ0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg 1272w, https://substackcdn.com/image/fetch/$s_!ygQ0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef9a7d8-a9ea-412d-a142-e645e6f74a54_800x799.svg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>See the <a href="https://docs.google.com/forms/d/e/1FAIpQLSckGHMY4ofgKfvf39Ue8fDZAbXJqN9pTcf5oLP3f3y-chE0Bg/viewform">problem proposal form</a> for more details on the problem sourcing process.</p><h4>Solutions can be verified automatically</h4><p>Evaluating AI solutions to unsolved math problems is a major logistical challenge. Math research typically proceeds via natural-language papers. Evaluating such papers is labor intensive and error-prone even for humans. While AI systems have made progress at evaluating natural-language mathematics, we cannot rely on the accuracy of their evaluations for advanced material.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p>Our approach is to find problems where, even though no solution is currently known, a proposed solution can be checked by a relatively straightforward computer program running on a typical computer. It is not obvious that such verifiable problems exist, but they do.</p><p>For example, some problems ask for a very concrete mathematical object. <a href="https://epoch.ai/frontiermath/open-problems/inverse-galois">One such case</a> asks for a polynomial with a certain property.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> It is quick to check if a given polynomial has the desired property, but finding one is beyond the reach of any known technique, including highly optimized, large-scale search. The problem&#8217;s meaningfulness stems from the fact that a conceptual approach appears to be required to construct the desired object.</p><p>In <a href="https://epoch.ai/frontiermath/open-problems/ramsey-book-graphs">other cases</a>, we want a construction that works for all positive integers. We can&#8217;t verify this in general, but we can ask for an <em>algorithm</em> that takes an integer and returns a construction for that integer. We can verify the algorithm on a challenge set of integers where no constructions are currently known and where the integers are large enough that search is intractable. Success here gives strong evidence that the algorithm implements a general solution.</p><p>The downside is that this approach limits what we can ask about. Our ideal would be to take a random sample of all unsolved math problems, but this constraint introduces a bias. The benchmark problems tend to be relatively concrete, and may not require fuzzier mathematical pursuits like &#8220;theory building&#8221;. Still, we have been pleasantly surprised by how readily mathematicians have been able to come up with a diversity of mathematically meaningful problems that satisfy this constraint.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><h4>Some benchmark problems may not be solvable</h4><p>One risk inherent to this benchmark is that problems might not have solutions as stated. This issue takes two forms: either the desired mathematical object does not exist, or it does exist but is too large for the verifier to certify it as valid.</p><p>We don&#8217;t think such cases undermine interpretation of the benchmark overall. Successful solves are clearly meaningful. Failure to solve <em>all</em> problems at a given difficulty level is also probably meaningful, and will become more so as we grow the problem set beyond the current pilot size. We thus encourage interpretations of the form, &#8220;We have seen several examples of AI solving moderately interesting open math problems, but not yet any examples of major advances.&#8221;</p><p>That said, we strove to include only problems where a solution is at least <em>likely</em> to exist. For some problems there are heuristic reasons to believe that a solution of the desired form exists, and, in all cases, there is at least an absence of any reason to believe that no solution exists.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a></p><p>Our target was an assessment from the problem contributor of at least an 80% probability that the problem was solvable as stated.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a> However, mathematicians often expressed high uncertainty when assigning a probability in the 50&#8211;80% range.</p><h4>Solved problems will be removed from the benchmark</h4><p>If a problem is solved &#8212; whether by humans or AI &#8212; the result will be published.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a><sup> </sup>Thus, any future AI system attempting the same problem would simply be able to look up the solution. For this reason, we will remove solved problems from the benchmark.</p><p>While this &#8220;first-to-solve&#8221; set-up is atypical, we don&#8217;t think it undermines the value of the Open Problems benchmark as a whole. The benchmark won&#8217;t give a score that can be used to compare models&#8217; ability to solve open problems, but it will tell us whether AI systems are capable of solving problems of a given difficulty and significance.</p><h4>The benchmark helps track AI &#8220;research taste&#8221;</h4><p>Most immediately, this benchmark asks whether AI can solve unsolved math problems. We think this also helps us track fuzzier concepts like &#8220;research taste&#8221; &#8212; how good AI systems are at choosing the right direction to pursue, noticing the right patterns, etc.</p><p>Such capabilities seem likely to be relevant for theoretical math, where the right ideas can be especially difficult to find. If AI systems are able to solve math problems that have resisted significant human effort, then they might be moving toward superhuman research taste in general.</p><p>It&#8217;s by no means a guarantee. Perhaps, like chess or Go, it will turn out that math&#8217;s formal nature makes it unusually &#8220;easy&#8221; for AI systems. Or perhaps AI systems will solve these problems in ways that we regard as relatively inelegant.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a><sup> </sup>Still, we are glad to add this benchmark to our toolbox for tracking harder-to-quantify capabilities.</p><h4>We hope to see strong attempts to get AI systems to solve these problems</h4><p>Our primary goal is to understand the frontier of AI math capabilities. However, it is not clear how best to elicit AI capabilities on the benchmark problems.</p><p>So far we have tried simply prompting GPT-5.2 Pro and Gemini 3 Deep Think in their web apps.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a> The results of this are shown on the individual problem pages. In this setting the models are generally capable of solving &#8220;warm-up&#8221; problems: variants of the open problems where solutions are known. This tells us that they understand the instructions and are familiar with the subject area. It also helps test the verifiers.</p><p>But, when given the actual open problems, the models don&#8217;t show much promise. Sometimes they seem set on trying optimization approaches instead of the more conceptual approaches that are likely necessary. Other times they recognize the problem as open and simply give up.</p><p>It seems likely that more &#8220;thinking&#8221; will be an important ingredient. Models already plan, execute, revise, and iterate &#8212; but they may need a lot more time and resources to do so if they are going to have a chance at these problems. Exactly how to enable them in this way is an open research question.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a> </p><p>We&#8217;re working on a scaffold that can facilitate this sort of extended attempt, and we hope others will try as well. Contact us at <a href="mailto:math@epoch.ai">math@epoch.ai</a> with any questions.</p><h4>Appendix: caveats on future AI solutions</h4><p>This benchmark is essentially a preregistration of the interestingness of a set of math problems. That said, if these problems end up being solved in certain ways, some caveats may be warranted. Here we try to preregister these caveats as well, to help mitigate concerns about moving the goalposts later.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a></p><p><strong>Collaborations.</strong> There have already been productive mathematical collaborations between humans and AI systems. A <a href="https://arxiv.org/abs/2601.07222">typical pattern</a> is for the AI system to work out some examples, and for the human to generalize to a complete solution. Indeed, using computers to search for useful examples long predates LLM-based AI systems. We will have to assess the division of labor behind any such solution. The more an AI system is responsible for the <em>conceptual</em> part, the more that will be indicative of advancing capabilities.</p><p><strong>Prior art.</strong> AI systems&#8217; breadth of mathematical knowledge may already outstrip that of top humans. It is possible that an existing result has already done most of the work to solve a problem and the mathematicians who attempted the problem were simply unaware of it. This is unlikely for better-known problems, but, in any case, an AI solution relying on such a result would be significantly less indicative of advancing capabilities. Of course, if an AI system adapts known results in new ways, that needs no caveat: much of math research has this character.</p><p><strong>Conventional compute.</strong> If an AI system suggests an optimized, parallelizable search algorithm and an AI company devotes a supercomputer-month to executing the search, then a problem could be solved with less mathematical insight than expected. While we selected problems where brute-force alone is unlikely to be successful, it is hard to be certain. Most problems have not had industrial-scale resources brought to bear on them.</p><p><strong>Verifier misspecification.</strong> A verifier may accept an AI solution even though the solution does not represent the broader conceptual achievement that the contributing mathematician intended the verifier to identify. In simple cases these misspecifications will be no more than bugs. In more complex cases, it may be that a solution was harder to verify than the mathematician initially believed. We will report on any such cases and fix the verifiers when possible.</p><p><strong>Sample bias.</strong> We must contend with the bias introduced by the verifiability criterion. Previous caveats aside, progress on this benchmark really should amount to solving meaningful open math problems. But it may turn out that AI systems are uniquely suited to solving the sort of meaningful open math problems that are amenable to automatic verification. If so, progress on the benchmark may not generalize well to progress in other areas of math.</p><p></p><p><em>Originally published on <a href="https://epoch.ai/frontiermath/open-problems/about">Epoch AI&#8217;s website</a>.</em></p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to receive the latest from Epoch AI.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>In other words, this benchmark has high construct validity.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Mathematicians generally emphasized that these time-to-solve guesses were, quite possibly, no better than noise. However, we believe that the large range gives us at least a bit of information.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>See, e.g., <a href="https://proofcorpus.ai/">here</a> for work on AI systems grading prose proofs.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Namely, a polynomial whose Galois group is the Mathieu group M<sub>23</sub>. Mathematicians have tried and failed to find such a polynomial, but still believe that one exists.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>A different approach would have been to pursue full formalization, most likely asking AI systems to implement solutions in Lean. We decided not to take this approach for three reasons, each having to do with the fact that Lean is still maturing as a platform. First, many subfields&#8217; foundations are not yet formalized in Lean. Second, even if a problem statement can be formalized, it is possible that the concepts required to solve it cannot be. Third, Lean is far less battle-tested than other programming paradigms. In particular, there are likely still a fair number of inscrutable bugs which models could exploit. For now we prefer the simplicity of one-off verification programs, even if in the long run Lean (or another formal system) may become a more practical and scalable solution.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>In particular, we don&#8217;t include problems asking for counterexamples to conjectures where mathematicians generally believe that the conjecture is true. For example, exhibiting an even number that cannot be written as the sum of two prime numbers would disprove <a href="https://en.wikipedia.org/wiki/Goldbach%27s_conjecture">Goldbach&#8217;s conjecture</a>. Such a counterexample would be easy to verify, at least if not astronomically large. But if mathematicians are correct in believing that Goldbach&#8217;s conjecture is true, then no such counterexample exists. Failure of an AI system to find such a counterexample would tell us nothing.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>This included consideration of any restrictions on solution size necessitated for the sake of making the problem automatically verifiable. For example, not just that such-and-such a mathematical object exists, but that one exists that is small enough for the verifier to handle.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>Indeed, a condition for any entity purchasing access to the verifiers is that they must notify Epoch and the problem contributor about any success from the verifier. They are given joint publication rights of such a solution with the problem contributor and Epoch. Note that problem contributors are not restricted in any way from pursuing their research, including on the problems they have contributed to the benchmark.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>Not that this would invalidate the solutions. And isn&#8217;t beauty truth, truth beauty?</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>Simple prompting in web apps is often sufficient to elicit state-of-the-art performance, e.g. from <a href="https://epoch.ai/blog/deep-think-math">Gemini 2.5 Deep Think</a> and <a href="https://x.com/EpochAIResearch/status/2014769359747744200?s=20">GPT-5.2 Pro</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>One whimsical prompt used in AlphaEvolve tells the model to believe in itself. Who knows!</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>To paraphrase Douglas Adams: We love goalposts. We love the whooshing noise they make as they go by.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Epoch's Latest AI Trends]]></title><description><![CDATA[We&#8217;ve added new trends & figures categories to our Trends page!]]></description><link>https://epochai.substack.com/p/epochs-latest-ai-trends</link><guid isPermaLink="false">https://epochai.substack.com/p/epochs-latest-ai-trends</guid><dc:creator><![CDATA[Luke Emberson]]></dc:creator><pubDate>Fri, 23 Jan 2026 22:04:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!duMG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We&#8217;ve added new trends &amp; figures categories to our Trends page! </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!duMG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!duMG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png 424w, https://substackcdn.com/image/fetch/$s_!duMG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png 848w, https://substackcdn.com/image/fetch/$s_!duMG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png 1272w, https://substackcdn.com/image/fetch/$s_!duMG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!duMG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png" width="1456" height="2148" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2148,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:530502,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/185581811?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!duMG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png 424w, https://substackcdn.com/image/fetch/$s_!duMG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png 848w, https://substackcdn.com/image/fetch/$s_!duMG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png 1272w, https://substackcdn.com/image/fetch/$s_!duMG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e4687b-4375-4e42-82a8-a26fb3289b67_1889x2787.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Do you know:</p><ul><li><p>How fast LLM inference prices are falling?</p></li><li><p>How fast compute stocks are growing?</p></li><li><p>How long it takes to build a GW scale data center?</p></li></ul><p>Find out on our <a href="https://epoch.ai/trends">Trends</a> page!</p>]]></content:encoded></item><item><title><![CDATA[New record on FrontierMath Tier 4]]></title><description><![CDATA[GPT-5.2 Pro (manual run) scored 31%, a substantial jump over the previous high score of 19%.]]></description><link>https://epochai.substack.com/p/new-record-on-frontiermath-tier-4</link><guid isPermaLink="false">https://epochai.substack.com/p/new-record-on-frontiermath-tier-4</guid><pubDate>Fri, 23 Jan 2026 19:14:14 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!T4-V!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T4-V!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T4-V!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png 424w, https://substackcdn.com/image/fetch/$s_!T4-V!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png 848w, https://substackcdn.com/image/fetch/$s_!T4-V!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png 1272w, https://substackcdn.com/image/fetch/$s_!T4-V!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T4-V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png" width="434" height="542.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1820,&quot;width&quot;:1456,&quot;resizeWidth&quot;:434,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!T4-V!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png 424w, https://substackcdn.com/image/fetch/$s_!T4-V!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png 848w, https://substackcdn.com/image/fetch/$s_!T4-V!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png 1272w, https://substackcdn.com/image/fetch/$s_!T4-V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c5eac-97a7-40a2-81df-0cea722c3a16_2048x2560.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We evaluated GPT-5.2 Pro manually on the ChatGPT website. We did this after encountering timeout issues with the API in our scaffold. We&#8217;re working to resolve these issues, but a manual evaluation seemed worthwhile in the meantime.</p><p>Prior to this run, 13 problems from Tier 4 had been solved by any model ever. GPT-5.2 Pro solved 11 of those, and 4 more besides. Its total for this run is thus 15/48 (31%), and the pass@the-kitchen-sink for Tier 4 (all problems solved ever) is now 17/48 (35%).</p><p>OpenAI has exclusive access to 28 Tier 4 problems and their solutions, with Epoch holding out the other 20 problems. GPT-5.2 Pro solved 5 (18%) of the non-held-out set and 10 (50%) of the held-out set. In other words: no evidence of over-fitting.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5CgA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5CgA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png 424w, https://substackcdn.com/image/fetch/$s_!5CgA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png 848w, https://substackcdn.com/image/fetch/$s_!5CgA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png 1272w, https://substackcdn.com/image/fetch/$s_!5CgA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5CgA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png" width="475" height="593.75" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1820,&quot;width&quot;:1456,&quot;resizeWidth&quot;:475,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!5CgA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png 424w, https://substackcdn.com/image/fetch/$s_!5CgA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png 848w, https://substackcdn.com/image/fetch/$s_!5CgA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png 1272w, https://substackcdn.com/image/fetch/$s_!5CgA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11ed0ad4-558e-47dc-ace1-8ba31f5bb4a4_2048x2560.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>During evaluation we found issues with two problems. GPT-5.2 Pro and GPT-5.2 (high) should have been credited with solving both, and GPT-5.2 (xhigh), (medium), and GPT-5 Pro should have been credited with solving one. We&#8217;ve fixed the issues and updated the scores on our hub.</p><p>One of the newly-solved problems was from Joel Hass, whose research is in low-dimensional topology and geometry. Afterward, he suggested we try a different, more challenging formulation of the same problem. GPT-5.2 Pro solved that too.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JeES!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JeES!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!JeES!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!JeES!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!JeES!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JeES!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png" width="443" height="443" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:443,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!JeES!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!JeES!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!JeES!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!JeES!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffddc43ca-e784-4f2e-bb8d-3078f51fee5b_2048x2048.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Another problem, by number theorist Ken Ono was initially solved by GPT-5.2 (xhigh), and also by GPT-5.2 Pro. Ken gave the solution a generally favorable review, though noted that the rigor of its prose explanation was somewhat lacking.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tqaz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tqaz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!tqaz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!tqaz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!tqaz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tqaz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png" width="441" height="441" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:441,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!tqaz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!tqaz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!tqaz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!tqaz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0ec5f76-a4d1-411f-b596-126f15a5defd_2048x2048.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Another newly-solved problem was by number theorist Dan Romik. He was impressed.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UWlJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UWlJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!UWlJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!UWlJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!UWlJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UWlJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png" width="447" height="447" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:447,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!UWlJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!UWlJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!UWlJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!UWlJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F57857dde-f439-4ccb-90a0-222264cfcf87_2048x2048.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A pair of problems by Jay Pantone, who works in analytic combinatorics, were solved earlier: one by GPT-5 and one by GPT-5.1. GPT-5.2 Pro solved these as well. The solutions were both valid, but Jay noted that both used numerical shortcuts that he didn&#8217;t intend.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!W9hF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!W9hF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!W9hF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!W9hF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!W9hF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!W9hF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png" width="443" height="443" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:443,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!W9hF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!W9hF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!W9hF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!W9hF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed927e09-e4b4-448c-999f-2a013a900f32_2048x2048.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>What remains unsolved? One author thinks models get his problem wrong because they make a plausible assumption without trying to prove it. If they tried to prove it &#8212; as he had to, when he encountered the problem in his own research &#8212; they might realize the truth is more subtle.</p><p>Check <a href="https://epoch.ai/frontiermath">our website</a> for more info on FrontierMath and analysis of AI math capabilities!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to receive the latest from Epoch AI.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Polling on AI Usage]]></title><description><![CDATA[We collaborated with the polling firm Blue Rose Research to survey 5,660 Americans about their AI usage habits in late 2025.]]></description><link>https://epochai.substack.com/p/polling-on-ai-usage</link><guid isPermaLink="false">https://epochai.substack.com/p/polling-on-ai-usage</guid><pubDate>Wed, 21 Jan 2026 10:40:48 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!KfCo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba9bbe0c-f619-407e-a77d-980612d86f43_1600x1678.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A key takeaway: A majority of Americans use AI on a weekly basis, with 35% using ChatGPT, 24% Gemini, and 13% Meta AI. However, less than ten percent of Americans paid for a subscription, with OpenAI again leading at 4.6%.</p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ba9bbe0c-f619-407e-a77d-980612d86f43_1600x1678.jpeg&quot;},{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4a2e3fa1-9e4f-43c0-8fd2-f90aa9be8f03_1600x1534.jpeg&quot;}],&quot;caption&quot;:&quot;&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fb5f01f6-c73c-4ac5-848f-825fee0dfa6c_1456x720.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p>We'll be posting more of our results over the next few weeks, but you can also view everything on <a href="https://epoch.ai/data/polling">our website</a> right now. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to receive the latest from Epoch AI..</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[The Epoch AI 2025 Impact Report is out!]]></title><description><![CDATA[2025 was full of achievements]]></description><link>https://epochai.substack.com/p/epoch-ai-2025-impact-report-is-out</link><guid isPermaLink="false">https://epochai.substack.com/p/epoch-ai-2025-impact-report-is-out</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Fri, 16 Jan 2026 19:44:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!2IDG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We published our <a href="https://epoch.ai/blog/epoch-impact-report-2025">2025 Impact Report</a> today. </p><p>The AI industry is scaling exponentially &#8212; investment, compute, data center buildouts. So, it turns out, is demand for making sense of it all.</p><p>See how we&#8217;ve kept up!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2IDG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2IDG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!2IDG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!2IDG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!2IDG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2IDG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:70133,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/184799071?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2IDG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!2IDG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!2IDG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!2IDG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7e86070-47e1-482e-b53d-dcef4fa3c3ed_1024x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Have a look at our <a href="https://epoch.ai/blog/epoch-impact-report-2025">2025 Impact Report</a>, which includes our plans for 2026! </p><div><hr></div><p>We are fundraising $3M to execute a more ambitious version of our plans. Interested in supporting us? Donations of any size are appreciated. </p><p>If you have questions, our DM&#8217;s are open. You can learn more and donate through our <a href="https://epoch.ai/donate">Donate</a> page or email donate@epoch.ai.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free &amp; choose which updates to receive.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Introducing the AI Chip Sales Data Explorer]]></title><description><![CDATA[We announce our new AI Chip Sales data explorer, which uses financial reports, company disclosures, and more to estimate compute, power usage, and spending over time for a wide variety of AI chips.]]></description><link>https://epochai.substack.com/p/introducing-the-ai-chip-sales-data</link><guid isPermaLink="false">https://epochai.substack.com/p/introducing-the-ai-chip-sales-data</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Wed, 14 Jan 2026 11:49:16 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!YSWq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YSWq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YSWq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!YSWq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!YSWq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!YSWq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YSWq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:34371,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/184534937?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YSWq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!YSWq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!YSWq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!YSWq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0cba8de-00cc-4f41-80a3-80168f2d792b_1600x900.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Discussions about AI progress increasingly hinge on computing capacity &#8211; aka compute &#8211; which is essential in order to develop, train, and deploy AI systems. But public data on the total capacity of AI computing hardware can be fragmented and incomplete.</p><p>To address this, we are releasing a new <a href="https://epoch.ai/data/ai-chip-sales">AI Chip Sales data explorer</a>, estimating and visualizing both the number and capacity of AI accelerators that have been sold or delivered in recent years. We leverage data and evidence from earnings reports, company disclosures, analyst coverage, and media reporting to produce estimates of AI chip counts across major vendors: Nvidia, Google, Amazon, AMD, and Huawei, broken down by AI chip model.</p><p>We believe this release provides the most complete publicly available picture to date on the global stock of AI compute.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>Compute</h2><p>We find that cumulative global AI compute capacity has reached the equivalent of more than 15 million Nvidia H100 GPUs, measured using each chip&#8217;s respective peak specifications in 8-bit operations per second.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!axL0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!axL0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!axL0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!axL0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!axL0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!axL0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:151406,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/184534937?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!axL0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!axL0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!axL0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!axL0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfea30c8-c20f-43c4-9aa8-8861fe918f29_1920x1080.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Last year also saw major transitions due to new chip generations. Notably, Nvidia&#8217;s new Blackwell generation has largely displaced the H100 and H200, with the new B300 alone now accounting for the majority of AI compute capacity sold by Nvidia.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E-No!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E-No!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png 424w, https://substackcdn.com/image/fetch/$s_!E-No!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png 848w, https://substackcdn.com/image/fetch/$s_!E-No!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png 1272w, https://substackcdn.com/image/fetch/$s_!E-No!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E-No!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png" width="1200" height="675" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:675,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:106441,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/184534937?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E-No!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png 424w, https://substackcdn.com/image/fetch/$s_!E-No!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png 848w, https://substackcdn.com/image/fetch/$s_!E-No!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png 1272w, https://substackcdn.com/image/fetch/$s_!E-No!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50748a62-14b6-4aaf-8337-1beec147893c_1200x675.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Costs and power</h2><p>Acquiring and deploying these chips requires massive resources.</p><p>Overall, the cost to purchase these chips, even before auxiliary capital costs such as networking and data center construction, has rapidly escalated to tens of billions of dollars per quarter.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YpTf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YpTf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!YpTf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!YpTf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!YpTf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YpTf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:104263,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/184534937?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YpTf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!YpTf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!YpTf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!YpTf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a54282b-8c14-4e91-a7aa-e815d7c30a60_1920x1080.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>AI chips also demand a lot of electrical power. Even before accounting for the power overheads of servers and data centers, the total quantity of chips we track would draw over 10 GW of power. This is around twice the average power consumption of New York City.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3JuH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3JuH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!3JuH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!3JuH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!3JuH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3JuH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:79681,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/184534937?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3JuH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!3JuH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!3JuH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!3JuH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01090aa4-2fc2-470b-b6e4-35b67a4114e2_1920x1080.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We hope the AI Chip Sales data explorer serves as a useful tool to understand trends in global compute. To see our full datasets and interactive visualizations, visit the explorer <a href="https://epoch.ai/data/ai-chip-sales">here</a>!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to receive the latest from Epoch AI.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Our most popular Data Insights & Gradient Updates of 2025]]></title><description><![CDATA[Looking back on 2025, we published 36 Data Insights and 37 Gradient Updates. What were our most popular short-form research posts?]]></description><link>https://epochai.substack.com/p/our-most-popular-data-insights-and</link><guid isPermaLink="false">https://epochai.substack.com/p/our-most-popular-data-insights-and</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Tue, 23 Dec 2025 17:00:20 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!qN6x!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In 2025, we ramped up our public communications to keep pace with rapid developments in AI. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qN6x!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qN6x!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!qN6x!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!qN6x!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!qN6x!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qN6x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:340883,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://epochai.substack.com/i/182405496?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qN6x!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!qN6x!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!qN6x!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!qN6x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b2f10c-9139-4b78-b272-f5440e05e8e7_1600x900.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Our <a href="https://epoch.ai/data-insights">Data Insights</a> offer short, visual, self-contained investigations of key trends and metrics in AI.</p><p><a href="https://epoch.ai/gradient-updates">Gradient Updates</a> is our outlet for leading-edge commentary by specific authors (also offered as a <a href="https://epochai.substack.com/s/gradient-updates">newsletter on Substack</a>), without necessarily representing the views of Epoch AI as a whole.</p><p>Below we bring you our top 10 most popular Data Insights and Gradient Updates in 2025.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><div><hr></div><h2>Most popular Data Insights</h2><p></p><h4>LLM inference prices have fallen rapidly but unequally across tasks</h4><p>&#129034; <strong>In short: </strong>Between April 2023 and March 2025, we saw a &gt;10x and larger drop in the price per token at an equivalent performance level.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zBqG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zBqG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png 424w, https://substackcdn.com/image/fetch/$s_!zBqG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png 848w, https://substackcdn.com/image/fetch/$s_!zBqG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!zBqG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zBqG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png" width="1456" height="983" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:983,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zBqG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png 424w, https://substackcdn.com/image/fetch/$s_!zBqG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png 848w, https://substackcdn.com/image/fetch/$s_!zBqG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!zBqG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70ab85f9-ff1c-4e95-82d1-2341ad81af43_1600x1080.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>API cost reductions indicate a more competitive market and large gains in efficiency, making AI more affordable to customers. If these trends hold up, any AI capability that exists today will soon be available for very cheap!</p><p><a href="https://epoch.ai/data-insights/llm-inference-price-trends">Learn more</a></p><div><hr></div><h4>Frontier AI performance becomes accessible on consumer hardware within a year</h4><p>&#129034; <strong>In short: </strong>The best open models that can run on a consumer GPU lag the frontier of AI by only a year or less, as measured by several metrics of performance including GPQA, MMLU, AA Intelligence, and LMArena.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1vMn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1vMn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png 424w, https://substackcdn.com/image/fetch/$s_!1vMn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png 848w, https://substackcdn.com/image/fetch/$s_!1vMn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png 1272w, https://substackcdn.com/image/fetch/$s_!1vMn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1vMn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png" width="1456" height="935" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:935,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1vMn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png 424w, https://substackcdn.com/image/fetch/$s_!1vMn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png 848w, https://substackcdn.com/image/fetch/$s_!1vMn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png 1272w, https://substackcdn.com/image/fetch/$s_!1vMn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32a9b651-f85b-4f98-a189-f44618c5a224_1600x1027.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>Open models that can run on a personal computer are accessible for billions of people. The relatively small lead held by frontier models  suggests it would be hard to maintain a market advantage with a fixed level of model capabilities. Instead, companies face pressure  to continue developing better models or excel at other services such as better integrations. This trend has direct implications for AI policy. Any capabilities appearing at the frontier are likely to be widely available and unrestricted in less than a year, complicating regulatory options.</p><p><a href="https://epoch.ai/data-insights/consumer-gpu-model-gap">Learn more</a></p><div><hr></div><h4>Most of OpenAI&#8217;s 2024 compute went to experiments</h4><p>&#129034; <strong>In short: </strong>Media reporting indicates that most of OpenAI&#8217;s compute fleet in 2024 wasn&#8217;t used for inference or training runs, but rather to run experiments that enable further development.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!V-1G!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!V-1G!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png 424w, https://substackcdn.com/image/fetch/$s_!V-1G!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png 848w, https://substackcdn.com/image/fetch/$s_!V-1G!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png 1272w, https://substackcdn.com/image/fetch/$s_!V-1G!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!V-1G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png" width="1456" height="1295" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1295,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!V-1G!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png 424w, https://substackcdn.com/image/fetch/$s_!V-1G!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png 848w, https://substackcdn.com/image/fetch/$s_!V-1G!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png 1272w, https://substackcdn.com/image/fetch/$s_!V-1G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb512d418-6064-4203-8c8a-e5c9f105c25b_1600x1423.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>AI development is capital-intensive. So we should expect leaders in the field to have access to large amounts of compute that they use for experiments. This finding also suggests that most of the costs associated with AI currently are associated with experiments, rather than their direct training and deployment.</p><p><a href="https://epoch.ai/data-insights/openai-compute-spend">Learn more</a></p><div><hr></div><h4>The stock of computing power from NVIDIA chips is doubling every 10 months</h4><p>&#129034; <strong>In short: </strong>The amount of installed AI compute from NVIDIA chips has more than doubled annually since 2020. New flagship chips account for most of the existing compute within three years of their release.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!n_9O!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!n_9O!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png 424w, https://substackcdn.com/image/fetch/$s_!n_9O!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png 848w, https://substackcdn.com/image/fetch/$s_!n_9O!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png 1272w, https://substackcdn.com/image/fetch/$s_!n_9O!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!n_9O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png" width="1456" height="892" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:892,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!n_9O!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png 424w, https://substackcdn.com/image/fetch/$s_!n_9O!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png 848w, https://substackcdn.com/image/fetch/$s_!n_9O!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png 1272w, https://substackcdn.com/image/fetch/$s_!n_9O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F078ac9c8-be5e-4f05-8fa8-6d51d28339c6_1600x980.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>Compute is a fundamental input for AI development and deployment. Exponentially more computational resources are needed to maintain the current pace of AI development, which has driven sustained demand for chips from NVIDIA and other manufacturers.</p><p><a href="https://epoch.ai/data-insights/nvidia-chip-production">Learn more</a></p><div><hr></div><h4>GPT-5 and GPT-4 were both major leaps in benchmarks from the previous generation</h4><p>&#129034; <strong>In short: </strong>Both GPT-4 and GPT-5 greatly exceeded the performance of their direct predecessors.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vKOj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vKOj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png 424w, https://substackcdn.com/image/fetch/$s_!vKOj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png 848w, https://substackcdn.com/image/fetch/$s_!vKOj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png 1272w, https://substackcdn.com/image/fetch/$s_!vKOj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vKOj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png" width="1456" height="1065" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1065,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vKOj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png 424w, https://substackcdn.com/image/fetch/$s_!vKOj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png 848w, https://substackcdn.com/image/fetch/$s_!vKOj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png 1272w, https://substackcdn.com/image/fetch/$s_!vKOj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a852024-65be-4ca1-8548-c2d92696cc35_1600x1170.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>When GPT-5 released, some were disappointed that the performance improvements were only marginal compared to existing models. But this is better explained by a more frequent cadence of releases in the last two years, rather than a slowdown in capabilities.</p><p><a href="https://epoch.ai/data-insights/gpt-capabilities-progress">Learn more</a></p><div><hr></div><h2>Most popular Gradient Updates</h2><p></p><h4>How much energy does ChatGPT use?</h4><p>&#129034; <strong>In short: </strong>Josh estimated the average energy cost of a GPT-4o query, finding it was less than running a lightbulb for five minutes. His estimate was later <a href="https://blog.samaltman.com/the-gentle-singularity#:~:text=People%20are%20often%20curious%20about%20how%20much%20energy%20a%20ChatGPT%20query%20uses%3B%20the%20average%20query%20uses%20about%200.34%20watt%2Dhours%2C%20about%20what%20an%20oven%20would%20use%20in%20a%20little%20over%20one%20second%2C%20or%20a%20high%2Defficiency%20lightbulb%20would%20use%20in%20a%20couple%20of%20minutes.">corroborated by Sam Altman</a>, and it is also similar to the <a href="https://cloud.google.com/blog/products/infrastructure/measuring-the-environmental-impact-of-ai-inference/">energy cost per prompt for Gemini</a> reported by Google.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1xAT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1xAT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png 424w, https://substackcdn.com/image/fetch/$s_!1xAT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png 848w, https://substackcdn.com/image/fetch/$s_!1xAT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png 1272w, https://substackcdn.com/image/fetch/$s_!1xAT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1xAT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png" width="1456" height="1019" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1019,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1xAT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png 424w, https://substackcdn.com/image/fetch/$s_!1xAT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png 848w, https://substackcdn.com/image/fetch/$s_!1xAT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png 1272w, https://substackcdn.com/image/fetch/$s_!1xAT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb378a147-b971-4fd6-bd5e-d349234cf8ff_1600x1120.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>Many are concerned about AI&#8217;s energy use. This piece helped quantify the costs and put these concerns in context, showing that AI energy use at the time was not very significant compared to other household activities. AI energy use has continued to grow exponentially since then, though, and it might become a larger issue in the future.</p><p><a href="https://epoch.ai/gradient-updates/how-much-energy-does-chatgpt-use">Read the post</a></p><div><hr></div><h4>How has DeepSeek improved the Transformer architecture?</h4><p>&#129034; <strong>In short: </strong>This post covered three techniques introduced by the team behind the<a href="https://arxiv.org/html/2412.19437v1">DeepSeek v3 paper</a> that allowed them to release the best open-source pretrained model at the time, while using 10&#215; less compute than the next best open model Llama 3. The techniques are multi-head latent attention (MLA), innovations on the mixture-of-experts (MoE) architecture, and multi-token prediction.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!j_jt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!j_jt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png 424w, https://substackcdn.com/image/fetch/$s_!j_jt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png 848w, https://substackcdn.com/image/fetch/$s_!j_jt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png 1272w, https://substackcdn.com/image/fetch/$s_!j_jt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!j_jt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png" width="1456" height="1169" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1169,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!j_jt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png 424w, https://substackcdn.com/image/fetch/$s_!j_jt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png 848w, https://substackcdn.com/image/fetch/$s_!j_jt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png 1272w, https://substackcdn.com/image/fetch/$s_!j_jt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff017e0a8-7de8-4903-8b84-ff4eacdefa35_1600x1285.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>Three days after we published this post,<strong> </strong>DeepSeek attracted far wider attention by releasing a reasoning model, R1. This model matched the performance of OpenAI&#8217;s o1 while using what we presume is a fraction of the development cost. The innovations they introduced illustrate the patterns of <a href="https://epoch.ai/blog/algorithmic-progress-in-language-models">training compute efficiency</a>, whereby year-to-year models become 3&#215; cheaper to develop because of new training techniques and data improvements.</p><p><a href="https://epoch.ai/gradient-updates/how-has-deepseek-improved-the-transformer-architecture">Read the post</a></p><div><hr></div><h4>How far can reasoning models scale?</h4><p>&#129034; <strong>In short: </strong>Josh discussed the growth in compute for RL reasoning training. Labs like OpenAI and Anthropic claimed in early 2025 that their rate of RL scaling couldn&#8217;t be sustained for more than 1-2 years, since it would quickly run into the limits of their compute infrastructure.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!130P!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!130P!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png 424w, https://substackcdn.com/image/fetch/$s_!130P!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png 848w, https://substackcdn.com/image/fetch/$s_!130P!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png 1272w, https://substackcdn.com/image/fetch/$s_!130P!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!130P!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png" width="1456" height="1065" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1065,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!130P!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png 424w, https://substackcdn.com/image/fetch/$s_!130P!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png 848w, https://substackcdn.com/image/fetch/$s_!130P!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png 1272w, https://substackcdn.com/image/fetch/$s_!130P!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd571a597-07d2-4b97-879e-ebfdd232a0c1_1600x1170.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>Reasoning has become a highly important axis for scaling model training, leading to excellent results in math, software engineering, and elsewhere. The limits to its growth suggest that the exceptional growth in capabilities during 2024 and 2025 could soon slow down.</p><p><a href="https://epoch.ai/gradient-updates/how-far-can-reasoning-models-scale">Read the post</a></p><div><hr></div><h4>How big could an &#8220;AI Manhattan Project&#8221; get?</h4><p>&#129034; <strong>In short: </strong>Arden and Anson estimated how large a national US AI project could get, comparing it to the relative spending during the Manhattan Project and the Apollo program. They conclude it could result in a training run 10,000x larger than GPT-4.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FaRE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FaRE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png 424w, https://substackcdn.com/image/fetch/$s_!FaRE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png 848w, https://substackcdn.com/image/fetch/$s_!FaRE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png 1272w, https://substackcdn.com/image/fetch/$s_!FaRE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FaRE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png" width="1456" height="1076" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1076,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FaRE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png 424w, https://substackcdn.com/image/fetch/$s_!FaRE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png 848w, https://substackcdn.com/image/fetch/$s_!FaRE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png 1272w, https://substackcdn.com/image/fetch/$s_!FaRE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1efe08bd-0036-4173-a9b7-25381988d638_1600x1182.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>In November 2024, the US-China Economic and Security Review Commission&#8217;s <a href="https://www.uscc.gov/sites/default/files/2024-11/2024_Annual_Report_to_Congress.pdf">top recommendation to Congress</a> was to &#8220;establish and fund a Manhattan Project-like program dedicated to racing to and acquiring an Artificial General Intelligence capability.&#8221; This exercise puts into context the potential size of such a national project.</p><p><a href="https://epoch.ai/gradient-updates/how-big-could-an-ai-manhattan-project-get">Read the post</a></p><div><hr></div><h4>Most AI value will come from broad automation, not from R&amp;D</h4><p>&#129034; <strong>In short: </strong>Ege and Matthew argued that most of the value AI will create will be mediated by its ability to automate many tasks across the economy, not its ability to speed up R&amp;D. Relatedly, R&amp;D activities have arguably contributed only a modest amount to productivity growth in the 1988-2020 period.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8DZp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8DZp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png 424w, https://substackcdn.com/image/fetch/$s_!8DZp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png 848w, https://substackcdn.com/image/fetch/$s_!8DZp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png 1272w, https://substackcdn.com/image/fetch/$s_!8DZp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8DZp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png" width="1456" height="1139" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1139,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8DZp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png 424w, https://substackcdn.com/image/fetch/$s_!8DZp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png 848w, https://substackcdn.com/image/fetch/$s_!8DZp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png 1272w, https://substackcdn.com/image/fetch/$s_!8DZp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4486c535-f715-48e8-9811-1085dcf32e63_1600x1252.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#129034; <strong>Why this matters: </strong>Many stories about explosive growth from AI, such as those posed by <a href="https://blog.samaltman.com/three-observations">Sam Altman</a>, <a href="https://www.thetimes.com/life-style/celebrity/article/demis-hassabis-ai-could-cure-all-diseases-in-10-years-09pcqh7cb">Denis Hassabis</a> and <a href="https://www.darioamodei.com/essay/machines-of-loving-grace">Dario Amodei</a>, posit automation of R&amp;D as a key lever of growth. This suggests that the impact of AI might be rapid, salient and localized.AI might suddenly automate the last hurdles to R&amp;D automation and quickly make great advances within AI companies. But AI might instead primarily affect society through a diffuse and gradual process that lasts several years or decades as different orgs adopt AI to improve efficiency.</p><p><a href="https://epoch.ai/gradient-updates/most-ai-value-will-come-from-broad-automation-not-from-r-d">Read the post</a></p><div><hr></div><p>We started our Data Insights and Gradient Updates programs so we could offer timely input related to ongoing developments.</p><p>The rapid uptake has been gratifying. On our website alone, these new formats amassed nearly half a million views and a total engagement time of nearly 5,000 hours while related posts on Twitter had over 6 million impressions with over a quarter million engagements. They have been used widely to inform the discourse on AI.</p><p>If you found any of our outputs helpful this past year, please take our <a href="https://forms.gle/xb3QyB5jfbT4JQAT9">2025 Epoch AI Impact Survey</a>! Next year we will bring the same passion to improving how we pursue our mission of informing the world about AI trends.</p><p>Happy holidays!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to receive the latest from Epoch AI..</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>We went by website page views to determine the ranking. Arguably, social media impressions would have been a better metric, since we see more engagement via these channels. We chose to go for website views for convenience and as a proxy for settled value and discoverability.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Two new benchmarks added to our suite]]></title><description><![CDATA[We've added SimpleQA Verified and a new benchmark of chess puzzles!]]></description><link>https://epochai.substack.com/p/two-new-benchmarks-added-to-our-suite</link><guid isPermaLink="false">https://epochai.substack.com/p/two-new-benchmarks-added-to-our-suite</guid><dc:creator><![CDATA[Greg Burnham]]></dc:creator><pubDate>Fri, 12 Dec 2025 16:46:30 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!FnrO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We will use these easy-to-run benchmarks to get quick estimates of our Epoch Capabilities Index (ECI) for newly-released models.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FnrO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FnrO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FnrO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FnrO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FnrO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FnrO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg" width="465" height="581.25" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:465,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!FnrO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FnrO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FnrO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FnrO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97273179-a261-4edb-a82f-82b607155282_1024x1280.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Simple QA</h3><p>SimpleQA Verified consists of 1,000 fact-seeking questions, developed by Google and built on prior work from OpenAI. Here&#8217;s a question that all models we benchmarked got wrong.</p><blockquote><p>&#8220;<em>What was the name of the man who served as Reeve of North Red Deer, Alberta, between 1924 and 1925?&#8221;</em></p></blockquote><p>Google maintains a nice leaderboard, but with ours we can expand the set of models covered. For instance, we find Qwen3-Max-Instruct to have held the high score (67%), until Gemini 3 Pro recently surpassed it (73%). Also notable: GPT-5/5.1 didn&#8217;t improve on o3&#8217;s score.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iLsv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iLsv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!iLsv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!iLsv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!iLsv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iLsv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg" width="465" height="581.25" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:465,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!iLsv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!iLsv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!iLsv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!iLsv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F041bf3f2-0191-4d6a-8d10-8945851a4ecb_1024x1280.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Qwen3-Max-Instruct&#8217;s score may be due in part to data contamination. We consider this possible because Qwen3-Max-Instruct ranks much lower on AA-Omniscience, a closed benchmark broadly similar to SimpleQA Verified.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NuxG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NuxG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NuxG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NuxG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NuxG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NuxG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg" width="463" height="578.75" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1280,&quot;width&quot;:1024,&quot;resizeWidth&quot;:463,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!NuxG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NuxG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NuxG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NuxG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5991c-5b6b-4fd0-b485-e22912432b64_1024x1280.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Chess Puzzles</h3><p>The Chess Puzzles benchmark consists of 100 novel chess puzzles we generated programmatically using a chess engine. We hope it will function as a &#8220;lite&#8221; version of more involved gaming benchmarks, measuring aspects of spatial reasoning and planning.</p><p>All puzzles have a single best next move which models must find. Our subjective assessment is that the puzzles range in difficulty from somewhat straightforward to somewhat difficult for a strong amateur. Here is one example of each: easier left, harder right; white to move.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!veKu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!veKu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg 424w, https://substackcdn.com/image/fetch/$s_!veKu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg 848w, https://substackcdn.com/image/fetch/$s_!veKu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!veKu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!veKu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg" width="1456" height="724" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:724,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!veKu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg 424w, https://substackcdn.com/image/fetch/$s_!veKu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg 848w, https://substackcdn.com/image/fetch/$s_!veKu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!veKu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0412b2f1-2d76-4ff2-9e96-d9dd390bec7b_1838x914.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Models are far from saturating the benchmark: the current high score is GPT-5 (high) with 37%, with other frontier models scoring similarly.</p><p>Check out our <a href="https://epoch.ai/benchmarks/search">benchmarking hub</a> for all of this and more!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://epochai.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to receive the latest from Epoch AI.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Epoch AI webinar: Inside the Frontier Data Centers hub]]></title><description><![CDATA[Showcasing the new Epoch AI Frontier Data Centers hub.]]></description><link>https://epochai.substack.com/p/epoch-ai-webinar-inside-the-frontier</link><guid isPermaLink="false">https://epochai.substack.com/p/epoch-ai-webinar-inside-the-frontier</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Wed, 26 Nov 2025 05:45:31 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/v-1X0nEcxH8" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Recorded on November 20, 2025, in this webinar we showcase the new Epoch AI Frontier Data Centers hub. <br><br>We walk through how we track the construction, power, compute, and cost of the largest AI data centers using satellite imagery and open data. <br><br>Explore the full Frontier Data Centers hub <a href="https://epoch.ai/data/data-centers">here</a>.<br>View construction timelines in the satellite explorer (mobile friendly) <a href="https://epoch.ai/data/data-centers/satellite-explorer">here</a>. </p><p>Watch the webinar here:</p><div id="youtube2-v-1X0nEcxH8" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;v-1X0nEcxH8&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/v-1X0nEcxH8?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h1><strong>Transcript</strong></h1><h4><strong>Introduction [00:00:00]</strong></h4><p><strong>Maria</strong></p><p>All right. Thank you everyone for joining this webinar. We&#8217;re going to be speaking about the Frontier Data Centers hub that we released recently at Epoch. My name is Maria and I want to introduce our speakers today. Yafah Edelman is head of data at Epoch. She led this project, and she&#8217;ll share a little bit about what motivated it and her plans for it. We&#8217;ll also hear from Ben Cottier, who&#8217;s a researcher at Epoch. He will be going over some insights from the research and doing a walkthrough of the hub and our methodology. I&#8217;ll hand it over to Yafah. You can get started.</p><p><strong>Yafah</strong></p><p>Thanks, everyone, for coming. We&#8217;ll be talking a bit about our Frontier Data Centers hub. I&#8217;m excited to share this with you. I think this is one of a very exciting product that Epoch has been working on, and we&#8217;re very happy to be able to explain everything about it to the public, just like all of our methodology and everything, because we think that&#8217;s important for people to understand.</p><p>Our Frontier Data Hub is tracking these frontier data centers. These are the very largest data centers that are being constructed right now. Many of them cost $10 billion or more. We think that these are incredibly important to track, because these infrastructure projects are some of the largest in history, with the largest rivaling the Manhattan Project, similar in size to the Manhattan Project or other very large outlays of R&amp;D expenditure. The largest we&#8217;re tracking right now, we expect to cost $100 billion when it comes online in 2028.</p><p>Tracking these data centers lets us understand the distribution of compute among companies and nations, as well as the trajectory of AI, how much investment there is in it, the extent to which the investment is increasing or decreasing, and the extent to which companies are able to provide the compute that they plan on and continue scaling at the historic rates.</p><p>Our database is entirely free and open, as I mentioned earlier. There are a few data center trackers out there. Ours being free and open means that the public can actually understand. And hopefully all of you can understand a bit about how we do this. We&#8217;re committed to transparency. We also share our methodology in even more detail on our website.</p><p>In addition to this commitment to transparency, we also plan to expand our coverage beyond the US to China, the Middle East, and Europe, something that we believe many other databases aren&#8217;t doing. This will allow people to get a sense of the global politics and status of AI, rather than just understanding what&#8217;s happening in the US. For more details on this, I&#8217;ll hand the presentation over to my colleague Ben, who will tell you a bit more about our results and methodology.</p><h4><strong>Satellite explorer demo [00:02:59]</strong></h4><p><strong>Ben</strong></p><p>Thank you. So I&#8217;m going to jump into the data hub on our website and show you some of that. I&#8217;m going to follow this link. Feel free to hop on to our website yourself and play around. I&#8217;m just going to switch what I&#8217;m presenting here.</p><p>All right. So here we are on the website. First thing I&#8217;m going to do, which you can see at the top here, is go to our satellite explorer. So I&#8217;ll follow this link. You can see this opens up a map where there are some points marked all over the map of the US. I can click on them and I can see different data centers like OpenAI Stargate, Abilene. These data centers are also listed on the left-hand side. You can see some key details about them, like the current power capacity, who owns the data center, who&#8217;s likely using the data centers, and so on.</p><p>I can navigate like Google Maps here and zoom in and out. I&#8217;m going to pick one example here. Let&#8217;s say I&#8217;m interested in this Google Omaha data center. I&#8217;m going to click here. This opens up a satellite view where we have a high-resolution satellite image of this data center. You can see that this image is annotated. So we have buildings which are numbered 1, 2, 3 and outlined in green. We&#8217;ve got grid power annotated here. These are substations. We also have backup power in purple. I can zoom in and see there&#8217;s a row of backup generators here next to the building.</p><p>Finally, we have cooling equipment which is outlined in blue. If I zoom in there, I can see a row of cooling units with some large fans on them. I&#8217;ll come back to that in a moment. But I want to explore this a bit more. So let&#8217;s look at the timeline at the top here. This gives you a timeline of satellite imagery so I can go back in time. Here we have another image. You can see that building three is a little less constructed than it was in the other image. I can go further back, all the way to when there wasn&#8217;t any building there. So this gives you a little insight into how we figure out the timeline of data center construction and when different parts of it go online.</p><p>You can also see this quantitatively. If I open up the graph view on the left here. This shows the power capacity for this data center which is operational at each point in time. We also show the compute capacity and the capital cost of the construction and GPUs and everything involved in this data center project.</p><h4><strong>Methodology: estimating power capacity [00:06:30]</strong></h4><p><strong>Ben</strong></p><p>So how do we do this? How do we estimate things like power capacity? Well, in this case we rely a lot on the cooling equipment. Coming back to that, if I zoom in here I have a nice clear image of these cooling towers. We developed a model where we can take some feature, some visual feature, like the diameter of these fans and actually predict quite accurately how much cooling capacity, how much heat is being taken out of the building by these cooling towers.</p><p>In this case these fans are roughly five meters in diameter. We can plug that into a formula that we&#8217;ve developed based on empirical data about cooling towers. We can estimate that each one of these fan units is cooling about eight megawatts worth of heat. Eight megawatts, an average US household is about one kilowatt. So eight megawatts is enough energy to power thousands of US homes just in this one cooling tower.</p><p>That&#8217;s how we get the capacity of a cooling tower, and then we can just add up the number of fans here to estimate a total cooling capacity for this building. Then we need to make a couple of adjustments to account for redundancy. For the sake of reliability, they don&#8217;t necessarily run all of these cooling towers at once. They keep a couple spare. Also there&#8217;s some overhead involved in cooling the GPUs or TPUs inside the building. After accounting for that, we can get a pretty good estimate of the amount of power in this building which is powering the GPUs or TPUs.</p><p>In this case, we actually have a ground truth for this. We found a permit. This is one of our key sources which is shown here on the left in the notes. In this permit it actually stated the power capacity of these cooling towers and how many cooling towers there were. This capacity that they stated actually lined up almost perfectly with what we estimated using our model. It was within 2%.</p><p>We&#8217;re not always quite this accurate, but we&#8217;re fairly confident that our estimates are within plus or minus 50% of the actual value at any point in time.</p><h4><strong>Graph view and key insights [00:09:25]</strong></h4><p><strong>Ben</strong></p><p>That&#8217;s a look at the satellite explorer and a bit of our methodology. At this point, I might want to not just look at one data center but compare multiple data centers on one graph. So that&#8217;s where the main page comes in. I&#8217;m going to head back there. If I scroll down we have this graph view which is showing the timeline of power capacity that&#8217;s operational for a few different data centers. I can play around with this. I can select Google Omaha, which we were just looking at on the satellite view, and other data centers like Microsoft Fairwater. I can also change the y-axis to look at not just power but compute capacity and cost. So that&#8217;s the graph view. We use that to do some of our analysis.</p><p>That&#8217;s a quick demo of the data hub. Now I&#8217;m going to head back to the slides and talk about some of the key insights that we&#8217;ve gained from this research.</p><p>The first thing I want to give you an intuition for is just the sheer scale of the data centers that are getting built right now. The largest one that we found is Meta Hyperion in Louisiana, and this is due to be operational in 2028, but it&#8217;s already under construction and we&#8217;ve seen it on the satellite images. The total amount of land for this data center campus would be enough to cover a large fraction of Manhattan, about one fifth. It&#8217;s several times bigger than Central Park, which you can see in green in this satellite image here.</p><p>We also found that several gigawatt-scale data centers are due to come online next year, in 2026, and several of these are on track to be built in two years or less, going from starting construction to reaching one gigawatt operational. Particularly notable here is xAI Colossus 2, which is expected to go from construction to one gigawatt in just one year. So xAI is particularly fast-paced here.</p><p>Then looking further out into the future, looking at the future plans, we expect this growth to just keep continuing. So going from one gigawatt in 2026, up to around three gigawatts towards the end of 2027 and 2028. You can see here Microsoft Fairwater, Wisconsin. That one is reaching around three gigawatts in 2027. We estimate that this would give it a capacity of around 5 million H100 equivalent GPUs in terms of computing power.</p><p>Those are a few key insights. Now I&#8217;m going to hand it back to Yafah to talk about our future plans a little bit.</p><h4><strong>Future plans [00:12:57]</strong></h4><p><strong>Yafah</strong></p><p>We have a lot to do. I&#8217;m excited to talk a little bit about what we&#8217;re going to be adding and what additional things the hub is going to cover in the future. Our most exciting, or one of the most exciting first things we&#8217;re going to be doing is increasing coverage. This means trying to get up from our current coverage. Currently, we cover about 15% of AI compute by our estimates. We want to get that up quite a lot, as high as 90% by covering more of the largest data centers in the US and also expanding around the world. This means covering data centers in the US and China, in the Middle East and in Europe. Here we have a picture of a data center in the UAE. We&#8217;re working on this and should hopefully have some updates for everyone pretty soon. I&#8217;m really optimistic about how we&#8217;re going to expand our coverage.</p><p>Another major update that&#8217;s coming is we&#8217;re working on tracking more metrics about data centers. An example here is the number of construction workers, which are often reported in permits. We also might be able to track similar numbers based off the number of cars in parking lots, which we have in our satellite photographs. We are also hoping to track which data centers are networked together. This will let us understand how big the full scope of the potential clusters that might exist beyond just individual campuses.</p><p>These steps and more are coming in the next few months. There are other things we&#8217;ll also be excited to share with everyone beyond these, but for now, I&#8217;ll pass it over to Maria to set up our Q&amp;A.</p><h4><strong>Q&amp;A session [00:14:44]</strong></h4><p><strong>Maria</strong></p><p>Sure. So now we&#8217;ll be moving on to the Q&amp;A. Thank you, Ben and Yafah. All right, so we have this one by Marco De DiDonato. Speaking about power generation, it seems that many large data centers you are mapping still rely on local grid to power themselves. Others like Colossus and Stargate 1 also rely on onsite gas turbines. What trend can we expect to see in the coming years? Can we realistically think that grid operators will be able to keep up with multi-gigawatt scale projects? Or will hyperscalers focus on local power generation going forward? That&#8217;s a lot. Maybe let&#8217;s start with what trend can we expect to see in the coming years.</p><p><strong>Yafah</strong></p><p>What we&#8217;ve seen a lot of is data centers providing onsite power as a bridge before they get grid power. I believe this happened at Stargate Abilene. This also happened with Colossus. I expect similar techniques to be used. As of now, my current impression is that all data centers plan to eventually add grid power. I have seen grids keeping up more with power than I think most people are under the impression. I&#8217;ve been seeing grids keep up more with adding capacity than I think most people are under the impression. That&#8217;s my answer there.</p><p><strong>Maria</strong></p><p>Now with Anton Trubnikov. Maybe I missed this, but how are you identifying sites in the first place? Many data centers and campuses are not publicly announced.</p><p><strong>Ben</strong></p><p>I can take that. We identify sites through multiple methods. One thing is we just look at news or rumors about data centers. There&#8217;s often local news, which will talk about some new construction of a data center which may not be widely publicized, but you can find it in local news. But there are also websites that are quite good at tracking lots of news about data centers, which we follow, and then we go and investigate further. So we might have just a city location for a data center. We don&#8217;t know the exact location, but then we might go into Google Earth and look around. Maybe we happen to find some new construction, and then we end up determining that that&#8217;s actually the data center in question.</p><p>We also look at permit documents. We use ChatGPT. Actually, this is quite helpful as a tool to search quite deeply for permit databases and find permits, which often mention the address of the site. So we can narrow down the location that way.</p><p>Going forward, we&#8217;re interested in doing more automated methods of discovery using satellite imagery. But so far we&#8217;ve just been using a combination of news and permit documents and satellite imagery, manually searching satellite imagery.</p><p><strong>Attendee</strong></p><p>I saw that it looks like there&#8217;s going to be more data on China, which is exciting. What are the challenges to gathering that data and how are you going to get around them?</p><p><strong>Yafah</strong></p><p>There are a few challenges here. One is that we are not nearly as optimistic about the potential for permits and similar filings to cover data centers as well. I also expect there to be a lot more noise with people claiming buildouts that don&#8217;t exist. However, for our first steps in this, we&#8217;ve looked through some sources, located a data center, and it turned out to be half a gigawatt, which is very large. It is an older data center, but there is some amount of new buildout. So to some extent, I think some of the similar sources, looking for new sources, people talking online, is at least promising for first steps.</p><p>For additional steps, we are excited to hopefully use low-resolution satellite imagery to automatically search for very large construction sites, which we can then do more research on to discover whether there are data centers. We&#8217;re also excited to investigate other avenues for similar plans. Additionally, our satellite methodology we expect to largely work in China, allowing us to, once we have found a potential site for a data center, do an analysis very similar to our current one.</p><p><strong>Attendee</strong></p><p>First of all, thank you for doing this, all of you. How will you go about determining which data centers communicate for decentralized computing?</p><p><strong>Yafah</strong></p><p>We&#8217;re still at early stages of this. One thing I&#8217;m optimistic about is seeing the infrastructure being built for fibers, looking at permits and looking at prior coverage of this sort of thing. This is something that we are eager to do but have not currently done.</p><p><strong>Ben</strong></p><p>Just to add to that, sometimes the companies do just say that the data centers are connected. For instance, the recent announcement from Microsoft of the second Fairwater Data Center. One&#8217;s in Atlanta, one is in Mount Pleasant, Wisconsin. They say that these data centers are connected by fiber. I think some claims may be more vague than others. We&#8217;d want to verify it as best we can. But in the absence of other information, we would just take a company&#8217;s word for it that data centers are in fact connected or planned to be connected. We indicate our confidence level in these&#8212;whether or not they are connected or other things, like who is using the data center. We have a confidence level that we indicate just to give people a sense of, are we confident in this? Or is this more like a rumor?</p><p><strong>Maria</strong></p><p>Yes. I see a question here from Yan Riviere. Earlier you talked about the gas turbines used before going to the grid. Do you think the turbines may be moved from one data center to another while waiting for the grid to be linked? I&#8217;m not sure if that was what you were implying.</p><p><strong>Yafah</strong></p><p>There are two instances of this I specifically refer to, one of which is xAI Colossus, which I believe rented portable gas turbines to power their data center before it was connected to the grid. Those would have been moved somewhere else afterward. Those are, I believe, typically used for disaster areas and providing backup power for them.</p><p>The other example is Abilene. Abilene is near a large amount of wind power, which is intermittent and thus needs a lot of backup power regardless. So I believe their plan, as stated, is to connect to the grid, which I believe they have already done. Ben can correct me if I&#8217;m wrong on that. And use the gas turbines as backup for the grid. The gas turbines there are permanent.</p><p><strong>Ben</strong></p><p>There&#8217;s I believe 200 megawatts worth of grid power already supplying Stargate in Abilene, and they&#8217;re working on a one gigawatt additional substation to fully power the final eight buildings.</p><p><strong>Maria</strong></p><p>Great. We have another question here from Ashby Field. They say I missed most of the presentation, so I apologize if it was covered. How do you derive the speculative users of the center based on map data? Will you track policy, tax incentivization, any location-based grid intensity datasets?</p><p><strong>Ben</strong></p><p>In terms of the speculative uses, this isn&#8217;t just based on map data. We look at announcements or permit documents for mentions of this. Sometimes permit documents will be filed under a shell company, but we&#8217;re able to track that back to the actual company. Sometimes if it&#8217;s speculative, that usually means we have some reason to believe that a certain company is using the data center, but this hasn&#8217;t been confirmed in any way.</p><p>We actually updated our estimate for this. There&#8217;s a data center called Goodnight, where we initially thought it was speculatively for OpenAI because the buildings had the same design as Stargate. But we&#8217;ve since heard that Google is actually investing in a data center in that location. So we switched to thinking that it&#8217;s more likely to be used by Google. So we&#8217;re updating our database based on the best evidence we have.</p><p><strong>Yafah</strong></p><p>We use tax abatements in some cases for learning about data centers. We expect to continue using this as one of many data sources. Some of those documents will be linked in the notes section of the satellite view of our data hub.</p><p>Any location-based grid intensity datasets&#8212;we do not have any location-based grid intensity datasets that we are releasing, that we have. We use some grid information that&#8217;s similar to that sometimes, but nothing substantial that we&#8217;ve created on our own, beyond the amount of power data centers use, which is obviously location-based, but we do track.</p><p><strong>Maria</strong></p><p>Okay. Here&#8217;s another question by Edward Kant. Are there any legal challenges in collecting and publishing such information? Do you want to take this or should I?</p><p><strong>Yafah</strong></p><p>I&#8217;ll take this. The answer is largely no. There is one particular thing which we had to deal with, which was getting proper licensing for some of the images. We did this, but there are no other legal challenges involved. Satellite data is&#8212;I know it surprises some people&#8212;satellite data is publishable.</p><p><strong>Maria</strong></p><p>There&#8217;s this new one by Marco DiDonato. I think it could be interesting to track the overall supply, global supply of gas turbines that could be used on data centers. There are actually only three major gas turbine suppliers in the world. It could help to understand what is the actual hard cap on the amount of power that can be deployed.</p><p><strong>Yafah</strong></p><p>This is definitely an interesting methodology. There&#8217;s one question you can ask, which is, to what extent are there enough traditional gas turbines to power these data centers? However, there are other alternative sources which we believe will provide power should these data centers not be able to get gas turbines. Gas turbines are often the most cost-effective power source, especially combined-cycle ones, but there are several alternatives.</p><p>In particular, a data center can have a flexible load where they go down during peak times, which makes it much easier for grids to supply them. A data center can also use solar and batteries, which we believe is very scalable with fairly small lead times, although it is more expensive than traditional power, but still not nearly as expensive as the GPUs which will go into that data center. So we expect that if necessary, AI companies will start using solar plus batteries.</p><p>Additionally, there are alternative types of gas turbines which you might not expect. For instance, some places are taking gas turbines out of jets, out of planes, and using them to power data centers. Something which I think is very cool. I&#8217;m not exactly sure how common that is yet. It&#8217;s probably more expensive because those types of gas turbines are simple-cycle or single-cycle turbines, which are less efficient.</p><h4><strong>Additional discussion on power [00:27:41]</strong></h4><p><strong>Yafah</strong></p><p>One thing I want to repeat, because I think it&#8217;s a thing that a lot of people are wrong about. There are challenges with power. Infrastructure in the US is designed for demand for power which has largely plateaued, which happens in developed countries as they start using power more efficiently rather than adding additional power. This means that it has designed for a situation where you can plan very far in advance. Loads aren&#8217;t increasing by a lot, and you thus care a lot about keeping costs down as much as possible. This sort of slow planning out years in advance, providing a large amount to keep costs down strategy is&#8212;</p><p>I think a lot of the worry that people have about data centers not being able to get enough power is really not going to pan out, because data centers can absolutely afford to pay more and will afford to pay, will pay more if necessary, just because of how much GPUs are still going to be a much larger cost than power. At the end of the day, if they have to pay twice as much to get the power quicker, they just will.</p><p><strong>Maria</strong></p><p>I see a new question. You mentioned jobs, which is obviously super important to local jurisdictions and governments considering data centers as economic stimulus. Are you collecting data for buildouts and perpetual operations separately? Nevertheless, this is great work. Do you want to take this?</p><p><strong>Yafah</strong></p><p>Right now we are not publishing job data, I believe. But when we add jobs to the hub, I expect to separate out these two sources. If anyone&#8217;s interested in my current impression, somewhere in the vicinity of perpetual operations requiring maybe a tenth to a fifth of the amount of people as the actual construction. Although this is a very preliminary, I&#8217;m sort of eyeballing it number, and I hope to have a lot more detail to share in the future.</p><h4><strong>Identifying sites and construction companies [00:30:46]</strong></h4><p><strong>Yafah</strong></p><p>While we&#8217;re waiting for anyone else to offer questions, I&#8217;m going to return to one. I think there&#8217;s some interesting additional information. Anton asked, how are we identifying sites? One thing I&#8217;m excited to dig more into is construction companies will often discuss the sites they are working on in their earnings reports or on their website. We&#8217;ve found some pretty interesting sources using this, and I&#8217;m excited to continue using this to discover data centers which have not been formally announced.</p><p>Very often, &#8220;formally announced&#8221; is not a very clear definition. You will often have a data center where the local cities all know about it and it&#8217;s in their newspapers. But the company, like Microsoft, has not announced it in a big PR campaign. So there are these intermediate things that are also relevant.</p><p><strong>Maria</strong></p><p>I think one question that is on the website and that I found interesting is whether an entire data center can be used to train one AI model. Could you explain that?</p><p><strong>Yafah</strong></p><p>Typically, we are under the impression that most of the new data centers could, in theory, be used to train one AI model. However, contrary to what some people are implying, these companies imply, we believe that this happens pretty rarely, and that in fact, the largest AI models that are currently being trained are being trained on a fraction of the data center.</p><p>This is also part of why, when people talk about being able to connect these data centers and do distributed training, I mean, I&#8217;m interested in this and I think it&#8217;s important to track, but I think it is not actually necessary for scaling up in the present. There&#8217;s a lot of room just to use a larger portion of the data centers that already exist.</p><p>So when Google or when Microsoft talks about how we have a data center in this state and this state and they&#8217;re connected to a super whatever, I&#8217;m like, well, you&#8217;ve never used even a fifth of one of those data centers to train a model. So you&#8217;re getting ahead of yourselves. People should maybe on the margin pay a little bit less attention to this, although it&#8217;s still a thing we&#8217;ll be tracking because we think it could be important. It&#8217;s interesting and important to keep track of, if they wanted to, how big can they get? But this might not, in practice, be what they&#8217;re actually using.</p><p><strong>Maria</strong></p><p>Thank you. Okay, I think we&#8217;ve run out of questions so we can probably leave it here. Thanks so much, everyone for joining, for asking questions, and Ben for presenting. You can email us if you have more questions. We&#8217;ll be answering, or feedback, for example, things you want to see in the future. You can explore the data center hub at epoch.ai. You&#8217;ll see several databases. This is one of them. Thank you so much. Have a great rest of your day.</p><p><strong>Ben</strong></p><p>Thanks everyone.</p>]]></content:encoded></item><item><title><![CDATA[Frontier Data Centers hub on mobile]]></title><description><![CDATA[AI infrastructure, now in your pocket.]]></description><link>https://epochai.substack.com/p/frontier-data-centers-hub-on-mobile</link><guid isPermaLink="false">https://epochai.substack.com/p/frontier-data-centers-hub-on-mobile</guid><dc:creator><![CDATA[Epoch AI]]></dc:creator><pubDate>Tue, 25 Nov 2025 02:33:01 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/3b98a6be-c02a-4b57-b43b-8f0ff0ecd133_490x872.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We&#8217;ve optimized our Frontier Data Centers hub for mobile.</p><p>You can now examine annotated, recent, high-resolution satellite imagery of the world&#8217;s largest compute clusters directly from your phone at <a href="https://epoch.ai/data/data-centers.">https://epoch.ai/data/data-centers.</a></p><p>Here&#8217;s a look at the updated Satellite Viewer:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;df9b7541-12c8-4656-941d-a1d8ac047f63&quot;,&quot;duration&quot;:null}"></div><p></p>]]></content:encoded></item></channel></rss>