Subscribe
Sign in
Home
Gradient Updates by Epoch AI
The Epoch AI Brief
Podcast
Archive
About
The Epoch AI Brief - February 2025
Epoch AI's 2024 Impact Report covers AI research, benchmarking, biology AI models, training compute, economic impacts of AI, and more. Read our latest…
39 mins ago
•
Epoch AI
Algorithmic progress likely spurs more spending on compute, not less
Algorithmic progress in AI may not reduce compute spending—instead, it could drive higher investment as efficiency unlocks new opportunities.
Feb 14
•
Matthew Barnett
How much energy does ChatGPT use?
This Gradient Updates issue explores how much energy ChatGPT uses per query, revealing it's 10x less than common estimates.
Feb 7
•
Josh You
January 2025
What went into training DeepSeek-R1?
On January 20th, 2025, DeepSeek released their latest open-weights reasoning model, DeepSeek-R1, which is on par with OpenAI’s o1 in benchmark…
Jan 31
•
Ege Erdil
AGI could drive wages below subsistence level
Historically, many have feared that automation would lead to mass unemployment and lower wages.
Jan 25
•
Matthew Barnett
How has DeepSeek improved the Transformer architecture?
DeepSeek has recently released DeepSeek v3, which is currently state-of-the-art in benchmark performance among open-weight models, alongside a technical…
Jan 18
•
Ege Erdil
The economic consequences of automating remote work
Recent AI progress has shown great promise in automating cognitive tasks, like those in natural language processing and vision.
Jan 10
•
Epoch AI
December 2024
Moravec’s paradox and its implications
Since the birth of the field of artificial intelligence in the 20th century, researchers have observed that the difficulty of a task for humans at best…
Dec 27, 2024
•
Ege Erdil
How do mixture-of-experts models compare to dense models in inference?
In last week’s Gradient Updates issue, I discussed how we can guess that GPT-4o and Claude 3.5 Sonnet have significantly fewer parameters than GPT-4.
Dec 20, 2024
•
Ege Erdil
Frontier language models have become much smaller
Between the release of the original Transformer in 2017 and the release of GPT-4, language models at the frontier of capabilities became much larger.
Dec 13, 2024
•
Ege Erdil
What did US export controls mean for China’s AI capabilities?
Four days ago, the US government announced new rules around the export of powerful chips and semiconductor manufacturing equipment to China.
Dec 6, 2024
•
Ege Erdil
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts