Published: Jul 10, 2023
Revised: Aug 8, 2023
  • Working Paper #1
Working paper
  • Working paper
  • Working Paper #1

Forecasting Existential Risks: Evidence from a Long-Run Forecasting Tournament

Forecasting Existential Risks: Evidence from a Long-Run Forecasting Tournament
What are the gravest risks facing humanity over the next century and beyond? In 2022, the Existential Risk Persuasion Tournament (XPT) brought together 80 experts and 89 superforecasters to predict the risks to humanity posed by artificial intelligence, nuclear war, pandemics, and other causes.
Ezra Karger†,*, Josh Rosenberg*, Zachary Jacobs*, Molly Hickman*, Rose Hadshar*, Kayla Gamin*, Taylor Smith*, Bridget Williams*, Tegan McCaslin*, Stephen Thomas*, Philip E. Tetlock‡,* ,
† Federal Reserve Bank of Chicago
* Forecasting Research Institute
‡ Wharton School of the University of Pennsylvania
Published: Jul 10, 2023
Revised: Aug 8, 2023
Ezra Karger†,*, Josh Rosenberg*, Zachary Jacobs*, Molly Hickman*, Rose Hadshar*, Kayla Gamin*, Taylor Smith*, Bridget Williams*, Tegan McCaslin*, Stephen Thomas*, Philip E. Tetlock‡,*

Abstract

The Existential Risk Persuasion Tournament (XPT) aimed to produce high-quality forecasts of the risks facing humanity over the next century by incentivizing thoughtful forecasts, explanations, persuasion, and updating from 169 forecasters over a multi-stage tournament. In this first iteration of the XPT, we discover points where historically accurate forecasters on short-run questions (superforecasters) and domain experts agree and disagree in their probability estimates of short-, medium-, and long-run threats to humanity from artificial intelligence, nuclear war, biological pathogens, and other causes. We document large-scale disagreement and minimal convergence of beliefs over the course of the XPT, with the largest disagreement about risks from artificial intelligence. The most pressing practical question for future work is: why were superforecasters so unmoved by experts’ much higher estimates of AI extinction risk, and why were experts so unmoved by the superforecasters’ lower estimates? The most puzzling scientific question is: why did rational forecasters, incentivized by the XPT to persuade each other, not converge after months of debate and the exchange of millions of words and thousands of forecasts?

Acknowledgments

This research would not have been possible without the generous support of the Musk Foundation and the Long-Term Future Fund. We thank Walter Frick, Michael Page, Terry Murray, David Budescu, Barb Mellers, Elie Hassenfeld, Philipp Schoenegger, and Pavel Atanasov for their thoughtful feedback and comments. We are also grateful for the many people who reviewed earlier drafts of this report. We greatly appreciate the assistance of Amory Bennett, Kaitlyn Coffee, Adam Kuzee, Avital Morris, Fiona Pollack, Coralie Consigny, Arunim Agarwal, and Rumtin Sepasspour throughout the project. Lastly, we extend our gratitude to our research participants for their invaluable contributions.

† Federal Reserve Bank of Chicago
* Forecasting Research Institute
‡ Wharton School of the University of Pennsylvania
    Related Research
    Working paper
    Assessing Near-Term Accuracy in the Existential Risk Persuasion Tournament
    Sep 2, 2025
    Academic article
    Belief updating in AI-risk debates: Exploring the limits of adversarial collaboration
    Apr 3, 2025
    Academic article
    Subjective-probability forecasts of existential risk: Initial results from a hybrid persuasion-forecasting tournament
    Jan 17, 2025
    Project
    The Longitudinal Expert AI Panel (LEAP)
    Ongoing