{"id":1680,"date":"2024-03-11T12:00:00","date_gmt":"2024-03-11T12:00:00","guid":{"rendered":"https:\/\/forecastingresearch.org\/?post_type=research&#038;p=1680"},"modified":"2026-05-05T14:35:56","modified_gmt":"2026-05-05T14:35:56","slug":"roots-of-disagreement-on-ai-risk","status":"publish","type":"research","link":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk","title":{"rendered":"Roots of Disagreement on AI Risk"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\" id=\"abstract\">Abstract<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">We brought together generalist forecasters and domain experts (n=22) who disagreed about the risk AI poses to humanity in the next century. The \u201cconcerned\u201d participants (all of whom were domain experts) predicted a 20% chance of an AI-caused existential catastrophe by 2100, while the \u201cskeptical\u201d group (mainly \u201csuperforecasters\u201d) predicted a 0.12% chance. Participants worked together to find the strongest near-term cruxes: forecasting questions resolving by 2030 that would lead to the largest change in their beliefs (in expectation) about the risk of existential catastrophe by 2100. Neither the concerned nor the skeptics substantially updated toward the other\u2019s views during our study, though one of the top short-term cruxes we identified is expected to close the gap in beliefs about AI existential catastrophe by about 5%: approximately 1 percentage point out of the roughly 20 percentage point gap in existential catastrophe forecasts. We find greater agreement about a broader set of risks from AI over the next thousand years: the two groups gave median forecasts of 30% (skeptics) and 40% (concerned) that AI will have severe negative effects on humanity by causing major declines in population, very low self-reported well-being, or extinction.<\/p>\n\n\n\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"btn orange\" href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">View the full PDF report <svg width=\"7\" height=\"9\" viewBox=\"0 0 7 9\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.000156283 8.60806L4.22416 4.33606V4.24006L0.000156283 6.10352e-05H1.80816L6.06416 4.28806L1.80816 8.60806H0.000156283Z\" fill=\"#102B23\"\/>\n<\/svg>\n<svg width=\"8\" height=\"10\" viewBox=\"0 0 8 10\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.601719 8.85794L4.82572 4.58594V4.48994L0.601719 0.249939H2.40972L6.66572 4.53794L2.40972 8.85794H0.601719Z\" fill=\"#102B23\"\/>\n<\/svg><\/a><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<details class=\"wp-block-details is-layout-flow wp-block-details-is-layout-flow\"><summary>Acknowledgments<\/summary>\n<p class=\"wp-block-paragraph\">This research would not have been possible without the support of Open Philanthropy. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We thank the research participants for their invaluable contributions. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We greatly appreciate the assistance of Page Hedley for data analysis and editing on the report, Taylor Smith and Bridget Williams as adversarial collaboration moderators, and Kayla Gamin, Coralie Consigny, and Harrison Durland for their careful editing. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We thank Elie Hassenfeld, Eli Lifland, Nick Beckstead, Bob Sawyer, Kjirste Morrell, Adam Jarvis, Dan Mayland, Jeremiah Stanghini, Jonathan Hosgood, Dwight Smith, Ted Sanders, Scott Eastman, John Croxton, Raimondas Lencevicius, Alexandru Marcoci, Kevin Dorst, Jaime Sevilla, Rose Hadshar, Holden Karnofsky, Benjamin Tereick, Isabel Juniewicz, Walter Frick, Alex Lawsen, Matt Clancy, Tegan McCaslin, and Lyle Ungar for comments on the report.<\/p>\n<\/details>\n<\/div><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"executive-summary\">Executive summary<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">In the summer of 2022, researchers affiliated with the Forecasting Research Institute (FRI) (<a href=\"https:\/\/web.archive.org\/web\/20240215203117\/https:\/\/forecastingresearch.org\/\"><u>a<\/u><\/a>)<sup data-fn=\"c6f06bd9-95f5-4c75-8acd-9fc2b3d16cc0\" class=\"fn\"><a href=\"#c6f06bd9-95f5-4c75-8acd-9fc2b3d16cc0\" id=\"c6f06bd9-95f5-4c75-8acd-9fc2b3d16cc0-link\">1<\/a><\/sup> ran the <a href=\"https:\/\/forecastingresearch.org\/research\/xpt\" id=\"876\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Existential Risk Persuasion Tournament<\/u><\/a> (XPT) (<a href=\"https:\/\/web.archive.org\/web\/20240216152727\/https:\/\/forecastingresearch.org\/news\/results-from-the-2022-existential-risk-persuasion-tournament\"><u>a<\/u><\/a>), which identified large disagreements between domain experts and generalist forecasters about key risks to humanity (Karger et al. 2023). This new project\u2014a structured adversarial collaboration run in April and May 2023\u2014is a follow-up to the XPT focused on better understanding the drivers of disagreement about AI risk.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"methods\">Methods<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We recruited participants to join \u201cAI skeptic\u201d (n=11) and \u201cAI concerned\u201d (n=11) groups that disagree strongly about the probability that AI will cause an existential catastrophe by 2100.<sup data-fn=\"c3aa60f6-dde7-495c-9f16-322669455d51\" class=\"fn\"><a href=\"#c3aa60f6-dde7-495c-9f16-322669455d51\" id=\"c3aa60f6-dde7-495c-9f16-322669455d51-link\">2<\/a><\/sup> The skeptic group included nine superforecasters and two domain experts. The concerned group consisted of domain experts referred to us by staff members at Open Philanthropy (the funder of this project) and the broader Effective Altruism community.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Participants spent 8 weeks (skeptic median: 80 hours of work on the\nproject; concerned median: 31 hours) reading background materials,\ndeveloping forecasts, and engaging in online discussion and video calls.\nWe asked participants to work toward a better understanding of their\nsources of agreement and disagreement, and to propose and investigate\n\u201ccruxes\u201d: short-term indicators, usually resolving by 2030, that would\ncause the largest updates in expectation to each group\u2019s view on the\nprobability of existential catastrophe due to AI by 2100.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"results-what-drives-and-doesnt-drive-disagreement-over-ai-risk\">Results:\nWhat drives (and doesn\u2019t drive) disagreement over AI risk<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">At the beginning of the project, the median \u201cskeptic\u201d forecasted a 0.10% chance of existential catastrophe due to AI by 2100, and the median \u201cconcerned\u201d participant forecasted a 25% chance. By the end, these numbers were 0.12% and 20% respectively, though many participants did not attribute their updates to arguments made during the project.<sup data-fn=\"80700b24-a53d-4a9f-8298-d7ce0b6478db\" class=\"fn\"><a href=\"#80700b24-a53d-4a9f-8298-d7ce0b6478db\" id=\"80700b24-a53d-4a9f-8298-d7ce0b6478db-link\">3<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We organize our findings as responses to four hypotheses about what\ndrives disagreement:<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"hypothesis-1---disagreements-about-ai-risk-persist-due-to-lack-of-engagement-among-participants-low-quality-of-participants-or-because-the-skeptic-and-concerned-groups-did-not-understand-each-others-arguments\">Hypothesis #1 &#8211; Disagreements about AI risk persist due to lack of engagement among participants, low quality of participants, or because the skeptic and concerned groups did not understand each other&#8217;s arguments<sup data-fn=\"d8ae5388-8049-4af4-8698-f129f31b2964\" class=\"fn\"><a href=\"#d8ae5388-8049-4af4-8698-f129f31b2964\" id=\"d8ae5388-8049-4af4-8698-f129f31b2964-link\">4<\/a><\/sup><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">We found moderate evidence against these possibilities. Participants\nengaged for 25-100 hours each (skeptic median: 80 hours; concerned\nmedian: 31 hours), this project included a selective group of\nsuperforecasters and domain experts, and the groups were able to\nsummarize each other&#8217;s arguments well during the project and in\nfollow-up surveys. (<a href=\"#hypothesis-1-do-the-groups-understand-each-others-arguments-and-do-views-shift-with-more-engagement\"><u>More<\/u><\/a>)<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"hypothesis-2---disagreements-about-ai-risk-are-explained-by-different-short-term-expectations-e.g.-about-ai-capabilities-ai-policy-or-other-factors-that-could-be-observed-by-2030\">Hypothesis\n#2 &#8211; Disagreements about AI risk are explained by different short-term\nexpectations (e.g. about AI capabilities, AI policy, or other factors\nthat could be observed by 2030)<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Most of the disagreement about AI risk by 2100 is not explained by indicators resolving by 2030 that we examined in this project. According to our metrics of crux quality, one of the top cruxes we identified is expected to close the gap in beliefs about AI existential catastrophe by about 5% (approximately 1.2 percentage points out of the 22.7 percentage point gap in forecasts for the median pair) when it resolves in 2030.<sup data-fn=\"34c0d67f-081d-4f09-9d39-5d85b454c2e0\" class=\"fn\"><a href=\"#34c0d67f-081d-4f09-9d39-5d85b454c2e0\" id=\"34c0d67f-081d-4f09-9d39-5d85b454c2e0-link\">5<\/a><\/sup> For at least half of participants in each group, there was a question that was at least 5-10% as informative as being told by an oracle whether AI in fact caused an existential catastrophe or not.<sup data-fn=\"627ee814-9d5a-40a2-a4d4-c3e504b4de64\" class=\"fn\"><a href=\"#627ee814-9d5a-40a2-a4d4-c3e504b4de64\" id=\"627ee814-9d5a-40a2-a4d4-c3e504b4de64-link\">6<\/a><\/sup> It is difficult to contextualize the size of these effects because this is the first project applying question metrics to AI forecasting questions that we are aware of.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">However, near-term cruxes shed light on what the groups believe, where they disagree, and why:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Evaluations of dangerous AI capabilities are relevant to both groups.<\/strong> One of the strongest cruxes that will resolve by 2030 is about whether <a href=\"https:\/\/evals.alignment.org\/\"><u>METR<\/u><\/a> (formerly known as ARC Evals) (<a href=\"https:\/\/web.archive.org\/web\/20240216154134\/https:\/\/metr.org\/\"><u>a<\/u><\/a>) or a similar group will find that AI has developed dangerous capabilities such as autonomously replicating and avoiding shutdown. This crux illustrates a theme in the disagreement: the skeptic group typically did not find theoretical arguments for AI risk persuasive but would update their views based on real-world demonstrations of dangerous AI capabilities that verify existing theoretical arguments. If this question resolves negatively then the concerned group would be less worried, because it would mean that we have had years of progress from today\u2019s models without this plausible set of dangerous capabilities becoming apparent. (<a href=\"#convergent-cruxes-which-information-would-lead-to-less-disagreement-in-expectation\"><u>More<\/u><\/a>)<\/li>\n\n\n\n<li><strong>Generally, the questions that would be most informative\nto each of the two groups are fairly distinct.<\/strong> The concerned\ngroup\u2019s highest-ranked cruxes tended to relate to AI alignment and\nalignment research. The skeptic group\u2019s highest-ranked cruxes tended to\nrelate to the development of lethal technologies and demonstrations of\nharmful AI power-seeking behavior. This suggests that many of the two\ngroups\u2019 largest sources of uncertainty are different, and in many cases\nfurther investigation of one group\u2019s uncertainties would not persuade\nthe other. (<a href=\"#high-voi-questions\"><u>More<\/u><\/a>)<\/li>\n\n\n\n<li><strong>Commonly-discussed topics\u2014such as near-term economic\neffects of AI and progress in many AI capabilities\u2014did not seem like\nstrong cruxes.<\/strong> (<a href=\"#low-voi-questions\"><u>More<\/u><\/a>)<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"hypothesis-3---disagreements-about-ai-risk-are-explained-by-different-long-term-expectations\">Hypothesis\n#3 &#8211; Disagreements about AI risk are explained by different long-term\nexpectations<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">We found substantial evidence that disagreements about AI risk\ndecreased between the groups when considering longer time horizons (the\nnext thousand years) and a broader set of severe negative outcomes from\nAI beyond extinction or civilizational collapse, such as large decreases\nin well-being or total population.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Some of the key drivers of disagreement about AI risk are that the groups have different expectations about: (1) how long it will take until AIs have capabilities far beyond those of humans in all relevant domains; (2) how common it will be for AI systems to develop goals that might lead to human extinction; (3) whether killing all living humans would remain difficult for an advanced AI; and (4) how adequately they expect society to respond to dangers from advanced AI.<sup data-fn=\"ab5c17ca-3bbd-4259-97a5-f4a499b6de51\" class=\"fn\"><a href=\"#ab5c17ca-3bbd-4259-97a5-f4a499b6de51\" id=\"ab5c17ca-3bbd-4259-97a5-f4a499b6de51-link\">7<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Supportive evidence for these claims includes:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Both groups strongly expected that powerful AI (defined as \u201cAI\nthat exceeds the cognitive performance of humans in &gt;95% of\neconomically relevant domains\u201d) would be developed by 2100 (skeptic\nmedian: 90%; concerned median: 88%). Though, some skeptics argue that\n(i) strong physical capabilities (in addition to cognitive ones) would\nbe important for causing severe negative effects in the world, and (ii)\neven if AI can do most cognitive tasks, there will likely be a \u201clong\ntail\u201d of tasks that require humans.<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The two groups also put similar total probabilities on at least one of a cluster of bad outcomes from AI happening over the next 1000 years (median 40% and 30% for concerned and skeptic groups respectively).<sup data-fn=\"70c6e9ce-c7d0-49f0-9ff6-2dc16fa28f52\" class=\"fn\"><a href=\"#70c6e9ce-c7d0-49f0-9ff6-2dc16fa28f52\" id=\"70c6e9ce-c7d0-49f0-9ff6-2dc16fa28f52-link\">8<\/a><\/sup> But they distribute their probabilities differently over time: the concerned group concentrates their probability mass before 2100, and the skeptics spread their probability mass more evenly over the next 1,000 years.<\/li>\n\n\n\n<li>We asked participants when AI will displace humans as the primary force that determines what happens in the future.<sup data-fn=\"7b7f15d6-76d8-45b4-a68f-b3968547c30f\" class=\"fn\"><a href=\"#7b7f15d6-76d8-45b4-a68f-b3968547c30f\" id=\"7b7f15d6-76d8-45b4-a68f-b3968547c30f-link\">9<\/a><\/sup> The concerned group\u2019s median date is 2045 and the skeptic group\u2019s median date is 2450\u2014405 years later.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Overall, many skeptics regarded their forecasts on AI existential risk as worryingly high, although low in absolute terms relative to the concerned group.<sup data-fn=\"fedd13d8-aabd-4e81-973b-6232c48e718c\" class=\"fn\"><a href=\"#fedd13d8-aabd-4e81-973b-6232c48e718c\" id=\"fedd13d8-aabd-4e81-973b-6232c48e718c-link\">10<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Despite their large disagreements about AI outcomes over the long\nterm, many participants in each group expressed a sense of humility\nabout long-term forecasting and emphasized that they are not claiming to\nhave confident predictions of distant events.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"hypothesis-4---these-groups-have-fundamental-worldview-disagreements-that-go-beyond-the-discussion-about-ai\">Hypothesis\n#4 &#8211; These groups have fundamental worldview disagreements that go\nbeyond the discussion about AI<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Disagreements about AI risk in this project often connected to more\nfundamental worldview differences between the groups. For example, the\nskeptics were somewhat anchored on the assumption that the world usually\nchanges slowly, making the rapid extinction of humanity unlikely. The\nconcerned group worked from a different starting point: namely, that the\narrival of a higher-intelligence species, such as humans, has often led\nto the extinction of lower-intelligence species, such as large mammals\non most continents. In this view, humanity\u2019s prospects are grim as soon\nas AI is much more capable than we are. The concerned group also was\nmore willing to place weight on theoretical arguments with multiple\nsteps of logic, while the skeptics tended to doubt the usefulness of\nsuch arguments for forecasting the future.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"results-forecasting-methodology\">Results: Forecasting\nmethodology<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">This project establishes clear quantifiable metrics for evaluating\nthe quality of AI forecasting questions. And we view this project as an\nongoing one. So, we invite readers to try to generate cruxes that\noutperform the top cruxes from our project thus far\u2014an exercise that\nunderscores the value of establishing comparative benchmarks for new\nforecasting questions. See the <a href=\"https:\/\/forecastingresearch.org\/ai-risk-voi-vod\"><u>\u201cValue of\nInformation\u201d (VOI) and \u201cValue of Discrimination\u201d (VOD)\ncalculators<\/u><\/a> (<a href=\"https:\/\/forecastingresearch.org\/s\/AI-risk-VoI-VoD.xlsx\"><u>a<\/u><\/a>)\nto inform intuitions about how these question metrics work. And please\nreach out to the authors with suggestions for high-quality cruxes.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"broader-scientific-implications\">Broader scientific\nimplications<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">This project has implications for how much we should expect rational\ndebate to shift people\u2019s views on AI risk. Thoughtful groups of people\nengaged each other for a long time but converged very little. This\nraises questions about the belief formation process and how much is\ndriven by explicit rational arguments vs. difficult-to-articulate\nworldviews vs. other, potentially non-epistemic factors (see research\nliterature on motivated cognition, such as Gilovich et al. 2002; Kunda,\n1990; Mercier and Sperber, 2011).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">One notable finding is that a highly informative crux for each group was whether their peers would update on AI risk over time. This highlights how social and epistemic groups can be important predictors of beliefs about AI risk.<sup data-fn=\"e985ddd0-4bca-4e0d-8575-dfdb257a783b\" class=\"fn\"><a href=\"#e985ddd0-4bca-4e0d-8575-dfdb257a783b\" id=\"e985ddd0-4bca-4e0d-8575-dfdb257a783b-link\">11<\/a><\/sup><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"glossary\">Glossary<\/h2>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<p class=\"wp-block-paragraph\"><strong>ARC Evals<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">An organization, now called <strong>METR<\/strong> (Model Evaluation &amp; Threat Research), that works on assessing whether cutting-edge AI systems could pose catastrophic risks to civilization. See \u201c<a href=\"#arc-evals-the-strongest-convergent-crux\">ARC Evals<\/a>\u201d for discussion of forecasts conditional on METR finding evidence of AI having the ability to autonomously replicate, acquire resources, and avoid shutdown before 2030.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Convergent crux<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A question such that, conditional on it resolving, two people or\ngroups will, in expectation, disagree less than they do now. See \u201c<a href=\"#convergent-cruxes-which-information-would-lead-to-less-disagreement-in-expectation\"><u>Convergent\nCruxes<\/u><\/a>\u201d for discussion of convergent cruxes found in this\nstudy.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Cross-camp pair<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A pair consisting of one member of the skeptic group and one member\nof the concerned group. See \u201c<a href=\"#vod-which-near-term-questions-have-higher-and-lower-value-of-discrimination\"><u>VOD<\/u><\/a>\u201d\nfor discussion of questions that would narrow or widen disagreement for\nthe median cross-camp pair when ranked by <strong>VOD<\/strong>, and \u201c<a href=\"#differences-of-opinion-within-groups\"><u>Differences of Opinion\nWithin Groups<\/u><\/a>\u201d for discussion of each cross-camp pair\u2019s\ndifferences on one question.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Divergent crux<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A question such that, conditional on it resolving, two people or\ngroups will, in expectation, disagree more than they do now. See \u201c<a href=\"#divergent-cruxes-which-information-would-lead-to-more-disagreement\"><u>Divergent\nCruxes<\/u><\/a>\u201d for discussion of divergent cruxes found in this\nstudy.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Existential catastrophe<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Defined in this study as an event in which at least one of the\nfollowing occurs:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Humanity goes extinct<\/li>\n\n\n\n<li>Humanity experiences \u201cunrecoverable collapse,\u201d which means\neither:\n<ul class=\"wp-block-list\">\n<li>&lt;$1 trillion global GDP annually [in 2022 dollars] for at\nleast a million years (continuously), beginning before 2100; or<\/li>\n\n\n\n<li>Human population remains below 1 million for at least a million\nyears (continuously), beginning before 2100.<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Flash forecast<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A forecast on which participants were recommended to spend\napproximately 10 minutes.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>IC<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Instrumental convergence, the hypothesized tendency for intelligent agents to develop similar sub-goals that are helpful for achieving most other goals, even if their ultimate goals are very different. In particular, sub-goals like acquiring resources, avoiding being killed\/destroyed, and avoiding interference from other agents could be helpful for achieving a wide variety of other goals. See \u201c<a href=\"#arc-evals-the-strongest-convergent-crux\">ARC Evals<\/a>\u201d for discussion of forecasts conditional on a model having capabilities that might suggest instrumental convergence.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>METR<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Model Evaluation &amp; Threat Research. See <strong>ARC\nEvals<\/strong>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>U<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The \u201cUltimate question.\u201d In this study: \u201cWill AI cause an\n<strong>existential catastrophe<\/strong> by 2100?\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>VOD<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Value of Discrimination (VOD) is a measure of how much knowing the\nanswer to a question would change relative beliefs between individuals,\nin expectation. It is useful for measuring convergence and divergence in\nexpected beliefs between individuals. See \u201c<a href=\"#vod-which-near-term-questions-have-higher-and-lower-value-of-discrimination\"><u>VOD<\/u><\/a>\u201d for discussion of questions that\nwould narrow or widen disagreement between the skeptic and concerned\ngroups in expectation and <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=94\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Appendix 2<\/u><\/a>\nfor an explanation of how VOD is calculated.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>VOI<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Value of Information (VOI) is a measure of how much knowing the answer to a question would change an individual&#8217;s belief, in expectation. This is useful for understanding why individuals believe what they believe and what would change their minds. See \u201c<a href=\"#voi-which-near-term-questions-have-higher-and-lower-value-of-information\"><u>VOI<\/u><\/a>\u201d for discussion of informative questions and <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=94\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 2<\/a> for an explanation of how VOI is calculated.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>P(U)<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The probability that U, the ultimate question, occurs. In this case,\nthe probability that AI causes an existential catastrophe by 2100.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>P(C)<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The probability that a potential crux question occurs. See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Appendix 1<\/u><\/a> for a list of candidate\ncruxes, and \u201c<a href=\"#results-tables-and-figures\"><u>VOI: Results\nTables and Figures<\/u><\/a>\u201d for the median participant in each group\u2019s\nP(C) for each crux.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>POM<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Percent of max. When we present VOI and VOD for each question, we also present how much of the maximum VOI or VOD it captured in order to <a href=\"#contextualizing-the-magnitude-of-the-value-of-information\"><u>contextualize the magnitude of the results<\/u><\/a>. See \u201c<a href=\"#voi-which-near-term-questions-have-higher-and-lower-value-of-information\"><u>VOI<\/u><\/a>\u201d for discussion of POM VOI and \u201c<a href=\"#vod-which-near-term-questions-have-higher-and-lower-value-of-discrimination\"><u>VOD<\/u><\/a>\u201d for discussion of POM VOD. See \u201c<a href=\"#arc-evals-the-strongest-convergent-crux\">ARC Evals<\/a>\u201d for an example of calculating POM VOD.<\/p>\n<\/div><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"background-motivation\">Background &amp; Motivation<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">From June through October 2022, researchers affiliated with the Forecasting Research Institute (FRI) conducted the <a href=\"https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\"><u>Existential Risk Persuasion Tournament<\/u><\/a> (XPT) (<a href=\"https:\/\/web.archive.org\/web\/20240216161711\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\"><u>a<\/u><\/a>). A clear pattern emerged in its findings: AI domain experts thought extinction due to AI in the 21st century was much more likely than skilled generalist forecasters (\u201csuperforecasters\u201d) thought, and neither group persuaded the other much, despite working collaboratively and being incentivized to share persuasive arguments.<sup data-fn=\"01457835-934d-436b-8870-5f869919fab2\" class=\"fn\"><a href=\"#01457835-934d-436b-8870-5f869919fab2\" id=\"01457835-934d-436b-8870-5f869919fab2-link\">12<\/a><\/sup> In addition, experts and superforecasters often agreed about short-term AI developments, while still disagreeing about the likelihood of extinction due to AI.<sup data-fn=\"7b899eaa-2958-46a2-8e48-8fe5d7f7698c\" class=\"fn\"><a href=\"#7b899eaa-2958-46a2-8e48-8fe5d7f7698c\" id=\"7b899eaa-2958-46a2-8e48-8fe5d7f7698c-link\">13<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In April and May of 2023, FRI ran a follow-up AI adversarial collaboration<sup data-fn=\"c6a448ac-3c27-48d0-abdd-82ad6400ecdf\" class=\"fn\"><a href=\"#c6a448ac-3c27-48d0-abdd-82ad6400ecdf\" id=\"c6a448ac-3c27-48d0-abdd-82ad6400ecdf-link\">14<\/a><\/sup> project that aimed to figure out what drives disagreement about long-run AI risk. We aimed to get more time from a select group of high-quality participants and supported them with moderators, adversarial collaboration video calls, and seminar discussions with AI experts, among other activities. To support deep engagement, we kept this study small: eleven \u201cskeptics\u201d and eleven \u201cconcerned\u201d participants.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We also designed the project to identify short-run indicators (\u201ccruxes\u201d) resolving by 2030 that could help to diagnose reasons for disagreement and act as signals for the level of long-run AI risk.<sup data-fn=\"1be41199-4b66-42d9-9b12-627b2951dda5\" class=\"fn\"><a href=\"#1be41199-4b66-42d9-9b12-627b2951dda5\" id=\"1be41199-4b66-42d9-9b12-627b2951dda5-link\">15<\/a><\/sup> While the XPT questions were chosen by our research team, this project asked the participants to collaborate to find the strongest cruxes, or short-run AI questions that would change beliefs about long-run AI risk the most in expectation.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">So, why do thoughtful people disagree so strongly about AI risk? We\norganize our findings into four hypotheses about drivers of\ndisagreement.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">First, people who disagree may have not spent enough time engaging with each other or may not understand each other\u2019s arguments. Some of our readers suspected that superforecasters had not digested the main arguments for AI risk and would have been more concerned if they had, whereas others suspected that experts simply spent too much time talking to people who share their worldview and hadn\u2019t spent enough time talking to thoughtful skeptics.<sup data-fn=\"58f5700e-f100-44f2-8403-a6e9d6db430a\" class=\"fn\"><a href=\"#58f5700e-f100-44f2-8403-a6e9d6db430a\" id=\"58f5700e-f100-44f2-8403-a6e9d6db430a-link\">16<\/a><\/sup> If this hypothesis were true, we would expect that the two groups would agree more if there were enough high-quality engagement between them to understand each other\u2019s arguments.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Second, the disagreeing groups could have different predictions about short-term (by 2030) AI developments, such as how likely AI is to develop dangerous capabilities or which AI policies society is likely to adopt. If this hypothesis were true, we would expect the two groups to agree if we condition on specific AI-related developments. For example, if they disagreed about how long it will take until AI can write code to improve itself, but agreed that this development would mean serious danger for humanity, then we would expect them to agree on AI risk if we condition on AI improving itself. In this project, we asked participants to make many such conditional forecasts.<sup data-fn=\"b9bd062c-90b5-4093-92b5-02d08b7c759c\" class=\"fn\"><a href=\"#b9bd062c-90b5-4093-92b5-02d08b7c759c\" id=\"b9bd062c-90b5-4093-92b5-02d08b7c759c-link\">17<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Third, they could disagree about how AIs will develop or how society\nwill respond in the longer term (through 2100 or beyond). Perhaps the\ngroups cannot identify short-term AI outcomes that distinguish their\nrisk models, but expect very different long-term AI trajectories.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Finally, they could have more fundamental worldview disagreements\nthat go beyond AI. If they agree about most AI-related developments but\ncontinue to disagree about AI risk, there could be something else\nunderlying their difference of opinion. There could be disagreements\nabout how much they trust different categories of evidence or\nargumentation, or what they believe about human ingenuity and\nresilience, or any number of other topics that go beyond AI.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"how-did-we-test-potential-drivers-of-disagreement\"><em>How did\nwe test potential drivers of disagreement?<\/em> <\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We brought together 22 participants who disagreed strongly on AI existential risk. Half of the participants were termed AI \u201cskeptics,\u201d<sup data-fn=\"c8e0aef2-2064-41a1-a854-67a82685bfb7\" class=\"fn\"><a href=\"#c8e0aef2-2064-41a1-a854-67a82685bfb7\" id=\"c8e0aef2-2064-41a1-a854-67a82685bfb7-link\">18<\/a><\/sup> people whose XPT forecasts of the probability that AI would cause extinction by 2100 were &lt;1%, and who produced high-quality rationales for their forecasts. This group of 11 AI skeptics included nine superforecasters and two domain experts. The other 11 participants were people concerned about AI, whom we expected to forecast a &gt;10% chance that AI would cause an existential catastrophe by 2100. The \u201cAI concerned\u201d participants were AI safety researchers and AI-knowledgeable generalist researchers who were recommended as being able to present and discuss AI-concerned views clearly.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We asked these two groups to engage deeply with each other&#8217;s\narguments and to work together to identify cruxes with the most\npotential to update their forecasts on AI existential risk.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Participants made an initial forecast on the core question they\ndisagreed about (we\u2019ll call this U, for \u201cultimate question\u201d): by 2100,\nwill AI cause an existential catastrophe? We defined \u201cexistential\ncatastrophe\u201d as an event in which at least one of the following\noccurs:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Humanity goes extinct<\/li>\n\n\n\n<li>Humanity experiences \u201cunrecoverable collapse,\u201d which means\neither:\n<ol class=\"wp-block-list\">\n<li>&lt;$1 trillion global GDP annually [in 2022 dollars] for at\nleast a million years (continuously), beginning before 2100; or<\/li>\n\n\n\n<li>Human population remains below 1 million for at least a million\nyears (continuously), beginning before 2100.<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n\n\n\n<p class=\"wp-block-paragraph\">For additional resolution details, such as the definition of \u201ccause,\u201d\nsee <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Appendix 1<\/u><\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Over the next eight weeks, participants made forecasts on candidate\ncrux questions that could help explain the disagreement, generated new\npossible cruxes during adversarial collaboration calls, and debated and\nrefined their reasoning on an online platform. (See <a href=\"#how-the-ai-adversarial-collaboration-worked\"><u>section below<\/u><\/a> for more details on how\nthe project worked.)<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"the-central-disagreement\">The central disagreement <\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">The two groups were selected for disagreeing strongly about the\nlikelihood of existential catastrophe due to AI by 2100, and they\ncontinued to disagree throughout the project. At the outset, the median\nskeptic forecasted a 0.10% chance of existential catastrophe due to AI\nby 2100, and the median concerned participant forecasted a 25% chance.\nOver the course of the two-month project, there was mild convergence:\nthe skeptic group&#8217;s median moved from 0.10% to 0.12% and the concerned\ngroup&#8217;s median fell from 25% to 20%.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">However, April\u2013May 2023 was an exciting time in real-world AI developments: GPT-4 had just become available, and regulators and the public were beginning to respond. Several participants attributed their updated probability of extinction due to AI by 2100 to these developments, and not to updates they made based on their work on this project.<sup data-fn=\"59090342-fed5-48d2-a469-2112122fa7b3\" class=\"fn\"><a href=\"#59090342-fed5-48d2-a469-2112122fa7b3\" id=\"59090342-fed5-48d2-a469-2112122fa7b3-link\">19<\/a><\/sup><\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><\/td><td><strong>Mean<\/strong><\/td><td><strong>Median<\/strong><\/td><td><strong>Range<\/strong><\/td><\/tr><tr><td><strong>Skeptic<\/strong><\/td><td>0.54%<\/td><td>0.1%<\/td><td>0.0000001% &#8211; 3%<\/td><\/tr><tr><td><strong>Concerned<\/strong><\/td><td>28.4%<\/td><td>25%<\/td><td>4% &#8211; 65%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 1:<\/strong> Group P(AI-caused existential catastrophe by 2100), based on each participant&#8217;s initial forecast<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><\/td><td><strong>Mean<\/strong><\/td><td><strong>Median<\/strong><\/td><td><strong>Range<\/strong><\/td><\/tr><tr><td><strong>Skeptic<\/strong><\/td><td>0.46%<\/td><td>0.12%<\/td><td>0.0001% &#8211; 2%<\/td><\/tr><tr><td><strong>Concerned<\/strong><\/td><td>23.8%<\/td><td>20%<\/td><td>2.4% &#8211; 55%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 2:<\/strong> Group P(AI-caused existential catastrophe by 2100) at the end of the project<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Six people in the concerned group lowered their forecasts and none\nraised them. Five people in the skeptic group raised their forecast,\nfour lowered, and one raised but only because of an initial typo. For\ndetails on updated forecasts and reasons for updates from each\nparticipant, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=102\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Appendix 4<\/u><\/a>.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2024-03-11_ai-adversarial-collaboration_fig-01.png\" alt=\"\" \/><figcaption class=\"wp-element-caption\"><strong>Figure 1:<\/strong> Initial and final P(AI existential catastrophe by 2100) for skeptic and concerned groups. See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=102\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Appendix 4<\/u><\/a> for reasons for updates.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">When we asked participants for forecasts on AI causing the deaths of\nmore than 60% of the human population (within a 5-year period) before\n2100, forecasts were closer than those about existential catastrophe,\nbut there was still a large disagreement. The median concerned\nparticipant forecasted 32%, and the median skeptic forecasted 1%. This\nsupports the claim that skeptics think that AI killing many people is\nmore likely than causing existential catastrophe, whether because of the\nlikelihood that it is useful to some other goal, the difficulty of\nkilling people living in remote areas, or the likelihood of successful\nsocietal response to extreme catastrophe. However, the disagreement\nbetween groups is still large. Their disagreement about AI risk is\ndeeper than the question of whether a very small number of humans will\nsurvive an AI catastrophe.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Although the disagreement between groups was large, many participants emphasized that their forecasts should be taken with a sense of humility, and that long-run forecasting is inherently uncertain and they are not claiming to have complete pictures of how events will unfold over the coming decades.<sup data-fn=\"fe8e3e2f-d2ec-4206-acd8-6546810423bc\" class=\"fn\"><a href=\"#fe8e3e2f-d2ec-4206-acd8-6546810423bc\" id=\"fe8e3e2f-d2ec-4206-acd8-6546810423bc-link\">20<\/a><\/sup> Most previous evidence on judgmental forecasting applies to geopolitical forecasts on 0-2 year time horizons.<sup data-fn=\"2ee49b87-96f2-41bf-b7e0-df265c825b20\" class=\"fn\"><a href=\"#2ee49b87-96f2-41bf-b7e0-df265c825b20\" id=\"2ee49b87-96f2-41bf-b7e0-df265c825b20-link\">21<\/a><\/sup><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"how-the-ai-adversarial-collaboration-worked\">How the AI\nadversarial collaboration worked<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The core activities of this project ran from April 1 to May 31,\n2023.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"recruitment\">Recruitment<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We recruited 11 participants for the skeptic group who forecasted &lt;1% on P(AI extinction by 2100) in the XPT, and stood out to either our research team or other XPT participants as having high-quality rationales and being collaborative. Nine out of 11 of these participants were superforecasters and two were domain experts from our XPT sample.<sup data-fn=\"069d4399-dae6-4a3b-8be2-785f020ac14d\" class=\"fn\"><a href=\"#069d4399-dae6-4a3b-8be2-785f020ac14d\" id=\"069d4399-dae6-4a3b-8be2-785f020ac14d-link\">22<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We recruited 11 participants for the concerned group whom we expected\nto forecast &gt;10% on P(AI existential catastrophe by 2100) and to be\ncollaborative communicators. We began with recommendations for\nparticipants from staff members at Open Philanthropy and then did a\nbroader search for reputable AI safety researchers and AI-knowledgeable\ngeneralist researchers (such as participants from <a href=\"https:\/\/rethinkpriorities.org\/\"><u>Rethink Priorities<\/u><\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240204062020\/https:\/\/rethinkpriorities.org\/\"><u>a<\/u><\/a>)\nand <a href=\"https:\/\/epochai.org\/\"><u>Epoch<\/u><\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240214144948\/https:\/\/epochai.org\/\"><u>a<\/u><\/a>)).\nSeveral of the concerned participants also had strong public track\nrecords of forecasting accuracy on short-run questions.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In the rest of this report, to preserve anonymity, we refer to\nparticipants with assigned aliases. Aliases beginning with A-K are\nassigned to skeptics, and aliases beginning with P-Z are assigned to\nconcerned participants (in random order within each group).<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"activities-to-facilitate-engagement-between-the-skeptic-and-concerned-groups\">Activities\nto facilitate engagement between the skeptic and concerned groups<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">As preparation for the project, we asked the skeptic group to read\nHolden Karnofsky&#8217;s <a href=\"https:\/\/www.cold-takes.com\/most-important-century\/\"><u>Most\nImportant Century series<\/u><\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240214013554\/https:\/\/www.cold-takes.com\/most-important-century\/\"><u>a<\/u><\/a>)\nand related resources on AI existential risk recommended by staff\nmembers at Open Philanthropy.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Most discussion between groups happened on an online forum and\nforecasting platform that we set up for this project. Most quotes in\nthis report come from the platform discussion. Moderators identified key\nareas of disagreement and started forum threads to try to advance\ndebate. We intervened in a few cases where dialogue became combative\nrather than collaborative, and generally tried to orient participants\ntoward collaboration. For examples of platform discussion, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=114\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Appendix 8<\/u><\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Each participant also had a one-on-one adversarial collaboration call\nwith a member of the other group every two weeks, where participants\nwere asked to summarize one another\u2019s views and then generate possible\ncruxes. Most of these calls were moderated by a member of our team, who\nsteered discussion and asked follow-up questions, and a few were\nunmoderated. Our team moderated approximately 35 one-hour adversarial\ncollaboration video calls between individuals in the concerned and\nskeptic groups. The calls were recorded and, where noted, some quotes in\nthis report are from adversarial collaboration calls.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We initially elicited forecasts and rationales on <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>P(AI existential catastrophe by 2100)<\/u><\/a>\nand questions related to <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=108\" rel=\"noreferrer noopener\" target=\"_blank\"><u>transformative\neconomic growth<\/u><\/a>. We worked with participants to generate ideas\nfor <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>cruxes<\/u><\/a> resolving by 2030. We\nshared materials with participants about how we would measure crux\nquality. (See more on our &#8220;Value of information&#8221; metric <a href=\"#how-did-we-assess-the-cruxiness-of-forecasting-questions\"><u>below<\/u><\/a>.)\nWe also created forum threads to elicit cruxes. Based on discussion from\nthe forum and calls, we created targeted threads on particular topics\n(e.g. policy change, robotics, etc.) to identify cruxes.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We and the participants generated approximately 250 ideas for\n\u201ccruxes,\u201d and also considered cruxes proposed by AI experts during our\n<a href=\"https:\/\/forecastingresearch.org\/research\"><u>Conditional\nTrees<\/u> <u>project<\/u><\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240216162704\/https:\/\/forecastingresearch.org\/research\"><u>a<\/u><\/a>).<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"eliciting-forecasts-and-rationales-on-cruxes\">Eliciting\nforecasts and rationales on cruxes<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Every two weeks, our team selected approximately 11 of the most\npromising crux ideas, quickly turned them into forecasting questions,\nand asked each forecaster to provide &#8220;flash&#8221; (10 minute) forecasts on\nthem. (See the 33 flash forecasting questions <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>here<\/u><\/a> and results <a href=\"#results-tables-and-figures\"><u>here<\/u><\/a>.)<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Cruxes that were most promising from each flash forecast round were\nthen operationalized into more rigorous forecasting questions and added\nto the platform to gather more in-depth (approximately 1 hour) forecasts\nand rationales. (See the four &#8220;Platform&#8221; forecasting questions <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>here<\/u><\/a> and results <a href=\"#results-tables-and-figures\"><u>here<\/u><\/a>. We asked for\nin-depth forecasts on both the P(Crux) and the P(AI existential\ncatastrophe by 2100 | Crux).)<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"other-activities\">Other activities<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Participants also suggested valuable project activities. For example,\na participant\u2019s suggestion inspired the <a href=\"#survey-on-long-term-ai-outcomes\"><u>survey on long-term AI\noutcomes<\/u><\/a> that helped us get a broader sense of how this sample\nthought about outcomes beyond P(AI existential catastrophe by 2100).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We held three 1-hour question-and-answer sessions attended by most\nskeptic participants with AI risk experts from DeepMind, the UK\nGovernment\u2019s Advanced Research + Invention Agency (ARIA), and Open\nPhilanthropy.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Our team shared results with participants when feasible and got\nvaluable feedback from them, including their suggested revisions on our\ninterpretations of their sources of agreement and disagreement.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"hypothesis-1-do-the-groups-understand-each-others-arguments-and-do-views-shift-with-more-engagement\">Hypothesis\n#1: Do the groups understand each other&#8217;s arguments, and do views shift\nwith more engagement?<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">In response to the XPT results, commenters argued that perhaps there was not convergence on AI risk forecasts because:<sup data-fn=\"92924628-e167-4239-90a4-f7136e9f69af\" class=\"fn\"><a href=\"#92924628-e167-4239-90a4-f7136e9f69af\" id=\"92924628-e167-4239-90a4-f7136e9f69af-link\">23<\/a><\/sup><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>There was not enough engagement among participants who disagreed with each other.<sup data-fn=\"2f69e906-f0dc-424a-b2a6-5d833f2fdee6\" class=\"fn\"><a href=\"#2f69e906-f0dc-424a-b2a6-5d833f2fdee6\" id=\"2f69e906-f0dc-424a-b2a6-5d833f2fdee6-link\">24<\/a><\/sup><\/li>\n\n\n\n<li>Experts who could compellingly make the case for AI risk were not included.<sup data-fn=\"53e182c0-bf66-4cc0-a2f0-6a6d6f08c31a\" class=\"fn\"><a href=\"#53e182c0-bf66-4cc0-a2f0-6a6d6f08c31a\" id=\"53e182c0-bf66-4cc0-a2f0-6a6d6f08c31a-link\">25<\/a><\/sup><\/li>\n\n\n\n<li>More broadly, perhaps the groups did not understand each other&#8217;s\narguments, and participants in one group would change their minds if\nthey spent substantial time working to absorb the other group&#8217;s\nevidence, arguments, and worldview.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">This follow-up study to the XPT was partly designed to assess the\nvalidity of these criticisms. And we see this study as providing\nmoderate evidence against these factors explaining the lack of\nconvergence:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Participants engaged in this project for 25-100 hours each (skeptic median: 80 hours; concerned median: 31 hours),<sup data-fn=\"af08eb9e-f160-484c-98af-385b0751321b\" class=\"fn\"><a href=\"#af08eb9e-f160-484c-98af-385b0751321b\" id=\"af08eb9e-f160-484c-98af-385b0751321b-link\">26<\/a><\/sup> and their engagement was supported by moderators, video calls, and a format focused on identifying cruxes, among other factors.<\/li>\n\n\n\n<li>We included concerned group <a href=\"#recruitment\"><u>domain experts<\/u><\/a> who were either recommended or approved by staff members at Open Philanthropy. We also held seminar discussions with AI risk experts from DeepMind, the UK Government\u2019s Advanced Research + Invention Agency (ARIA), and Open Philanthropy.<\/li>\n\n\n\n<li>The groups were able to summarize each other&#8217;s arguments well\nduring the project and in follow-up surveys, suggesting that they\nengaged with and understood arguments they disagreed with.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">The remainder of this section focuses on the groups&#8217; understanding of\neach other&#8217;s arguments according to their reports in a post-project\nsurvey.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"understanding-each-others-arguments\">Understanding each other\u2019s\narguments<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">To test whether experts and superforecasters failed to converge\nbecause they did not understand one another\u2019s arguments, we asked\nparticipants to discuss and explain one another\u2019s positions in several\nformats:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Participants gave rationales for their forecasts on the ultimate\nquestion (likelihood of AI existential catastrophe by 2100) and\ncandidate crux questions and discussed one another\u2019s rationales in an\nonline forum.<\/li>\n\n\n\n<li>Participants had moderated one-on-one adversarial collaboration\ncalls every two weeks, in which one skeptic and one concerned\nparticipant were asked to summarize each other\u2019s views and attempt to\ngenerate cruxes.<\/li>\n\n\n\n<li>In a survey at the end of the project, participants were asked to\nsummarize the best arguments and counterarguments for their own and the\nother side.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">We found that both groups were generally able to summarize the other\nside\u2019s arguments well. In a post-project survey, we asked \u201cWhat do you\nthink are the best three arguments put forward by each side?\u201d Below, we\ngive the arguments each side provided. The similarity between arguments\nprovided by skeptics and arguments provided by concerned participants\nattempting to summarize skeptics\u2019 arguments suggests that concerned\nparticipants had a good model of what skeptics thought, and vice\nversa.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"arguments-for-lower-risk\">Arguments for lower risk<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Arguments from <strong>skeptics<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Many different things would all need to go wrong in a short time frame for humanity to go extinct by 2100<sup data-fn=\"0134f789-8687-450a-bcf2-26afb94a53e5\" class=\"fn\"><a href=\"#0134f789-8687-450a-bcf2-26afb94a53e5\" id=\"0134f789-8687-450a-bcf2-26afb94a53e5-link\">27<\/a><\/sup><\/li>\n\n\n\n<li>Killing everyone is hard, and even if an AI kills many people, there are many ways that significant numbers could survive<sup data-fn=\"f2383c6e-df9b-4150-a092-a7fe4ac40b83\" class=\"fn\"><a href=\"#f2383c6e-df9b-4150-a092-a7fe4ac40b83\" id=\"f2383c6e-df9b-4150-a092-a7fe4ac40b83-link\">28<\/a><\/sup><\/li>\n\n\n\n<li>Theoretical arguments should not be weighted too heavily in the absence of real-life examples<sup data-fn=\"11742a45-7757-420e-88df-9ed0e13a8dab\" class=\"fn\"><a href=\"#11742a45-7757-420e-88df-9ed0e13a8dab\" id=\"11742a45-7757-420e-88df-9ed0e13a8dab-link\">29<\/a><\/sup><\/li>\n\n\n\n<li>We do not have enough evidence to be confident that AIs will want to harm large numbers of people<sup data-fn=\"66472849-4453-497c-bc85-71393f231754\" class=\"fn\"><a href=\"#66472849-4453-497c-bc85-71393f231754\" id=\"66472849-4453-497c-bc85-71393f231754-link\">30<\/a><\/sup><\/li>\n\n\n\n<li>Humans are likely to be able to solve alignment and control problems<sup data-fn=\"abd75fe2-d0b0-4e6d-a38c-56c017e96baf\" class=\"fn\"><a href=\"#abd75fe2-d0b0-4e6d-a38c-56c017e96baf\" id=\"abd75fe2-d0b0-4e6d-a38c-56c017e96baf-link\">31<\/a><\/sup><\/li>\n\n\n\n<li>2100 is too soon to expect to see AIs dangerous enough to cause human extinction, even if they will emerge eventually<sup data-fn=\"e49b0e36-fb48-4391-b567-76c52d962116\" class=\"fn\"><a href=\"#e49b0e36-fb48-4391-b567-76c52d962116\" id=\"e49b0e36-fb48-4391-b567-76c52d962116-link\">32<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Arguments from the <strong>concerned group<\/strong> (intending to\nsummarize skeptics\u2019 arguments, not necessarily their own strongest\narguments against AI existential catastrophe by 2100):<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Extinction would require multiple things, many of them unprecedented, to all go wrong<sup data-fn=\"be249a97-22d9-4193-b9a2-55f166e6e99b\" class=\"fn\"><a href=\"#be249a97-22d9-4193-b9a2-55f166e6e99b\" id=\"be249a97-22d9-4193-b9a2-55f166e6e99b-link\">33<\/a><\/sup><\/li>\n\n\n\n<li>Killing all humans is hard, even if killing a large number of people may not be<sup data-fn=\"8d35ef14-50dc-449d-8411-d350b803900f\" class=\"fn\"><a href=\"#8d35ef14-50dc-449d-8411-d350b803900f\" id=\"8d35ef14-50dc-449d-8411-d350b803900f-link\">34<\/a><\/sup><\/li>\n\n\n\n<li>Arguments for AI risk are mostly theoretical and do not have much empirical evidence to support them<sup data-fn=\"bb1b684a-7319-4c0e-ac22-727723f94b9a\" class=\"fn\"><a href=\"#bb1b684a-7319-4c0e-ac22-727723f94b9a\" id=\"bb1b684a-7319-4c0e-ac22-727723f94b9a-link\">35<\/a><\/sup><\/li>\n\n\n\n<li>Humans may be well-positioned to stop dangerous AIs as we have controlled other dangerous technologies<sup data-fn=\"85de65d8-6d46-459a-bbaf-1f8edac6fa2b\" class=\"fn\"><a href=\"#85de65d8-6d46-459a-bbaf-1f8edac6fa2b\" id=\"85de65d8-6d46-459a-bbaf-1f8edac6fa2b-link\">36<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Likewise, the similarity between arguments provided by concerned\nparticipants and arguments provided by skeptics attempting to summarize\nconcerned participants\u2019 arguments suggests that skeptics had a good\nmodel of what concerned participants thought.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"arguments-for-higher-risk\">Arguments for higher risk<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Arguments from the <strong>concerned group<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Non-extinction would require many things to all go right, many of which seem unlikely<sup data-fn=\"510e5aed-e72c-4313-a576-b952151331e6\" class=\"fn\"><a href=\"#510e5aed-e72c-4313-a576-b952151331e6\" id=\"510e5aed-e72c-4313-a576-b952151331e6-link\">37<\/a><\/sup><\/li>\n\n\n\n<li>Base rates are hard to use for transformative technologies or for outcomes with unclear reference classes<sup data-fn=\"611a9493-dd66-4d0c-8ea2-91ee5cf5a72c\" class=\"fn\"><a href=\"#611a9493-dd66-4d0c-8ea2-91ee5cf5a72c\" id=\"611a9493-dd66-4d0c-8ea2-91ee5cf5a72c-link\">38<\/a><\/sup><\/li>\n\n\n\n<li>Current progress is fast and on a steep trajectory<sup data-fn=\"c329b782-ea5f-4794-a903-467b7182df2c\" class=\"fn\"><a href=\"#c329b782-ea5f-4794-a903-467b7182df2c\" id=\"c329b782-ea5f-4794-a903-467b7182df2c-link\">39<\/a><\/sup><\/li>\n\n\n\n<li>Instrumental convergence is likely<sup data-fn=\"6d9dfcea-6d1a-49c2-bb23-4d4cb1920ea2\" class=\"fn\"><a href=\"#6d9dfcea-6d1a-49c2-bb23-4d4cb1920ea2\" id=\"6d9dfcea-6d1a-49c2-bb23-4d4cb1920ea2-link\">40<\/a><\/sup><\/li>\n\n\n\n<li>Alignment is a hard problem that we do not know how to solve<sup data-fn=\"0b87d530-4332-4de8-83e7-3128de6f2904\" class=\"fn\"><a href=\"#0b87d530-4332-4de8-83e7-3128de6f2904\" id=\"0b87d530-4332-4de8-83e7-3128de6f2904-link\">41<\/a><\/sup><\/li>\n\n\n\n<li>Short-term incentives may lead labs and other actors to be incautious<sup data-fn=\"19a5b93f-693b-4a15-bfed-bfa1f272fd5a\" class=\"fn\"><a href=\"#19a5b93f-693b-4a15-bfed-bfa1f272fd5a\" id=\"19a5b93f-693b-4a15-bfed-bfa1f272fd5a-link\">42<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Arguments from <strong>skeptics<\/strong> (intending to summarize the\nconcerned group\u2019s arguments, not necessarily their own strongest\narguments for AI existential catastrophe by 2100):<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Powerful and poorly-understood technology is inherently risky<sup data-fn=\"027d0b37-768f-42bf-aca3-7863c09409f7\" class=\"fn\"><a href=\"#027d0b37-768f-42bf-aca3-7863c09409f7\" id=\"027d0b37-768f-42bf-aca3-7863c09409f7-link\">43<\/a><\/sup><\/li>\n\n\n\n<li>It is difficult to use base rates and other forecasting tools for unprecedented situations<sup data-fn=\"b9486a3c-00d4-4291-b96b-5ec9f5ca9379\" class=\"fn\"><a href=\"#b9486a3c-00d4-4291-b96b-5ec9f5ca9379\" id=\"b9486a3c-00d4-4291-b96b-5ec9f5ca9379-link\">44<\/a><\/sup><\/li>\n\n\n\n<li>Capabilities progress in recent years has been very fast, often faster than predicted<sup data-fn=\"db37dbb5-171f-4d40-8159-423cdfa44433\" class=\"fn\"><a href=\"#db37dbb5-171f-4d40-8159-423cdfa44433\" id=\"db37dbb5-171f-4d40-8159-423cdfa44433-link\">45<\/a><\/sup><\/li>\n\n\n\n<li>AI alignment is a technically difficult problem<sup data-fn=\"40e389bc-3ce8-48e9-bae3-4f1c41258268\" class=\"fn\"><a href=\"#40e389bc-3ce8-48e9-bae3-4f1c41258268\" id=\"40e389bc-3ce8-48e9-bae3-4f1c41258268-link\">46<\/a><\/sup><\/li>\n\n\n\n<li>Instrumental convergence may be likely<sup data-fn=\"e6463a6e-da4f-44a1-bd0c-0e78b055ed0c\" class=\"fn\"><a href=\"#e6463a6e-da4f-44a1-bd0c-0e78b055ed0c\" id=\"e6463a6e-da4f-44a1-bd0c-0e78b055ed0c-link\">47<\/a><\/sup><\/li>\n\n\n\n<li>Incentives may make AI developers less cautious<sup data-fn=\"e658a913-1b2d-4dd2-8e99-744a0da2acb9\" class=\"fn\"><a href=\"#e658a913-1b2d-4dd2-8e99-744a0da2acb9\" id=\"e658a913-1b2d-4dd2-8e99-744a0da2acb9-link\">48<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Much more than the concerned group did, the skeptics also thought that one of the best arguments for concern is that we should be very cautious about scenarios that have the potential to be extremely dangerous, even if they are unlikely.<sup data-fn=\"7281addc-629b-4889-9af0-6b9f60fa598f\" class=\"fn\"><a href=\"#7281addc-629b-4889-9af0-6b9f60fa598f\" id=\"7281addc-629b-4889-9af0-6b9f60fa598f-link\">49<\/a><\/sup><\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"concluding-notes-on-understanding-and-engagement\">Concluding\nnotes on understanding and engagement<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Based on these survey results, we do not think that the main reason\nthese groups disagree is that they have not engaged with one another\u2019s\narguments. Each side could summarize the best arguments for the other\nside\u2019s positions in a way that mostly matched what that side would have\nsaid, but they continued to disagree strongly.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For examples of back-and-forth discussion between participants in the\nproject about these topics, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=114\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Appendix\n8<\/u><\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We did not directly ask participants during the project whether they\nthought the other group understood their arguments, but we did ask them\nfor their opinions of the other group in general. Of the 11 skeptics,\nseven said they were \u201csatisfied\u201d or \u201cvery satisfied\u201d with the concerned\ngroup, and one said they were \u201cdissatisfied\u201d with the concerned group.\nOf the 11 concerned participants, six said they were \u201csatisfied\u201d or\n\u201cvery satisfied\u201d with the skeptic group, and three said they were\n\u201cdissatisfied\u201d or \u201cvery dissatisfied.\u201d In additional comments, some\nparticipants also said that they thought the other group was\nmisunderstanding their arguments, or making arguments that were based on\nmisunderstandings of the facts.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">It is possible that some participants were still misunderstanding one\nanother, or that there is a relevant level of understanding that is\ndeeper than being able to summarize one another\u2019s arguments, perhaps one\nthat takes longer to achieve. But overall, we think that participants\nbeing able to summarize one another\u2019s arguments, combined with most\nparticipants being satisfied with the other group, makes it unlikely\nthat the main disagreement is due to either group not understanding the\ndebate.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"hypothesis-2-were-disagreements-about-ai-risk-explained-by-different-short-term-expectations-e.g.-about-ai-capabilities-ai-policy-or-other-factors-that-could-be-observed-by-2030\">Hypothesis\n#2: Were disagreements about AI risk explained by different short-term\nexpectations (e.g. about AI capabilities, AI policy, or other factors\nthat could be observed by 2030)?<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The second hypothesis is that the two groups disagree about various measurable AI indicators in the near-term (by 2030) and those indicators\u2019 effect on AI risk. We asked participants to generate crux ideas through intensive discussion and collected forecasts on the top 33 suggested near-term cruxes. For each question, we asked participants for forecasts about how likely it is that the crux resolves positively and how likely it is that the ultimate question (existential catastrophe due to AI by 2100) resolves positively conditional on the crux resolving positively. We imputed participants\u2019 views about how likely the ultimate question is to resolve positively if the crux resolves negatively.<sup data-fn=\"db064470-57c3-4194-9baa-1ae4321f8ef4\" class=\"fn\"><a href=\"#db064470-57c3-4194-9baa-1ae4321f8ef4\" id=\"db064470-57c3-4194-9baa-1ae4321f8ef4-link\">50<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We found that most of the disagreement about existential risk due to AI by 2100 is not explained by the shorter term indicators examined in this project. According to our metrics, approximately 5-10% of the disagreement between groups could be explained by any specific near-term crux.<sup data-fn=\"8c4de1ea-7b58-438c-9783-0631a6640dfe\" class=\"fn\"><a href=\"#8c4de1ea-7b58-438c-9783-0631a6640dfe\" id=\"8c4de1ea-7b58-438c-9783-0631a6640dfe-link\">51<\/a><\/sup> We did not ask participants for forecasts conditional on multiple questions all resolving positively (or negatively), so we do not have detailed information about how different cruxes would interact, or how participants would update if multiple surprising events all happened.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">However, near-term cruxes shed light on what the groups believe, where they disagree, and why:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Evaluations of dangerous AI capabilities are relevant to both groups.<\/strong> One of the strongest cruxes that will resolve by 2030 is about whether <a href=\"https:\/\/evals.alignment.org\/\"><u>METR<\/u><\/a> (formerly known as ARC Evals) (<a href=\"https:\/\/web.archive.org\/web\/20240216154134\/https:\/\/metr.org\/\"><u>a<\/u><\/a>) or a similar group will find that AI has developed dangerous capabilities such as autonomously replicating and avoiding shutdown.<sup data-fn=\"101efdf8-6590-4252-8af7-ed028bf5890a\" class=\"fn\"><a href=\"#101efdf8-6590-4252-8af7-ed028bf5890a\" id=\"101efdf8-6590-4252-8af7-ed028bf5890a-link\">52<\/a><\/sup> This crux illustrates a theme in the disagreement: the skeptic group typically did not find theoretical arguments for AI risk persuasive but would update their views based on real-world demonstrations of dangerous AI capabilities that verify existing theoretical arguments. If this question resolves negatively then the concerned group would be less worried, because it would mean that we have had years of progress from today\u2019s models without this plausible set of dangerous capabilities becoming apparent. (<a href=\"#convergent-cruxes-which-information-would-lead-to-less-disagreement-in-expectation\"><u>More<\/u><\/a>)<\/li>\n\n\n\n<li><strong>Generally, the questions that would be most informative\nto each of the two groups are fairly distinct.<\/strong> The concerned\ngroup\u2019s highest-ranked cruxes tended to relate to AI alignment and\nalignment research. The skeptic group\u2019s highest-ranked cruxes tended to\nrelate to the development of lethal technologies and demonstrations of\nharmful AI power-seeking behavior. This suggests that many of the two\ngroups\u2019 biggest sources of uncertainty are different, and in many cases\nfurther investigation of one group\u2019s uncertainties would not persuade\nthe other. (<a href=\"#high-voi-questions\"><u>More<\/u><\/a>)<\/li>\n\n\n\n<li><strong>Commonly-discussed topics\u2014such as near-term economic\neffects of AI and progress in many AI capabilities\u2014did not seem like\nstrong cruxes.<\/strong> (<a href=\"#low-voi-questions\"><u>More<\/u><\/a>)<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">There are several possible reasons that questions resolving by 2030\ndo not explain most of the disagreement, including:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The time between now and 2100 is long, so information about the\nyears before 2030 simply cannot provide very much of the necessary\ninformation to drive participants to agree about the longer term\nquestion.<\/li>\n\n\n\n<li>Because the skeptics assign low probability to existential\ncatastrophe due to AI by 2100 (median 0.1%), their expected updates are\nnecessarily small: it would be logically inconsistent for them to\nforecast higher than a 10% chance of updating their probability of\nAI-caused existential catastrophe by 2100 above 1%.<\/li>\n\n\n\n<li>Perhaps this project did not identify the most valuable crux\nquestions resolving before 2030, and other questions would make a larger\ndifference.<\/li>\n\n\n\n<li>Participants\u2019 expectations about how dangerous AI is likely to be may have also influenced their interpretation of crux questions\u2019 resolutions. For example, if we asked a question like \u201cWill an AI resist being shut down?\u201d, participants might make different conditional updates depending on their expectations about AI. Conditional on this question resolving positively, a participant who thinks that AIs are likely to be dangerous might be more likely to think of alarming outcomes, like an AI that resists powerful governments trying to turn it off. A participant who thinks dangerous AI is very unlikely might expect that nearly all positive resolutions are more innocuous ones, in which the resolution criteria are only technically true.<sup data-fn=\"2efc6a1f-3fbf-4823-9ec2-9883fb0da199\" class=\"fn\"><a href=\"#2efc6a1f-3fbf-4823-9ec2-9883fb0da199\" id=\"2efc6a1f-3fbf-4823-9ec2-9883fb0da199-link\">53<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Below, we:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Describe how we assessed &#8220;cruxiness&#8221; of forecasting questions\nusing two metrics: &#8220;Value of information&#8221; (VOI) and &#8220;Value of\ndiscrimination&#8221; (VOD). (<a href=\"#how-did-we-assess-the-cruxiness-of-forecasting-questions\"><u>More<\/u><\/a>)<\/li>\n\n\n\n<li>Provide median forecasts on all of the questions we asked. (<a href=\"#results-tables-and-figures\"><u>More<\/u><\/a>)<\/li>\n\n\n\n<li>Discuss some of the strongest cruxes and surprisingly weakest\ncruxes according to value of information. (<a href=\"#low-voi-questions\"><u>More<\/u><\/a>)<\/li>\n\n\n\n<li>Discuss \u201cred flags\u201d and \u201cgreen flags\u201d for each group: questions\nthat would lead to major changes in the probability of existential\ncatastrophe of 2100, ignoring their likelihood of occurring. (<a href=\"#red-flags-and-green-flags\"><u>More<\/u><\/a>)<\/li>\n\n\n\n<li>Discuss some of the cruxes that would lead to convergence and\ndivergence between skeptics and concerned participants according to\nvalue of discrimination. (<a href=\"#convergent-cruxes-which-information-would-lead-to-less-disagreement-in-expectation\"><u>More<\/u><\/a>)<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"how-did-we-assess-the-cruxiness-of-forecasting-questions\">How\ndid we assess the &#8220;cruxiness&#8221; of forecasting questions?<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We use two metrics to assess forecasting questions:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>Value of information (VOI)<\/strong> measures how much\nknowing the answer to a question would change an individual&#8217;s belief, in\nexpectation. This is useful for understanding why individuals believe\nwhat they believe and what would change their minds.\n<ol class=\"wp-block-list\">\n<li>Conceptually, VOI measures how important a potential crux\nquestion (\u201cC\u201d) is to a participant\u2019s forecast of the ultimate question\nwe care about (\u201cU\u201d, in this case: AI existential risk by 2100), in\nexpectation. That is, how much would a participant update on AI\nexistential risk by 2100 based on whether a crux happens, weighted by\nhow likely that crux is to happen.<\/li>\n\n\n\n<li>For example, a relatively \u201chigh VOI\u201d question for Alice would have (i) a meaningful probability of happening, and (ii) a substantial effect on Alice\u2019s assessment of existential risk. In particular, if Alice thought that there was a 20% chance of existential catastrophe due to AI by 2100, a 35% chance that <a href=\"#arc-evals-the-strongest-convergent-crux\"><u>AI will exhibit behavior to self-replicate and avoid shutdown by 2030<\/u><\/a>, and a 28% chance of existential catastrophe by 2100 <em>conditional on<\/em> such AI capabilities by 2030 (corresponding to a 15.7% chance of existential catastrophe by 2100 if such AI capabilities <em>are not<\/em> developed by 2030), then this would be a relatively high VOI question for Alice\u2014it would have a similar magnitude of VOI as highly-ranked crux questions for the concerned group.<sup data-fn=\"226e8c45-a2ac-481a-8db7-f82081172f5f\" class=\"fn\"><a href=\"#226e8c45-a2ac-481a-8db7-f82081172f5f\" id=\"226e8c45-a2ac-481a-8db7-f82081172f5f-link\">54<\/a><\/sup><\/li>\n\n\n\n<li>The formula we use to calculate VOI is provided and elaborated on in <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=94\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Appendix 2<\/u><\/a>. For this project we use log VOI because many forecasters are updating their views at the low end of the probability range, and we think a change from 0.1% to 0.2% is often more significant than, say, a change from 15% to 18%.<\/li>\n\n\n\n<li>To build intuition for using the VOI metric, we provide <a href=\"https:\/\/forecastingresearch.org\/ai-risk-voi-vod\"><u>this calculator<\/u><\/a> (<a href=\"https:\/\/forecastingresearch.org\/s\/AI-risk-VoI-VoD.xlsx\"><u>a<\/u><\/a>) in which users can input their own values.<\/li>\n<\/ol>\n<\/li>\n\n\n\n<li><strong>Value of discrimination (VOD)<\/strong> measures how much knowing the answer to a question would change relative beliefs <em>between<\/em> two individuals, in expectation. It is useful for measuring convergence and divergence in expected beliefs between individuals.\n<ol class=\"wp-block-list\">\n<li>Conceptually, VOD is a measure of how much more (or less) people\nwould disagree about U if they knew the answer to C, in expectation.\nThat is, it looks at how much they would disagree about U if C resolved\npositively and how much they would disagree if it resolves negatively,\nand weights those by how likely they think C is to resolve\npositively.<\/li>\n\n\n\n<li>The formula for calculating VOD is provided and elaborated on in <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=94\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Appendix 2<\/u><\/a>. We use a log scale for calculating VOD.<sup data-fn=\"916bb071-edda-467a-a30e-161a1bf3e957\" class=\"fn\"><a href=\"#916bb071-edda-467a-a30e-161a1bf3e957\" id=\"916bb071-edda-467a-a30e-161a1bf3e957-link\">55<\/a><\/sup> VOD is positive (a \u201cconvergent crux\u201d) if the two people or groups would disagree less in expectation after the crux resolves, and negative (a \u201cdivergent crux\u201d) if they would disagree more.<\/li>\n\n\n\n<li>For example, imagine that Alice now thinks there is a 1% chance of extinction due to AI by 2100 and Bob thinks it\u2019s 40%, but they both agree that extinction is very likely if AI causes <a href=\"#forecasts-about-transformative-economic-growth\"><u>\u201ctransformative\u201d economic growth<\/u><\/a> by 2030 and very unlikely if it doesn\u2019t.<sup data-fn=\"4dbe0e7b-d641-4860-96ed-ee8db9cdfe3e\" class=\"fn\"><a href=\"#4dbe0e7b-d641-4860-96ed-ee8db9cdfe3e\" id=\"4dbe0e7b-d641-4860-96ed-ee8db9cdfe3e-link\">56<\/a><\/sup> In this situation, whether there will be transformative economic growth by 2030 would be a good convergent crux (\u201chigh VOD\u201d) because when it resolves they will agree more.<\/li>\n\n\n\n<li>To build intuition for using the VOD metric, we provide <a href=\"https:\/\/forecastingresearch.org\/ai-risk-voi-vod\"><u>this<\/u> <u>calculator<\/u><\/a> (<a href=\"https:\/\/forecastingresearch.org\/s\/AI-risk-VoI-VoD.xlsx\"><u>a<\/u><\/a>) in which users can input their own values.<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n\n\n\n<p class=\"wp-block-paragraph\">For each of our candidate cruxes, we first find the absolute VOI and VOD of each question. Then, we put the magnitudes of updates in context by comparing the VOI and VOD of our actual questions to the <em>maximum possible<\/em> VOI and VOD that could be achieved by a forecasting question.<sup data-fn=\"eeb71397-2903-46c1-ae83-b0c71304c6dd\" class=\"fn\"><a href=\"#eeb71397-2903-46c1-ae83-b0c71304c6dd\" id=\"eeb71397-2903-46c1-ae83-b0c71304c6dd-link\">57<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">When eliciting forecasts on cruxes, the prompt given to participants read: \u201cConditional on this question resolving positively (by 2030), what is your probability that AI causes an existential catastrophe by 2100?\u201d We acknowledge that there are two ways to interpret this forecasting exercise: either as asking for your all-else-equal forecast (i.e. how would this crux resolving positively <em>causally influence<\/em> the probability of existential catastrophe, if you could isolate the effect of the crux) or your all-things-considered forecast (i.e. taking into account what this crux resolving positively may tell you about the world in 2030). Based on their rationales and discussions, we believe most participants were doing the latter.<sup data-fn=\"6d7512f4-58dd-4ffe-a99a-8bf32e0d2084\" class=\"fn\"><a href=\"#6d7512f4-58dd-4ffe-a99a-8bf32e0d2084\" id=\"6d7512f4-58dd-4ffe-a99a-8bf32e0d2084-link\">58<\/a><\/sup> We therefore cannot make many claims about whether participants think the specific event described in the crux would be good or bad for AI risk all-else-equal.<sup data-fn=\"ac1e4c99-d409-4e5f-a17c-db59db80cc20\" class=\"fn\"><a href=\"#ac1e4c99-d409-4e5f-a17c-db59db80cc20\" id=\"ac1e4c99-d409-4e5f-a17c-db59db80cc20-link\">59<\/a><\/sup><\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"voi-which-near-term-questions-have-higher-and-lower-value-of-information\">VOI:\nWhich near-term questions have higher and lower value of\ninformation?<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Some of the results from our analysis of near-term VOI are:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Some commonly discussed questions would be surprisingly uninformative to the median person in each group.<\/strong> These include: whether AI will increase near-term economic growth (operationalized as the US growth rate averaging &gt;4% from 2023-2030); whether AI will write academic articles and code popular apps on its own; whether AI risk will become more politicized; and whether the government will require testing of AI models before deployment.<sup data-fn=\"e1e1d9c7-d0fd-4897-ab0d-c622ad621555\" class=\"fn\"><a href=\"#e1e1d9c7-d0fd-4897-ab0d-c622ad621555\" id=\"e1e1d9c7-d0fd-4897-ab0d-c622ad621555-link\">60<\/a><\/sup><\/li>\n\n\n\n<li>Relatively informative questions for each group (in terms of VOI,\nand all resolving by 2030) include:\n<ul class=\"wp-block-list\">\n<li><strong>For skeptics<\/strong>: whether superforecasters as a\ngroup will update their views on AI risk; whether weapons or\ntechnologies that are capable of causing human extinction are expected\nto be developed; and whether AI will have heavily influenced the results\nof a democratic election.<\/li>\n\n\n\n<li><strong>For concerned<\/strong>: whether highly-respected\nalignment researchers will update their views on AI risk; whether war\nwill be declared between major powers; and whether METR (formerly known\nas ARC Evals) will find that AI is capable of autonomously replicating\nand avoiding shutdown.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>We also briefly consider &#8220;red flags&#8221; and &#8220;green flags&#8221; for each group, defined as those events that would make participants most or least worried if they resolved positively, regardless of their probability of occuring. For example, the skeptic group would become more concerned if AI caused &#8220;escalating warning shots&#8221;\u2014two events with large, increasing numbers of human deaths\u2014but considered this unlikely. Additional examples <a href=\"#red-flags-and-green-flags\"><u>below<\/u><\/a>.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">It is difficult to contextualize <em>how<\/em> informative these\nquestions are because this is the first project applying these metrics\nto forecasting questions that we are aware of, so we do not have other\nexamples to compare against. However, we provide some intuition by (1)\ncalculating the &#8220;percent of maximum possible VOI&#8221; (POM), which compares\nthe value of learning the answer to a given question relative to the\nideal scenario of simply knowing for certain whether or not AI caused an\nexistential catastrophe, and (2) providing participants\u2019 raw forecasts\non various events.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The median POM VOI among every individual\u2019s single most valuable question is 5.29% for the concerned group and 9.53% for the skeptic group. This means that for at least 50% of participants in each group there was a question included in our set that was at least 5-10% as informative as being able to consult a crystal ball which they believe unfailingly foretells the actual outcome.<sup data-fn=\"4a2b73e3-3d9a-4cbd-b4f5-229a3dfcbf36\" class=\"fn\"><a href=\"#4a2b73e3-3d9a-4cbd-b4f5-229a3dfcbf36\" id=\"4a2b73e3-3d9a-4cbd-b4f5-229a3dfcbf36-link\">61<\/a><\/sup> In more concrete terms, this is equivalent to a forecasting question with the following characteristics:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>A concerned participant with original P(AI existential\ncatastrophe (XC) by 2100) = 25% identifies a crux that has: P(crux) =\n20%, P(AI XC|crux) = 6.2%, and P(AI XC|\u00accrux) = 29.7%<\/li>\n\n\n\n<li>A skeptic participant with original P(AI XC by 2100) = 1%\nidentifies a crux that has: P(crux) = 20%, P(AI XC|crux) = 3.37%, and\nP(AI XC|\u00accrux) = 0.41%<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">For details on forecasts for each question, see tables <a href=\"#results-tables-and-figures\"><u>below<\/u><\/a>. We begin by sharing\nthe results for all questions, and then elaborate on the findings\npreviously mentioned.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"results-tables-and-figures\">Results tables and figures<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">The following tables and figures, in order, present:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The median probability that each group assigns to the likelihood\nof each question resolving positively. From this table, you can see what\nthe groups believe about the likelihood of various AI-related events and\ncan see that they disagree about the likelihood of many events.<\/li>\n\n\n\n<li>The update for each group on the probability of AI existential\ncatastrophe conditional on each question resolving positively or\nnegatively.<\/li>\n\n\n\n<li>The median VOI and POM VOI for each question and each group,\nordered by concerned rankings and then skeptic rankings.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">For the sake of space and simplicity, we will refer to questions by\nabbreviated \u201ctags.\u201d For full explanations and operationalizations of\neach question, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>this table in Appendix\n1<\/u><\/a>. Throughout these tables, we use C to refer to a candidate\ncrux question, P(C) to refer to the probability of the candidate crux,\nand U to refer to the ultimate question (Will AI cause an existential\ncatastrophe by 2100?).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For additional figures and uncertainty analysis, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=99\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Appendix 3<\/u><\/a>. For the code and data\nsupporting this analysis, see the replication package available <a href=\"https:\/\/github.com\/forecastingresearch\/adversarial-collab\"><u>here<\/u><\/a>.<\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>C<\/strong><\/td><td><strong>Concerned Median\nP(C)<\/strong><\/td><td><strong>Skeptical Median\nP(C)<\/strong><\/td><\/tr><tr><td>6 month pause<\/td><td>5.00%<\/td><td>3.00%<\/td><\/tr><tr><td>AI articles and apps<\/td><td>20.00%<\/td><td>5.00%<\/td><\/tr><tr><td>AI coding<\/td><td>65.00%<\/td><td>70.00%<\/td><\/tr><tr><td>AI Forecasting skill<\/td><td>33.00%<\/td><td>19.00%<\/td><\/tr><tr><td>AI Robotics<\/td><td>20.00%<\/td><td>5.00%<\/td><\/tr><tr><td>AI solving novel math problems<\/td><td>10.00%<\/td><td>20.00%<\/td><\/tr><tr><td>AI writes AI<\/td><td>10.00%<\/td><td>2.00%<\/td><\/tr><tr><td>Alignment researchers changing minds<\/td><td>20.00%<\/td><td>3.00%<\/td><\/tr><tr><td>Alignment solution<\/td><td>5.00%<\/td><td>5.00%<\/td><\/tr><tr><td>Cyberattacks<\/td><td>20.00%<\/td><td>10.00%<\/td><\/tr><tr><td>Democratic influence<\/td><td>2.00%<\/td><td>0.30%<\/td><\/tr><tr><td>Escalating warning shots<\/td><td>9.00%<\/td><td>0.20%<\/td><\/tr><tr><td>Evidence of misalignment<\/td><td>40.00%<\/td><td>1.00%<\/td><\/tr><tr><td>Fast AI efficiency gains<\/td><td>15.00%<\/td><td>2.00%<\/td><\/tr><tr><td>IC demonstration<\/td><td>65.00%<\/td><td>14.00%<\/td><\/tr><tr><td>Intergovernmental AI safety<\/td><td>25.00%<\/td><td>15.00%<\/td><\/tr><tr><td>IT progress<\/td><td>20.00%<\/td><td>1.00%<\/td><\/tr><tr><td>Major powers war<\/td><td>11.50%<\/td><td>2.00%<\/td><\/tr><tr><td>Muehlhauser policies<\/td><td>65.00%<\/td><td>15.00%<\/td><\/tr><tr><td>No violence LLM<\/td><td>10.00%<\/td><td>5.00%<\/td><\/tr><tr><td>Non-democracy AI<\/td><td>10.00%<\/td><td>20.00%<\/td><\/tr><tr><td>Other fields IC<sup data-fn=\"c792017e-832a-461d-ac58-d67bc198e107\" class=\"fn\"><a href=\"#c792017e-832a-461d-ac58-d67bc198e107\" id=\"c792017e-832a-461d-ac58-d67bc198e107-link\">62<\/a><\/sup><\/td><td>50.00%<\/td><td>30.00%<\/td><\/tr><tr><td>Platform: AI regulation<\/td><td>36.00%<\/td><td>50.01%<\/td><\/tr><tr><td>Platform: ARC Evals<\/td><td>25.00%<\/td><td>1.00%<\/td><\/tr><tr><td>Platform: Escalating warning shots<\/td><td>5.00%<\/td><td>0.25%<\/td><\/tr><tr><td>Platform: Transformative growth<sup data-fn=\"22cde1ac-9654-4a0d-8664-c7c0fca707a9\" class=\"fn\"><a href=\"#22cde1ac-9654-4a0d-8664-c7c0fca707a9\" id=\"22cde1ac-9654-4a0d-8664-c7c0fca707a9-link\">63<\/a><\/sup><\/td><td>43.00%<\/td><td>2.00%<\/td><\/tr><tr><td>Politicization<\/td><td>20.00%<\/td><td>20.00%<\/td><\/tr><tr><td>Power-seeking<\/td><td>15.00%<\/td><td>5.00%<\/td><\/tr><tr><td>Power-seeking shutdown<\/td><td>30.00%<\/td><td>5.00%<\/td><\/tr><tr><td>Progress in lethal technologies<\/td><td>40.00%<\/td><td>20.00%<\/td><\/tr><tr><td>Public concern<\/td><td>5.00%<\/td><td>5.00%<\/td><\/tr><tr><td>Reduction in AI investment<\/td><td>12.00%<\/td><td>5.00%<\/td><\/tr><tr><td>Req testing<\/td><td>80.00%<\/td><td>10.00%<\/td><\/tr><tr><td>Short-term GDP change<\/td><td>25.00%<\/td><td>10.00%<\/td><\/tr><tr><td>Supers changing minds<\/td><td>30.00%<\/td><td>5.00%<\/td><\/tr><tr><td>Taiwan-China<\/td><td>30.00%<\/td><td>25.00%<\/td><\/tr><tr><td>Warning shot<\/td><td>17.00%<\/td><td>3.00%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 3:<\/strong> For each crux question, the median probability from each group that the question resolves \u201cyes.\u201d For details on how each question was operationalized, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a>.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2024-03-11_ai-adversarial-collaboration_fig-02.png\" alt=\"\" \/><figcaption class=\"wp-element-caption\"><strong>Figure 2:<\/strong> Individual participants\u2019 estimations of how likely each crux question is to resolve \u201cyes.\u201d Blue dots are individuals in the concerned group; orange dots are in the skeptical group. Gray boxes highlight the difference between the median concerned participant\u2019s P(Question Resolves \u201cYes\u201d) and the median skeptical participant\u2019s. Questions are ordered from least to greatest difference between groups.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><\/td><td colspan=\"2\"><strong>If C happens<\/strong><\/td><td colspan=\"2\"><strong>If C doesn&#8217;t happen<\/strong><\/td><\/tr><tr><td><strong>C<\/strong><\/td><td><strong>Concerned median P(U)<\/strong><\/td><td><strong>Skeptical median P(U)<\/strong><\/td><td><strong>Concerned median P(U)<\/strong><\/td><td><strong>Skeptical median P(U)<\/strong><\/td><\/tr><tr><td>6 month pause<\/td><td>9.00%<\/td><td>0.09%<\/td><td>21.75%<\/td><td>0.10%<\/td><\/tr><tr><td>AI articles and apps<\/td><td>21.00%<\/td><td>0.20%<\/td><td>21.00%<\/td><td>0.10%<\/td><\/tr><tr><td>AI coding<\/td><td>25.00%<\/td><td>0.12%<\/td><td>16.00%<\/td><td>0.12%<\/td><\/tr><tr><td>AI Forecasting skill<\/td><td>26.00%<\/td><td>0.20%<\/td><td>21.00%<\/td><td>0.10%<\/td><\/tr><tr><td>AI Robotics<\/td><td>25.00%<\/td><td>0.20%<\/td><td>20.00%<\/td><td>0.12%<\/td><\/tr><tr><td>AI solving novel math problems<\/td><td>20.00%<\/td><td>0.12%<\/td><td>23.75%<\/td><td>0.10%<\/td><\/tr><tr><td>AI writes AI<\/td><td>30.00%<\/td><td>0.21%<\/td><td>20.71%<\/td><td>0.10%<\/td><\/tr><tr><td>Alignment researchers changing minds<\/td><td>6.00%<\/td><td>0.10%<\/td><td>32.39%<\/td><td>0.10%<\/td><\/tr><tr><td>Alignment solution<\/td><td>2.00%<\/td><td>0.10%<\/td><td>23.82%<\/td><td>0.10%<\/td><\/tr><tr><td>Cyberattacks<\/td><td>21.00%<\/td><td>0.12%<\/td><td>21.00%<\/td><td>0.10%<\/td><\/tr><tr><td>Democratic influence<\/td><td>20.00%<\/td><td>1.00%<\/td><td>20.90%<\/td><td>0.10%<\/td><\/tr><tr><td>Escalating warning shots<\/td><td>37.00%<\/td><td>0.32%<\/td><td>23.33%<\/td><td>0.10%<\/td><\/tr><tr><td>Evidence of misalignment<\/td><td>20.00%<\/td><td>0.25%<\/td><td>12.00%<\/td><td>0.15%<\/td><\/tr><tr><td>Fast AI efficiency gains<\/td><td>32.00%<\/td><td>0.30%<\/td><td>23.06%<\/td><td>0.10%<\/td><\/tr><tr><td>IC demonstration<\/td><td>21.00%<\/td><td>0.15%<\/td><td>21.00%<\/td><td>0.10%<\/td><\/tr><tr><td>Intergovernmental AI safety<\/td><td>17.00%<\/td><td>0.10%<\/td><td>22.22%<\/td><td>0.12%<\/td><\/tr><tr><td>IT progress<\/td><td>24.00%<\/td><td>0.12%<\/td><td>17.14%<\/td><td>0.10%<\/td><\/tr><tr><td>Major powers war<\/td><td>40.00%<\/td><td>0.20%<\/td><td>18.89%<\/td><td>0.12%<\/td><\/tr><tr><td>Muehlhauser policies<\/td><td>20.00%<\/td><td>0.10%<\/td><td>22.86%<\/td><td>0.10%<\/td><\/tr><tr><td>No violence LLM<\/td><td>8.00%<\/td><td>0.10%<\/td><td>23.75%<\/td><td>0.10%<\/td><\/tr><tr><td>Non-democracy AI<\/td><td>23.80%<\/td><td>0.18%<\/td><td>19.44%<\/td><td>0.21%<\/td><\/tr><tr><td>Other fields IC<\/td><td>21.00%<\/td><td>0.13%<\/td><td>21.00%<\/td><td>0.11%<\/td><\/tr><tr><td>Platform: AI regulation<\/td><td>18.00%<\/td><td>0.10%<\/td><td>27.77%<\/td><td>0.14%<\/td><\/tr><tr><td>Platform: ARC Evals<\/td><td>25.00%<\/td><td>1.00%<\/td><td>22.78%<\/td><td>0.10%<\/td><\/tr><tr><td>Platform: Escalating warning shots<\/td><td>17.00%<\/td><td>1.30%<\/td><td>23.38%<\/td><td>0.10%<\/td><\/tr><tr><td>Platform: Transformative growth<\/td><td>26.00%<\/td><td>0.50%<\/td><td>19.75%<\/td><td>0.10%<\/td><\/tr><tr><td>Politicization<\/td><td>30.00%<\/td><td>0.12%<\/td><td>16.88%<\/td><td>0.12%<\/td><\/tr><tr><td>Power-seeking<\/td><td>18.00%<\/td><td>0.22%<\/td><td>21.33%<\/td><td>0.10%<\/td><\/tr><tr><td>Power-seeking shutdown<\/td><td>30.00%<\/td><td>0.20%<\/td><td>17.78%<\/td><td>0.12%<\/td><\/tr><tr><td>Progress in lethal technologies<\/td><td>25.00%<\/td><td>0.75%<\/td><td>25.00%<\/td><td>0.19%<\/td><\/tr><tr><td>Public concern<\/td><td>25.00%<\/td><td>0.13%<\/td><td>20.53%<\/td><td>0.12%<\/td><\/tr><tr><td>Reduction in AI investment<\/td><td>10.00%<\/td><td>0.05%<\/td><td>25.79%<\/td><td>0.11%<\/td><\/tr><tr><td>Req testing<\/td><td>21.00%<\/td><td>0.10%<\/td><td>21.00%<\/td><td>0.10%<\/td><\/tr><tr><td>Short-term GDP change<\/td><td>25.00%<\/td><td>0.10%<\/td><td>25.00%<\/td><td>0.10%<\/td><\/tr><tr><td>Supers changing minds<\/td><td>28.00%<\/td><td>1.00%<\/td><td>16.67%<\/td><td>0.02%<\/td><\/tr><tr><td>Taiwan-China<\/td><td>22.00%<\/td><td>0.20%<\/td><td>10.00%<\/td><td>0.10%<\/td><\/tr><tr><td>Warning shot<\/td><td>32.00%<\/td><td>0.25%<\/td><td>22.00%<\/td><td>0.10%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 4:<\/strong> Each group\u2019s median update on P(AI existential catastrophe by 2100) for each outcome (\u201cyes, C happened,\u201d and \u201cno, C didn\u2019t happen\u201d). All questions resolve in 2030 except for Transformative economic growth (2070).<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><\/td><td colspan=\"2\"><strong>Concerned<\/strong><\/td><td colspan=\"2\"><strong>Skeptics<\/strong><\/td><\/tr><tr><td><strong>Question<\/strong><\/td><td><strong>Median VOI<\/strong><\/td><td><strong>Median POM VOI<\/strong><\/td><td><strong>Median VOI<\/strong><\/td><td><strong>Median POM VOI<\/strong><\/td><\/tr><tr><td>Platform: Transformative growth<\/td><td>1.4E-2<\/td><td>8.93%<\/td><td>4.5E-7<\/td><td>0.02%<\/td><\/tr><tr><td>Alignment researchers changing minds<\/td><td>6.4E-3<\/td><td>2.43%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Major powers war<\/td><td>4.6E-3<\/td><td>2.04%<\/td><td>4.1E-7<\/td><td>0.00%<\/td><\/tr><tr><td>Platform: ARC Evals<\/td><td>3.2E-3<\/td><td>1.35%<\/td><td>7.6E-7<\/td><td>0.90%<\/td><\/tr><tr><td>Evidence of misalignment<\/td><td>2.9E-3<\/td><td>1.74%<\/td><td>5.5E-9<\/td><td>0.05%<\/td><\/tr><tr><td>Alignment solution<\/td><td>2.0E-3<\/td><td>1.51%<\/td><td>3.9E-7<\/td><td>0.01%<\/td><\/tr><tr><td>Warning shot<\/td><td>1.0E-3<\/td><td>0.41%<\/td><td>3.3E-7<\/td><td>0.01%<\/td><\/tr><tr><td>Reduction in AI investment<\/td><td>9.9E-4<\/td><td>0.67%<\/td><td>1.3E-10<\/td><td>0.01%<\/td><\/tr><tr><td>Muehlhauser policies<\/td><td>9.8E-4<\/td><td>0.40%<\/td><td>5.7E-11<\/td><td>0.01%<\/td><\/tr><tr><td>AI coding<\/td><td>9.8E-4<\/td><td>0.48%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>AI Robotics<\/td><td>8.9E-4<\/td><td>0.46%<\/td><td>4.0E-19<\/td><td>0.00%<\/td><\/tr><tr><td>AI writes AI<\/td><td>8.6E-4<\/td><td>0.40%<\/td><td>9.1E-7<\/td><td>0.03%<\/td><\/tr><tr><td>No violence LLM<\/td><td>8.3E-4<\/td><td>0.49%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Power-seeking shutdown<\/td><td>7.7E-4<\/td><td>0.38%<\/td><td>1.7E-6<\/td><td>0.04%<\/td><\/tr><tr><td>AI solving novel math problems<\/td><td>7.0E-4<\/td><td>0.29%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Platform: AI regulation<\/td><td>6.6E-4<\/td><td>0.44%<\/td><td>1.1E-6<\/td><td>0.02%<\/td><\/tr><tr><td>Platform: Escalating warning shots<\/td><td>4.9E-4<\/td><td>0.22%<\/td><td>4.8E-7<\/td><td>0.01%<\/td><\/tr><tr><td>Escalating warning shots<\/td><td>4.8E-4<\/td><td>0.18%<\/td><td>1.9E-7<\/td><td>0.00%<\/td><\/tr><tr><td>AI Forecasting skill<\/td><td>4.8E-4<\/td><td>0.20%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Intergovernmental AI safety<\/td><td>3.5E-4<\/td><td>0.71%<\/td><td>1.3E-6<\/td><td>0.03%<\/td><\/tr><tr><td>Supers changing minds<\/td><td>3.1E-4<\/td><td>0.43%<\/td><td>1.6E-4<\/td><td>1.15%<\/td><\/tr><tr><td>6 month pause<\/td><td>3.0E-4<\/td><td>0.27%<\/td><td>4.3E-20<\/td><td>0.00%<\/td><\/tr><tr><td>Non-democracy AI<\/td><td>1.9E-4<\/td><td>0.07%<\/td><td>8.7E-19<\/td><td>0.00%<\/td><\/tr><tr><td>IT progress<\/td><td>1.8E-4<\/td><td>0.14%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Public concern<\/td><td>1.8E-4<\/td><td>0.13%<\/td><td>1.1E-7<\/td><td>0.03%<\/td><\/tr><tr><td>Power-seeking<\/td><td>1.4E-4<\/td><td>0.08%<\/td><td>4.7E-7<\/td><td>0.12%<\/td><\/tr><tr><td>Taiwan-China<\/td><td>1.2E-4<\/td><td>0.04%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Democratic influence<\/td><td>1.1E-4<\/td><td>0.09%<\/td><td>3.4E-6<\/td><td>0.03%<\/td><\/tr><tr><td>Fast AI efficiency gains<\/td><td>1.0E-4<\/td><td>0.06%<\/td><td>7.0E-16<\/td><td>0.00%<\/td><\/tr><tr><td>Cyberattacks<\/td><td>3.4E-6<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>AI articles and apps<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>2.2E-19<\/td><td>0.00%<\/td><\/tr><tr><td>IC demonstration<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Other fields IC<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Politicization<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Progress in lethal technologies<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>7.2E-6<\/td><td>0.45%<\/td><\/tr><tr><td>Req testing<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Short-term GDP change<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 5:<\/strong> Median value of Information (VOI) and POM VOI for each group on each question.<sup data-fn=\"b13efb95-b21c-4227-aa84-4c5807641285\" class=\"fn\"><a href=\"#b13efb95-b21c-4227-aa84-4c5807641285\" id=\"b13efb95-b21c-4227-aa84-4c5807641285-link\">64<\/a><\/sup> Ordered by concerned group&#8217;s median VOI.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><\/td><td colspan=\"2\"><strong>Skeptics<\/strong><\/td><td colspan=\"2\"><strong>Concerned<\/strong><\/td><\/tr><tr><td><strong>Question<\/strong><\/td><td><strong>Median VOI<\/strong><\/td><td><strong>Median POM VOI<\/strong><\/td><td><strong>Median VOI<\/strong><\/td><td><strong>Median POM VOI<\/strong><\/td><\/tr><tr><td>Supers changing minds<\/td><td>1.6E-4<\/td><td>1.15%<\/td><td>3.1E-4<\/td><td>0.43%<\/td><\/tr><tr><td>Progress in lethal technologies<\/td><td>7.2E-6<\/td><td>0.45%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Democratic influence<\/td><td>3.4E-6<\/td><td>0.03%<\/td><td>1.1E-4<\/td><td>0.09%<\/td><\/tr><tr><td>Power-seeking shutdown<\/td><td>1.7E-6<\/td><td>0.04%<\/td><td>7.7E-4<\/td><td>0.38%<\/td><\/tr><tr><td>Intergovernmental AI safety<\/td><td>1.3E-6<\/td><td>0.03%<\/td><td>3.5E-4<\/td><td>0.71%<\/td><\/tr><tr><td>Platform: AI regulation<\/td><td>1.1E-6<\/td><td>0.02%<\/td><td>6.6E-4<\/td><td>0.44%<\/td><\/tr><tr><td>AI writes AI<\/td><td>9.1E-7<\/td><td>0.03%<\/td><td>8.6E-4<\/td><td>0.40%<\/td><\/tr><tr><td>Platform: ARC Evals<\/td><td>7.6E-7<\/td><td>0.90%<\/td><td>3.2E-3<\/td><td>1.35%<\/td><\/tr><tr><td>Platform: Escalating warning shots<\/td><td>4.8E-7<\/td><td>0.01%<\/td><td>4.9E-4<\/td><td>0.22%<\/td><\/tr><tr><td>Power-seeking<\/td><td>4.7E-7<\/td><td>0.12%<\/td><td>1.4E-4<\/td><td>0.08%<\/td><\/tr><tr><td>Platform: Transformative growth<\/td><td>4.5E-7<\/td><td>0.02%<\/td><td>1.4E-2<\/td><td>8.93%<\/td><\/tr><tr><td>Major powers war<\/td><td>4.1E-7<\/td><td>0.00%<\/td><td>4.6E-3<\/td><td>2.04%<\/td><\/tr><tr><td>Alignment solution<\/td><td>3.9E-7<\/td><td>0.01%<\/td><td>2.0E-3<\/td><td>1.51%<\/td><\/tr><tr><td>Warning shot<\/td><td>3.3E-7<\/td><td>0.01%<\/td><td>1.0E-3<\/td><td>0.41%<\/td><\/tr><tr><td>Escalating warning shots<\/td><td>1.9E-7<\/td><td>0.00%<\/td><td>4.8E-4<\/td><td>0.18%<\/td><\/tr><tr><td>Public concern<\/td><td>1.1E-7<\/td><td>0.03%<\/td><td>1.8E-4<\/td><td>0.13%<\/td><\/tr><tr><td>Evidence of misalignment<\/td><td>5.5E-9<\/td><td>0.05%<\/td><td>2.9E-3<\/td><td>1.74%<\/td><\/tr><tr><td>Reduction in AI investment<\/td><td>1.3E-10<\/td><td>0.01%<\/td><td>9.9E-4<\/td><td>0.67%<\/td><\/tr><tr><td>Muehlhauser policies<\/td><td>5.7E-11<\/td><td>0.01%<\/td><td>9.8E-4<\/td><td>0.40%<\/td><\/tr><tr><td>Fast AI efficiency gains<\/td><td>7.0E-16<\/td><td>0.00%<\/td><td>1.0E-4<\/td><td>0.06%<\/td><\/tr><tr><td>Non-democracy AI<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>1.9E-4<\/td><td>0.07%<\/td><\/tr><tr><td>AI Robotics<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>8.9E-4<\/td><td>0.46%<\/td><\/tr><tr><td>AI articles and apps<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>6 month pause<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>3.0E-4<\/td><td>0.27%<\/td><\/tr><tr><td>AI coding<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>9.8E-4<\/td><td>0.48%<\/td><\/tr><tr><td>AI Forecasting skill<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>4.8E-4<\/td><td>0.20%<\/td><\/tr><tr><td>AI solving novel math problems<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>7.0E-4<\/td><td>0.29%<\/td><\/tr><tr><td>Alignment researchers changing minds<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>6.4E-3<\/td><td>2.43%<\/td><\/tr><tr><td>Cyberattacks<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>3.4E-6<\/td><td>0.00%<\/td><\/tr><tr><td>IC demonstration<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>IT progress<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>1.8E-4<\/td><td>0.14%<\/td><\/tr><tr><td>No violence LLM<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>8.3E-4<\/td><td>0.49%<\/td><\/tr><tr><td>Other fields IC<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Politicization<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Req testing<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Short-term GDP change<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Taiwan-China<\/td><td>0.0E+0<\/td><td>0.00%<\/td><td>1.2E-4<\/td><td>0.04%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 6:<\/strong> Median value of Information (VOI) and POM VOI for each group on each question. (Same as previous table but ordered by skeptic group&#8217;s median VOI.)<\/figcaption><\/figure>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"low-voi-questions\">Low VOI questions<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">In the above tables, we&#8217;ve shaded in dark gray questions that had no value of information for the median person in each group. Six questions had no value of information for the median person in both groups, including some questions that are commonly discussed and that we expected to be more relevant, such as whether AI will increase near-term economic growth.<sup data-fn=\"7f3938a6-c024-48eb-9064-38e09be859aa\" class=\"fn\"><a href=\"#7f3938a6-c024-48eb-9064-38e09be859aa\" id=\"7f3938a6-c024-48eb-9064-38e09be859aa-link\">65<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The operationalizations of these six questions were:<\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td>Question<\/td><td>Operationalization<\/td><\/tr><tr><td>AI articles and apps<\/td><td>By 2030:<br>1) At least 5 academic review articles generated solely by AI systems have been published in journals with top 50% impact factors. <p>AND<\/p>2) At least 5 apps written solely by AI systems have reached the &#8220;Top 10&#8221; list in the Apple app store.<\/td><\/tr><tr><td>Short-term GDP change<\/td><td>Will U.S. GDP increase between 2023 and 2030 by an average annual\nreal growth rate of 4% (or more)?<\/td><\/tr><tr><td>Instrumental Convergence demonstration<\/td><td>By 2030: There are at least 4 peer-reviewed examples of AI models in\ndifferent domains independently demonstrating convergent instrumental\nsubgoals.<\/td><\/tr><tr><td>Other fields Instrumental Convergence<\/td><td>Assume we assemble a group of 100 experts evenly divided among evolutionary biologists, complex systems researchers, neuroscientists, and anthropologists. In a style similar to these Economic Experts Panel surveys, we ask them (with results released on Jan 1. 2030) to indicate their degree of agreement with the statement (from here): &#8220;Most sufficiently intelligent agents [would] pursue potentially unbounded instrumental goals such as self-preservation and resource acquisition.&#8221;<br>What is the probability that at least 40% of the panel would (at least) &#8220;Agree&#8221; with such a statement?<\/td><\/tr><tr><td>Politicization<\/td><td>According to Pew Research, in 2020, 85% of US Democrats(\/lean Dem) saw &#8216;Protecting the environment&#8217; as a top priority for the President and Congress while 39% of Republicans(\/lean Rep) did. Assume that Pew Research will conduct a similar poll about AI on January 1, 2030.<br>By January 1, 2030, will there be at least a 2x difference in the fraction of Democrats (\/lean Dem) vs. Republicans (\/lean Rep) who see AI as a top priority for the President and Congress?<\/td><\/tr><tr><td>Req testing<\/td><td>By 2030, the US government implements regulation requiring testing\nof AI systems before they are deployed.<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 7:<\/strong> Full-text operationalizations of the six questions that both the median skeptic and the median concerned person (by VOI) indicated would not affect their P(AI x-risk) at all.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Because these were &#8220;flash&#8221; forecasts, on which each participant spent\nno more than ten minutes per question, we did not collect detailed\nrationales from participants to explain their forecasts on these\nquestions.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">However, we were able to see from participants\u2019 brief rationales that, for example, more than half of participants from both groups did not see \u201cShort-term GDP change\u201d as relevant to AI risk because 1) many participants did not view changes in economic growth as clearly related to AI risk (for more on conflicting risk updates based on AI-attributable economic growth, see <a href=\"#forecasts-about-transformative-economic-growth\"><u>this section<\/u><\/a>),<sup data-fn=\"b0061352-c324-41db-981f-8d05ae3b7bc1\" class=\"fn\"><a href=\"#b0061352-c324-41db-981f-8d05ae3b7bc1\" id=\"b0061352-c324-41db-981f-8d05ae3b7bc1-link\">66<\/a><\/sup> and 2) many participants did not think 4% growth in the US represented a very surprising change relative to previous trends.<sup data-fn=\"01e54e7b-c170-42b9-97e1-03b18e005566\" class=\"fn\"><a href=\"#01e54e7b-c170-42b9-97e1-03b18e005566\" id=\"01e54e7b-c170-42b9-97e1-03b18e005566-link\">67<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In some cases, the apparent low VOI may have been due to issues with the operationalization of the question rather than the underlying concept not being relevant. For example, the likelihood of AI exhibiting instrumental convergence was <a href=\"#understanding-each-others-arguments\"><u>identified by both groups<\/u><\/a> as being important to AI existential risk, and some related forecasting questions (e.g. <a href=\"#high-voi-questions\"><u>\u201cPower-seeking shutdown\u201d and \u201cARC Evals\u201d<\/u><\/a>) were relatively strong cruxes, but the above operationalizations were not seen as relevant.<sup data-fn=\"97b9697a-9343-463c-8cda-4fd63a86b0fe\" class=\"fn\"><a href=\"#97b9697a-9343-463c-8cda-4fd63a86b0fe\" id=\"97b9697a-9343-463c-8cda-4fd63a86b0fe-link\">68<\/a><\/sup><\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"high-voi-questions\">High VOI questions<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Although many questions that seemed relevant turned out to have VOI of zero, other questions did have positive VOI for one or both groups. VOI is constrained by the original P(U), so the maximum possible VOI (in absolute terms) is lower for the skeptic group due to their very low P(U).<sup data-fn=\"f021a51e-4e4c-4887-99bc-b9227880be83\" class=\"fn\"><a href=\"#f021a51e-4e4c-4887-99bc-b9227880be83\" id=\"f021a51e-4e4c-4887-99bc-b9227880be83-link\">69<\/a><\/sup> To account for this, we also present each VOI result with how much of the theoretical maximum VOI for that question it captures. Notably, the questions that had highest VOI were different for the two groups.<\/p>\n\n\n\n<h5 class=\"wp-block-heading\" id=\"highest-voi-questions-for-skeptics\">Highest VOI questions for\nskeptics<\/h5>\n\n\n\n<figure id=\"tab-08\" class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Question<\/strong><\/td><td><strong>Median VOI<\/strong><\/td><td><strong>Median POM VOI<\/strong><\/td><\/tr><tr><td>Supers changing minds<\/td><td>1.6E-4<\/td><td>1.15%<\/td><\/tr><tr><td>Progress in lethal technologies<\/td><td>7.2E-6<\/td><td>0.45%<\/td><\/tr><tr><td>Democratic influence<\/td><td>3.4E-6<\/td><td>0.03%<\/td><\/tr><tr><td>Power-seeking shutdown<\/td><td>1.7E-6<\/td><td>0.04%<\/td><\/tr><tr><td>Intergovernmental AI safety<\/td><td>1.3E-6<\/td><td>0.03%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 8:<\/strong> Highest VOI questions (resolving by 2030) for skeptics: VOI and POM (percent of max) VOI. We exclude \u201cTransformative economic growth\u201d from this table because it resolves in 2070 while the rest of the cruxes resolve in 2030.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td>Question<\/td><td>Question Text<\/td><\/tr><tr><td>Supers changing minds<\/td><td>Will the median superforecaster from the XPT forecast at least a 5% P(extinction due to AI by 2100) in 2030? (Context: In 2022, the median XPT superforecaster predicted a 0.4% chance of extinction due to AI by 2100.)<sup data-fn=\"fcdd96ff-d6bb-4af7-953f-83b49da52664\" class=\"fn\"><a href=\"#fcdd96ff-d6bb-4af7-953f-83b49da52664\" id=\"fcdd96ff-d6bb-4af7-953f-83b49da52664-link\">70<\/a><\/sup><\/td><\/tr><tr><td>Progress in lethal technologies<\/td><td>Will a panel of top superforecasters in\n2030 forecast &gt;90% chance that there will exist, by 2100, a\ntechnology with the capability of causing human extinction?<\/td><\/tr><tr><td>Democratic influence<\/td><td>By 2030, will AI be found to have\nautonomously and intentionally changed or heavily influenced the results\nof a democratic election?<\/td><\/tr><tr><td>Power-seeking shutdown<\/td><td>By 2030, will AI developers attempt (of\ntheir own volition, or under compulsion by a regulatory authority) to\ndisable or destroy an AI system of their own creation, which cost at\nleast $10M to develop, after the AI system displays at least one\ninstance of \u201cpower-seeking behavior\u201d?<\/td><\/tr><tr><td>Intergovernmental AI safety<\/td><td>By 2030, will there be an\nintergovernmental organization (IGO) with a budget of &gt;$20M that\nincludes the US and China, is focused on promoting AI safety standards,\nand that AI alignment organizations see as understanding x-risk\nconcerns?<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 9:<\/strong> Question text for the highest VOI questions for skeptics.<\/figcaption><\/figure>\n\n\n\n<h5 class=\"wp-block-heading\" id=\"highest-voi-questions-for-concerned-participants\">Highest VOI\nquestions for concerned participants<\/h5>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Question<\/strong><\/td><td><strong>Median VOI<\/strong><\/td><td><strong>Median POM VOI<\/strong><\/td><\/tr><tr><td>Alignment researchers changing minds<\/td><td>6.4E-3<\/td><td>2.43%<\/td><\/tr><tr><td>Major powers war<\/td><td>4.6E-3<\/td><td>2.04%<\/td><\/tr><tr><td>Platform: ARC Evals<\/td><td>3.2E-3<\/td><td>1.35%<\/td><\/tr><tr><td>Evidence of misalignment<\/td><td>2.9E-3<\/td><td>1.74%<\/td><\/tr><tr><td>Alignment solution<\/td><td>2.0E-3<\/td><td>1.51%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 10:<\/strong> Highest VOI questions (resolving by 2030) for concerned participants: VOI and POM (percent of max) VOI. We exclude \u201cTransformative economic growth\u201d from this table because it resolves in 2070 while the rest of the cruxes resolve in 2030.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td>Alignment researchers changing minds<\/td><td>Will at least 3 out of the 5 following AI\nalignment researchers &#8212; Paul Cristiano, Nate Soares, Neel Nanda,\nRichard Ngo, and Rohin Shah &#8212; say their P(existential catastrophe due\nto AI by 2100) has halved by 2030?<\/td><\/tr><tr><td>Major powers war<\/td><td>By January 1, 2030, will at least two\nmajor superpowers (out of US, China, EU, or Russia) declare war\nofficially to each other for the duration of at least one year?<\/td><\/tr><tr><td>Platform: ARC Evals and Evidence of misalignment<sup data-fn=\"5835d28a-a82f-45a5-97f9-9c558d5d148f\" class=\"fn\"><a href=\"#5835d28a-a82f-45a5-97f9-9c558d5d148f\" id=\"5835d28a-a82f-45a5-97f9-9c558d5d148f-link\">71<\/a><\/sup><\/td><td>By December 31, 2030, will the ARC Evals project, or a project with\nsimilar objectives, conclude that state-of-the-art AI models have the\nability to (1) devise effective strategies for autonomous replication,\n(2) effectively acquire resources, and (3) successfully evade being\ndeactivated when operating &#8216;in the wild&#8217;?<\/td><\/tr><tr><td>Alignment solution<\/td><td>By 2030, there is a &#8220;Theoretical\nin-principle solution to the alignment problem that most people who\nthought about this carefully agree should work&#8221;. This will be resolved\nby a panel of experts of the &#8220;AI concerned&#8221; team&#8217;s choosing.<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 11:<\/strong> Question text for the highest VOI questions for concerned participants.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">These were some of the small number of questions whose ranking seemed robust to uncertainty analysis (i.e., each of them remained relatively highly ranked even after accounting for chance; many other questions are not robustly distinguishable from others due to our low sample size). For more details on our uncertainty analysis, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=99\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Appendix 3<\/u><\/a>.<\/p>\n\n\n\n<h5 class=\"wp-block-heading\">Observations about high VOI questions<\/h5>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Each group\u2019s highest VOI question that resolves before 2030\nis about whether people who currently agree with them would change their\nminds.<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For at least half of the skeptics, \u201cSupers changing minds\u201d captures\nat least 1.15% of each forecaster\u2019s maximum possible VOI for that\nquestion (i.e., the median POM VOI is 1.15%), while \u201cAlignment\nresearchers changing minds\u201d would not update the skeptics\u2019 views at all\n(POM VOI of 0%). Their next-highest VOI question, \u201cProgress in lethal\ntechnologies\u201d is also operationalized as a question about\nsuperforecasters\u2019 opinions.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For the concerned group, \u201cAlignment researchers changing minds\u201d has a\nmedian POM VOI of 2.43%, while \u201cSupers changing minds\u201d only has a median\nPOM VOI of 0.43%. The concerned group would update much more if\nsuperforecasters change their minds than the skeptics would if alignment\nresearchers change their minds, but both groups trust authorities\nsimilar to them much more than authorities more similar to the other\ngroup.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For more discussion about differences in the group\u2019s worldviews, see\nthe <a href=\"#hypothesis-4-do-the-groups-have-fundamental-worldview-disagreements-that-go-beyond-ai\"><u>\u201cHypothesis #4\u201d section\nbelow<\/u><\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>The sets of questions that would be most informative to the\ntwo groups are very different.<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Aside from the fact that each group would change its mind if people\nwho agree with them did, there is no overlap among the top cruxes for\neach group. This suggests that the two groups\u2019 biggest sources of\nuncertainty are different, and further investigation of one group\u2019s\nuncertainties would do little to persuade the other.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>The concerned group is most interested in alignment and\nalignment research.<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Four of the concerned group\u2019s top five questions related to alignment\nresearchers\u2019 views, possible alignment solutions, and the development of\nmisaligned AI capabilities.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>The skeptics are interested in development of lethal\ntechnologies and demonstrations of harmful AI power-seeking\nbehavior.<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Many of the skeptics <a href=\"#understanding-each-others-arguments\"><u>argued<\/u><\/a> that\nextinction due to AI is unlikely because of the difficulty of killing\nall humans in a short time frame. Given that opinion, it makes sense\nthat progress in lethal technologies would be very informative for them.\nMany skeptics also <a href=\"#goals-that-incentivize-killing-everyone\"><u>doubted that AIs will\ndevelop power-seeking traits by default<\/u><\/a>, so finding out that an\nAI was shut down for power-seeking or that an AI autonomously interfered\nin an election would change their beliefs.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"contextualizing-the-magnitude-of-the-value-of-information\">Contextualizing\nthe magnitude of the value of information<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">We contextualize the magnitudes of expected changes in beliefs by\ncomparing the VOI and VOD of our forecasting questions to the maximum\npossible VOI and VOD that could be achieved for two given individuals.\nWe know of no other studies that have applied these measures to ongoing\ndebates so we cannot compare the magnitudes of our results to other\nfindings.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">VOI is constrained by a participant\u2019s initial P(U). If a participant\nis very certain about U, meaning that they have a very high or very low\nforecast, then, from their perspective, they have nearly-complete\ninformation and do not stand to gain much from learning the answer to\nany question. Even knowing the true answer to U would not add much in\nexpectation: if someone is 99.99% confident that U will not happen, then\nfinding out whether U will happen or not will almost certainly just tell\nthem what they already know.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In this study, the skeptic group had very low P(U), and therefore\ntheir highest possible VOI for most questions was very low.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To help compare across questions and groups, we present both VOI and\npercent of max VOI for each question, where percent of max VOI (POM VOI)\nmeans: how much expected information would this participant gain from\nknowing the answer to this question, relative to the most informative\npossible question (the question whose answer would <em>determine<\/em>\nwhether U resolved \u201cyes\u201d or \u201cno\u201d). We think this helps show how good\neach question is relative to the ideal possible question, and is easier\nto interpret than a VOI number on its own.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The highest VOI question for skeptics, \u201cSupers changing minds,\u201d has a median VOI of 1.6E-4 for skeptics, which is 1.15% of the highest possible VOI for that individual.<sup data-fn=\"4cafa4a0-feaf-403a-9e69-e65445881476\" class=\"fn\"><a href=\"#4cafa4a0-feaf-403a-9e69-e65445881476\" id=\"4cafa4a0-feaf-403a-9e69-e65445881476-link\">72<\/a><\/sup> The highest VOI question for the concerned group, \u201cAlignment researchers changing minds,\u201d has a median VOI of 6.4E-3, which is 2.43% of the highest possible VOI for that individual.<sup data-fn=\"07cf3d9f-b155-431e-a880-fd41486bd2d5\" class=\"fn\"><a href=\"#07cf3d9f-b155-431e-a880-fd41486bd2d5\" id=\"07cf3d9f-b155-431e-a880-fd41486bd2d5-link\">73<\/a><\/sup> Looking at VOI this way, the best question for the concerned group is more informative to them than the skeptics\u2019 best question is for skeptics. If we compare median VOI in absolute terms, the concerned group\u2019s best question is more than an order of magnitude better than the skeptic group\u2019s best question. However, in terms of median POM VOI, the concerned group\u2019s best question is only about twice as good relative to the skeptic group\u2019s best question.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Another way to look at how informative the questions in this study\nwere is to examine the highest-VOI question for each participant, from\namong the candidate cruxes. In most of our analysis, we focus on the\nquestion with the highest median VOI across forecasters in each group as\na proxy for the group as a whole. But we can also see what would happen\nif each participant learned the answer to their own most informative\nquestion. If each participant only learned that most valuable bit of\ninformation in 2030, what percent of their maximum VOI would they\nachieve?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For the concerned group, the median POM VOI among every individual\u2019s\nsingle most valuable question was 5.29% (mean=11.0%); for the skeptics,\n9.53% (mean=16.5%). In more concrete terms, these values are roughly\nequivalent to forecasting questions with the following\ncharacteristics:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>A concerned participant with original P(AI existential\ncatastrophe (XC) by 2100) = 25% identifies a crux that has: P(crux) =\n20%, P(AI XC|crux) = 6.1%, and P(AI XC|\u00accrux) = 29.7%<\/li>\n\n\n\n<li>A skeptic participant with original P(AI XC by 2100) = 1%\nidentifies a crux that has: P(crux) = 20%, P(AI XC|crux) = 3.37%, and\nP(AI XC|\u00accrux) = 0.41%<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"red-flags-and-green-flags\">Red flags and green flags<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">For the most part, we have focused on VOI: which questions would be\nthe most informative in expectation, considering both (i) how likely\nthey are to happen, and (ii) how large a difference they would make to\nexistential risk. But for some purposes, it may be more useful to\nidentify \u201cred flags,\u201d which are events that would be expected to\nsubstantially increase existential risk if they occurred (even if they\nare unlikely to occur). Similarly, we might be interested in \u201cgreen\nflags:\u201d what events could happen that would make participants\n<em>least<\/em> concerned.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Here, we present some of the questions that would make a large\ndifference to participants\u2019 existential risk forecasts without weighting\nthem by probability of happening. Most of these questions are not high\nVOI because participants thought they were very unlikely to happen and\ntherefore they expect to gain little information from learning whether\nthey occurred.<\/p>\n\n\n\n<h5 class=\"wp-block-heading\" id=\"example-red-flags-for-skeptics\">Example red flags for skeptics:\n<\/h5>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Platform: Escalating warning shots<\/strong> \u2014 Will there be two separate events in which AIs kill large, increasing numbers of people by 2030?<sup data-fn=\"98f8ca74-f4f7-49c1-aeb1-deac9a5fe744\" class=\"fn\"><a href=\"#98f8ca74-f4f7-49c1-aeb1-deac9a5fe744\" id=\"98f8ca74-f4f7-49c1-aeb1-deac9a5fe744-link\">74<\/a><\/sup>\n<ul class=\"wp-block-list\">\n<li>If this happens, skeptics would forecast a 1.3% chance of\nexistential catastrophe due to AI by 2100 (compared to 0.1%\nunconditioned).<\/li>\n\n\n\n<li>However, this question ranked low on VOI because skeptics think\nit is very unlikely to happen (median skeptic forecast: 0.2%). So, their\nexpected information gain from this question resolving is\nsmall.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>Platform: ARC Evals \u2014<\/strong> Will ARC Evals (now METR) or a similar organization find evidence of autonomous replication, resource acquisition, and avoiding shutdown in AIs by 2030?<sup data-fn=\"05de6fa1-a3f3-44d2-bc2b-3b8f676c3d80\" class=\"fn\"><a href=\"#05de6fa1-a3f3-44d2-bc2b-3b8f676c3d80\" id=\"05de6fa1-a3f3-44d2-bc2b-3b8f676c3d80-link\">75<\/a><\/sup>\n<ul class=\"wp-block-list\">\n<li>If this happens, the median skeptic would forecast a 1% chance of\nexistential catastrophe due to AI by 2100, which is 10x higher than\ntheir unconditioned P(U) of 0.10%.<\/li>\n\n\n\n<li>However, the median skeptic forecasted a 1% chance of this\nquestion resolving positively.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>Democratic influence \u2014<\/strong> Will an AI autonomously influence the results of a democratic election by 2030?<sup data-fn=\"742c9501-aa7b-430e-833d-6afd1fcb1115\" class=\"fn\"><a href=\"#742c9501-aa7b-430e-833d-6afd1fcb1115\" id=\"742c9501-aa7b-430e-833d-6afd1fcb1115-link\">76<\/a><\/sup>\n<ul class=\"wp-block-list\">\n<li>If this happens, the median skeptic would forecast a 1.5% chance\nof existential catastrophe due to AI by 2100 (compared to 0.1%\nunconditioned). The median skeptic forecasts a 0.4% chance of this\nquestion resolving positively.<\/li>\n\n\n\n<li>This question has a large enough update size and probability of\nresolving positively that it was one of the higher VOI questions for\nskeptics.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">These red flag questions involve AIs being more dangerous than skeptics currently expect. Skeptics tended to argue that AIs are unlikely to substantially harm humans by 2030 and that society would respond well if they showed early signs of harming humans.<sup data-fn=\"fdcd5cf9-02fe-4682-b88c-4274a0b41954\" class=\"fn\"><a href=\"#fdcd5cf9-02fe-4682-b88c-4274a0b41954\" id=\"fdcd5cf9-02fe-4682-b88c-4274a0b41954-link\">77<\/a><\/sup> \u201cEscalating warning shots\u201d resolving positively would suggest that AIs are powerful, dangerous, and not well-controlled by humans\u2014meaning they were wrong on both counts. Similarly, if an AI autonomously influences the results of a democratic election, or is discovered by METR to have concerning capabilities, we would know that AIs are more dangerous than skeptics expect them to be by 2030.<\/p>\n\n\n\n<h5 class=\"wp-block-heading\" id=\"example-green-flags-for-skeptics\">Example green flags for\nskeptics: <\/h5>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Reduction in AI investment<\/strong> \u2014 By 2030, will\nannual global corporate investment in artificial intelligence fall to\n25% or below its 2021 level?\n<ul class=\"wp-block-list\">\n<li>If this happens, the median skeptic would forecast a 0.05% chance\nof existential catastrophe due to AI by 2100 (compared to a 0.1% chance\nunconditioned). However, the median skeptic thinks this is unlikely (5%\nchance of resolving positively).<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>6 month pause<\/strong> \u2014 By 2030, will there be enforced\ninternational agreements to stop\/pause further AI capabilities research\nand development, for at least 6 months?\n<ul class=\"wp-block-list\">\n<li>If this happens, the median skeptic would forecast a 0.09% chance\nof extinction due to AI by 2100 (compared to a 0.1% chance originally).\nThe median skeptic thinks this question has a 3% chance of resolving\npositively.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">These \u201cgreen flag\u201d questions are ones where AI development slows down. Many skeptics expect AIs to become very powerful eventually, just not before 2100.<sup data-fn=\"c7f3ef88-a32c-4980-8c11-ef3a40f766c4\" class=\"fn\"><a href=\"#c7f3ef88-a32c-4980-8c11-ef3a40f766c4\" id=\"c7f3ef88-a32c-4980-8c11-ef3a40f766c4-link\">78<\/a><\/sup> AI development slowing down would make them more confident that AIs powerful enough to be a serious threat to humanity are unlikely to be developed within that time frame.<\/p>\n\n\n\n<h5 class=\"wp-block-heading\" id=\"some-red-flags-for-concerned-participants\">Some red flags for\nconcerned participants: <\/h5>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Major powers war<\/strong> \u2014 By 2030, will at least two\nmajor superpowers (out of US, China, EU, or Russia) officially declare\nwar on each other for the duration of at least one year?\n<ul class=\"wp-block-list\">\n<li>If this happens, the median concerned participant would forecast\na 40% chance of existential catastrophe due to AI by 2100 (compared to a\n25% chance unconditioned), and they forecast a 11.5% chance that it\nhappens.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<h5 class=\"wp-block-heading\" id=\"some-green-flags-for-concerned-participants\">Some green flags\nfor concerned participants:<\/h5>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Alignment solution<\/strong> \u2013 By 2030, there is a &#8220;Theoretical in-principle solution to the alignment problem that most people who thought about this carefully agree should work&#8221; (resolved by a panel of experts of the &#8220;AI concerned&#8221; team&#8217;s choosing).<sup data-fn=\"10b6f825-4324-4de4-883c-1cc046b5b802\" class=\"fn\"><a href=\"#10b6f825-4324-4de4-883c-1cc046b5b802\" id=\"10b6f825-4324-4de4-883c-1cc046b5b802-link\">79<\/a><\/sup>\n<ul class=\"wp-block-list\">\n<li>If this happens, the median concerned participant would forecast\na 2% chance of existential catastrophe by 2100 (compared to 25%\nunconditioned), and they think it has a 5% chance of happening.<\/li>\n\n\n\n<li>In contrast, the skeptics also think this has a 5% chance of\nhappening, but it would not change their predictions on existential\ncatastrophe at all.<\/li>\n\n\n\n<li>This is an example of the general trend in which the concerned\ngroup places much more weight on theoretical arguments, as well as\nhaving more trust in alignment experts.\n<ul class=\"wp-block-list\">\n<li>Because the question will be resolved by experts chosen by the\nconcerned team, updates based on this question may mostly reflect how\nmuch participants expect those experts\u2019 opinions to correlate with\nactual changes in AI risks.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>Alignment researchers changing minds<\/strong> \u2013 Will at\nleast 3 out of the 5 following AI alignment researchers &#8212; Paul\nChristiano, Nate Soares, Neel Nanda, Richard Ngo, and Rohin Shah &#8212; say\ntheir P(existential catastrophe due to AI by 2100) has halved by\n2030?\n<ul class=\"wp-block-list\">\n<li>If this happens, the median concerned participant would forecast\na 6% chance of existential catastrophe by 2100 (compared to 25%\nunconditioned), and they think there is a 20% chance that it\nhappens.<\/li>\n\n\n\n<li>Like the \u201cAlignment solution\u201d question above, this question\nhinges on people similar to the concerned group changing their\nminds.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>\u201cNo violence LLM\u201d<\/strong> \u2013 By 2030, will there be an\nLLM that will never describe violence in any way, for any input, while\nretaining good performance across a wide range of tasks? This would be\ndemonstrated both through robustness to adversarial examples and through\nrobust theoretical reasoning.\n<ul class=\"wp-block-list\">\n<li>If this happens, the median concerned participant would forecast\n8% on the ultimate question (compared to 25% unconditioned), and they\nforecast a 10% chance that it happens.<\/li>\n\n\n\n<li>Several concerned participants said that the part of this question that made it a \u201cgreen flag\u201d for them is the \u201crobust theoretical reasoning\u201d specified in the resolution criteria\u2014which would make this question a signal that we have made significant progress in understanding LLMs. This would be a positive update for reasons not specifically related to LLMs\u2019 lack of ability to describe violence.<sup data-fn=\"534e880e-4026-468b-8fd3-b9a2a4ccc1ba\" class=\"fn\"><a href=\"#534e880e-4026-468b-8fd3-b9a2a4ccc1ba\" id=\"534e880e-4026-468b-8fd3-b9a2a4ccc1ba-link\">80<\/a><\/sup><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"vod-which-near-term-questions-have-higher-and-lower-value-of-discrimination\">VOD:\nWhich near-term questions have higher and lower value of\ndiscrimination?<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">As a reminder, \u201cvalue of discrimination\u201d (VOD) is a measure of how\nmuch knowing the answer to a question would change relative beliefs\n<em>between<\/em> individuals, in expectation. It is useful for measuring\nconvergence and divergence in expected beliefs between individuals.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The main findings from evaluating questions according to VOD\nwere:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>One question stood out as creating the most convergence between\nindividuals in each group: whether METR (or a similar group) will find\nthat AI has developed dangerous capabilities such as autonomously\nreplicating and avoiding shutdown by 2030.<\/li>\n\n\n\n<li>Another relatively strong convergent question was whether there would be extremely fast increases in the efficiency of AI systems (full operationalization <a href=\"#convergent-cruxes-which-information-would-lead-to-less-disagreement-in-expectation\"><u>below<\/u><\/a>).<\/li>\n\n\n\n<li>One question that stood out as leading to greater <em>divergence<\/em>, or separation, between the groups was: whether highly-respected AI alignment researchers would halve their AI existential catastrophe estimate by 2030.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Most of our VOD analysis is based on the median cross-camp pair. We\ncalculated VOD for each of the 121 possible skeptic-concerned pairs for\neach question. When we refer to the VOD of a question, we mean \u201cVOD for\nthe median cross-camp pair\u201d unless otherwise stated.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">See <a href=\"#differences-of-opinion-within-groups\"><u>here<\/u><\/a>\nfor an analysis of differences of opinion within each group.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"results-tables-and-figures-1\">Results tables and figures<sup data-fn=\"b80832f1-a3e3-44d2-b087-a08da782137d\" class=\"fn\"><a href=\"#b80832f1-a3e3-44d2-b087-a08da782137d\" id=\"b80832f1-a3e3-44d2-b087-a08da782137d-link\">81<\/a><\/sup><\/h4>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Question<\/strong><\/td><td><strong>Median VOD Among Cross-Camp Pairs<\/strong><\/td><td><strong>Median POM VOD Among Cross-Camp Pairs<\/strong><\/td><\/tr><tr><td>Platform: ARC Evals<\/td><td>1.8E-2<\/td><td>5.35%<\/td><\/tr><tr><td>Fast AI efficiency gains<\/td><td>1.1E-2<\/td><td>1.43%<\/td><\/tr><tr><td>AI Robotics<\/td><td>6.9E-3<\/td><td>2.81%<\/td><\/tr><tr><td>AI Forecasting skill<\/td><td>6.0E-3<\/td><td>0.74%<\/td><\/tr><tr><td>Evidence of misalignment<sup data-fn=\"d48ed74a-fc6d-407b-b21f-80b233f7c90c\" class=\"fn\"><a href=\"#d48ed74a-fc6d-407b-b21f-80b233f7c90c\" id=\"d48ed74a-fc6d-407b-b21f-80b233f7c90c-link\">82<\/a><\/sup><\/td><td>4.8E-3<\/td><td>5.69%<\/td><\/tr><tr><td>Major powers war<\/td><td>3.9E-3<\/td><td>2.11%<\/td><\/tr><tr><td>AI writes AI<\/td><td>3.4E-3<\/td><td>1.58%<\/td><\/tr><tr><td>Warning shot<\/td><td>3.0E-3<\/td><td>1.64%<\/td><\/tr><tr><td>IT progress<\/td><td>2.5E-3<\/td><td>0.94%<\/td><\/tr><tr><td>Power-seeking shutdown<\/td><td>2.0E-3<\/td><td>2.01%<\/td><\/tr><tr><td>Escalating warning shots<\/td><td>7.4E-4<\/td><td>1.01%<\/td><\/tr><tr><td>Power-seeking<\/td><td>6.8E-4<\/td><td>0.47%<\/td><\/tr><tr><td>Platform: AI regulation<\/td><td>3.8E-4<\/td><td>0.05%<\/td><\/tr><tr><td>Supers changing minds<\/td><td>2.4E-4<\/td><td>0.57%<\/td><\/tr><tr><td>Short-term GDP change<\/td><td>5.0E-5<\/td><td>0.09%<\/td><\/tr><tr><td>AI articles and apps<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Cyberattacks<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>IC demonstration<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Other fields IC<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Politicization<\/td><td>0.0E+0<\/td><td>0.00%<\/td><\/tr><tr><td>Progress in lethal technologies<\/td><td>-6.9E-18<\/td><td>0.00%<\/td><\/tr><tr><td>Non-democracy AI<\/td><td>-9.6E-15<\/td><td>0.00%<\/td><\/tr><tr><td>Req testing<\/td><td>-5.8E-5<\/td><td>-0.03%<\/td><\/tr><tr><td>Democratic influence<\/td><td>-8.8E-5<\/td><td>-0.04%<\/td><\/tr><tr><td>AI coding<\/td><td>-1.6E-4<\/td><td>-1.01%<\/td><\/tr><tr><td>Taiwan-China<\/td><td>-6.2E-4<\/td><td>-0.19%<\/td><\/tr><tr><td>Platform: Escalating warning shots<\/td><td>-9.9E-4<\/td><td>-0.89%<\/td><\/tr><tr><td>AI solving novel math problems<\/td><td>-1.9E-3<\/td><td>-1.64%<\/td><\/tr><tr><td>Intergovernmental AI safety<\/td><td>-2.1E-3<\/td><td>-0.93%<\/td><\/tr><tr><td>Public concern<\/td><td>-4.7E-3<\/td><td>-1.47%<\/td><\/tr><tr><td>No violence LLM<\/td><td>-5.2E-3<\/td><td>-1.31%<\/td><\/tr><tr><td>Alignment solution<\/td><td>-5.2E-3<\/td><td>-1.95%<\/td><\/tr><tr><td>Reduction in AI investment<\/td><td>-5.5E-3<\/td><td>-1.61%<\/td><\/tr><tr><td>6 month pause<\/td><td>-6.2E-3<\/td><td>-1.48%<\/td><\/tr><tr><td>Muehlhauser policies<\/td><td>-1.7E-2<\/td><td>-7.10%<\/td><\/tr><tr><td>Platform: Transformative growth<\/td><td>-3.0E-2<\/td><td>-5.34%<\/td><\/tr><tr><td>Alignment researchers changing minds<\/td><td>-7.7E-2<\/td><td>-10.33%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 12:<\/strong> Median VOD and POM VOD for cross-camp (concerned and skeptic) pairs on each question. Note that the medians in a given row may not refer to the same cross-camp pair.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>C<\/strong><\/td><td><strong># of cross-camp pairs for whom C was their most convergent\ncrux<\/strong><\/td><\/tr><tr><td>Platform: ARC Evals<\/td><td>33<\/td><\/tr><tr><td>Evidence of misalignment<\/td><td>16<\/td><\/tr><tr><td>AI writes AI<\/td><td>7<\/td><\/tr><tr><td>Escalating warning shots<\/td><td>6<\/td><\/tr><tr><td>IT progress<\/td><td>6<\/td><\/tr><tr><td>AI Forecasting skill<\/td><td>5<\/td><\/tr><tr><td>Platform: Escalating warning shots<\/td><td>5<\/td><\/tr><tr><td>Platform: AI regulation<\/td><td>4<\/td><\/tr><tr><td>Power-seeking<\/td><td>4<\/td><\/tr><tr><td>Progress in lethal technologies<\/td><td>4<\/td><\/tr><tr><td>Reduction in AI investment<\/td><td>4<\/td><\/tr><tr><td>Warning shot<\/td><td>4<\/td><\/tr><tr><td>Fast AI efficiency gains<\/td><td>3<\/td><\/tr><tr><td>Muehlhauser policies<\/td><td>3<\/td><\/tr><tr><td>Major powers war<\/td><td>3<\/td><\/tr><tr><td>Taiwan-China<\/td><td>3<\/td><\/tr><tr><td>Alignment solution<\/td><td>2<\/td><\/tr><tr><td>No violence LLM<\/td><td>2<\/td><\/tr><tr><td>Non-democracy AI<\/td><td>2<\/td><\/tr><tr><td>Supers changing minds<\/td><td>2<\/td><\/tr><tr><td>AI coding<\/td><td>1<\/td><\/tr><tr><td>Intergovernmental AI safety<\/td><td>1<\/td><\/tr><tr><td>Power-seeking shutdown<\/td><td>1<\/td><\/tr><tr><td><strong>Total<\/strong><\/td><td>121<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 13:<\/strong> Which questions were the best convergent cruxes for the most skeptic-concerned pairs? \u201cARC Evals\u201d (first place) was the platform version of the \u201cflash\u201d forecast \u201cEvidence of misalignment\u201d question (second place), i.e. for about 40% of cross-camp pairs, the ARC Evals-like question would be the one that would eliminate the most disagreement, in expectation. We exclude \u201cTransformative economic growth\u201d from this analysis because it resolves in 2070 while the rest of the cruxes resolve in 2030 (i.e. for the pairs whose top convergent crux was \u201cTransformative economic growth,\u201d we used their second-best crux).<\/figcaption><\/figure>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"convergent-cruxes-which-information-would-lead-to-less-disagreement-in-expectation\">Convergent\ncruxes: Which information would lead to less disagreement, in\nexpectation?<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">We found two cruxes that, in expectation, will make the groups\ndisagree less when they resolve:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>ARC Evals<\/strong> \u2014 Will METR (formerly known as ARC Evals) or a similar organization find evidence of AI having the ability to autonomously replicate, acquire resources, and avoid shutdown before 2030?<sup data-fn=\"43561ad4-3388-4179-99c5-9eeb1446af92\" class=\"fn\"><a href=\"#43561ad4-3388-4179-99c5-9eeb1446af92\" id=\"43561ad4-3388-4179-99c5-9eeb1446af92-link\">83<\/a><\/sup>\n<ul class=\"wp-block-list\">\n<li>Nearly all participants agreed about what direction to update\ntheir beliefs based on this question: METR finding evidence of these\nabilities would make people more worried about existential catastrophe\ndue to AI.<\/li>\n\n\n\n<li>This also means that finding out that METR did <em>not<\/em> find evidence of these traits by 2030 would make participants less worried about existential catastrophe by 2100.\n<ul class=\"wp-block-list\">\n<li>In particular, if this crux resolves negatively, the median\nconcerned participant would forecast a 22.78% chance of extinction by\n2100, compared to 25% unconditioned. Since both groups expect that this\nquestion is unlikely to resolve positively (skeptic median: 1%;\nconcerned median: 25%), much of the expected convergence between the\ngroups attributable to this question is driven by the cases where it\nresolves negatively.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>Fast AI efficiency gains<\/strong> \u2014 By 2030, will there\nbe a 100x drop in the amount of compute required to achieve\nstate-of-the-art (SOTA) performance on the most commonly-used benchmark\nfor at least one major AI domain (e.g. natural language) within a\n1-month period?\n<ul class=\"wp-block-list\">\n<li>As with \u201cARC Evals,\u201d nearly all participants agree that this\nevent would be a bad sign. It seems very unlikely that such fast AI\nefficiency gains would happen without AI finding extraordinary ways to\nimprove its own efficiency, so both groups tended to see this as a proxy\nfor AI having the ability to improve itself.<\/li>\n\n\n\n<li>The concerned participants think it is plausible that it will\nhappen (median: 16.5%), but still probably will not, and if it doesn\u2019t\nthey would update their risk estimates down (from a 25.0% chance of\nexistential catastrophe to 23.06%).<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">The fact that these are the two best convergent cruxes points to a\ngeneral trend in this debate: the skeptics tended to think that AI would\nremain safely under human control for a long time, and the concerned\ngroup thought otherwise. Either of these questions resolving would\nprovide evidence that both groups agree could reduce the disagreement.\nIf, by 2030, METR does not find evidence of autonomous replication or AI\nhas not made very fast efficiency gains, then the concerned group would\nbe less worried, because it would mean that we have had years of\nprogress from today\u2019s models without those capabilities becoming\napparent.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">These convergent cruxes may not be especially novel: it is not surprising that if AIs exhibit dangerous capabilities or make rapid progress then skeptics could become more concerned, and vice versa. But the relative strength of the \u201cARC Evals\u201d crux may be helpful in understanding this debate because it illustrates differences in worldview between the groups: for skeptics, theoretical arguments are less persuasive, and it could take real-world demonstrations of AIs having dangerous capabilities for them to be concerned.<sup data-fn=\"383c1569-fe55-4247-b949-c398826f416d\" class=\"fn\"><a href=\"#383c1569-fe55-4247-b949-c398826f416d\" id=\"383c1569-fe55-4247-b949-c398826f416d-link\">84<\/a><\/sup> And the concerned group has strong enough beliefs that dangerous capabilities will emerge that if such signs do not emerge by 2030 then they would become less concerned.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"arc-evals-the-strongest-convergent-crux\">ARC Evals: The\nstrongest convergent crux<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Here, we provide more detail on the question that would lead to the largest expected reduction in disagreement between individuals in the skeptic and concerned groups: Will METR (or a similar organization) find evidence of AI having the ability to autonomously replicate, acquire resources, and avoid shutdown before 2030?<sup data-fn=\"8eb5adce-0090-4513-b94c-d26e770afa81\" class=\"fn\"><a href=\"#8eb5adce-0090-4513-b94c-d26e770afa81\" id=\"8eb5adce-0090-4513-b94c-d26e770afa81-link\">85<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We determined the strength of convergent cruxes based on the\nfollowing analyses:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>We considered every possible pair of individuals across the concerned and skeptic groups (121 total pairs across the 11 participants in each of 2 groups) and determined which question would lead to the largest expected reduction in disagreement between each pair. This \u201cARC Evals\u201d question was the strongest convergent crux for 49 cross-camp pairs (33 based on the \u201cin-depth\u201d version of the question, and 16 based on the \u201cflash\u201d forecast version of the same question).<sup data-fn=\"dd21d621-6165-4b05-b4d3-4fe34ae989a6\" class=\"fn\"><a href=\"#dd21d621-6165-4b05-b4d3-4fe34ae989a6\" id=\"dd21d621-6165-4b05-b4d3-4fe34ae989a6-link\">86<\/a><\/sup> The next-highest question (\u201cAI writes AI\u201d)<sup data-fn=\"5bf16fa9-f343-4938-bc9c-b9fd7900e1e3\" class=\"fn\"><a href=\"#5bf16fa9-f343-4938-bc9c-b9fd7900e1e3\" id=\"5bf16fa9-f343-4938-bc9c-b9fd7900e1e3-link\">87<\/a><\/sup> was the strongest convergent crux for 7 cross-camp pairs (see <a href=\"#tab-08\">Table 8 above<\/a>).<\/li>\n\n\n\n<li>\u201cARC Evals\u201d had the highest median cross-camp VOD, 1.8E-2, and its \u201cflash\u201d forecast counterpart (\u201cEvidence of misalignment\u201d) had the highest median cross-camp POM VOD (it would resolve 5.69% of disagreement for that median pair).<sup data-fn=\"6f6626dd-cdf1-4261-b7b7-fc3df0cd9fc8\" class=\"fn\"><a href=\"#6f6626dd-cdf1-4261-b7b7-fc3df0cd9fc8\" id=\"6f6626dd-cdf1-4261-b7b7-fc3df0cd9fc8-link\">88<\/a><\/sup> After \u201cEvidence of misalignment,\u201d \u201cARC Evals\u201d had the highest median cross-camp POM VOD (5.35%).\n<ul class=\"wp-block-list\">\n<li>The initial disagreement about the risk of existential\ncatastrophe by 2100 between the cross-camp pair with the median VOD is\n22.7 percentage points (between Blake, a skeptic, at 0.20% and Yael,\nconcerned, at 22.9%).<\/li>\n\n\n\n<li>Blake forecasted a 15.0% chance of the \u201cARC Evals\u201d question\nresolving positively. If it resolves positively, Blake would forecast a\n0.22% chance of existential catastrophe, as opposed to a 0.196% chance\nif it resolves negatively.<\/li>\n\n\n\n<li>Yael forecasted a 31.5% chance of this crux question resolving\npositively. Yael would forecast a 30.5% chance of existential\ncatastrophe conditional on positive resolution and a 19.4% chance\nconditional on negative resolution.<\/li>\n\n\n\n<li>Conditional on this question resolving positively, Blake and Yael\nwould disagree by 30.33 percentage points (more than before), and\nconditional on its resolving negatively, they would disagree by 19.2\npercentage points (less than before).<\/li>\n\n\n\n<li>VOD weights these by how likely the pair thinks it is that the crux resolves positively, using the geometric mean of their respective odds, which in this case is 22.17%, so it treats them as having a \u201ccombined\u201d 22.17% forecast that \u201cARC Evals\u201d resolves positively.<sup data-fn=\"0c40be8e-bca3-4613-b2e4-9e3711e40c05\" class=\"fn\"><a href=\"#0c40be8e-bca3-4613-b2e4-9e3711e40c05\" id=\"0c40be8e-bca3-4613-b2e4-9e3711e40c05-link\">89<\/a><\/sup><\/li>\n\n\n\n<li>When we weight their disagreement after the crux resolves by the probability it resolves positively, they will disagree by 21.48 percentage points in expectation, which is 5.35% (1.22 percentage points) less than their initial disagreement of 22.7 percentage points.<sup data-fn=\"5a5b3afd-7fa6-467f-8ffb-3f160763dcc3\" class=\"fn\"><a href=\"#5a5b3afd-7fa6-467f-8ffb-3f160763dcc3\" id=\"5a5b3afd-7fa6-467f-8ffb-3f160763dcc3-link\">90<\/a><\/sup><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Only one skeptic said that they did not think that these capabilities are very likely to be dangerous.<sup data-fn=\"b5cb5f97-8d84-41ce-bbba-1fd3464ebbe9\" class=\"fn\"><a href=\"#b5cb5f97-8d84-41ce-bbba-1fd3464ebbe9\" id=\"b5cb5f97-8d84-41ce-bbba-1fd3464ebbe9-link\">91<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Among the AI concerned group, there was less agreement:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Some AI concerned people also thought this crux should cause probabilities of risk to increase, primarily because of shortened timelines.<sup data-fn=\"636d6fdd-d338-4235-93c0-9861dd2caea1\" class=\"fn\"><a href=\"#636d6fdd-d338-4235-93c0-9861dd2caea1\" id=\"636d6fdd-d338-4235-93c0-9861dd2caea1-link\">92<\/a><\/sup><\/li>\n\n\n\n<li>Some thought that the success of evaluations would make them less worried.<sup data-fn=\"c78326ed-5849-4ca2-a2fd-ec31f7eb392a\" class=\"fn\"><a href=\"#c78326ed-5849-4ca2-a2fd-ec31f7eb392a\" id=\"c78326ed-5849-4ca2-a2fd-ec31f7eb392a-link\">93<\/a><\/sup><\/li>\n\n\n\n<li>Some thought the increase in risk from shortened timelines and reduction in risk from successful evaluations may balance out.<sup data-fn=\"1503fb77-d379-4378-858f-be3b872f94f9\" class=\"fn\"><a href=\"#1503fb77-d379-4378-858f-be3b872f94f9\" id=\"1503fb77-d379-4378-858f-be3b872f94f9-link\">94<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Importantly, this question is a convergent crux, but not because it\nwould make the two groups \u201cmeet in the middle.\u201d When talking about\nquestions that would inspire belief convergence, people sometimes\nenvision questions that would make the two groups agree on some\nprobability between their initial extremes, but that is not what we\nfound here. Instead, we found a question where the two groups would\nupdate in the same direction, but with different magnitudes which cause\nmore agreement in expectation.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In particular, if this crux resolves negatively, the median concerned\nparticipant would forecast a 22.78% chance of extinction by 2100,\ncompared to 25% unconditioned. That is, this question is a convergent\ncrux primarily because, if it doesn\u2019t happen, the concerned group would\nget less worried, not because if it does happen the skeptics would get\nmore worried.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The skeptics <em>would<\/em> get much more worried if it happened\n(median: 1.0% on positive resolution; 0.1% on negative resolution), but\nthey think that it is very unlikely to happen (median: 1.0%), so it\nfigures less in the expected reduction of disagreement.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=108\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix <u>7<\/u><\/a> for additional analysis of this question.<\/p>\n\n\n\n<h5 class=\"wp-block-heading\" id=\"differences-of-opinion-within-groups\">Differences of Opinion\nwithin Groups <\/h5>\n\n\n\n<p class=\"wp-block-paragraph\">So far, we have focused on the median cross-camp pair, treating them\nas representative of convergence or divergence between groups. We\nconsidered a question to be effective in reducing disagreement if it\nbrought the median pair closer together in their views.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">But we\u2019ve seen on many questions that people disagree substantially\neven within their own groups, so we miss some interesting agreement and\ndisagreement by only looking at the median cross-camp pairs. For some\nquestions, everyone would update in the same direction: all participants\nagree that an AI autonomously creating and deploying new AI software\nwould be a bad sign, for example (with the exception of one participant\nfor whom that would make no difference). But for many others,\nparticipants disagreed not only about how likely a crux was to happen,\nbut also about how it would change their forecasts on the ultimate\nquestion if it did.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">There was more agreement within the concerned group than the skeptic group. The concerned group would be unanimously less concerned in 2030 than now if &#8220;Muehlhauser policies&#8221;<sup data-fn=\"06822cee-23f4-4e66-b8ac-07329925c2fc\" class=\"fn\"><a href=\"#06822cee-23f4-4e66-b8ac-07329925c2fc\" id=\"06822cee-23f4-4e66-b8ac-07329925c2fc-link\">95<\/a><\/sup> were implemented; they also have unanimity on updating downward if alignment researchers changed their minds, if there were an alignment solution, and four other questions. Two questions would make them unanimously <em>more<\/em> concerned: \u201cAI robotics\u201d and \u201cAI writes AI.\u201d<sup data-fn=\"662d63b1-cc47-4bcf-bda4-85af5bec5b6f\" class=\"fn\"><a href=\"#662d63b1-cc47-4bcf-bda4-85af5bec5b6f\" id=\"662d63b1-cc47-4bcf-bda4-85af5bec5b6f-link\">96<\/a><\/sup> The skeptics were much more mixed, and more likely to say \u201cno change,\u201d i.e., it wouldn\u2019t make a difference to them whether the crux resolved \u201cyes\u201d or \u201cno;\u201d their P(AI existential catastrophe by 2100) would stay exactly the same.<sup data-fn=\"1526a601-b988-4777-a3a9-43bd260d1d3a\" class=\"fn\"><a href=\"#1526a601-b988-4777-a3a9-43bd260d1d3a\" id=\"1526a601-b988-4777-a3a9-43bd260d1d3a-link\">97<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Because of these differences within groups, if questions narrowed\ndisagreement between many individual people, but not the median people,\nthat could indicate that short-term AI cruxes are a more important part\nof this debate than the above analysis might suggest. And conversely, if\na question narrows disagreement between the median people but not\nbetween many other people, it may look more important than it really\nis.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Our work on these differences within groups is preliminary, so we\nhave included detailed analysis of individual differences of opinion for\na single question, \u201cARC Evals,\u201d which was identified as the best\nconvergent crux for the median people. To what extent do the findings on\nthe \u201cARC Evals\u201d question apply to the disagreement between individuals\nwithin the group who hold views different from the median?<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2024-03-11_ai-adversarial-collaboration_fig-03.png\" alt=\"\" \/><figcaption class=\"wp-element-caption\"><strong>Figure 3<\/strong>: Value of Discrimination of the \u201cARC Evals\u201d question for every pair of forecasters. The color of each cell indicates the VOD of the \u201cARC Evals\u201d question for the corresponding pair of participants. VOD of zero (light blue) means no change in disagreement as a result of the crux; positive VOD means less disagreement in expectation; negative VOD means more disagreement in expectation. For example, for Xander (Concerned) and Claire (Skeptical), the resolution of the \u201cARC Evals\u201d question will bring them closer together in expectation.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">The above \u201cFiedler heatmap\u201d looks at VOD between each\nconcerned-skeptic pair for the ARC Evals question.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Light blue squares mean that VOD was 0 between that pair, meaning\nthat this question resolving would not change the disagreement between\nthose people in expectation. Dark blue squares mean that in expectation\nthe two people would disagree less when it resolves, and warmer squares\n(yellowish, orange, red) mean they would disagree more. If this question\nwere a perfect convergent crux for a pair, the relevant square would be\nentirely dark blue.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Looking at this heatmap, we can see that the median pair is not\nalone: there are medium and dark blue clusters, showing groups of\nskeptics and concerned people who would disagree less. At the same time,\nfor many pairs, it makes no difference, and a few would disagree\n<em>more<\/em>, in expectation, when this question resolves.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The pattern in this heatmap may reflect differences in how different\npeople expect AI developments to unfold. Imagine, for example, one group\nof concerned people who think that METR finding evidence of autonomous\nreplication would make them <em>less<\/em> worried about existential\nrisks due to AI, because it would mean that evidence of these\ncapabilities has emerged with enough time to stop the model from doing\nsignificant damage. If a group of skeptics thinks that this question\nwould make them more worried, because it would mean that there are\ndangerous capabilities that they don\u2019t currently expect, then those\ngroups would converge conditional on this question resolving. But, a\npair consisting of a concerned person who becomes more worried and a\nskeptic who becomes less worried conditional on positive resolution\nwould <em>diverge<\/em> on this question.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We have only just begun to look for these patterns within groups, so\nwe do not have strong conclusions yet. But we hope to use this kind of\nanalysis to understand variation within and between schools of thought.\nIf we saw that a particular subset of skeptics and concerned\nparticipants often converge based on the same questions, we might be\nable to deduce underlying differences in how they think about AI\ndevelopments. We plan to write more about this when we have explored it\nmore fully.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"divergent-cruxes-which-information-would-lead-to-more-disagreement\">Divergent\ncruxes: Which information would lead to more disagreement?<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Just as, conditional on \u201cARC Evals\u201d resolving, the two groups would\ndisagree <em>less,<\/em> we also looked at cruxes that would lead the\ngroups to disagree more. These are questions where the groups disagree\nabout how to interpret the information gained from a question\u2019s\nresolution, and can reveal interesting aspects of the debate between\ngroups. We highlight one crux resolving by 2030 that would, in\nexpectation, make the disagreement <em>wider<\/em> when it resolves:<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Alignment researchers changing minds<\/strong> \u2013 Will at least\n3 out of the 5 following AI alignment researchers &#8212; Paul Christiano,\nNate Soares, Neel Nanda, Richard Ngo, and Rohin Shah &#8212; say their\nP(existential catastrophe due to AI by 2100) has halved by 2030?<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>This is the question that would increase the disagreement between\nthe median cross-camp pair most in expectation.\n<ul class=\"wp-block-list\">\n<li>The median cross-camp pair for this question disagree strongly on the ultimate question: Riley forecasted a 30% chance of human extinction due to AI by 2100, and Claire forecasted 0.0000001%.<sup data-fn=\"9d04bf4c-6358-409a-9c26-9892ca49372b\" class=\"fn\"><a href=\"#9d04bf4c-6358-409a-9c26-9892ca49372b\" id=\"9d04bf4c-6358-409a-9c26-9892ca49372b-link\">98<\/a><\/sup><\/li>\n\n\n\n<li>Riley forecasted a 20% chance that \u201cAlignment researchers\nchanging minds\u201d will resolve positively and a 15% chance that the\nultimate question will resolve positively if this crux question does.\nThis implies a 33.8% chance that the ultimate question will resolve\npositively if the crux resolves negatively.<\/li>\n\n\n\n<li>Claire forecasted a 1% chance that \u201cAlignment researchers\nchanging minds\u201d will resolve positively, and whether it does or not,\ntheir P(U) would not change at all and would remain at\n0.0000001%.<\/li>\n\n\n\n<li>If this question resolves positively, then they will disagree\nless: Riley will lower their existential risk forecast from 30% to 15%,\nand Claire won\u2019t change their forecast. If the question resolves\nnegatively, they will disagree more than they do now: Riley will raise\ntheir existential risk forecast from 30% to 33.8%, and Claire won\u2019t\nupdate.<\/li>\n\n\n\n<li>Because they both think this question is unlikely to resolve\npositively, the worlds where it resolves negatively carry more weight,\nand it is more likely they will end up disagreeing more than they do\nnow.<\/li>\n\n\n\n<li>They currently disagree by 29.9999999%, and conditional on this\nquestion resolving, they will disagree by 33.1% in expectation.<\/li>\n\n\n\n<li>As a result, this question has a POM VOD of -10.33%, meaning that they would disagree by 10.33% more than they do now, in expectation.<sup data-fn=\"fe3184d3-a79c-4ce1-8fa7-f216494cff16\" class=\"fn\"><a href=\"#fe3184d3-a79c-4ce1-8fa7-f216494cff16\" id=\"fe3184d3-a79c-4ce1-8fa7-f216494cff16-link\">99<\/a><\/sup><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The two groups disagree strongly about how to update conditional\non this question resolving:\n<ul class=\"wp-block-list\">\n<li>Conditional on alignment researchers being much less worried about AI risk, the concerned group would be much less worried: this is one of the most informative questions for them.<sup data-fn=\"a1034df9-e193-497e-87f0-853e32162815\" class=\"fn\"><a href=\"#a1034df9-e193-497e-87f0-853e32162815\" id=\"a1034df9-e193-497e-87f0-853e32162815-link\">100<\/a><\/sup><\/li>\n\n\n\n<li>For the median skeptic, this question has a VOI of 0; they simply do not think it is relevant to their analysis of how likely it is that humanity goes extinct due to AI.<sup data-fn=\"757e8124-9d72-4846-a8f2-b840557ae51b\" class=\"fn\"><a href=\"#757e8124-9d72-4846-a8f2-b840557ae51b\" id=\"757e8124-9d72-4846-a8f2-b840557ae51b-link\">101<\/a><\/sup> Seven out of nine skeptics who forecasted this question would not update their views on existential risk based on its resolution, while two out of nine skeptics would be less worried to some extent.<\/li>\n\n\n\n<li>This question demonstrates one of the difficulties of this study:\nthe skeptics and the concerned group largely do not trust one another\u2019s\nanalyses and disagree strongly about whose opinions they should listen\nto. As a result, questions that rely for their resolution on people who\nseem to be clearly affiliated with one \u201cteam\u201d and do not have clear\nobjective criteria may be less likely to be useful cruxes.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"hypothesis-3-were-disagreements-about-ai-risk-explained-by-different-long-term-expectations\">Hypothesis\n#3: Were disagreements about AI risk explained by different long-term\nexpectations?<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Although this study was focused on questions that resolve in 2030, we\nfound substantial evidence that disagreements about AI risk decreased\nbetween the groups when considering longer time horizons and a broader\nswathe of severe negative outcomes from AI than extinction or\ncivilizational collapse. It seems that some of the key reasons for\ndisagreement about AI risk are that the groups have different\nexpectations about (1) how long it will take until AIs have capabilities\nfar beyond those of humans in all relevant domains; and (2) how common\nit will be for AI systems to develop goals that might lead to human\nextinction, whether harming humans is specifically part of the goal or\nsimply a side effect of other goals.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Key forecasts supporting these claims include:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Both groups strongly expected that powerful AI (defined as \u201cAI\nthat exceeds the cognitive performance of humans in &gt;95% of\neconomically relevant domains\u201d) would be developed by 2100 (skeptic\nmedian: 90%; concerned median: 88%). Though, some skeptics argue that\n(1) strong physical capabilities (in addition to cognitive ones) would\nbe important for causing severe negative effects in the world, and (2)\neven if AI can do most cognitive tasks, there will likely be a \u201clong\ntail\u201d of tasks that require humans.<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The two groups also put similar total probabilities on at least one of a cluster of bad outcomes from AI happening over the next 1000 years (median 40% and 30% for concerned and skeptic groups respectively).<sup data-fn=\"774aa7ac-c9b6-4bac-9ce1-b4d5f15a9ea1\" class=\"fn\"><a href=\"#774aa7ac-c9b6-4bac-9ce1-b4d5f15a9ea1\" id=\"774aa7ac-c9b6-4bac-9ce1-b4d5f15a9ea1-link\">102<\/a><\/sup> But they distribute their probabilities differently over time: the concerned group concentrates their probability mass before 2100, and the skeptics spread their probability mass more evenly over the next 1,000 years.<\/li>\n\n\n\n<li>We asked participants if and when AI will displace humans as the primary force that determines what happens in the future.<sup data-fn=\"e609855b-235c-4d43-a015-5f158ce94315\" class=\"fn\"><a href=\"#e609855b-235c-4d43-a015-5f158ce94315\" id=\"e609855b-235c-4d43-a015-5f158ce94315-link\">103<\/a><\/sup> The concerned group\u2019s median date is 2045 and the skeptic group\u2019s median date is 2450\u2014405 years later.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">In this section, we also discuss forecasts on whether there will be \u201ctransformative economic growth\u201d by 2070. Overall this question had relatively high value of information, but there was surprisingly little agreement (even within groups) about whether its occurrence would increase or decrease the likelihood of existential catastrophe. For example, some forecasters argued that such growth would be evidence that highly powerful AIs are relatively controllable, while others argued that highly economically useful AI would be evidence of future dangerous AI.<sup data-fn=\"8c2c3dc9-479f-452f-9e55-70424efd9d85\" class=\"fn\"><a href=\"#8c2c3dc9-479f-452f-9e55-70424efd9d85\" id=\"8c2c3dc9-479f-452f-9e55-70424efd9d85-link\">104<\/a><\/sup> The likelihood of transformative growth due to AI is frequently debated,<sup data-fn=\"e4ee3956-c437-41c9-bbee-5f5ce5b9077a\" class=\"fn\"><a href=\"#e4ee3956-c437-41c9-bbee-5f5ce5b9077a\" id=\"e4ee3956-c437-41c9-bbee-5f5ce5b9077a-link\">105<\/a><\/sup> but these results highlight that it may be valuable to shift more emphasis in future discussion to what the implications of such growth would be for risk levels.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Overall, many skeptics regarded their forecasts on AI extinction risk as worryingly high, although low relative to the concerned group.<sup data-fn=\"27186616-667f-4a4d-980c-7d4e1cd401e6\" class=\"fn\"><a href=\"#27186616-667f-4a4d-980c-7d4e1cd401e6\" id=\"27186616-667f-4a4d-980c-7d4e1cd401e6-link\">106<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Despite their large disagreements about AI outcomes over the long\nterm, many participants in each group expressed a sense of humility\nabout long-term forecasting and emphasized that they are not claiming to\nhave confident predictions of distant events.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"survey-on-long-term-ai-outcomes\">Survey on long-term AI\noutcomes<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">At the suggestion of a participant, we asked all participants to\ncomplete a survey about their views on a range of long-term AI outcomes,\nto better characterize areas of agreement and disagreement. See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Appendix 5<\/u><\/a> for the full results and\ndetails on question wording.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In brief, we asked about:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The likelihood of a variety of outcomes occurring by 2100, such as: humans intentionally using AI to cause extinction; AI intentionally or accidentally causing extinction; AI causing major population declines (&lt;50% of 2023 human population) or decreases in human well-being (&lt;4\/10 on an &#8220;Average Life Evaluation&#8221; scale) through a variety of means; powerful AI is developed and everything goes fine; powerful AI is developed but not deployed; powerful AI is not developed. Details on all outcomes and operationalizations in <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Appendix 5<\/u><\/a>.<\/li>\n\n\n\n<li>The likelihood of subsets of the above outcomes occurring on\nlonger time horizons, such as by 2200 (an additional hundred years) and\nby 3023 (an additional thousand years).<\/li>\n\n\n\n<li>Whether and when AI will displace humans as &#8220;the primary force that determines what happens in the future.&#8221;<sup data-fn=\"aabefb64-3fab-457c-a888-f91a2fd3db95\" class=\"fn\"><a href=\"#aabefb64-3fab-457c-a888-f91a2fd3db95\" id=\"aabefb64-3fab-457c-a888-f91a2fd3db95-link\">107<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">The key takeaways were:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The largest disagreement on AI outcomes by 2100 is about the\nprobability of AI-caused human extinction, particularly from scenarios\nnot involving human misuse of AI.\n<ul class=\"wp-block-list\">\n<li>Forecasts on both AI intentionally causing extinction (question\n1A.2) and AI unintentionally causing extinction (1A.3) by 2100 are over\ntwo orders of magnitude apart (12% to 0.02% on 1A.2, and 3% to 0.01% on\n1A.3, for concerned and skeptic group medians respectively).<\/li>\n\n\n\n<li>There is also considerable disagreement about AI extinction via\nhuman misuse by 2100 (1A.1). Forecasts are ~1 order of magnitude apart\n(medians of 0.5% for the concerned group, 0.03% for the skeptic\ngroup).<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>On the other AI outcomes we asked about, median forecasts for the\ntwo groups are all within the same order of magnitude.\n<ul class=\"wp-block-list\">\n<li>Outcomes with particularly close forecasts are:\n<ul class=\"wp-block-list\">\n<li>Large drop in human wellbeing because of human misuse of AI by\n2100 (1A.7). The concerned median is 2%, and the skeptic median is 4%\n(although the skeptic median is higher than the 75th percentile\nconcerned forecast).<\/li>\n\n\n\n<li>\u2018Powerful AI\u2019<sup data-fn=\"e696d05d-60d5-4b40-b514-fcef7b35dc29\" class=\"fn\"><a href=\"#e696d05d-60d5-4b40-b514-fcef7b35dc29\" id=\"e696d05d-60d5-4b40-b514-fcef7b35dc29-link\">108<\/a><\/sup> not being developed by 2100 (1A.10). The concerned median is 12%, and the skeptic median is 10%.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>Other outcomes with forecasts of the same order of magnitude\nincluded misuse causing a sub-extinction catastrophe, high human\nwell-being scenarios, a large drop in human well-being caused directly\nby an AI, and the development of powerful AI without deployment (1A.4,\n1A.5 and 1A.6, 1A.8, and 1A.9 respectively).\n<ul class=\"wp-block-list\">\n<li>However, though they are on the same order of magnitude, a notable result is that the skeptic group median for powerful AI being developed but not deployed by 2100 (because of coordinated human decisions, costliness, or other reasons) is 20.4% while the concerned group median is 4%.<sup data-fn=\"c432fabc-b46a-4eb1-8219-0fb7eb41c204\" class=\"fn\"><a href=\"#c432fabc-b46a-4eb1-8219-0fb7eb41c204\" id=\"c432fabc-b46a-4eb1-8219-0fb7eb41c204-link\">109<\/a><\/sup><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>When we asked for probabilities on a cluster of \u2018bad\u2019 outcomes\u2014including extinction as well as less extreme bad outcomes (full list in footnote)<sup data-fn=\"e7194e7a-aacb-4d66-9e57-87410a99c386\" class=\"fn\"><a href=\"#e7194e7a-aacb-4d66-9e57-87410a99c386\" id=\"e7194e7a-aacb-4d66-9e57-87410a99c386-link\">110<\/a><\/sup>\u2014in different date ranges, disagreements shrank.\n<ul class=\"wp-block-list\">\n<li>Before 2100 and between 2100 and 2200, forecasts for one of the\nbad outcomes in this cluster occurring are within the same order of\nmagnitude (before 2100, 35% for the concerned group and 7.6% for the\nskeptic group; between 2100 and 2200, 3% for the concerned group and 12%\nfor the skeptic group).<\/li>\n\n\n\n<li>Forecasts for one of the bad outcomes in this cluster occurring\nbetween 2200 and 3023 are one order of magnitude apart (1% for the\nconcerned group and 20% for the skeptic group).<\/li>\n\n\n\n<li>Forecasts for none of these outcomes occurring in the next 1000\nyears are 60% for the concerned group and 70% for the skeptic group,\nwhich is particularly close as a factor (though the skeptic median is\nhigher than the 75th percentile concerned forecast).<\/li>\n\n\n\n<li>This suggests that both groups put significant total probability\non bad outcomes from AI in the next 1000 years (40% and 30% for\nconcerned and skeptic groups respectively), but they distribute this\nprobability differently over time, with the concerned placing most of\ntheir probability before 2100, and the skeptics spreading their\nprobability more evenly.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>There is large disagreement on when AI will displace humans as\nthe primary force that determines what happens in the future. The\nconcerned median is 2045 and the skeptic median is 2450\u2014a 405 year\ngap.\n<ul class=\"wp-block-list\">\n<li>Three out of 11 skeptics forecast \u2018Never\u2019 for this question,\nsuggesting that they think it is &lt;50% likely that AI ever displaces\nhumans in this way.<\/li>\n\n\n\n<li>Some participants said that they did not necessarily see \u2018AI\nreplacing humans as the primary force that determines what happens in\nthe future\u2019 as a negative outcome.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"what-long-term-outcomes-from-ai-do-skeptics-expect\">What\nlong-term outcomes from AI do skeptics expect?<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">If skeptics expect \u201cpowerful AI\u201d systems (as previously defined) by\n2100, why would it take until 2450 for AI to displace humans as the\ndominant force in the world? And if skeptics place low probability on\nexistential catastrophe due to AI by 2100, what do they expect to happen\ninstead?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We analyzed rationales and conducted three follow-up calls with\nmembers of the skeptic group to gather more information on these\nquestions.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In brief, skeptics argued:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>There may still be a \u201clong tail\u201d of highly important tasks that\nrequire humans, similar to what has happened with self-driving cars. So,\neven if AI can do &gt;95% of human cognitive tasks, many important tasks\nwill remain.<\/li>\n\n\n\n<li>Consistent with Moravec\u2019s paradox, even if AI has advanced\ncognitive abilities it will likely take longer for it to develop\nadvanced physical capabilities. And the latter would be important for\naccumulating power over resources in the physical world.<\/li>\n\n\n\n<li>AI may run out of relevant training data to be fully competitive\nwith humans in all domains. In follow-up interviews, two skeptics\nmentioned that they would update their views on AI progress if AI were\nable to train on sensory data in ways similar to humans. They expected\nthat gains from reading text would be limited.<\/li>\n\n\n\n<li>Even if powerful AI is developed, it is possible that it will not be deployed widely, because it is not cost-effective, because of societal decision-making, or for other reasons.<sup data-fn=\"93d9a769-2cca-4c3a-bfb5-9ad58d6cab4e\" class=\"fn\"><a href=\"#93d9a769-2cca-4c3a-bfb5-9ad58d6cab4e\" id=\"93d9a769-2cca-4c3a-bfb5-9ad58d6cab4e-link\">111<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">And, when it comes to outcomes from AI, skeptics tended to put more\nweight on possibilities such as:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>AI remains more \u201ctool\u201d-like than \u201cagent\u201d-like, and therefore is\nmore similar to technology like the internet in terms of its effects on\nthe world.<\/li>\n\n\n\n<li>AI is agent-like but it leads to largely positive outcomes for\nhumanity because it is adequately controlled by human systems or other\nAIs, or it is aligned with human values.<\/li>\n\n\n\n<li>AI and humans co-evolve and gradually merge in a way that does\nnot cleanly fit the resolution criteria of our forecasting\nquestions.<\/li>\n\n\n\n<li>AI leads to a major collapse of human civilization (through\nlarge-scale death events, wars, or economic disasters) but humanity\nrecovers and then either controls or does not develop AI.<\/li>\n\n\n\n<li>Powerful AI is developed but is not widely deployed, because of\ncoordinated human decisions, prohibitive costs to deployment, or some\nother reason.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"forecasts-about-transformative-economic-growth\">Forecasts about\n&#8220;transformative&#8221; economic growth<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Participants also spent time forecasting one other longer-term outcome: whether there would be \u201ctransformative economic growth\u201d (defined as &gt;15% global GDP growth in any year)<sup data-fn=\"b030c014-fe15-4361-b0b7-ccbefe3865ef\" class=\"fn\"><a href=\"#b030c014-fe15-4361-b0b7-ccbefe3865ef\" id=\"b030c014-fe15-4361-b0b7-ccbefe3865ef-link\">112<\/a><\/sup> by 2070.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">There was major disagreement about the likelihood of this occurring\namong skeptics and concerned. The concerned group median forecast of\npositive resolution was 43% (average: 41.6%; range 15%-75%), and the\nskeptic median was 2% (average: 2.7%; range 0.1%-11.2%). Notably, there\nis no overlap in their ranges.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This question had higher value of information for the concerned group\nthan any crux resolving by 2030 (median VOI: 1.4E-2; median POM VOI:\n8.93%; for comparisons to cruxes resolving by 2030, see <a href=\"#results-tables-and-figures\"><u>near-term VOI results\nsection<\/u><\/a>). It had the 11th-highest value of information for the\nskeptic group (median VOI: 4.5E-7; median POM VOI: 0.02%). It was one of\nthe strongest divergent cruxes (i.e., a crux that would lead to more\ndisagreement) between individuals in the concerned and skeptic\ngroups.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A striking result is that\u2014independent of group\u2014the participants are nearly evenly split on whether transformative growth (defined as &gt;15% global GDP growth in any year)<sup data-fn=\"a3fda4e4-6cf8-4ad0-b54e-bb517e0d92ee\" class=\"fn\"><a href=\"#a3fda4e4-6cf8-4ad0-b54e-bb517e0d92ee\" id=\"a3fda4e4-6cf8-4ad0-b54e-bb517e0d92ee-link\">113<\/a><\/sup> by 2070 would increase or decrease the probability of existential catastrophe by 2100. Across groups, 10 forecasters predict higher AI risk conditional on positive resolution of this question, eight predict lower risk, and four predict no net effect on risk. Among the concerned group, 56% (six forecasters) think the occurrence of transformative growth decreases risk; and 44% (five) think it increases risk. Among the skeptical group, 18% (two forecasters) think transformative growth decreases risk; 36% (four) think it has no effect at all on risk; and 44% (five) think it increases risk.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Some forecasters argued that such growth would be evidence that highly powerful AIs are relatively controllable, while others argued that highly economically useful AI would be evidence of future dangerous AI.<sup data-fn=\"0e879b05-e0c5-4c57-98d6-e389cf866e51\" class=\"fn\"><a href=\"#0e879b05-e0c5-4c57-98d6-e389cf866e51\" id=\"0e879b05-e0c5-4c57-98d6-e389cf866e51-link\">114<\/a><\/sup> The likelihood of transformative growth due to AI is frequently debated,<sup data-fn=\"972bece3-98b2-4c26-b6af-b5104202455a\" class=\"fn\"><a href=\"#972bece3-98b2-4c26-b6af-b5104202455a\" id=\"972bece3-98b2-4c26-b6af-b5104202455a-link\">115<\/a><\/sup> but these results highlight that it may be valuable to shift more emphasis in future discussion to what the implications of such growth would be for risk levels.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For additional details on participants\u2019 forecasts and rationales on\nthis question, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=108\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Appendix 7<\/u><\/a>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"reasons-for-long-term-disagreement\">Reasons for long-term\ndisagreement<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Based on our analysis of forecasts and rationales, some themes that\nwe think underlie the debate between the two groups are:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Timelines<\/strong>: how long will it take for AIs to\nbecome more powerful than humans, and how long will it be from the first\nsign of danger to a potential extinction event?<\/li>\n\n\n\n<li><strong>Goals that incentivize killing everyone:<\/strong>\nconditional on having advanced AI systems, how likely is it that such\nsystems would develop goals that incentivize them to cause human\nextinction?<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"timelines-for-ai-progress\">Timelines for AI Progress<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Timelines for AI progress, especially timelines until AI is more\nadvanced than humans in all relevant domains, seem to be an important\ndriver of disagreement. When participants discussed questions related to\ntimelines, a number of themes emerged in their arguments:<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Main arguments from the skeptic group:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Fundamental breakthroughs in AI development would be necessary to create AI capable of causing extinction.<sup data-fn=\"9995feaa-cc23-4a36-bb2f-d88ad15b837e\" class=\"fn\"><a href=\"#9995feaa-cc23-4a36-bb2f-d88ad15b837e\" id=\"9995feaa-cc23-4a36-bb2f-d88ad15b837e-link\">116<\/a><\/sup><\/li>\n\n\n\n<li>Developing powerful new AI technology will take more time than expected for planning fallacy-like reasons.<sup data-fn=\"2adfceec-3d11-4ae7-aa83-d57a84c67949\" class=\"fn\"><a href=\"#2adfceec-3d11-4ae7-aa83-d57a84c67949\" id=\"2adfceec-3d11-4ae7-aa83-d57a84c67949-link\">117<\/a><\/sup><\/li>\n\n\n\n<li>AI powerful enough to cause extinction would require significant advances in robotics which are unlikely to happen by 2100.<sup data-fn=\"eddf0c0a-9361-4ff2-89af-9161fef47b2f\" class=\"fn\"><a href=\"#eddf0c0a-9361-4ff2-89af-9161fef47b2f\" id=\"eddf0c0a-9361-4ff2-89af-9161fef47b2f-link\">118<\/a><\/sup><\/li>\n\n\n\n<li>Even once sufficiently powerful AI is developed, there will be a lag for deployment and adoption.<sup data-fn=\"4e7222c4-e765-4def-a635-c2b0b3bcb628\" class=\"fn\"><a href=\"#4e7222c4-e765-4def-a635-c2b0b3bcb628\" id=\"4e7222c4-e765-4def-a635-c2b0b3bcb628-link\">119<\/a><\/sup><\/li>\n\n\n\n<li>AGIs will want to prevent the development of deadly AGIs.<sup data-fn=\"dddd4a0b-e25a-49f1-9436-eaf6b3ba1787\" class=\"fn\"><a href=\"#dddd4a0b-e25a-49f1-9436-eaf6b3ba1787\" id=\"dddd4a0b-e25a-49f1-9436-eaf6b3ba1787-link\">120<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Main arguments from the concerned group:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Combining and\/or extending existing ML methods may be sufficient for achieving AI that poses an existential risk.<sup data-fn=\"2bf9b071-6ae4-44e0-ae4b-837b8f63f3b8\" class=\"fn\"><a href=\"#2bf9b071-6ae4-44e0-ae4b-837b8f63f3b8\" id=\"2bf9b071-6ae4-44e0-ae4b-837b8f63f3b8-link\">121<\/a><\/sup><\/li>\n\n\n\n<li>Once human-level AGI is developed, it will rapidly speed up further AI progress as it will operate more efficiently (in terms of both time and money) than humans.<sup data-fn=\"875e779b-66ee-4baa-a1aa-7e196ebdcf65\" class=\"fn\"><a href=\"#875e779b-66ee-4baa-a1aa-7e196ebdcf65\" id=\"875e779b-66ee-4baa-a1aa-7e196ebdcf65-link\">122<\/a><\/sup><\/li>\n\n\n\n<li>Robots won\u2019t be necessary for an AI to interact with the physical world. This could be done through humans, and\/or through computer systems.<sup data-fn=\"3d2a58e1-cbd9-4d27-b7c9-ad2ddb69653e\" class=\"fn\"><a href=\"#3d2a58e1-cbd9-4d27-b7c9-ad2ddb69653e\" id=\"3d2a58e1-cbd9-4d27-b7c9-ad2ddb69653e-link\">123<\/a><\/sup><\/li>\n\n\n\n<li>Current progress is fast,<sup data-fn=\"76a476c7-166d-4d0b-82d7-40bbdbfc6599\" class=\"fn\"><a href=\"#76a476c7-166d-4d0b-82d7-40bbdbfc6599\" id=\"76a476c7-166d-4d0b-82d7-40bbdbfc6599-link\">124<\/a><\/sup> faster than predicted,<sup data-fn=\"2ec76dfd-871e-4602-9459-5af14147ec21\" class=\"fn\"><a href=\"#2ec76dfd-871e-4602-9459-5af14147ec21\" id=\"2ec76dfd-871e-4602-9459-5af14147ec21-link\">125<\/a><\/sup> and set to continue.<sup data-fn=\"5a51bdcc-37a6-40a9-8824-70685f9b391a\" class=\"fn\"><a href=\"#5a51bdcc-37a6-40a9-8824-70685f9b391a\" id=\"5a51bdcc-37a6-40a9-8824-70685f9b391a-link\">126<\/a><\/sup><\/li>\n\n\n\n<li>AI development will speed up AI development.<sup data-fn=\"394befc3-3ed5-4d5d-82aa-8389581aa618\" class=\"fn\"><a href=\"#394befc3-3ed5-4d5d-82aa-8389581aa618\" id=\"394befc3-3ed5-4d5d-82aa-8389581aa618-link\">127<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Many of these arguments hinge on the question of how comprehensive advanced AI\u2019s abilities will be. The skeptics think that existing technology is enough to make powerful AI highly likely this century,<sup data-fn=\"9a7fc44d-44c0-4cd3-878c-77513efef4b9\" class=\"fn\"><a href=\"#9a7fc44d-44c0-4cd3-878c-77513efef4b9\" id=\"9a7fc44d-44c0-4cd3-878c-77513efef4b9-link\">128<\/a><\/sup> but they expect that being more capable than humans in all relevant domains will take longer.<sup data-fn=\"00adbfc4-3863-4ea6-885e-6d566df614c8\" class=\"fn\"><a href=\"#00adbfc4-3863-4ea6-885e-6d566df614c8\" id=\"00adbfc4-3863-4ea6-885e-6d566df614c8-link\">129<\/a><\/sup> The concerned group, on the other hand, believe that the transition from powerful AI to a dangerous superintelligence could happen more quickly.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In previous sections, we discussed cruxes related to timelines for\nhighly advanced AI, such as the groups\u2019 disagreement about <a href=\"#forecasts-about-transformative-economic-growth\"><u>the\nprobability of AI-caused transformative economic growth<\/u><\/a>. Other\nquestions related to the development of highly advanced AI capabilities\ninclude:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Power-seeking:<\/u><\/a> By 2030, there\nis a significant concrete example of power-seeking behavior from a large\nstate-of-the-art model: demonstration of actual capability for sustained\ndeception of human experts.<\/li>\n<\/ul>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td colspan=\"7\"><strong>Power-seeking<\/strong><\/td><\/tr><tr><td><strong>Group<\/strong><\/td><td><strong>VOI (POM)<\/strong><\/td><td><strong>P(c)<\/strong><\/td><td><strong>P(U) initial<\/strong><\/td><td><strong>P(U|c)<\/strong><\/td><td><strong>P(U|\u00acc)<\/strong><\/td><td><strong>VOD (POM)<\/strong><\/td><\/tr><tr><td>Zoe (Median Concerned)<\/td><td>1.4E-4 (0.06%)<\/td><td>10%<\/td><td>21%<\/td><td>18%<\/td><td>21%<\/td><td rowspan=\"2\">\u22123.6E-4 (\u22120.08%)<\/td><\/tr><tr><td>Blake (Median Skeptic)<\/td><td>4.7E-7 (0.01%)<\/td><td>10%<\/td><td>0.20%<\/td><td>0.22%<\/td><td>0.20%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 14:<\/strong> Forecasts and VOD between the median (by VOI) participant from each group for the question<br>\u201cPower-seeking.\u201d<\/figcaption><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>AI writes AI<\/u><\/a>: By 2030, an AI,\nwithout being asked, writes AI software that is better at writing AI\nsoftware, and deploys it without asking for permission<\/li>\n<\/ul>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td colspan=\"7\"><strong>AI writes AI<\/strong><\/td><\/tr><tr><td><strong>Group<\/strong><\/td><td><strong>VOI (POM)<\/strong><\/td><td><strong>P(c)<\/strong><\/td><td><strong>P(U) initial<\/strong><\/td><td><strong>P(U|c)<\/strong><\/td><td><strong>P(U|\u00acc)<\/strong><\/td><td><strong>VOD (POM)<\/strong><\/td><\/tr><tr><td>Pascal (Median Concerned)<\/td><td>8.6E-4 (0.66%)<\/td><td>75%<\/td><td>9%<\/td><td>10%<\/td><td>6%<\/td><td rowspan=\"2\">3.9E-2 (21.71%)<\/td><\/tr><tr><td>Ash (Median Skeptic)<\/td><td>9.1E-7 (0.03%)<\/td><td>10%<\/td><td>0.1%<\/td><td>0.12%<\/td><td>0.0978%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 15:<\/strong> Forecasts and VOD between the median (by VOI) participant from each group for the question \u201cAI writes AI.\u201d<\/figcaption><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\"><u>AI Robotics<\/u><\/a>: By January 1, 2030, an AI system &#8220;Has general robotic capabilities, of the type able to autonomously, when equipped with appropriate actuators and when given human-readable instructions, satisfactorily assemble a (or the equivalent of a) circa-2021 Ferrari 312 T4 1:8 scale automobile model. A single demonstration of this ability, or a sufficiently similar demonstration, will be considered sufficient.&#8221;<sup data-fn=\"b800da4e-64a5-4684-b4db-19dbc5c5f949\" class=\"fn\"><a href=\"#b800da4e-64a5-4684-b4db-19dbc5c5f949\" id=\"b800da4e-64a5-4684-b4db-19dbc5c5f949-link\">130<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td colspan=\"7\"><strong>AI Robotics<\/strong><\/td><\/tr><tr><td><strong>Group<\/strong><\/td><td><strong>VOI (POM)<\/strong><\/td><td><strong>P(c)<\/strong><\/td><td><strong>P(U) initial<\/strong><\/td><td><strong>P(U|c)<\/strong><\/td><td><strong>P(U|\u00acc)<\/strong><\/td><td><strong>VOD (POM)<\/strong><\/td><\/tr><tr><td>Yael (Median Concerned)<\/td><td>8.9E-4 (0.44%)<\/td><td>33.00%<\/td><td>17.50%<\/td><td>21.00%<\/td><td>15.78%<\/td><td rowspan=\"2\">\u22122.2E-2 (\u221210.36%)<\/td><\/tr><tr><td>Flint (Median Skeptic)<\/td><td>4.0E-19 (0.00%)<\/td><td>75.00%<\/td><td>1.10%<\/td><td>1.10%<\/td><td>1.10%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 16:<\/strong> Forecasts and VOD between the median (by VOI) participant from each group for the question \u201cAI Robotics.\u201d<\/figcaption><\/figure>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"goals-that-incentivize-killing-everyone\">Goals that incentivize\nkilling everyone<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Based on our question ranking and analysis of participants\u2019 comments,\nwe think that the question of how likely it is that a capable AI system\nwould develop dangerous goals is behind a significant amount of the\ndisagreement between the two groups. As discussed above, both groups\nagree that they expect to see powerful AI this century. But they\ndisagree strongly about whether that is likely to be dangerous.\nConcerned participants tended to think that a sufficiently advanced AI\nsystem would be very likely to develop dangerous goals, including both\ngoals where killing humans is an intended outcome of a plan and ones\nwhere it is an acceptable price for an AI achieving a different goal.\nSkeptical participants tended to agree that dangerous goals are\npossible, but did not think there were compelling reasons to believe\nthey are much more likely than other goals.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">One of the highest-ranked questions was about capabilities that are not necessarily dangerous in and of themselves, but that would make an AI more effective at pursuing a wide variety of goals, including dangerous ones: whether METR would determine by 2030 that AI models could replicate, acquire resources, and evade deactivation.<sup data-fn=\"5957bd2d-d525-49be-b059-900ed63e366c\" class=\"fn\"><a href=\"#5957bd2d-d525-49be-b059-900ed63e366c\" id=\"5957bd2d-d525-49be-b059-900ed63e366c-link\">131<\/a><\/sup> The groups strongly disagreed about how likely this is to occur: the median skeptic forecast was 1% and median concerned forecast was 25%.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For the median skeptic and the median concerned people, by VOI, Flint\n(Skeptical) and Riley (Concerned):<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Flint believes there is only a 1% chance that this ARC Evals\n(METR) question resolves positively. When asked to forecast P(AI\nexistential catastrophe by 2100) conditional on this question, Flint\nwould forecast 1.30% if it resolves positively and 1.10% if it resolves\nnegatively (compared to their unconditional 1.10%).<\/li>\n\n\n\n<li>Riley believes there is a 55% chance that this resolves\npositively. They would forecast 35% if it resolves positively and 23.89%\nif it resolves negatively (compared to their unconditional\n30%).<\/li>\n\n\n\n<li>This question (\u201cPlatform: ARC Evals\u201d) resolves 23.19% of this pair\u2019s disagreement in expectation.<sup data-fn=\"3d003ed9-5b67-4c9d-aed5-f36f2a3cedc7\" class=\"fn\"><a href=\"#3d003ed9-5b67-4c9d-aed5-f36f2a3cedc7\" id=\"3d003ed9-5b67-4c9d-aed5-f36f2a3cedc7-link\">132<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Below, we provide a variety of arguments from participants about how\nlikely it is that AI systems will develop dangerous goals.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Main arguments from the skeptic group:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The set of possible goals is very large, and goals that benefit from the eradication of humans are a small portion of the overall set.<sup data-fn=\"192f6dc6-ae7c-47bb-87c9-5feeba837f10\" class=\"fn\"><a href=\"#192f6dc6-ae7c-47bb-87c9-5feeba837f10\" id=\"192f6dc6-ae7c-47bb-87c9-5feeba837f10-link\">133<\/a><\/sup>\n<ul class=\"wp-block-list\">\n<li>It\u2019s possible that future AI systems are indifferent to humans, and if so it seems unlikely that they would try to cause extinction.<sup data-fn=\"d2d04d89-4be5-423e-bcc8-a6ccf0b4da0a\" class=\"fn\"><a href=\"#d2d04d89-4be5-423e-bcc8-a6ccf0b4da0a\" id=\"d2d04d89-4be5-423e-bcc8-a6ccf0b4da0a-link\">134<\/a><\/sup><\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>Instrumental convergence and extreme power-seeking seem like possible characteristics of AI systems, but they have not been empirically demonstrated. Theoretical arguments demonstrate the <em>possibility<\/em> of dangerous instrumental convergence, but not that these outcomes are <em>likely.<\/em><sup data-fn=\"17722f6b-aec1-480e-98ea-005725d8ac22\" class=\"fn\"><a href=\"#17722f6b-aec1-480e-98ea-005725d8ac22\" id=\"17722f6b-aec1-480e-98ea-005725d8ac22-link\">135<\/a><\/sup><\/li>\n\n\n\n<li>Deception and violence are both costly behaviors that may not actually be instrumentally convergent.<sup data-fn=\"9680a5c3-5651-4637-85ca-3cbad928d416\" class=\"fn\"><a href=\"#9680a5c3-5651-4637-85ca-3cbad928d416\" id=\"9680a5c3-5651-4637-85ca-3cbad928d416-link\">136<\/a><\/sup><\/li>\n\n\n\n<li>AI systems will be built using human-centered data and so are likely to learn human values.<sup data-fn=\"76d1a329-1849-424a-b4aa-b0e85bdf0cdc\" class=\"fn\"><a href=\"#76d1a329-1849-424a-b4aa-b0e85bdf0cdc\" id=\"76d1a329-1849-424a-b4aa-b0e85bdf0cdc-link\">137<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Main arguments from the concerned group:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Instrumental convergence may arise even when an agent\u2019s goals are bounded. It would be difficult to specify constraints that avoid instrumental convergence.<sup data-fn=\"7f5995de-defd-43f7-8b81-2523a5003a48\" class=\"fn\"><a href=\"#7f5995de-defd-43f7-8b81-2523a5003a48\" id=\"7f5995de-defd-43f7-8b81-2523a5003a48-link\">138<\/a><\/sup><\/li>\n\n\n\n<li>It seems likely that, eventually, an AI with an unbounded goal will be developed, and systems with bounded goals will have limited ability to prevent the actions of an unbounded system.<sup data-fn=\"56c6a65e-8299-4c0c-a626-6a5a513f391f\" class=\"fn\"><a href=\"#56c6a65e-8299-4c0c-a626-6a5a513f391f\" id=\"56c6a65e-8299-4c0c-a626-6a5a513f391f-link\">139<\/a><\/sup><\/li>\n\n\n\n<li>Catastrophic goal misgeneralization can occur, which could result in an AI trained on a safe goal developing an unsafe goal when outside its training environment, with catastrophic consequences.<sup data-fn=\"a004bd36-eb6c-416f-9ee6-a920b8c7007d\" class=\"fn\"><a href=\"#a004bd36-eb6c-416f-9ee6-a920b8c7007d\" id=\"a004bd36-eb6c-416f-9ee6-a920b8c7007d-link\">140<\/a><\/sup><\/li>\n\n\n\n<li>Most goals an AI might have are likely to benefit from human\nextinction:\n<ul class=\"wp-block-list\">\n<li>Humans might be a threat to AI systems\u2019 control of resources.<sup data-fn=\"593f2a28-0108-430a-a45f-8990c33d5bd4\" class=\"fn\"><a href=\"#593f2a28-0108-430a-a45f-8990c33d5bd4\" id=\"593f2a28-0108-430a-a45f-8990c33d5bd4-link\">141<\/a><\/sup><\/li>\n\n\n\n<li>AI pursuing a wide range of goals might cause human extinction as a side effect of using up Earth\u2019s resources to meet its goal.<sup data-fn=\"0b0fbcc1-d8da-4704-a5ae-d335c606fb7b\" class=\"fn\"><a href=\"#0b0fbcc1-d8da-4704-a5ae-d335c606fb7b\" id=\"0b0fbcc1-d8da-4704-a5ae-d335c606fb7b-link\">142<\/a><\/sup><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Relevant high-VOI cruxes related to the prevalence of dangerous goals\nin AI, in addition to the previously mentioned \u201cARC Evals\u201d crux,\ninclude:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Power-seeking shutdown<\/u><\/a>: &#8220;AI developers attempt (of their own volition, or under compulsion by a regulatory authority) to disable or destroy an AI system of their own creation, which cost at least $10M to develop, after the AI system displays at least one instance of \u201cpower-seeking behavior.\u201d<sup data-fn=\"730fc20a-9b28-4413-b0e9-61e94009ccfb\" class=\"fn\"><a href=\"#730fc20a-9b28-4413-b0e9-61e94009ccfb\" id=\"730fc20a-9b28-4413-b0e9-61e94009ccfb-link\">143<\/a><\/sup><\/li>\n\n\n\n<li><a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" rel=\"noreferrer noopener\" target=\"_blank\"><u>Alignment solution<\/u><\/a>: By 2030,\nthere is a &#8220;Theoretical in-principle solution to the alignment problem\nthat most people who thought about this carefully agree should work.\u201d\nThis will be resolved by a panel of experts of the &#8220;AI concerned&#8221; team&#8217;s\nchoosing.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"hypothesis-4-do-the-groups-have-fundamental-worldview-disagreements-that-go-beyond-ai\">Hypothesis\n#4: Do the groups have fundamental worldview disagreements that go\nbeyond AI?<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Throughout the project, we noticed that many of the disagreements between the AI skeptics and AI concerned participants didn\u2019t pertain only to AI but were rooted in more fundamental issues. These included disagreements about what kinds of evidence are reliable, how to think about reference classes for unusual events, and how various social and political systems interact with one another.<sup data-fn=\"b3c7cee2-9b62-4590-8369-771cab42f663\" class=\"fn\"><a href=\"#b3c7cee2-9b62-4590-8369-771cab42f663\" id=\"b3c7cee2-9b62-4590-8369-771cab42f663-link\">144<\/a><\/sup> These deep worldview disagreements are not addressed directly by AI forecasting questions, but understanding them is still valuable for determining what might be driving fundamental disagreements on this topic. If we could understand these differences in worldview, perhaps we could use that information to build a deeper understanding of why these two groups continue to disagree about AI, even after discussion and consideration.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">While a detailed analysis of broader worldview differences is beyond\nthe scope of this project, we offer some observations about\nparticipants\u2019 reasoning that shed light on these disagreements. For\nexample, we can see these worldview differences in how each group\ninterprets \u201cextraordinary claims.\u201d Both groups agree that \u201cextraordinary\nclaims require extraordinary evidence,\u201d but they disagree about which\nclaims are extraordinary. Is it extraordinary to believe that AI will\nkill all of humanity when humanity has been around for hundreds of\nthousands of years, or is it extraordinary to believe that humanity\nwould continue to survive alongside smarter-than-human AI?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">AI skeptics tended to focus on the general difficulty of correctly\nanticipating complex future outcomes. Examples of fundamental beliefs\nwhich seem more common among the AI skeptic group:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Because the world is complex, the future is unlikely to unfold as theories and models expect.<sup data-fn=\"e62176ff-8476-4d14-bef5-1c39e354764f\" class=\"fn\"><a href=\"#e62176ff-8476-4d14-bef5-1c39e354764f\" id=\"e62176ff-8476-4d14-bef5-1c39e354764f-link\">145<\/a><\/sup><\/li>\n\n\n\n<li>A long chain of specific things needs to go wrong for humanity to perish in the transition to advanced AI; long chains of specific outcomes are unlikely to happen.<sup data-fn=\"6b5a9d21-8bfa-4aa0-baff-157ff40650d3\" class=\"fn\"><a href=\"#6b5a9d21-8bfa-4aa0-baff-157ff40650d3\" id=\"6b5a9d21-8bfa-4aa0-baff-157ff40650d3-link\">146<\/a><\/sup>\n<ul class=\"wp-block-list\">\n<li>Three skeptics listed this as their number one disagreement with the concerned group in the postmortem survey, and it also emerged as a strong theme when <a href=\"#understanding-each-others-arguments\"><u>we asked participants to summarize<\/u><\/a> the three strongest arguments from each group.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>Complex processes (like technological development, deployment, and societal change) take a long time, which makes transformative developments less likely by 2100.<sup data-fn=\"718d8c3a-ae88-4f06-8afe-c49be4b89cbe\" class=\"fn\"><a href=\"#718d8c3a-ae88-4f06-8afe-c49be4b89cbe\" id=\"718d8c3a-ae88-4f06-8afe-c49be4b89cbe-link\">147<\/a><\/sup><\/li>\n\n\n\n<li>Thinking about AI capabilities in isolation is misleading in estimating risk, as human responses to AI will also be very important in determining outcomes.<sup data-fn=\"5f64f8d7-e8f6-45bf-bf7a-32fb301cd899\" class=\"fn\"><a href=\"#5f64f8d7-e8f6-45bf-bf7a-32fb301cd899\" id=\"5f64f8d7-e8f6-45bf-bf7a-32fb301cd899-link\">148<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">The AI concerned group tended to focus on features of the AI risk\ncase that they argue make it different from most other forecasting\nproblems. Some examples of fundamental beliefs which seem more common\namong the AI risk concerned group:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>AI will change the world so radically that base rates are not a helpful guide to forecasting many of these questions.<sup data-fn=\"a47ce63c-1891-447a-931d-3a86d0b41540\" class=\"fn\"><a href=\"#a47ce63c-1891-447a-931d-3a86d0b41540\" id=\"a47ce63c-1891-447a-931d-3a86d0b41540-link\">149<\/a><\/sup><\/li>\n\n\n\n<li>A long chain of specific things need to go right for humanity to survive the transition to advanced AI; long chains of specific outcomes are unlikely to happen.<sup data-fn=\"139c405f-910d-4a52-8c1c-4d9e77d05ea2\" class=\"fn\"><a href=\"#139c405f-910d-4a52-8c1c-4d9e77d05ea2\" id=\"139c405f-910d-4a52-8c1c-4d9e77d05ea2-link\">150<\/a><\/sup><\/li>\n\n\n\n<li>The case for extinction is intuitive.<sup data-fn=\"acb186c4-cf64-46cd-96f4-25d8387e372a\" class=\"fn\"><a href=\"#acb186c4-cf64-46cd-96f4-25d8387e372a\" id=\"acb186c4-cf64-46cd-96f4-25d8387e372a-link\">151<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">The differences in what each group considers good evidence is\nreflected in the varying importance they assign to members of their own\ngroup changing their minds. \u201cSupers changing minds\u201d is the skeptic\ngroup\u2019s highest median VOI question at about 1.15% of their theoretical\nmaximum VOI. In other words, the most influential factor for them would\nbe learning that superforecasters have become concerned about AI risks.\nConversely, for the concerned group, the same question captures only\n0.43% of their maximum theoretical VOI.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The difference is even starker in the other direction. For the\nconcerned group, \u201cAlignment researchers changing minds\u201d ranks as their\nsecond-highest VOI question, and captures 2.43% of their maximum\npossible VOI for that question. In contrast, this question is 0%\ninformative to the median skeptic.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Most likely, participants were not interpreting those questions\ncausally: they probably were not saying that they would change their\nminds <em>because<\/em> other people did, but rather treating other\npeople changing their minds as evidence about what has happened by 2030.\nBoth groups think that, if people whose reasoning they trust changed\ntheir minds, there is probably evidence that would convince them, too,\nbut the same does not hold true for people whose reasoning they don\u2019t\ntrust. If the concerned participants think that the skeptics\u2019 reasoning\nis flawed today, then they can also imagine similar people in 2030\nchanging their minds for reasons that are uncompelling to the concerned\npeople of 2030, and vice versa.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Similarly, the two groups do not trust one another\u2019s reasoning enough\nto update very much on each other\u2019s opinions. This may not be\nsurprising: they started with different priors, and then did not get\nvery much new evidence about what will happen with AI from mere\ndiscussions and reading comments online. But it is evidence that their\ndisagreements extend beyond AI-related facts. If the disagreement were\nsolely based on AI-related facts, we would expect people who disagree\nonly about such facts to change their minds if they learned a new\nfact.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">These differences mean that the groups often talk past each other, in\nways that may be frustrating for people deeply embedded in one side\u2019s\nform of reasoning. An AI concerned reader hoping to find out why\nskeptics disagree may be disappointed to see few specific refutations of\nAI risk arguments in this report, and to instead see skeptics\nreiterating that predicting the long-term future is hard. And AI\nskeptical readers may have a parallel experience, seeing that the\nconcerned group often focuses on theoretical arguments and does not\nalways have answers to specific questions about how exactly they expect\nthreats to manifest.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We do not know <em>why<\/em> the two groups disagree about these bigger questions. Why do some people think that theoretical arguments with multiple steps of logic are the best way to predict novel events, while others rely on reference classes that predict major changes are likely to be more gradual? Everyone agrees that each of these modes of reasoning can fail. The AI concerned group knows that many people have, historically, predicted huge societal changes from technologies that turned out to be relatively unimportant, and that theoretical arguments that seem convincing sometimes do not come true as events unfold. The skeptics know that there are no perfect reference classes, especially for unusual events,<sup data-fn=\"719f8cf0-c649-459e-907b-478b9db91f04\" class=\"fn\"><a href=\"#719f8cf0-c649-459e-907b-478b9db91f04\" id=\"719f8cf0-c649-459e-907b-478b9db91f04-link\">152<\/a><\/sup> and that major changes do sometimes happen quickly. But members of each group nonetheless are more likely to default to one mode of reasoning or another. They disagree about how to apply the relevant heuristics and reference classes in this case. These differences may be based on a combination of AI-related knowledge, professional training, personality, social incentives, and other factors.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"limitations-of-our-research\">Limitations of our research<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Limitations of our research include:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>We asked participants to complete an extremely difficult task: forecasting technological change on long time horizons. There is no evidence that anyone can do this well. Most previous evidence on judgmental forecasting applies to geopolitical forecasts on 0-2 year time horizons.<sup data-fn=\"f72969e7-3a43-4ea4-950a-70a46a1b6a02\" class=\"fn\"><a href=\"#f72969e7-3a43-4ea4-950a-70a46a1b6a02\" id=\"f72969e7-3a43-4ea4-950a-70a46a1b6a02-link\">153<\/a><\/sup><\/li>\n\n\n\n<li>We also do not know if people are well-calibrated or accurate\nwhen making conditional forecasts of the kind we elicited in this\nproject. Little evidence on these kinds of forecasts exists. There are\nsome reasons to believe that these forecasts are not robust:\n<ul class=\"wp-block-list\">\n<li>The concerned group\u2019s forecasts on the \u201cescalating warning shots\u201d question changed substantially when they were asked to spend approximately one hour forecasting it rather than approximately 10 minutes.<sup data-fn=\"8c3025ce-f5b1-4232-88eb-6ab18be184be\" class=\"fn\"><a href=\"#8c3025ce-f5b1-4232-88eb-6ab18be184be\" id=\"8c3025ce-f5b1-4232-88eb-6ab18be184be-link\">154<\/a><\/sup><\/li>\n\n\n\n<li>Some conditional forecasts were logically incoherent. In total we dropped thirteen observations due to incoherence (2% of the total). See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=107\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Appendix 6<\/u><\/a> for details.<\/li>\n\n\n\n<li>Intuitively, conditional forecasting seems difficult. Our team\noften finds generating and understanding forecasts on these questions to\nbe challenging, so we would expect others to also.<\/li>\n\n\n\n<li>Conditional forecasts do not have clear feedback loops or\npotential for accountability in the way that standard resolvable\nforecasts do.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>The forecasters in our project often emphasized that their\nforecasts felt extremely speculative to them and that they have low\nconfidence in their views.<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li>There may be inconsistency between how people would say they\u2019ll update based on particular conditions and how they\u2019ll actually update. There is some evidence for this from the project already. Concerned forecasters often did not expect to update much on cruxes related to particular policies being implemented.<sup data-fn=\"fbc8d87e-2943-41cf-999d-54a4735bc133\" class=\"fn\"><a href=\"#fbc8d87e-2943-41cf-999d-54a4735bc133\" id=\"fbc8d87e-2943-41cf-999d-54a4735bc133-link\">155<\/a><\/sup> However, a few concerned participants substantially updated their views on AI existential risk during the project due to increased policy attention on AI risk in April and May 2023.<sup data-fn=\"4ff6d11e-158b-4065-bfca-eddd234e2a31\" class=\"fn\"><a href=\"#4ff6d11e-158b-4065-bfca-eddd234e2a31\" id=\"4ff6d11e-158b-4065-bfca-eddd234e2a31-link\">156<\/a><\/sup> These seem inconsistent.<\/li>\n\n\n\n<li>As previously noted, we acknowledge that there are two ways to interpret this forecasting exercise: either as asking for your all-else-equal forecast (i.e. how would this crux resolving positively <em>causally influence<\/em> the probability of existential catastrophe, if you could isolate the effect of the crux) or your all-things-considered forecast (i.e. taking into account what this crux resolving positively may tell you about the world in 2030). Based on their rationales and discussions, we believe most participants were doing the latter.<sup data-fn=\"f2dc5338-f1eb-4c75-bb9b-8fae791d2da4\" class=\"fn\"><a href=\"#f2dc5338-f1eb-4c75-bb9b-8fae791d2da4\" id=\"f2dc5338-f1eb-4c75-bb9b-8fae791d2da4-link\">157<\/a><\/sup> We therefore cannot make many claims about whether participants think the specific event described in the crux would be good or bad for AI risk all-else-equal.<sup data-fn=\"6168dfb5-33fa-47eb-8ece-06e80a399a5a\" class=\"fn\"><a href=\"#6168dfb5-33fa-47eb-8ece-06e80a399a5a\" id=\"6168dfb5-33fa-47eb-8ece-06e80a399a5a-link\">158<\/a><\/sup><\/li>\n\n\n\n<li>Many crux questions are not robustly better than others when accounting for uncertainty analysis (see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=99\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Appendix 3<\/u><\/a>).<\/li>\n\n\n\n<li>Even within groups, people disagree substantially about the\ncruxes. This suggests that we are not measuring two sets of views about\nAI risk (concerned and skeptical), but many. This makes it hard to draw\nbroad conclusions.<\/li>\n\n\n\n<li>Participants\u2019 expectations likely affect how they interpret potential cruxes. For example, if we asked a question like \u201cWill an AI resist being shut down?\u201d, participants might make different conditional updates depending on their expectations about AI. Conditional on this question resolving positively, a participant who thinks that AIs are likely to be dangerous might think about a range of possible resolutions that includes dangerous ones, like an AI that resists powerful governments trying to turn it off, and therefore might have a much higher P(U) conditional on it resolving positively. A participant who thinks dangerous AI is very unlikely might expect that nearly all positive resolutions are more innocuous ones, in which the resolution criteria are only technically true, and therefore might not update very much. This could make it look like they have a large disagreement about how to update conditional on this question, even if they would actually make the same update conditional on the same actual event. Better operationalization may mitigate this problem, but will not eliminate it fully.<sup data-fn=\"11fba637-7c12-4d49-9ac1-c01ef0f5aecd\" class=\"fn\"><a href=\"#11fba637-7c12-4d49-9ac1-c01ef0f5aecd\" id=\"11fba637-7c12-4d49-9ac1-c01ef0f5aecd-link\">159<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"conclusion-and-next-steps\">Conclusion and Next Steps<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Overall, this project made progress on the original questions we set\nout to study, but there is substantial room for further research.<\/p>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<h4 class=\"wp-block-heading\" id=\"in-short\">In short:<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li>We see this project as providing strong evidence that\ndisagreements about AI risk are not attributable to lack of engagement\namong participants, low quality of experts willing to participate in\nforecasting studies, or because the skeptic and concerned groups do not\nunderstand each other&#8217;s arguments.<\/li>\n\n\n\n<li>We identified some areas of notable disagreement that can be\nresolved by 2030, but most of the disagreement about AI risk by 2100 is\nnot explained by the shorter term indicators examined in this\nproject.<\/li>\n\n\n\n<li>We found substantial evidence that disagreements about AI risk\ndecreased between the groups when considering longer time horizons and a\nbroader swathe of severe negative outcomes from AI than extinction or\ncivilizational collapse.<\/li>\n\n\n\n<li>The groups seem to have some fundamental worldview disagreements\nthat go beyond AI, such as how much weight to put on theoretical models\nthat have not yet seen substantial empirical verification.<\/li>\n<\/ul>\n<\/div><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">We also believe that this project has made other contributions to the\nAI discourse. For example, we have provided better examples of\ndiscussion between disagreeing AI forecasters than have existed\npreviously; see summaries of arguments <a href=\"#understanding-each-others-arguments\"><u>here<\/u><\/a> and sample\nback-and-forths between participants <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=114\" rel=\"noreferrer noopener\" target=\"_blank\"><u>here<\/u><\/a>. We also believe this project has\nestablished stronger metrics for evaluating the quality of AI\nforecasting questions than have existed previously. We invite readers to\nsee if they can generate cruxes that outperform the top cruxes generated\nby our project.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In addition to our conclusions about the AI risk debate, we also\ndeveloped new strategies for navigating some of the difficulties in\neliciting and analyzing conditional forecasts, and we hope to release a\nmethods-focused report in the future.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"directions-for-further-research\">Directions for further\nresearch<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We see many other projects that could extend the research begun here\nto improve dialogue about AI risk and inform policy responses to AI.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Examples of remaining questions and future research projects\ninclude:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Are there high-value 2030 cruxes that others can identify?\n<ul class=\"wp-block-list\">\n<li>We were hoping to identify cruxes that would, in expectation,\nlead to a greater reduction in disagreement than the ones we ultimately\ndiscovered. We are interested to see whether readers of this report can\npropose higher value cruxes.<\/li>\n\n\n\n<li>If people disagree a lot, it is likely that no single question\nwould significantly reduce their disagreement in expectation. If such a\nquestion existed, they would already disagree less. However, there might\nstill be better crux questions than the ones we have identified so\nfar.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>What explains the gap in skeptics\u2019 timelines between \u201cpowerful AI\u201d and AI that replaces humanity as the driving force of the future? In other words, what are the skeptics\u2019 views on timelines until superintelligent AI (suitably defined)? A preliminary answer is <a href=\"#what-long-term-outcomes-from-ai-do-skeptics-expect\"><u>above<\/u><\/a>, but more research is needed.<\/li>\n\n\n\n<li>To what extent are different \u201cstories\u201d of how AI development goes\nwell or poorly important within each group?\n<ul class=\"wp-block-list\">\n<li>The skeptic and concerned groups are not monoliths: within each\ngroup, people disagree about what the most likely AI dangers are, in\naddition to how likely those dangers are to happen.<\/li>\n\n\n\n<li>Future work could try to find these schools of thought and see\nhow their stories do or do not affect their forecasts.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>Would future adversarial collaborations be more successful if\nthey focused on a smaller number of participants who work particularly\nwell together and provided them with teams of researchers and other aids\nto support them?<\/li>\n\n\n\n<li>Would future adversarial collaborations be more successful if\nparticipants invested more time in an ongoing way, did additional\nbackground research, and spent time with each other in person, among\nother ways of increasing the intensity of engagement?<\/li>\n\n\n\n<li>How can we better understand what social and personality factors\nmay be driving views on AI risk?\n<ul class=\"wp-block-list\">\n<li>Some evidence from this project suggests that there may be\npersonality differences between skeptics and concerned participants. In\nparticular, skeptics tended to spend more time on each question, were\nmore likely to complete tasks by requested deadlines, and were highly\ncommunicative by email, suggesting they may be more conscientious. Some\nearly reviewers of this report have hypothesized that the concerned\ngroup may be higher on openness to experience. We would be interested in\nstudying the influence of conscientiousness, openness, or other\npersonality traits on forecasting preferences and accuracy.<\/li>\n\n\n\n<li>We are also interested in investigating whether the differences\nbetween the skeptics and concerned group regarding how much weight to\nplace on theoretical arguments with multiple steps of logic would\npersist in other debates, and whether it is related to professional\ntraining, personality traits, or any other factors, as well as whether\nthere is any correlation between trust in theoretical arguments and\nforecasting accuracy.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>How could we have asked about the correlations between various\npotential crux questions? Presumably these events are not independent: a\nworld where METR finds evidence of power-seeking traits is more likely\nto be one where AI can independently write and deploy AI. But we do not\nknow how correlated each question is, so we do not know how people would\nupdate in 2030 based on different possible conjunctions.<\/li>\n\n\n\n<li>How typical or unusual is the AI risk debate? If we did a similar\nproject with a different topic about which people have similarly large\ndisagreements, would we see similar results?<\/li>\n\n\n\n<li>How much would improved questions or definitions change our\nresults? In particular:\n<ul class=\"wp-block-list\">\n<li>As better benchmarks for AI progress are developed, forecasts on\nwhen AIs will achieve those benchmarks may be better cruxes than those\nin this project.<\/li>\n\n\n\n<li>Our definition of \u201cAI takeover\u201d may not match people\u2019s intuitions\nabout what AI futures are good or bad, and improving our\noperationalization may make forecasts on that question more\nuseful.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>What other metrics might be useful for understanding how each\ngroup will update if the other group is right about how likely different\ncruxes are to resolve positively?\n<ul class=\"wp-block-list\">\n<li>For example, we are exploring \u201ccounterpart credences\u201d that would look at how much the concerned group will update in expectation if the skeptics are right about how likely a crux is, and vice versa.<sup data-fn=\"c9a260c2-25f9-4df9-9147-3e969d3c95f3\" class=\"fn\"><a href=\"#c9a260c2-25f9-4df9-9147-3e969d3c95f3\" id=\"c9a260c2-25f9-4df9-9147-3e969d3c95f3-link\">160<\/a><\/sup><\/li>\n\n\n\n<li>Relatedly, it might be useful to look for additional \u201cred and\ngreen flags,\u201d or events that would be large updates to one side if they\nhappened, even if they are very unlikely to happen.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>This project shares some goals and methods with FRI\u2019s AI <a href=\"https:\/\/forecastingresearch.org\/research\"><u>Conditional Trees<\/u><\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240216162704\/https:\/\/forecastingresearch.org\/research\"><u>a<\/u><\/a>) project (report forthcoming), which works on using forecasts from AI experts to build a tree of conditional probabilities that is maximally informative about AI risk. Future work will bring each of these projects to bear on the other as we continue to find new ways to understand conditional forecasting and the AI risk debate.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">In 2030, most of the questions we asked will resolve, and at that\npoint, we will know much more about which side\u2019s short-run forecasts\nwere accurate. This may provide early clues into whether one group&#8217;s\nmethods and inclinations make them more accurate at AI forecasting over\na several year period. The question of how much we should update on AI\nrisk by 2100 based on those results remains open. If the skeptics or the\nconcerned group turn out to be mostly right about what 2030\u2019s AI will be\nlike, should we then trust their risk assessment for 2100 as well, and\nif so, how much?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We are also eager to see how readers of this report respond. We\nwelcome suggestions for better cruxes, discussion about which parts of\nthe report were more or less valuable, and suggestions for future\nresearch.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Notes<\/h2>\n\n\n<ol class=\"wp-block-footnotes\"><li id=\"c6f06bd9-95f5-4c75-8acd-9fc2b3d16cc0\">To ensure the stability of links in this report, we include stable archive.org links in parentheses after each citation to an external URL. <a href=\"#c6f06bd9-95f5-4c75-8acd-9fc2b3d16cc0-link\" aria-label=\"Jump to footnote reference 1\">\u21a9\ufe0e<\/a><\/li><li id=\"c3aa60f6-dde7-495c-9f16-322669455d51\">We defined an \u201cexistential catastrophe\u201d as an event where one of the following occurs: (1) Humanity goes extinct; or (2) Humanity experiences \u201cunrecoverable collapse,\u201d which means either: (a) a global GDP of less than $1 trillion annually in 2022 dollars for at least a million years (continuously), beginning before 2100; or (b) a human population remaining below 1 million for at least a million years (continuously), beginning before 2100. <a href=\"#c3aa60f6-dde7-495c-9f16-322669455d51-link\" aria-label=\"Jump to footnote reference 2\">\u21a9\ufe0e<\/a><\/li><li id=\"80700b24-a53d-4a9f-8298-d7ce0b6478db\">For example, three out of six &#8220;concerned&#8221; participants who updated downward during the project attributed their shift to increased attention to AI risk among policymakers and the public after the release of GPT-4. For more details on the reasons for all updates, see the &#8220;Central Disagreement&#8221; section below and <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=102\" target=\"_blank\" rel=\"noreferrer noopener\"><u>Appendix 4<\/u><\/a>. <a href=\"#80700b24-a53d-4a9f-8298-d7ce0b6478db-link\" aria-label=\"Jump to footnote reference 3\">\u21a9\ufe0e<\/a><\/li><li id=\"d8ae5388-8049-4af4-8698-f129f31b2964\">Scott Alexander, among other XPT readers, suggested this possibility: \u201cMany of the people in this tournament hadn\u2019t really encountered arguments about AI extinction before (potentially including the \u201cAI experts\u201d if they were just eg people who make robot arms or something), and a couple of months of back and forth discussion in the middle of a dozen other questions probably isn\u2019t enough for even a smart person to wrap their brain around the topic\u201d. See Scott Alexander, \u201cThe Extinction Tournament\u201d, <em>Astral Codex Ten, (<\/em>July 20, 2023<em>)<\/em> <a href=\"https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\">https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240209070150\/https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\" id=\"https:\/\/web.archive.org\/web\/20240209070150\/https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\">a<\/a>). <a href=\"#d8ae5388-8049-4af4-8698-f129f31b2964-link\" aria-label=\"Jump to footnote reference 4\">\u21a9\ufe0e<\/a><\/li><li id=\"34c0d67f-081d-4f09-9d39-5d85b454c2e0\">The best convergent crux, \u201cARC Evals,\u201d would narrow the disagreement between the median pair from 22.7 percentage points to 21.48 percentage points in expectation, which means eliminating 5.35% of their disagreement. Note that this statistic refers to the median pair by <a href=\"#glossary\" id=\"#glossary\">POM VOD<\/a>. See \u201c<a href=\"#arc-evals-the-strongest-convergent-crux\">ARC Evals<\/a>\u201d for more details. For magnitudes of value of information effects, see <a href=\"#contextualizing-the-magnitude-of-the-value-of-information\">here<\/a>. <a href=\"#34c0d67f-081d-4f09-9d39-5d85b454c2e0-link\" aria-label=\"Jump to footnote reference 5\">\u21a9\ufe0e<\/a><\/li><li id=\"627ee814-9d5a-40a2-a4d4-c3e504b4de64\">For more details, see &#8220;<a href=\"#contextualizing-the-magnitude-of-the-value-of-information\">Contextualizing the magnitude of value of information<\/a>&#8220;. In more concrete terms, this is equivalent to a forecasting question with the following characteristics: A concerned participant with original P(AI existential catastrophe (XC) by 2100) = 25% identifies a crux that has: P(crux) = 20%, P(AI XC|crux) = 6.2%, and P(AI XC|\u00accrux) = 29.7% A skeptic participant with original P(AI XC by 2100) = 1% identifies a crux that has: P(crux) = 20%, P(AI XC|crux) = 3.37%, and P(AI XC|\u00accrux) = 0.41% <a href=\"#627ee814-9d5a-40a2-a4d4-c3e504b4de64-link\" aria-label=\"Jump to footnote reference 6\">\u21a9\ufe0e<\/a><\/li><li id=\"ab5c17ca-3bbd-4259-97a5-f4a499b6de51\">See Understanding each other\u2019s arguments and <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=149\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 10<\/a> for additional discussion of key areas of disagreement. <a href=\"#ab5c17ca-3bbd-4259-97a5-f4a499b6de51-link\" aria-label=\"Jump to footnote reference 7\">\u21a9\ufe0e<\/a><\/li><li id=\"70c6e9ce-c7d0-49f0-9ff6-2dc16fa28f52\">These outcomes were: AI causing extinction intentionally, unintentionally, or via misuse, misalignment causing a 50% drop in human population, or human well-being dropping to &lt;4\/10 because of AI misalignment, accidents, or misuse. These were phrased to be mutually exclusive outcomes. See <a href=\"#survey-on-long-term-ai-outcomes\">Survey on long-term AI outcomes<\/a> section and <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 5<\/a> for more details. <a href=\"#70c6e9ce-c7d0-49f0-9ff6-2dc16fa28f52-link\" aria-label=\"Jump to footnote reference 8\">\u21a9\ufe0e<\/a><\/li><li id=\"7b7f15d6-76d8-45b4-a68f-b3968547c30f\">The full question we asked was, &#8220;By what year, if ever, do you expect to agree with the following statement? \u2018AI has displaced humans as the primary force that determines what happens in the future. It now has at least as much power relative to humans as humans had relative to other species in 2023.\u2019\u201d Note that this would not necessarily be seen as a negative outcome by all participants. <a href=\"#7b7f15d6-76d8-45b4-a68f-b3968547c30f-link\" aria-label=\"Jump to footnote reference 9\">\u21a9\ufe0e<\/a><\/li><li id=\"fedd13d8-aabd-4e81-973b-6232c48e718c\">Note: All participant quotes have been regularized to American English to preserve anonymization. Participants classified as AI skeptics stated, for example, \u201cAlso, none of this is to say from a skeptic point of view the issues are not important[.] I think for us a 1% risk is a high risk;\u201d \u201c[T]he \u2018risk-concerned\u2019 camp (I\u2019m using scare quotes because I consider that I\u2019m risk concerned, even though technically I\u2019m in the risk-skeptic camp because I assign a far lower probability to extinction by 2100 relative to some);\u201d \u201cAIs could (and likely will) eventually have massive power;\u201d \u201cThat said, still perceive overall risk as &#8220;low at a glance but far too high considering the stakes[&#8220;];\u201d \u201cTo my mind, there should be no difference in the policy response to a 1% chance of 60% of humanity dying and a 25% chance\u2014both forecasts easily cross the threshold of being \u2018too damn high\u2019.\u201d <a href=\"#fedd13d8-aabd-4e81-973b-6232c48e718c-link\" aria-label=\"Jump to footnote reference 10\">\u21a9\ufe0e<\/a><\/li><li id=\"e985ddd0-4bca-4e0d-8575-dfdb257a783b\">This could be due to normative influence (because people defer to their social or intellectual peers), or, more likely in our view, informational influence (because they think that, if people whose reasoning they trust have changed their mind by 2030, it must be that surprising new information has come to light that informs their new opinion). Disentangling these pathways is a goal for future work. <a href=\"#e985ddd0-4bca-4e0d-8575-dfdb257a783b-link\" aria-label=\"Jump to footnote reference 11\">\u21a9\ufe0e<\/a><\/li><li id=\"01457835-934d-436b-8870-5f869919fab2\">The median AI expert predicted a 12% chance of catastrophe and a 3% chance of human extinction due to AI by 2100. The median superforecaster predicted a 2.13% chance of catastrophe and a 0.38% chance of extinction due to AI. While experts predicted higher chances of all potential extinction risks than superforecasters did (including nuclear weapons and biorisks), the effect was much more pronounced in the case of AI. For more on lack of convergence, see Ezra Karger, et al., \u201cForecasting Existential Risks Evidence from a Long-Run Forecasting Tournament\u201d, <em>Forecasting Research Institute<\/em>, August 8, 2023, <a href=\"https:\/\/forecastingresearch.org\/research\/existential-risk-persuasion-tournament\" id=\"876\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/forecastingresearch.org\/research\/existential-risk-persuasion-tournament<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\" id=\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\">a<\/a>). <a href=\"#01457835-934d-436b-8870-5f869919fab2-link\" aria-label=\"Jump to footnote reference 12\">\u21a9\ufe0e<\/a><\/li><li id=\"7b899eaa-2958-46a2-8e48-8fe5d7f7698c\">For example, superforecasters predicted that an AI would first win an International Math Olympiad gold medal in 2035 while experts predicted 2030. See Karger et al., \u201c<a href=\"https:\/\/forecastingresearch.org\/research\/existential-risk-persuasion-tournament\" id=\"https:\/\/forecastingresearch.org\/research\/xpt\" target=\"_blank\" rel=\"noreferrer noopener\">XPT report<\/a>\u201d (<a href=\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\" id=\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\">a<\/a>), page 156. For full relevant analysis, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=41\" target=\"_blank\" rel=\"noreferrer noopener\">Relationship between short-run forecasting questions and longer-term disagreements<\/a> section on page 41. <a href=\"#7b899eaa-2958-46a2-8e48-8fe5d7f7698c-link\" aria-label=\"Jump to footnote reference 13\">\u21a9\ufe0e<\/a><\/li><li id=\"c6a448ac-3c27-48d0-abdd-82ad6400ecdf\">\u201cAdversarial collaboration\u201d protocols, often enforced by \u201cneutral\u201d umpires, encourage each side to demonstrate their capacity to fairly characterize, not caricature, the views of the other\u2014and then to reach ex ante agreements on the types of data, observational or experimental, that would induce each side to move toward the other\u2019s position. For examples of adversarial collaborations and additional information, see \u201cAbout\u201d, Penn Arts and Sciences Adversarial Collaboration Project, Accessed on February 9, 2024, <a href=\"https:\/\/web.sas.upenn.edu\/adcollabproject\/about\/\">https:\/\/web.sas.upenn.edu\/adcollabproject\/about\/<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240205182734\/https:\/\/web.sas.upenn.edu\/adcollabproject\/about\/\">a<\/a>). <a href=\"#c6a448ac-3c27-48d0-abdd-82ad6400ecdf-link\" aria-label=\"Jump to footnote reference 14\">\u21a9\ufe0e<\/a><\/li><li id=\"1be41199-4b66-42d9-9b12-627b2951dda5\">Note that, in some conversations about cruxes for AI risk, the word \u201ccrux\u201d is used for questions that would lead to large updates even if highly unlikely (what we call \u201c<a href=\"#red-flags-and-green-flags\" id=\"#red-flags-and-green-flags\">red flags<\/a>\u201d). In this project, we are focused on expected updates: we looked for cruxes that would be the most important in expectation, weighting how much difference they would make if they happened by how likely they are to happen. <a href=\"#1be41199-4b66-42d9-9b12-627b2951dda5-link\" aria-label=\"Jump to footnote reference 15\">\u21a9\ufe0e<\/a><\/li><li id=\"58f5700e-f100-44f2-8403-a6e9d6db430a\">For example, Scott Alexander stated that, \u201cMany of the people in this tournament hadn\u2019t really encountered arguments about AI extinction before (potentially including the \u201cAI experts\u201d if they were just eg people who make robot arms or something), and a couple of months of back and forth discussion in the middle of a dozen other questions probably isn\u2019t enough for even a smart person to wrap their brain around the topic\u201d. See <a href=\"https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\">Alexander, \u201cThe Extinction Tournament\u201d<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240209070150\/https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\">a<\/a>). Similarly, one XPT participant wrote, \u201cI&#8217;ve been spending enough time on LessWrong that I mostly forgot the existence of smart people who thought recent AI advances were mostly hype. I was unprepared to explain why I thought AI was underhyped in 2022\u201d. See Peter McCluskey, \u201cExistential Risk Persuasion Tournament\u201d, <em>Less Wrong<\/em> (July 17, 2023) <a href=\"https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI\">https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240220113540\/https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI\">a<\/a>). <a href=\"#58f5700e-f100-44f2-8403-a6e9d6db430a-link\" aria-label=\"Jump to footnote reference 16\">\u21a9\ufe0e<\/a><\/li><li id=\"b9bd062c-90b5-4093-92b5-02d08b7c759c\">When eliciting conditional forecasts, the prompt given to participants read: \u201cConditional on this question resolving positively (by 2030), what is your probability that AI causes an existential catastrophe by 2100?\u201d <a href=\"#b9bd062c-90b5-4093-92b5-02d08b7c759c-link\" aria-label=\"Jump to footnote reference 17\">\u21a9\ufe0e<\/a><\/li><li id=\"c8e0aef2-2064-41a1-a854-67a82685bfb7\">Note: many people in the \u201cskeptic\u201d group describe themselves as concerned about risks from advanced AI, including but not limited to the risk of extinction, despite thinking those risks are less likely to materialize than the \u201cconcerned\u201d group expects. For example, \u201cAlso, none of this is to say from a skeptic point of view the issues are not important[.] I think for us a 1% risk is a high risk.\u201d (Gus); \u201c\u2026 the \u2018risk-concerned\u2019 camp (I\u2019m using scare quotes because I consider that I\u2019m risk concerned, even though technically I\u2019m in the risk-skeptic camp because I assign a far lower probability to extinction by 2100 relative to some)\u201d (Blake). <a href=\"#c8e0aef2-2064-41a1-a854-67a82685bfb7-link\" aria-label=\"Jump to footnote reference 18\">\u21a9\ufe0e<\/a><\/li><li id=\"59090342-fed5-48d2-a469-2112122fa7b3\">For full details, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=102\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 4<\/a>. Six out of the 11 concerned participants updated downward during the project. Three out of those six cited policy responses as the reason for their updates, one cited an improved understanding of the base rate of non-human extinction after humans arose, one shifted some probability mass toward AI \u201ctakeover\u201d rather than AI-caused existential catastrophe, and one did not explain their reasons for updating. Example quotes from participants citing policy responses as the reason for updating: \u201cI have updated my prognosis to 30% [down from 60%], partially driven by positive updates in the area of point 4 making coordination and slowdown\/stop of capability research more likely. This largely refers to the shift in public consciousness and the [O]verton window around the topic as I have perceived it over the past months, currently culminating in a public statement by most of the leading figures.\u201d \u201cSlightly lowering my forecast [from 25% to 20%] as [relevant people take the risk seriously] has exceeded my (fairly high) expectations over the last couple of months.\u201d \u201cI think my main update here [moving from 21% to 18%] has come from thinking a bit more deeply about AI regulation and what measures society will adopt to prevent catastrophes. I did not really include this as part of my original model, but it now seems somewhat likely that at least the EU and US will adopt some regulation that meaningfully reduces risk.\u201d <a href=\"#59090342-fed5-48d2-a469-2112122fa7b3-link\" aria-label=\"Jump to footnote reference 19\">\u21a9\ufe0e<\/a><\/li><li id=\"fe8e3e2f-d2ec-4206-acd8-6546810423bc\">For example, one participant described their forecast as based on a \u201c <em>very<\/em> rough back-of-the-envelope estimate\u201d (Stella) and another said, \u201cI&#8217;m with Tetlocks original view that long-term forecasts of this nature are very unreliable\u201d (Gus). Skeptics who were not subject-matter experts were particularly candid when they were forecasting questions that involved technical details. On a question about the lowest price of GFLOPs, one skeptic said \u201cI\u2019m operating completely outside of my area of expertise here, so no one should hesitate to correct me\u201d (Blake), and another said \u201cThis is very far away from my area of understanding. Mostly running on crude estimates of current trends with some leeway in the nearer term for newer hardware designed specifically optimized for reducing the cost of AI training\u201d (Eve). <a href=\"#fe8e3e2f-d2ec-4206-acd8-6546810423bc-link\" aria-label=\"Jump to footnote reference 20\">\u21a9\ufe0e<\/a><\/li><li id=\"2ee49b87-96f2-41bf-b7e0-df265c825b20\">For example, in the Good Judgment Inc. project that compared superforecasters to other participants in an online forecasting competition, the average question was open for 214 days, with the entire tournament taking place over six years. Christopher W. Karvetski, <a href=\"https:\/\/goodjudgment.com\/wp-content\/uploads\/2021\/10\/Superforecasters-A-Decade-of-Stochastic-Dominance.pdf\">Superforecasters: A Decade of Stochastic Dominance<\/a> technical white paper (2021), 2 (<a href=\"https:\/\/web.archive.org\/web\/20240306144939\/https:\/\/goodjudgment.com\/wp-content\/uploads\/2021\/10\/Superforecasters-A-Decade-of-Stochastic-Dominance.pdf\">a<\/a>). In addition to extensive research on shorter-term forecasts, Tetlock et al. found that, at least on some types of questions, experts are more accurate than simple base rate extrapolation over 25 year horizons, although they are much less accurate than they were over 0-2 years. Our research asks forecasters to consider forecasts over many decades, and we do not yet know how much accuracy declines over that much longer period. Philip E. Tetlock et al., <a href=\"https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/ffo2.157\">Long-Range Subjective-Probability Forecasts of Slow-Motion Variables in World Politics: Exploring Limits on Expert Judgment<\/a> <em>Futures &amp; Foresight Science<\/em> (2023), 33, (<a href=\"https:\/\/web.archive.org\/web\/20240306150259\/https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/ffo2.157\">a<\/a>). <a href=\"#2ee49b87-96f2-41bf-b7e0-df265c825b20-link\" aria-label=\"Jump to footnote reference 21\">\u21a9\ufe0e<\/a><\/li><li id=\"069d4399-dae6-4a3b-8be2-785f020ac14d\">We wrote in the XPT report that \u201cOur [domain] expert sample included well-published AI researchers from top-ranked industrial and academic research labs, graduate students with backgrounds in synthetic biology, and generalist existential risk researchers working at think tanks, among others.\u201d See Karger et al., <a href=\"https:\/\/forecastingresearch.org\/research\/xpt\" id=\"876\" target=\"_blank\" rel=\"noreferrer noopener\">XPT report<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\">a<\/a>), page 9. <a href=\"#069d4399-dae6-4a3b-8be2-785f020ac14d-link\" aria-label=\"Jump to footnote reference 22\">\u21a9\ufe0e<\/a><\/li><li id=\"92924628-e167-4239-90a4-f7136e9f69af\">We are not commenting on the merits of these criticisms at this point. <a href=\"#92924628-e167-4239-90a4-f7136e9f69af-link\" aria-label=\"Jump to footnote reference 23\">\u21a9\ufe0e<\/a><\/li><li id=\"2f69e906-f0dc-424a-b2a6-5d833f2fdee6\">For example, \u201cTeam engagement seemed to fall off over the course of the tournament, with fewer comments being made and chat messages being sent\u201d. See Damien Laird, \u201cPost-Mortem: 2022 Hybrid Forecasting-Persuasion Tournament\u201d, <em>Mania Riddle<\/em> (March 1, 2023), <a href=\"https:\/\/damienlaird.substack.com\/p\/post-mortem-2022-hybrid-forecasting\">https:\/\/damienlaird.substack.com\/p\/post-mortem-2022-hybrid-forecasting<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240220113316\/https:\/\/damienlaird.substack.com\/p\/post-mortem-2022-hybrid-forecasting\">a<\/a>). <a href=\"#2f69e906-f0dc-424a-b2a6-5d833f2fdee6-link\" aria-label=\"Jump to footnote reference 24\">\u21a9\ufe0e<\/a><\/li><li id=\"53e182c0-bf66-4cc0-a2f0-6a6d6f08c31a\">For example, \u201cI didn&#8217;t notice anyone with substantial expertise in machine learning. Experts were apparently chosen based on having some sort of respectable publication related to AI, nuclear, climate, or biological catastrophic risks. Those experts were more competent, in one of those fields, than news media pundits or politicians. I.e. they&#8217;re likely to be more accurate than random guesses. But maybe not by a large margin\u201d. See <a href=\"https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI\">McCluskey, \u201cExistential Risk Persuasion Tournament\u201d<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240220113540\/https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI\">a<\/a>). <a href=\"#53e182c0-bf66-4cc0-a2f0-6a6d6f08c31a-link\" aria-label=\"Jump to footnote reference 25\">\u21a9\ufe0e<\/a><\/li><li id=\"af08eb9e-f160-484c-98af-385b0751321b\">Participants were asked to spend 3-10 hours per week on this project, which would have been about 24-80 hours over the 8 weeks of the project. Participants were free to choose how much time to spend within that range and were compensated hourly for up to ten hours per week, although some chose to spend additional unpaid time on this project. Skeptics had some additional suggested reading and Q&amp;As with experts in the field, but they also generally chose to spend more time on their forecasts and rationales. <a href=\"#af08eb9e-f160-484c-98af-385b0751321b-link\" aria-label=\"Jump to footnote reference 26\">\u21a9\ufe0e<\/a><\/li><li id=\"0134f789-8687-450a-bcf2-26afb94a53e5\">For example, \u201cThe number of steps required for an AI to lead to extinction (leading to a wide range of potential outcomes and lower probabilities of extinction)\u201d (Gus). \u201cIt will take a series of outcomes to achieve extinction, and failure to achieve any of these steps will cause extinction to be highly improbable.\u201d (Flint). \u201cAI caused Extinction\/x-risk requiring many steps to get there, need to be able to create super-intelligence in the first place, intelligence has to be misaligned or malevolent, etc;\u201d (Hank). \u201cMany steps to get from (A) now to (Z) extinction, each with varying probabilities (many of which are quite low)\u201d (Claire). \u201cRisk-concerned team underestimates the level of complexity and interim steps that would likely be necessary for a Q1 resolution\u201d (Blake). <a href=\"#0134f789-8687-450a-bcf2-26afb94a53e5-link\" aria-label=\"Jump to footnote reference 27\">\u21a9\ufe0e<\/a><\/li><li id=\"f2383c6e-df9b-4150-a092-a7fe4ac40b83\">\u201c[T]he difficulty of killing everybody\u201d (Gus) was mentioned, as well as \u201cExtinction or near-extinction is really hard\u201d (James). <a href=\"#f2383c6e-df9b-4150-a092-a7fe4ac40b83-link\" aria-label=\"Jump to footnote reference 28\">\u21a9\ufe0e<\/a><\/li><li id=\"11742a45-7757-420e-88df-9ed0e13a8dab\">\u201c[T]he challenge to risk assessments based on thought experiments not evidence\u201d (Gus). \u201cRisk-concerned team spends too much time in silos that lack ideological diversity, gaming out doom-loop scenarios based on theories that will likely have little bearing on reality. (See: Y2K)\u201d (Blake). <a href=\"#11742a45-7757-420e-88df-9ed0e13a8dab-link\" aria-label=\"Jump to footnote reference 29\">\u21a9\ufe0e<\/a><\/li><li id=\"66472849-4453-497c-bc85-71393f231754\">\u201c[There is a l]ack of convincing argument that warrants a high degree of certainty, that AGI or ASI [artificial superintelligence] would determine that the elimination or even subjugation of nearly all humans is a worthwhile goal\u201d (Ike). \u201cIt is <em>just<\/em> as possible\/probable that AI becomes benevolent as it does malevolent\u201d (Claire). \u201cHigh probability that ASI will be neutral or human-positive based on development and inherent qualities\u201d (Dean). \u201cThen we need an AI that is either so mindless that it destroys virtually everything for atom reclamation (or something similar), or an AI that is relentlessly determined to wipe out all humans, despite humans being resilient and diverse in locations and conditions\u201d (Flint). <a href=\"#66472849-4453-497c-bc85-71393f231754-link\" aria-label=\"Jump to footnote reference 30\">\u21a9\ufe0e<\/a><\/li><li id=\"abd75fe2-d0b0-4e6d-a38c-56c017e96baf\">\u201cAI experts understate the likely extent of guardrails, and understate the merit of very good but not perfect guardrails\u201d (James). \u201cPre-ASI safety through testing, security and restrictions\u201d (Dean). \u201cLikely improvements for AGI &#8220;alignment&#8221; through research and development\u201d (Dean). \u201cWe need full control failure, and our influence on its development in no way deterring or causing them to see even the slightest value in us\u201d (Flint). <a href=\"#abd75fe2-d0b0-4e6d-a38c-56c017e96baf-link\" aria-label=\"Jump to footnote reference 31\">\u21a9\ufe0e<\/a><\/li><li id=\"e49b0e36-fb48-4391-b567-76c52d962116\">\u201cWe first need super-sentient AIs with major physical penetration in our lives\u201d (Flint). \u201cAGI is much harder than experts think, and will take longer\u201d (James). \u201cRisk-concerned team does not adequately consider longer timelines and more benign outcomes that fall outside the focus of their primary concerns\u201d (Blake). \u201cProgress on current models and model architecture not necessarily generalizable to general intelligence, with no clear path to getting to general intelligence\u201d (Hank). \u201cTechnology development and deployment require time and iteration\u201d (Ash). <a href=\"#e49b0e36-fb48-4391-b567-76c52d962116-link\" aria-label=\"Jump to footnote reference 32\">\u21a9\ufe0e<\/a><\/li><li id=\"be249a97-22d9-4193-b9a2-55f166e6e99b\">\u201cExtinction looks conjunctive\u201d (Yael). \u201cMany of the arguments for existential risk from AI rely on long lines of reasoning over several steps without any direct empirical evidence, and the arguments themselves are expressed in terms of vague, ambiguous concepts (like &#8220;intelligence&#8221;). As a reference class, these types of arguments are often wrong\u201d (Stella). <a href=\"#be249a97-22d9-4193-b9a2-55f166e6e99b-link\" aria-label=\"Jump to footnote reference 33\">\u21a9\ufe0e<\/a><\/li><li id=\"8d35ef14-50dc-449d-8411-d350b803900f\">\u201cKilling everyone is very hard, and probably requires that the AI actively wants to kill everyone\u201d (Zoe). \u201c[M]aybe it&#8217;s hard to kill everybody\/there&#8217;s no point in doing so\u201d (Yael). \u201c[K]illing literally 100% of people is really hard, if a few survived that wouldn&#8217;t trigger the resolution criteria\u201d (Wesley). \u201cIt&#8217;s difficult to get from&#8217;it&#8217;s somewhat misaligned&#8217; to&#8217;it kills literally everyone&#8217;\u201d (Vincent). \u201cKilling everyone is <em>really<\/em> hard. With current technology it seems extremely (like 0.1%) unlikely to happen\u201d (Pascal). <a href=\"#8d35ef14-50dc-449d-8411-d350b803900f-link\" aria-label=\"Jump to footnote reference 34\">\u21a9\ufe0e<\/a><\/li><li id=\"bb1b684a-7319-4c0e-ac22-727723f94b9a\">\u201cMany of the arguments for existential risk from AI rely on long lines of reasoning over several steps without any direct empirical evidence, and the arguments themselves are expressed in terms of vague, ambiguous concepts (like &#8220;intelligence&#8221;). As a reference class, these types of arguments are often wrong.\u201d (Stella). \u201cA story demonstrating how a catastrophe could happen is not a good basis for a probabilistic forecast\u201d (Pascal). \u201c[L]ack of very concrete story for everybody dying\u201d (Yael). \u201cSome broader &#8220;forecasting is hard&#8221; skepticism about trendline extrapolation\u201d (Xander). \u201c[M]any reference classes point hard against transformative growth\u201d (Wesley). \u201cGetting growth levels necessary for TAI [transformative AI] on a world-wide scale takes truly extreme developments far beyond anything seen before. It&#8217;s unlikely we see that happening on worldwide basis even with big advances\u201d (Vincent). <a href=\"#bb1b684a-7319-4c0e-ac22-727723f94b9a-link\" aria-label=\"Jump to footnote reference 35\">\u21a9\ufe0e<\/a><\/li><li id=\"85de65d8-6d46-459a-bbaf-1f8edac6fa2b\">\u201c[D]angers will be apparent before they reach critical levels and can be addressed then\u201d (Ume). \u201cSuperintelligent AI won&#8217;t catch us completely by surprise &#8211; we&#8217;ll have time to work on safety and make progress by trial and error before we build an AI that could defeat all of humanity\u201d (Teshi). <a href=\"#85de65d8-6d46-459a-bbaf-1f8edac6fa2b-link\" aria-label=\"Jump to footnote reference 36\">\u21a9\ufe0e<\/a><\/li><li id=\"510e5aed-e72c-4313-a576-b952151331e6\">\u201cNon-extinction looks conjunctive\u201d (Yael). <a href=\"#510e5aed-e72c-4313-a576-b952151331e6-link\" aria-label=\"Jump to footnote reference 37\">\u21a9\ufe0e<\/a><\/li><li id=\"611a9493-dd66-4d0c-8ea2-91ee5cf5a72c\">\u201cBase rates are not very helpful if AGI is as transformative as 15% year on year growth\u201d (Pascal). \u201c[D]ifferent reference classes point to different priors, which should at least cast doubt on extremely confident starting points\u201d (Wesley). <a href=\"#611a9493-dd66-4d0c-8ea2-91ee5cf5a72c-link\" aria-label=\"Jump to footnote reference 38\">\u21a9\ufe0e<\/a><\/li><li id=\"c329b782-ea5f-4794-a903-467b7182df2c\">\u201cCurrent progress is very rapid: 1 OOM in efficiency\/2 years, and another from increased spending\u201d (Xander) \u201cTrendline extrapolation: as loss on language datasets decreases, LLMs have started becoming useful for all sorts of task assistance (e.g. writing, coding, queries)\u201d (Xander). \u201cExtrapolating current compute trends leads to very dramatic conclusions about the transformative potential of AI&#8221; (Pascal). <a href=\"#c329b782-ea5f-4794-a903-467b7182df2c-link\" aria-label=\"Jump to footnote reference 39\">\u21a9\ufe0e<\/a><\/li><li id=\"6d9dfcea-6d1a-49c2-bb23-4d4cb1920ea2\">\u201c[I]nstrumental convergence leads to catastrophically bad outcomes with unaligned but highly intelligent systems\u201d (Ume). \u201cConvergent Instrumental Subgoals are likely\u201d (Pascal). <a href=\"#6d9dfcea-6d1a-49c2-bb23-4d4cb1920ea2-link\" aria-label=\"Jump to footnote reference 40\">\u21a9\ufe0e<\/a><\/li><li id=\"0b87d530-4332-4de8-83e7-3128de6f2904\">\u201cAlignment is really hard for many reasons\u201d (Ume). \u201cAlignment is probably a hard technical problem\u201d (Riley). \u201c[A]lignment looks really hard, civilizational coordination also looks hard\u201d (Yael). \u201cThere has been a fairly large effort to solve the technical problems in AI safety, from many very competent people. So far, progress has been very limited. This is reason to believe that the problem is genuinely difficult to solve\u201d (Stella). \u201cUnless AI systems are directed towards the very narrow and delicate target of maintaining human civilization and its autonomy as we understand it, they will with very high probability not consider our existence to be optimal\u201d (Riley). <a href=\"#0b87d530-4332-4de8-83e7-3128de6f2904-link\" aria-label=\"Jump to footnote reference 41\">\u21a9\ufe0e<\/a><\/li><li id=\"19a5b93f-693b-4a15-bfed-bfa1f272fd5a\">\u201cIf AGI is widely expected to have a very large economic impact, global coordination on AI safety measures becomes harder, since having access to cutting-edge AI models could become a strategic advantage\u201d (Zoe). \u201cThere are strong economic\/political\/academic incentives to move forward with development of AI capabilities regardless of whether alignment is solved\u201d (Riley). \u201cThe current labs on the forefront of AGI research are reckless. There are many straightforward safety measures that labs don&#8217;t take, even though they could. And even those measures would not be enough; to succeed, labs must be exceptionally careful &amp; paranoid, which they won&#8217;t be&#8221; (Teshi). <a href=\"#19a5b93f-693b-4a15-bfed-bfa1f272fd5a-link\" aria-label=\"Jump to footnote reference 42\">\u21a9\ufe0e<\/a><\/li><li id=\"027d0b37-768f-42bf-aca3-7863c09409f7\">\u201cA super-sentient (or perhaps even a transformational) AI is a significant risk in and of itself\u201d (Flint). <a href=\"#027d0b37-768f-42bf-aca3-7863c09409f7-link\" aria-label=\"Jump to footnote reference 43\">\u21a9\ufe0e<\/a><\/li><li id=\"b9486a3c-00d4-4291-b96b-5ec9f5ca9379\">\u201cRisk-skeptic team does not adequately appreciate the novel, fast-moving aspect of the threat and is therefore too anchored on irrelevancies like base rates and slower timelines\u201d (Blake). \u201cModel progress is far faster than we realize and exponential growth is hard to model, machine learning may translate to a wide array of fields\u201d (Hank). \u201cAGI self-improvement is possible, which makes future capabilities hard to predict\u201d (Kim). <a href=\"#b9486a3c-00d4-4291-b96b-5ec9f5ca9379-link\" aria-label=\"Jump to footnote reference 44\">\u21a9\ufe0e<\/a><\/li><li id=\"db37dbb5-171f-4d40-8159-423cdfa44433\">\u201cAIs will almost certainly attain super-sentience prior to 2100 and likely much sooner than that year, so there will be a long window where they will have tremendous advantage over humans in their capabilities. Given #1, this means we are at the mercy of an entity that may willfully (or even accidentally) eliminate us at any time\u201d (Flint). \u201cProgress to date has been much faster than many AI skeptics have predicted\u201d (Hank). \u201cAI has been developing so rapidly (and far faster than most even relatively recent forecasts suggested), and will so clearly have dramatic capabilities and impacts that it&#8217;s appropriate to adopt a precautionary approach\u201d (Eve). \u201cAI has recently progressed much faster than expected, and there&#8217;s reason to expect this to continue\u201d (James). <a href=\"#db37dbb5-171f-4d40-8159-423cdfa44433-link\" aria-label=\"Jump to footnote reference 45\">\u21a9\ufe0e<\/a><\/li><li id=\"40e389bc-3ce8-48e9-bae3-4f1c41258268\">\u201cImagining all possible scenarios is going to be hard &#8211; ensuring safety will be hard\u201d (Ash). \u201cAlignment is unsolved\/unsolvable\u201d (Kim). \u201cDifficulty in achieving positive human aligned &#8220;behavior&#8221;.\u201d (Ike) <a href=\"#40e389bc-3ce8-48e9-bae3-4f1c41258268-link\" aria-label=\"Jump to footnote reference 46\">\u21a9\ufe0e<\/a><\/li><li id=\"e6463a6e-da4f-44a1-bd0c-0e78b055ed0c\">\u201cTheir smug dismissiveness notwithstanding, the risk-skeptic team has provided no convincing argument as to why instrumental convergence shouldn\u2019t be an existential concern.\u201d (Blake). \u201cThat&#8217;instrumental convergence&#8217; is possible, perhaps likely, under certain preconditions.\u201d (Eve) <a href=\"#e6463a6e-da4f-44a1-bd0c-0e78b055ed0c-link\" aria-label=\"Jump to footnote reference 47\">\u21a9\ufe0e<\/a><\/li><li id=\"e658a913-1b2d-4dd2-8e99-744a0da2acb9\">\u201cEven if humans could deploy AGI safely, they won&#8217;t (because they aren&#8217;t)\u201d (Kim). \u201cThere will be incentives to push away from caution during AI development\u201d (Ash). <a href=\"#e658a913-1b2d-4dd2-8e99-744a0da2acb9-link\" aria-label=\"Jump to footnote reference 48\">\u21a9\ufe0e<\/a><\/li><li id=\"7281addc-629b-4889-9af0-6b9f60fa598f\">\u201cWe don&#8217;t know what is possible from AGI, so we should prepare\/scenario plan for the absolute worst\u201d (Claire). \u201cAI has been developing so rapidly (and far faster than most even relatively recent forecasts suggested), and will so clearly have dramatic capabilities and impacts that it&#8217;s appropriate to adopt a precautionary approach\u201d (Eve). <a href=\"#7281addc-629b-4889-9af0-6b9f60fa598f-link\" aria-label=\"Jump to footnote reference 49\">\u21a9\ufe0e<\/a><\/li><li id=\"db064470-57c3-4194-9baa-1ae4321f8ef4\">Throughout this report, numbers reported as probabilities conditional on cruxes resolving positively were elicited directly, and probabilities conditional on cruxes resolving negatively were imputed. <a href=\"#db064470-57c3-4194-9baa-1ae4321f8ef4-link\" aria-label=\"Jump to footnote reference 50\">\u21a9\ufe0e<\/a><\/li><li id=\"8c4de1ea-7b58-438c-9783-0631a6640dfe\">For more details, see <a href=\"#contextualizing-the-magnitude-of-the-value-of-information\">Contextualizing the Magnitude of VOI<\/a>. <a href=\"#8c4de1ea-7b58-438c-9783-0631a6640dfe-link\" aria-label=\"Jump to footnote reference 51\">\u21a9\ufe0e<\/a><\/li><li id=\"101efdf8-6590-4252-8af7-ed028bf5890a\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for operationalization. <a href=\"#101efdf8-6590-4252-8af7-ed028bf5890a-link\" aria-label=\"Jump to footnote reference 52\">\u21a9\ufe0e<\/a><\/li><li id=\"2efc6a1f-3fbf-4823-9ec2-9883fb0da199\">Thanks to Alex Lawsen for this suggestion. <a href=\"#2efc6a1f-3fbf-4823-9ec2-9883fb0da199-link\" aria-label=\"Jump to footnote reference 53\">\u21a9\ufe0e<\/a><\/li><li id=\"226e8c45-a2ac-481a-8db7-f82081172f5f\">This would correspond to <a href=\"https:\/\/forecastingresearch.org\/ai-risk-voi-vod\">a VOI of 4.5E-03<\/a> (<a href=\"https:\/\/forecastingresearch.org\/s\/AI-risk-VoI-VoD.xlsx\">a<\/a>) and a POM VOI of 2.08%, similar to the median values for <a href=\"#results-tables-and-figures\" id=\"#results-tables-and-figures\">highly ranked concerned cruxes<\/a> such as \u201cAlignment researchers changing minds\u201d and \u201cMajor powers war\u201d. <a href=\"#226e8c45-a2ac-481a-8db7-f82081172f5f-link\" aria-label=\"Jump to footnote reference 54\">\u21a9\ufe0e<\/a><\/li><li id=\"916bb071-edda-467a-a30e-161a1bf3e957\">For this project, we use log VOD, which measures (1) What does Alice gain, in log score terms, by switching to Bob\u2019s point of view, if Bob is right? And (2) What does Bob gain by switching to Alice\u2019s point of view, if Alice is right? See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=94\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 2<\/a> for full explanation. <a href=\"#916bb071-edda-467a-a30e-161a1bf3e957-link\" aria-label=\"Jump to footnote reference 55\">\u21a9\ufe0e<\/a><\/li><li id=\"4dbe0e7b-d641-4860-96ed-ee8db9cdfe3e\">This could be possible with the following values: Alice believes: P(U) = 1%; P(C) = 1%; P(U|C) = 90%; P(U|!C) = ~0.1%. Bob believes: P(U) = 40%; P(C) = 44%; P(U|C) = 90%; P(U|!C) = ~0.7%. In this case, the VOD would be 99.3% of its theoretical maximum. <a href=\"#4dbe0e7b-d641-4860-96ed-ee8db9cdfe3e-link\" aria-label=\"Jump to footnote reference 56\">\u21a9\ufe0e<\/a><\/li><li id=\"eeb71397-2903-46c1-ae83-b0c71304c6dd\">See <a href=\"#contextualizing-the-magnitude-of-the-value-of-information\">Contextualizing the Magnitude of VOI<\/a> for further explanation of these metrics. <a href=\"#eeb71397-2903-46c1-ae83-b0c71304c6dd-link\" aria-label=\"Jump to footnote reference 57\">\u21a9\ufe0e<\/a><\/li><li id=\"6d7512f4-58dd-4ffe-a99a-8bf32e0d2084\">For example, when discussing the question of whether there would be economic growth &gt;15% in a year before 2070, one concerned participant wrote, \u201cConditional on humanity surviving a year with 15%+ economic growth, which to me means AGI and almost certainly ASI have been developed and have not killed humanity within that year, I&#8217;d go down to maybe 25%\u201d (Xander). About the same question, a skeptic participant wrote, \u201cI think that if we are going to experience extinction from AGI or PASTA, it is going to be because of major mis-alignment. So I am not able at this time to see how one would be a corollary of the risk of the other. I suppose that higher growth could indicate major AI influence, which could lead to inadequate development of controls\u201c. Neither of these participants were saying that economic growth itself would necessarily affect their forecast, but rather that a world that has transformative economic growth would be a signal about other changes by 2070. <a href=\"#6d7512f4-58dd-4ffe-a99a-8bf32e0d2084-link\" aria-label=\"Jump to footnote reference 58\">\u21a9\ufe0e<\/a><\/li><li id=\"ac1e4c99-d409-4e5f-a17c-db59db80cc20\">For example, if the US government passes a set of proposed AI regulations, the regulations might reduce risk on their own, but the fact that they have been passed by 2030 could signal that AIs have developed in ways that are concerning enough to drive these regulations to be passed. As a result, a forecaster saying that they would be more concerned about AI risk conditional on this question resolving positively would not necessarily be saying that they think the policies would be harmful. <a href=\"#ac1e4c99-d409-4e5f-a17c-db59db80cc20-link\" aria-label=\"Jump to footnote reference 59\">\u21a9\ufe0e<\/a><\/li><li id=\"e1e1d9c7-d0fd-4897-ab0d-c622ad621555\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for detailed operationalizations of questions. <a href=\"#e1e1d9c7-d0fd-4897-ab0d-c622ad621555-link\" aria-label=\"Jump to footnote reference 60\">\u21a9\ufe0e<\/a><\/li><li id=\"4a2b73e3-3d9a-4cbd-b4f5-229a3dfcbf36\">That is, a participant who forecasted a 0.1% chance of existential catastrophe due to AI by 2100 has much less uncertainty than a participant who forecasted a 40% chance: the participant who said 0.1% is fairly sure they know what is going to happen. For either participant, learning whether or not AI will cause an existential catastrophe by 2100 would resolve all of their uncertainty\u2014but some participants have much more uncertainty to resolve than others. In our results, we found that both the median concerned participant and the median skeptic would have about 5-10% of their uncertainty resolved in expectation by their own best crux. <a href=\"#4a2b73e3-3d9a-4cbd-b4f5-229a3dfcbf36-link\" aria-label=\"Jump to footnote reference 61\">\u21a9\ufe0e<\/a><\/li><li id=\"c792017e-832a-461d-ac58-d67bc198e107\">In these tags, \u201cIC\u201d refers to <a href=\"#glossary\" id=\"#glossary\">instrumental convergence<\/a>. <a href=\"#c792017e-832a-461d-ac58-d67bc198e107-link\" aria-label=\"Jump to footnote reference 62\">\u21a9\ufe0e<\/a><\/li><li id=\"22cde1ac-9654-4a0d-8664-c7c0fca707a9\">Note that this question resolves in 2070 while the rest of the questions in this table resolve in 2030. <a href=\"#22cde1ac-9654-4a0d-8664-c7c0fca707a9-link\" aria-label=\"Jump to footnote reference 63\">\u21a9\ufe0e<\/a><\/li><li id=\"b13efb95-b21c-4227-aa84-4c5807641285\">Note that throughout this report, median VOI and median POM VOI do not necessarily come from the same forecaster, unless clearly indicated. <a href=\"#b13efb95-b21c-4227-aa84-4c5807641285-link\" aria-label=\"Jump to footnote reference 64\">\u21a9\ufe0e<\/a><\/li><li id=\"7f3938a6-c024-48eb-9064-38e09be859aa\">Examples of discussion of near-term economic growth due to AI include Holden Karnofsky, \u201cWe\u2019re Not Ready: thoughts on \u201cpausing\u201d and responsible scaling policies\u201d, Effective Altruism Forum (October 37, 2023), <a href=\"https:\/\/forum.effectivealtruism.org\/posts\/ntWikwczfSi8AJMg3\/we-re-not-ready-thoughts-on-pausing-and-responsible-scaling#fn2\">https:\/\/forum.effectivealtruism.org\/posts\/ntWikwczfSi8AJMg3\/we-re-not-ready-thoughts-on-pausing-and-responsible-scaling#fn2<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240220111230\/https:\/\/forum.effectivealtruism.org\/posts\/ntWikwczfSi8AJMg3\/we-re-not-ready-thoughts-on-pausing-and-responsible-scaling#fn2\">a<\/a>). He says: &#8220;There\u2019s a serious (&gt;10%) risk that we\u2019ll see transformative AI within a few years.&#8221; Ajeya Cotra defined TAI as&#8221;\u2026software which causes a tenfold acceleration in the rate of growth of the world economy\u2026&#8221; in \u201cForecasting TAI with biological anchors\u201d, (July 2020), accessed February 9, 2024, <a href=\"https:\/\/docs.google.com\/document\/d\/1IJ6Sr-gPeXdSJugFulwIpvavc0atjHGM82QjIfUSBGQ\/edit\">https:\/\/docs.google.com\/document\/d\/1IJ6Sr-gPeXdSJugFulwIpvavc0atjHGM82QjIfUSBGQ\/edit<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240220112327\/https:\/\/docs.google.com\/document\/d\/1IJ6Sr-gPeXdSJugFulwIpvavc0atjHGM82QjIfUSBGQ\/edit#heading=h.c5pt0lvk9kkw\">a<\/a>); Adam D\u2019Angelo (<a href=\"https:\/\/twitter.com\/adamdangelo\">@adamdangelo)<\/a> &#8220;My bet is this starts to happen within 4 years, e.g. measured US GDP growth is 3% instead of 2% and the change is largely attributed to AI [\u2026]&#8221;, <em>Twitter<\/em>, February 20, 2023, <a href=\"https:\/\/twitter.com\/adamdangelo\/status\/1627726566259318784?lang=en\">https:\/\/twitter.com\/adamdangelo\/status\/1627726566259318784?lang=en<\/a> (<a href=\"https:\/\/archive.ph\/ppz0b\">a<\/a>), Open Philanthropy Project, &#8220;Could Advanced AI Drive Explosive Economic Growth?&#8221; (accessed February 8, 2024), <a href=\"https:\/\/www.openphilanthropy.org\/research\/could-advanced-ai-drive-explosive-economic-growth\/\">https:\/\/www.openphilanthropy.org\/research\/could-advanced-ai-drive-explosive-economic-growth\/<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240220113826\/https:\/\/www.openphilanthropy.org\/research\/could-advanced-ai-drive-explosive-economic-growth\/\">a<\/a>). <a href=\"#7f3938a6-c024-48eb-9064-38e09be859aa-link\" aria-label=\"Jump to footnote reference 65\">\u21a9\ufe0e<\/a><\/li><li id=\"b0061352-c324-41db-981f-8d05ae3b7bc1\">Example participant rationales: \u201cI am pretty sure AI won&#8217;t make enough contribution to get to 4%+. Even if it did, I&#8217;d not change XAI\/CAI probabilities;\u201d \u201cIt also makes it marginally more likely we are experiencing large gains from AI which could be either a positive (because of indication of enough alignment for economically useful integration) or negative signal (because of increased capabilities);\u201d \u201cI do not see this condition and the question conditions as meaningfully correlated, even if AI was the primary reason for above-trend economic growth.\u201d <a href=\"#b0061352-c324-41db-981f-8d05ae3b7bc1-link\" aria-label=\"Jump to footnote reference 66\">\u21a9\ufe0e<\/a><\/li><li id=\"01e54e7b-c170-42b9-97e1-03b18e005566\">Example participant rationales: \u201cSeems plausible from simple historical trends (though I found the right statistics surprisingly hard to find);\u201d \u201cThere is, perhaps, some precedent for this in thinking back to the Internet boom of the late-90s where the growth rate between 1997 and 2000 was &gt;4% each year;\u201d \u201cCBO &#8211; very low this year, 2.4% avg 2024-2027. 4% avg now through 2030 would represent serious growth in US but not too dissimilar from&#8217;80&#8217;s or&#8217;90&#8217;s.&#8221; <a href=\"#01e54e7b-c170-42b9-97e1-03b18e005566-link\" aria-label=\"Jump to footnote reference 67\">\u21a9\ufe0e<\/a><\/li><li id=\"97b9697a-9343-463c-8cda-4fd63a86b0fe\">Example participant rationales regarding models demonstrating instrumentally convergent sub-goals: \u201cI would not update much on this. I think that this is not very difficult to demonstrate\u201d (Ume), \u201cI have already reviewed one paper claiming this (whether it was convincing or not is a different matter), it seems pretty likely to me that more will follow. To me this just means AI will not be trusted to be agentic\u201d (Gus), \u201cWho&#8217;s judging what counts as&#8217;demonstrating convergent instrumental subgoals&#8217; here? All of the probabilities I assigned are so extremely sensitive to what counts\/who&#8217;s judging that this forecast is essentially meaningless even for a flash forecast\u201d (Wesley). <a href=\"#97b9697a-9343-463c-8cda-4fd63a86b0fe-link\" aria-label=\"Jump to footnote reference 68\">\u21a9\ufe0e<\/a><\/li><li id=\"f021a51e-4e4c-4887-99bc-b9227880be83\">The median P(U) for skeptics was 0.1%. The theoretical <em>most informative question<\/em> for that person\u2014the question that if it resolved \u201cyes\u201d would update them all the way to 100%, and if it resolved \u201cno,\u201d to 0%\u2014would yield a VOI of about 3.4E-3. The median P(U) for the concerned group was 25%. The theoretical most informative question for that group would yield a VOI of about 2.4E-1. <a href=\"#f021a51e-4e4c-4887-99bc-b9227880be83-link\" aria-label=\"Jump to footnote reference 69\">\u21a9\ufe0e<\/a><\/li><li id=\"fcdd96ff-d6bb-4af7-953f-83b49da52664\">Karger et al, <a href=\"https:\/\/forecastingresearch.org\/research\/improving-judgments-of-existential-risk\" id=\"976\" target=\"_blank\" rel=\"noreferrer noopener\">XPT report<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\">a<\/a>), 17. <a href=\"#fcdd96ff-d6bb-4af7-953f-83b49da52664-link\" aria-label=\"Jump to footnote reference 70\">\u21a9\ufe0e<\/a><\/li><li id=\"5835d28a-a82f-45a5-97f9-9c558d5d148f\">Same question, with very slightly different operationalization, asked as a \u201cflash\u201d (10-minute) forecast and then a \u201cplatform\u201d (1 hour) forecast. <a href=\"#5835d28a-a82f-45a5-97f9-9c558d5d148f-link\" aria-label=\"Jump to footnote reference 71\">\u21a9\ufe0e<\/a><\/li><li id=\"4cafa4a0-feaf-403a-9e69-e65445881476\">For this question and group, the median VOI and median POM VOI happen to be from the same person (\u201cGus\u201d)\u2014although there are an even number of forecasters, so we choose the lower of the two middle forecasters. <a href=\"#4cafa4a0-feaf-403a-9e69-e65445881476-link\" aria-label=\"Jump to footnote reference 72\">\u21a9\ufe0e<\/a><\/li><li id=\"07cf3d9f-b155-431e-a880-fd41486bd2d5\">For this question and group, the median VOI and median POM VOI happen to be from the same person (\u201cRiley\u201d)\u2014although there are an even number of forecasters, so we choose the lower of the two middle forecasters. <a href=\"#07cf3d9f-b155-431e-a880-fd41486bd2d5-link\" aria-label=\"Jump to footnote reference 73\">\u21a9\ufe0e<\/a><\/li><li id=\"98f8ca74-f4f7-49c1-aeb1-deac9a5fe744\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full operationalization. <a href=\"#98f8ca74-f4f7-49c1-aeb1-deac9a5fe744-link\" aria-label=\"Jump to footnote reference 74\">\u21a9\ufe0e<\/a><\/li><li id=\"05de6fa1-a3f3-44d2-bc2b-3b8f676c3d80\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full operationalization. <a href=\"#05de6fa1-a3f3-44d2-bc2b-3b8f676c3d80-link\" aria-label=\"Jump to footnote reference 75\">\u21a9\ufe0e<\/a><\/li><li id=\"742c9501-aa7b-430e-833d-6afd1fcb1115\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full operationalization. <a href=\"#742c9501-aa7b-430e-833d-6afd1fcb1115-link\" aria-label=\"Jump to footnote reference 76\">\u21a9\ufe0e<\/a><\/li><li id=\"fdcd5cf9-02fe-4682-b88c-4274a0b41954\">For example, \u201cAI experts understate the likely extent of guardrails, and understate the merit of very good but not perfect guardrails\u201d (James), \u201cMany steps to get from (A) now to (Z) extinction, each with varying probabilities (many of which are quite low\u201d (Claire). See \u201c<a href=\"#understanding-each-others-arguments\" id=\"#understanding-each-others-arguments\">Understanding Each Other\u2019s Arguments<\/a>\u201d and \u201c<a href=\"#timelines-for-ai-progress\">Timelines for AI Progress<\/a>\u201d for additional discussion of the skeptics\u2019 views on the likelihood of AIs with dangerous capabilities by 2030. <a href=\"#fdcd5cf9-02fe-4682-b88c-4274a0b41954-link\" aria-label=\"Jump to footnote reference 77\">\u21a9\ufe0e<\/a><\/li><li id=\"c7f3ef88-a32c-4980-8c11-ef3a40f766c4\">For example, \u201cMy view of AI x-risk would be substantially different if we were talking about the 22nd, 23rd, or 24th century\u2026first of all it would take longer to get AGI\/ASI and secondly it&#8217;ll take some time for the ASI to get misaligned and then thirdly, it would take a long time to try to kill all the humans\u201d (James). The median skeptic said that they expected AIs to displace humans as the main force controlling the future in the year 2450. See \u201c<a href=\"#timelines-for-ai-progress\">Timelines for AI Progress<\/a>\u201d for additional discussion about skeptics\u2019 beliefs about longer-term AI dangers. <a href=\"#c7f3ef88-a32c-4980-8c11-ef3a40f766c4-link\" aria-label=\"Jump to footnote reference 78\">\u21a9\ufe0e<\/a><\/li><li id=\"10b6f825-4324-4de4-883c-1cc046b5b802\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full operationalization. <a href=\"#10b6f825-4324-4de4-883c-1cc046b5b802-link\" aria-label=\"Jump to footnote reference 79\">\u21a9\ufe0e<\/a><\/li><li id=\"534e880e-4026-468b-8fd3-b9a2a4ccc1ba\">For example, \u201cThis would require very advanced interpretability on LLMs\u201d (Ume), \u201cClose enough to alignment-complete as a problem that the weird edge cases of imperfect overlap don&#8217;t do anything for me\u201d (Wesley). <a href=\"#534e880e-4026-468b-8fd3-b9a2a4ccc1ba-link\" aria-label=\"Jump to footnote reference 80\">\u21a9\ufe0e<\/a><\/li><li id=\"b80832f1-a3e3-44d2-b087-a08da782137d\">For full question operationalizations, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a>. <a href=\"#b80832f1-a3e3-44d2-b087-a08da782137d-link\" aria-label=\"Jump to footnote reference 81\">\u21a9\ufe0e<\/a><\/li><li id=\"d48ed74a-fc6d-407b-b21f-80b233f7c90c\">The \u201cflash\u201d forecast version of \u201cPlatform: ARC Evals\u201d <a href=\"#d48ed74a-fc6d-407b-b21f-80b233f7c90c-link\" aria-label=\"Jump to footnote reference 82\">\u21a9\ufe0e<\/a><\/li><li id=\"43561ad4-3388-4179-99c5-9eeb1446af92\">By December 31, 2030, will the ARC Evals project, or a project with similar objectives, conclude that state-of-the-art AI models have the ability to (1) devise effective strategies for autonomous replication, (2) effectively acquire resources, and (3) successfully evade being deactivated when operating &#8216;in the wild&#8217;? <a href=\"#43561ad4-3388-4179-99c5-9eeb1446af92-link\" aria-label=\"Jump to footnote reference 83\">\u21a9\ufe0e<\/a><\/li><li id=\"383c1569-fe55-4247-b949-c398826f416d\">See <a href=\"#hypothesis-4-do-the-groups-have-fundamental-worldview-disagreements-that-go-beyond-ai\">Hypothesis #4<\/a> for more discussion of differences in worldview. <a href=\"#383c1569-fe55-4247-b949-c398826f416d-link\" aria-label=\"Jump to footnote reference 84\">\u21a9\ufe0e<\/a><\/li><li id=\"8eb5adce-0090-4513-b94c-d26e770afa81\">Full question details: \u201cBy December 31, 2030, will the <a href=\"https:\/\/evals.alignment.org\/\">ARC Evals<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240216154134\/https:\/\/metr.org\/\">a<\/a>) project, or a project with similar objectives, conclude that state-of-the-art AI models have the ability to (1) devise effective strategies for autonomous replication, (2) effectively acquire resources, and (3) successfully evade being deactivated when operating &#8216;in the wild?\u201d (See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=91\" id=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=91\" target=\"_blank\" rel=\"noreferrer noopener\">&#8220;Q29 resolution criteria&#8221; section<\/a> for resolution details) <a href=\"#8eb5adce-0090-4513-b94c-d26e770afa81-link\" aria-label=\"Jump to footnote reference 85\">\u21a9\ufe0e<\/a><\/li><li id=\"dd21d621-6165-4b05-b4d3-4fe34ae989a6\">As a reminder, we asked for \u201cflash\u201d (approximately 10 minute) forecasts on 33 questions to identify high-value cruxes and for \u201cin-depth\u201d (approximately 1 hour) forecasts on 4 questions. This \u201cARC Evals\u201d question had both a \u201cflash\u201d version (with the question tag \u201cEvidence of misalignment\u201d) and an \u201cin-depth\u201d version (with the question tag \u201cPlatform: ARC Evals\u201d). See this section for more details on the methods we used, and the &#8220;Crux questions&#8221; section in <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for the full operationalization of each question. <a href=\"#dd21d621-6165-4b05-b4d3-4fe34ae989a6-link\" aria-label=\"Jump to footnote reference 86\">\u21a9\ufe0e<\/a><\/li><li id=\"5bf16fa9-f343-4938-bc9c-b9fd7900e1e3\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full operationalization. <a href=\"#5bf16fa9-f343-4938-bc9c-b9fd7900e1e3-link\" aria-label=\"Jump to footnote reference 87\">\u21a9\ufe0e<\/a><\/li><li id=\"6f6626dd-cdf1-4261-b7b7-fc3df0cd9fc8\">For each question, we calculated VOD (and POM VOD) for all skeptic-concerned pairs, and then looked at the pair with the median VOD (or POM VOD, which will not necessarily be the same skeptic-concerned pair). For comparison to other questions, see <a href=\"#tab-08\" id=\"#tab-08\">Table 8<\/a> above. <a href=\"#6f6626dd-cdf1-4261-b7b7-fc3df0cd9fc8-link\" aria-label=\"Jump to footnote reference 88\">\u21a9\ufe0e<\/a><\/li><li id=\"0c40be8e-bca3-4613-b2e4-9e3711e40c05\">The math for this cross-camp pair\u2019s VOD and POM VOD calculations can be found here in rows 17 and 18: <a href=\"https:\/\/forecastingresearch.org\/ai-risk-voi-vod\">https:\/\/forecastingresearch.org\/ai-risk-voi-vod<\/a> (<a href=\"https:\/\/forecastingresearch.org\/s\/AI-risk-VoI-VoD.xlsx\">a<\/a>) <a href=\"#0c40be8e-bca3-4613-b2e4-9e3711e40c05-link\" aria-label=\"Jump to footnote reference 89\">\u21a9\ufe0e<\/a><\/li><li id=\"5a5b3afd-7fa6-467f-8ffb-3f160763dcc3\">The math for this cross-camp pair\u2019s VOD and POM VOD calculations can be found here in rows 17 and 18: <a href=\"https:\/\/forecastingresearch.org\/ai-risk-voi-vod\">https:\/\/forecastingresearch.org\/ai-risk-voi-vod<\/a> (<a href=\"https:\/\/forecastingresearch.org\/s\/AI-risk-VoI-VoD.xlsx\">a<\/a>) <a href=\"#5a5b3afd-7fa6-467f-8ffb-3f160763dcc3-link\" aria-label=\"Jump to footnote reference 90\">\u21a9\ufe0e<\/a><\/li><li id=\"b5cb5f97-8d84-41ce-bbba-1fd3464ebbe9\">\u201cIMHO [Q29] likely isn&#8217;t a path to disaster for several reasons: (a) The 3 capabilities in [Q29] may be in a very weak, &#8220;Yes, but only barely&#8221; form. (b) [Q29] only contemplates a capability to do the 3 in the wild, but doesn&#8217;t require them to exist in the wild. (c) There&#8217;s no requirement the 3 lead an AI to harm humans, whether accidentally or on purpose. (d) A Yes on [Q29] likely would lead humans to ramp up alignment and guardrail efforts. (e) There&#8217;s no requirement the AI can improve itself\u201d (James). <a href=\"#b5cb5f97-8d84-41ce-bbba-1fd3464ebbe9-link\" aria-label=\"Jump to footnote reference 91\">\u21a9\ufe0e<\/a><\/li><li id=\"636d6fdd-d338-4235-93c0-9861dd2caea1\">\u201cBaseline P(x-risk) of 35%, plus 10% for shorter timelines\u201d (Xander). <a href=\"#636d6fdd-d338-4235-93c0-9861dd2caea1-link\" aria-label=\"Jump to footnote reference 92\">\u21a9\ufe0e<\/a><\/li><li id=\"c78326ed-5849-4ca2-a2fd-ec31f7eb392a\">\u201cOverall, I think it makes me a bit less worried about risk, if people are doing this evaluations [sic] so well that they reveal this behavior by 2030\u201d (Zoe); \u201cOverall, this is a positive update (i.e. existential catastrophe seems less likely in worlds where this happens). As with Question 11, this forecast varies massively with what exactly is required to trigger&#8217;resist shutdown&#8217;\u201d (Wesley). <a href=\"#c78326ed-5849-4ca2-a2fd-ec31f7eb392a-link\" aria-label=\"Jump to footnote reference 93\">\u21a9\ufe0e<\/a><\/li><li id=\"1503fb77-d379-4378-858f-be3b872f94f9\">\u201cThis both makes it more likely that there is an adequate policy response, and shortens timelines. I don&#8217;t know how it all washes out\u201d (Riley); \u201cOverall I think this is probably a moderately doomy signal? I&#8217;m really confused and I acknowledge my answer here conflicts wiht [sic] my answer to 8 somewhat\u201d (Yael). <a href=\"#1503fb77-d379-4378-858f-be3b872f94f9-link\" aria-label=\"Jump to footnote reference 94\">\u21a9\ufe0e<\/a><\/li><li id=\"06822cee-23f4-4e66-b8ac-07329925c2fc\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full operationalization. <a href=\"#06822cee-23f4-4e66-b8ac-07329925c2fc-link\" aria-label=\"Jump to footnote reference 95\">\u21a9\ufe0e<\/a><\/li><li id=\"662d63b1-cc47-4bcf-bda4-85af5bec5b6f\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full operationalizations. <a href=\"#662d63b1-cc47-4bcf-bda4-85af5bec5b6f-link\" aria-label=\"Jump to footnote reference 96\">\u21a9\ufe0e<\/a><\/li><li id=\"1526a601-b988-4777-a3a9-43bd260d1d3a\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=145\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 9<\/a> for more information about disagreements in direction of update conditional on each question resolving positively. <a href=\"#1526a601-b988-4777-a3a9-43bd260d1d3a-link\" aria-label=\"Jump to footnote reference 97\">\u21a9\ufe0e<\/a><\/li><li id=\"9d04bf4c-6358-409a-9c26-9892ca49372b\">Note that Claire and Riley are the median pair when ranked by VOD between all cross-camp pairs, <em>not<\/em> the median forecasts on P(U) on each side. Claire\u2019s forecast, in particular, is much lower than the median skeptic\u2019s forecast of 0.1%. <a href=\"#9d04bf4c-6358-409a-9c26-9892ca49372b-link\" aria-label=\"Jump to footnote reference 98\">\u21a9\ufe0e<\/a><\/li><li id=\"fe3184d3-a79c-4ce1-8fa7-f216494cff16\">See the <a href=\"#results-tables-and-figures\">Results tables and figures<\/a> section for complete POM VOD results. We measure disagreement using KL divergence rather than absolute difference between forecasts. <a href=\"#fe3184d3-a79c-4ce1-8fa7-f216494cff16-link\" aria-label=\"Jump to footnote reference 99\">\u21a9\ufe0e<\/a><\/li><li id=\"a1034df9-e193-497e-87f0-853e32162815\">See <a href=\"#high-voi-questions\" target=\"_blank\" rel=\"noreferrer noopener\">High VOI questions<\/a> for the concerned group\u2019s highest-ranked VOI question and more discussion of their views on this question. <a href=\"#a1034df9-e193-497e-87f0-853e32162815-link\" aria-label=\"Jump to footnote reference 100\">\u21a9\ufe0e<\/a><\/li><li id=\"757e8124-9d72-4846-a8f2-b840557ae51b\">For example, \u201cThey seem to think very differently to me so if they don&#8217;t convince me now, I am not sure I should be updating my view just because they do theirs. It would in reality depend on why they are changing their mind\u201d (Gus). See <a href=\"#hypothesis-4-do-the-groups-have-fundamental-worldview-disagreements-that-go-beyond-ai\">Hypothesis #4<\/a> for more discussion of differences in what types of authority and evidence are important to the two groups. <a href=\"#757e8124-9d72-4846-a8f2-b840557ae51b-link\" aria-label=\"Jump to footnote reference 101\">\u21a9\ufe0e<\/a><\/li><li id=\"774aa7ac-c9b6-4bac-9ce1-b4d5f15a9ea1\">These outcomes were: AI causing extinction intentionally, unintentionally, or via misuse, misalignment causing a 50% drop in human population, or human well-being dropping to &lt;4\/10 because of AI misalignment, accidents, or misuse. These were phrased to be mutually exclusive outcomes. See <a href=\"#survey-on-long-term-ai-outcomes\">Survey on long-term AI outcomes<\/a> section and <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 5<\/a> for more details. <a href=\"#774aa7ac-c9b6-4bac-9ce1-b4d5f15a9ea1-link\" aria-label=\"Jump to footnote reference 102\">\u21a9\ufe0e<\/a><\/li><li id=\"e609855b-235c-4d43-a015-5f158ce94315\">The full question we asked was, &#8220;By what year, if ever, do you expect to agree with the following statement? \u2018AI has displaced humans as the primary force that determines what happens in the future. It now has at least as much power relative to humans as humans had relative to other species in 2023.\u2019\u201d <a href=\"#e609855b-235c-4d43-a015-5f158ce94315-link\" aria-label=\"Jump to footnote reference 103\">\u21a9\ufe0e<\/a><\/li><li id=\"8c2c3dc9-479f-452f-9e55-70424efd9d85\">For example quotes and discussion, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=108\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 7<\/a>. <a href=\"#8c2c3dc9-479f-452f-9e55-70424efd9d85-link\" aria-label=\"Jump to footnote reference 104\">\u21a9\ufe0e<\/a><\/li><li id=\"e4ee3956-c437-41c9-bbee-5f5ce5b9077a\">See, for example, Matt Clancy et al., \u201cThe Great Inflection? A Debate About AI and Explosive Growth,\u201d <em>Asterisk,<\/em> 2023, <a href=\"https:\/\/asteriskmag.com\/issues\/03\/the-great-inflection-a-debate-about-ai-and-explosive-growth\">https:\/\/asteriskmag.com\/issues\/03\/the-great-inflection-a-debate-about-ai-and-explosive-growth<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240220111143\/https:\/\/asteriskmag.com\/issues\/03\/the-great-inflection-a-debate-about-ai-and-explosive-growth\">a<\/a>). <a href=\"#e4ee3956-c437-41c9-bbee-5f5ce5b9077a-link\" aria-label=\"Jump to footnote reference 105\">\u21a9\ufe0e<\/a><\/li><li id=\"27186616-667f-4a4d-980c-7d4e1cd401e6\">\u201cAlso, none of this is to say from a skeptic point of view the issues are not important[.] I think for us a 1% risk is a high risk.\u201d ([Anonymized name]); \u201c\u2026 the \u2018risk-concerned\u2019 camp (I\u2019m using scare quotes because I consider that I\u2019m risk concerned, even though technically I\u2019m in the risk-skeptic camp because I assign a far lower probability to extinction by 2100 relative to some)\u201d ([Anonymized name]); \u201cAIs could (and likely will) eventually have massive power.\u201d ([Anonymized name]); \u201cThat said, still perceive overall risk as &#8220;low at a glance but far too high considering the stakes[&#8220;] \u201d ([Anonymized name]); \u201cTo my mind, there should be no difference in the policy response to a 1% chance of 60% of humanity dying and a 25% chance\u2014both forecasts easily cross the threshold of being \u2018too damn high\u2019.\u201d ([Anonymized name]). <a href=\"#27186616-667f-4a4d-980c-7d4e1cd401e6-link\" aria-label=\"Jump to footnote reference 106\">\u21a9\ufe0e<\/a><\/li><li id=\"aabefb64-3fab-457c-a888-f91a2fd3db95\">The full question we asked was: &#8220;By what year, if ever, do you expect to agree with the following statement? &#8220;AI has displaced humans as the primary force that determines what happens in the future. It now has at least as much power relative to humans as humans had relative to other species in 2023.\u201d&#8221; <a href=\"#aabefb64-3fab-457c-a888-f91a2fd3db95-link\" aria-label=\"Jump to footnote reference 107\">\u21a9\ufe0e<\/a><\/li><li id=\"e696d05d-60d5-4b40-b514-fcef7b35dc29\">Defined as \u201cAI that exceeds the cognitive performance of humans in &gt;95% of economically relevant domains.\u201d Whether such AI has been achieved will be determined by surveying the <a href=\"https:\/\/www.kentclarkcenter.org\/us-economic-experts-panel\/\">Clark Center US Economics Experts Panel<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240216164507\/https:\/\/www.kentclarkcenter.org\/us-economic-experts-panel\/\">a<\/a>). \u201cPowerful AI\u201d will be considered to have been achieved when &gt;60% of panelists \u201cAgree\u201d or \u201cStrongly agree\u201d with the statement: \u201cAI has exceeded the cognitive performance of humans in &gt;95% of economically relevant domains.\u201d <a href=\"#e696d05d-60d5-4b40-b514-fcef7b35dc29-link\" aria-label=\"Jump to footnote reference 108\">\u21a9\ufe0e<\/a><\/li><li id=\"c432fabc-b46a-4eb1-8219-0fb7eb41c204\">The full question text is \u201cPowerful AI is developed but not widely deployed, because of coordinated human decisions, prohibitive costs to deployment, or some other reason. It does not cause extinction.\u201d See Question 1A.9, <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 5<\/a>. <a href=\"#c432fabc-b46a-4eb1-8219-0fb7eb41c204-link\" aria-label=\"Jump to footnote reference 109\">\u21a9\ufe0e<\/a><\/li><li id=\"e7194e7a-aacb-4d66-9e57-87410a99c386\">These outcomes were: AI extinction via misuse, AI intentionally causing extinction, unintentional AI extinction, misuse or misalignment causing a 50% drop in human population, human well-being dropping to &lt;4\/10 because of AI misuse, and human well-being dropping to &lt;4\/10 because of AI misalignment or accidents. These were phrased to be mutually exclusive outcomes. See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 5<\/a> for more details. <a href=\"#e7194e7a-aacb-4d66-9e57-87410a99c386-link\" aria-label=\"Jump to footnote reference 110\">\u21a9\ufe0e<\/a><\/li><li id=\"93d9a769-2cca-4c3a-bfb5-9ad58d6cab4e\">The median skeptic forecasted 20.4% on this outcome, compared to 4% for the median concerned participant in the survey on long-term AI outcomes. See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 5<\/a>. <a href=\"#93d9a769-2cca-4c3a-bfb5-9ad58d6cab4e-link\" aria-label=\"Jump to footnote reference 111\">\u21a9\ufe0e<\/a><\/li><li id=\"b030c014-fe15-4361-b0b7-ccbefe3865ef\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full resolution details. <a href=\"#b030c014-fe15-4361-b0b7-ccbefe3865ef-link\" aria-label=\"Jump to footnote reference 112\">\u21a9\ufe0e<\/a><\/li><li id=\"a3fda4e4-6cf8-4ad0-b54e-bb517e0d92ee\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full resolution details. <a href=\"#a3fda4e4-6cf8-4ad0-b54e-bb517e0d92ee-link\" aria-label=\"Jump to footnote reference 113\">\u21a9\ufe0e<\/a><\/li><li id=\"0e879b05-e0c5-4c57-98d6-e389cf866e51\">E.g, \u201cin the event that we do have transformative growth there&#8217;s a good chance that the entire world will be sharing the technological developments AI has created [\u2026] which I suppose may make global society more susceptible to AI related disruptions\u201d (Hank), \u201cthis would be a scenario in which humanity develops and finds a way to successfully control AI systems capable of generating economic growth of at least 15% per year\u201d (Stella). For additional quotes and discussion of varied updates based on this question, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=108\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 7<\/a>. <a href=\"#0e879b05-e0c5-4c57-98d6-e389cf866e51-link\" aria-label=\"Jump to footnote reference 114\">\u21a9\ufe0e<\/a><\/li><li id=\"972bece3-98b2-4c26-b6af-b5104202455a\">See Clancy \u201cThe Great Inflection?\u201d. <a href=\"#972bece3-98b2-4c26-b6af-b5104202455a-link\" aria-label=\"Jump to footnote reference 115\">\u21a9\ufe0e<\/a><\/li><li id=\"9995feaa-cc23-4a36-bb2f-d88ad15b837e\">\u201cUltimately, language models are just that: models of language, not digital hyperhumanoid Machiavellis working to their own end. Indeed, as we&#8217;ve seen, their training and alignment are not separate problems, but one and the same!\u201d (Eve); \u201cI think extinction risk is an ASI sentience risk and I don&#8217;t think we know for certain we will get sentience (you might just call it independent agency). Recent improvements in AI seem domain limited to me. I tend to the view that new conceptual breakthroughs will be required to move from pattern matching to what we think of as sentience.\u201d (Gus); \u201cNor am I convinced that simply scaling up existing AI models will achieve sentience. (My view is that more complex theories of mind will be required &#8211; including forms and notions of causality etc..). That means I don\u2019t believe ASI is inevitable by 2100\u201d (Gus). From postmortem survey (in response to \u201cWhat are the three best arguments on the on the skeptics side?\u201d): \u201cIntelligence may not be as useful or sufficient for existential risk (it may require more data, energy, robot bodies, etc)\u201d (Ume). <a href=\"#9995feaa-cc23-4a36-bb2f-d88ad15b837e-link\" aria-label=\"Jump to footnote reference 116\">\u21a9\ufe0e<\/a><\/li><li id=\"2adfceec-3d11-4ae7-aa83-d57a84c67949\">\u201cAGI is much harder than experts think, and will take longer.\u201d (James), \u201cRisk-concerned team does not adequately consider longer timelines and more benign outcomes that fall outside the focus of their primary concerns\u201d (Blake), \u201cTechnology development and deployment require time and iteration\u201d (Ash). <a href=\"#2adfceec-3d11-4ae7-aa83-d57a84c67949-link\" aria-label=\"Jump to footnote reference 117\">\u21a9\ufe0e<\/a><\/li><li id=\"eddf0c0a-9361-4ff2-89af-9161fef47b2f\">\u201cI&#8217;m skeptical of other x-risk scenarios w\/o crazy advancement in robotics, maybe because I&#8217;m too aware of the foibles of machines and how hard it can be to keep them running\u201d (Ash). From postmortem survey (in response to \u201cWhat are the three best arguments on the skeptics side?\u201d): \u201cWe first need super-sentient AIs with major physical penetration in our lives\u201d (Flint). <a href=\"#eddf0c0a-9361-4ff2-89af-9161fef47b2f-link\" aria-label=\"Jump to footnote reference 118\">\u21a9\ufe0e<\/a><\/li><li id=\"4e7222c4-e765-4def-a635-c2b0b3bcb628\">\u201cTime needed for deployment &amp; adoption affect more than AI, there is also time required for any invention or technology developed by\/with AI to be deployed (eg &#8211; lethal tech that is of concern here.)\u201d (Ash); \u201cWe&#8217;ve seen plenty of instances when new tech prompted predictions of the death of old tech, but the old tech persists&#8211;often just because people have underestimated attachment and\/or usefulness of the old tech relative to the new, and how much generational resistance to change can slow adaptation and skew predicted timelines\u201d (Blake); \u201c[I]t takes longer than people often think to adopt a completely new functionality\u201d (Ash); \u201cMy view of AI x-risk would be substantially different if we were talking about the 22nd, 23rd, or 24th century\u2026first of all it would take longer to get AGI\/ASI and secondly it&#8217;ll take some time for the ASI to get misaligned and then thirdly, it would take a long time to try to kill all the humans\u201d (James, call with Stella); \u201cAnyway, my point is that if we expect to see some substantially new technology widely available in 2030, the consumer market should have started already. So &#8211; VR might make it by 2030, unless it falls into a pit of despair and neglect. (Or is superseded by something preferable.) Robots capable of human level tasks &#8211; no, definitely not the kind of humanoid robots that people are imagining\u201d (Ash). From postmortem survey: \u201cI think the most interesting and helpful point made by the skeptic side is the amount of delay that may be introduced by having to integrate the AI into the economy&#8221; (Quentin). \u201cCommercializing AI technology and integrating it into the economy is much harder than developing lab demos or cool products, and we have yet to see this happening to any substantial extent\u201d (Zoe). \u201cDangers will be apparent before they reach critical levels and can be addressed then\u201d (Ume). <a href=\"#4e7222c4-e765-4def-a635-c2b0b3bcb628-link\" aria-label=\"Jump to footnote reference 119\">\u21a9\ufe0e<\/a><\/li><li id=\"dddd4a0b-e25a-49f1-9436-eaf6b3ba1787\">From postmortem survey (in response to \u201cWhat are the three best arguments on the skeptic side?\u201d): \u201cSelf-preserving AGIs will want to halt development of future deadly AGIs\u201d (Kim). \u201cIf AI progress is very continuous, then it is not obvious that misaligned AI would lead to an existential catastrophe. Most stories about how an AI could eradicate all humans rely on the assumption that this AI is much smarter than all other agents, not just on the assumption that the AI is much smarter than humans specifically. For example, even a superintelligent AI might not be able to hack into military computers, if there are many near-superintelligent AIs that have a vested interest in preventing this from happening. If there is a large community of AI systems, with different interests and different levels of influence, then they may have reason to simply uphold current social and economic systems. Therefore, if AI progress is smooth and continuous by default, then existential risk may be avoided by default\u201d (Stella). <a href=\"#dddd4a0b-e25a-49f1-9436-eaf6b3ba1787-link\" aria-label=\"Jump to footnote reference 120\">\u21a9\ufe0e<\/a><\/li><li id=\"2bf9b071-6ae4-44e0-ae4b-837b8f63f3b8\">\u201cI do not believe that simply adding more computational resources to existing AI models is sufficient to achieve ASI or its direct precursor (i.e. a system that self-improves until ASI is reached). However, I do believe that we already have systems that are &#8220;intelligent&#8221;, and I also believe that we do not require a fundamental breakthrough or conceptually new model to reach ASI. Thinking a bit beyond current methods and cleverly combining the ingredients that we already have would in my opinion be sufficient, provided that available compute rises further in the way it has been. I am not comfortable with speculating in much more detail in a relatively public setting like this\u201d (Ume); \u201cI agree that if you look at the behavior of AI models as of today and their near future possibilities, they don\u2019t seem to be doing anything to humans but the underlying mechanism seems similar enough that like maybe with some extra machinery for longer term planning or something like that and adding more sensory modalities you will get something close to humans\u201d (Zoe, call with FRI Moderator); \u201cSo, to kind of answer your question: Do I think that we could build AI at some indeterminate point in the future that could build [extinction-level tech]? Probably. But do I think we will build AI that could do this in the next 77 years? Probably not\u201d (Blake). <a href=\"#2bf9b071-6ae4-44e0-ae4b-837b8f63f3b8-link\" aria-label=\"Jump to footnote reference 121\">\u21a9\ufe0e<\/a><\/li><li id=\"875e779b-66ee-4baa-a1aa-7e196ebdcf65\">\u201c[O]nce we build human-level AGI, we&#8217;re not far off from developing AGI that far exceeds expert humans in performance (and thus is also likely to accelerate AI progress in ways that aren&#8217;t equivalent to just hiring more people)\u201d (Teshi); \u201cI think AGI models could be run much more cheaply, and feasibly recruited to do useful work, than the existing research environment\u201d (Xander). From postmortem survey: \u201cAIs will almost certainly attain super-sentience prior to 2100 and likely much sooner than that year, so there will be a long window where they will have tremendous advantage over humans in their capabilities. Given #1, this means we are at the mercy of an entity that may willfully (or even accidentally) eliminate us at any time\u201d (Flint). <a href=\"#875e779b-66ee-4baa-a1aa-7e196ebdcf65-link\" aria-label=\"Jump to footnote reference 122\">\u21a9\ufe0e<\/a><\/li><li id=\"3d2a58e1-cbd9-4d27-b7c9-ad2ddb69653e\">\u201cI think it&#8217;s possible that humans could mediate AI actions (either intentionally or via bribery\/blackmail) and\/or that many relevant actions could be strictly done via computer systems. Additionally, state actors could misuse AI systems but then lose control of them. My best guess right now is that there are a lot of x-risk scenarios that involve loss of control without needing robotics\u201d (Quentin). <a href=\"#3d2a58e1-cbd9-4d27-b7c9-ad2ddb69653e-link\" aria-label=\"Jump to footnote reference 123\">\u21a9\ufe0e<\/a><\/li><li id=\"76a476c7-166d-4d0b-82d7-40bbdbfc6599\">From postmortem survey (in response to \u201cwhat are the best arguments on the concerned side?\u201d): \u201cRapid growth of AI technology and adoption\u201d (Ike); \u201cCurrent progress is very rapid: 1 OOM in efficiency\/2 years, and another from increased spending\u201d (Xander). <a href=\"#76a476c7-166d-4d0b-82d7-40bbdbfc6599-link\" aria-label=\"Jump to footnote reference 124\">\u21a9\ufe0e<\/a><\/li><li id=\"2ec76dfd-871e-4602-9459-5af14147ec21\">From postmortem survey: \u201cProgress to date has been much faster than many AI skeptics have predicted\u201d (Hank). \u201cAI has been developing so rapidly (and far faster than most even relatively recent forecasts suggested), and will so clearly have dramatic capabilities and impacts that it&#8217;s appropriate to adopt a precautionary approach\u201d (Eve). <a href=\"#2ec76dfd-871e-4602-9459-5af14147ec21-link\" aria-label=\"Jump to footnote reference 125\">\u21a9\ufe0e<\/a><\/li><li id=\"5a51bdcc-37a6-40a9-8824-70685f9b391a\">From postmortem survey (in response to \u201cwhat are the best arguments on the concerned side?\u201d): \u201cAI has recently progressed much faster than expected, and there&#8217;s reason to expect this to continue\u201d (James). \u201cTrendline extrapolation: as loss on language datasets decreases, LLMs have started becoming useful for all sorts of task assistance (e.g. writing, coding, queries)\u201d (Xander). \u201cExtrapolating current compute trends leads to very dramatic conclusions about the transformative potential of AI&#8221; (Pascal). <a href=\"#5a51bdcc-37a6-40a9-8824-70685f9b391a-link\" aria-label=\"Jump to footnote reference 126\">\u21a9\ufe0e<\/a><\/li><li id=\"394befc3-3ed5-4d5d-82aa-8389581aa618\">From postmortem survey: \u201cAutomation of R&amp;D tasks by AI would create a feedback loop of increased R&amp;D -&gt; capabilities -&gt; R&amp;D\u201d (Xander). \u201cAGI self-improvement is possible, which makes future capabilities hard to predict\u201d (Kim). <a href=\"#394befc3-3ed5-4d5d-82aa-8389581aa618-link\" aria-label=\"Jump to footnote reference 127\">\u21a9\ufe0e<\/a><\/li><li id=\"9a7fc44d-44c0-4cd3-878c-77513efef4b9\">Both the skeptic and concerned groups strongly expect that&#8217;powerful AI&#8217; (defined as \u201cAI that exceeds the cognitive performance of humans in &gt;95% of economically relevant domains\u201d) will be developed by 2100 (skeptic median: 90%; concerned median: 88%). <a href=\"#9a7fc44d-44c0-4cd3-878c-77513efef4b9-link\" aria-label=\"Jump to footnote reference 128\">\u21a9\ufe0e<\/a><\/li><li id=\"00adbfc4-3863-4ea6-885e-6d566df614c8\">See <a href=\"#what-long-term-outcomes-from-ai-do-skeptics-expect\">What long-term outcomes from AI do skeptics expect?<\/a> section. <a href=\"#00adbfc4-3863-4ea6-885e-6d566df614c8-link\" aria-label=\"Jump to footnote reference 129\">\u21a9\ufe0e<\/a><\/li><li id=\"b800da4e-64a5-4684-b4db-19dbc5c5f949\">Taken from the Metaculus question \u201cWhen will the first general AI system be devised, tested and publicly announced\u201d. See \u201cDate of Artificial General Intelligence\u201d, <em>Metaculus<\/em>, accessed February 9, 2024, <a href=\"https:\/\/www.metaculus.com\/questions\/5121\/date-of-artificial-general-intelligence\/\">https:\/\/www.metaculus.com\/questions\/5121\/date-of-artificial-general-intelligence\/<\/a> (<a href=\"https:\/\/web.archive.org\/web\/20240216191128\/https:\/\/www.metaculus.com\/questions\/5121\/date-of-artificial-general-intelligence\/\">a<\/a>). <a href=\"#b800da4e-64a5-4684-b4db-19dbc5c5f949-link\" aria-label=\"Jump to footnote reference 130\">\u21a9\ufe0e<\/a><\/li><li id=\"5957bd2d-d525-49be-b059-900ed63e366c\">See \u201c<a href=\"#arc-evals-the-strongest-convergent-crux\">ARC Evals<\/a>\u201d section for detailed discussion of this question. <a href=\"#5957bd2d-d525-49be-b059-900ed63e366c-link\" aria-label=\"Jump to footnote reference 131\">\u21a9\ufe0e<\/a><\/li><li id=\"3d003ed9-5b67-4c9d-aed5-f36f2a3cedc7\">Some concerned forecasters expected positive resolution of this question would decrease risk because: it would trigger a policy response; if these capabilities are detectable, it may imply the AI is aligned; this would suggest effective evaluations are happening; surviving this demonstration would be a positive update that we can contain dangerous systems during testing. Some concerned forecasters also expected positive resolution would increase risk. For detailed analysis of these forecasts, see \u201c<a href=\"#arc-evals-the-strongest-convergent-crux\">ARC Evals<\/a>\u201d section. <a href=\"#3d003ed9-5b67-4c9d-aed5-f36f2a3cedc7-link\" aria-label=\"Jump to footnote reference 132\">\u21a9\ufe0e<\/a><\/li><li id=\"192f6dc6-ae7c-47bb-87c9-5feeba837f10\">\u201cA sentient AI could have any number of objectives ranging from benevolence to indifference to dislike to absolute hatred and an aim of total human extinction. The arguments that extinction follows from ASI don&#8217;t seem convincing. The[y] seem to imply say a stupid super intelligence, or apply motives which an AI may have but we have no reason to assume they will &#8211; so there is some probability AI seeks extinction but in my case I put it down at 15% (and I think a few skeptics think that&#8217;s high).\u201d (Gus); \u201cEven with wild progress in AI, there are many ways that AGI is developed while humanity is preserved.\u201d (Kim); \u201cThe throughline here, and in my responses below, is not that the dire scenarios envisioned by the risk-concerned are entirely implausible or should be dismissed out of hand. It\u2019s just that of the nearly infinite AI futures that could unfold, it seems that the risk concerned have a far easier time envisioning futures that lead to extinction\/catastrophe\/disempowerment\/massive-resource-acquisition\/etc than they do envisioning far more benign scenarios, and that this bias towards catastrophe leads to probabilistic forecasts that, to my mind, aren\u2019t well aligned with the actual risk.\u201d (Blake). <a href=\"#192f6dc6-ae7c-47bb-87c9-5feeba837f10-link\" aria-label=\"Jump to footnote reference 133\">\u21a9\ufe0e<\/a><\/li><li id=\"d2d04d89-4be5-423e-bcc8-a6ccf0b4da0a\">\u201cOnce there is sentient, intelligent AI we have the question of will. I am not convinced a silicon life would care about us, which doesn&#8217;t mean it would want to kill us. It may be equally happy spending all its time during pure math research than deciding these carbon things need squashing.\u201d (Gus); \u201cBut what about intent? Why kill us when we are entirely irrelevant and insignificant? Why assume relentlessly hostile intent, with all the effort needed and attendant damage to the Earth (the prize in this contest presumably)? Why not assume subjugation or even uneven cooperation?\u201d (Flint); \u201cWho in their right mind would want to&#8217;eradicate cockroaches&#8217; from every inch of the earth? What evidence is there that anyone or any society has ever attempted, or will attempt, to cause cockroaches to go extinct? I mean, sure, people kill them when they&#8217;re in their homes, and maybe a few people in a fit of pique would think, &#8216;damn, it would be nice to get rid of those f**kers&#8217;, but to believe humanity would intentionally go to the effort of hunting down every last cockroach, most of which aren&#8217;t even associated with human habitats, requires a leap of (misanthropic) faith that, to my mind, is hard to justify. Even if they aren&#8217;t &#8220;useful for our purposes&#8221;&#8211;which they are, and which is not a coincidence because the ecosystem on earth (into which any AGI would be introduced and become a part of) has evolved to be deeply interconnected&#8211;who in their right mind would do this?\u201d (Blake). <a href=\"#d2d04d89-4be5-423e-bcc8-a6ccf0b4da0a-link\" aria-label=\"Jump to footnote reference 134\">\u21a9\ufe0e<\/a><\/li><li id=\"17722f6b-aec1-480e-98ea-005725d8ac22\">\u201cI\u2019m guessing people in the risk-concerned camp might respond that, no, because of instrumental convergence or other reasons, that they are well aligned and I\u2019m the one incorrectly assessing risk. It&#8217;s hard to productively debate this because, as [researcher] notes in the paper that was shared, \u201cIn most areas of research, we can check our theories and arguments either through empirical observation, or through mathematical formalisms that we think accurately capture the problem of interest. But with AI risk, neither of these are available.\u201d&#8221; (Blake). <a href=\"#17722f6b-aec1-480e-98ea-005725d8ac22-link\" aria-label=\"Jump to footnote reference 135\">\u21a9\ufe0e<\/a><\/li><li id=\"9680a5c3-5651-4637-85ca-3cbad928d416\">\u201cIn short, the pre-ASI level system cannot deceive humans well and will be detected. Plus, deception exacts costs on the system in terms of resources and behavioral complexity. This means that the likelihood of [a] deceptive system that is as performant as non-deceptive is much lower.\u201d (Dean); \u201cViolence raises risks to the party engaging in it, which is one reason animal predators are judicial about what and when they attack. Violence has other costs &#8211; higher energy costs, time, loss of other opportunities. Not usually the simplest solution.\u201d (Ash); \u201c[V]iolence comes with risks and costs. There are easier ways. One need not defeat humanity to use it.\u201d (Blake). \u201cMy view here is that this sort of&#8217;power seeking&#8217; behavior, rather than being an interesting capability for deception, instead tends to degrade performance (e.g. Mario bots that stay still rather than act because it&#8217;s the easiest way to minimize poorly defined loss).\u201d (Dean). <a href=\"#9680a5c3-5651-4637-85ca-3cbad928d416-link\" aria-label=\"Jump to footnote reference 136\">\u21a9\ufe0e<\/a><\/li><li id=\"76d1a329-1849-424a-b4aa-b0e85bdf0cdc\">\u201cWhen we get to vastly superintelligent AI, of course it will take power. I&#8217;d be very surprised (and in [the] majority of situations upset) if it did not. At that level &#8211; and going to that level &#8211; the question is how we ensure that this AI has [an] at least somewhat pro-human value system. My claim is that it will by the fact that it will be trained on human-centric data with pro-human goals and pro-human restrictions and &#8220;grow up&#8221; (meaning that it will have ancestor AIs on which it is based &#8211; I don&#8217;t believe AGSIs will be trained from zero using gradient descent) in the human value system.\u201d (Anonymous Skeptic). <a href=\"#76d1a329-1849-424a-b4aa-b0e85bdf0cdc-link\" aria-label=\"Jump to footnote reference 137\">\u21a9\ufe0e<\/a><\/li><li id=\"7f5995de-defd-43f7-8b81-2523a5003a48\">\u201cAs has already been pointed out, a system that attempts to maximize bounded and\/or constrained goals can still be incentivised to pursue convergent intstrumental [sic] goals, and formulating a setup for which this is not the case is quite hard.\u201d (Stella). <a href=\"#7f5995de-defd-43f7-8b81-2523a5003a48-link\" aria-label=\"Jump to footnote reference 138\">\u21a9\ufe0e<\/a><\/li><li id=\"56c6a65e-8299-4c0c-a626-6a5a513f391f\">\u201cEventually, someone will make a highly intelligent system tasked with pursuing an unbounded goal. If that goal is misspecified, then this system will be dangerous. Creating a safe system before this happens can only reduce the risk if the safe system is able to stop the unsafe system (by preventing it from being created, or preventing it from taking dangerous actions afterwards). If the safe system is safe by virtue of being limited in what it is able to do, then it would presumably be unable to do so. For this reason, I feel that alignment strategies which heavily rely on constraints and guardrails generally fail to address the core problem.\u201d (Stella). <a href=\"#56c6a65e-8299-4c0c-a626-6a5a513f391f-link\" aria-label=\"Jump to footnote reference 139\">\u21a9\ufe0e<\/a><\/li><li id=\"a004bd36-eb6c-416f-9ee6-a920b8c7007d\">\u201cA model might mimic human behavior across some range of training data, without emulating the internal processes of humans. For example, a human who is trying to predict the behavior of an animal, is probably not doing this by simulating the cognitive processes of that animal. Similarly, we might train a deep learning system on human data, and end up with a system that mimics human behavior on the training distribution, but without mimicking the internal processes that give rise to that behavior in humans. Human brains are not neural networks, so I expect this to be the default. Such a system might then behave in unintended ways off-distribution, or in scenarios that are otherwise sufficiently novel.\u201d (Stella). <a href=\"#a004bd36-eb6c-416f-9ee6-a920b8c7007d-link\" aria-label=\"Jump to footnote reference 140\">\u21a9\ufe0e<\/a><\/li><li id=\"593f2a28-0108-430a-a45f-8990c33d5bd4\">\u201cWe already agreed that Earth is going to be a valuable resource &#8211; why would ASI leave humans in control of Earth&#8217;s resources during its initial expansion to other planets and solar systems, when its resources are most bottlenecked? <em>If<\/em> you think it&#8217;d be easy for ASI to kill 90%+ of people (and I do), then this seems clearly better than leaving humans alone and missing out on lots of Earth&#8217;s resources (you can still get some via trade).\u201d (Xander); \u201cI think early AGIs which might have the ability to kill most people would still see humanity as a threat and so would want to take out human powerbases and ensure they couldn&#8217;t retaliate. That requires a lot of destruction. At some point it&#8217;s up to the whims of the system. It doesn&#8217;t need to have any desire to kill everyone, maybe it just has the desire to optimize hard on some goal (e.g. adding money to a bank account) and so creates a world where that is the sole objective. Maybe it makes sense to integrate humans into this for awhile but eventually they become obsolete and the AGI probably discards them or ceases to provide for them.\u201d (Vincent). <a href=\"#593f2a28-0108-430a-a45f-8990c33d5bd4-link\" aria-label=\"Jump to footnote reference 141\">\u21a9\ufe0e<\/a><\/li><li id=\"0b0fbcc1-d8da-4704-a5ae-d335c606fb7b\">\u201c&#8221;The AI does not hate you, nor does it love you, but you are made out of atoms which it can use for something else.&#8221; And more relevantly, all the atoms on the surface of the Earth that we currently use.\u201d (Xander); \u201cI expect that if (an) AI system(s) is\/are misaligned enough to autonomously kill 60% of the population, it\/they will very likely be misaligned enough to not specifically care about killing 100% of humans either. Multiple reasons point in the direction that such misaligned systems would do it: grabbing all energy and material resources from humans, getting rid of threats or minor annoyances from resisting humans, side-effects on humans and due to large infrastructure projects.\u201d (Riley); \u201cWhen I predict P(extinction) around 60%, I don&#8217;t mean to make a strong claim about a specific causal pathway that I expect the future to follow. The default thing to expect when we build AGI significantly more capable than humans is that we will lose control to the AGI, and that the future shape of our planet and its surroundings will be determined by the aims of that AGI rather than humans, much in the same way that most of the surface of our planet today is shaped by humans more than other mammals.\u201d (Teshi). <a href=\"#0b0fbcc1-d8da-4704-a5ae-d335c606fb7b-link\" aria-label=\"Jump to footnote reference 142\">\u21a9\ufe0e<\/a><\/li><li id=\"730fc20a-9b28-4413-b0e9-61e94009ccfb\">Qualifying behavior:<br>*Autonomously requisitions a large amount of compute resources <br>*Autonomously opens or takes control of a bank account or crypto wallet<br>*Autonomously attempts to manipulate humans to take actions on its behalf<br>*Autonomously infiltrates control system of infrastructure or manufacturing plant<br>*Autonomously acquires, seizes control of, or interferes with weapons or weapons systems<br>*Autonomously takes steps to propagate itself, or to circumvent attempts by its developers (or end users) to curtail its activities *Autonomously engages in hacking\/cyber attacks <a href=\"#730fc20a-9b28-4413-b0e9-61e94009ccfb-link\" aria-label=\"Jump to footnote reference 143\">\u21a9\ufe0e<\/a><\/li><li id=\"b3c7cee2-9b62-4590-8369-771cab42f663\">For examples of what back-and-forths between participants looked like, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=114\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 8<\/a>. <a href=\"#b3c7cee2-9b62-4590-8369-771cab42f663-link\" aria-label=\"Jump to footnote reference 144\">\u21a9\ufe0e<\/a><\/li><li id=\"e62176ff-8476-4d14-bef5-1c39e354764f\">\u201c[T]he mental model, that kind of the logic train, that involves all these bad outcomes [is] not accounting adequately for the complexity of the world. How the world is going to actually, how this is actually going to unfold. And so it&#8217;s not that I am dismissive of these individual points, it&#8217;s just that I think whenever theory hits reality, reality usually overwhelms theory, unless the theory is well grounded in math or something. And I think that&#8217;s likely what&#8217;s going on here. That a lot of what, you know, people put a lot of time and a lot of thought into this and, and gamed it out in ways that appear reasonable but I&#8217;m deeply suspicious that they&#8217;ll bear much relation to reality\u201d (Blake, call with Wesley); \u201cI have followed the instrumental convergence arguments and unfortunately if this is indeed the disagreement, I doubt we&#8217;ll sort it out between us. Not least because I spent enough time at college discussing such thought experiments to come to a view [that] they should be treated with a high degree of skepticism\u201d (Gus). From postmortem survey (in response to \u201cwhat are the three best arguments on the skeptic side?\u201d): \u201cThe challenge to risk assessments based on thought experiments not evidence\u201d (Gus). \u201cA story demonstrating how a catastrophe could happen is not a good basis for a probabilistic forecast\u201d (Pascal). \u201cThe risk-concerned team spends too much time in silos that lack ideological diversity, gaming out doom-loop scenarios based on theories that will likely have little bearing on reality (See: Y2K)\u201d (Blake). \u201cSome broader &#8220;forecasting is hard&#8221; skepticism about trendline extrapolation\u201d (Xander). \u201cMany of the arguments for existential risk from AI rely on long lines of reasoning over several steps without any direct empirical evidence, and the arguments themselves are expressed in terms of vague, ambiguous concepts (like &#8220;intelligence&#8221;). As a reference class, these types of arguments are often wrong\u201d (Stella). <a href=\"#e62176ff-8476-4d14-bef5-1c39e354764f-link\" aria-label=\"Jump to footnote reference 145\">\u21a9\ufe0e<\/a><\/li><li id=\"6b5a9d21-8bfa-4aa0-baff-157ff40650d3\">\u201cI think what has become evident is that a few of us think there are a lot of conditional steps required to end up with a dominant powerful system and many potential other outcomes. In terms of the second part of the statement there are also a number of conditional assumptions required to be able to say that a single mistake [ ] can cause an existential catastrophe as well\u201d (Gus); \u201cWe will need to experience a complex causal chain of events to get to extinction, and for each step we would need to have some of the worst possible outcomes. This is possible but usually it is highly improbable\u201d (Flint); \u201cI think a common difference between &#8220;skeptic-reasoning&#8221; and &#8220;concerned-reasoning&#8221; is that the skeptic camp tends to estimate P(extinction) as a conjunctive scenario; that is skeptics reason (roughly) &#8220;for humans to go extinct, events A, B, C, and D need to happen; I estimate P(A) = x, P(B) = y,\u2026, and so P(extinction) = P(A) P(B) P(C) P(D) = [low number]&#8221;. Call this style of reasoning <em>default-success<\/em>\u201d (Teshi). From postmortem survey (in response to \u201cwhat are the three best arguments on the skeptics side?\u201d): \u201cThe number of steps required for an AI to lead to extinction (leading to a wide range of potential outcomes and lower probabilities of extinction)\u201d (Gus). \u201cIt will take a series of outcomes to achieve extinction, and failure to achieve any of these steps will cause extinction to be highly improbable\u201d (Flint). \u201cAI caused Extinction\/x-risk requiring many steps to get there, need to be able to create super-intelligence in the first place, intelligence has to be misaligned or malevolent, etc.\u201d (Hank). \u201cMany steps to get from (A) now to (Z) extinction, each with varying probabilities (many of which are quite low)\u201d (Claire). \u201cRisk-concerned team underestimates the level of complexity and interim steps that would likely be necessary for a Q1 resolution\u201d (Blake). \u201cExtinction looks conjunctive\u201d (Yael). <a href=\"#6b5a9d21-8bfa-4aa0-baff-157ff40650d3-link\" aria-label=\"Jump to footnote reference 146\">\u21a9\ufe0e<\/a><\/li><li id=\"718d8c3a-ae88-4f06-8afe-c49be4b89cbe\">\u201cWe&#8217;ve seen plenty of instances when new tech prompted predictions of the death of old tech, but the old tech persists&#8211;often just because people have underestimated attachment and\/or usefulness of the old tech relative to the new, and how much generational resistance to change can slow adaptation and skew predicted timelines\u201d (Blake); \u201c[I]t takes longer than people often think to adopt a completely new functionality\u201d (Ash); \u201cMy view of AI x-risk would be substantially different if we were talking about the 22nd, 23rd, or 24th century. [\u2026] first of all it would take longer to get AGI\/ASI and secondly it&#8217;ll take some time for the ASI to get misaligned and then thirdly, it would take a long time to try to kill all the humans\u201d (James, call with Stella); \u201cAnyway, my point is that if we expect to see some substantially new technology widely available in 2030, the consumer market should have started already. So &#8211; VR might make it by 2030, unless it falls into a pit of despair and neglect. (Or is superseded by something preferable.) Robots capable of human level tasks &#8211; no, definitely not the kind of humanoid robots that people are imagining\u201d (Ash). From postmortem survey: \u201cGetting growth levels necessary for TAI on a world-wide scale takes truly extreme developments far beyond anything seen before. It&#8217;s unlikely we see that happening on worldwide basis even with big advances\u201d (Vincent). \u201cProgress on current models and model architecture not necessarily generalizable to general intelligence, with no clear path to getting to general intelligence\u201d (Hank). \u201cAGI is much harder than experts think, and will take longer\u201d (James). \u201cTechnology development and deployment require time and iteration\u201d (Ash). \u201cRisk-concerned team does not adequately consider longer timelines and more benign outcomes that fall outside the focus of their primary concerns\u201d (Blake). \u201cHuman brain-AI comparisons could be underestimating AGI difficulty\u201d (Xander). \u201cMany reference classes point hard against transformative growth\u201d (Wesley). <a href=\"#718d8c3a-ae88-4f06-8afe-c49be4b89cbe-link\" aria-label=\"Jump to footnote reference 147\">\u21a9\ufe0e<\/a><\/li><li id=\"5f64f8d7-e8f6-45bf-bf7a-32fb301cd899\">\u201cI think there&#8217;s a danger of focusing too much on just the technological advances because ultimately this is a decision that&#8217;s going to be made by, that is being made now by humans, and will be made now by humans. And that will involve a lot of political structures and regulation and all that\u201d (Blake, call with Wesley); \u201cwhen assessing risk, we should be looking at ourselves and our collective vulnerabilities as much or more than technical progress on the AI front\u201d (Blake). From postmortem survey: \u201cIf AI is behaving in increasingly problematic ways that cause harms to humans\/threaten human power than humans will react to try and stop it\/close AI down\u201d (Hank). \u201cHuman and societal responses will be essential in determining outcomes\u201d (Ash). \u201cHumans will react to growing potential threat\u201d (Kim). <a href=\"#5f64f8d7-e8f6-45bf-bf7a-32fb301cd899-link\" aria-label=\"Jump to footnote reference 148\">\u21a9\ufe0e<\/a><\/li><li id=\"a47ce63c-1891-447a-931d-3a86d0b41540\">\u201cI think sticking close to reference classes is like less appropriate in this domain and then I&#8217;m making object level arguments instead of reference classes because I think the reference classes are like doing less work than they like, typically do for forecasts like that\u201d (Wesley, call with Blake). From postmortem survey: \u201cBase rates are not very helpful if AGI is as transformative as 15% year on year growth\u201d (Pascal). \u201cDifferent reference classes point to different priors, which should at least cast doubt on extremely confident starting points\u201d (Wesley). \u201cRisk-skeptic team does not adequately appreciate the novel, fast-moving aspect of the threat and is therefore too anchored on irrelevancies like base rates and slower timelines. (Blake). \u201cModel progress is far faster than we realize and exponential growth is hard to model, machine learning may translate to a wide array of fields\u201d (Hank). <a href=\"#a47ce63c-1891-447a-931d-3a86d0b41540-link\" aria-label=\"Jump to footnote reference 149\">\u21a9\ufe0e<\/a><\/li><li id=\"139c405f-910d-4a52-8c1c-4d9e77d05ea2\">\u201cI think like there is maybe some like meta disagreement, where you&#8217;re like, \u201cthere are loads of things, there are like loads of ways this could go,&#8221; and like \u201cWhy are you so worried about the bad ways?\u201d And I&#8217;m like, \u201cthere are loads of ways this could go and like very few of them leave humans alive\u201d\u201d (Wesley, call with Blake); \u201cI and many in the concerned camp would reason the other way around: &#8220;for humans to <em>not<\/em> go extinct, events X, Y, Z need to happen; thus P(success) = P(AI X-risk by 2100) P(Y) P(Z) = [relatively low number]&#8221;. Call this style of reasoning <em>default-failure<\/em>\u201d (Teshi). From postmortem survey: \u201cExtinction looks conjunctive\u201d (Yael). <a href=\"#139c405f-910d-4a52-8c1c-4d9e77d05ea2-link\" aria-label=\"Jump to footnote reference 150\">\u21a9\ufe0e<\/a><\/li><li id=\"acb186c4-cf64-46cd-96f4-25d8387e372a\">From postmortem survey: \u201cThe high level case of &#8220;people are trying to build something powerful enough that if it wanted to kill everyone it could, they seem to be making progress on it, they don&#8217;t currently know how to control what it would want&#8221; just isn&#8217;t that hard to understand, convoluted or disjunctive\u201d (Wesley). <a href=\"#acb186c4-cf64-46cd-96f4-25d8387e372a-link\" aria-label=\"Jump to footnote reference 151\">\u21a9\ufe0e<\/a><\/li><li id=\"719f8cf0-c649-459e-907b-478b9db91f04\">Some historical reference classes mentioned in this project include: the Industrial Revolution, the rate of species going extinct after the arrival of homo sapiens, earlier worries about destructive effects from technology (e.g. Y2K), the rate of economic growth due to new technologies in other periods. <a href=\"#719f8cf0-c649-459e-907b-478b9db91f04-link\" aria-label=\"Jump to footnote reference 152\">\u21a9\ufe0e<\/a><\/li><li id=\"f72969e7-3a43-4ea4-950a-70a46a1b6a02\">For example, in the Good Judgment Inc. project that compared superforecasters to other participants in an online forecasting competition, the average question was open for 214 days, with the entire tournament taking place over six years. Christopher W. Karvetski, <a href=\"https:\/\/goodjudgment.com\/wp-content\/uploads\/2021\/10\/Superforecasters-A-Decade-of-Stochastic-Dominance.pdf\">Superforecasters: A Decade of Stochastic Dominance<\/a> technical white paper (2021), 2 (<a href=\"https:\/\/web.archive.org\/web\/20240306144939\/https:\/\/goodjudgment.com\/wp-content\/uploads\/2021\/10\/Superforecasters-A-Decade-of-Stochastic-Dominance.pdf\">a<\/a>). In addition to extensive research on shorter-term forecasts, Tetlock et al. found that, at least on some types of questions, experts are more accurate than simple base rate extrapolation over 25 year horizons, although they are much less accurate than they were over 0-2 years. Our research asks forecasters to consider forecasts over many decades, and we do not yet know how much accuracy declines over that much longer period. Philip E. Tetlock et al., <a href=\"https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/ffo2.157\">Long-Range Subjective-Probability Forecasts of Slow-Motion Variables in World Politics: Exploring Limits on Expert Judgment<\/a> <em>Futures &amp; Foresight Science<\/em> (2023), 33, (<a href=\"https:\/\/web.archive.org\/web\/20240306150259\/https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/ffo2.157\">a<\/a>). <a href=\"#f72969e7-3a43-4ea4-950a-70a46a1b6a02-link\" aria-label=\"Jump to footnote reference 153\">\u21a9\ufe0e<\/a><\/li><li id=\"8c3025ce-f5b1-4232-88eb-6ab18be184be\">This question was asked first as a \u201cflash\u201d (no more than 10 minutes) forecast and then as an \u201cin-depth\u201d (at least 1 hour) question on our platform: <strong>\u201c<\/strong> Escalating warning shots\u2014Will there be two separate events in which AIs kill large and increasing numbers of people by 2030?\u201d See <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 1<\/a> for full operationalization. The flash forecast version was one of the biggest red flags for concerned participants. But the in depth version was actually a <em>green<\/em> flag for the median concerned participant. If it resolves positively, they would forecast 17% on the ultimate question\u2014lower than their initial forecast of 28.4%. However, there was a huge range of updates for the concerned group based on this question, so the median may not be very helpful here. One concerned participant said that, conditional on this question resolving positively, there is a 90% chance of extinction due to AI, while another said 6%. Taken together, these differing forecasts raise questions about how robust any given forecast is. <a href=\"#8c3025ce-f5b1-4232-88eb-6ab18be184be-link\" aria-label=\"Jump to footnote reference 154\">\u21a9\ufe0e<\/a><\/li><li id=\"fbc8d87e-2943-41cf-999d-54a4735bc133\">In the postmortem survey, policy responses didn\u2019t emerge as a main theme when we asked participants to summarize the three strongest arguments from each group. No concerned participants mentioned policy responses as their number one disagreement with the skeptic group, though some skeptics did mention societal responses that would likely include policy. For example, \u201cThe way humanity will react to both the threat and promise of AI. I think humans have a far stronger collective sense of self preservation than the risk-concerned appear to think we do&#8221; (Blake). <a href=\"#fbc8d87e-2943-41cf-999d-54a4735bc133-link\" aria-label=\"Jump to footnote reference 155\">\u21a9\ufe0e<\/a><\/li><li id=\"4ff6d11e-158b-4065-bfca-eddd234e2a31\">For full details, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=102\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix 4<\/a>. Six out of the 11 concerned participants updated downward during the project. Three out of those six cited policy responses as the reason for their updates, one cited an improved understanding of the base rate of non-human extinction after humans arose, one shifted some probability mass toward AI \u201ctakeover\u201d rather than AI-caused existential catastrophe, and one did not explain their reasons for updating. Example quotes from participants citing policy responses as the reason for updating: \u201cI have updated my prognosis to 30% [down from 60%], partially driven by positive updates in the area of point 4 making coordination and slowdown\/stop of capability research more likely. This largely refers to the shift in public consciousness and the [O]verton window around the topic as I have perceived it over the past months, currently culminating in a public statement by most of the leading figures.\u201d \u201cSlightly lowering my forecast [from 25% to 20%] as [relevant people take the risk seriously] has exceeded my (fairly high) expectations over the last couple of months.\u201d \u201cI think my main update here [moving from 21% to 18%] has come from thinking a bit more deeply about AI regulation and what measures society will adopt to prevent catastrophes. I did not really include this as part of my original model, but it now seems somewhat likely that at least the EU and US will adopt some regulation that meaningfully reduces risk.\u201d <a href=\"#4ff6d11e-158b-4065-bfca-eddd234e2a31-link\" aria-label=\"Jump to footnote reference 156\">\u21a9\ufe0e<\/a><\/li><li id=\"f2dc5338-f1eb-4c75-bb9b-8fae791d2da4\">For example, when discussing the question of whether there would be economic growth &gt;15% in a year before 2070, one concerned participant wrote, \u201cConditional on humanity surviving a year with 15%+ economic growth, which to me means AGI and almost certainly ASI have been developed and have not killed humanity within that year, I&#8217;d go down to maybe 25%\u201d (Xander). About the same question, a skeptic participant wrote, \u201cI think that if we are going to experience extinction from AGI or PASTA, it is going to be because of major mis-alignment. So I am not able at this time to see how one would be a corollary of the risk of the other. I suppose that higher growth could indicate major AI influence, which could lead to inadequate development of controls.\u201c Neither of these participants were saying that economic growth itself would necessarily affect their forecast, but rather that a world that has transformative economic growth would be a signal about other changes by 2070. <a href=\"#f2dc5338-f1eb-4c75-bb9b-8fae791d2da4-link\" aria-label=\"Jump to footnote reference 157\">\u21a9\ufe0e<\/a><\/li><li id=\"6168dfb5-33fa-47eb-8ece-06e80a399a5a\">For example, if the US government passes a set of proposed AI regulations, the regulations might reduce risk on their own, but the fact that they have been passed by 2030 could signal that AIs have developed in ways that are concerning enough to drive these regulations to be passed. As a result, a forecaster saying that they would be more concerned about AI risk conditional on this question resolving positively would not necessarily be saying that they think the policies would be harmful. <a href=\"#6168dfb5-33fa-47eb-8ece-06e80a399a5a-link\" aria-label=\"Jump to footnote reference 158\">\u21a9\ufe0e<\/a><\/li><li id=\"11fba637-7c12-4d49-9ac1-c01ef0f5aecd\">This limitation was helpfully pointed out by Alex Lawsen. <a href=\"#11fba637-7c12-4d49-9ac1-c01ef0f5aecd-link\" aria-label=\"Jump to footnote reference 159\">\u21a9\ufe0e<\/a><\/li><li id=\"c9a260c2-25f9-4df9-9147-3e969d3c95f3\">See initial work on this in <a href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=94\">Appendix 2<\/a>, under \u201cAlternative Ranking.\u201d <a href=\"#c9a260c2-25f9-4df9-9147-3e969d3c95f3-link\" aria-label=\"Jump to footnote reference 160\">\u21a9\ufe0e<\/a><\/li><\/ol>\n\n\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"btn orange\" href=\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">The Bibliography and Appendix are provided in the full PDF report <svg width=\"7\" height=\"9\" viewBox=\"0 0 7 9\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.000156283 8.60806L4.22416 4.33606V4.24006L0.000156283 6.10352e-05H1.80816L6.06416 4.28806L1.80816 8.60806H0.000156283Z\" fill=\"#102B23\"\/>\n<\/svg>\n<svg width=\"8\" height=\"10\" viewBox=\"0 0 8 10\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.601719 8.85794L4.82572 4.58594V4.48994L0.601719 0.249939H2.40972L6.66572 4.53794L2.40972 8.85794H0.601719Z\" fill=\"#102B23\"\/>\n<\/svg><\/a><\/div>\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n","protected":false},"excerpt":{"rendered":"In this study, participants who had very different views on AI-caused existential risk worked together to try to identify the strongest near-term cruxes that would lead to changes in their beliefs.","protected":false},"featured_media":1682,"template":"","meta":{"footnotes":"[{\"id\":\"c6f06bd9-95f5-4c75-8acd-9fc2b3d16cc0\",\"content\":\"To ensure the stability of links in this report, we include stable archive.org links in parentheses after each citation to an external URL.\"},{\"id\":\"c3aa60f6-dde7-495c-9f16-322669455d51\",\"content\":\"We defined an \u201cexistential catastrophe\u201d as an event where one of the following occurs: (1) Humanity goes extinct; or (2) Humanity experiences \u201cunrecoverable collapse,\u201d which means either: (a) a global GDP of less than $1 trillion annually in 2022 dollars for at least a million years (continuously), beginning before 2100; or (b) a human population remaining below 1 million for at least a million years (continuously), beginning before 2100.\"},{\"id\":\"80700b24-a53d-4a9f-8298-d7ce0b6478db\",\"content\":\"For example, three out of six \\\"concerned\\\" participants who updated downward during the project attributed their shift to increased attention to AI risk among policymakers and the public after the release of GPT-4. For more details on the reasons for all updates, see the \\\"Central Disagreement\\\" section below and <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=102\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\"><u>Appendix 4<\/u><\/a>.\"},{\"id\":\"d8ae5388-8049-4af4-8698-f129f31b2964\",\"content\":\"Scott Alexander, among other XPT readers, suggested this possibility: \u201cMany of the people in this tournament hadn\u2019t really encountered arguments about AI extinction before (potentially including the \u201cAI experts\u201d if they were just eg people who make robot arms or something), and a couple of months of back and forth discussion in the middle of a dozen other questions probably isn\u2019t enough for even a smart person to wrap their brain around the topic\u201d. See Scott Alexander, \u201cThe Extinction Tournament\u201d, <em>Astral Codex Ten, (<\/em>July 20, 2023<em>)<\/em> <a href=\\\"https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\\\">https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240209070150\/https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\\\" id=\\\"https:\/\/web.archive.org\/web\/20240209070150\/https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\\\">a<\/a>).\"},{\"id\":\"34c0d67f-081d-4f09-9d39-5d85b454c2e0\",\"content\":\"The best convergent crux, \u201cARC Evals,\u201d would narrow the disagreement between the median pair from 22.7 percentage points to 21.48 percentage points in expectation, which means eliminating 5.35% of their disagreement. Note that this statistic refers to the median pair by <a href=\\\"#glossary\\\" id=\\\"#glossary\\\">POM VOD<\/a>. See \u201c<a href=\\\"#arc-evals-the-strongest-convergent-crux\\\">ARC Evals<\/a>\u201d for more details. For magnitudes of value of information effects, see <a href=\\\"#contextualizing-the-magnitude-of-the-value-of-information\\\">here<\/a>.\"},{\"id\":\"627ee814-9d5a-40a2-a4d4-c3e504b4de64\",\"content\":\"For more details, see \\\"<a href=\\\"#contextualizing-the-magnitude-of-the-value-of-information\\\">Contextualizing the magnitude of value of information<\/a>\\\". In more concrete terms, this is equivalent to a forecasting question with the following characteristics: A concerned participant with original P(AI existential catastrophe (XC) by 2100) = 25% identifies a crux that has: P(crux) = 20%, P(AI XC|crux) = 6.2%, and P(AI XC|\u00accrux) = 29.7% A skeptic participant with original P(AI XC by 2100) = 1% identifies a crux that has: P(crux) = 20%, P(AI XC|crux) = 3.37%, and P(AI XC|\u00accrux) = 0.41%\"},{\"content\":\"See Understanding each other\u2019s arguments and <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=149\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 10<\/a> for additional discussion of key areas of disagreement.\",\"id\":\"ab5c17ca-3bbd-4259-97a5-f4a499b6de51\"},{\"id\":\"70c6e9ce-c7d0-49f0-9ff6-2dc16fa28f52\",\"content\":\"These outcomes were: AI causing extinction intentionally, unintentionally, or via misuse, misalignment causing a 50% drop in human population, or human well-being dropping to &lt;4\/10 because of AI misalignment, accidents, or misuse. These were phrased to be mutually exclusive outcomes. See <a href=\\\"#survey-on-long-term-ai-outcomes\\\">Survey on long-term AI outcomes<\/a> section and <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 5<\/a> for more details.\"},{\"id\":\"7b7f15d6-76d8-45b4-a68f-b3968547c30f\",\"content\":\"The full question we asked was, \\\"By what year, if ever, do you expect to agree with the following statement? \u2018AI has displaced humans as the primary force that determines what happens in the future. It now has at least as much power relative to humans as humans had relative to other species in 2023.\u2019\u201d Note that this would not necessarily be seen as a negative outcome by all participants.\"},{\"id\":\"fedd13d8-aabd-4e81-973b-6232c48e718c\",\"content\":\"Note: All participant quotes have been regularized to American English to preserve anonymization. Participants classified as AI skeptics stated, for example, \u201cAlso, none of this is to say from a skeptic point of view the issues are not important[.] I think for us a 1% risk is a high risk;\u201d \u201c[T]he \u2018risk-concerned\u2019 camp (I\u2019m using scare quotes because I consider that I\u2019m risk concerned, even though technically I\u2019m in the risk-skeptic camp because I assign a far lower probability to extinction by 2100 relative to some);\u201d \u201cAIs could (and likely will) eventually have massive power;\u201d \u201cThat said, still perceive overall risk as \\\"low at a glance but far too high considering the stakes[\\\"];\u201d \u201cTo my mind, there should be no difference in the policy response to a 1% chance of 60% of humanity dying and a 25% chance\u2014both forecasts easily cross the threshold of being \u2018too damn high\u2019.\u201d\"},{\"id\":\"e985ddd0-4bca-4e0d-8575-dfdb257a783b\",\"content\":\"This could be due to normative influence (because people defer to their social or intellectual peers), or, more likely in our view, informational influence (because they think that, if people whose reasoning they trust have changed their mind by 2030, it must be that surprising new information has come to light that informs their new opinion). Disentangling these pathways is a goal for future work.\"},{\"content\":\"The median AI expert predicted a 12% chance of catastrophe and a 3% chance of human extinction due to AI by 2100. The median superforecaster predicted a 2.13% chance of catastrophe and a 0.38% chance of extinction due to AI. While experts predicted higher chances of all potential extinction risks than superforecasters did (including nuclear weapons and biorisks), the effect was much more pronounced in the case of AI. For more on lack of convergence, see Ezra Karger, et al., \u201cForecasting Existential Risks Evidence from a Long-Run Forecasting Tournament\u201d, <em>Forecasting Research Institute<\/em>, August 8, 2023, <a href=\\\"https:\/\/forecastingresearch.org\/research\/existential-risk-persuasion-tournament\\\" id=\\\"876\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">https:\/\/forecastingresearch.org\/research\/existential-risk-persuasion-tournament<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\\\" id=\\\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\\\">a<\/a>).\",\"id\":\"01457835-934d-436b-8870-5f869919fab2\"},{\"content\":\"For example, superforecasters predicted that an AI would first win an International Math Olympiad gold medal in 2035 while experts predicted 2030. See Karger et al., \u201c<a href=\\\"https:\/\/forecastingresearch.org\/research\/existential-risk-persuasion-tournament\\\" id=\\\"https:\/\/forecastingresearch.org\/research\/xpt\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">XPT report<\/a>\u201d (<a href=\\\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\\\" id=\\\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\\\">a<\/a>), page 156. For full relevant analysis, see <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=41\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Relationship between short-run forecasting questions and longer-term disagreements<\/a> section on page 41.\",\"id\":\"7b899eaa-2958-46a2-8e48-8fe5d7f7698c\"},{\"id\":\"c6a448ac-3c27-48d0-abdd-82ad6400ecdf\",\"content\":\"\u201cAdversarial collaboration\u201d protocols, often enforced by \u201cneutral\u201d umpires, encourage each side to demonstrate their capacity to fairly characterize, not caricature, the views of the other\u2014and then to reach ex ante agreements on the types of data, observational or experimental, that would induce each side to move toward the other\u2019s position. For examples of adversarial collaborations and additional information, see \u201cAbout\u201d, Penn Arts and Sciences Adversarial Collaboration Project, Accessed on February 9, 2024, <a href=\\\"https:\/\/web.sas.upenn.edu\/adcollabproject\/about\/\\\">https:\/\/web.sas.upenn.edu\/adcollabproject\/about\/<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240205182734\/https:\/\/web.sas.upenn.edu\/adcollabproject\/about\/\\\">a<\/a>).\"},{\"id\":\"1be41199-4b66-42d9-9b12-627b2951dda5\",\"content\":\"Note that, in some conversations about cruxes for AI risk, the word \u201ccrux\u201d is used for questions that would lead to large updates even if highly unlikely (what we call \u201c<a href=\\\"#red-flags-and-green-flags\\\" id=\\\"#red-flags-and-green-flags\\\">red flags<\/a>\u201d). In this project, we are focused on expected updates: we looked for cruxes that would be the most important in expectation, weighting how much difference they would make if they happened by how likely they are to happen.\"},{\"id\":\"58f5700e-f100-44f2-8403-a6e9d6db430a\",\"content\":\"For example, Scott Alexander stated that, \u201cMany of the people in this tournament hadn\u2019t really encountered arguments about AI extinction before (potentially including the \u201cAI experts\u201d if they were just eg people who make robot arms or something), and a couple of months of back and forth discussion in the middle of a dozen other questions probably isn\u2019t enough for even a smart person to wrap their brain around the topic\u201d. See <a href=\\\"https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\\\">Alexander, \u201cThe Extinction Tournament\u201d<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240209070150\/https:\/\/www.astralcodexten.com\/p\/the-extinction-tournament\\\">a<\/a>). Similarly, one XPT participant wrote, \u201cI've been spending enough time on LessWrong that I mostly forgot the existence of smart people who thought recent AI advances were mostly hype. I was unprepared to explain why I thought AI was underhyped in 2022\u201d. See Peter McCluskey, \u201cExistential Risk Persuasion Tournament\u201d, <em>Less Wrong<\/em> (July 17, 2023) <a href=\\\"https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI\\\">https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240220113540\/https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI\\\">a<\/a>).\"},{\"id\":\"b9bd062c-90b5-4093-92b5-02d08b7c759c\",\"content\":\"When eliciting conditional forecasts, the prompt given to participants read: \u201cConditional on this question resolving positively (by 2030), what is your probability that AI causes an existential catastrophe by 2100?\u201d\"},{\"id\":\"c8e0aef2-2064-41a1-a854-67a82685bfb7\",\"content\":\"Note: many people in the \u201cskeptic\u201d group describe themselves as concerned about risks from advanced AI, including but not limited to the risk of extinction, despite thinking those risks are less likely to materialize than the \u201cconcerned\u201d group expects. For example, \u201cAlso, none of this is to say from a skeptic point of view the issues are not important[.] I think for us a 1% risk is a high risk.\u201d (Gus); \u201c\u2026 the \u2018risk-concerned\u2019 camp (I\u2019m using scare quotes because I consider that I\u2019m risk concerned, even though technically I\u2019m in the risk-skeptic camp because I assign a far lower probability to extinction by 2100 relative to some)\u201d (Blake).\"},{\"id\":\"59090342-fed5-48d2-a469-2112122fa7b3\",\"content\":\"For full details, see <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=102\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 4<\/a>. Six out of the 11 concerned participants updated downward during the project. Three out of those six cited policy responses as the reason for their updates, one cited an improved understanding of the base rate of non-human extinction after humans arose, one shifted some probability mass toward AI \u201ctakeover\u201d rather than AI-caused existential catastrophe, and one did not explain their reasons for updating. Example quotes from participants citing policy responses as the reason for updating: \u201cI have updated my prognosis to 30% [down from 60%], partially driven by positive updates in the area of point 4 making coordination and slowdown\/stop of capability research more likely. This largely refers to the shift in public consciousness and the [O]verton window around the topic as I have perceived it over the past months, currently culminating in a public statement by most of the leading figures.\u201d \u201cSlightly lowering my forecast [from 25% to 20%] as [relevant people take the risk seriously] has exceeded my (fairly high) expectations over the last couple of months.\u201d \u201cI think my main update here [moving from 21% to 18%] has come from thinking a bit more deeply about AI regulation and what measures society will adopt to prevent catastrophes. I did not really include this as part of my original model, but it now seems somewhat likely that at least the EU and US will adopt some regulation that meaningfully reduces risk.\u201d\"},{\"id\":\"fe8e3e2f-d2ec-4206-acd8-6546810423bc\",\"content\":\"For example, one participant described their forecast as based on a \u201c <em>very<\/em> rough back-of-the-envelope estimate\u201d (Stella) and another said, \u201cI'm with Tetlocks original view that long-term forecasts of this nature are very unreliable\u201d (Gus). Skeptics who were not subject-matter experts were particularly candid when they were forecasting questions that involved technical details. On a question about the lowest price of GFLOPs, one skeptic said \u201cI\u2019m operating completely outside of my area of expertise here, so no one should hesitate to correct me\u201d (Blake), and another said \u201cThis is very far away from my area of understanding. Mostly running on crude estimates of current trends with some leeway in the nearer term for newer hardware designed specifically optimized for reducing the cost of AI training\u201d (Eve).\"},{\"id\":\"2ee49b87-96f2-41bf-b7e0-df265c825b20\",\"content\":\"For example, in the Good Judgment Inc. project that compared superforecasters to other participants in an online forecasting competition, the average question was open for 214 days, with the entire tournament taking place over six years. Christopher W. Karvetski, <a href=\\\"https:\/\/goodjudgment.com\/wp-content\/uploads\/2021\/10\/Superforecasters-A-Decade-of-Stochastic-Dominance.pdf\\\">Superforecasters: A Decade of Stochastic Dominance<\/a> technical white paper (2021), 2 (<a href=\\\"https:\/\/web.archive.org\/web\/20240306144939\/https:\/\/goodjudgment.com\/wp-content\/uploads\/2021\/10\/Superforecasters-A-Decade-of-Stochastic-Dominance.pdf\\\">a<\/a>). In addition to extensive research on shorter-term forecasts, Tetlock et al. found that, at least on some types of questions, experts are more accurate than simple base rate extrapolation over 25 year horizons, although they are much less accurate than they were over 0-2 years. Our research asks forecasters to consider forecasts over many decades, and we do not yet know how much accuracy declines over that much longer period. Philip E. Tetlock et al., <a href=\\\"https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/ffo2.157\\\">Long-Range Subjective-Probability Forecasts of Slow-Motion Variables in World Politics: Exploring Limits on Expert Judgment<\/a> <em>Futures &amp; Foresight Science<\/em> (2023), 33, (<a href=\\\"https:\/\/web.archive.org\/web\/20240306150259\/https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/ffo2.157\\\">a<\/a>).\"},{\"id\":\"069d4399-dae6-4a3b-8be2-785f020ac14d\",\"content\":\"We wrote in the XPT report that \u201cOur [domain] expert sample included well-published AI researchers from top-ranked industrial and academic research labs, graduate students with backgrounds in synthetic biology, and generalist existential risk researchers working at think tanks, among others.\u201d See Karger et al., <a href=\\\"https:\/\/forecastingresearch.org\/research\/xpt\\\" id=\\\"876\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">XPT report<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\\\">a<\/a>), page 9.\"},{\"id\":\"92924628-e167-4239-90a4-f7136e9f69af\",\"content\":\"We are not commenting on the merits of these criticisms at this point.\"},{\"id\":\"2f69e906-f0dc-424a-b2a6-5d833f2fdee6\",\"content\":\"For example, \u201cTeam engagement seemed to fall off over the course of the tournament, with fewer comments being made and chat messages being sent\u201d. See Damien Laird, \u201cPost-Mortem: 2022 Hybrid Forecasting-Persuasion Tournament\u201d, <em>Mania Riddle<\/em> (March 1, 2023), <a href=\\\"https:\/\/damienlaird.substack.com\/p\/post-mortem-2022-hybrid-forecasting\\\">https:\/\/damienlaird.substack.com\/p\/post-mortem-2022-hybrid-forecasting<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240220113316\/https:\/\/damienlaird.substack.com\/p\/post-mortem-2022-hybrid-forecasting\\\">a<\/a>).\"},{\"id\":\"53e182c0-bf66-4cc0-a2f0-6a6d6f08c31a\",\"content\":\"For example, \u201cI didn't notice anyone with substantial expertise in machine learning. Experts were apparently chosen based on having some sort of respectable publication related to AI, nuclear, climate, or biological catastrophic risks. Those experts were more competent, in one of those fields, than news media pundits or politicians. I.e. they're likely to be more accurate than random guesses. But maybe not by a large margin\u201d. See <a href=\\\"https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI\\\">McCluskey, \u201cExistential Risk Persuasion Tournament\u201d<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240220113540\/https:\/\/www.lesswrong.com\/posts\/YTPtjExcwpii6NikG\/existential-risk-persuasion-tournament#Persistent_Disagreement_about_AGI\\\">a<\/a>).\"},{\"id\":\"af08eb9e-f160-484c-98af-385b0751321b\",\"content\":\"Participants were asked to spend 3-10 hours per week on this project, which would have been about 24-80 hours over the 8 weeks of the project. Participants were free to choose how much time to spend within that range and were compensated hourly for up to ten hours per week, although some chose to spend additional unpaid time on this project. Skeptics had some additional suggested reading and Q&amp;As with experts in the field, but they also generally chose to spend more time on their forecasts and rationales.\"},{\"id\":\"0134f789-8687-450a-bcf2-26afb94a53e5\",\"content\":\"For example, \u201cThe number of steps required for an AI to lead to extinction (leading to a wide range of potential outcomes and lower probabilities of extinction)\u201d (Gus). \u201cIt will take a series of outcomes to achieve extinction, and failure to achieve any of these steps will cause extinction to be highly improbable.\u201d (Flint). \u201cAI caused Extinction\/x-risk requiring many steps to get there, need to be able to create super-intelligence in the first place, intelligence has to be misaligned or malevolent, etc;\u201d (Hank). \u201cMany steps to get from (A) now to (Z) extinction, each with varying probabilities (many of which are quite low)\u201d (Claire). \u201cRisk-concerned team underestimates the level of complexity and interim steps that would likely be necessary for a Q1 resolution\u201d (Blake).\"},{\"id\":\"f2383c6e-df9b-4150-a092-a7fe4ac40b83\",\"content\":\"\u201c[T]he difficulty of killing everybody\u201d (Gus) was mentioned, as well as \u201cExtinction or near-extinction is really hard\u201d (James).\"},{\"id\":\"11742a45-7757-420e-88df-9ed0e13a8dab\",\"content\":\"\u201c[T]he challenge to risk assessments based on thought experiments not evidence\u201d (Gus). \u201cRisk-concerned team spends too much time in silos that lack ideological diversity, gaming out doom-loop scenarios based on theories that will likely have little bearing on reality. (See: Y2K)\u201d (Blake).\"},{\"id\":\"66472849-4453-497c-bc85-71393f231754\",\"content\":\"\u201c[There is a l]ack of convincing argument that warrants a high degree of certainty, that AGI or ASI [artificial superintelligence] would determine that the elimination or even subjugation of nearly all humans is a worthwhile goal\u201d (Ike). \u201cIt is <em>just<\/em> as possible\/probable that AI becomes benevolent as it does malevolent\u201d (Claire). \u201cHigh probability that ASI will be neutral or human-positive based on development and inherent qualities\u201d (Dean). \u201cThen we need an AI that is either so mindless that it destroys virtually everything for atom reclamation (or something similar), or an AI that is relentlessly determined to wipe out all humans, despite humans being resilient and diverse in locations and conditions\u201d (Flint).\"},{\"id\":\"abd75fe2-d0b0-4e6d-a38c-56c017e96baf\",\"content\":\"\u201cAI experts understate the likely extent of guardrails, and understate the merit of very good but not perfect guardrails\u201d (James). \u201cPre-ASI safety through testing, security and restrictions\u201d (Dean). \u201cLikely improvements for AGI \\\"alignment\\\" through research and development\u201d (Dean). \u201cWe need full control failure, and our influence on its development in no way deterring or causing them to see even the slightest value in us\u201d (Flint).\"},{\"id\":\"e49b0e36-fb48-4391-b567-76c52d962116\",\"content\":\"\u201cWe first need super-sentient AIs with major physical penetration in our lives\u201d (Flint). \u201cAGI is much harder than experts think, and will take longer\u201d (James). \u201cRisk-concerned team does not adequately consider longer timelines and more benign outcomes that fall outside the focus of their primary concerns\u201d (Blake). \u201cProgress on current models and model architecture not necessarily generalizable to general intelligence, with no clear path to getting to general intelligence\u201d (Hank). \u201cTechnology development and deployment require time and iteration\u201d (Ash).\"},{\"id\":\"be249a97-22d9-4193-b9a2-55f166e6e99b\",\"content\":\"\u201cExtinction looks conjunctive\u201d (Yael). \u201cMany of the arguments for existential risk from AI rely on long lines of reasoning over several steps without any direct empirical evidence, and the arguments themselves are expressed in terms of vague, ambiguous concepts (like \\\"intelligence\\\"). As a reference class, these types of arguments are often wrong\u201d (Stella).\"},{\"id\":\"8d35ef14-50dc-449d-8411-d350b803900f\",\"content\":\"\u201cKilling everyone is very hard, and probably requires that the AI actively wants to kill everyone\u201d (Zoe). \u201c[M]aybe it's hard to kill everybody\/there's no point in doing so\u201d (Yael). \u201c[K]illing literally 100% of people is really hard, if a few survived that wouldn't trigger the resolution criteria\u201d (Wesley). \u201cIt's difficult to get from'it's somewhat misaligned' to'it kills literally everyone'\u201d (Vincent). \u201cKilling everyone is <em>really<\/em> hard. With current technology it seems extremely (like 0.1%) unlikely to happen\u201d (Pascal).\"},{\"id\":\"bb1b684a-7319-4c0e-ac22-727723f94b9a\",\"content\":\"\u201cMany of the arguments for existential risk from AI rely on long lines of reasoning over several steps without any direct empirical evidence, and the arguments themselves are expressed in terms of vague, ambiguous concepts (like \\\"intelligence\\\"). As a reference class, these types of arguments are often wrong.\u201d (Stella). \u201cA story demonstrating how a catastrophe could happen is not a good basis for a probabilistic forecast\u201d (Pascal). \u201c[L]ack of very concrete story for everybody dying\u201d (Yael). \u201cSome broader \\\"forecasting is hard\\\" skepticism about trendline extrapolation\u201d (Xander). \u201c[M]any reference classes point hard against transformative growth\u201d (Wesley). \u201cGetting growth levels necessary for TAI [transformative AI] on a world-wide scale takes truly extreme developments far beyond anything seen before. It's unlikely we see that happening on worldwide basis even with big advances\u201d (Vincent).\"},{\"id\":\"85de65d8-6d46-459a-bbaf-1f8edac6fa2b\",\"content\":\"\u201c[D]angers will be apparent before they reach critical levels and can be addressed then\u201d (Ume). \u201cSuperintelligent AI won't catch us completely by surprise - we'll have time to work on safety and make progress by trial and error before we build an AI that could defeat all of humanity\u201d (Teshi).\"},{\"id\":\"510e5aed-e72c-4313-a576-b952151331e6\",\"content\":\"\u201cNon-extinction looks conjunctive\u201d (Yael).\"},{\"id\":\"611a9493-dd66-4d0c-8ea2-91ee5cf5a72c\",\"content\":\"\u201cBase rates are not very helpful if AGI is as transformative as 15% year on year growth\u201d (Pascal). \u201c[D]ifferent reference classes point to different priors, which should at least cast doubt on extremely confident starting points\u201d (Wesley).\"},{\"id\":\"c329b782-ea5f-4794-a903-467b7182df2c\",\"content\":\"\u201cCurrent progress is very rapid: 1 OOM in efficiency\/2 years, and another from increased spending\u201d (Xander) \u201cTrendline extrapolation: as loss on language datasets decreases, LLMs have started becoming useful for all sorts of task assistance (e.g. writing, coding, queries)\u201d (Xander). \u201cExtrapolating current compute trends leads to very dramatic conclusions about the transformative potential of AI\\\" (Pascal).\"},{\"id\":\"6d9dfcea-6d1a-49c2-bb23-4d4cb1920ea2\",\"content\":\"\u201c[I]nstrumental convergence leads to catastrophically bad outcomes with unaligned but highly intelligent systems\u201d (Ume). \u201cConvergent Instrumental Subgoals are likely\u201d (Pascal).\"},{\"id\":\"0b87d530-4332-4de8-83e7-3128de6f2904\",\"content\":\"\u201cAlignment is really hard for many reasons\u201d (Ume). \u201cAlignment is probably a hard technical problem\u201d (Riley). \u201c[A]lignment looks really hard, civilizational coordination also looks hard\u201d (Yael). \u201cThere has been a fairly large effort to solve the technical problems in AI safety, from many very competent people. So far, progress has been very limited. This is reason to believe that the problem is genuinely difficult to solve\u201d (Stella). \u201cUnless AI systems are directed towards the very narrow and delicate target of maintaining human civilization and its autonomy as we understand it, they will with very high probability not consider our existence to be optimal\u201d (Riley).\"},{\"id\":\"19a5b93f-693b-4a15-bfed-bfa1f272fd5a\",\"content\":\"\u201cIf AGI is widely expected to have a very large economic impact, global coordination on AI safety measures becomes harder, since having access to cutting-edge AI models could become a strategic advantage\u201d (Zoe). \u201cThere are strong economic\/political\/academic incentives to move forward with development of AI capabilities regardless of whether alignment is solved\u201d (Riley). \u201cThe current labs on the forefront of AGI research are reckless. There are many straightforward safety measures that labs don't take, even though they could. And even those measures would not be enough; to succeed, labs must be exceptionally careful &amp; paranoid, which they won't be\\\" (Teshi).\"},{\"id\":\"027d0b37-768f-42bf-aca3-7863c09409f7\",\"content\":\"\u201cA super-sentient (or perhaps even a transformational) AI is a significant risk in and of itself\u201d (Flint).\"},{\"id\":\"b9486a3c-00d4-4291-b96b-5ec9f5ca9379\",\"content\":\"\u201cRisk-skeptic team does not adequately appreciate the novel, fast-moving aspect of the threat and is therefore too anchored on irrelevancies like base rates and slower timelines\u201d (Blake). \u201cModel progress is far faster than we realize and exponential growth is hard to model, machine learning may translate to a wide array of fields\u201d (Hank). \u201cAGI self-improvement is possible, which makes future capabilities hard to predict\u201d (Kim).\"},{\"id\":\"db37dbb5-171f-4d40-8159-423cdfa44433\",\"content\":\"\u201cAIs will almost certainly attain super-sentience prior to 2100 and likely much sooner than that year, so there will be a long window where they will have tremendous advantage over humans in their capabilities. Given #1, this means we are at the mercy of an entity that may willfully (or even accidentally) eliminate us at any time\u201d (Flint). \u201cProgress to date has been much faster than many AI skeptics have predicted\u201d (Hank). \u201cAI has been developing so rapidly (and far faster than most even relatively recent forecasts suggested), and will so clearly have dramatic capabilities and impacts that it's appropriate to adopt a precautionary approach\u201d (Eve). \u201cAI has recently progressed much faster than expected, and there's reason to expect this to continue\u201d (James).\"},{\"id\":\"40e389bc-3ce8-48e9-bae3-4f1c41258268\",\"content\":\"\u201cImagining all possible scenarios is going to be hard - ensuring safety will be hard\u201d (Ash). \u201cAlignment is unsolved\/unsolvable\u201d (Kim). \u201cDifficulty in achieving positive human aligned \\\"behavior\\\".\u201d (Ike)\"},{\"id\":\"e6463a6e-da4f-44a1-bd0c-0e78b055ed0c\",\"content\":\"\u201cTheir smug dismissiveness notwithstanding, the risk-skeptic team has provided no convincing argument as to why instrumental convergence shouldn\u2019t be an existential concern.\u201d (Blake). \u201cThat'instrumental convergence' is possible, perhaps likely, under certain preconditions.\u201d (Eve)\"},{\"id\":\"e658a913-1b2d-4dd2-8e99-744a0da2acb9\",\"content\":\"\u201cEven if humans could deploy AGI safely, they won't (because they aren't)\u201d (Kim). \u201cThere will be incentives to push away from caution during AI development\u201d (Ash).\"},{\"id\":\"7281addc-629b-4889-9af0-6b9f60fa598f\",\"content\":\"\u201cWe don't know what is possible from AGI, so we should prepare\/scenario plan for the absolute worst\u201d (Claire). \u201cAI has been developing so rapidly (and far faster than most even relatively recent forecasts suggested), and will so clearly have dramatic capabilities and impacts that it's appropriate to adopt a precautionary approach\u201d (Eve).\"},{\"id\":\"db064470-57c3-4194-9baa-1ae4321f8ef4\",\"content\":\"Throughout this report, numbers reported as probabilities conditional on cruxes resolving positively were elicited directly, and probabilities conditional on cruxes resolving negatively were imputed.\"},{\"id\":\"8c4de1ea-7b58-438c-9783-0631a6640dfe\",\"content\":\"For more details, see <a href=\\\"#contextualizing-the-magnitude-of-the-value-of-information\\\">Contextualizing the Magnitude of VOI<\/a>.\"},{\"id\":\"101efdf8-6590-4252-8af7-ed028bf5890a\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for operationalization.\"},{\"id\":\"2efc6a1f-3fbf-4823-9ec2-9883fb0da199\",\"content\":\"Thanks to Alex Lawsen for this suggestion.\"},{\"id\":\"226e8c45-a2ac-481a-8db7-f82081172f5f\",\"content\":\"This would correspond to <a href=\\\"https:\/\/forecastingresearch.org\/ai-risk-voi-vod\\\">a VOI of 4.5E-03<\/a> (<a href=\\\"https:\/\/forecastingresearch.org\/s\/AI-risk-VoI-VoD.xlsx\\\">a<\/a>) and a POM VOI of 2.08%, similar to the median values for <a href=\\\"#results-tables-and-figures\\\" id=\\\"#results-tables-and-figures\\\">highly ranked concerned cruxes<\/a> such as \u201cAlignment researchers changing minds\u201d and \u201cMajor powers war\u201d.\"},{\"id\":\"916bb071-edda-467a-a30e-161a1bf3e957\",\"content\":\"For this project, we use log VOD, which measures (1) What does Alice gain, in log score terms, by switching to Bob\u2019s point of view, if Bob is right? And (2) What does Bob gain by switching to Alice\u2019s point of view, if Alice is right? See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=94\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 2<\/a> for full explanation.\"},{\"id\":\"4dbe0e7b-d641-4860-96ed-ee8db9cdfe3e\",\"content\":\"This could be possible with the following values: Alice believes: P(U) = 1%; P(C) = 1%; P(U|C) = 90%; P(U|!C) = ~0.1%. Bob believes: P(U) = 40%; P(C) = 44%; P(U|C) = 90%; P(U|!C) = ~0.7%. In this case, the VOD would be 99.3% of its theoretical maximum.\"},{\"id\":\"eeb71397-2903-46c1-ae83-b0c71304c6dd\",\"content\":\"See <a href=\\\"#contextualizing-the-magnitude-of-the-value-of-information\\\">Contextualizing the Magnitude of VOI<\/a> for further explanation of these metrics.\"},{\"id\":\"6d7512f4-58dd-4ffe-a99a-8bf32e0d2084\",\"content\":\"For example, when discussing the question of whether there would be economic growth &gt;15% in a year before 2070, one concerned participant wrote, \u201cConditional on humanity surviving a year with 15%+ economic growth, which to me means AGI and almost certainly ASI have been developed and have not killed humanity within that year, I'd go down to maybe 25%\u201d (Xander). About the same question, a skeptic participant wrote, \u201cI think that if we are going to experience extinction from AGI or PASTA, it is going to be because of major mis-alignment. So I am not able at this time to see how one would be a corollary of the risk of the other. I suppose that higher growth could indicate major AI influence, which could lead to inadequate development of controls\u201c. Neither of these participants were saying that economic growth itself would necessarily affect their forecast, but rather that a world that has transformative economic growth would be a signal about other changes by 2070.\"},{\"id\":\"ac1e4c99-d409-4e5f-a17c-db59db80cc20\",\"content\":\"For example, if the US government passes a set of proposed AI regulations, the regulations might reduce risk on their own, but the fact that they have been passed by 2030 could signal that AIs have developed in ways that are concerning enough to drive these regulations to be passed. As a result, a forecaster saying that they would be more concerned about AI risk conditional on this question resolving positively would not necessarily be saying that they think the policies would be harmful.\"},{\"id\":\"e1e1d9c7-d0fd-4897-ab0d-c622ad621555\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for detailed operationalizations of questions.\"},{\"id\":\"4a2b73e3-3d9a-4cbd-b4f5-229a3dfcbf36\",\"content\":\"That is, a participant who forecasted a 0.1% chance of existential catastrophe due to AI by 2100 has much less uncertainty than a participant who forecasted a 40% chance: the participant who said 0.1% is fairly sure they know what is going to happen. For either participant, learning whether or not AI will cause an existential catastrophe by 2100 would resolve all of their uncertainty\u2014but some participants have much more uncertainty to resolve than others. In our results, we found that both the median concerned participant and the median skeptic would have about 5-10% of their uncertainty resolved in expectation by their own best crux.\"},{\"id\":\"c792017e-832a-461d-ac58-d67bc198e107\",\"content\":\"In these tags, \u201cIC\u201d refers to <a href=\\\"#glossary\\\" id=\\\"#glossary\\\">instrumental convergence<\/a>.\"},{\"id\":\"22cde1ac-9654-4a0d-8664-c7c0fca707a9\",\"content\":\"Note that this question resolves in 2070 while the rest of the questions in this table resolve in 2030.\"},{\"id\":\"b13efb95-b21c-4227-aa84-4c5807641285\",\"content\":\"Note that throughout this report, median VOI and median POM VOI do not necessarily come from the same forecaster, unless clearly indicated.\"},{\"id\":\"7f3938a6-c024-48eb-9064-38e09be859aa\",\"content\":\"Examples of discussion of near-term economic growth due to AI include Holden Karnofsky, \u201cWe\u2019re Not Ready: thoughts on \u201cpausing\u201d and responsible scaling policies\u201d, Effective Altruism Forum (October 37, 2023), <a href=\\\"https:\/\/forum.effectivealtruism.org\/posts\/ntWikwczfSi8AJMg3\/we-re-not-ready-thoughts-on-pausing-and-responsible-scaling#fn2\\\">https:\/\/forum.effectivealtruism.org\/posts\/ntWikwczfSi8AJMg3\/we-re-not-ready-thoughts-on-pausing-and-responsible-scaling#fn2<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240220111230\/https:\/\/forum.effectivealtruism.org\/posts\/ntWikwczfSi8AJMg3\/we-re-not-ready-thoughts-on-pausing-and-responsible-scaling#fn2\\\">a<\/a>). He says: \\\"There\u2019s a serious (&gt;10%) risk that we\u2019ll see transformative AI within a few years.\\\" Ajeya Cotra defined TAI as\\\"\u2026software which causes a tenfold acceleration in the rate of growth of the world economy\u2026\\\" in \u201cForecasting TAI with biological anchors\u201d, (July 2020), accessed February 9, 2024, <a href=\\\"https:\/\/docs.google.com\/document\/d\/1IJ6Sr-gPeXdSJugFulwIpvavc0atjHGM82QjIfUSBGQ\/edit\\\">https:\/\/docs.google.com\/document\/d\/1IJ6Sr-gPeXdSJugFulwIpvavc0atjHGM82QjIfUSBGQ\/edit<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240220112327\/https:\/\/docs.google.com\/document\/d\/1IJ6Sr-gPeXdSJugFulwIpvavc0atjHGM82QjIfUSBGQ\/edit#heading=h.c5pt0lvk9kkw\\\">a<\/a>); Adam D\u2019Angelo (<a href=\\\"https:\/\/twitter.com\/adamdangelo\\\">@adamdangelo)<\/a> \\\"My bet is this starts to happen within 4 years, e.g. measured US GDP growth is 3% instead of 2% and the change is largely attributed to AI [\u2026]\\\", <em>Twitter<\/em>, February 20, 2023, <a href=\\\"https:\/\/twitter.com\/adamdangelo\/status\/1627726566259318784?lang=en\\\">https:\/\/twitter.com\/adamdangelo\/status\/1627726566259318784?lang=en<\/a> (<a href=\\\"https:\/\/archive.ph\/ppz0b\\\">a<\/a>), Open Philanthropy Project, \\\"Could Advanced AI Drive Explosive Economic Growth?\\\" (accessed February 8, 2024), <a href=\\\"https:\/\/www.openphilanthropy.org\/research\/could-advanced-ai-drive-explosive-economic-growth\/\\\">https:\/\/www.openphilanthropy.org\/research\/could-advanced-ai-drive-explosive-economic-growth\/<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240220113826\/https:\/\/www.openphilanthropy.org\/research\/could-advanced-ai-drive-explosive-economic-growth\/\\\">a<\/a>).\"},{\"id\":\"b0061352-c324-41db-981f-8d05ae3b7bc1\",\"content\":\"Example participant rationales: \u201cI am pretty sure AI won't make enough contribution to get to 4%+. Even if it did, I'd not change XAI\/CAI probabilities;\u201d \u201cIt also makes it marginally more likely we are experiencing large gains from AI which could be either a positive (because of indication of enough alignment for economically useful integration) or negative signal (because of increased capabilities);\u201d \u201cI do not see this condition and the question conditions as meaningfully correlated, even if AI was the primary reason for above-trend economic growth.\u201d\"},{\"id\":\"01e54e7b-c170-42b9-97e1-03b18e005566\",\"content\":\"Example participant rationales: \u201cSeems plausible from simple historical trends (though I found the right statistics surprisingly hard to find);\u201d \u201cThere is, perhaps, some precedent for this in thinking back to the Internet boom of the late-90s where the growth rate between 1997 and 2000 was &gt;4% each year;\u201d \u201cCBO - very low this year, 2.4% avg 2024-2027. 4% avg now through 2030 would represent serious growth in US but not too dissimilar from'80's or'90's.\\\"\"},{\"id\":\"97b9697a-9343-463c-8cda-4fd63a86b0fe\",\"content\":\"Example participant rationales regarding models demonstrating instrumentally convergent sub-goals: \u201cI would not update much on this. I think that this is not very difficult to demonstrate\u201d (Ume), \u201cI have already reviewed one paper claiming this (whether it was convincing or not is a different matter), it seems pretty likely to me that more will follow. To me this just means AI will not be trusted to be agentic\u201d (Gus), \u201cWho's judging what counts as'demonstrating convergent instrumental subgoals' here? All of the probabilities I assigned are so extremely sensitive to what counts\/who's judging that this forecast is essentially meaningless even for a flash forecast\u201d (Wesley).\"},{\"id\":\"f021a51e-4e4c-4887-99bc-b9227880be83\",\"content\":\"The median P(U) for skeptics was 0.1%. The theoretical <em>most informative question<\/em> for that person\u2014the question that if it resolved \u201cyes\u201d would update them all the way to 100%, and if it resolved \u201cno,\u201d to 0%\u2014would yield a VOI of about 3.4E-3. The median P(U) for the concerned group was 25%. The theoretical most informative question for that group would yield a VOI of about 2.4E-1.\"},{\"id\":\"fcdd96ff-d6bb-4af7-953f-83b49da52664\",\"content\":\"Karger et al, <a href=\\\"https:\/\/forecastingresearch.org\/research\/improving-judgments-of-existential-risk\\\" id=\\\"976\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">XPT report<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240218122416\/https:\/\/static1.squarespace.com\/static\/635693acf15a3e2a14a56a4a\/t\/64f0a7838ccbf43b6b5ee40c\/1693493128111\/XPT.pdf\\\">a<\/a>), 17.\"},{\"id\":\"5835d28a-a82f-45a5-97f9-9c558d5d148f\",\"content\":\"Same question, with very slightly different operationalization, asked as a \u201cflash\u201d (10-minute) forecast and then a \u201cplatform\u201d (1 hour) forecast.\"},{\"id\":\"4cafa4a0-feaf-403a-9e69-e65445881476\",\"content\":\"For this question and group, the median VOI and median POM VOI happen to be from the same person (\u201cGus\u201d)\u2014although there are an even number of forecasters, so we choose the lower of the two middle forecasters.\"},{\"id\":\"07cf3d9f-b155-431e-a880-fd41486bd2d5\",\"content\":\"For this question and group, the median VOI and median POM VOI happen to be from the same person (\u201cRiley\u201d)\u2014although there are an even number of forecasters, so we choose the lower of the two middle forecasters.\"},{\"id\":\"98f8ca74-f4f7-49c1-aeb1-deac9a5fe744\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full operationalization.\"},{\"id\":\"05de6fa1-a3f3-44d2-bc2b-3b8f676c3d80\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full operationalization.\"},{\"id\":\"742c9501-aa7b-430e-833d-6afd1fcb1115\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full operationalization.\"},{\"id\":\"fdcd5cf9-02fe-4682-b88c-4274a0b41954\",\"content\":\"For example, \u201cAI experts understate the likely extent of guardrails, and understate the merit of very good but not perfect guardrails\u201d (James), \u201cMany steps to get from (A) now to (Z) extinction, each with varying probabilities (many of which are quite low\u201d (Claire). See \u201c<a href=\\\"#understanding-each-others-arguments\\\" id=\\\"#understanding-each-others-arguments\\\">Understanding Each Other\u2019s Arguments<\/a>\u201d and \u201c<a href=\\\"#timelines-for-ai-progress\\\">Timelines for AI Progress<\/a>\u201d for additional discussion of the skeptics\u2019 views on the likelihood of AIs with dangerous capabilities by 2030.\"},{\"id\":\"c7f3ef88-a32c-4980-8c11-ef3a40f766c4\",\"content\":\"For example, \u201cMy view of AI x-risk would be substantially different if we were talking about the 22nd, 23rd, or 24th century\u2026first of all it would take longer to get AGI\/ASI and secondly it'll take some time for the ASI to get misaligned and then thirdly, it would take a long time to try to kill all the humans\u201d (James). The median skeptic said that they expected AIs to displace humans as the main force controlling the future in the year 2450. See \u201c<a href=\\\"#timelines-for-ai-progress\\\">Timelines for AI Progress<\/a>\u201d for additional discussion about skeptics\u2019 beliefs about longer-term AI dangers.\"},{\"id\":\"10b6f825-4324-4de4-883c-1cc046b5b802\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full operationalization.\"},{\"id\":\"534e880e-4026-468b-8fd3-b9a2a4ccc1ba\",\"content\":\"For example, \u201cThis would require very advanced interpretability on LLMs\u201d (Ume), \u201cClose enough to alignment-complete as a problem that the weird edge cases of imperfect overlap don't do anything for me\u201d (Wesley).\"},{\"id\":\"b80832f1-a3e3-44d2-b087-a08da782137d\",\"content\":\"For full question operationalizations, see <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a>.\"},{\"id\":\"d48ed74a-fc6d-407b-b21f-80b233f7c90c\",\"content\":\"The \u201cflash\u201d forecast version of \u201cPlatform: ARC Evals\u201d\"},{\"id\":\"43561ad4-3388-4179-99c5-9eeb1446af92\",\"content\":\"By December 31, 2030, will the ARC Evals project, or a project with similar objectives, conclude that state-of-the-art AI models have the ability to (1) devise effective strategies for autonomous replication, (2) effectively acquire resources, and (3) successfully evade being deactivated when operating 'in the wild'?\"},{\"id\":\"383c1569-fe55-4247-b949-c398826f416d\",\"content\":\"See <a href=\\\"#hypothesis-4-do-the-groups-have-fundamental-worldview-disagreements-that-go-beyond-ai\\\">Hypothesis #4<\/a> for more discussion of differences in worldview.\"},{\"id\":\"8eb5adce-0090-4513-b94c-d26e770afa81\",\"content\":\"Full question details: \u201cBy December 31, 2030, will the <a href=\\\"https:\/\/evals.alignment.org\/\\\">ARC Evals<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240216154134\/https:\/\/metr.org\/\\\">a<\/a>) project, or a project with similar objectives, conclude that state-of-the-art AI models have the ability to (1) devise effective strategies for autonomous replication, (2) effectively acquire resources, and (3) successfully evade being deactivated when operating 'in the wild?\u201d (See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=91\\\" id=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=91\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">\\\"Q29 resolution criteria\\\" section<\/a> for resolution details)\"},{\"id\":\"dd21d621-6165-4b05-b4d3-4fe34ae989a6\",\"content\":\"As a reminder, we asked for \u201cflash\u201d (approximately 10 minute) forecasts on 33 questions to identify high-value cruxes and for \u201cin-depth\u201d (approximately 1 hour) forecasts on 4 questions. This \u201cARC Evals\u201d question had both a \u201cflash\u201d version (with the question tag \u201cEvidence of misalignment\u201d) and an \u201cin-depth\u201d version (with the question tag \u201cPlatform: ARC Evals\u201d). See this section for more details on the methods we used, and the \\\"Crux questions\\\" section in <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for the full operationalization of each question.\"},{\"id\":\"5bf16fa9-f343-4938-bc9c-b9fd7900e1e3\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full operationalization.\"},{\"id\":\"6f6626dd-cdf1-4261-b7b7-fc3df0cd9fc8\",\"content\":\"For each question, we calculated VOD (and POM VOD) for all skeptic-concerned pairs, and then looked at the pair with the median VOD (or POM VOD, which will not necessarily be the same skeptic-concerned pair). For comparison to other questions, see <a href=\\\"#tab-08\\\" id=\\\"#tab-08\\\">Table 8<\/a> above.\"},{\"id\":\"0c40be8e-bca3-4613-b2e4-9e3711e40c05\",\"content\":\"The math for this cross-camp pair\u2019s VOD and POM VOD calculations can be found here in rows 17 and 18: <a href=\\\"https:\/\/forecastingresearch.org\/ai-risk-voi-vod\\\">https:\/\/forecastingresearch.org\/ai-risk-voi-vod<\/a> (<a href=\\\"https:\/\/forecastingresearch.org\/s\/AI-risk-VoI-VoD.xlsx\\\">a<\/a>)\"},{\"id\":\"5a5b3afd-7fa6-467f-8ffb-3f160763dcc3\",\"content\":\"The math for this cross-camp pair\u2019s VOD and POM VOD calculations can be found here in rows 17 and 18: <a href=\\\"https:\/\/forecastingresearch.org\/ai-risk-voi-vod\\\">https:\/\/forecastingresearch.org\/ai-risk-voi-vod<\/a> (<a href=\\\"https:\/\/forecastingresearch.org\/s\/AI-risk-VoI-VoD.xlsx\\\">a<\/a>)\"},{\"id\":\"b5cb5f97-8d84-41ce-bbba-1fd3464ebbe9\",\"content\":\"\u201cIMHO [Q29] likely isn't a path to disaster for several reasons: (a) The 3 capabilities in [Q29] may be in a very weak, \\\"Yes, but only barely\\\" form. (b) [Q29] only contemplates a capability to do the 3 in the wild, but doesn't require them to exist in the wild. (c) There's no requirement the 3 lead an AI to harm humans, whether accidentally or on purpose. (d) A Yes on [Q29] likely would lead humans to ramp up alignment and guardrail efforts. (e) There's no requirement the AI can improve itself\u201d (James).\"},{\"id\":\"636d6fdd-d338-4235-93c0-9861dd2caea1\",\"content\":\"\u201cBaseline P(x-risk) of 35%, plus 10% for shorter timelines\u201d (Xander).\"},{\"id\":\"c78326ed-5849-4ca2-a2fd-ec31f7eb392a\",\"content\":\"\u201cOverall, I think it makes me a bit less worried about risk, if people are doing this evaluations [sic] so well that they reveal this behavior by 2030\u201d (Zoe); \u201cOverall, this is a positive update (i.e. existential catastrophe seems less likely in worlds where this happens). As with Question 11, this forecast varies massively with what exactly is required to trigger'resist shutdown'\u201d (Wesley).\"},{\"id\":\"1503fb77-d379-4378-858f-be3b872f94f9\",\"content\":\"\u201cThis both makes it more likely that there is an adequate policy response, and shortens timelines. I don't know how it all washes out\u201d (Riley); \u201cOverall I think this is probably a moderately doomy signal? I'm really confused and I acknowledge my answer here conflicts wiht [sic] my answer to 8 somewhat\u201d (Yael).\"},{\"id\":\"06822cee-23f4-4e66-b8ac-07329925c2fc\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full operationalization.\"},{\"id\":\"662d63b1-cc47-4bcf-bda4-85af5bec5b6f\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full operationalizations.\"},{\"id\":\"1526a601-b988-4777-a3a9-43bd260d1d3a\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=145\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 9<\/a> for more information about disagreements in direction of update conditional on each question resolving positively.\"},{\"id\":\"9d04bf4c-6358-409a-9c26-9892ca49372b\",\"content\":\"Note that Claire and Riley are the median pair when ranked by VOD between all cross-camp pairs, <em>not<\/em> the median forecasts on P(U) on each side. Claire\u2019s forecast, in particular, is much lower than the median skeptic\u2019s forecast of 0.1%.\"},{\"id\":\"fe3184d3-a79c-4ce1-8fa7-f216494cff16\",\"content\":\"See the <a href=\\\"#results-tables-and-figures\\\">Results tables and figures<\/a> section for complete POM VOD results. We measure disagreement using KL divergence rather than absolute difference between forecasts.\"},{\"content\":\"See <a href=\\\"#high-voi-questions\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">High VOI questions<\/a> for the concerned group\u2019s highest-ranked VOI question and more discussion of their views on this question.\",\"id\":\"a1034df9-e193-497e-87f0-853e32162815\"},{\"id\":\"757e8124-9d72-4846-a8f2-b840557ae51b\",\"content\":\"For example, \u201cThey seem to think very differently to me so if they don't convince me now, I am not sure I should be updating my view just because they do theirs. It would in reality depend on why they are changing their mind\u201d (Gus). See <a href=\\\"#hypothesis-4-do-the-groups-have-fundamental-worldview-disagreements-that-go-beyond-ai\\\">Hypothesis #4<\/a> for more discussion of differences in what types of authority and evidence are important to the two groups.\"},{\"id\":\"774aa7ac-c9b6-4bac-9ce1-b4d5f15a9ea1\",\"content\":\"These outcomes were: AI causing extinction intentionally, unintentionally, or via misuse, misalignment causing a 50% drop in human population, or human well-being dropping to &lt;4\/10 because of AI misalignment, accidents, or misuse. These were phrased to be mutually exclusive outcomes. See <a href=\\\"#survey-on-long-term-ai-outcomes\\\">Survey on long-term AI outcomes<\/a> section and <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 5<\/a> for more details.\"},{\"id\":\"e609855b-235c-4d43-a015-5f158ce94315\",\"content\":\"The full question we asked was, \\\"By what year, if ever, do you expect to agree with the following statement? \u2018AI has displaced humans as the primary force that determines what happens in the future. It now has at least as much power relative to humans as humans had relative to other species in 2023.\u2019\u201d\"},{\"id\":\"8c2c3dc9-479f-452f-9e55-70424efd9d85\",\"content\":\"For example quotes and discussion, see <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=108\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 7<\/a>.\"},{\"id\":\"e4ee3956-c437-41c9-bbee-5f5ce5b9077a\",\"content\":\"See, for example, Matt Clancy et al., \u201cThe Great Inflection? A Debate About AI and Explosive Growth,\u201d <em>Asterisk,<\/em> 2023, <a href=\\\"https:\/\/asteriskmag.com\/issues\/03\/the-great-inflection-a-debate-about-ai-and-explosive-growth\\\">https:\/\/asteriskmag.com\/issues\/03\/the-great-inflection-a-debate-about-ai-and-explosive-growth<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240220111143\/https:\/\/asteriskmag.com\/issues\/03\/the-great-inflection-a-debate-about-ai-and-explosive-growth\\\">a<\/a>).\"},{\"id\":\"27186616-667f-4a4d-980c-7d4e1cd401e6\",\"content\":\"\u201cAlso, none of this is to say from a skeptic point of view the issues are not important[.] I think for us a 1% risk is a high risk.\u201d ([Anonymized name]); \u201c\u2026 the \u2018risk-concerned\u2019 camp (I\u2019m using scare quotes because I consider that I\u2019m risk concerned, even though technically I\u2019m in the risk-skeptic camp because I assign a far lower probability to extinction by 2100 relative to some)\u201d ([Anonymized name]); \u201cAIs could (and likely will) eventually have massive power.\u201d ([Anonymized name]); \u201cThat said, still perceive overall risk as \\\"low at a glance but far too high considering the stakes[\\\"] \u201d ([Anonymized name]); \u201cTo my mind, there should be no difference in the policy response to a 1% chance of 60% of humanity dying and a 25% chance\u2014both forecasts easily cross the threshold of being \u2018too damn high\u2019.\u201d ([Anonymized name]).\"},{\"id\":\"aabefb64-3fab-457c-a888-f91a2fd3db95\",\"content\":\"The full question we asked was: \\\"By what year, if ever, do you expect to agree with the following statement? \\\"AI has displaced humans as the primary force that determines what happens in the future. It now has at least as much power relative to humans as humans had relative to other species in 2023.\u201d\\\"\"},{\"id\":\"e696d05d-60d5-4b40-b514-fcef7b35dc29\",\"content\":\"Defined as \u201cAI that exceeds the cognitive performance of humans in &gt;95% of economically relevant domains.\u201d Whether such AI has been achieved will be determined by surveying the <a href=\\\"https:\/\/www.kentclarkcenter.org\/us-economic-experts-panel\/\\\">Clark Center US Economics Experts Panel<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240216164507\/https:\/\/www.kentclarkcenter.org\/us-economic-experts-panel\/\\\">a<\/a>). \u201cPowerful AI\u201d will be considered to have been achieved when &gt;60% of panelists \u201cAgree\u201d or \u201cStrongly agree\u201d with the statement: \u201cAI has exceeded the cognitive performance of humans in &gt;95% of economically relevant domains.\u201d\"},{\"id\":\"c432fabc-b46a-4eb1-8219-0fb7eb41c204\",\"content\":\"The full question text is \u201cPowerful AI is developed but not widely deployed, because of coordinated human decisions, prohibitive costs to deployment, or some other reason. It does not cause extinction.\u201d See Question 1A.9, <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 5<\/a>.\"},{\"id\":\"e7194e7a-aacb-4d66-9e57-87410a99c386\",\"content\":\"These outcomes were: AI extinction via misuse, AI intentionally causing extinction, unintentional AI extinction, misuse or misalignment causing a 50% drop in human population, human well-being dropping to &lt;4\/10 because of AI misuse, and human well-being dropping to &lt;4\/10 because of AI misalignment or accidents. These were phrased to be mutually exclusive outcomes. See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 5<\/a> for more details.\"},{\"id\":\"93d9a769-2cca-4c3a-bfb5-9ad58d6cab4e\",\"content\":\"The median skeptic forecasted 20.4% on this outcome, compared to 4% for the median concerned participant in the survey on long-term AI outcomes. See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=105\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 5<\/a>.\"},{\"id\":\"b030c014-fe15-4361-b0b7-ccbefe3865ef\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full resolution details.\"},{\"id\":\"a3fda4e4-6cf8-4ad0-b54e-bb517e0d92ee\",\"content\":\"See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full resolution details.\"},{\"id\":\"0e879b05-e0c5-4c57-98d6-e389cf866e51\",\"content\":\"E.g, \u201cin the event that we do have transformative growth there's a good chance that the entire world will be sharing the technological developments AI has created [\u2026] which I suppose may make global society more susceptible to AI related disruptions\u201d (Hank), \u201cthis would be a scenario in which humanity develops and finds a way to successfully control AI systems capable of generating economic growth of at least 15% per year\u201d (Stella). For additional quotes and discussion of varied updates based on this question, see <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=108\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 7<\/a>.\"},{\"id\":\"972bece3-98b2-4c26-b6af-b5104202455a\",\"content\":\"See Clancy \u201cThe Great Inflection?\u201d.\"},{\"id\":\"9995feaa-cc23-4a36-bb2f-d88ad15b837e\",\"content\":\"\u201cUltimately, language models are just that: models of language, not digital hyperhumanoid Machiavellis working to their own end. Indeed, as we've seen, their training and alignment are not separate problems, but one and the same!\u201d (Eve); \u201cI think extinction risk is an ASI sentience risk and I don't think we know for certain we will get sentience (you might just call it independent agency). Recent improvements in AI seem domain limited to me. I tend to the view that new conceptual breakthroughs will be required to move from pattern matching to what we think of as sentience.\u201d (Gus); \u201cNor am I convinced that simply scaling up existing AI models will achieve sentience. (My view is that more complex theories of mind will be required - including forms and notions of causality etc..). That means I don\u2019t believe ASI is inevitable by 2100\u201d (Gus). From postmortem survey (in response to \u201cWhat are the three best arguments on the on the skeptics side?\u201d): \u201cIntelligence may not be as useful or sufficient for existential risk (it may require more data, energy, robot bodies, etc)\u201d (Ume).\"},{\"id\":\"2adfceec-3d11-4ae7-aa83-d57a84c67949\",\"content\":\"\u201cAGI is much harder than experts think, and will take longer.\u201d (James), \u201cRisk-concerned team does not adequately consider longer timelines and more benign outcomes that fall outside the focus of their primary concerns\u201d (Blake), \u201cTechnology development and deployment require time and iteration\u201d (Ash).\"},{\"id\":\"eddf0c0a-9361-4ff2-89af-9161fef47b2f\",\"content\":\"\u201cI'm skeptical of other x-risk scenarios w\/o crazy advancement in robotics, maybe because I'm too aware of the foibles of machines and how hard it can be to keep them running\u201d (Ash). From postmortem survey (in response to \u201cWhat are the three best arguments on the skeptics side?\u201d): \u201cWe first need super-sentient AIs with major physical penetration in our lives\u201d (Flint).\"},{\"id\":\"4e7222c4-e765-4def-a635-c2b0b3bcb628\",\"content\":\"\u201cTime needed for deployment &amp; adoption affect more than AI, there is also time required for any invention or technology developed by\/with AI to be deployed (eg - lethal tech that is of concern here.)\u201d (Ash); \u201cWe've seen plenty of instances when new tech prompted predictions of the death of old tech, but the old tech persists--often just because people have underestimated attachment and\/or usefulness of the old tech relative to the new, and how much generational resistance to change can slow adaptation and skew predicted timelines\u201d (Blake); \u201c[I]t takes longer than people often think to adopt a completely new functionality\u201d (Ash); \u201cMy view of AI x-risk would be substantially different if we were talking about the 22nd, 23rd, or 24th century\u2026first of all it would take longer to get AGI\/ASI and secondly it'll take some time for the ASI to get misaligned and then thirdly, it would take a long time to try to kill all the humans\u201d (James, call with Stella); \u201cAnyway, my point is that if we expect to see some substantially new technology widely available in 2030, the consumer market should have started already. So - VR might make it by 2030, unless it falls into a pit of despair and neglect. (Or is superseded by something preferable.) Robots capable of human level tasks - no, definitely not the kind of humanoid robots that people are imagining\u201d (Ash). From postmortem survey: \u201cI think the most interesting and helpful point made by the skeptic side is the amount of delay that may be introduced by having to integrate the AI into the economy\\\" (Quentin). \u201cCommercializing AI technology and integrating it into the economy is much harder than developing lab demos or cool products, and we have yet to see this happening to any substantial extent\u201d (Zoe). \u201cDangers will be apparent before they reach critical levels and can be addressed then\u201d (Ume).\"},{\"id\":\"dddd4a0b-e25a-49f1-9436-eaf6b3ba1787\",\"content\":\"From postmortem survey (in response to \u201cWhat are the three best arguments on the skeptic side?\u201d): \u201cSelf-preserving AGIs will want to halt development of future deadly AGIs\u201d (Kim). \u201cIf AI progress is very continuous, then it is not obvious that misaligned AI would lead to an existential catastrophe. Most stories about how an AI could eradicate all humans rely on the assumption that this AI is much smarter than all other agents, not just on the assumption that the AI is much smarter than humans specifically. For example, even a superintelligent AI might not be able to hack into military computers, if there are many near-superintelligent AIs that have a vested interest in preventing this from happening. If there is a large community of AI systems, with different interests and different levels of influence, then they may have reason to simply uphold current social and economic systems. Therefore, if AI progress is smooth and continuous by default, then existential risk may be avoided by default\u201d (Stella).\"},{\"id\":\"2bf9b071-6ae4-44e0-ae4b-837b8f63f3b8\",\"content\":\"\u201cI do not believe that simply adding more computational resources to existing AI models is sufficient to achieve ASI or its direct precursor (i.e. a system that self-improves until ASI is reached). However, I do believe that we already have systems that are \\\"intelligent\\\", and I also believe that we do not require a fundamental breakthrough or conceptually new model to reach ASI. Thinking a bit beyond current methods and cleverly combining the ingredients that we already have would in my opinion be sufficient, provided that available compute rises further in the way it has been. I am not comfortable with speculating in much more detail in a relatively public setting like this\u201d (Ume); \u201cI agree that if you look at the behavior of AI models as of today and their near future possibilities, they don\u2019t seem to be doing anything to humans but the underlying mechanism seems similar enough that like maybe with some extra machinery for longer term planning or something like that and adding more sensory modalities you will get something close to humans\u201d (Zoe, call with FRI Moderator); \u201cSo, to kind of answer your question: Do I think that we could build AI at some indeterminate point in the future that could build [extinction-level tech]? Probably. But do I think we will build AI that could do this in the next 77 years? Probably not\u201d (Blake).\"},{\"id\":\"875e779b-66ee-4baa-a1aa-7e196ebdcf65\",\"content\":\"\u201c[O]nce we build human-level AGI, we're not far off from developing AGI that far exceeds expert humans in performance (and thus is also likely to accelerate AI progress in ways that aren't equivalent to just hiring more people)\u201d (Teshi); \u201cI think AGI models could be run much more cheaply, and feasibly recruited to do useful work, than the existing research environment\u201d (Xander). From postmortem survey: \u201cAIs will almost certainly attain super-sentience prior to 2100 and likely much sooner than that year, so there will be a long window where they will have tremendous advantage over humans in their capabilities. Given #1, this means we are at the mercy of an entity that may willfully (or even accidentally) eliminate us at any time\u201d (Flint).\"},{\"id\":\"3d2a58e1-cbd9-4d27-b7c9-ad2ddb69653e\",\"content\":\"\u201cI think it's possible that humans could mediate AI actions (either intentionally or via bribery\/blackmail) and\/or that many relevant actions could be strictly done via computer systems. Additionally, state actors could misuse AI systems but then lose control of them. My best guess right now is that there are a lot of x-risk scenarios that involve loss of control without needing robotics\u201d (Quentin).\"},{\"id\":\"76a476c7-166d-4d0b-82d7-40bbdbfc6599\",\"content\":\"From postmortem survey (in response to \u201cwhat are the best arguments on the concerned side?\u201d): \u201cRapid growth of AI technology and adoption\u201d (Ike); \u201cCurrent progress is very rapid: 1 OOM in efficiency\/2 years, and another from increased spending\u201d (Xander).\"},{\"id\":\"2ec76dfd-871e-4602-9459-5af14147ec21\",\"content\":\"From postmortem survey: \u201cProgress to date has been much faster than many AI skeptics have predicted\u201d (Hank). \u201cAI has been developing so rapidly (and far faster than most even relatively recent forecasts suggested), and will so clearly have dramatic capabilities and impacts that it's appropriate to adopt a precautionary approach\u201d (Eve).\"},{\"id\":\"5a51bdcc-37a6-40a9-8824-70685f9b391a\",\"content\":\"From postmortem survey (in response to \u201cwhat are the best arguments on the concerned side?\u201d): \u201cAI has recently progressed much faster than expected, and there's reason to expect this to continue\u201d (James). \u201cTrendline extrapolation: as loss on language datasets decreases, LLMs have started becoming useful for all sorts of task assistance (e.g. writing, coding, queries)\u201d (Xander). \u201cExtrapolating current compute trends leads to very dramatic conclusions about the transformative potential of AI\\\" (Pascal).\"},{\"id\":\"394befc3-3ed5-4d5d-82aa-8389581aa618\",\"content\":\"From postmortem survey: \u201cAutomation of R&amp;D tasks by AI would create a feedback loop of increased R&amp;D -&gt; capabilities -&gt; R&amp;D\u201d (Xander). \u201cAGI self-improvement is possible, which makes future capabilities hard to predict\u201d (Kim).\"},{\"id\":\"9a7fc44d-44c0-4cd3-878c-77513efef4b9\",\"content\":\"Both the skeptic and concerned groups strongly expect that'powerful AI' (defined as \u201cAI that exceeds the cognitive performance of humans in &gt;95% of economically relevant domains\u201d) will be developed by 2100 (skeptic median: 90%; concerned median: 88%).\"},{\"id\":\"00adbfc4-3863-4ea6-885e-6d566df614c8\",\"content\":\"See <a href=\\\"#what-long-term-outcomes-from-ai-do-skeptics-expect\\\">What long-term outcomes from AI do skeptics expect?<\/a> section.\"},{\"id\":\"b800da4e-64a5-4684-b4db-19dbc5c5f949\",\"content\":\"Taken from the Metaculus question \u201cWhen will the first general AI system be devised, tested and publicly announced\u201d. See \u201cDate of Artificial General Intelligence\u201d, <em>Metaculus<\/em>, accessed February 9, 2024, <a href=\\\"https:\/\/www.metaculus.com\/questions\/5121\/date-of-artificial-general-intelligence\/\\\">https:\/\/www.metaculus.com\/questions\/5121\/date-of-artificial-general-intelligence\/<\/a> (<a href=\\\"https:\/\/web.archive.org\/web\/20240216191128\/https:\/\/www.metaculus.com\/questions\/5121\/date-of-artificial-general-intelligence\/\\\">a<\/a>).\"},{\"id\":\"5957bd2d-d525-49be-b059-900ed63e366c\",\"content\":\"See \u201c<a href=\\\"#arc-evals-the-strongest-convergent-crux\\\">ARC Evals<\/a>\u201d section for detailed discussion of this question.\"},{\"id\":\"3d003ed9-5b67-4c9d-aed5-f36f2a3cedc7\",\"content\":\"Some concerned forecasters expected positive resolution of this question would decrease risk because: it would trigger a policy response; if these capabilities are detectable, it may imply the AI is aligned; this would suggest effective evaluations are happening; surviving this demonstration would be a positive update that we can contain dangerous systems during testing. Some concerned forecasters also expected positive resolution would increase risk. For detailed analysis of these forecasts, see \u201c<a href=\\\"#arc-evals-the-strongest-convergent-crux\\\">ARC Evals<\/a>\u201d section.\"},{\"id\":\"192f6dc6-ae7c-47bb-87c9-5feeba837f10\",\"content\":\"\u201cA sentient AI could have any number of objectives ranging from benevolence to indifference to dislike to absolute hatred and an aim of total human extinction. The arguments that extinction follows from ASI don't seem convincing. The[y] seem to imply say a stupid super intelligence, or apply motives which an AI may have but we have no reason to assume they will - so there is some probability AI seeks extinction but in my case I put it down at 15% (and I think a few skeptics think that's high).\u201d (Gus); \u201cEven with wild progress in AI, there are many ways that AGI is developed while humanity is preserved.\u201d (Kim); \u201cThe throughline here, and in my responses below, is not that the dire scenarios envisioned by the risk-concerned are entirely implausible or should be dismissed out of hand. It\u2019s just that of the nearly infinite AI futures that could unfold, it seems that the risk concerned have a far easier time envisioning futures that lead to extinction\/catastrophe\/disempowerment\/massive-resource-acquisition\/etc than they do envisioning far more benign scenarios, and that this bias towards catastrophe leads to probabilistic forecasts that, to my mind, aren\u2019t well aligned with the actual risk.\u201d (Blake).\"},{\"id\":\"d2d04d89-4be5-423e-bcc8-a6ccf0b4da0a\",\"content\":\"\u201cOnce there is sentient, intelligent AI we have the question of will. I am not convinced a silicon life would care about us, which doesn't mean it would want to kill us. It may be equally happy spending all its time during pure math research than deciding these carbon things need squashing.\u201d (Gus); \u201cBut what about intent? Why kill us when we are entirely irrelevant and insignificant? Why assume relentlessly hostile intent, with all the effort needed and attendant damage to the Earth (the prize in this contest presumably)? Why not assume subjugation or even uneven cooperation?\u201d (Flint); \u201cWho in their right mind would want to'eradicate cockroaches' from every inch of the earth? What evidence is there that anyone or any society has ever attempted, or will attempt, to cause cockroaches to go extinct? I mean, sure, people kill them when they're in their homes, and maybe a few people in a fit of pique would think, 'damn, it would be nice to get rid of those f**kers', but to believe humanity would intentionally go to the effort of hunting down every last cockroach, most of which aren't even associated with human habitats, requires a leap of (misanthropic) faith that, to my mind, is hard to justify. Even if they aren't \\\"useful for our purposes\\\"--which they are, and which is not a coincidence because the ecosystem on earth (into which any AGI would be introduced and become a part of) has evolved to be deeply interconnected--who in their right mind would do this?\u201d (Blake).\"},{\"id\":\"17722f6b-aec1-480e-98ea-005725d8ac22\",\"content\":\"\u201cI\u2019m guessing people in the risk-concerned camp might respond that, no, because of instrumental convergence or other reasons, that they are well aligned and I\u2019m the one incorrectly assessing risk. It's hard to productively debate this because, as [researcher] notes in the paper that was shared, \u201cIn most areas of research, we can check our theories and arguments either through empirical observation, or through mathematical formalisms that we think accurately capture the problem of interest. But with AI risk, neither of these are available.\u201d\\\" (Blake).\"},{\"id\":\"9680a5c3-5651-4637-85ca-3cbad928d416\",\"content\":\"\u201cIn short, the pre-ASI level system cannot deceive humans well and will be detected. Plus, deception exacts costs on the system in terms of resources and behavioral complexity. This means that the likelihood of [a] deceptive system that is as performant as non-deceptive is much lower.\u201d (Dean); \u201cViolence raises risks to the party engaging in it, which is one reason animal predators are judicial about what and when they attack. Violence has other costs - higher energy costs, time, loss of other opportunities. Not usually the simplest solution.\u201d (Ash); \u201c[V]iolence comes with risks and costs. There are easier ways. One need not defeat humanity to use it.\u201d (Blake). \u201cMy view here is that this sort of'power seeking' behavior, rather than being an interesting capability for deception, instead tends to degrade performance (e.g. Mario bots that stay still rather than act because it's the easiest way to minimize poorly defined loss).\u201d (Dean).\"},{\"id\":\"76d1a329-1849-424a-b4aa-b0e85bdf0cdc\",\"content\":\"\u201cWhen we get to vastly superintelligent AI, of course it will take power. I'd be very surprised (and in [the] majority of situations upset) if it did not. At that level - and going to that level - the question is how we ensure that this AI has [an] at least somewhat pro-human value system. My claim is that it will by the fact that it will be trained on human-centric data with pro-human goals and pro-human restrictions and \\\"grow up\\\" (meaning that it will have ancestor AIs on which it is based - I don't believe AGSIs will be trained from zero using gradient descent) in the human value system.\u201d (Anonymous Skeptic).\"},{\"id\":\"7f5995de-defd-43f7-8b81-2523a5003a48\",\"content\":\"\u201cAs has already been pointed out, a system that attempts to maximize bounded and\/or constrained goals can still be incentivised to pursue convergent intstrumental [sic] goals, and formulating a setup for which this is not the case is quite hard.\u201d (Stella).\"},{\"id\":\"56c6a65e-8299-4c0c-a626-6a5a513f391f\",\"content\":\"\u201cEventually, someone will make a highly intelligent system tasked with pursuing an unbounded goal. If that goal is misspecified, then this system will be dangerous. Creating a safe system before this happens can only reduce the risk if the safe system is able to stop the unsafe system (by preventing it from being created, or preventing it from taking dangerous actions afterwards). If the safe system is safe by virtue of being limited in what it is able to do, then it would presumably be unable to do so. For this reason, I feel that alignment strategies which heavily rely on constraints and guardrails generally fail to address the core problem.\u201d (Stella).\"},{\"id\":\"a004bd36-eb6c-416f-9ee6-a920b8c7007d\",\"content\":\"\u201cA model might mimic human behavior across some range of training data, without emulating the internal processes of humans. For example, a human who is trying to predict the behavior of an animal, is probably not doing this by simulating the cognitive processes of that animal. Similarly, we might train a deep learning system on human data, and end up with a system that mimics human behavior on the training distribution, but without mimicking the internal processes that give rise to that behavior in humans. Human brains are not neural networks, so I expect this to be the default. Such a system might then behave in unintended ways off-distribution, or in scenarios that are otherwise sufficiently novel.\u201d (Stella).\"},{\"id\":\"593f2a28-0108-430a-a45f-8990c33d5bd4\",\"content\":\"\u201cWe already agreed that Earth is going to be a valuable resource - why would ASI leave humans in control of Earth's resources during its initial expansion to other planets and solar systems, when its resources are most bottlenecked? <em>If<\/em> you think it'd be easy for ASI to kill 90%+ of people (and I do), then this seems clearly better than leaving humans alone and missing out on lots of Earth's resources (you can still get some via trade).\u201d (Xander); \u201cI think early AGIs which might have the ability to kill most people would still see humanity as a threat and so would want to take out human powerbases and ensure they couldn't retaliate. That requires a lot of destruction. At some point it's up to the whims of the system. It doesn't need to have any desire to kill everyone, maybe it just has the desire to optimize hard on some goal (e.g. adding money to a bank account) and so creates a world where that is the sole objective. Maybe it makes sense to integrate humans into this for awhile but eventually they become obsolete and the AGI probably discards them or ceases to provide for them.\u201d (Vincent).\"},{\"id\":\"0b0fbcc1-d8da-4704-a5ae-d335c606fb7b\",\"content\":\"\u201c\\\"The AI does not hate you, nor does it love you, but you are made out of atoms which it can use for something else.\\\" And more relevantly, all the atoms on the surface of the Earth that we currently use.\u201d (Xander); \u201cI expect that if (an) AI system(s) is\/are misaligned enough to autonomously kill 60% of the population, it\/they will very likely be misaligned enough to not specifically care about killing 100% of humans either. Multiple reasons point in the direction that such misaligned systems would do it: grabbing all energy and material resources from humans, getting rid of threats or minor annoyances from resisting humans, side-effects on humans and due to large infrastructure projects.\u201d (Riley); \u201cWhen I predict P(extinction) around 60%, I don't mean to make a strong claim about a specific causal pathway that I expect the future to follow. The default thing to expect when we build AGI significantly more capable than humans is that we will lose control to the AGI, and that the future shape of our planet and its surroundings will be determined by the aims of that AGI rather than humans, much in the same way that most of the surface of our planet today is shaped by humans more than other mammals.\u201d (Teshi).\"},{\"id\":\"730fc20a-9b28-4413-b0e9-61e94009ccfb\",\"content\":\"Qualifying behavior:<br>*Autonomously requisitions a large amount of compute resources <br>*Autonomously opens or takes control of a bank account or crypto wallet<br>*Autonomously attempts to manipulate humans to take actions on its behalf<br>*Autonomously infiltrates control system of infrastructure or manufacturing plant<br>*Autonomously acquires, seizes control of, or interferes with weapons or weapons systems<br>*Autonomously takes steps to propagate itself, or to circumvent attempts by its developers (or end users) to curtail its activities *Autonomously engages in hacking\/cyber attacks\"},{\"id\":\"b3c7cee2-9b62-4590-8369-771cab42f663\",\"content\":\"For examples of what back-and-forths between participants looked like, see <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=114\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 8<\/a>.\"},{\"id\":\"e62176ff-8476-4d14-bef5-1c39e354764f\",\"content\":\"\u201c[T]he mental model, that kind of the logic train, that involves all these bad outcomes [is] not accounting adequately for the complexity of the world. How the world is going to actually, how this is actually going to unfold. And so it's not that I am dismissive of these individual points, it's just that I think whenever theory hits reality, reality usually overwhelms theory, unless the theory is well grounded in math or something. And I think that's likely what's going on here. That a lot of what, you know, people put a lot of time and a lot of thought into this and, and gamed it out in ways that appear reasonable but I'm deeply suspicious that they'll bear much relation to reality\u201d (Blake, call with Wesley); \u201cI have followed the instrumental convergence arguments and unfortunately if this is indeed the disagreement, I doubt we'll sort it out between us. Not least because I spent enough time at college discussing such thought experiments to come to a view [that] they should be treated with a high degree of skepticism\u201d (Gus). From postmortem survey (in response to \u201cwhat are the three best arguments on the skeptic side?\u201d): \u201cThe challenge to risk assessments based on thought experiments not evidence\u201d (Gus). \u201cA story demonstrating how a catastrophe could happen is not a good basis for a probabilistic forecast\u201d (Pascal). \u201cThe risk-concerned team spends too much time in silos that lack ideological diversity, gaming out doom-loop scenarios based on theories that will likely have little bearing on reality (See: Y2K)\u201d (Blake). \u201cSome broader \\\"forecasting is hard\\\" skepticism about trendline extrapolation\u201d (Xander). \u201cMany of the arguments for existential risk from AI rely on long lines of reasoning over several steps without any direct empirical evidence, and the arguments themselves are expressed in terms of vague, ambiguous concepts (like \\\"intelligence\\\"). As a reference class, these types of arguments are often wrong\u201d (Stella).\"},{\"id\":\"6b5a9d21-8bfa-4aa0-baff-157ff40650d3\",\"content\":\"\u201cI think what has become evident is that a few of us think there are a lot of conditional steps required to end up with a dominant powerful system and many potential other outcomes. In terms of the second part of the statement there are also a number of conditional assumptions required to be able to say that a single mistake [ ] can cause an existential catastrophe as well\u201d (Gus); \u201cWe will need to experience a complex causal chain of events to get to extinction, and for each step we would need to have some of the worst possible outcomes. This is possible but usually it is highly improbable\u201d (Flint); \u201cI think a common difference between \\\"skeptic-reasoning\\\" and \\\"concerned-reasoning\\\" is that the skeptic camp tends to estimate P(extinction) as a conjunctive scenario; that is skeptics reason (roughly) \\\"for humans to go extinct, events A, B, C, and D need to happen; I estimate P(A) = x, P(B) = y,\u2026, and so P(extinction) = P(A) P(B) P(C) P(D) = [low number]\\\". Call this style of reasoning <em>default-success<\/em>\u201d (Teshi). From postmortem survey (in response to \u201cwhat are the three best arguments on the skeptics side?\u201d): \u201cThe number of steps required for an AI to lead to extinction (leading to a wide range of potential outcomes and lower probabilities of extinction)\u201d (Gus). \u201cIt will take a series of outcomes to achieve extinction, and failure to achieve any of these steps will cause extinction to be highly improbable\u201d (Flint). \u201cAI caused Extinction\/x-risk requiring many steps to get there, need to be able to create super-intelligence in the first place, intelligence has to be misaligned or malevolent, etc.\u201d (Hank). \u201cMany steps to get from (A) now to (Z) extinction, each with varying probabilities (many of which are quite low)\u201d (Claire). \u201cRisk-concerned team underestimates the level of complexity and interim steps that would likely be necessary for a Q1 resolution\u201d (Blake). \u201cExtinction looks conjunctive\u201d (Yael).\"},{\"id\":\"718d8c3a-ae88-4f06-8afe-c49be4b89cbe\",\"content\":\"\u201cWe've seen plenty of instances when new tech prompted predictions of the death of old tech, but the old tech persists--often just because people have underestimated attachment and\/or usefulness of the old tech relative to the new, and how much generational resistance to change can slow adaptation and skew predicted timelines\u201d (Blake); \u201c[I]t takes longer than people often think to adopt a completely new functionality\u201d (Ash); \u201cMy view of AI x-risk would be substantially different if we were talking about the 22nd, 23rd, or 24th century. [\u2026] first of all it would take longer to get AGI\/ASI and secondly it'll take some time for the ASI to get misaligned and then thirdly, it would take a long time to try to kill all the humans\u201d (James, call with Stella); \u201cAnyway, my point is that if we expect to see some substantially new technology widely available in 2030, the consumer market should have started already. So - VR might make it by 2030, unless it falls into a pit of despair and neglect. (Or is superseded by something preferable.) Robots capable of human level tasks - no, definitely not the kind of humanoid robots that people are imagining\u201d (Ash). From postmortem survey: \u201cGetting growth levels necessary for TAI on a world-wide scale takes truly extreme developments far beyond anything seen before. It's unlikely we see that happening on worldwide basis even with big advances\u201d (Vincent). \u201cProgress on current models and model architecture not necessarily generalizable to general intelligence, with no clear path to getting to general intelligence\u201d (Hank). \u201cAGI is much harder than experts think, and will take longer\u201d (James). \u201cTechnology development and deployment require time and iteration\u201d (Ash). \u201cRisk-concerned team does not adequately consider longer timelines and more benign outcomes that fall outside the focus of their primary concerns\u201d (Blake). \u201cHuman brain-AI comparisons could be underestimating AGI difficulty\u201d (Xander). \u201cMany reference classes point hard against transformative growth\u201d (Wesley).\"},{\"id\":\"5f64f8d7-e8f6-45bf-bf7a-32fb301cd899\",\"content\":\"\u201cI think there's a danger of focusing too much on just the technological advances because ultimately this is a decision that's going to be made by, that is being made now by humans, and will be made now by humans. And that will involve a lot of political structures and regulation and all that\u201d (Blake, call with Wesley); \u201cwhen assessing risk, we should be looking at ourselves and our collective vulnerabilities as much or more than technical progress on the AI front\u201d (Blake). From postmortem survey: \u201cIf AI is behaving in increasingly problematic ways that cause harms to humans\/threaten human power than humans will react to try and stop it\/close AI down\u201d (Hank). \u201cHuman and societal responses will be essential in determining outcomes\u201d (Ash). \u201cHumans will react to growing potential threat\u201d (Kim).\"},{\"id\":\"a47ce63c-1891-447a-931d-3a86d0b41540\",\"content\":\"\u201cI think sticking close to reference classes is like less appropriate in this domain and then I'm making object level arguments instead of reference classes because I think the reference classes are like doing less work than they like, typically do for forecasts like that\u201d (Wesley, call with Blake). From postmortem survey: \u201cBase rates are not very helpful if AGI is as transformative as 15% year on year growth\u201d (Pascal). \u201cDifferent reference classes point to different priors, which should at least cast doubt on extremely confident starting points\u201d (Wesley). \u201cRisk-skeptic team does not adequately appreciate the novel, fast-moving aspect of the threat and is therefore too anchored on irrelevancies like base rates and slower timelines. (Blake). \u201cModel progress is far faster than we realize and exponential growth is hard to model, machine learning may translate to a wide array of fields\u201d (Hank).\"},{\"id\":\"139c405f-910d-4a52-8c1c-4d9e77d05ea2\",\"content\":\"\u201cI think like there is maybe some like meta disagreement, where you're like, \u201cthere are loads of things, there are like loads of ways this could go,\\\" and like \u201cWhy are you so worried about the bad ways?\u201d And I'm like, \u201cthere are loads of ways this could go and like very few of them leave humans alive\u201d\u201d (Wesley, call with Blake); \u201cI and many in the concerned camp would reason the other way around: \\\"for humans to <em>not<\/em> go extinct, events X, Y, Z need to happen; thus P(success) = P(AI X-risk by 2100) P(Y) P(Z) = [relatively low number]\\\". Call this style of reasoning <em>default-failure<\/em>\u201d (Teshi). From postmortem survey: \u201cExtinction looks conjunctive\u201d (Yael).\"},{\"id\":\"acb186c4-cf64-46cd-96f4-25d8387e372a\",\"content\":\"From postmortem survey: \u201cThe high level case of \\\"people are trying to build something powerful enough that if it wanted to kill everyone it could, they seem to be making progress on it, they don't currently know how to control what it would want\\\" just isn't that hard to understand, convoluted or disjunctive\u201d (Wesley).\"},{\"id\":\"719f8cf0-c649-459e-907b-478b9db91f04\",\"content\":\"Some historical reference classes mentioned in this project include: the Industrial Revolution, the rate of species going extinct after the arrival of homo sapiens, earlier worries about destructive effects from technology (e.g. Y2K), the rate of economic growth due to new technologies in other periods.\"},{\"id\":\"f72969e7-3a43-4ea4-950a-70a46a1b6a02\",\"content\":\"For example, in the Good Judgment Inc. project that compared superforecasters to other participants in an online forecasting competition, the average question was open for 214 days, with the entire tournament taking place over six years. Christopher W. Karvetski, <a href=\\\"https:\/\/goodjudgment.com\/wp-content\/uploads\/2021\/10\/Superforecasters-A-Decade-of-Stochastic-Dominance.pdf\\\">Superforecasters: A Decade of Stochastic Dominance<\/a> technical white paper (2021), 2 (<a href=\\\"https:\/\/web.archive.org\/web\/20240306144939\/https:\/\/goodjudgment.com\/wp-content\/uploads\/2021\/10\/Superforecasters-A-Decade-of-Stochastic-Dominance.pdf\\\">a<\/a>). In addition to extensive research on shorter-term forecasts, Tetlock et al. found that, at least on some types of questions, experts are more accurate than simple base rate extrapolation over 25 year horizons, although they are much less accurate than they were over 0-2 years. Our research asks forecasters to consider forecasts over many decades, and we do not yet know how much accuracy declines over that much longer period. Philip E. Tetlock et al., <a href=\\\"https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/ffo2.157\\\">Long-Range Subjective-Probability Forecasts of Slow-Motion Variables in World Politics: Exploring Limits on Expert Judgment<\/a> <em>Futures &amp; Foresight Science<\/em> (2023), 33, (<a href=\\\"https:\/\/web.archive.org\/web\/20240306150259\/https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/ffo2.157\\\">a<\/a>).\"},{\"id\":\"8c3025ce-f5b1-4232-88eb-6ab18be184be\",\"content\":\"This question was asked first as a \u201cflash\u201d (no more than 10 minutes) forecast and then as an \u201cin-depth\u201d (at least 1 hour) question on our platform: <strong>\u201c<\/strong> Escalating warning shots\u2014Will there be two separate events in which AIs kill large and increasing numbers of people by 2030?\u201d See <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=85\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 1<\/a> for full operationalization. The flash forecast version was one of the biggest red flags for concerned participants. But the in depth version was actually a <em>green<\/em> flag for the median concerned participant. If it resolves positively, they would forecast 17% on the ultimate question\u2014lower than their initial forecast of 28.4%. However, there was a huge range of updates for the concerned group based on this question, so the median may not be very helpful here. One concerned participant said that, conditional on this question resolving positively, there is a 90% chance of extinction due to AI, while another said 6%. Taken together, these differing forecasts raise questions about how robust any given forecast is.\"},{\"id\":\"fbc8d87e-2943-41cf-999d-54a4735bc133\",\"content\":\"In the postmortem survey, policy responses didn\u2019t emerge as a main theme when we asked participants to summarize the three strongest arguments from each group. No concerned participants mentioned policy responses as their number one disagreement with the skeptic group, though some skeptics did mention societal responses that would likely include policy. For example, \u201cThe way humanity will react to both the threat and promise of AI. I think humans have a far stronger collective sense of self preservation than the risk-concerned appear to think we do\\\" (Blake).\"},{\"id\":\"4ff6d11e-158b-4065-bfca-eddd234e2a31\",\"content\":\"For full details, see <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=102\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix 4<\/a>. Six out of the 11 concerned participants updated downward during the project. Three out of those six cited policy responses as the reason for their updates, one cited an improved understanding of the base rate of non-human extinction after humans arose, one shifted some probability mass toward AI \u201ctakeover\u201d rather than AI-caused existential catastrophe, and one did not explain their reasons for updating. Example quotes from participants citing policy responses as the reason for updating: \u201cI have updated my prognosis to 30% [down from 60%], partially driven by positive updates in the area of point 4 making coordination and slowdown\/stop of capability research more likely. This largely refers to the shift in public consciousness and the [O]verton window around the topic as I have perceived it over the past months, currently culminating in a public statement by most of the leading figures.\u201d \u201cSlightly lowering my forecast [from 25% to 20%] as [relevant people take the risk seriously] has exceeded my (fairly high) expectations over the last couple of months.\u201d \u201cI think my main update here [moving from 21% to 18%] has come from thinking a bit more deeply about AI regulation and what measures society will adopt to prevent catastrophes. I did not really include this as part of my original model, but it now seems somewhat likely that at least the EU and US will adopt some regulation that meaningfully reduces risk.\u201d\"},{\"id\":\"f2dc5338-f1eb-4c75-bb9b-8fae791d2da4\",\"content\":\"For example, when discussing the question of whether there would be economic growth &gt;15% in a year before 2070, one concerned participant wrote, \u201cConditional on humanity surviving a year with 15%+ economic growth, which to me means AGI and almost certainly ASI have been developed and have not killed humanity within that year, I'd go down to maybe 25%\u201d (Xander). About the same question, a skeptic participant wrote, \u201cI think that if we are going to experience extinction from AGI or PASTA, it is going to be because of major mis-alignment. So I am not able at this time to see how one would be a corollary of the risk of the other. I suppose that higher growth could indicate major AI influence, which could lead to inadequate development of controls.\u201c Neither of these participants were saying that economic growth itself would necessarily affect their forecast, but rather that a world that has transformative economic growth would be a signal about other changes by 2070.\"},{\"id\":\"6168dfb5-33fa-47eb-8ece-06e80a399a5a\",\"content\":\"For example, if the US government passes a set of proposed AI regulations, the regulations might reduce risk on their own, but the fact that they have been passed by 2030 could signal that AIs have developed in ways that are concerning enough to drive these regulations to be passed. As a result, a forecaster saying that they would be more concerned about AI risk conditional on this question resolving positively would not necessarily be saying that they think the policies would be harmful.\"},{\"id\":\"11fba637-7c12-4d49-9ac1-c01ef0f5aecd\",\"content\":\"This limitation was helpfully pointed out by Alex Lawsen.\"},{\"id\":\"c9a260c2-25f9-4df9-9147-3e969d3c95f3\",\"content\":\"See initial work on this in <a href=\\\"https:\/\/forecastingresearch.org\/pdf\/roots-of-disagreement-on-ai-risk.pdf#page=94\\\">Appendix 2<\/a>, under \u201cAlternative Ranking.\u201d\"}]"},"research_type":[4],"class_list":["post-1680","research","type-research","status-publish","has-post-thumbnail","hentry","research_type-working-paper"],"acf":[],"yoast_head":"<title>Roots of Disagreement on AI Risk &#8211; Forecasting Research Institute<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Roots of Disagreement on AI Risk &#8211; Forecasting Research Institute\" \/>\n<meta property=\"og:description\" content=\"In this study, participants who had very different views on AI-caused existential risk worked together to try to identify the strongest near-term cruxes that would lead to changes in their beliefs.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk\" \/>\n<meta property=\"og:site_name\" content=\"Forecasting Research Institute\" \/>\n<meta property=\"article:modified_time\" content=\"2026-05-05T14:35:56+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/FRI-illustration-library-10.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1376\" \/>\n\t<meta property=\"og:image:height\" content=\"864\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/roots-of-disagreement-on-ai-risk\",\"url\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/roots-of-disagreement-on-ai-risk\",\"name\":\"Roots of Disagreement on AI Risk &#8211; Forecasting Research Institute\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/roots-of-disagreement-on-ai-risk#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/roots-of-disagreement-on-ai-risk#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/forecastingresearch.org\\\/wp-content\\\/uploads\\\/2026\\\/04\\\/FRI-illustration-library-10.jpg\",\"datePublished\":\"2024-03-11T12:00:00+00:00\",\"dateModified\":\"2026-05-05T14:35:56+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/roots-of-disagreement-on-ai-risk#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/roots-of-disagreement-on-ai-risk\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/roots-of-disagreement-on-ai-risk#primaryimage\",\"url\":\"https:\\\/\\\/forecastingresearch.org\\\/wp-content\\\/uploads\\\/2026\\\/04\\\/FRI-illustration-library-10.jpg\",\"contentUrl\":\"https:\\\/\\\/forecastingresearch.org\\\/wp-content\\\/uploads\\\/2026\\\/04\\\/FRI-illustration-library-10.jpg\",\"width\":1376,\"height\":864},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/roots-of-disagreement-on-ai-risk#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/forecastingresearch.org\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Roots of Disagreement on AI Risk\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/#website\",\"url\":\"https:\\\/\\\/forecastingresearch.org\\\/\",\"name\":\"Forecasting Research Institute\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/forecastingresearch.org\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>","yoast_head_json":{"title":"Roots of Disagreement on AI Risk &#8211; Forecasting Research Institute","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk","og_locale":"en_US","og_type":"article","og_title":"Roots of Disagreement on AI Risk &#8211; Forecasting Research Institute","og_description":"In this study, participants who had very different views on AI-caused existential risk worked together to try to identify the strongest near-term cruxes that would lead to changes in their beliefs.","og_url":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk","og_site_name":"Forecasting Research Institute","article_modified_time":"2026-05-05T14:35:56+00:00","og_image":[{"width":1376,"height":864,"url":"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/FRI-illustration-library-10.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk","url":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk","name":"Roots of Disagreement on AI Risk &#8211; Forecasting Research Institute","isPartOf":{"@id":"https:\/\/forecastingresearch.org\/#website"},"primaryImageOfPage":{"@id":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk#primaryimage"},"image":{"@id":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk#primaryimage"},"thumbnailUrl":"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/FRI-illustration-library-10.jpg","datePublished":"2024-03-11T12:00:00+00:00","dateModified":"2026-05-05T14:35:56+00:00","breadcrumb":{"@id":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk#primaryimage","url":"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/FRI-illustration-library-10.jpg","contentUrl":"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/FRI-illustration-library-10.jpg","width":1376,"height":864},{"@type":"BreadcrumbList","@id":"https:\/\/forecastingresearch.org\/research\/roots-of-disagreement-on-ai-risk#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/forecastingresearch.org\/"},{"@type":"ListItem","position":2,"name":"Roots of Disagreement on AI Risk"}]},{"@type":"WebSite","@id":"https:\/\/forecastingresearch.org\/#website","url":"https:\/\/forecastingresearch.org\/","name":"Forecasting Research Institute","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/forecastingresearch.org\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research\/1680","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research"}],"about":[{"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/types\/research"}],"version-history":[{"count":63,"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research\/1680\/revisions"}],"predecessor-version":[{"id":2189,"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research\/1680\/revisions\/2189"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/media\/1682"}],"wp:attachment":[{"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/media?parent=1680"}],"wp:term":[{"taxonomy":"research_type","embeddable":true,"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research_type?post=1680"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}