{"id":1355,"date":"2025-11-10T12:00:00","date_gmt":"2025-11-10T12:00:00","guid":{"rendered":"https:\/\/forecastingresearch.org\/?post_type=research&#038;p=1355"},"modified":"2026-05-05T11:23:12","modified_gmt":"2026-05-05T11:23:12","slug":"longitudinal-expert-ai-panel-leap-working-paper","status":"publish","type":"research","link":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper","title":{"rendered":"The Longitudinal Expert AI Panel: Understanding Expert Views on AI Capabilities, Adoption, and Impact"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\" id=\"abstract\">Abstract<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Public debates about AI revolve around bold claims and counterclaims,\nbut rarely culminate in precise, falsifiable forecasts of AI\ncapabilities, adoption, and impact. The Longitudinal Expert AI Panel\n(LEAP) improves this signal-to-noise ratio by gathering such forecasts,\nmonthly, from a carefully chosen panel of 339 experts spanning industry,\nacademia, and policy. The median expert foresees that by 2030 AI will be\nresponsible for 7% of U.S. electricity usage, assist in 18% of work\nhours in the U.S., and provide daily companionship for 15% of\nadults\u2014roughly 7x, 4x, and 2.5x current levels, respectively. The median\nexpert also gives a 60% chance that AI systems solve or substantially\nassist in solving a Millennium Prize Problem by 2040, which would be a\nmajor achievement in mathematics. There is substantial within-individual\nuncertainty and between-individual disagreement among experts, each\naccounting for roughly half of the total variation in expert forecasts\nacross all questions. Nevertheless, the vast majority of LEAP forecasts\nfall far short of the warnings from AI lab leaders about imminent\nartificial superintelligence. We analyze 1.7 million words of\nparticipant rationales to provide a complementary qualitative overview\nof the key mechanisms underpinning fast and slow forecasts of AI\nprogress.<\/p>\n\n\n\n<div class=\"wp-block-buttons is-horizontal is-content-justification-space-between is-layout-flex wp-container-core-buttons-is-layout-1c7eb588 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"btn orange\" href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">View the full PDF report <svg width=\"7\" height=\"9\" viewBox=\"0 0 7 9\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.000156283 8.60806L4.22416 4.33606V4.24006L0.000156283 6.10352e-05H1.80816L6.06416 4.28806L1.80816 8.60806H0.000156283Z\" fill=\"#102B23\"\/>\n<\/svg>\n<svg width=\"8\" height=\"10\" viewBox=\"0 0 8 10\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.601719 8.85794L4.82572 4.58594V4.48994L0.601719 0.249939H2.40972L6.66572 4.53794L2.40972 8.85794H0.601719Z\" fill=\"#102B23\"\/>\n<\/svg><\/a><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-group is-vertical is-layout-flex wp-container-core-group-is-layout-4fc3f8e1 wp-block-group-is-layout-flex\">\n<details class=\"wp-block-details is-layout-flow wp-block-details-is-layout-flow\"><summary>Acknowledgments<\/summary>\n<p class=\"wp-block-paragraph\">We thank Victoria Schmidt, Morgane Bascle, and Jonah Black for\nresearch assistance.<\/p>\n<\/details>\n\n\n\n<details class=\"wp-block-details is-layout-flow wp-block-details-is-layout-flow\"><summary>Disclaimers<\/summary>\n<p class=\"wp-block-paragraph\">The views expressed in this paper do not necessarily represent the\nviews of the Federal Reserve Bank of Chicago or the Federal Reserve\nSystem. This research was funded with support from Open Philanthropy and\nCraig Falls.<\/p>\n<\/details>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"executive-summary\">Executive Summary<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Despite the clashing narratives around AI, there is little work systematically mapping the full spectrum of views among experts (computer scientists, economists, technologists) and the general public. What do these groups believe about AI&#8217;s future capabilities, adoption, and effects? And why do they believe what they do? The Longitudinal Expert AI Panel (LEAP) fills this gap with monthly surveys tracking the quantified beliefs of experts, historically accurate forecasters (\u201csuperforecasters\u201d),<sup data-fn=\"349340c1-373d-4eb1-85d3-28f508f33d90\" class=\"fn\"><a href=\"#349340c1-373d-4eb1-85d3-28f508f33d90\" id=\"349340c1-373d-4eb1-85d3-28f508f33d90-link\">1<\/a><\/sup> and the general public.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Policymakers and stakeholders routinely consult experts to ground\ntheir decision-making in coherent, informed beliefs, especially when\nfaced with new technologies and high levels of uncertainty. LEAP\nfacilitates this process by cutting through the anecdote and speculation\nthat currently dominates AI discourse. We specifically target prominent\nexperts whom policymakers would be most inclined to consult on the\nprogression of AI capabilities and their impact. Expert participants\ninclude top-cited AI and ML scientists, prominent economists, key\ntechnical staff at frontier AI companies, and influential policy experts\nfrom a broad range of NGOs. Our goal is to provide a clear summary of\nexpert views, analyzed within and across these key groups.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Since we launched LEAP in June 2025, we have completed three survey waves focused on (1) high-level predictions about AI progress; (2) the application of AI to scientific discovery; and (3) the adoption and social impact of AI. Experts provided thoughtful engagement, spending a median of 44 minutes per survey and writing over 460,000 words of rationales explaining their beliefs, across all surveys and questions.<sup data-fn=\"d15cee71-1108-43b3-b329-2dec7e6e33f9\" class=\"fn\"><a href=\"#d15cee71-1108-43b3-b329-2dec7e6e33f9\" id=\"d15cee71-1108-43b3-b329-2dec7e6e33f9-link\">2<\/a><\/sup> In this paper, we share results from these first three waves along with a detailed methodological description of the project as a whole.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Across the first three waves of LEAP, five patterns stand out:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Experts expect sizable near\u2011term societal effects from AI by\n2040.<\/li>\n\n\n\n<li>Substantial disagreement and uncertainty underlie expert\nforecasts.<\/li>\n\n\n\n<li>The median expert expects much slower progress than prominent\nleaders of frontier AI labs.<\/li>\n\n\n\n<li>Experts anticipate faster progress than the public on most\noutcomes.<\/li>\n\n\n\n<li>Experts and superforecasters mostly agree. Where they disagree,\nexperts tend to expect more AI progress. Also, there are no systematic\ndifferences between the predictions of different types of experts:\ncomputer scientists, economists, industry professionals, and policy\nprofessionals.<\/li>\n<\/ol>\n\n\n\n<p class=\"wp-block-paragraph\">First, <strong>experts expect sizable near\u2011term societal effects<\/strong>: by 2030, the median expert predicts that 18%<sup data-fn=\"d99b5bf5-3708-4392-8e7b-df7289f0c6aa\" class=\"fn\"><a href=\"#d99b5bf5-3708-4392-8e7b-df7289f0c6aa\" id=\"d99b5bf5-3708-4392-8e7b-df7289f0c6aa-link\">3<\/a><\/sup> of all U.S. work hours will be assisted by generative AI, up from 4.1% in November 2024 (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Bick et al. 2025<\/a>);<sup data-fn=\"66522d7b-d668-4484-884c-38aa6b6f8528\" class=\"fn\"><a href=\"#66522d7b-d668-4484-884c-38aa6b6f8528\" id=\"66522d7b-d668-4484-884c-38aa6b6f8528-link\">4<\/a><\/sup> AI training and deployment will consume 7% of U.S. electricity;<sup data-fn=\"92219e5a-e381-4fb7-a009-03912e8393c4\" class=\"fn\"><a href=\"#92219e5a-e381-4fb7-a009-03912e8393c4\" id=\"92219e5a-e381-4fb7-a009-03912e8393c4-link\">5<\/a><\/sup><sup>,<\/sup><sup data-fn=\"fa6b7dcf-4e3b-48d1-9e52-b21ae96cd313\" class=\"fn\"><a href=\"#fa6b7dcf-4e3b-48d1-9e52-b21ae96cd313\" id=\"fa6b7dcf-4e3b-48d1-9e52-b21ae96cd313-link\">6<\/a><\/sup> autonomous vehicles will provide 20% of U.S. ride-hailing trips, and annual global private investment (as reported by Our World in Data)<sup data-fn=\"a7e95b27-899d-4afa-bdac-b90b259a26e9\" class=\"fn\"><a href=\"#a7e95b27-899d-4afa-bdac-b90b259a26e9\" id=\"a7e95b27-899d-4afa-bdac-b90b259a26e9-link\">7<\/a><\/sup> will reach $260 billion, up from the $130 billion reported total for this series in 2024. The median expert predicts that by 2030, 15% of adults will self-report using AI for companionship, emotional support, social interaction, or simulated relationships at least once daily, up from 6% today. By 2040, that number doubles to 30% of adults. Experts expect substantial improvements in the ability of AI systems to complete difficult math questions: the median expert believes that AI systems will achieve performance of 75% on the FrontierMath benchmark<sup data-fn=\"4657eacc-2029-4fa9-8213-fa8ffe075025\" class=\"fn\"><a href=\"#4657eacc-2029-4fa9-8213-fa8ffe075025\" id=\"4657eacc-2029-4fa9-8213-fa8ffe075025-link\">8<\/a><\/sup> by 2030; and 23% of experts expect saturation of the benchmark.<sup data-fn=\"ee90ef6f-442e-457d-b06d-42db793b5c9e\" class=\"fn\"><a href=\"#ee90ef6f-442e-457d-b06d-42db793b5c9e\" id=\"ee90ef6f-442e-457d-b06d-42db793b5c9e-link\">9<\/a><\/sup> The median expert also believes it is more likely than not (a 60% chance) that AI solves or substantially assists in solving a Millennium Prize Problem by 2040.<sup data-fn=\"8dae5ba0-5fad-4f9b-90c4-526705f192b7\" class=\"fn\"><a href=\"#8dae5ba0-5fad-4f9b-90c4-526705f192b7\" id=\"8dae5ba0-5fad-4f9b-90c4-526705f192b7-link\">10<\/a><\/sup> We summarize these expert forecasts in the figure below.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-02.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure:<\/strong> Median expert forecasts for various questions. We display the 10<sup>th<\/sup>, 25<sup>th<\/sup>, 50<sup>th<\/sup>, 75<sup>th<\/sup>, and 90<sup>th<\/sup> percentiles of the median forecasts given by experts at each date. For example, if 25% of experts give a median forecast of $10 billion or less, the 25<sup>th<\/sup> percentile series in the graph will lie at $10 billion; these series are <em>not<\/em> confidence intervals. Where available, we include a historical baseline in light blue.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Even experts with lower forecasts of AI progress, capabilities, and adoption still expect substantial impacts relative to current levels. Recall that, by 2030, the median expert predicts that 18% of all U.S. work hours will be assisted by generative AI, up from 4.1% in November 2024 <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">(Bick et al. 2025<\/a>);<sup data-fn=\"b6814db1-908d-4c41-a402-60e57447bb67\" class=\"fn\"><a href=\"#b6814db1-908d-4c41-a402-60e57447bb67\" id=\"b6814db1-908d-4c41-a402-60e57447bb67-link\">11<\/a><\/sup> however, respondents were shown a historical baseline value of 2%, based on an earlier version of the cited paper. We also asked forecasters for their 25<sup>th<\/sup> percentile forecast,<sup data-fn=\"63d51702-88b7-44e8-946b-9c7393b547e7\" class=\"fn\"><a href=\"#63d51702-88b7-44e8-946b-9c7393b547e7\" id=\"63d51702-88b7-44e8-946b-9c7393b547e7-link\">12<\/a><\/sup> and panelists on this question gave a forecast of 9%, still more than a fourfold increase from the historical baseline level provided to forecasters at the time of the survey. In other words, the median expert gives a 75% chance that at least 9% of work hours will be assisted by generative AI in 2030.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Second, <strong>substantial disagreement and uncertainty underlie expert forecasts<\/strong>. The top quartile of experts estimate that the majority of revenue from newly approved U.S. drugs in 2040 will be from AI-discovered drugs, but the bottom quartile of experts predicts that less than 10% of new drug sales will be from AI-discovered drugs.<sup data-fn=\"dbfb006f-8326-4aa7-9c86-c321c5a2b32e\" class=\"fn\"><a href=\"#dbfb006f-8326-4aa7-9c86-c321c5a2b32e\" id=\"dbfb006f-8326-4aa7-9c86-c321c5a2b32e-link\">13<\/a><\/sup> On the question of whether AI will independently solve or substantially assist in solving a Millennium Prize Problem by 2040, a quarter of experts think it is quite likely (&gt;81% chance), while another quarter of experts believe it is unlikely (&lt;30% chance). On many questions, individual experts also express substantial uncertainty in their own forecasts. We construct a composite measure of the total variation in expert beliefs in the figure below (for two questions), taking into account both the within-forecaster uncertainty and the between-forecaster disagreement. We discuss the methodology in greater detail in <a href=\"#uncertainty-and-disagreement\"><u>Uncertainty and Disagreement<\/u><\/a>, where we describe the underlying assumptions that we rely on to construct these figures. These \u201cpooled distributions\u201d represent the full distribution of outcomes predicted by the average expert. We find that, across all forecasting questions where we allow forecasters to express their uncertainty, within-forecaster uncertainty explains 49% of the total variation in forecasts, while 51% of variation is explained by between-forecaster disagreement.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-04.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure:<\/strong> Pooled distributions for expert forecasts on <em>Work Hours Assisted by Generative AI<\/em> (top panels) and <em>FrontierMath<\/em> <em>scores<\/em> (bottom panels). These pooled distributions combine within-expert uncertainty and between-expert disagreement. Densities are normalized to the same peak for comparability. See <a href=\"#uncertainty-and-disagreement\">Uncertainty and Disagreement<\/a> for details.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Third, <strong>the median expert expects much slower progress than prominent leaders of frontier AI labs<\/strong>. These lab leaders predict human-level or superhuman AI by 2026\u20132029, while most of our expert panel rejects these shorter timelines. We ask respondents to forecast whether the average LEAP panelist will say, in 2030, that we are closest to a \u201cslow-,\u201d \u201cmoderate-,\u201d or \u201crapid-\u201d AI progress scenario. We define these scenarios in detail in the appendix (see <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=105\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.I. 4. General AI Progress<\/a>). The average expert thinks that 23% of LEAP panelists in 2030 will say the world most closely mirrors a \u201crapid\u201d AI progress scenario that, of our three scenarios, most closely matches some of the lab leaders\u2019 claims. On the other hand, the average expert believes that 28% of panelists will indicate that progress plateaued at close-to-current levels, with fewer improvements in capabilities relative to today (a \u201cslow\u201d AI progress scenario).<sup data-fn=\"83eadd21-22ab-44de-83c1-39f0f652accd\" class=\"fn\"><a href=\"#83eadd21-22ab-44de-83c1-39f0f652accd\" id=\"83eadd21-22ab-44de-83c1-39f0f652accd-link\">14<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Fourth, <strong>experts anticipate faster progress than the public<\/strong><sup data-fn=\"e264401d-38b7-41d7-8e5a-6e77ec5c63f2\" class=\"fn\"><a href=\"#e264401d-38b7-41d7-8e5a-6e77ec5c63f2\" id=\"e264401d-38b7-41d7-8e5a-6e77ec5c63f2-link\">15<\/a><\/sup> on most outcomes (e.g., the accuracy AI systems will achieve on the FrontierMath benchmark by 2030: median expert 75% vs. median member of the public 50%, <em>p &lt; 0.001<\/em>; generative AI work assistance by 2030: median expert 18% vs. median member of the public 10%, <em>p &lt; 0.001<\/em>). Experts predict a 32% chance that AI will be at least as impactful as a \u201ctechnology of the millennium\u201d\u2014like the printing press or the Industrial Revolution\u2014whereas the public gives this only a 22% chance. Across forecasts that exhibit a clear relationship with AI capabilities and impacts, a randomly selected expert is 16% more likely than a randomly selected member of the public to predict faster progress than would be expected by random chance.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We summarize some of the differences in aggregate forecasts in the\nfigure below.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-05.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure:<\/strong> Differences between the expert and public median 50<sup>th<\/sup> percentile forecasts for several questions where the unit is a percentage. Points indicate the median of each group\u2019s 50<sup>th<\/sup> percentile forecasts. We apply transformations to create valenced forecasts, where values closer to the left indicate slower progress and values to the right indicate faster progress.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Fifth, <strong>highly accurate forecasters (superforecasters) and\nexperts are largely aligned, with superforecasters expecting slightly\nless progress overall<\/strong>. Differences across expert\nsubgroups\u2014those specializing in computer science or economics, those\nworking in industry, and those working in the policy and think tank\nspace\u2014are small and rarely statistically significant. We summarize some\nof the differences in aggregate forecasts in the two figures below. The\nfirst compares experts and superforecasters, while the second groups by\ncategory of expertise.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-07.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure:<\/strong> Differences between the expert and superforecaster median 50<sup>th<\/sup> percentile forecasts for several questions where the unit is a percentage. Points indicate the median of each group\u2019s 50<sup>th<\/sup> percentile forecasts. We apply transformations to create valenced forecasts, where values closer to the left indicate slower progress and values to the right indicate faster progress.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-09.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure:<\/strong> Expert category median 50<sup>th<\/sup> percentile forecasts for several questions where the unit is a percentage. Points indicate the median of 50<sup>th<\/sup> percentile forecasts for each category. We apply transformations to create valenced forecasts, where values closer to the left indicate slower progress and values to the right indicate faster progress.<\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"introduction\">Introduction<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">As artificial intelligence (AI) reshapes culture,<sup data-fn=\"9fa4990e-410c-4d97-a3de-a3b1e47a6f5d\" class=\"fn\"><a href=\"#9fa4990e-410c-4d97-a3de-a3b1e47a6f5d\" id=\"9fa4990e-410c-4d97-a3de-a3b1e47a6f5d-link\">16<\/a><\/sup> science,<sup data-fn=\"57335b56-eb62-469f-a244-c2dbbcd32a50\" class=\"fn\"><a href=\"#57335b56-eb62-469f-a244-c2dbbcd32a50\" id=\"57335b56-eb62-469f-a244-c2dbbcd32a50-link\">17<\/a><\/sup> labor markets,<sup data-fn=\"1f53487e-20fd-4bed-9901-426a33ef65ec\" class=\"fn\"><a href=\"#1f53487e-20fd-4bed-9901-426a33ef65ec\" id=\"1f53487e-20fd-4bed-9901-426a33ef65ec-link\">18<\/a><\/sup> and the aggregate economy,<sup data-fn=\"ae2ddc53-1ace-4f33-8208-e055f402bd19\" class=\"fn\"><a href=\"#ae2ddc53-1ace-4f33-8208-e055f402bd19\" id=\"ae2ddc53-1ace-4f33-8208-e055f402bd19-link\">19<\/a><\/sup> experts debate its value, risks, and how quickly it will integrate into everyday life. Leaders of AI companies forecast transformative AI systems that cure all diseases,<sup data-fn=\"17c90b83-c2e9-4427-8fc0-369dd9597752\" class=\"fn\"><a href=\"#17c90b83-c2e9-4427-8fc0-369dd9597752\" id=\"17c90b83-c2e9-4427-8fc0-369dd9597752-link\">20<\/a><\/sup> replace whole classes of jobs,<sup data-fn=\"d2c807a0-8b97-43c6-a1f7-3c7c1e3fdd3f\" class=\"fn\"><a href=\"#d2c807a0-8b97-43c6-a1f7-3c7c1e3fdd3f\" id=\"d2c807a0-8b97-43c6-a1f7-3c7c1e3fdd3f-link\">21<\/a><\/sup> and supercharge GDP growth by the 2030s.<sup data-fn=\"865aa83d-5511-4739-98e3-e025ab768709\" class=\"fn\"><a href=\"#865aa83d-5511-4739-98e3-e025ab768709\" id=\"865aa83d-5511-4739-98e3-e025ab768709-link\">22<\/a><\/sup> Skeptics see small gains at best, with AI\u2019s impact amounting to little more than a modest boost in productivity\u2014if anything at all (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Acemoglu 2024<\/a>).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Despite these clashing narratives, there is little work systematically mapping the full spectrum of views among computer scientists, economists, technologists in the private sector, and the public. What do these groups believe about AI&#8217;s future capabilities, adoption, and effects? Why do they believe what they do, and what mechanisms support those beliefs? Prior surveys offer opinions, but rarely comprehensively quantify those opinions, hampering policy guidance and evaluation. We fill this gap with the Longitudinal Expert AI Panel (LEAP), a monthly survey tracking the probabilistic forecasts of experts, historically accurate forecasters (\u201csuperforecasters\u201d),<sup data-fn=\"579fdc13-94d7-4f0e-9228-b858d3ceee3b\" class=\"fn\"><a href=\"#579fdc13-94d7-4f0e-9228-b858d3ceee3b\" id=\"579fdc13-94d7-4f0e-9228-b858d3ceee3b-link\">23<\/a><\/sup> and the public.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Policymakers and stakeholders routinely consult experts to ground their decision-making in coherent, informed beliefs, especially when faced with new technologies and high levels of uncertainty. LEAP is designed to facilitate this process by cutting through the anecdote and speculation that currently dominates AI discourse. We specifically target prominent experts whom policymakers would be most inclined to consult regarding the progression of AI capabilities and its technological impact. LEAP expert invitees (and participants) include top-cited AI and ML scientists, prominent economists, key technical staff at frontier AI companies, and influential policy experts from a broad range of NGOs. Our goal is to provide a clear summary of expert views, analyzed both across and within these key groups. To address concerns that our respondents might be a biased or selective sample, we reweight our data to account for different response propensities within our expert sampling frame, providing a representative summary of our expert frame\u2019s beliefs.<sup data-fn=\"9fc23d2a-f118-4f4e-8692-d1172ee6fddc\" class=\"fn\"><a href=\"#9fc23d2a-f118-4f4e-8692-d1172ee6fddc\" id=\"9fc23d2a-f118-4f4e-8692-d1172ee6fddc-link\">24<\/a><\/sup> The <a href=\"#panel-construction\">Panel Construction<\/a> section contains more details. This reweighting leaves our headline conclusions unchanged (see <a href=\"#sensitivity-of-results-to-reweighting\">Sensitivity of Results to Reweighting<\/a>).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We evaluate LEAP respondents\u2019 forecasts against detailed, pre-specified resolution criteria\u2014avoiding the problem of ambiguous resolution criteria that introduce noise in forecasts by permitting both different interpretations of a question by forecasters, and disagreement about how a question is later resolved. In contrast, clear resolution criteria put the debate in common terms and enable policymakers to understand the range of opinion. LEAP\u2019s resolution criteria are also used to measure accuracy\u2014and tie respondents\u2019 compensation to this accuracy\u2014to encourage participants to provide high-quality forecasts. Lastly, we can use this accuracy measure to identify the most accurate forecasters and explore how their views compare to other participants.<sup data-fn=\"889cae6c-7ce7-4e15-8357-1ccdecf3e7a0\" class=\"fn\"><a href=\"#889cae6c-7ce7-4e15-8357-1ccdecf3e7a0\" id=\"889cae6c-7ce7-4e15-8357-1ccdecf3e7a0-link\">25<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">LEAP captures not just forecasts, but participants\u2019 rationales: 1.7\nmillion words of detailed explanations across the first three survey\nwaves of detailed explanations. Over 600,000 of these words come from\nour expert and superforecaster samples, with the remainder coming from\npublic respondents. We use this data to identify key sources of\ndisagreement and to analyze why participants express significant\nuncertainty about the future effects of AI.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Since we launched LEAP in June 2025, we have completed three survey\nwaves focused on (1) high-level predictions about AI progress; (2) the\napplication of AI to scientific discovery; and (3) widespread adoption\nand societal impact. In this paper, we share results from these first\nthree waves along with a detailed methodological description of the\nproject as a whole.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In what follows, we first present aggregate forecasts from experts. Aggregation exploits the \u201cwisdom of the crowd\u201d phenomenon, in which the accuracy of aggregated predictions exceeds the accuracy of a large majority of their constituent parts. This practice is supported by work in numerous fields, such as prediction markets (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Bassamboo, Cui, and Moreno 2015<\/a>), political forecasting (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Sj\u00f6berg 2009<\/a>, <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Murr 2011<\/a>), and more (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Hueffer et al. 2023<\/a>, <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Adjodah et al. 2021<\/a>). Aggregated predictions of many individuals tend to be remarkably accurate (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Da and Huang 2020<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Lichtendahl Jr, Grushka-Cockayne, and Pfeifer 2013<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Surowiecki 2004<\/a>).<sup data-fn=\"93825f9e-0f1b-4259-92b6-3e3c903b5ea5\" class=\"fn\"><a href=\"#93825f9e-0f1b-4259-92b6-3e3c903b5ea5\" id=\"93825f9e-0f1b-4259-92b6-3e3c903b5ea5-link\">26<\/a><\/sup> While aggregate forecasts are helpful, it is important to understand the extent of disagreement between experts and where individuals have more and less uncertainty in their forecasts. We discuss our approach to answering these questions below in the <a href=\"#uncertainty-and-disagreement\"><u>Uncertainty and Disagreement<\/u><\/a> section<strong>.<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We will continue to field new LEAP surveys every month for at least\nthree years. We will revisit questions from previous waves beginning in\nyear two, in order to assess how expert views are changing, both in\naggregate and within individuals. First, when we report results,\nparticipants can compare their own forecasts to other participants. As\nforecasts resolve, participants will be able to see how the accuracy of\ntheir past forecasts compares to other participants and adjust their\nfuture forecasts accordingly. We will also explore the relationship\nbetween short-term accuracy and long-run forecasts about AI\ncapabilities, effects, and diffusion.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In this report, we discuss LEAP in the context of prior AI forecasting surveys (<a href=\"#connection-to-prior-work\"><u>Connection to Prior Work<\/u><\/a>), detail the procedures we use to build our panel (<a href=\"#panel-construction\"><u>Panel Construction<\/u><\/a>), describe the surveys and forecasting questions (<a href=\"#monthly-surveys-and-forecasting-questions\">Monthly Surveys and Forecasting Questions<\/a>), share preliminary results (<a href=\"#results\"><u>Results<\/u><\/a>), outline plans for future work (<a href=\"#next-steps\"><u>Next Steps<\/u><\/a>), and conclude (<a href=\"#conclusion\"><u>Conclusion<\/u><\/a>).<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"connection-to-prior-work\">Connection to Prior Work<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">While prior work on AI progress has taken a variety of forms, we\nfocus here on studies that, like LEAP, use surveys of the AI expert\ncommunity and the general public to measure the range of opinions and\nexpectations on the progress of AI, its adoption, and the associated\nsocietal impacts. This type of work faces five challenges. First, it is\ndifficult to specify a comprehensive sampling frame, and nonresponse\nbias is challenging to avoid. Both of these hurdles can distort\naggregate estimates. Second, views on AI could change over time. Third,\nambiguous resolution criteria introduce noise in forecasts and limit our\nability to identify the most accurate forecasters, whom policymakers may\nwant to rely on in the future. Fourth, while we might care about\nlong-term outcomes, it is challenging to assess the credibility of\nlong-term forecasts on short-run decision timelines. Fifth, participants\noften devote limited bandwidth to surveys, and, even when thoughtfully\nengaged, might not report their true beliefs on forecasting questions,\ninstead using the survey to express their preferences. While no survey\ncan fully overcome these limitations, we designed LEAP to thoughtfully\nengage with and mitigate each of these five challenges.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"challenge-1.-narrow-frames-and-selective-response-can-distort-estimates-of-ai-progress\">Challenge\n1. Narrow frames and selective response can distort estimates of AI\nprogress<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">A survey can fail to represent aggregate views in two ways. First, the sampling frame might fail to capture the population of interest.<sup data-fn=\"064d21f7-b3d1-4317-bef2-4e8847e58d17\" class=\"fn\"><a href=\"#064d21f7-b3d1-4317-bef2-4e8847e58d17\" id=\"064d21f7-b3d1-4317-bef2-4e8847e58d17-link\">27<\/a><\/sup> Second, survey respondents might systematically differ from nonrespondents. Both problems affect AI forecasting surveys.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Most surveys of AI experts that predate LEAP target participants in top AI\/ML or computer science conference proceedings (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Muller and Bostrom 2014<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Walsh 2017<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Grace et al. 2018<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Zhang et al. 2021<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Stein-Perlman 2022<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Zhang et al. 2022<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Grace et al. 2024<\/a>).<sup data-fn=\"61f5b523-52cd-481a-a891-9ef64a3e70b1\" class=\"fn\"><a href=\"#61f5b523-52cd-481a-a891-9ef64a3e70b1\" id=\"61f5b523-52cd-481a-a891-9ef64a3e70b1-link\">28<\/a><\/sup> LEAP seeks to broaden the range of opinions as well as the expertise of participants and thus defines <em>four<\/em> target populations of experts: computer scientists researching topics in AI, economists studying the economic impacts of AI, AI industry professionals, and AI policy professionals. LEAP is the first survey to comprehensively measure the beliefs of each of these groups on AI-related questions so that we can compare each group\u2019s forecasts and accuracy in the future. The <a href=\"#sampling\"><u>Sampling<\/u><\/a> section details our sampling procedures.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">LEAP additionally includes members of the general public<sup data-fn=\"6346c640-6888-47ea-ba4b-d11ef7f533f7\" class=\"fn\"><a href=\"#6346c640-6888-47ea-ba4b-d11ef7f533f7\" id=\"6346c640-6888-47ea-ba4b-d11ef7f533f7-link\">29<\/a><\/sup> and forecasters with a demonstrated track record of uniquely high accuracy on forecasting tasks.<sup data-fn=\"1f9b4d46-19b4-462a-b266-41e84b18c234\" class=\"fn\"><a href=\"#1f9b4d46-19b4-462a-b266-41e84b18c234\" id=\"1f9b4d46-19b4-462a-b266-41e84b18c234-link\">30<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Publicly available demographic data on AI experts is often collected and used to stratify results in these studies, but have not historically been used to reweight results to match a target expert population (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Grace et al. 2018<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Zhang et al. 2022<\/a>). Many studies find associations between collected demographics and outlook,<sup data-fn=\"ae7ac65a-0fe7-47f1-b51b-2b2c644162e8\" class=\"fn\"><a href=\"#ae7ac65a-0fe7-47f1-b51b-2b2c644162e8\" id=\"ae7ac65a-0fe7-47f1-b51b-2b2c644162e8-link\">31<\/a><\/sup> suggesting that results may be sensitive to nonresponse bias. LEAP directly addresses issues of nonresponse bias by using common survey weighting methods, discussed in greater detail in <a href=\"#reweighting\">Reweighting<\/a>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"challenge-2.-views-on-ai-can-change-over-time\">Challenge 2.\nViews on AI can change over time<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">One-time surveys reflect respondent views at a given time, so they cannot track the evolution of individual or group opinions over time. As a panel survey that will continue for many years,<sup data-fn=\"6d01365e-a7d7-4422-b04e-aeafbefcdbd7\" class=\"fn\"><a href=\"#6d01365e-a7d7-4422-b04e-aeafbefcdbd7\" id=\"6d01365e-a7d7-4422-b04e-aeafbefcdbd7-link\">32<\/a><\/sup> LEAP enables this type of tracking over time.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"challenge-3.-ambiguous-forecasting-questions-complicate-ex-post-evaluation-and-comparison-between-forecasters\">Challenge\n3. Ambiguous forecasting questions complicate <em>ex post<\/em>\nevaluation and comparison between forecasters<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">When forecasting questions lack detailed resolution criteria,\nassessing accuracy and comparing forecasts is fraught. If a question is\nambiguous, forecasters may have different interpretations of how the\nquestion will be resolved, so those with disparate views can each claim\nto have superior accuracy. In contrast, forecasting questions with\nprespecified resolution criteria permit evaluation of forecasting\naccuracy after events resolve, which proves crucial for the subsequent\ntwo challenges.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Early surveys of AI experts focused on measuring the range of opinions on AI (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Muller and Bostrom 2014<\/a>) or the relationship between those opinions and other forecasts (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Walsh 2017<\/a>) rather than defining resolvable forecasting questions. Research that followed set a goal of generating useful, more resolvable, forecasts. Grace et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2018<\/a>) ask experts to forecast on AI progress milestones. Stein-Perlman and Grace (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2022<\/a>) and Grace et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2024<\/a>) use slightly modified versions of the earlier questions. Zhang et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2022<\/a>) modify many of the AI milestone forecasts in Grace et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2018<\/a>). LEAP builds on these recent efforts and attempts to create clear resolution criteria for all questions, establishing a longitudinal panel of beliefs from the same set of experts to understand how opinions change over time and to better quantify uncertainty and disagreement across a wider set of experts. O\u2019Donovan et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2025<\/a>) and McClain et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2025<\/a>) present forward-looking questions but do not ask respondents for forecasts; Pew asks respondents their opinions on the impact of AI over the next twenty years but does not generate measurable outcomes (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">McClain et al. 2025<\/a>). LEAP forecasting questions use precise and specific language and are designed with measurable outcomes; survey respondents are provided with the resolution criteria at the time of the forecast, a feature that enables us to address the remaining two challenges. <a href=\"#monthly-surveys-and-forecasting-questions\"><u>Monthly Surveys and Forecasting Questions<\/u><\/a> discusses this feature in greater detail.<sup data-fn=\"7f3f6025-dedc-4c9c-82a9-1eda5a37187f\" class=\"fn\"><a href=\"#7f3f6025-dedc-4c9c-82a9-1eda5a37187f\" id=\"7f3f6025-dedc-4c9c-82a9-1eda5a37187f-link\">33<\/a><\/sup><\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"challenge-4.-long-term-forecasts-are-difficult-to-evaluate-in-the-near-term\">Challenge\n4. Long-term forecasts are difficult to evaluate in the near-term<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">When forecasters disagree about the long-run, how can policymakers\nevaluate those forecasts on short-run timelines? One potential path\nforward is to evaluate the short-run accuracy of forecasters, relying on\nthe most accurate forecasters over the short-run to better understand\nlong-run outcomes. Resolving this challenge first requires clear\nresolution criteria, discussed in the preceding challenge. However, it\nalso requires forecasts on both short- and long-run questions.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">When using fixed-year format questions, earlier work used 10-, 20-, and 50-year time frames for predictions (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Muller and Bostrom 2014<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Grace et al. 2018<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Zhang et al. 2019<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Zhang et al. 2022<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Stein-Perlman and Grace 2022<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Grace et al. 2024<\/a>). LEAP includes questions with a wide array of resolution dates, including both near- (as early as the end of 2025) and long-term resolution, allowing us to explore whether short-term accuracy correlates with longer-run beliefs.<sup data-fn=\"84f8937b-1ffb-4a3f-94a4-ba5d8b3bba48\" class=\"fn\"><a href=\"#84f8937b-1ffb-4a3f-94a4-ba5d8b3bba48\" id=\"84f8937b-1ffb-4a3f-94a4-ba5d8b3bba48-link\">34<\/a><\/sup><\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"challenge-5.-forecasting-is-time-intensive-and-cognitively-taxing-and-truthfulness-is-rarely-incentivized\">Challenge\n5. Forecasting is time-intensive and cognitively taxing, and\ntruthfulness is rarely incentivized<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Expert and public participants have limited time to complete surveys, and long surveys risk dissuading potential participants from completion. Three features of LEAP counteract this challenge. First, we provide historical baselines and relevant background information for each question (see <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.I. Survey Questions: Wave 1<\/a> for representative examples). Second, the survey instrument contains interactive interfaces\u2014which integrate historical baseline data where available\u2014to facilitate the forecasting process (see <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=89\" type=\"link\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=89\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix B.V. Survey instrument<\/a>). Third, much like other surveys, LEAP compensates forecasters for their participation. In the first three waves, the median expert respondent spent 44 minutes on each survey, while the median member of the public and superforecaster spent 29 and 90 minutes, respectively. In contrast, the American Trends Panel (ATP) from Pew Research, a popular public opinion survey, targets ten to fifteen minutes per survey (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Pew Research Center 2024<\/a>).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Thoughtful engagement need not translate into forecasters reporting their true beliefs. If forecasters\u2019 rewards are not positively related to the quality of their forecasts, they lack the incentive to provide accurate forecasts. Much past work relies on traditional participation payments that are independent of forecasting accuracy (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Grace et al. 2018<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Grace et al. 2024<\/a>). Surveys of the general public (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Zhang 2019<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">McClain et al. 2025<\/a>) tend to provide similar recruitment incentives. In contrast, LEAP\u2019s detailed resolution criteria on timely forecasting questions allow us to incentivize participants to provide accurate forecasts by tying rewards to proper scoring rules (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Brier 1950<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Jose and Winkler 2009<\/a>; see <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=87\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=87\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix B.III. Scoring<\/a>). Past work demonstrates that proper scoring rules yield more accurate forecasts than unincentivized forecasts (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Karger et al. 2021<\/a>).<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"panel-construction\">Panel Construction<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">We discuss in this section our procedures for sampling and\nreweighting, as well as summarize the characteristics of our panel.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"sampling\">Sampling<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We target prominent experts whom policymakers, business and nonprofit\nleaders, and other stakeholders would be most inclined to consult\nregarding the progression of AI capabilities and its technological\nimpact. Our complete panel consists of experts, forecasters with a\ndemonstrated track record of high, differential accuracy on forecasting\ntasks (\u201csuperforecasters\u201d), and members of the general public.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"expert-sampling\">Expert Sampling<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">We target four expert communities. First, we include computer scientists researching topics in AI by including top-cited authors, stratified by age, and the authors of the top-rated papers at leading AI and ML conferences. Second, we identify leading economists, both across fields and within the subfield of economics focused on the economic effects of AI and new technology. We include top-cited authors of papers on AI and technology, members of the U.S. Economic Experts Panel (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Clark Center 2025<\/a>), and attendees of economics conferences on AI. Third, we include industry professionals, identified via their contributions to frontier models or employment at AI-related companies with extensive fundraising. Fourth, we identify institutions leading the discussion on AI development, policy, and impacts and invite research staff.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We sample from two other sources and sort them into one of the four communities above. First, we invited the honorees from TIME\u2019s 100 Most Influential People in AI in 2023 and 2024 (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Barker Bonomo and Javed 2024<\/a>). Second, we allowed invited respondents to recommend other qualified candidates for the survey, yielding 172 additional invitees. In order to filter this group, we require that an individual:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>meets the requirements of another sampling category;<\/li>\n\n\n\n<li>has over 1,000 academic citations; or<\/li>\n\n\n\n<li>has over 300 academic citations if a PhD student or postgraduate\nresearcher.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">These requirements excluded only 7 of the recommended candidates that\nultimately enrolled. After exclusion, the referred group has 11.6 years\nof experience on average and 75% have a postgraduate degree. Like other\nexpert sample expansions, referred contacts are not included in frame\ntargets, but their responses do receive positive weights through the\nreweighting process.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Within each community, we build our initial frame by identifying potential respondents from a number of sources, described in greater detail in <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=71\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix A. Panel Construction and Sampling<\/a>. We largely create non-probability samples composed of all respondents who meet criteria for inclusion in the frame, but we randomly sample from some sources that yielded a large frame (e.g., industry and policy professionals).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To reach sufficient respondent counts, we expand beyond our initial frame in a number of categories below\u2014this results in our \u201cfull frame,\u201d as shown below. To correct for the consequent change in sample composition, we use our initial frame to define reweighting targets (see <a href=\"#reweighting\">Reweighting<\/a> for further details on reweighting and <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=71\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix A. Panel Construction and Sampling<\/a> for a detailed specification of our initial frame). While individuals identified in these late expansions do not contribute to our targets for weighting, they do receive positive weights in our results.<\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Expert Type<\/strong><\/td><td><strong>Initial Frame Count<\/strong><\/td><td><strong>Full Frame Count<\/strong><\/td><td><strong>Participant Count<\/strong><\/td><\/tr><tr><td>Computer Science<\/td><td>454<\/td><td>719<\/td><td>61<\/td><\/tr><tr><td>Economics<\/td><td>391<\/td><td>773<\/td><td>66<\/td><\/tr><tr><td>Industry<\/td><td>561<\/td><td>1,640<\/td><td>57<\/td><\/tr><tr><td>Policy<\/td><td>367<\/td><td>881<\/td><td>96<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 1:<\/strong> Counts by expert category in our initial frame (target population), full frame (the initial frame plus expanded sampling), and participant pool (individuals who have completed at least 1 survey).<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">When an individual enrolls in LEAP, we collect information on their\n<em>primary<\/em> affiliation, which we use to reassign their category.\nFor example, a respondent might have been sampled through a well-cited\ncomputer science publication, but they currently work at a leading AI\nlab. In such cases, we use their current affiliation, as self-reported\nin the enrollment survey, as their assigned category. However, we use\nthe initial, rather than updated, categorization to define our weighting\ntargets, as we intend to measure the characteristics of the broader\npopulation reflected by this sampling group, rather than the identity of\nparticular individuals. Additionally, we only have this updated\naffiliation information for enrollees and not experts in the broader\nsampling frame who do not enroll. This section will accordingly use\nthese initial classifications. All sample statistics outside of this\nsubsection instead use this final classification, rather than a\nclassification based on how we sampled an individual.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"superforecaster-sampling\">Superforecaster Sampling<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">We include a sample of superforecasters, sourced through Good Judgment Inc., a company that maintains and adds to a list of highly accurate forecasters, many of whom were among the top 2% of forecasters by accuracy in IARPA\u2019s ACE tournament (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Good Judgment Inc. n.d.<\/a>). These superforecasters have a demonstrated track record of providing the most accurate forecasts across a wide array of topics.<sup data-fn=\"6401e4e8-3ac9-49bd-9b42-4cf3fa0a82de\" class=\"fn\"><a href=\"#6401e4e8-3ac9-49bd-9b42-4cf3fa0a82de\" id=\"6401e4e8-3ac9-49bd-9b42-4cf3fa0a82de-link\">35<\/a><\/sup> We invited 67 superforecasters through this search.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"public-sampling\">Public Sampling<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">We include in our sample highly-engaged participants<sup data-fn=\"e994d151-c382-4662-965f-c28352b5d1d5\" class=\"fn\"><a href=\"#e994d151-c382-4662-965f-c28352b5d1d5\" id=\"e994d151-c382-4662-965f-c28352b5d1d5-link\">36<\/a><\/sup> from past FRI research projects on the CloudResearch Connect platform. We initially invited approximately 2,600 individuals. To ensure representativeness across underrepresented populations in our initial sample, we also reach out to additional respondents who are either (1) over the age of 50 and identify as Republican; or (2) have a high school degree (or equivalent) as their highest level of completed education.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"reweighting\">Reweighting<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Individuals with certain viewpoints might be disproportionately likely to respond to our survey conditional on receiving an invite, skewing our results towards these viewpoints associated with a high propensity to respond to the survey. To address concerns about nonresponse bias, we use a standard approach in the public polling field, raking, to adjust aggregate statistics to be representative of the sampling frames.<sup data-fn=\"0409d0c2-0b62-47b0-a0c0-2506997a6fdc\" class=\"fn\"><a href=\"#0409d0c2-0b62-47b0-a0c0-2506997a6fdc\" id=\"0409d0c2-0b62-47b0-a0c0-2506997a6fdc-link\">37<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">These adjustments do not substantially change any of our results. The section <a href=\"#sensitivity-of-results-to-reweighting\">Sensitivity of Results to Reweighting<\/a> shows the impact of weighting on our aggregate results. We default to reporting weighted summary statistics for any results in this paper, but any tests or statistics related to differences in distributions (Mann\u2013Whitney U tests and Cliff\u2019s <em>\u03b4<\/em>) are currently unweighted.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For the expert sample, we use our <em>initial<\/em> invite pool to generate benchmarks for reweighting. <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=71\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix A. Panel Construction and Sampling<\/a> provides more information on the composition and selection of this initial invite pool. We reweight on years of relevant experience, age, affiliation with Effective Altruism,<sup data-fn=\"0f8c5e8c-45d5-4684-9430-7bd8115fa6ee\" class=\"fn\"><a href=\"#0f8c5e8c-45d5-4684-9430-7bd8115fa6ee\" id=\"0f8c5e8c-45d5-4684-9430-7bd8115fa6ee-link\">38<\/a><\/sup> gender, continent, education, and affiliation with top AI labs.<sup data-fn=\"67c8e553-de80-40a5-8016-37d549db3725\" class=\"fn\"><a href=\"#67c8e553-de80-40a5-8016-37d549db3725\" id=\"67c8e553-de80-40a5-8016-37d549db3725-link\">39<\/a><\/sup> We also equally weight participants based on their category of expertise.<sup data-fn=\"b7a4e61c-0e08-472b-8df1-2381b8e75f7b\" class=\"fn\"><a href=\"#b7a4e61c-0e08-472b-8df1-2381b8e75f7b\" id=\"b7a4e61c-0e08-472b-8df1-2381b8e75f7b-link\">40<\/a><\/sup> Respondents from expanded sampling and referred contacts do not contribute to our frame targets for weighting, but they do receive positive weights in our results. The target populations for each reweighting category are displayed in <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=83\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix A.VIII. Reweighting<\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We do not reweight our superforecaster sample.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For the public sample, we reweight on age, gender, race\/ethnicity, household income, educational attainment, and political party identification. We target U.S. population demographics.<sup data-fn=\"8c49426e-4ad8-40a9-85a3-f09d3efd32ca\" class=\"fn\"><a href=\"#8c49426e-4ad8-40a9-85a3-f09d3efd32ca\" id=\"8c49426e-4ad8-40a9-85a3-f09d3efd32ca-link\">41<\/a><\/sup> After our initial public invites, we conducted two targeted recruitment waves in cells with low response counts in our initial sample: first, we recruited individuals who were over age 50 and identified as Republicans. Second, we recruited individuals with a high school degree (or equivalent) as their highest level of completed education.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We assessed our sample for differences in experience, prominence\n(measured by citations), ideology (measured by affiliation with\nEffective Altruism), and top-lab affiliation. First, there are similar\nlevels of experience in the sampling frame and actual respondents; 50%\nof respondents have more than 10 years of experience in their field,\ncontrasted with 56% of invitees. Second, there is no clear difference in\nacademic or research prominence for the invitees versus respondents; for\nexample, top-cited computer scientist respondents averaged 120,000\ncitations, similar to the 148,000 in our invited pool. Third, however,\nthere is a clear difference in ideological affiliation; we found that\n28% of respondents had ties to Effective Altruism, in contrast with 14%\nof invitees. But, when we downweighted this group of respondents to\nmatch the invitee proportion during reweighting, we saw no meaningful\nchange in aggregate results. Lastly, there is also a difference in\nleading lab membership. 18% of invitees were employed at one of the\ntop-20 AI labs (as defined above), but only 8% of our respondents belong\nto this group.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Table 2 presents a comparison of several key demographic and\nprofessional features between the target sampling frame and the Wave 1\nrespondent population before and after weighting. The table illustrates\nhow our reweighting process adjusted the respondent sample to more\nclosely reflect the target population\u2019s composition.<\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Characteristic<\/strong><\/td><td><strong>Target<\/strong><\/td><td><strong>Responded\n(unweighted)<\/strong><\/td><td><strong>Responded\n(weighted)<\/strong><\/td><\/tr><tr><td>Average age<\/td><td>40 years<\/td><td>37 years<\/td><td>40 years<\/td><\/tr><tr><td>Affiliated with Effective Altruism<\/td><td>14%<\/td><td>28%<\/td><td>12%<\/td><\/tr><tr><td>Men<\/td><td>77%<\/td><td>78%<\/td><td>78%<\/td><\/tr><tr><td>Lives in North America<\/td><td>60%<\/td><td>67%<\/td><td>62%<\/td><\/tr><tr><td>Average years of experience<\/td><td>14 years<\/td><td>12 years<\/td><td>13 years<\/td><\/tr><tr><td>Has a postgraduate degree<\/td><td>75%<\/td><td>77%<\/td><td>79%<\/td><\/tr><tr><td>Affiliated with \u201cIndustry\u201d category<\/td><td>25%<sup data-fn=\"3e1cd8cc-472b-4a6f-b1ab-ab02415bd022\" class=\"fn\"><a href=\"#3e1cd8cc-472b-4a6f-b1ab-ab02415bd022\" id=\"3e1cd8cc-472b-4a6f-b1ab-ab02415bd022-link\">42<\/a><\/sup><\/td><td>23%<\/td><td>25%<\/td><\/tr><tr><td>Affiliated with top AI lab<\/td><td>18%<\/td><td>8%<\/td><td>16%<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 2:<\/strong> Comparisons between characteristics of the sampling frame population and the population of unweighted\/weighted Wave 1 respondents.<\/figcaption><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"respondent-characteristics\">Respondent Characteristics<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">339 experts provided complete forecasts for at least one survey. We\nreport respondent counts by domain of expertise in Table 3.<\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Expert type<\/strong><\/td><td><strong>Number of\nrespondents<\/strong><\/td><\/tr><tr><td>Computer scientists<\/td><td>76<\/td><\/tr><tr><td>AI Industry employees<\/td><td>76<\/td><\/tr><tr><td>Economists<\/td><td>68<\/td><\/tr><tr><td>AI policy experts<\/td><td>119<\/td><\/tr><tr><td><strong>Total<\/strong><\/td><td><strong>339<\/strong><\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 3:<\/strong> Number of respondents per expert category. Respondents completed at least one of the first three survey waves.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">To better characterize these experts, our respondent sample\nincludes:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><em>Top computer scientists:<\/em> 41 of our 76 computer science experts (54%) are professors, and 30 of these 41 (73%) are from top-20 institutions (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Berger 2025<\/a>). Twenty-three (30%) had top-rated (top-40 or better) papers at NeurIPS or ICLR in recent years, and eight others (11%) are PhD students or postdocs who are highly cited according to our criteria.<sup data-fn=\"31f3e343-943a-4f7c-b254-30b03fd86a60\" class=\"fn\"><a href=\"#31f3e343-943a-4f7c-b254-30b03fd86a60\" id=\"31f3e343-943a-4f7c-b254-30b03fd86a60-link\">43<\/a><\/sup> Ten (13%) are among the 200 top-cited authors in AI (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">OpenAlex n.d.<\/a>). This category also includes researchers at academic and non-academic research institutions. Our CS respondents have a median of 7,100 citations (for the 95% of panelists for whom data is available).<\/li>\n\n\n\n<li><em>AI industry experts:<\/em> 20 of our 76 industry respondents (26%) work for one of five leading AI companies: OpenAI, Anthropic, Google DeepMind, Meta, and Nvidia. Twenty-one of the remaining industry respondents (28% of the total) work for either a top AI company (top-20 model providers, by training compute, as measured by EpochAI 2024b), were identified as contributors to top-15 LLMs according to training compute or performance on Chatbot Arena in our sampling procedure (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Epoch AI 2024b<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">LMArena 2024<\/a>), or work for one of the top-30 AI-related companies, as measured by total funds raised (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Crunchbase 2025<\/a>). The remaining respondents were recategorized from our CS literature sampling pools, referral sampling, or other categories. Our industry respondents have a median of 9,100 citations (for the 59% of panelists for whom data is available).<\/li>\n\n\n\n<li><em>Top AI economists:<\/em> 54 of our 68 economist respondents (79%) are professors, and 30 (44%) are from top-50 economics institutions (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">RePEc 2025<\/a>).<sup data-fn=\"98cb1a57-9113-4df3-b1a4-4ac0efac4e9d\" class=\"fn\"><a href=\"#98cb1a57-9113-4df3-b1a4-4ac0efac4e9d\" id=\"98cb1a57-9113-4df3-b1a4-4ac0efac4e9d-link\">44<\/a><\/sup> Our economist respondents have a median of 2,200 citations (for the 96% of panelists for whom data is available).<\/li>\n\n\n\n<li><em>Policy and think tank group:<\/em> Of our 119 AI policy\nrespondents, 75 (60%) work for the following most-represented\norganizations (unordered): Brookings, RAND, Epoch AI, Federation of\nAmerican Scientists, Center for Security and Emerging Technology, AI\nNow, Carnegie Endowment, Foundation for American Innovation, Stanford\u2019s\nInstitute for Human-Centered Artificial Intelligence and related groups,\nGovAI, Institute for AI Policy and Strategy, Future of Life Institute,\nInstitute for Law &amp; AI, Center for a New American Security, Data\n&amp; Society Research Institute, Abundance Institute, and the Centre\nfor International Governance Innovation.<\/li>\n\n\n\n<li><em>TIME 100:<\/em> Our panel includes 12 honorees from TIME\u2019s 100 Most Influential People in AI in 2023 and 2024 (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Bajekal 2023<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Barker Bonomo and Javed 2024<\/a>). TIME 100 honorees are categorized by their expertise and distributed among the above categories.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">In addition to domain experts, respondents included:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>60 highly accurate forecasters (\u201csuperforecasters\u201d), based on\nperformance in prior geopolitical forecasting tournaments.<\/li>\n\n\n\n<li>1,400 members of the public, largely consisting of especially\nengaged participants in previous research, reweighted to be nationally\nrepresentative of the U.S.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">We report the count of respondents in each wave in Figure 1 below. The drop in expert completions from Wave 1 to Wave 2 was much larger (23%) than from Wave 2 to 3 (4%).<sup data-fn=\"41549962-1a59-45e2-a739-8bb71e8a1b2f\" class=\"fn\"><a href=\"#41549962-1a59-45e2-a739-8bb71e8a1b2f\" id=\"41549962-1a59-45e2-a739-8bb71e8a1b2f-link\">45<\/a><\/sup><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-01.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 1.<\/strong> The number of survey completions per participant group, across the first three survey waves of LEAP.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">In the first three waves, the median expert respondent spent 44 minutes on each survey,<sup data-fn=\"10c0bd6e-dc02-4b64-8601-2383555bcf25\" class=\"fn\"><a href=\"#10c0bd6e-dc02-4b64-8601-2383555bcf25\" id=\"10c0bd6e-dc02-4b64-8601-2383555bcf25-link\">46<\/a><\/sup> while the median member of the public and superforecaster spent 29 and 90 minutes, respectively. Respondents provided 1.7 million words of rationales across three waves. Over 600,000 of these words come from our superforecaster and expert samples, with the remainder from public respondents.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"monthly-surveys-and-forecasting-questions\">Monthly Surveys and\nForecasting Questions<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">We conduct surveys approximately every month, each consisting of 5\u20136 forecasting questions. We expect each survey to take experts approximately 30\u201340 minutes to complete, and respondents are informed of this time estimate. Respondents receive a standardized payment for each survey they complete.<sup data-fn=\"2a066104-34d3-43c9-916f-65ad3ecd00c2\" class=\"fn\"><a href=\"#2a066104-34d3-43c9-916f-65ad3ecd00c2\" id=\"2a066104-34d3-43c9-916f-65ad3ecd00c2-link\">47<\/a><\/sup> In addition to quantitative forecasts, we collect rationales for each forecasting question, in the form of plain text. We use these rationale data to understand the underlying reasons for forecaster responses.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">LEAP includes forecasting questions across five categories:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><em>AI inputs:<\/em> drivers of AI progress like investment,\nelectricity consumption, and other AI R&amp;D inputs, such as\ntalent.<\/li>\n\n\n\n<li><em>AI capabilities:<\/em> measures of AI progress such as\nbenchmark performance on difficult and consequential tasks.<\/li>\n\n\n\n<li><em>AI adoption:<\/em> drivers of AI impact such as the prevalence\nof AI applications and the intensity of AI use in economically\nmeaningful and high-stakes contexts.<\/li>\n\n\n\n<li><em>AI impacts:<\/em> downstream societal effects, such as AI\nincidents and labor market disruption.<\/li>\n\n\n\n<li><em>AI scenarios:<\/em> bundles of outcomes that represent different trajectories for the technology. See an example from Wave 1 <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=105\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=105\" target=\"_blank\" rel=\"noreferrer noopener\">here (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=105\" target=\"_blank\" rel=\"noreferrer noopener\">General AI Progress<\/a>)<\/a>.<\/li>\n<\/ol>\n\n\n\n<p class=\"wp-block-paragraph\">We source forecasting questions and topics from academic papers, technical and policy reports, prediction platforms, public writings by leading AI figures, our past research output, our academic advisory board, and suggestions from survey respondents. We then create resolvable forecasting questions from these various input sources. You can view example questions with associated resolution criteria in <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=95\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=95\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E. Survey Questions<\/a>. The resolution criteria are intended to reduce noise in forecast collection by minimizing the space for different question interpretations among forecasters, as well as permit accuracy assessment.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Second, we ask various types of forecasting questions:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><em>Probabilistic:<\/em> We ask participants to assign a probability to a binary or discrete event. For example, \u201cWill AI solve or substantially assist in solving a Millennium Prize Problem in mathematics by the following resolution dates?\u201d (Wave 2)<sup data-fn=\"b8186e41-dd7f-49b2-820c-dc0836183480\" class=\"fn\"><a href=\"#b8186e41-dd7f-49b2-820c-dc0836183480\" id=\"b8186e41-dd7f-49b2-820c-dc0836183480-link\">48<\/a><\/sup><\/li>\n\n\n\n<li><em>Quantile:<\/em> We ask participants to forecast quantiles of a continuous outcome (typically the 25<sup>th<\/sup>, 50<sup>th<\/sup>, and 75<sup>th<\/sup> percentiles). For example, \u201cWhat will be the highest percentage accuracy achieved by an AI model on FrontierMath, by the following resolution dates?\u201d (Wave 1)<sup data-fn=\"fbb1f957-9891-42fa-8c14-a32faa4e4b21\" class=\"fn\"><a href=\"#fbb1f957-9891-42fa-8c14-a32faa4e4b21\" id=\"fbb1f957-9891-42fa-8c14-a32faa4e4b21-link\">49<\/a><\/sup><\/li>\n\n\n\n<li><em>Point Estimate:<\/em> We ask participants for a point estimate of a continuous outcome. For example, \u201cBy the end of 2030, what percent of LEAP expert panelists will agree that each of the following is a serious cognitive limitation of state-of-the-art AI systems?\u201d (Wave 2; list omitted for brevity but available in <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=110\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=110\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.II. Survey Questions: Wave 2<\/a>).<\/li>\n<\/ol>\n\n\n\n<p class=\"wp-block-paragraph\">Scoring first requires us to resolve the values being forecasted. Then, these scoring rules are used to provide accuracy prizes to incentivize truthful reporting. Respondents are provided with detailed resolution criteria and relevant historical baseline data in order to inform their forecasts. We resolve questions using either ground truth data or data generated from LEAP itself. We discuss these methods in greater detail in <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=85\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=85\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix B. Monthly Surveys and Forecasting Questions<\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We score rationales with a combination of human and LLM judges, and\nprovide prizes to the highest quality rationales in each survey wave. We\ndo not publicly share our scoring criteria to prevent gaming by\nparticipants.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"results\">Results<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"uncertainty-and-disagreement\">Uncertainty and Disagreement<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">On many questions, we ask respondents to express their uncertainty in the form of quantile forecasts. We describe quantile forecasts in greater detail in <a href=\"#monthly-surveys-and-forecasting-questions\">Monthly Surveys and Forecasting Questions<\/a>. This approach allows us to quantify two sources of variation: <em>between-expert disagreement<\/em> (i.e., how much experts disagree with each other) and <em>within-expert uncertainty<\/em> (i.e., how wide each expert\u2019s predicted range is). To measure total variation, we generate a \u201cpooled\u201d distribution of respondent beliefs, representing the full variation in expert views.<sup data-fn=\"9753e07e-ab53-40f2-a948-77cfa0cacb13\" class=\"fn\"><a href=\"#9753e07e-ab53-40f2-a948-77cfa0cacb13\" id=\"9753e07e-ab53-40f2-a948-77cfa0cacb13-link\">50<\/a><\/sup> We discuss this procedure in greater detail in <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=92\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix C. Pooled Distribution Estimation<\/a>. We plot these pooled distributions for several questions in <a href=\"#fig-04\">Figure 4<\/a>, and we then decompose the variation in expert views into the proportion due to within-forecaster uncertainty and the component due to between-forecaster disagreement. For example, if we look at the share of work hours participants forecast to be assisted by generative AI, the variance of the distribution grows over time: the standard deviation is 5.7% in 2025, 13% in 2027, and 22% in 2030, and 65% to 53% of the variation is explained by within-forecaster uncertainty across the three time horizons.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">While this approach to understanding uncertainty and disagreement is used across questions and groups of forecasters through the rest of the paper, we outline it with expert responses to one example question here. By 2030, the median expert predicts that 18% of all U.S. work hours will be assisted by generative AI. We will discuss this finding in more detail below. The 18% median forecast should be interpreted alongside information about variation in beliefs. To capture the full extent of expert uncertainty and disagreement, we construct an aggregate distribution of expert beliefs by combining each expert&#8217;s distributional forecasts on the question into a pooled (mixture) distribution. The 25<sup>th<\/sup>\u201375<sup>th<\/sup> percentile range of this pooled distribution is (7.3%, 34.6%). The ranges represent the total variation of beliefs in the expert pool. We use the law of total variance to decompose the variance of this pooled distribution into between-forecaster disagreement and within-forecaster uncertainty. For example, 53% of the variation in forecasts on this question is due to within-forecaster uncertainty and 47% are due to between-forecaster disagreement. <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=92\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix C. Pooled Distribution Estimation<\/a> contains more details on the method we use for this analysis.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Because fitting a distribution to a respondent\u2019s forecasts relies on\na parametric assumption, especially about the tails of individual\ndistributions, we also present two other measures of variation. First,\nwe present a basic measure of the amount of between-forecaster\ndisagreement in the forecast: the interquartile range of experts\u2019\n50<sup>th<\/sup> percentile forecasts was 9%\u201330%, indicating substantial\nvariation among experts in their predictions of the median outcome. In\nother words, a quarter of all experts believe that 9% of work hours (or\nless) will be assisted by generative AI, and a quarter believe it is\nlikely to be above 30%.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Second, we present a measure of individuals\u2019 uncertainty based on the quantile forecasts we elicited. In addition to the median, we elicited 25<sup>th<\/sup> and 75<sup>th<\/sup> percentile quantiles from each forecaster on this question. The median 25<sup>th<\/sup> percentile forecast across all experts was 9% and the median 75<sup>th<\/sup> percentile forecast was 28%. This 9%\u201328% range cannot be interpreted as a typical confidence interval, but it can indicate the degree of uncertainty forecasters had about the outcome of interest.<sup data-fn=\"ab935eb6-3a6a-45e4-84b4-e5c4214e247c\" class=\"fn\"><a href=\"#ab935eb6-3a6a-45e4-84b4-e5c4214e247c\" id=\"ab935eb6-3a6a-45e4-84b4-e5c4214e247c-link\">51<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In the footnotes for each point estimate, we report three measures of\nuncertainty as follows: we first summarize the total variation, pooling\nall expert beliefs and discussing the variation in that distribution. We\nthen report the variation in median forecasts. If we only collected a\ncentral estimate, we do not report additional statistics. If we did\ncollect other quantile forecasts, we report the median forecasts across\nexperts of those quantiles. For example, on this example question, we\nwould report the median 25<sup>th<\/sup> (and 75<sup>th<\/sup>) percentile\nforecasts for each quantity of interest.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"key-insights\">Key Insights<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We draw five insights from the results for Waves 1, 2, and 3. First,\nmost experts expect sizable near\u2011term societal effects from AI. Second,\nsubstantial disagreement underlies these forecasts. Third, the median\nexpert expects much less progress than prominent leaders of frontier AI\nlabs. Fourth, experts anticipate faster progress than the public on most\noutcomes. Fifth, highly accurate forecasters (superforecasters) and\nexperts are largely aligned, with superforecasters expecting slightly\nless progress overall. Differences across expert subgroups (CS,\neconomics, industry, policy) are small and rarely statistically\nsignificant, and reweighting our expert sample to match a carefully\nconstructed expert sampling frame leaves our headline conclusions\nunchanged.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"experts-expect-sizable-societal-effects-from-ai-by-2040.\">1.\nExperts expect sizable societal effects from AI by 2040.<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">In particular, the median expert expects substantial impacts on the\nability of AI systems to solve difficult math problems, the use of AI\nfor companionship and work, electricity usage from AI, and investment in\nAI. Even the lower end of the expert belief distribution still implies\nsubstantial impacts of AI:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><em>Work:<\/em> The median expert forecast is that 18% of work hours will be assisted by generative AI in 2030,<sup data-fn=\"98818581-9957-4f4d-ab1f-c4945458f22a\" class=\"fn\"><a href=\"#98818581-9957-4f4d-ab1f-c4945458f22a\" id=\"98818581-9957-4f4d-ab1f-c4945458f22a-link\">52<\/a><\/sup> up from approximately 4.1% in November 2024 (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Bick et al. 2025<\/a>), over a 4x increase.<sup data-fn=\"2b7fae99-fc07-44f2-b81f-3174831227bb\" class=\"fn\"><a href=\"#2b7fae99-fc07-44f2-b81f-3174831227bb\" id=\"2b7fae99-fc07-44f2-b81f-3174831227bb-link\">53<\/a><\/sup> The bottom quartile of experts gives a forecast of 9%, while the top quartile gives a forecast of 30%. The median expert gives a 25% chance the value is 9% or lower (still over a 2x increase), and a 25% chance it exceeds 28%.<sup data-fn=\"8cc15fdf-409f-4f29-98f6-67baaf97c59d\" class=\"fn\"><a href=\"#8cc15fdf-409f-4f29-98f6-67baaf97c59d\" id=\"8cc15fdf-409f-4f29-98f6-67baaf97c59d-link\">54<\/a><\/sup><\/li>\n\n\n\n<li><em>Private AI investment:<\/em> The median expert predicts that Our World in Data will report $260 billion of global private AI investment by 2030, up from the $130 billion baseline for the series in 2024.<sup data-fn=\"23bfde2d-f28b-477d-8655-ea9c96555e8f\" class=\"fn\"><a href=\"#23bfde2d-f28b-477d-8655-ea9c96555e8f\" id=\"23bfde2d-f28b-477d-8655-ea9c96555e8f-link\">55<\/a><\/sup> The median expert gives a 25% chance that investment will be at or below $175 billion, over a third higher than the baseline value, and another 25% chance that investment matches or exceeds $400 billion, just over 3x larger than the baseline level.<\/li>\n\n\n\n<li><em>Electricity usage:<\/em> The median expert predicts that 7% of\nU.S. electricity consumption will be used for training and deploying AI\nsystems in 2030, and close to double that (12%) in 2040. For context, 7%\nis 1.5x today\u2019s entire data-center load, 13% is all of Texas\u2019\nelectricity use, 23% is almost all of the industrial sector\u2019s\nelectricity use, and 40% accounts for all residential electricity use.\nEven experts expecting less electricity consumption give substantial\nmedian forecasts: the bottom quartile of experts still predict values of\n5% in 2030 and 8% in 2040.<\/li>\n\n\n\n<li><em>Math research:<\/em> 23% of experts predict that the FrontierMath benchmark will be saturated by the end of 2030,<sup data-fn=\"c2d6b13f-32f9-40bb-b188-ddff67dc7421\" class=\"fn\"><a href=\"#c2d6b13f-32f9-40bb-b188-ddff67dc7421\" id=\"c2d6b13f-32f9-40bb-b188-ddff67dc7421-link\">56<\/a><\/sup> meaning that AI can autonomously solve a set of math problems that resemble those a math PhD student might spend several days completing. The bottom quarter of experts expect 60% or less of these problems to be solved in the same timeframe, substantially more than the 19% baseline at the time of the survey. By 2040, experts predict it is more likely than not (60%) that AI will substantially assist in solving a Millennium Prize Problem, a set of problems comprising some of the most difficult unsolved mathematical problems.<\/li>\n\n\n\n<li><em>Companionship:<\/em> The median expert predicts that by 2030,\n15% of adults will self-report using AI for companionship, emotional\nsupport, social interaction, or simulated relationships at least once\ndaily, up from 6% today. By 2040, that number doubles to 30% of\nadults.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">To assess the broader scope of AI\u2019s impacts, we asked experts to\nassess \u201cslow\u201d versus \u201cmoderate\u201d or \u201cfast\u201d scenarios for AI progress, and\nhow AI will compare to other historically significant developments such\nas the internet, electricity, and the Industrial Revolution. We\nfound:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><em>Speed of AI progress:<\/em> By 2030, the average expert thinks that 23%<sup data-fn=\"8acd763f-4363-457b-a67c-090e56541707\" class=\"fn\"><a href=\"#8acd763f-4363-457b-a67c-090e56541707\" id=\"8acd763f-4363-457b-a67c-090e56541707-link\">57<\/a><\/sup> of LEAP panelists will say the state of AI most closely mirrors a \u201crapid\u201d progress scenario, which we described as: AI writes Pulitzer Prize-worthy novels, collapses years-long research into days and weeks, outcompetes any human software engineer, and independently develops new cures for cancer.<sup data-fn=\"bdd05175-27a9-4dcd-aa6b-f7c5de77458a\" class=\"fn\"><a href=\"#bdd05175-27a9-4dcd-aa6b-f7c5de77458a\" id=\"bdd05175-27a9-4dcd-aa6b-f7c5de77458a-link\">58<\/a><\/sup> Conversely, they give a 28% chance of a slow-progress scenario, in which AI is a widely useful assisting technology but falls short of transformative impact.<\/li>\n\n\n\n<li><em>Societal impact:<\/em> By 2040, the median expert predicts that the impact of AI will be comparable to a \u201ctechnology of the century,\u201d akin to electricity or automobiles. Experts also give a 32% chance that AI will be at least as impactful as a \u201ctechnology of the millennium,\u201d such as the printing press or the Industrial Revolution and a 16% chance the AI is equally or less impactful than a \u201ctechnology of the year\u201d like the VCR.<sup data-fn=\"f7f7e06f-3b9e-412d-afe6-7537f6485a8d\" class=\"fn\"><a href=\"#f7f7e06f-3b9e-412d-afe6-7537f6485a8d\" id=\"f7f7e06f-3b9e-412d-afe6-7537f6485a8d-link\">59<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">The median expert predicts 2%<sup data-fn=\"f5441114-d01f-46a5-8871-c9bbaeb2942e\" class=\"fn\"><a href=\"#f5441114-d01f-46a5-8871-c9bbaeb2942e\" id=\"f5441114-d01f-46a5-8871-c9bbaeb2942e-link\">60<\/a><\/sup> growth in white-collar jobs between January 2025 and December 2030. This is significantly slower than a recent linear trend, which would predict 6.8% growth. However, we did not collect forecasts on the causal effect of AI on white-collar employment.<sup data-fn=\"41a645ad-d348-452d-9788-c893693a4f45\" class=\"fn\"><a href=\"#41a645ad-d348-452d-9788-c893693a4f45\" id=\"41a645ad-d348-452d-9788-c893693a4f45-link\">61<\/a><\/sup> While some experts expect AI to cause white-collar job loss (See <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=136\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=136\" target=\"_blank\" rel=\"noreferrer noopener\">Occupational Employment Index<\/a>), this question does not allow for a clear understanding of that causal relationship.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We summarize the expert forecasts for these various indicators in\nFigures 2 and 3.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-02.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 2:<\/strong> Median expert forecasts for various questions. We display the 10<sup>th<\/sup> , 25<sup>th<\/sup> , 50<sup>th<\/sup> , 75<sup>th<\/sup> , and 90<sup>th<\/sup> percentiles of the median forecasts given by experts at each date. For example, if 25% of experts give a median forecast of 5% or less, the 25<sup>th<\/sup> percentile series in the graph will lie at 5%; these series are not confidence intervals. Where available, we include a historical baseline.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-03.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 3:<\/strong> Average expert forecasts on the Technological Richter Scale question.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Experts lend insight into the aggregate results above with their rationales. For example, when considering the societal impact of AI (as defined by a technological Richter scale),<sup data-fn=\"cad14124-84e3-405a-8c44-d454bc9b91c3\" class=\"fn\"><a href=\"#cad14124-84e3-405a-8c44-d454bc9b91c3\" id=\"cad14124-84e3-405a-8c44-d454bc9b91c3-link\">62<\/a><\/sup> one expert is well aligned with the median in that they assign the greatest probability to AI being the Technology of the Century, but they also argue:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>&#8230;if progress can continue at its present rate, Technology of the Millennium is a possibility. Given the level of investment and the scramble for talent, Technology of the Decade is assured. We are on a long runway stretching back 50 years and have finally achieved liftoff\u2026 AI could conceivably rival the printing press at giving [the] everyman a level of intelligence where it once provided [the] everyman with information. The industrial revolution greatly increased the material productivity of society; AI could provide the same boost for both material and service products by trading electrical energy for intellect.<\/em><\/p>\n<\/blockquote>\n\n\n\n<p class=\"wp-block-paragraph\">Echoing that sentiment, an expert who forecasted a slightly\nabove-the-median impact writes, \u201cWhile there are parallel examples in\nthe rise of agriculture and industrial production, particularly in terms\nof general-purpose innovation (steam, fossil fuels, electricity, etc.),\nAI is unique because it both augments human intelligence and will\neventually surpass it.\u201d<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"experts-disagree-and-express-substantial-uncertainty-about-the-trajectory-of-ai.\">2.\nExperts disagree and express substantial uncertainty about the\ntrajectory of AI.<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">While the median expert predicts substantial AI progress, and a sizable fraction of experts predict fast progress, experts disagree widely. Notably, the top quartile of experts gives a median forecast that 50% of newly approved drug sales in the U.S. in 2040 will be from AI-discovered drugs, compared to a median forecast of just 10% for the bottom quartile of experts.<sup data-fn=\"5d86a372-9fcf-43bb-a540-84cfcdf20a1e\" class=\"fn\"><a href=\"#5d86a372-9fcf-43bb-a540-84cfcdf20a1e\" id=\"5d86a372-9fcf-43bb-a540-84cfcdf20a1e-link\">63<\/a><\/sup> Further, the top quartile of experts gives a forecast of at least 81% that AI will solve or substantially assist in solving a solution to a Millennium Prize Problem by 2040, compared to a forecast of just 30% from the bottom quartile of experts.<sup data-fn=\"d3f78ea9-9e62-4a05-95f9-937aa2e600e9\" class=\"fn\"><a href=\"#d3f78ea9-9e62-4a05-95f9-937aa2e600e9\" id=\"d3f78ea9-9e62-4a05-95f9-937aa2e600e9-link\">64<\/a><\/sup> We use our pooled distributions to express the relative importance of within-forecaster uncertainty and between-forecaster disagreement. We find that, across all forecasting questions that allow forecasters to express their uncertainty, within-forecaster uncertainty explains 49% of the total variation in forecasts, compared to the 51% explained by between-forecaster disagreement.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In Figure 4 below, we plot the pooled distributions for expert\nforecasts on the share of work hours assisted by generative AI and\nFrontierMath scores by the ends of 2025, 2027, and 2030.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\" id=\"fig-04\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-04.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 4:<\/strong> Pooled distributions for expert forecasts on Work Hours Assisted by Generative AI (top panels) and FrontierMath scores (bottom panels). These pooled distributions combine within-expert uncertainty and between-expert disagreement. Densities are normalized to the same peak for comparability. See <a href=\"#uncertainty-and-disagreement\" id=\"#uncertainty-and-disagreement\">Uncertainty and Disagreement<\/a> for details.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">We report summary statistics for the expert pooled distributions for select questions in Table 4 below. You can find tables for all other relevant questions on the LEAP website.<sup data-fn=\"0f876b32-154f-4ffd-81cd-cb565602b4f8\" class=\"fn\"><a href=\"#0f876b32-154f-4ffd-81cd-cb565602b4f8\" id=\"0f876b32-154f-4ffd-81cd-cb565602b4f8-link\">65<\/a><\/sup><\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Question<\/strong><\/td><td><strong>Resolution Year<\/strong><\/td><td><strong>25<sup>th<\/sup>\nPctl.<\/strong><\/td><td><strong>50<sup>th<\/sup>\nPctl.<\/strong><\/td><td><strong>75<sup>th<\/sup>\nPctl.<\/strong><\/td><td><strong>Within Share<\/strong><\/td><td><strong>Between Share<\/strong><\/td><\/tr><tr><td rowspan=\"2\">AV Trips<\/td><td>2027<\/td><td>2.7<\/td><td>7.1<\/td><td>20.1<\/td><td>0.53<\/td><td>0.47<\/td><\/tr><tr><td>2030<\/td><td>6.9<\/td><td>19.2<\/td><td>46.2<\/td><td>0.53<\/td><td>0.47<\/td><\/tr><tr><td rowspan=\"3\">Drug Discovery<\/td><td>2027<\/td><td>1.1<\/td><td>2.5<\/td><td>6.8<\/td><td>0.41<\/td><td>0.59<\/td><\/tr><tr><td>2030<\/td><td>2.4<\/td><td>6<\/td><td>15.7<\/td><td>0.41<\/td><td>0.59<\/td><\/tr><tr><td>2040<\/td><td>8.4<\/td><td>22.8<\/td><td>53.8<\/td><td>0.48<\/td><td>0.52<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 4:<\/strong> Summary of expert pooled distributions for U.S. Ride-Hailing Trips by Autonomous Vehicles (AV Trips) and AI-discovered share of Drug Sales (Drug Discovery). \u201cWithin Share\u201d and \u201cBetween Share\u201d denote the proportion of variance from within-forecaster uncertainty and between-forecaster disagreement, respectively.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">The degree of within-forecaster uncertainty revealed above is also\nwell-represented in the rationales. As one expert writes, \u201cPredicting\nwhen such breakthroughs come is notoriously difficult, hence broad\nconfidence intervals,\u201d reflecting a common sentiment. Another expresses\na more expansive view:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>My forecasts reflect the ambivalence I feel between competing narratives\u2014one being that AI\u2019s potential to change the course of human history has been overhyped, and that we\u2019ll run out of decent training data, and so on; the other being that we\u2019re in a delusional, pre-Copernican state where we humans still cling to a \u2018we\u2019re the center of the universe\u2019 notion that intelligence is something unique to our species, or at least has to be rooted in biological entities, even as it becomes blindingly self-evident that this is not true.<\/em><\/p>\n<\/blockquote>\n\n\n\n<p class=\"wp-block-paragraph\">The sharp between-forecaster disagreement documented by the forecasts\nis also revealed in the rationales, particularly through their\njuxtaposition. For example, when considering pace-of-AI-progress\nscenarios, one expert writes, \u201cChatGPT was first publicly released in\nlate 2022. I don\u2019t believe what we witnessed over these past 2.5 years\nwould justify expecting rapid progress over the next 5,\u201d while another\noffers, \u201cI think the last three years of progress have been\nqualitatively immense, so the next five years seem like they could lead\nto highly autonomous systems capable of very impressive things.\u201d In\nother instances, the contrast is starker: \u201cI believe that there is wide\nconsensus on the rapid evolution of AI,\u201d writes one expert. \u201cRapid\nprogress scenario is unhinged,\u201d writes another.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"the-median-expert-expects-significantly-less-ai-progress-than-leaders-of-frontier-ai-companies.\">3.\nThe median expert expects significantly less AI progress than leaders of\nfrontier AI companies.<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Leaders of frontier AI companies have made aggressive predictions\nabout AI progress. Dario Amodei, co-founder and CEO of Anthropic,\npredicts:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>January 2025: \u201cBy 2026 or 2027, we will have AI systems that are broadly better than almost all humans at almost all things.\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">World Economic Forum 2025<\/a>)<\/li>\n\n\n\n<li>March 2025: Anthropic also claimed in a response to the U.S. Office of Science and Technology Policy that it anticipates that by 2027 AI systems will exist that equal the intellectual capabilities of \u201cNobel Prize winners across most disciplines\u2014including biology, computer science, mathematics, and engineering.\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">D\u2019Souza 2025<\/a>)<\/li>\n\n\n\n<li>May 2025: Amodei has stated that AI could increase overall unemployment to 10-20% in the next one to five years, a prediction highlighted by Barack Obama. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Allen and VandeHei 2025<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Obama 2025<\/a>)<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Sam Altman of OpenAI states that:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>January 2025: \u201cI think AGI will probably get developed during [Donald Trump\u2019s second presidential] term, and getting that right seems really important.\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Tyrangiel 2025<\/a>)<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Elon Musk, leader of xAI and Tesla, writes:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>December 2024: \u201cIt is increasingly likely that AI will superset [sic] the intelligence of any single human by the end of 2025 and maybe all humans by 2027\/2028. Probability that AI exceeds the intelligence of all humans combined by 2030 is ~100%.\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Musk 2024<\/a>)<\/li>\n\n\n\n<li>August 2025: When a user posted \u201cBy 2030, all jobs will be replaced by AI and robots,\u201d Musk responded: \u201cYour estimates are about right.\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Musk 2025<\/a>)<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Demis Hassabis, CEO and co-founder of Google DeepMind predicts:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>August 2025: \u201cWe&#8217;ll have something that we could sort of reasonably call AGI, that exhibits all the cognitive capabilities humans have, maybe in the next five to 10 years, possibly the lower end of that.\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Rose 2025<\/a>)<\/li>\n\n\n\n<li>August 2025: \u201cIt\u2019s going to be 10 times bigger than the Industrial Revolution, and maybe 10 times faster.\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Rose 2025<\/a>)<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">While we cannot directly compare these claims to LEAP questions, we\noffer clear evidence based on LEAP forecasts that the median expert\nexpects substantially smaller effects of AI than is expected by frontier\nAI company leaders:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><em>General capabilities:<\/em> Lab leaders predict human-level or superhuman AI by 2026-2029, while our expert panel indicates longer timelines for superhuman capabilities. By 2030, the average expert thinks that 23% of LEAP panelists will say the state of AI most closely mirrors an (\u201crapid\u201d) AI progress scenario that matches some of these claims.<sup data-fn=\"1c277265-12d5-46e7-86a4-18ea84d9eedb\" class=\"fn\"><a href=\"#1c277265-12d5-46e7-86a4-18ea84d9eedb\" id=\"1c277265-12d5-46e7-86a4-18ea84d9eedb-link\">66<\/a><\/sup><sup>,<\/sup><sup data-fn=\"d5650a95-e72c-4f84-9fb3-40613353d20e\" class=\"fn\"><a href=\"#d5650a95-e72c-4f84-9fb3-40613353d20e\" id=\"d5650a95-e72c-4f84-9fb3-40613353d20e-link\">67<\/a><\/sup><\/li>\n\n\n\n<li><em>White-collar jobs:<\/em> The median expert predicts 2% growth in white-collar employment by 2030 (compared to a 6.8% trend extrapolation).<sup data-fn=\"d11df40e-72a1-411b-8fde-3f2d502bf544\" class=\"fn\"><a href=\"#d11df40e-72a1-411b-8fde-3f2d502bf544\" id=\"d11df40e-72a1-411b-8fde-3f2d502bf544-link\">68<\/a><\/sup> This contrasts with Elon Musk&#8217;s suggestion that all jobs might be replaced by 2030.<sup data-fn=\"6feae762-cbb5-456d-8f2d-9c0ecc33f95a\" class=\"fn\"><a href=\"#6feae762-cbb5-456d-8f2d-9c0ecc33f95a\" id=\"6feae762-cbb5-456d-8f2d-9c0ecc33f95a-link\">69<\/a><\/sup> Relatedly, Dario Amodei predicts 10-20% overall unemployment within the next five years.<\/li>\n\n\n\n<li><em>Millennium Prize Problems:<\/em> The median expert gives a 60% chance that AI will substantially assist in solving a Millennium Prize Problem by 2040<sup data-fn=\"88000fb8-dbe0-4c1e-bdba-6f267cd8ccbf\" class=\"fn\"><a href=\"#88000fb8-dbe0-4c1e-bdba-6f267cd8ccbf\" id=\"88000fb8-dbe0-4c1e-bdba-6f267cd8ccbf-link\">70<\/a><\/sup> (and 20% by 2030).<sup data-fn=\"2997cdbe-89c3-4246-aa80-9590826ba717\" class=\"fn\"><a href=\"#2997cdbe-89c3-4246-aa80-9590826ba717\" id=\"2997cdbe-89c3-4246-aa80-9590826ba717-link\">71<\/a><\/sup> Amodei&#8217;s prediction of general \u201cNobel Prize winner\u201d level capabilities by 2026-2027 could imply a much more aggressive timeline, but the implication of Amodei\u2019s predictions are somewhat unclear.<sup data-fn=\"cbd0269b-114c-4ad3-9bd8-1b550056f9d2\" class=\"fn\"><a href=\"#cbd0269b-114c-4ad3-9bd8-1b550056f9d2\" id=\"cbd0269b-114c-4ad3-9bd8-1b550056f9d2-link\">72<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">The rationales help explain why the median expert forecasted slower\nprogress than AI company leaders predict. Some reference specific claims\nmade by company leaders: \u201cAnthropic CEO&#8217;s forecast of 90% of coding in\nthe USA done by AI \u2018within six months\u2019 has been a fantastic dud,\u201d wrote\none expert, and another, \u201cMusk has been talking about autonomous driving\nfor ages, and it\u2019s always been worse in the end than he said.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Others offer broader arguments as to why they believe the timelines\npredicted by the leaders of frontier AI companies are unlikely to\nmaterialize. Below are two examples of these arguments:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>Radical change in major systems just takes longer than 4-5 years. I also think that [in] many of these domains, even unexpectedly fast advancement in AIs will not easily translate to improvements for quite some time because of unexpected barriers. That is, at least until we have strong artificial general intelligence (AGI), which we will not by 2030. To paraphrase an old saying, every job looks easy for those not actually doing it.<\/em><\/p>\n<\/blockquote>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>The force of its impact will likely be slowed by bottlenecks in areas AI hasn\u2019t yet conquered. An important concept is that an economic bottleneck grows in significance when productivity elsewhere increases. For instance, if global shipping dramatically increases, a bottleneck in the Suez or Panama Canal becomes much more costly. There likely exist thousands (millions?) of potential bottlenecks in the economy which will only become legible as other processes are sped up by orders of magnitude.<\/em><\/p>\n<\/blockquote>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"experts-predict-much-faster-ai-progress-than-the-general-public.\">4.\nExperts predict much faster AI progress than the general public.<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Of 68 total forecasts<sup data-fn=\"934b8442-9d4f-4d36-8120-1843f29193d6\" class=\"fn\"><a href=\"#934b8442-9d4f-4d36-8120-1843f29193d6\" id=\"934b8442-9d4f-4d36-8120-1843f29193d6-link\">73<\/a><\/sup> (across 14 questions with multiple time horizons and quantiles) with a clear valence of AI capabilities,<sup data-fn=\"9d6dd258-27c6-4bbf-a8a2-d7b72fb8f0f9\" class=\"fn\"><a href=\"#9d6dd258-27c6-4bbf-a8a2-d7b72fb8f0f9\" id=\"9d6dd258-27c6-4bbf-a8a2-d7b72fb8f0f9-link\">74<\/a><\/sup> the general public holds views about AI progress, capabilities, and diffusion that are statistically indistinguishable<sup data-fn=\"7e5ba175-383b-4ed0-a966-7bd43feb4245\" class=\"fn\"><a href=\"#7e5ba175-383b-4ed0-a966-7bd43feb4245\" id=\"7e5ba175-383b-4ed0-a966-7bd43feb4245-link\">75<\/a><\/sup> from experts in 9% of cases, predict less progress<sup data-fn=\"6dc2d733-d15e-453b-8e68-32880d3cca1c\" class=\"fn\"><a href=\"#6dc2d733-d15e-453b-8e68-32880d3cca1c\" id=\"6dc2d733-d15e-453b-8e68-32880d3cca1c-link\">76<\/a><\/sup> than experts in a large majority (71%) of all cases, and predict more progress in 21% of forecasts. Where experts and the public disagree, the public predicts less progress over three times as often as more progress. Across these forecasts that exhibit a clear valence of AI capabilities, a randomly selected expert is 16% more likely than a randomly selected member of the public to predict faster progress than would be expected by random chance. Note, all Mann\u2013Whitney U tests and Cliff\u2019s <em>\u03b4<\/em><sup data-fn=\"2ff68abb-a816-4876-958f-ba26747c8edf\" class=\"fn\"><a href=\"#2ff68abb-a816-4876-958f-ba26747c8edf\" id=\"2ff68abb-a816-4876-958f-ba26747c8edf-link\">77<\/a><\/sup> values are calculated on unweighted data, but we plan to integrate weights into all such analyses in future work. We summarize some of the differences in aggregate forecasts in Figure 5, and Figure 6 plots pooled distributions for experts against those for the public. The questions selected in Figure 5 reflect the progress-valenced questions that easily map onto a percentage scale.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-05.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 5:<\/strong> Differences between the expert and public median 50<sup>th<\/sup> percentile forecasts for several questions where the unit is a percentage. Points indicate the median of each group\u2019s 50<sup>th<\/sup> percentile forecasts. We apply transformations to create valenced forecasts, where values closer to the left indicate slower progress and values to the right indicate faster progress.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-06.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 6:<\/strong> Pooled distributions for expert and public forecasts on various questions across years. These pooled distributions combine within-expert uncertainty and between-expert disagreement. Densities are normalized to the same peak for comparability. See <a href=\"#uncertainty-and-disagreement\">Uncertainty and Disagreement<\/a> for details.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">We report statistical comparisons between experts and the public for select questions in Table 5 below. You can find tables for all other relevant questions on the LEAP website.<sup data-fn=\"888c9df3-1e71-4349-9a83-a8f74371c6ce\" class=\"fn\"><a href=\"#888c9df3-1e71-4349-9a83-a8f74371c6ce\" id=\"888c9df3-1e71-4349-9a83-a8f74371c6ce-link\">78<\/a><\/sup><\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Question<\/strong><\/td><td><strong>Resolution Year<\/strong><\/td><td><strong>Percentile<\/strong><\/td><td><strong>p-value<\/strong><\/td><td><strong>Cliff&#8217;s <em>\u03b4<\/em><\/strong><\/td><\/tr><tr><td rowspan=\"6\">AV Trips<\/td><td>2027<\/td><td>25<\/td><td>0.017<\/td><td>0.085<\/td><\/tr><tr><td>2027<\/td><td>50<\/td><td>&lt;0.001<\/td><td>0.16<\/td><\/tr><tr><td>2027<\/td><td>75<\/td><td>&lt;0.001<\/td><td>0.23<\/td><\/tr><tr><td>2030<\/td><td>25<\/td><td>&lt;0.001<\/td><td>0.12<\/td><\/tr><tr><td>2030<\/td><td>50<\/td><td>&lt;0.001<\/td><td>0.26<\/td><\/tr><tr><td>2030<\/td><td>75<\/td><td>&lt;0.001<\/td><td>0.35<\/td><\/tr><tr><td rowspan=\"9\">Drug Discovery<\/td><td>2027<\/td><td>25<\/td><td>&lt;0.001<\/td><td>-0.26<\/td><\/tr><tr><td>2027<\/td><td>50<\/td><td>&lt;0.001<\/td><td>-0.19<\/td><\/tr><tr><td>2027<\/td><td>75<\/td><td>0.018<\/td><td>-0.093<\/td><\/tr><tr><td>2030<\/td><td>25<\/td><td>&lt;0.001<\/td><td>-0.14<\/td><\/tr><tr><td>2030<\/td><td>50<\/td><td>0.28<\/td><td>-0.043<\/td><\/tr><tr><td>2030<\/td><td>75<\/td><td>0.15<\/td><td>0.056<\/td><\/tr><tr><td>2040<\/td><td>25<\/td><td>&lt;0.01<\/td><td>0.12<\/td><\/tr><tr><td>2040<\/td><td>50<\/td><td>&lt;0.001<\/td><td>0.28<\/td><\/tr><tr><td>2040<\/td><td>75<\/td><td>&lt;0.001<\/td><td>0.37<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 5:<\/strong> Statistical comparisons for experts and the public for U.S. Ride-Hailing Trips by Autonomous Vehicles (AV Trips) and AI-discovered share of Drug Sales (Drug Discovery). We report p-values from Mann\u2013Whitney U tests and Cliff\u2019s \u03b4 values. A positive value indicates that expert forecasts tend to be larger than public forecasts.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">We summarize some of the major differences below:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><em>Societal impact:<\/em> On average, experts give a 63% chance\nthat AI will be at least as impactful as a \u201ctechnology of the\ncentury\u201d\u2014like electricity or automobiles\u2014whereas the public gives this a\n43% chance. Further, experts give a 32% chance that it will be at least\nas impactful as a \u201ctechnology of the millennium\u201d (akin to the printing\npress or the Industrial Revolution), while the public gives this a 22%\nchance.<\/li>\n\n\n\n<li><em>Autonomous vehicles:<\/em> The public predicts about half as much autonomous vehicle progress as experts by 2030, as suggested by each group\u2019s 50<sup>th<\/sup> percentile forecasts. The median expert in our sample predicts that usage of autonomous vehicles will grow dramatically\u2014from a baseline of 0.27% of all U.S. rideshare trips in Q4 2024 to 20% by the end of 2030.<sup data-fn=\"328076b5-8365-43d5-abc4-cc93d7587a88\" class=\"fn\"><a href=\"#328076b5-8365-43d5-abc4-cc93d7587a88\" id=\"328076b5-8365-43d5-abc4-cc93d7587a88-link\">79<\/a><\/sup> In comparison, the general public predicts 12%<sup data-fn=\"3f0107f8-f99a-4f11-9abf-973f0331d9a1\" class=\"fn\"><a href=\"#3f0107f8-f99a-4f11-9abf-973f0331d9a1\" id=\"3f0107f8-f99a-4f11-9abf-973f0331d9a1-link\">80<\/a><\/sup> (<em>p &lt; 0.001, Cliff\u2019s \u03b4 = 0.26<\/em>).<\/li>\n\n\n\n<li><em>Generative AI use:<\/em> The public predicts about half as\nmuch generative AI use in 2030. Experts predict that 18% of U.S. work\nhours will be assisted by generative AI in 2030, whereas the general\npublic predicts 10% (<em>p &lt; 0.001, Cliff\u2019s \u03b4 = 0.29<\/em>).<\/li>\n\n\n\n<li><em>Mathematics:<\/em> 23% of experts predict that FrontierMath<sup data-fn=\"3467dc4d-ea16-46eb-98f5-47a868923605\" class=\"fn\"><a href=\"#3467dc4d-ea16-46eb-98f5-47a868923605\" id=\"3467dc4d-ea16-46eb-98f5-47a868923605-link\">81<\/a><\/sup> will be saturated by the end of 2030 in the median scenario,<sup data-fn=\"23a598cf-525c-4d4e-ac54-155fb5d62df1\" class=\"fn\"><a href=\"#23a598cf-525c-4d4e-ac54-155fb5d62df1\" id=\"23a598cf-525c-4d4e-ac54-155fb5d62df1-link\">82<\/a><\/sup> meaning that AI can autonomously solve a typical math problem that a math PhD student might spend multiple days completing. Only 6% of the public predict the same, about 3x less.<\/li>\n\n\n\n<li><em>Diffusion into science:<\/em> Experts predict a roughly 10x increase (from 3% to roughly 30%) in AI-engaged papers across Physics, Materials Science, and Medicine between 2022 and 2030.<sup data-fn=\"59e81bf9-27c0-408e-9e27-224496185e50\" class=\"fn\"><a href=\"#59e81bf9-27c0-408e-9e27-224496185e50\" id=\"59e81bf9-27c0-408e-9e27-224496185e50-link\">83<\/a><\/sup> The general public predicts two-thirds as much diffusion into science: that roughly 20% of papers in these fields will be AI-engaged.<sup data-fn=\"80d423ed-cb7d-4acf-8d3e-f74c9ea10fcb\" class=\"fn\"><a href=\"#80d423ed-cb7d-4acf-8d3e-f74c9ea10fcb\" id=\"80d423ed-cb7d-4acf-8d3e-f74c9ea10fcb-link\">84<\/a><\/sup><\/li>\n\n\n\n<li><em>Drug discovery:<\/em> By 2040, the median expert predicts that 25% of sales from newly approved U.S. drugs will be from AI-discovered drugs, compared to 15% for the public (<em>p &lt; 0.001<\/em>,<em>Cliff\u2019s \u03b4 = 0.28<\/em>). The median expert also thinks there\u2019s a 25% chance that AI-discovered drugs will account for more than 43% of recent drug sales, whereas the general public predicts there\u2019s a 25% chance of a share greater than 23%\u2014about half the expert forecast. In contrast, the public expects a larger share from AI-discovered drugs in the short-run, predicting that AI-discovered drugs will account for 2.5% of recent drug sales in 2027; experts predict 1.6% of 2027 sales will be attributed to AI-discovered drugs (<em>p &lt; 0.001, Cliff\u2019s \u03b4 = 0.19)<\/em>.<sup data-fn=\"54630412-1747-4c6c-8b27-9bb8eaf39c0d\" class=\"fn\"><a href=\"#54630412-1747-4c6c-8b27-9bb8eaf39c0d\" id=\"54630412-1747-4c6c-8b27-9bb8eaf39c0d-link\">85<\/a><\/sup><\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Contrary to this result, the public assigns more weight to the \u201cRapid Progress\u201d scenario in the General AI Progress question: the average member of the public expects 26% of LEAP panelists in 2030 will select the rapid scenario (95% confidence interval [25.5%, 26.4%]),<sup data-fn=\"cd6bbf30-b63a-4157-a9b6-136fd4cb82dd\" class=\"fn\"><a href=\"#cd6bbf30-b63a-4157-a9b6-136fd4cb82dd\" id=\"cd6bbf30-b63a-4157-a9b6-136fd4cb82dd-link\">86<\/a><\/sup> compared to 23% for experts (95% confidence interval [22.1%, 23.7%]).<sup data-fn=\"31ae1fcd-f1a7-4a0e-8954-f07e28840e22\" class=\"fn\"><a href=\"#31ae1fcd-f1a7-4a0e-8954-f07e28840e22\" id=\"31ae1fcd-f1a7-4a0e-8954-f07e28840e22-link\">87<\/a><\/sup><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To assess the extent to which low-effort or relatively lower\ncomprehension from the public could drive these results, we compare\nmembers of our public sample with high forecasting accuracy in other\nstudies to those with low accuracy. We do not find that one group\nsystematically expects more or less progress. <a href=\"#public-accuracy-stratification\"><u>Public Accuracy\nStratification<\/u><\/a> details this analysis. Additionally, while\nwithin-forecaster uncertainty explains 49% of the total variation in\nexpert forecasts, this component explains just 37% of the variation in\npublic forecasts. As forecasting questions resolve, we will compare the\ncalibration (and accuracy) of expert and public forecasts.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"there-are-few-differences-in-prediction-between-superforecasters-and-experts-but-where-there-is-disagreement-experts-tend-to-expect-more-ai-progress.-we-dont-see-systematic-differences-between-the-beliefs-of-computer-scientists-economists-industry-professionals-and-policy-professionals.\">5.\nThere are few differences in prediction between superforecasters and\nexperts, but, where there is disagreement, experts tend to expect more\nAI progress. We don\u2019t see systematic differences between the beliefs of\ncomputer scientists, economists, industry professionals, and policy\nprofessionals.<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\"><em>There are no discernible differences between forecasts from\ndifferent groups of experts<strong>.<\/strong><\/em> Across all pairwise\ncomparisons of expert categories for each of the questions with a clear\nAI progress valence, only 32 out of 408 combinations (7.8%) show\nstatistically significant differences (at a 5% threshold), similar to\nwhat you would expect from chance. This means that computer scientists,\neconomists, industry professionals, and policy professionals largely\npredict similar futures <em>as groups<\/em>, despite there being\nsignificant disagreement about AI among experts. This raises questions\nabout popular narratives that economists tend to be skeptical of AI\nprogress and that employees of AI companies tend to be more optimistic\nabout fast gains in AI capabilities. In other words, while we do see\nwidespread disagreement among experts about the future of AI systems,\ncapabilities, and diffusion, we fail to find evidence that this\ndisagreement is explained by the domain in which experts work. As LEAP\ncontinues, we plan to study what factors most drive expert disagreement.\nHowever, the groups used for these comparisons are subsets of our expert\nsample, so these comparisons are necessarily less powered.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Superforecasters and expert groups predict similar futures.<\/em>\nSuperforecasters are statistically indistinguishable from experts in 69%\nof valenced forecasts, predict less progress than experts in 26% of\nforecasts, and more progress in 4% of forecasts. A randomly selected\nexpert is 9.8% more likely than a randomly selected superforecaster to\npredict faster progress than would be expected by random chance.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Where superforecasters and experts disagree, superforecasters usually (86% of such cases) predict less progress.<\/em> Further, some of these disagreements are quite large<strong>.<\/strong> For example, the median expert predicts that use of autonomous vehicles will grow dramatically\u2014from 0.27% of all U.S. rideshare trips in 2024 to 20% by the end of 2030,<sup data-fn=\"38196ecc-fcfc-4953-b552-fee38be2bf97\" class=\"fn\"><a href=\"#38196ecc-fcfc-4953-b552-fee38be2bf97\" id=\"38196ecc-fcfc-4953-b552-fee38be2bf97-link\">88<\/a><\/sup> whereas the median superforecaster predicts less than half that, 8%<sup data-fn=\"0db06cd8-5f5f-4038-93ff-a68285545c4d\" class=\"fn\"><a href=\"#0db06cd8-5f5f-4038-93ff-a68285545c4d\" id=\"0db06cd8-5f5f-4038-93ff-a68285545c4d-link\">89<\/a><\/sup> (<em>p &lt; 0.001)<\/em>. A randomly selected superforecaster predicts, in the median scenario for 2030, less AV penetration than a randomly selected expert 37% more often than would be expected by random chance. Superforecasters also predict less societal impact from AI and less AI-driven electricity use. Drug discovery is the only setting where superforecasters are more optimistic than experts: By 2040, experts predict that 25% of sales from recently approved U.S. drugs will be from AI-discovered drugs. Superforecasters predict 45%, almost double (<em>p &lt; 0.01<\/em>). Here, a randomly selected superforecaster predicts a higher share, in the median scenario, than a randomly selected expert 23% more often than would be expected by chance.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This pattern is consistent with the follow-up to our Existential Risk Persuasion Tournament, where a small sample of experts were more optimistic about AI progress than around 80 superforecasters about AI progress and capabilities in a 2022 (pre-ChatGPT) survey, although both experts and superforecasters significantly underestimated AI progress by 2025 (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Ku\u010dinskas et al. 2025<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Karger et al. 2025<\/a>). We summarize some of the differences in aggregate forecasts in Figure 7, and Figure 8 plots pooled distributions for experts against superforecasters. In Figure 9, we compare aggregate forecasts of the various expert groups.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-07.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 7:<\/strong> Differences between the expert and superforecaster median 50<sup>th<\/sup> percentile forecasts for several questions where the unit is a percentage. Points indicate the median of each group\u2019s 50<sup>th<\/sup> percentile forecasts. We apply transformations to create valenced forecasts, where values closer to the left indicate slower progress and values to the right indicate faster progress.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-08.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 8:<\/strong> Pooled distributions for expert and superforecaster forecasts on various questions across years. These pooled distributions combine within-expert uncertainty and between-expert disagreement. Densities are normalized to the same peak for comparability. See <a href=\"#uncertainty-and-disagreement\">Uncertainty and Disagreement<\/a> for details.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-09.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 9:<\/strong> Expert category median 50<sup>th<\/sup> percentile forecasts for several questions where the unit is a percentage. Points indicate the median of 50<sup>th<\/sup> percentile forecasts for each category. We apply transformations to create valenced forecasts, where values closer to the left indicate slower progress and values to the right indicate faster progress.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">We report statistical comparisons between experts and superforecasters for select questions in Table 6 below. You can find tables for all other relevant questions on the LEAP website.<sup data-fn=\"f6aa4b20-0c7b-4251-a1d4-a3629b227fc0\" class=\"fn\"><a href=\"#f6aa4b20-0c7b-4251-a1d4-a3629b227fc0\" id=\"f6aa4b20-0c7b-4251-a1d4-a3629b227fc0-link\">90<\/a><\/sup><\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Question<\/strong><\/td><td><strong>Resolution Year<\/strong><\/td><td><strong>Percentile<\/strong><\/td><td><strong>p-value<\/strong><\/td><td><strong>Cliff&#8217;s Delta<\/strong><\/td><\/tr><tr><td rowspan=\"6\">AV Trips<\/td><td>2027<\/td><td>25<\/td><td>&lt;0.001<\/td><td>0.34<\/td><\/tr><tr><td>2027<\/td><td>50<\/td><td>&lt;0.001<\/td><td>0.45<\/td><\/tr><tr><td>2027<\/td><td>75<\/td><td>&lt;0.001<\/td><td>0.44<\/td><\/tr><tr><td>2030<\/td><td>25<\/td><td>&lt;0.001<\/td><td>0.31<\/td><\/tr><tr><td>2030<\/td><td>50<\/td><td>&lt;0.001<\/td><td>0.37<\/td><\/tr><tr><td>2030<\/td><td>75<\/td><td>&lt;0.001<\/td><td>0.32<\/td><\/tr><tr><td rowspan=\"9\">Drug Discovery<\/td><td>2027<\/td><td>25<\/td><td>0.58<\/td><td>0.042<\/td><\/tr><tr><td>2027<\/td><td>50<\/td><td>0.084<\/td><td>0.14<\/td><\/tr><tr><td>2027<\/td><td>75<\/td><td>0.09<\/td><td>0.14<\/td><\/tr><tr><td>2030<\/td><td>25<\/td><td>0.44<\/td><td>0.064<\/td><\/tr><tr><td>2030<\/td><td>50<\/td><td>0.66<\/td><td>0.037<\/td><\/tr><tr><td>2030<\/td><td>75<\/td><td>0.81<\/td><td>0.021<\/td><\/tr><tr><td>2040<\/td><td>25<\/td><td>0.014<\/td><td>-0.2<\/td><\/tr><tr><td>2040<\/td><td>50<\/td><td>&lt;0.01<\/td><td>-0.23<\/td><\/tr><tr><td>2040<\/td><td>75<\/td><td>&lt;0.01<\/td><td>-0.22<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 6:<\/strong> Statistical comparisons for experts and superforecasters for U.S. Ride-Hailing Trips by Autonomous Vehicles (AV Trips) and AI-discovered share of Drug Sales (Drug Discovery). We report p-values from Mann\u2013Whitney U tests and Cliff\u2019s \u03b4 values. A positive value indicates that expert forecasts tend to be larger than superforecaster forecasts.<\/figcaption><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"highlights-from-question-by-question-analyses\">Highlights from\nQuestion-by-Question Analyses<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">To provide a window into some of the details available in the question-level analysis, below we highlight some key reasoning offered by experts in their written rationales. We present these highlights for a handful of questions, but more detail is available in <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=129\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix F. Question-by-Question Results<\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>FrontierMath<\/em>:<sup data-fn=\"a8016211-a299-4c2c-a058-715e656b8533\" class=\"fn\"><a href=\"#a8016211-a299-4c2c-a058-715e656b8533\" id=\"a8016211-a299-4c2c-a058-715e656b8533-link\">91<\/a><\/sup> Experts who forecast a high degree of progress on this benchmark often point to recent trends. One argues, \u201cWe&#8217;ve seen jumps of around 5 points<sup data-fn=\"c1891d77-58c4-4aca-8ca2-d9c550549066\" class=\"fn\"><a href=\"#c1891d77-58c4-4aca-8ca2-d9c550549066\" id=\"c1891d77-58c4-4aca-8ca2-d9c550549066-link\">92<\/a><\/sup> on this benchmark every couple of months so far and these jumps will only accelerate as scores approach 50% (benchmark scores tend to be roughly sigmoid-shaped over time).\u201d Many also emphasize inference scaling potential, with one observing that \u201cthe current top scorer is a reasoning model, and the reasoning model paradigm is relatively new; this suggests that rapid improvements are likely as the paradigm evolves,\u201d and another opining that \u201cwith very large amounts of inference compute, it&#8217;s possible that o3 or o4-mini could already get well over 30% today.\u201d Beyond advances in technical capabilities, a significant minority of high-forecast respondents point to the likelihood of sustained investment in math-related AI capabilities due to its prestige and R&amp;D value. Writes one: \u201cMath is highly relevant to many R&amp;D domains, so progress in math has been, and is highly likely to continue to be, a focus for leading AI companies.\u201d Among low-forecast respondents, a common sentiment is that \u201cthe fastest . . . progress is behind us and we are now approaching the flat\/end point portion of the S-Curve of advancement.\u201d Many also express skepticism that inference scaling will be enough to overcome fundamental architectural limitations.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Autonomous vehicles<\/em>:<sup data-fn=\"150075bf-55f0-4ce6-8676-ee214749eb17\" class=\"fn\"><a href=\"#150075bf-55f0-4ce6-8676-ee214749eb17\" id=\"150075bf-55f0-4ce6-8676-ee214749eb17-link\">93<\/a><\/sup> A common consideration among experts who forecast that a high percentage of U.S. ride-hailing trips will eventually be provided by Level 4 autonomous vehicles was that Level 4 technology has a proven track record. One writes, \u201cHistorically, when a technology finally gets to be used in the wild, it improves very rapidly.\u201d Another highlights Waymo&#8217;s exponential expansion, noting that \u201cWaymo is currently more-than-doubling every year,\u201d and that this could result in a data flywheel where \u201cbroader deployment will generate more data, which in turn enhances safety\u2014creating a positive feedback loop.\u201d Conversely, low-forecast respondents often emphasize historical overpromising and argue the current technology is difficult to scale \u201cbecause it requires lots of case-by-case optimization for a particular region (down to individual intersections),\u201d and &#8220;progress in Phoenix or Miami does not generalize easily to New York, Boston, or Chicago.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>White-collar jobs<\/em>:<sup data-fn=\"15dc6f35-e4d0-4ebb-a421-50422463048e\" class=\"fn\"><a href=\"#15dc6f35-e4d0-4ebb-a421-50422463048e\" id=\"15dc6f35-e4d0-4ebb-a421-50422463048e-link\">94<\/a><\/sup> Experts who forecast that the percentage of white-collar jobs will be higher in the future relative to 2025 often cite historical precedent when arguing that white-collar employment is more likely to adapt than collapse: \u201cHistorically, human labor patterns have experienced quite radical transformations over time, even in established sectors. Emerging technologies, rather than sucking people out of the labor market of white-collar work, are more likely to make them work differently and lead to new white-collar roles that can capitalize on this transition.\u201dLow-forecast respondents tend to view AI&#8217;s speed and cognitive focus as a key differentiator from prior periods of transition: \u201cWhite collar workers&#8230;work on symbolic tasks, generate language, and make decisions based on analysis of data, all tasks to which LLMs are well suited,\u201d writes one. Another points to evidence of layoffs already occurring, particularly in the tech sector: \u201cFrom Intel to Microsoft, many top executives and management staff were laid off to make room for other investments at the organization. Google laid off 10% of its managerial staff last December.\u201d Still another adds, \u201csoftware engineering is already being hugely impacted and this is only accelerating.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Speed of AI progress<\/em>:<sup data-fn=\"45326349-1cc9-4056-a9ee-e162931dae19\" class=\"fn\"><a href=\"#45326349-1cc9-4056-a9ee-e162931dae19\" id=\"45326349-1cc9-4056-a9ee-e162931dae19-link\">95<\/a><\/sup> Many experts who forecast a high likelihood that AI progress will be rapid mention the consistent pace of capability improvements, with one noting, \u201chistorically AI system development has followed a steep scaling curve and increases in model size, data and compute have led to rapid capability gains.\u201d Another points out that, \u201cMETR [Model Evaluation and Threat Research] results imply a roughly 4 to 10x improvement in time horizon every year, which means that we&#8217;ll have systems capable of doing weeks or months of work by the late 2020s.\u201d<sup data-fn=\"baac8fbb-0b1e-44ee-b779-3704da1d4dc3\" class=\"fn\"><a href=\"#baac8fbb-0b1e-44ee-b779-3704da1d4dc3\" id=\"baac8fbb-0b1e-44ee-b779-3704da1d4dc3-link\">96<\/a><\/sup> Many also note the potential for a recursive self-improvement feedback loop, in which AI improves itself, to significantly accelerate growth. Slow-progress forecasters often highlight physical constraints, pointing to the need for better training data and the cost of compute\u2014especially as it relates to energy needs: \u201cI expect energy to be the chief bottleneck to AI progress such that it will be a rate-limiter for progress in general.\u201d Another common sentiment shared by slow-progress forecasters is that \u201cfor any of the moderate and rapid progress criteria to be met there would need to be a massive paradigm shift in AI technology,\u201d and that such a shift in the underlying LLM architecture is unlikely to materialize in the near term.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Technological Richter Scale<\/em>:<sup data-fn=\"25fab556-1fa8-43b8-9d4e-72a8e47447c7\" class=\"fn\"><a href=\"#25fab556-1fa8-43b8-9d4e-72a8e47447c7\" id=\"25fab556-1fa8-43b8-9d4e-72a8e47447c7-link\">97<\/a><\/sup> Intended to be analogous to the measurement of earthquake magnitudes, the technological Richter scale instead attempts to measure the impact of technologies. Experts who predict AI will have an exceptionally high impact tend to see it as uniquely positioned to fundamentally restructure society, with one going so far as to suggest it could \u201cchallenge capitalism\u201d and \u201chas the potential to replace human labor in most fields [and] might force our societies to shift to a new economic model.\u201d This transformative potential is believed by many to stem from AI&#8217;s dual nature in that \u201cit both augments human intelligence and will eventually surpass it.\u201d High-impact respondents also point to the quick pace of AI deployment and emphasize the likelihood of rapid AI progress: \u201cPeople fundamentally don&#8217;t think in exponentials. 2040 is a LONG time away, technologically. And AI will modify AI, at which point its improvement will go even more second-order.\u201d Low-impact respondents often question the sustainability of recent growth patterns, with one noting that \u201cupper levels, and in particular 10, require sustained exponential growth; this is unlikely to materialize given that natural growth (e.g., of bacteria) follow a sigmoid shape.\u201d Another points to regulations and physical bottlenecks as constraints and challenges whether AI will deliver tangible benefits sufficient to drive transformation: \u201cThe average citizen will not have much benefit to buy from AI. Improved games or art? Cheaper manufactured goods? A robot to clean your house? How does AI deliver things that humans want, like better, cheaper healthcare?\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Millennium Prize<\/em>:<sup data-fn=\"303a3c69-d754-4e8c-8bba-63ac24995998\" class=\"fn\"><a href=\"#303a3c69-d754-4e8c-8bba-63ac24995998\" id=\"303a3c69-d754-4e8c-8bba-63ac24995998-link\">98<\/a><\/sup> The most common reason given by experts who think the likelihood is high that AI will solve (or substantially assist in solving) one of the notoriously difficult Millennium Prize math problems is that in January of 2025 the CEO of Google\u2019s DeepMind claimed that, in partnership with a team of mathematicians, they&#8217;re \u201cclose to solving\u201d one of the problems (later identified as Navier-Stokes) within \u201ca year or year and a half\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Ansede 2025<\/a>). Many also point to AI\u2019s gold medals in the International Mathematical Olympiad (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Luong and Lockhart 2025<\/a>) and progress on FrontierMath as evidence of rapid capability growth in mathematical reasoning. Low-forecast respondents generally don&#8217;t mention the DeepMind claim; a few do, but dismiss corporate pronouncements. One instead cites the president of the Clay Mathematics Institute (the organization responsible for awarding the Millennium Prize) who in June of 2025 claimed, \u201cWe&#8217;re very far away from AI being able to say anything serious about any of those problems\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Heaven 2025<\/a>). Regarding purported progress, one mathematician notes: \u201cThe Math Olympiad is targeted toward gifted high school students spending an afternoon on a problem solvable with known techniques\u2026[whereas] Millennium Prize problems can consume entire careers without a solution.&#8221; Multiple low-forecast respondents also note FrontierMath Tier 4 (which poses much harder problems than Tiers 1-3) has &lt;10% solve rates.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Diffusion into science<\/em>:<sup data-fn=\"cabcd0dd-a9c8-4846-8934-d23aa0862b47\" class=\"fn\"><a href=\"#cabcd0dd-a9c8-4846-8934-d23aa0862b47\" id=\"cabcd0dd-a9c8-4846-8934-d23aa0862b47-link\">99<\/a><\/sup> When considering the percentage of publications in the fields of physics, materials science, and medicine that will be \u2018AI-engaged\u2019, experts predicting high engagement commonly extrapolate recent exponential growth, noting the 2018-2022 data shows engagement roughly tripling across all three fields, and emphasize this baseline predates ChatGPT: \u201cWhat we see in the baseline data is just the beginning, resulting from the application of foundational AI models but largely without the generative AI models exploding on the scene.\u201d Many cite domain-specific breakthroughs (AlphaFold in protein folding, GNoME and MatterGen in materials discovery, AI-driven imaging and diagnostics in medicine) as evidence AI can deliver transformative results that will drive rapid diffusion. Low-forecast respondents instead tend to focus on interpretability and reliability issues that could slow diffusion: \u201cAI is a black box that hallucinates,\u201d writes one. Others add that diffusion may be slow due to, \u201cnatural resistance to change from the existing body of researchers in these fields,\u201d or because, in the case of medical studies, AI isn\u2019t useful: \u201cA significant portion of papers are observational, often reporting causal effects. There isn&#8217;t much room for AI in these sorts of papers, as current statistical methods are more reliable and bias-free.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Drug discovery<\/em>:<sup data-fn=\"4b26fec0-a80a-4b36-b343-5fc01a5fe3b9\" class=\"fn\"><a href=\"#4b26fec0-a80a-4b36-b343-5fc01a5fe3b9\" id=\"4b26fec0-a80a-4b36-b343-5fc01a5fe3b9-link\">100<\/a><\/sup> Several experts who forecast that AI will accelerate drug discovery-to-market timelines point to faster design-make-test-analyze loops and potentially AI-enabled pharmacodynamic simulations that could streamline clinical trials. Others note AI-discovered drugs already demonstrate significantly higher Phase I success rates, and that extrapolating from current growth trends suggests \u201cthese will constitute a majority of new clinical trial submissions.\u201d Some also point out that discovery-to-market timelines can be shortened significantly during times of crisis via EUAs (emergency use authorizations). Low-forecast respondents commonly emphasize regulatory realities that may limit AI&#8217;s impact on approval timelines. One writes: \u201cGiven that the median time it takes to get through the FDA approval process is over 10 years, and no AI-discovered drugs appear to have started Phase III trials yet,<sup data-fn=\"9eddcef0-6559-409f-a235-4e5b1e3bb50a\" class=\"fn\"><a href=\"#9eddcef0-6559-409f-a235-4e5b1e3bb50a\" id=\"9eddcef0-6559-409f-a235-4e5b1e3bb50a-link\">101<\/a><\/sup> 2027 is likely too soon for many, if any, new AI drugs to be approved.\u201d Although some low-forecast respondents acknowledge improved Phase I results, several make the point that \u201cthe turnaround time between Phase I and approval will not speed up substantially for AI-invented drugs,\u201d because \u201cearly entrants sped through Phase I but then quickly reverted to the mean in Phase II.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Electricity<\/em>:<sup data-fn=\"fc555ffd-72de-41b2-93aa-8b26cf617ca0\" class=\"fn\"><a href=\"#fc555ffd-72de-41b2-93aa-8b26cf617ca0\" id=\"fc555ffd-72de-41b2-93aa-8b26cf617ca0-link\">102<\/a><\/sup> When assessing the percent of U.S. electricity consumption that will be used for training and deploying AI systems, high-forecast respondents often emphasize that the massive capital expenditure plans already announced by competing AI companies signal the type of unprecedented infrastructure investment that could result in \u201can explosion of energy usage.\u201d Others note that the geopolitical competition for AI supremacy may trigger an energy arms race, with one forecaster warning, \u201cChina is also investing massive amounts in datacenters,\u201d leading to the possibility that \u201cwe enter an arms race that is mostly determined by who can pump the most electricity into AI.\u201d Low-forecast respondents tend to focus more on potentially formidable constraints, particularly when considering \u201cthe material and political investments necessary to get significant growth\u2014physical data centers, chips, permitting, water for cooling, transmission lines,\u201d and suggest these constraints could push infrastructure development offshore: \u201cMajor developers will possibly respond by increasingly outsourcing the physical infrastructure of data processing to locales outside of the US\u2014there&#8217;s no particular reason why models need to be trained inside of U.S. borders where the economic and political expenses are potentially much higher.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>Private investment in AI<\/em>:<sup data-fn=\"b41f19c3-e70b-4826-8e0e-f6e36d60d407\" class=\"fn\"><a href=\"#b41f19c3-e70b-4826-8e0e-f6e36d60d407\" id=\"b41f19c3-e70b-4826-8e0e-f6e36d60d407-link\">103<\/a><\/sup> Among experts who predict high future levels of global private investment in AI, most view the current level of investment as fundamentally justified and advance arguments along the lines of, \u201cAI adoption is still in its early stages across many industries,\u201d and \u201cthe strong rebound to ~$130 billion in 2024 is critical. It occurred despite higher interest rates, signaling powerful, non-speculative belief in the transformative potential of generative AI.\u201d In contrast, frequently expressed sentiments among low-forecast respondents include doubts that productivity gains \u201cwill materialize quickly enough to justify high levels of investment,\u201d and that this could lead to the bursting of an AI bubble, with one expert noting, \u201cboth Deutsche Bank and Bain &amp; Co. have just warned that the current AI boom is not sustainable\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Edwards 2025<\/a>) and another likening the current situation to, \u201cthe dot com bubble in 2000.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>AI companions<\/em>:<sup data-fn=\"37ebf1a6-a5b9-4718-98ec-b418ceec79fc\" class=\"fn\"><a href=\"#37ebf1a6-a5b9-4718-98ec-b418ceec79fc\" id=\"37ebf1a6-a5b9-4718-98ec-b418ceec79fc-link\">104<\/a><\/sup> Many experts who believe that a high proportion of U.S. adults will eventually use AI for companionship cite loneliness as a key driver. One points out that \u201cthe U.S. Surgeon General declar[ed] loneliness an epidemic in 2023, with about half of U.S. adults experiencing measurable levels of loneliness\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Office of the Surgeon General 2023<\/a>). Others predict that as AI capabilities advance, AI companions will become more \u201csophisticated, emotionally intelligent, and capable of forming deeper connections with users,\u201d and that this will facilitate their integration into existing platforms and devices, driving use and normalization to the point where \u201cambient access through devices turns companionship into a series of micro-interactions throughout the day.\u201d Low-forecast respondents tend to believe humans are likely to have a strong preference for human companionship, with one arguing that \u201cmost people would find [AI] companionship unfulfilling, perhaps even viewing reliance on it as a kind of failure.\u201d Others point to lower saturation limits than the number of people who experience frequent loneliness, with one stating: \u201cAbout a quarter of U.S. adults go to therapy.<sup data-fn=\"dbf69356-d11c-4580-8af5-0f30101613d6\" class=\"fn\"><a href=\"#dbf69356-d11c-4580-8af5-0f30101613d6\" id=\"dbf69356-d11c-4580-8af5-0f30101613d6-link\">105<\/a><\/sup> If that&#8217;s the market size, then I expect AI to eventually saturate [at] that.\u201d<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"example-of-a-question-level-analysis-millennium-prize-problem\">Example\nof a Question-Level Analysis: Millennium Prize Problem<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">For each question, we conduct an analysis like the Millennium Prize example below, and present them in <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=129\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix F. Question-by-Question Results<\/a>. Each analysis summarizes the question and background information, summarizes the results, and analyzes rationales to uncover the core differences in view between low and high forecasts. In the first wave alone, experts and superforecasters wrote over 600,000 words supporting their beliefs. Analyzing these rationales alongside predictions provides significantly more context on why experts believe what they believe, and the drivers of disagreement, than the forecast alone.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Question.<\/strong> <em>Will AI solve or substantially assist\nin solving a Millennium Prize Problem in mathematics by 2027, 2030, and\n2040?<\/em><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Background.<\/strong> The seven Millennium Prize Problems<sup data-fn=\"d5be9c36-5816-48d1-b8dc-7855b561d596\" class=\"fn\"><a href=\"#d5be9c36-5816-48d1-b8dc-7855b561d596\" id=\"d5be9c36-5816-48d1-b8dc-7855b561d596-link\">106<\/a><\/sup> were chosen by the founding Scientific Advisory Board of the Clay Mathematics Institute (CMI) of Cambridge, Massachusetts to be the most significant and difficult mathematics problems unsolved by 2000.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Historical Baseline.<\/strong> As of July 2025, only one of the seven problems has been solved (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Clay Mathematics Institute, 2025<\/a>).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For full question background and resolution details, see <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=110\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=110\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.II. 1. Millenium Prize<\/a>.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-10.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 10:<\/strong> In this question, participants make 50<sup>th<\/sup> percentile forecasts for various resolution dates. This figure shows the 10<sup>th<\/sup> , 25<sup>th<\/sup> , 50<sup>th<\/sup> , 75<sup>th<\/sup> , and 90<sup>th<\/sup> percentiles of these 50<sup>th<\/sup> percentile forecasts, split by participant group. The 25<sup>th<\/sup> expert percentile for Dec 2027 represents the number that 25% of experts\u2019 median forecasts are lower than.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Results<\/strong>. Experts estimate a 10% chance that AI will solve or substantially assist in solving a Millennium Prize Problem by 2027,<sup data-fn=\"3f59b267-230c-4c22-8b5b-2095c003c706\" class=\"fn\"><a href=\"#3f59b267-230c-4c22-8b5b-2095c003c706\" id=\"3f59b267-230c-4c22-8b5b-2095c003c706-link\">107<\/a><\/sup> up to 20% by 2030,<sup data-fn=\"0b5a5127-7d50-40c5-9ec3-68f02e8ccf83\" class=\"fn\"><a href=\"#0b5a5127-7d50-40c5-9ec3-68f02e8ccf83\" id=\"0b5a5127-7d50-40c5-9ec3-68f02e8ccf83-link\">108<\/a><\/sup> and <strong>60% by 2040<\/strong>.<sup data-fn=\"cefe5981-7c1a-4add-abf6-6847f9d23bd5\" class=\"fn\"><a href=\"#cefe5981-7c1a-4add-abf6-6847f9d23bd5\" id=\"cefe5981-7c1a-4add-abf6-6847f9d23bd5-link\">109<\/a><\/sup> All categories of experts, superforecasters, and the public largely predict similarly across timescales. However, there is wide disagreement <em>between<\/em> experts: the top quartile of experts think there\u2019s at least an even (50%) chance of AI assistance <strong>by 2030<\/strong>, whereas the bottom quartile of experts think there\u2019s only a 10% chance. The disagreement by 2040 is even larger: the interquartile range for expert medians is 30%\u201381%, while the top decile of experts think there\u2019s a 95% chance and the bottom decile of experts think there\u2019s only a 15% chance.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">For full results tables, see <a href=\"https:\/\/leap.forecastingresearch.org\/reports\/wave2#millennium-prize\"><u>here<\/u><\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The rationales experts wrote to explain their forecasts lend\nconsiderable insight into their core areas of disagreement, in\nparticular:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>DeepMind\/Navier-Stokes<\/strong>: High-forecast respondents frequently cite DeepMind CEO&#8217;s January 2025 statement that, in partnership with a team of mathematicians, they&#8217;re \u201cclose to solving\u201d one of the problems (later identified as Navier-Stokes) within \u201ca year or year and a half\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Ansede 2025<\/a>). This is treated as strong concrete evidence. Low-forecast respondents generally don&#8217;t mention this or dismiss corporate pronouncements. One expert cites the Clay Institute president&#8217;s June 2025 claim: \u201cWe&#8217;re very far away from AI being able to say anything serious about any of those problems\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Heaven 2025<\/a>).<\/li>\n\n\n\n<li><strong>Benchmarks<\/strong>: Many high-forecast respondents point to International Mathematical Olympiad gold medals and FrontierMath progress as evidence of rapid capability growth in mathematical reasoning that will likely continue (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Luong and Lockhart 2025<\/a>). Low-forecast respondents tend to argue these are fundamentally different challenges. One mathematician notes: \u201cThe Math Olympiad is targeted toward gifted high school students spending an afternoon on a problem solvable with known techniques\u2026[whereas] Millennium Prize problems can consume entire careers without a solution.\u201d Multiple forecasters note FrontierMath Tier 4 (which poses much harder problems than Tiers 1-3) has &lt;10% solve rates.<\/li>\n\n\n\n<li><strong>The nature of Millennium problems<\/strong>: High-forecast\nrespondents commonly emphasize that math is verifiable, has clear\nstructure, and that some problems (Navier-Stokes, Birch\u2013Swinnerton-Dyer)\nmay be suited to AI-assisted numerical exploration or pattern\nrecognition. Low-forecast respondents often express doubts that\nMillennium Problems are solvable with the current AI paradigm,\nemphasizing doing so requires \u201cdeep conceptual breakthroughs,\u201d\n\u201cdeveloping new concepts and mathematical rules,\u201d and \u201ctruly out of the\nbox thinking.\u201d One domain expert writes: \u201cThe current generation of AI\ndoes not seem to be able to do this sort of creative mathematical work\nat all. It can apply known techniques and get novel results, but these\nresults would be very easy for top working mathematicians.\u201d<\/li>\n\n\n\n<li><strong>Base rates and timelines<\/strong>: High-forecast\nrespondents mostly don&#8217;t engage with base rates, or they argue that AI\nchanges the game fundamentally. By contrast, many low-forecast\nrespondents emphasize that only one out of seven problems have been\nsolved in the 25 years since the prize was announced, meaning some have\nremained unsolved for more than a century. They also highlight\nMillennium Prize rules: upon the publication of a solution, a minimum of\ntwo years must pass before a prize can be awarded, to allow time for\nadequate verification. (In the case of the one prize that was awarded,\nthe gap between the publication of the solution and the awarding of the\nprize was over seven years.) This, many low-forecast respondents point\nout, renders the 2027\/2030 dates almost impossible regardless of\ntechnical progress.<\/li>\n\n\n\n<li><strong>\u201cSubstantially assist\u201d interpretation<\/strong>:\nHigh-forecast respondents tend toward a broad interpretation\u2014any\nmeaningful acceleration of human-AI hybrid research counts, whereas\nlow-forecast respondents tend toward restrictive interpretation. One\nnotes the resolution criteria require contribution \u201clikely not\nproducible without AI,\u201d which is a higher bar.<\/li>\n\n\n\n<li><strong>Architecture sufficiency<\/strong>: Most high-forecast\nrespondents believe incremental improvements over current LLM\ncapabilities will be sufficient, especially when paired with specialized\ntools (Lean, AlphaProof) and human collaboration. Low-forecast\nrespondents frequently argue the current LLM paradigm fundamentally\ncannot do this. Multiple forecasters say we need \u201centirely new\narchitectures\u201d (neurosymbolic systems were mentioned several times) or\nthat a \u201cpattern matching paradigm doesn&#8217;t extend to the deep creativity\nrequired.\u201d<\/li>\n\n\n\n<li><strong>Difficulty of achieving <em>superhuman<\/em>\nperformance<\/strong>: Although rarely discussed by high-forecast\nrespondents, a few low-forecast respondents expressed doubts that this\ncould be achieved, with one writing, \u201cTraining a model to do math at the\nlevel of human experts might be a qualitatively different ML problem\nfrom training a model to do math<em>surpassing<\/em> expert\ncapabilities. RL training requires creating problems with reward\nfunctions&#8230;We haven&#8217;t achieved that with reasoning post-training\nyet.\u201d<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"high-forecast-rationale-examples\">High-forecast rationale\nexamples:<\/h4>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>\u201cI guess the elephant in the room is that DeepMind says they are close: The so-called Navier-Stokes Operation, underway for three years with a team of 20 people, has so far been carried out with complete discretion, although the chief of Google DeepMind, Demis Hassabis let slip in a January interview that they are \u2018close to solving a Millennium Prize Problem\u2019 without mentioning which one. \u2018We\u2019ll see that in the next year or year and a half.\u201d<sup data-fn=\"c72e2948-fd92-4df8-9136-0c41fff551be\" class=\"fn\"><a href=\"#c72e2948-fd92-4df8-9136-0c41fff551be\" id=\"c72e2948-fd92-4df8-9136-0c41fff551be-link\">110<\/a><\/sup><\/em><\/p>\n<\/blockquote>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>\u201cSome of the problems, like the Riemann Hypothesis or the Birch and Swinnerton-Dyer Conjecture, are especially well-suited to AI-supported exploration. They bear a kind of family resemblance to the Four-Color Theorem in their relationship to computer-assisted mathematics. The Four-Color Theorem was famously solved through a hybrid of human conceptual framing and extensive computer verification. As Donald MacKenzie details in his socio-history of that episode [reference below<sup data-fn=\"cf5f7d6c-56ee-4976-a10c-004354151b62\" class=\"fn\"><a href=\"#cf5f7d6c-56ee-4976-a10c-004354151b62\" id=\"cf5f7d6c-56ee-4976-a10c-004354151b62-link\">111<\/a><\/sup>], much of the intellectual labor wasn\u2019t in the computation itself but in formalizing the problem in a way that machines could meaningfully engage with it and in managing the institutional consequences of proof-by-machine.\u201d<\/em><\/p>\n<\/blockquote>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>\u201cMy optimism that AI could achieve high-level original mathematics is revised upward significantly since the Bubeck announcement about GPT-5 a few weeks ago regarding the first confirmed example of novel mathematical reasoning generated by a LLM.\u201d<sup data-fn=\"52fb1676-507a-472e-b319-e457143e4133\" class=\"fn\"><a href=\"#52fb1676-507a-472e-b319-e457143e4133\" id=\"52fb1676-507a-472e-b319-e457143e4133-link\">112<\/a><\/sup><\/em><\/p>\n<\/blockquote>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"low-forecast-rationale-examples\">Low-forecast rationale\nexamples:<\/h4>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>\u201cI have domain expertise here as a mathematician. The current generation of AI does not seem to be able to do this sort of creative mathematical work at all. It can apply known techniques, and get novel results, but these results would be very easy for top working mathematicians. The kind of pattern matching paradigm we have seen so far apparently doesn\u2019t extend at all to deep creativity required.\u201d<\/em><\/p>\n<\/blockquote>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>\u201cIf the millennium problems all require new insights absent from the training data, then current LLM technology is simply not up to the task: we will need instead new AI paradigms that are better at creating non-combinatorial insights (i.e., insights that do not originate from the recombination of patterns already learned by the AI). This will take time: it is not only the time to develop these new AI techniques, but also the time for the humans now riding the wave of machine learning and LLMs to accept that it might be worth their time to look into alternative approaches (more so after extensive efforts to trivialize these approaches as a loss of time). It is the second factor which I think will be the true time bottleneck and could push the resolution of this question further in time.\u201d<\/em><\/p>\n<\/blockquote>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>\u201cGiven the progress on Tier 3 FrontierMath problems, a Millennium Problem seems well away, notwithstanding bullish predictions from corporate spokespeople with vested interests.\u201d<\/em><\/p>\n<\/blockquote>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>\u201cI would put 60% as some hard limit on whether any of the conjectures can be solved at all.\u201d<\/em><\/p>\n<\/blockquote>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><em>\u201cI don&#8217;t think there are many economic incentives to develop those kinds of systems. Millennium problems are very, very hard &#8211; much harder than most directly economically useful tasks. They require developing new mathematical theories and techniques to even approach them. As far as I know, current top AI models lack this ability, and I don&#8217;t see an easy way for them to obtain such an ability (nor are there many economic incentives for building such abilities into them).\u201d<\/em><\/p>\n<\/blockquote>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"sensitivity-of-results-to-reweighting\">Sensitivity of Results to\nReweighting<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">As described in the <a href=\"#reweighting\">Reweighting<\/a> section, we use a standard approach in the public polling field, raking, to weight aggregate statistics to be representative of the sampling frames. We perform a sensitivity analysis to understand the impact of weighting on aggregate forecasts. We compare the median aggregated forecast to the weighted median aggregated forecast, where positive values of differences indicate that weighting participant responses increases the value of a forecast.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We begin by expressing the difference between the weighted median and\nunweighted median at the question level as a proportion of the forecast\ndispersion. This difference is divided by the standard deviation of the\nunweighted forecasts, within each question-participant group, to\nstandardize the impact across questions with different units. To examine\nthe full distribution of reweighting effects, we calculate this\nstandardized difference for multiple aggregate statistics\u2014including the\n25<sup>th<\/sup> percentile, median, and 75<sup>th<\/sup> percentile of\nthe standardized differences themselves. For example, a 25<sup>th<\/sup>\npercentile value of -0.07 indicates that across all questions, 25% of\nthe standardized reweighting effects fall below -0.07. Table 7 below\nshows a summary of these differences by survey for experts and the\npublic.<\/p>\n\n\n\n<figure class=\"wp-block-table\"><div class=\"table-wrapper\"><table class=\"has-fixed-layout\"><tbody><tr><td><strong>Survey<\/strong><\/td><td><strong>Participant<\/strong><\/td><td><strong>Min<\/strong><\/td><td><strong>p25<\/strong><\/td><td><strong>Median<\/strong><\/td><td><strong>Mean<\/strong><\/td><td><strong>p75<\/strong><\/td><td><strong>Max<\/strong><\/td><\/tr><tr><td>Wave 1: Headliners<\/td><td>Expert<\/td><td>-0.24<\/td><td>-0.07<\/td><td>0<\/td><td>-0.03<\/td><td>0.01<\/td><td>0.2<\/td><\/tr><tr><td>Wave 1: Headliners<\/td><td>Public<\/td><td>-0.2<\/td><td>0<\/td><td>0<\/td><td>0<\/td><td>0<\/td><td>0.12<\/td><\/tr><tr><td>Wave 2: AI for science<\/td><td>Expert<\/td><td>-0.38<\/td><td>0<\/td><td>0<\/td><td>-0.02<\/td><td>0<\/td><td>0.19<\/td><\/tr><tr><td>Wave 2: AI for science<\/td><td>Public<\/td><td>-0.18<\/td><td>0<\/td><td>0.01<\/td><td>0.03<\/td><td>0.06<\/td><td>0.15<\/td><\/tr><tr><td>Wave 3: Broad Adoption of AI<\/td><td>Expert<\/td><td>-0.23<\/td><td>-0.09<\/td><td>0<\/td><td>-0.03<\/td><td>0<\/td><td>0.28<\/td><\/tr><tr><td>Wave 3: Broad Adoption of AI<\/td><td>Public<\/td><td>-0.21<\/td><td>0<\/td><td>0<\/td><td>-0.01<\/td><td>0<\/td><td>0.08<\/td><\/tr><\/tbody><\/table><\/div><figcaption class=\"wp-element-caption\"><strong>Table 7:<\/strong> Summary standardized reweighting effects. The difference between weighted and unweighted medians, expressed in standard deviation units, is summarized across surveys and participant groups.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">These values show how much reweighting shifts the forecast relative\nto the dispersion of forecasts. For example, a value of 0.1 means the\nweighted forecast is 0.1 standard deviations higher than the unweighted\nforecast. This table shows that the median effect on both participant\ntypes and across all surveys is no change in the aggregate result.\nReweighting has a marginally larger effect on expert participants than\nmembers of the public. This histogram below shows the effect of\nreweighting on all the forecasts in more detail.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-11.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 11:<\/strong> Histogram showing standardized difference between weighted and unweighted aggregate median.<\/figcaption><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"public-accuracy-stratification\">Public Accuracy\nStratification<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Given that we select for experience in AI in our expert sample but not in our public sample, public forecasts could be distorted by either a lack of comprehension or effort. To investigate this concern, we partition our sample by forecasting accuracy on out-of-sample, prior questions. For the 832 participants we can match to a prior forecasting record,<sup data-fn=\"06475ed0-c168-4784-bda4-e7e8682711a3\" class=\"fn\"><a href=\"#06475ed0-c168-4784-bda4-e7e8682711a3\" id=\"06475ed0-c168-4784-bda4-e7e8682711a3-link\">113<\/a><\/sup> we calculate accuracy scores based on performance on 24 forecasting questions asking about near-term (&lt;6 months) events in an earlier research project.<sup data-fn=\"d34e17c1-44b3-48aa-b7da-41c3f2034432\" class=\"fn\"><a href=\"#d34e17c1-44b3-48aa-b7da-41c3f2034432\" id=\"d34e17c1-44b3-48aa-b7da-41c3f2034432-link\">114<\/a><\/sup> We score these forecasts using an S-score, with a lower score indicating better performance.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Public participants are split into two accuracy groups:\n\u201cHigh-accuracy,\u201d representing the most accurate 50% of public\nparticipants, and \u201clow-accuracy\u201d representing the least accurate 50% of\npublic participants. We do not adjust weights after partitioning the\npublic sample.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Of 68 total forecasts with a clear valence of AI capabilities, the\nresults are mixed. The high-accuracy group holds views about AI\nprogress, capabilities, and diffusion that are statistically\nindistinguishable from the low-accuracy group in half of all cases. They\npredict less progress in 28% of cases and more in the remaining 22% of\nforecasts. We summarize some of the differences in aggregate forecasts\nin Figure 12.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" src=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/paper_2025-11-10_longitudinal-expert-ai-panel-ai-capabilities-adoption-impact_fig-12.png\" alt=\"\"\/><figcaption class=\"wp-element-caption\"><strong>Figure 12:<\/strong> Differences between the high- and low-accuracy public median 50<sup>th<\/sup> percentile forecasts for several questions where the unit is a percentage. Points indicate the median of each group\u2019s 50<sup>th<\/sup> percentile forecasts. We apply transformations to create valenced forecasts, where values closer to the left indicate slower progress and values to the right indicate faster progress.<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">We present some additional results in <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=93\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix D. Public Accuracy Stratification<\/a>. We plan to explore the drivers of these disagreements in future work.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"next-steps\">Next Steps<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">This paper reports results from the first three monthly waves of LEAP\nand describes the project methodology in detail. We will elicit\nforecasts roughly each month for the next three years on timely topics\nrelated to the development, capabilities, adoption, and impact of AI.\nBoth the set of forecasting questions and the space of possible analyses\nwill expand over time: additional questions grow the number of\ndatapoints we can test and learn from, and progressive question\nresolution enables a continued sharpening of accuracy measurement. We\nplan to release reports on each new wave of LEAP soon after we complete\ndata collection. Additionally, we will periodically release more\ndetailed reports and papers conducting more extensive and cross-wave\nanalysis. We will discuss some of those analyses below.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Future waves of LEAP will be focused on forecasts related to topics including security and geopolitics, robotics, labor and automation, incidents and harms, and AI safety. For example, we tentatively plan to ask forecasters to predict how much AI will improve the productivity of software engineers, how AI will affect the productivity of workers, and how the use of AI may cause harm. We welcome reader suggestions for LEAP questions or wave themes via outreach to our project team.<sup data-fn=\"f4eaac42-3c24-41ec-860f-14305bca01b1\" class=\"fn\"><a href=\"#f4eaac42-3c24-41ec-860f-14305bca01b1\" id=\"f4eaac42-3c24-41ec-860f-14305bca01b1-link\">115<\/a><\/sup> We now describe planned follow-up work for LEAP.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"accuracy-assessment\">Accuracy Assessment<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">As questions resolve, we will be assessing the accuracy of forecasts to identify particularly accurate individual forecasters within the expert, superforecaster, and public groups, and to assess the relative accuracy of the different expert subgroups within our sample. We will also present forecasters with information on their own past performance, assessing how this feedback translates into accuracy on new forecasting questions. Ku\u010dinskas et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2025<\/a>) perform a similar accuracy analysis, retrospectively analyzing a forecasting study focused on multi-year forecasts about AI-, nuclear-, climate-, and biotechnology-related progress and risks, finding no correlation between short-term accuracy and long-term beliefs in their study context, first introduced in Karger et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2023<\/a>). LEAP is, in many ways, a follow-up to that project, improving on several choices the authors made: LEAP requires that forecasters answer all (or almost all) questions, elicits one-time forecasts, and does not provide a team structure or room for deliberation, since debate in that survey did not generally prove to resolve disagreements (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Karger et al. 2025<\/a>). We will evaluate these findings with significantly more precision in LEAP as questions resolve in 2027 and 2030.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"forecast-updating\">Forecast Updating<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We plan to re-survey the LEAP sample on many of our questions to\ntrack how respondents\u2019 views evolve over time. For example, one year\nfrom now, how will respondents update their forecasts of whether there\nwill be an AI-reliant solution to a Millennium Prize Problem? How will\nprogress (or a lack of progress) for AI systems on key benchmarks change\nrespondents\u2019 views of longer-run effects of AI?<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"schools-of-thought-analysis-and-crux-finding\">Schools of Thought\nAnalysis and Crux-Finding<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">A \u201cschool of thought\u201d is a cluster of similar responses to a set of forecasting questions. In future work, we will use standard clustering techniques to search for similar groups of forecasters across questions, and we will complement this work with analysis of qualitative information (rationales) for these various schools of thought. Are we able to distinguish consistent differences in sets of forecasts among subsets of our sample\u2014for example, fast versus slow AI progress groups? The search for schools of thought in our forecasting data takes a related but opposite approach to our prior work using adversarial collaborations to identify cruxes for differences of opinion about the likelihood of harms from AI, in which we search for individuals from distinct schools-of-thought and then ask them to forecast on questions about AI on which they disagreed (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Rosenberg et al. 2024<\/a>). Both approaches can help us map beliefs about AI.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Relatedly, are we able to identify \u201ccruxes\u201d\u2014i.e., strongly differential forecasts between schools of thought on near-term questions that enable faster assessment of which school is more likely to be accurate in the long term? This crux-finding effort builds on our earlier work (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">McCaslin et al. 2024<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Rosenberg et al. 2024<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Rosenberg et al. 2025<\/a>), which finds that disagreement on the likelihood of extreme outcomes from AI is not easily resolved by debate, but we can identify nearer-term cruxes that could increase consensus.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"elicitation-experiments\">Elicitation Experiments<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We have already tested the impact of providing various defaults in the interactive forecasting interfaces (see <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=89\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix B.V. Survey instrument<\/a>) and the ordering of options within questions; we plan to present these results in a future report. Additionally, we plan to experiment with question wording, question ordering, and more.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"expanded-use-of-rationale-data\">Expanded Use of Rationale\nData<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We are exploring scalable and privacy-preserving methods for directly displaying a subset of rationales for high and low forecasts on particular questions, as part of our forecast explorer.<sup data-fn=\"bccb8797-974a-42b0-801a-cbd08c5a401a\" class=\"fn\"><a href=\"#bccb8797-974a-42b0-801a-cbd08c5a401a\" id=\"bccb8797-974a-42b0-801a-cbd08c5a401a-link\">116<\/a><\/sup> We will provide detailed rationale analyses (similar to the examples above) for all LEAP questions, available in our monthly reports. We are also awarding prizes to respondents for rationale quality, and we will highlight publicly some particularly high-quality rationales.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"public-engagement\">Public Engagement<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We may experiment with broader public engagement on AI forecasting.\nFor example, we may enable anyone to make their own forecasts on LEAP\nquestions, and provide them with a report on where their forecasts fit\namong different schools of thought.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"conclusion\">Conclusion<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Policymakers, nonprofit and business leaders, and other stakeholders\nroutinely consult experts to base their decisions on the perspective of\nexperts, especially when faced with new technologies and high levels of\nuncertainty. While public discussion of AI and expert anecdotes are\nwidespread, structured quantitative evidence on expert beliefs is\nlacking, impeding effective decision making. With the launch of LEAP, we\nfill an important gap by both measuring the full range of expert\nopinions on AI capability developments and their impact, and by\ncapturing the underlying reasoning and evidence that supports these\nbeliefs.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We have completed three survey waves focused on (1) high-level\npredictions about AI progress; (2) the application of AI to scientific\ndiscovery; and (3) widespread adoption and social impact. The first\nthree rounds of LEAP reveal five key findings.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">First, collectively experts expect sizable societal effects from AI\nby 2040, even if effects materialize more slowly than expected.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Second, and in contrast with the first takeaway, considerable disagreement across experts, and uncertainty within individual experts, underlies these predictions of progress. This dynamic likely arises from multiple sources: the inherent difficulty of forecasting emerging technologies, sharp disagreements between competing schools of thought, and the fundamental uncertainty surrounding AI development and its impact. Forecasting emerging technologies is inherently difficult. For example, historical predictions about fusion power have consistently proven overoptimistic (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Takeda et al. 2023<\/a>). In our own prior work, both domain experts and superforecasters substantially underestimated AI progress (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Ku\u010dinskas et al. 2025<\/a>). Nevertheless, aggregate forecasts remain informative and offer the potential to cut through the noise of disagreement, as wisdom-of-the-crowd effects have proven robust across domains. As the project progresses and forecasting questions resolve, LEAP will evaluate the performance of aggregate and individual forecasts.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Third, expert predictions diverge substantially from the timelines\narticulated by frontier lab executives, with our median expert\nanticipating considerably slower progress. LEAP provides an expert view\nfree from the potential distorting effects of financial incentives that\nmay influence public statements from industry leaders. While the\nhistorical record will be the ultimate yardstick for these predictions,\nLEAP helps us understand what a broader swath of experts expect from\nAI.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Fourth, experts generally forecast faster AI progress than the public\nacross most outcomes, and LEAP will continue to track the evolution of\nexpert and public opinion, especially as the technology begins to be\nmore front-of-mind for the public.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Lastly, we observe consistency in predictions between\nsuperforecasters and experts; in the instances where their views\ndiverge, experts tend to predict somewhat faster AI progress.\nImportantly, we also find substantial consistency across our four\ncategories of experts: computer scientists, economists, industry\nprofessionals, and policy professionals. The forthcoming resolution of\nnear-term predictions will reveal whether specialized domain knowledge\nor general forecasting skill proves more valuable for predicting AI\ntrajectories\u2014a question with significant implications for weighing\ndifferent sources of expertise in technology policy decisions.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Although we designed LEAP to overcome the major challenges that\nconfront AI forecasting efforts, there are still some clear limitations\nof this work.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">First, it is difficult to generate a sample frame that is\nrepresentative of any key group of experts, and nonresponse bias is\ndifficult to avoid, potentially biasing results. We construct\ncomprehensive sampling frames of experts to minimize coverage bias;\nhowever, it is possible that there are experts who hold views different\nfrom those found in our frames. We reweight our data based on frame\ndemographics to reduce nonresponse bias, but our set of target variables\nmight not capture all the variation in opinion. These sources of bias\nmay affect the representativeness of our results.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In addition, while we have taken great care in constructing clear, specific, and resolvable questions, some questions contain inherent ambiguity, and for others, the discontinuation or change of a data source may preclude resolution. It is also possible that, going forward, attrition will affect our ability to measure how views on AI progress and diffusion change over time; indeed, leading research organizations often experience high attrition and low response rates (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">NORC AmeriSpeak 2024<\/a>). We will monitor attrition from LEAP to ensure sufficient sample sizes in future waves as well as to understand if attrition may be biasing our results.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Finally, survey participants typically have limited time for surveys.\nLEAP addresses this through three strategies: providing historical\ncontext and background for each question, offering interactive\ninterfaces with baseline data to streamline forecasting, and providing\nsignificant compensation to participants for their time. These measures\ncontribute to considerable effort by participants\u2014the median expert took\n44 minutes per survey, the median member of the public 29 minutes, and\nthe median superforecaster 90 minutes. But, sharing background and\nbaseline information among all participants reduces their independence\nand may dilute the wisdom-of-the-crowd effect, creating correlated\nforecasts or echo chambers. Additionally, it remains possible that some\nparticipants will not put effort into reporting their true beliefs on\neach question, speeding through the survey because of time constraints\nor disinterest. Unlike previous studies that used fixed payments\nregardless of accuracy, LEAP employs proper scoring rules that link\ncompensation to forecast quality based on clear resolution criteria, in\nan effort to reduce this risk.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">LEAP will continue to explore important questions regarding the\nfuture of AI. Public, high-profile proclamations about the technology\nare not necessarily representative of expert opinion, and we will search\nfor agreement and disagreement among experts, the general public, and\nprofessional forecasters. As LEAP forecasting questions begin to resolve\nas early as the end of 2025, we will assess how short-run accuracy on\nAI-related questions correlates with long-run AI-related beliefs as we\ntry to bring clarity to the many current high-stakes debates about\nAI.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Notes<\/h2>\n\n\n<ol class=\"wp-block-footnotes\"><li id=\"349340c1-373d-4eb1-85d3-28f508f33d90\">Forecasters are denoted \u201csuperforecasters\u201d if they (1) were in the top 2% of the accuracy distribution in a given year of the Intelligence Advanced Research Projects Activity (IARPA) Aggregative Contingent Estimation (ACE) tournament (IARPA ACE Program n.d.; Mellers et al. 2014) or (2) they were a highly accurate performer on Good Judgment Open, an online continuous geopolitical forecasting tournament. Good Judgment Inc., which runs Good Judgment Open, then adds these top forecasters to the \u201csuperforecaster\u201d pool. Most superforecasters come from the first selection criteria. Mellers et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2015<\/a>) finds persistent performance of these superforecasters across several years of geopolitical forecasting. <a href=\"#349340c1-373d-4eb1-85d3-28f508f33d90-link\" aria-label=\"Jump to footnote reference 1\">\u21a9\ufe0e<\/a><\/li><li id=\"d15cee71-1108-43b3-b329-2dec7e6e33f9\">Expert rationales averaged 92 words, with 25% of expert rationales exceeding 100 words and 8% exceeding 200 words. Among superforecasters, 47% of rationales exceeded 100 words and 23% exceeded 200 words. <a href=\"#d15cee71-1108-43b3-b329-2dec7e6e33f9-link\" aria-label=\"Jump to footnote reference 2\">\u21a9\ufe0e<\/a><\/li><li id=\"d99b5bf5-3708-4392-8e7b-df7289f0c6aa\">If not otherwise stated, we report values from the 50<sup>th<\/sup> percentile forecasts given by each expert. We elaborate on the use of quantile forecasting in <a href=\"#monthly-surveys-and-forecasting-questions\">Monthly Surveys and Forecasting Questions<\/a>. <a href=\"#d99b5bf5-3708-4392-8e7b-df7289f0c6aa-link\" aria-label=\"Jump to footnote reference 3\">\u21a9\ufe0e<\/a><\/li><li id=\"66522d7b-d668-4484-884c-38aa6b6f8528\">Respondents were shown a historical baseline value of 2%, based on an earlier version of the cited paper. A new version of the working paper estimates a range of 1.6% to 6.6%. We select the midpoint, 4.1%, as the historical baseline value. <a href=\"#66522d7b-d668-4484-884c-38aa6b6f8528-link\" aria-label=\"Jump to footnote reference 4\">\u21a9\ufe0e<\/a><\/li><li id=\"92219e5a-e381-4fb7-a009-03912e8393c4\">The median expert expects electricity consumption used for AI to rise to 12% by 2040. <a href=\"#92219e5a-e381-4fb7-a009-03912e8393c4-link\" aria-label=\"Jump to footnote reference 5\">\u21a9\ufe0e<\/a><\/li><li id=\"fa6b7dcf-4e3b-48d1-9e52-b21ae96cd313\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=116\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=96\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.II., 4. Electricity Consumption<\/a> for information on the baseline estimate of 1.0%. <a href=\"#fa6b7dcf-4e3b-48d1-9e52-b21ae96cd313-link\" aria-label=\"Jump to footnote reference 6\">\u21a9\ufe0e<\/a><\/li><li id=\"a7e95b27-899d-4afa-bdac-b90b259a26e9\">Regarding their private AI investment indicator, Our World in Data (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2025<\/a>) notes: 1. \u201cThe data likely underestimates total global AI investment, as it only captures certain types of private equity transactions, excluding other significant channels and categories of AI-related spending;\u201d 2. \u201cThe source does not fully disclose its methodology and what&#8217;s included or excluded. This means it may not fully capture important areas of AI investment, such as those from publicly traded companies, corporate internal R&amp;D, government funding, public sector initiatives, data center infrastructure, hardware production, semiconductor manufacturing, and expenses for research and talent.\u201d More details on what is likely excluded can be found at Our World in Data (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2025<\/a>). <a href=\"#a7e95b27-899d-4afa-bdac-b90b259a26e9-link\" aria-label=\"Jump to footnote reference 7\">\u21a9\ufe0e<\/a><\/li><li id=\"4657eacc-2029-4fa9-8213-fa8ffe075025\">The FrontierMath benchmark consists of math problems that resemble those a math PhD student might spend several days solving. <a href=\"#4657eacc-2029-4fa9-8213-fa8ffe075025-link\" aria-label=\"Jump to footnote reference 8\">\u21a9\ufe0e<\/a><\/li><li id=\"ee90ef6f-442e-457d-b06d-42db793b5c9e\">This estimate of 23% reflects the fraction of experts whose median forecast is that AI systems will achieve performance of at least 90% (which we call saturation) on Tiers 1\u20133 of FrontierMath. We take the average of the proportions calculated under weak and strict inequality. <a href=\"#ee90ef6f-442e-457d-b06d-42db793b5c9e-link\" aria-label=\"Jump to footnote reference 9\">\u21a9\ufe0e<\/a><\/li><li id=\"8dae5ba0-5fad-4f9b-90c4-526705f192b7\">The Millennium Prize Problems are seven mathematical problems identified by the Clay Mathematics Institute in 2000 as the most important unsolved questions in mathematics (C<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">lay Mathematics Institute n.d.<\/a>). Only one has been solved to date. <a href=\"#8dae5ba0-5fad-4f9b-90c4-526705f192b7-link\" aria-label=\"Jump to footnote reference 10\">\u21a9\ufe0e<\/a><\/li><li id=\"b6814db1-908d-4c41-a402-60e57447bb67\">Respondents were shown a historical baseline value of 2%, based on an earlier version of the cited paper. A new version of the working paper estimates a range of 1.6% to 6.6%. We select the midpoint, 4.1%, as the historical baseline value. <a href=\"#b6814db1-908d-4c41-a402-60e57447bb67-link\" aria-label=\"Jump to footnote reference 11\">\u21a9\ufe0e<\/a><\/li><li id=\"63d51702-88b7-44e8-946b-9c7393b547e7\">A forecaster expects the realized outcome to be below their 25<sup>th<\/sup> percentile forecast in just 25% of cases, compared to 50% of cases for a median forecast. We additionally ask for a 75<sup>th<\/sup> percentile forecast whenever we collect a 25<sup>th<\/sup> percentile forecast. <a href=\"#63d51702-88b7-44e8-946b-9c7393b547e7-link\" aria-label=\"Jump to footnote reference 12\">\u21a9\ufe0e<\/a><\/li><li id=\"dbfb006f-8326-4aa7-9c86-c321c5a2b32e\">Forecasters are asked what percentage of sales revenue from recently FDA-approved drugs will come from those discovered using AI methods available after 2022. See <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=114\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.II. 3. Drug Discovery<\/a>. <a href=\"#dbfb006f-8326-4aa7-9c86-c321c5a2b32e-link\" aria-label=\"Jump to footnote reference 13\">\u21a9\ufe0e<\/a><\/li><li id=\"83eadd21-22ab-44de-83c1-39f0f652accd\">Note that for two categorical questions about overall AI scenarios, we report averages instead of medians. Since respondents assign probabilities that sum to 100%, we use average aggregation to maintain this property. <a href=\"#83eadd21-22ab-44de-83c1-39f0f652accd-link\" aria-label=\"Jump to footnote reference 14\">\u21a9\ufe0e<\/a><\/li><li id=\"e264401d-38b7-41d7-8e5a-6e77ec5c63f2\">To assess the extent to which low-effort or relatively lower comprehension from the public could drive these results, we compare members of our public with high levels of forecasting accuracy in other studies to those with low accuracy. We do not find that one group systematically expects more or less progress. <a href=\"#public-accuracy-stratification\">Public Accuracy Stratification<\/a> details this analysis. <a href=\"#e264401d-38b7-41d7-8e5a-6e77ec5c63f2-link\" aria-label=\"Jump to footnote reference 15\">\u21a9\ufe0e<\/a><\/li><li id=\"9fa4990e-410c-4d97-a3de-a3b1e47a6f5d\">While surveys show that fewer than half of Americans report using AI products (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">NORC 2025<\/a>), nearly all (99%) actually use AI-enabled tools like navigation apps, streaming services, and social media weekly (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Maese 2025<\/a>); this gap reveals that AI has become ubiquitous in some applications, but is sometimes invisible to users. <a href=\"#9fa4990e-410c-4d97-a3de-a3b1e47a6f5d-link\" aria-label=\"Jump to footnote reference 16\">\u21a9\ufe0e<\/a><\/li><li id=\"57335b56-eb62-469f-a244-c2dbbcd32a50\">AI is already accelerating scientific discoveries across a wide range of disciplines such as medicine and materials science (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Dai et al. 2025<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Kay 2025<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Russell et al. 2023<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Stanford Medicine News Center 2025<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Sundermier 2024<\/a>). While some of these claims may be overblown or overstated to encourage media attention, AI is certainly affecting many aspects of science and research. <a href=\"#57335b56-eb62-469f-a244-c2dbbcd32a50-link\" aria-label=\"Jump to footnote reference 17\">\u21a9\ufe0e<\/a><\/li><li id=\"1f53487e-20fd-4bed-9901-426a33ef65ec\">Despite public figures predicting extensive job destruction\u2014for example Jamie Dimon speaking at Fortune\u2019s Most Powerful Women summit: \u201c[AI] will eliminate jobs. People should stop sticking their head in the sand\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Gerut 2025<\/a>)\u2014overall employment effects of AI remain small (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Chandar 2025<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Gimbel et al. 2025<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Eckhardt and Goldschlag 2025<\/a>) , the impact of rapid technological change on jobs in the short run is ambiguous, and in the longer run is neutral under standard economic models (e.g., <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Aghion, Jones, and Jones 2018<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Agrawal, Gans, and Goldfarb 2019<\/a>). Some recent evidence suggests that AI may be contributing to significant declines in entry-level hiring for workers in especially AI-exposed occupations (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Brynjolfsson, Chandar, and Chen 2025<\/a>), with those workers experiencing increased wages. This could suggest a negative supply shock, rather than a negative demand shock from substitution to AI. The ambiguity of technological change arises because automating some human processes often augments others (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Agrawal, Gans, and Goldfarb 2023<\/a>). <a href=\"#1f53487e-20fd-4bed-9901-426a33ef65ec-link\" aria-label=\"Jump to footnote reference 18\">\u21a9\ufe0e<\/a><\/li><li id=\"ae2ddc53-1ace-4f33-8208-e055f402bd19\">Jason Furman, in an interview with Ross Douthat, estimates that 92% of the increase in demand in the U.S. in the first half of 2025 comes from categories related to AI investment and services (information processing equipment and software). Accounting for equilibrium effects, Furman posits that roughly half of GDP growth is from the AI boom (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Douthat 2025<\/a>). This view is echoed by Karen Dynan, an economics professor at Harvard University, who argues, \u201cin a mechanical sense it\u2019s fair to say that AI has been the main driver of U.S. GDP growth this year\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Curran and Niquette 2025<\/a>). <a href=\"#ae2ddc53-1ace-4f33-8208-e055f402bd19-link\" aria-label=\"Jump to footnote reference 19\">\u21a9\ufe0e<\/a><\/li><li id=\"17c90b83-c2e9-4427-8fc0-369dd9597752\">Demis Hassabis: \u201cI think one day maybe we can cure all disease with the help of AI.\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Hassabis 2025<\/a>). <a href=\"#17c90b83-c2e9-4427-8fc0-369dd9597752-link\" aria-label=\"Jump to footnote reference 20\">\u21a9\ufe0e<\/a><\/li><li id=\"d2c807a0-8b97-43c6-a1f7-3c7c1e3fdd3f\">\u201cThere Will Be Very Hard Parts like Whole Classes of Jobs Going Away\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Altman 2025<\/a>). <a href=\"#d2c807a0-8b97-43c6-a1f7-3c7c1e3fdd3f-link\" aria-label=\"Jump to footnote reference 21\">\u21a9\ufe0e<\/a><\/li><li id=\"865aa83d-5511-4739-98e3-e025ab768709\">\u201c[A] dream scenario\u2014perhaps a goal to aim for\u2014would be 20% annual GDP growth rate in the developing world\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Amodei 2024<\/a>). <a href=\"#865aa83d-5511-4739-98e3-e025ab768709-link\" aria-label=\"Jump to footnote reference 22\">\u21a9\ufe0e<\/a><\/li><li id=\"579fdc13-94d7-4f0e-9228-b858d3ceee3b\">Forecasters are denoted \u201csuperforecasters\u201d if they (1) were in the top 2% of the accuracy distribution in a given year of the IARPA ACE tournament (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">IARPA ACE Program n.d.<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Mellers et al. 2014<\/a>) or (2) they were a highly accurate performer on Good Judgment Open, an online continuous geopolitical forecasting tournament. Good Judgment Inc., which runs Good Judgment Open, then adds these top forecasters to the \u201csuperforecaster\u201d pool. Most superforecasters come from the first selection criteria. Mellers et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2015<\/a>) finds persistent performance of these superforecasters across several years of geopolitical forecasting. <a href=\"#579fdc13-94d7-4f0e-9228-b858d3ceee3b-link\" aria-label=\"Jump to footnote reference 23\">\u21a9\ufe0e<\/a><\/li><li id=\"9fc23d2a-f118-4f4e-8692-d1172ee6fddc\">\u201cRepresentative\u201d is a contested, and sometimes fraught concept in social science (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Chasalow and Levy 2021<\/a>). Here, we adopt a narrow definition for our expert sample. We specify an expert sampling frame that we believe closely tracks groups of experts that policymakers are most inclined to consult about AI. We then apply standard survey reweighting methods to combat nonresponse bias, reweighting our respondent sample to match the observable characteristics in our initial sampling frame. For our public sample, we use known characteristics of the U.S. population as our reweighting targets. See <a href=\"#reweighting\">Reweighting<\/a> for more details and a discussion of limitations of this approach. <a href=\"#9fc23d2a-f118-4f4e-8692-d1172ee6fddc-link\" aria-label=\"Jump to footnote reference 24\">\u21a9\ufe0e<\/a><\/li><li id=\"889cae6c-7ce7-4e15-8357-1ccdecf3e7a0\">The <a href=\"#monthly-surveys-and-forecasting-questions\">Monthly Surveys and Forecasting Questions<\/a> section contain further detail on resolution criteria. <a href=\"#889cae6c-7ce7-4e15-8357-1ccdecf3e7a0-link\" aria-label=\"Jump to footnote reference 25\">\u21a9\ufe0e<\/a><\/li><li id=\"93825f9e-0f1b-4259-92b6-3e3c903b5ea5\">As typically specified, the \u201cwisdom of the crowd\u201d phenomenon appeals to independent and unbiased judgments, which lead the aggregate to outperform randomly selected components of the aggregate (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Davis-Stober et al. 2014<\/a>). An extensive literature develops improved aggregation approaches (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Baron et al. 2014<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Himmelstein, Budescu, and Han 2023<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Himmelstein, Budescu, and Ho 2023<\/a>), but a key theme of that work is that a simple median performs quite well as an aggregation mechanism across contexts. <a href=\"#93825f9e-0f1b-4259-92b6-3e3c903b5ea5-link\" aria-label=\"Jump to footnote reference 26\">\u21a9\ufe0e<\/a><\/li><li id=\"064d21f7-b3d1-4317-bef2-4e8847e58d17\">LEAP identifies distinct target populations (experts, superforecasters, and the general public) whose views we wish to study. We carefully construct sampling frames, or the part of the target populations that have a chance of being sampled, using multiple data sources to maximize the coverage of these populations. See <a href=\"#sampling\" id=\"#sampling\">Sampling<\/a> for more information on how sampling frames are constructed. <a href=\"#064d21f7-b3d1-4317-bef2-4e8847e58d17-link\" aria-label=\"Jump to footnote reference 27\">\u21a9\ufe0e<\/a><\/li><li id=\"61f5b523-52cd-481a-a891-9ef64a3e70b1\">Top citations in AI publications have also been used to target experts (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Muller and Bostrom 2014<\/a>), as well as a broader search of the literature using targeted publication classification codes (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">O\u2019Donovan et al. 2025<\/a>). <a href=\"#61f5b523-52cd-481a-a891-9ef64a3e70b1-link\" aria-label=\"Jump to footnote reference 28\">\u21a9\ufe0e<\/a><\/li><li id=\"6346c640-6888-47ea-ba4b-d11ef7f533f7\">Participation from the general public in these types of surveys relies on well-established panels (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">McClain et al. 2025<\/a>), online opinion polling platforms (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Zhang and Defoe 2019<\/a>), or by identifying non-experts with a demonstrated interest in AI (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Walsh 2017<\/a>). <a href=\"#6346c640-6888-47ea-ba4b-d11ef7f533f7-link\" aria-label=\"Jump to footnote reference 29\">\u21a9\ufe0e<\/a><\/li><li id=\"1f9b4d46-19b4-462a-b266-41e84b18c234\">These individuals, labeled \u201csuperforecasters,\u201d are distinguished by their ability to sustain high-accuracy forecasts and \u201cavoid regression to the mean\u201d across multiple prediction instances (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Mellers et al. 2015<\/a>). This was demonstrated by findings from three consecutive years of geopolitical forecasting tournaments conducted by the Good Judgment Project under the U.S. Intelligence Advanced Research Projects Activity (IARPA). <a href=\"#1f9b4d46-19b4-462a-b266-41e84b18c234-link\" aria-label=\"Jump to footnote reference 30\">\u21a9\ufe0e<\/a><\/li><li id=\"ae7ac65a-0fe7-47f1-b51b-2b2c644162e8\">Muller and Bostrom (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2014<\/a>) find experts working in theoretical AI were more likely to respond and more likely to be concerned about the negative effects of AI. Grace et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2018<\/a>) find respondents to have less time in the field and lower citation indexes which in turn was associated with more optimistic views on the timing of human-level machine intelligence (HLMI). O\u2019Donovan et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2025<\/a>) find an association between views on AI safety and governance and respondent views on the inevitability of HLMI, as well as an association between their categorization of respondents as AI optimists and the inevitability of AI. <a href=\"#ae7ac65a-0fe7-47f1-b51b-2b2c644162e8-link\" aria-label=\"Jump to footnote reference 31\">\u21a9\ufe0e<\/a><\/li><li id=\"6d01365e-a7d7-4422-b04e-aeafbefcdbd7\">A panel survey is a survey that repeatedly collects data from the same group of respondents over time, and LEAP is the first continuous panel survey of AI experts. There is one other repeated expert sample of AI experts that we have identified in the literature: Zhang et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2022<\/a>) and Stein-Perlman and Grace (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2022<\/a>) do \u201cmatched panel\u201d analysis by matching their respondents to earlier responses in Grace et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2018<\/a>). <a href=\"#6d01365e-a7d7-4422-b04e-aeafbefcdbd7-link\" aria-label=\"Jump to footnote reference 32\">\u21a9\ufe0e<\/a><\/li><li id=\"7f3f6025-dedc-4c9c-82a9-1eda5a37187f\">Nevertheless, the creation of unambiguous forecasting questions remains difficult. We dropped one question on the use of AI-use during K-12 instructional hours from this analysis, due to substantial misinterpretations. <a href=\"#7f3f6025-dedc-4c9c-82a9-1eda5a37187f-link\" aria-label=\"Jump to footnote reference 33\">\u21a9\ufe0e<\/a><\/li><li id=\"84f8937b-1ffb-4a3f-94a4-ba5d8b3bba48\">Some of the cited studies extrapolate forecasts backwards to obtain estimates for earlier dates, but these methods require parametric assumptions and generally do not allow for discontinuities in the time paths of forecasts. <a href=\"#84f8937b-1ffb-4a3f-94a4-ba5d8b3bba48-link\" aria-label=\"Jump to footnote reference 34\">\u21a9\ufe0e<\/a><\/li><li id=\"6401e4e8-3ac9-49bd-9b42-4cf3fa0a82de\">Forecasters are denoted \u201csuperforecasters\u201d if they (1) were in the top 2% of the accuracy distribution in a given year of the IARPA ACE tournament (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">IARPA ACE Program n.d.<\/a>; <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Mellers et al. 2014<\/a>) or (2) they were a highly accurate performer on Good Judgment Open, an online continuous geopolitical forecasting tournament. Good Judgment Inc., which runs Good Judgment Open, then adds these top forecasters to the \u201csuperforecaster\u201d pool. Most superforecasters come from the first selection criteria. Mellers et al. (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2015<\/a>) finds persistent performance of these superforecasters across several years of geopolitical forecasting. <a href=\"#6401e4e8-3ac9-49bd-9b42-4cf3fa0a82de-link\" aria-label=\"Jump to footnote reference 35\">\u21a9\ufe0e<\/a><\/li><li id=\"e994d151-c382-4662-965f-c28352b5d1d5\">Highly-engaged participants were recruited from previous high effort projects spanning multiple weeks. <a href=\"#e994d151-c382-4662-965f-c28352b5d1d5-link\" aria-label=\"Jump to footnote reference 36\">\u21a9\ufe0e<\/a><\/li><li id=\"0409d0c2-0b62-47b0-a0c0-2506997a6fdc\">Pew describes \u201craking,\u201d also known as iterative proportional fitting, as the most common approach to reweighting public opinion surveys, <a href=\"https:\/\/www.pewresearch.org\/methods\/2018\/01\/26\/how-different-weighting-methods-work\/#:~:text=For%20public%20opinion%20surveys%2C%20the,each%20variable%20used%20in%20weighting.\" id=\"https:\/\/www.pewresearch.org\/methods\/2018\/01\/26\/how-different-weighting-methods-work\/#:~:text=For%20public%20opinion%20surveys%2C%20the,each%20variable%20used%20in%20weighting.\">\u201cFor public opinion surveys, the most prevalent method for weighting is iterative proportional fitting, more commonly referred to as raking\u201d<\/a> (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Mercer et al. 2018<\/a>). <a href=\"#0409d0c2-0b62-47b0-a0c0-2506997a6fdc-link\" aria-label=\"Jump to footnote reference 37\">\u21a9\ufe0e<\/a><\/li><li id=\"0f8c5e8c-45d5-4684-9430-7bd8115fa6ee\">Effective Altruism (EA) is a philosophical and social movement focused on directing resources towards improving the world. It is not a monolithic group. Many effective altruists strongly disagree about how best to direct resources towards improving the world, the philosophical framework that determines what an \u2018improvement\u2019 means, the risks one should take to improve the world, and the focus one should place on the short- and long-run in pursuit of improving the world. We consider someone EA-affiliated if they or their employer have or have had funding ties to EA, they publicly endorse EA, or they self-identify as EA-affiliated or EA-adjacent. Further, if their work focuses on EA, AI safety, global catastrophic risks, or existential risks, we consider them to be EA-affiliated. These criteria are permissive and favor overinclusion, as we chose to measure an upper bound of EA affiliation. Nevertheless, the proportion of our frame tagged as having EA affiliation remains relatively small, at 14.3%. <a href=\"#0f8c5e8c-45d5-4684-9430-7bd8115fa6ee-link\" aria-label=\"Jump to footnote reference 38\">\u21a9\ufe0e<\/a><\/li><li id=\"67c8e553-de80-40a5-8016-37d549db3725\">We define \u201ctop AI labs\u201d as the 10 unique labs which provide the 20 most computationally intensive models (in terms of training FLOP) on Epoch AI\u2019s <a href=\"https:\/\/epoch.ai\/data\/ai-models\" id=\"https:\/\/epoch.ai\/data\/ai-models\">Data on AI Models<\/a> table (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Epoch AI 2024b<\/a>). <a href=\"#67c8e553-de80-40a5-8016-37d549db3725-link\" aria-label=\"Jump to footnote reference 39\">\u21a9\ufe0e<\/a><\/li><li id=\"b7a4e61c-0e08-472b-8df1-2381b8e75f7b\">As seen in the table below, we split our expert population into four categories of expertise: Computer Science, Economics, Industry and Policy. For the purposes of reweighting, we equally weight these categories. Our initial frame slightly overrepresented Computer Science and Industry professionals and underrepresented Economics and Policy professionals. <a href=\"#b7a4e61c-0e08-472b-8df1-2381b8e75f7b-link\" aria-label=\"Jump to footnote reference 40\">\u21a9\ufe0e<\/a><\/li><li id=\"8c49426e-4ad8-40a9-85a3-f09d3efd32ca\">We use the <a href=\"https:\/\/doi.org\/10.18128\/D010.V16.0\">IPUMS USA combined Census and American Community Survey (ACS) data<\/a> to derive population targets for all variables except party identification (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Ruggles et al. 2025<\/a>). For party identification, we use <a href=\"https:\/\/www.pewresearch.org\/politics\/2024\/04\/09\/the-partisanship-and-ideology-of-american-voters\/\" id=\"https:\/\/www.pewresearch.org\/politics\/2024\/04\/09\/the-partisanship-and-ideology-of-american-voters\/\">data from Pew Research Center<\/a> (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Nadeem 2024<\/a>). <a href=\"#8c49426e-4ad8-40a9-85a3-f09d3efd32ca-link\" aria-label=\"Jump to footnote reference 41\">\u21a9\ufe0e<\/a><\/li><li id=\"3e1cd8cc-472b-4a6f-b1ab-ab02415bd022\">Note that the actual proportion of participants invited from Industry was 36%. For the purposes of reweighting, each respondent category of expertise was weighted equally as discussed above. <a href=\"#3e1cd8cc-472b-4a6f-b1ab-ab02415bd022-link\" aria-label=\"Jump to footnote reference 42\">\u21a9\ufe0e<\/a><\/li><li id=\"31f3e343-943a-4f7c-b254-30b03fd86a60\">These respondents are either 200 top-cited according to OpenAlex, or part of our age-stratified CS author list. <a href=\"#31f3e343-943a-4f7c-b254-30b03fd86a60-link\" aria-label=\"Jump to footnote reference 43\">\u21a9\ufe0e<\/a><\/li><li id=\"98cb1a57-9113-4df3-b1a4-4ac0efac4e9d\">Research rankings for economics schools and journals, based on publication records. Authors at an institution hosting one of the top-50 economics departments according to RePEc are considered. <a href=\"#98cb1a57-9113-4df3-b1a4-4ac0efac4e9d-link\" aria-label=\"Jump to footnote reference 44\">\u21a9\ufe0e<\/a><\/li><li id=\"41549962-1a59-45e2-a739-8bb71e8a1b2f\">The percentage decrease in completions from Wave 2 to 3 per category were as follows: Economics: 1.7%, Superforecasters: 3.5%, Public: 3.8%, Computer Science: 4.7%, Industry: 5.3%, Policy: 7.2%. <a href=\"#41549962-1a59-45e2-a739-8bb71e8a1b2f-link\" aria-label=\"Jump to footnote reference 45\">\u21a9\ufe0e<\/a><\/li><li id=\"10c0bd6e-dc02-4b64-8601-2383555bcf25\">The survey platform records the active time spent on the survey window on respondents\u2019 devices. If they leave the page (even to do additional research relevant to the survey), the timer is paused. Hence, this measure likely serves as a lower bound for time spent on the survey. <a href=\"#10c0bd6e-dc02-4b64-8601-2383555bcf25-link\" aria-label=\"Jump to footnote reference 46\">\u21a9\ufe0e<\/a><\/li><li id=\"2a066104-34d3-43c9-916f-65ad3ecd00c2\">We provide expert participants with $2,000 per each year of full participation (prorated for the number of surveys they complete, with an expectation that we will complete 12 surveys in a typical year). In other words, experts receive $166.67 per survey completed. We provide superforecasters with $1,000, prorated similarly, or $83.33 per survey completed. We pay public participants in line with CloudResearch platform norms, or $8 per survey completed ($13.71 per hour). <a href=\"#2a066104-34d3-43c9-916f-65ad3ecd00c2-link\" aria-label=\"Jump to footnote reference 47\">\u21a9\ufe0e<\/a><\/li><li id=\"b8186e41-dd7f-49b2-820c-dc0836183480\">Respondents are prompted to give their forecasts in the form of probabilities. <a href=\"#b8186e41-dd7f-49b2-820c-dc0836183480-link\" aria-label=\"Jump to footnote reference 48\">\u21a9\ufe0e<\/a><\/li><li id=\"fbb1f957-9891-42fa-8c14-a32faa4e4b21\">Respondents are prompted to give their forecasts in the form of 25<sup>th<\/sup> , 50<sup>th<\/sup> , and 75<sup>th<\/sup> percentiles. <a href=\"#fbb1f957-9891-42fa-8c14-a32faa4e4b21-link\" aria-label=\"Jump to footnote reference 49\">\u21a9\ufe0e<\/a><\/li><li id=\"9753e07e-ab53-40f2-a948-77cfa0cacb13\">The pooled distribution is not necessarily the optimal way to aggregate forecasts in terms of forecasting accuracy. For example, Ranjan and Gneiting (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2010<\/a>) show that combining well-calibrated forecasts in this fashion yields forecasts that are miscalibrated. Similar results are reported by Lichtendahl et al (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2013<\/a>). <a href=\"#9753e07e-ab53-40f2-a948-77cfa0cacb13-link\" aria-label=\"Jump to footnote reference 50\">\u21a9\ufe0e<\/a><\/li><li id=\"ab935eb6-3a6a-45e4-84b4-e5c4214e247c\">In future work, we will summarize forecaster-level gaps between 25<sup>th<\/sup> and 75<sup>th<\/sup> percentile forecasts. <a href=\"#ab935eb6-3a6a-45e4-84b4-e5c4214e247c-link\" aria-label=\"Jump to footnote reference 51\">\u21a9\ufe0e<\/a><\/li><li id=\"98818581-9957-4f4d-ab1f-c4945458f22a\"><em>Pooled distribution<\/em>: IQR (7.3%\u201334.6%); variance decomposition: 47% between\u2013forecaster disagreement, 53% within\u2013forecaster uncertainty. <br><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (9%\u201330%); median 25<sup>th<\/sup> and 75<sup>th<\/sup> percentile forecasts were 9% and 28% respectively. <br>Uncertainty and disagreement metrics on other claims made in this list can be found in the Monthly Reports at <a href=\"https:\/\/leap.forecastingresearch.org\/reports\/\">https:\/\/leap.forecastingresearch.org\/reports\/<\/a>. <a href=\"#98818581-9957-4f4d-ab1f-c4945458f22a-link\" aria-label=\"Jump to footnote reference 52\">\u21a9\ufe0e<\/a><\/li><li id=\"2b7fae99-fc07-44f2-b81f-3174831227bb\">Respondents were shown a historical baseline value of 2%, based on an earlier version of the cited paper. The most recent draft gives a range of 1.6% to 6.6%. We select the midpoint, 4.1%, as the historical baseline value. <a href=\"#2b7fae99-fc07-44f2-b81f-3174831227bb-link\" aria-label=\"Jump to footnote reference 53\">\u21a9\ufe0e<\/a><\/li><li id=\"8cc15fdf-409f-4f29-98f6-67baaf97c59d\">Sentences of the form, \u201cThe median expert gives an X% chance,\u201d report the median of experts\u2019 X<sup>th<\/sup> percentile forecasts. <a href=\"#8cc15fdf-409f-4f29-98f6-67baaf97c59d-link\" aria-label=\"Jump to footnote reference 54\">\u21a9\ufe0e<\/a><\/li><li id=\"23bfde2d-f28b-477d-8655-ea9c96555e8f\">Regarding their private AI investment indicator, Our World in Data (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2025<\/a>) notes: 1. \u201cThe data likely underestimates total global AI investment, as it only captures certain types of private equity transactions, excluding other significant channels and categories of AI-related spending;\u201d 2. \u201cThe source does not fully disclose its methodology and what&#8217;s included or excluded. This means it may not fully capture important areas of AI investment, such as those from publicly traded companies, corporate internal R&amp;D, government funding, public sector initiatives, data center infrastructure, hardware production, semiconductor manufacturing, and expenses for research and talent.\u201d More details on what is likely excluded can be found at Our World in Data (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2025<\/a>). <a href=\"#23bfde2d-f28b-477d-8655-ea9c96555e8f-link\" aria-label=\"Jump to footnote reference 55\">\u21a9\ufe0e<\/a><\/li><li id=\"c2d6b13f-32f9-40bb-b188-ddff67dc7421\">This estimate of 23% reflects the fraction of experts whose median forecast is that AI systems will achieve performance of at least 90% (which we call saturation) on Tiers 1-3 of FrontierMath. We take the average of the proportions calculated under weak and strict inequality. <a href=\"#c2d6b13f-32f9-40bb-b188-ddff67dc7421-link\" aria-label=\"Jump to footnote reference 56\">\u21a9\ufe0e<\/a><\/li><li id=\"8acd763f-4363-457b-a67c-090e56541707\"><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (10%\u201330%). <a href=\"#8acd763f-4363-457b-a67c-090e56541707-link\" aria-label=\"Jump to footnote reference 57\">\u21a9\ufe0e<\/a><\/li><li id=\"bdd05175-27a9-4dcd-aa6b-f7c5de77458a\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.I. Survey Questions: Wave 1<\/a> for background information. We ask participants the probability that LEAP panelists will choose \u201cslow progress,\u201d \u201cmoderate progress,\u201d or \u201crapid progress\u201d as best matching the general level of AI progress. <a href=\"#bdd05175-27a9-4dcd-aa6b-f7c5de77458a-link\" aria-label=\"Jump to footnote reference 58\">\u21a9\ufe0e<\/a><\/li><li id=\"f7f7e06f-3b9e-412d-afe6-7537f6485a8d\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.I. Survey Questions: Wave 1<\/a> for background information <a href=\"#f7f7e06f-3b9e-412d-afe6-7537f6485a8d-link\" aria-label=\"Jump to footnote reference 59\">\u21a9\ufe0e<\/a><\/li><li id=\"f5441114-d01f-46a5-8871-c9bbaeb2942e\"><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (-4%\u20135%) <a href=\"#f5441114-d01f-46a5-8871-c9bbaeb2942e-link\" aria-label=\"Jump to footnote reference 60\">\u21a9\ufe0e<\/a><\/li><li id=\"41a645ad-d348-452d-9788-c893693a4f45\">We have a complementary survey in the field exploring these topics which we plan to release results from in early 2026. <a href=\"#41a645ad-d348-452d-9788-c893693a4f45-link\" aria-label=\"Jump to footnote reference 61\">\u21a9\ufe0e<\/a><\/li><li id=\"cad14124-84e3-405a-8c44-d454bc9b91c3\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=108\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=108\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.I. Survey Questions: Wave 1, Question 5, Technological Richter Scale<\/a> for details. <a href=\"#cad14124-84e3-405a-8c44-d454bc9b91c3-link\" aria-label=\"Jump to footnote reference 62\">\u21a9\ufe0e<\/a><\/li><li id=\"5d86a372-9fcf-43bb-a540-84cfcdf20a1e\">The median forecast for this question was 25%. <br><em>Pooled distribution<\/em>: IQR (8.4%\u201353.8%); variance decomposition: 52% between\u2013forecaster disagreement, 48% within\u2013forecaster uncertainty. <br><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (10%\u201350%); median 25<sup>th<\/sup> and 75<sup>th<\/sup> percentile forecasts were 10% and 43% respectively. <a href=\"#5d86a372-9fcf-43bb-a540-84cfcdf20a1e-link\" aria-label=\"Jump to footnote reference 63\">\u21a9\ufe0e<\/a><\/li><li id=\"d3f78ea9-9e62-4a05-95f9-937aa2e600e9\">The median forecast for this question was 60%. <a href=\"#d3f78ea9-9e62-4a05-95f9-937aa2e600e9-link\" aria-label=\"Jump to footnote reference 64\">\u21a9\ufe0e<\/a><\/li><li id=\"0f876b32-154f-4ffd-81cd-cb565602b4f8\">See <a href=\"https:\/\/leap.forecastingresearch.org\/reports\/\">https:\/\/leap.forecastingresearch.org\/reports\/<\/a> to access these tables. <a href=\"#0f876b32-154f-4ffd-81cd-cb565602b4f8-link\" aria-label=\"Jump to footnote reference 65\">\u21a9\ufe0e<\/a><\/li><li id=\"1c277265-12d5-46e7-86a4-18ea84d9eedb\"><em>Raw data<\/em>: The median forecast for this question was 20%. IQR on the 50<sup>th<\/sup> percentile was (10%\u201330%). <a href=\"#1c277265-12d5-46e7-86a4-18ea84d9eedb-link\" aria-label=\"Jump to footnote reference 66\">\u21a9\ufe0e<\/a><\/li><li id=\"d5650a95-e72c-4f84-9fb3-40613353d20e\">See <a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\" target=\"_blank\" rel=\"noreferrer noopener\">Appendix E.I. Survey Questions: Wave 1<\/a> for background information. We ask participants the probability LEAP panelists will choose \u201cslow progress,\u201d \u201cmoderate progress,\u201d or \u201crapid progress\u201d as best matching the general level of AI progress. <a href=\"#d5650a95-e72c-4f84-9fb3-40613353d20e-link\" aria-label=\"Jump to footnote reference 67\">\u21a9\ufe0e<\/a><\/li><li id=\"d11df40e-72a1-411b-8fde-3f2d502bf544\"><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (-4%\u20135%). <a href=\"#d11df40e-72a1-411b-8fde-3f2d502bf544-link\" aria-label=\"Jump to footnote reference 68\">\u21a9\ufe0e<\/a><\/li><li id=\"6feae762-cbb5-456d-8f2d-9c0ecc33f95a\">In a future survey wave, we plan to collect forecasts of the predicted relationship between AI capabilities and employment growth in each sector by asking respondents to forecast employment growth conditional on low-, moderate-, and rapid-progress scenarios. <a href=\"#6feae762-cbb5-456d-8f2d-9c0ecc33f95a-link\" aria-label=\"Jump to footnote reference 69\">\u21a9\ufe0e<\/a><\/li><li id=\"88000fb8-dbe0-4c1e-bdba-6f267cd8ccbf\"><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (30%\u201381%). <a href=\"#88000fb8-dbe0-4c1e-bdba-6f267cd8ccbf-link\" aria-label=\"Jump to footnote reference 70\">\u21a9\ufe0e<\/a><\/li><li id=\"2997cdbe-89c3-4246-aa80-9590826ba717\"><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (10%\u201350%). <a href=\"#2997cdbe-89c3-4246-aa80-9590826ba717-link\" aria-label=\"Jump to footnote reference 71\">\u21a9\ufe0e<\/a><\/li><li id=\"cbd0269b-114c-4ad3-9bd8-1b550056f9d2\">The degree to which progress on Millennium Prize Problems is serial or parallel, as well as the general difficulty of the Problems, complicates this comparison. Eliciting forecasts from multiple experts on consistent forecasting questions with clear resolution criteria helps us bring clarity to debates often plagued by ambiguous definitions. <a href=\"#cbd0269b-114c-4ad3-9bd8-1b550056f9d2-link\" aria-label=\"Jump to footnote reference 72\">\u21a9\ufe0e<\/a><\/li><li id=\"934b8442-9d4f-4d36-8120-1843f29193d6\">Questions include FrontierMath, Autonomous Vehicle Trips, Millennium Prize, Diffusion of AI Across Sciences, Drug Discovery, Electricity Consumption, Cognitive Limitations, AI Investment, Generative AI Use Intensity, Open vs Proprietary Polarity, AI Companions, Barriers to Adoption, General AI progress, and Technological Richter Scale. <a href=\"#934b8442-9d4f-4d36-8120-1843f29193d6-link\" aria-label=\"Jump to footnote reference 73\">\u21a9\ufe0e<\/a><\/li><li id=\"9d6dd258-27c6-4bbf-a8a2-d7b72fb8f0f9\">Forecasting questions with clear valence have an unambiguous directional association with progress. For some questions, like employment by sector, it is unclear whether higher or lower levels of unemployment would be associated with more advanced or less advanced AI progress, so we exclude those questions from this analysis. We also transform some forecasts to establish the progress valence. First, we take the average across all fields in Diffusion of AI Across Sciences. Second, for Cognitive Limitations and Barriers to Adoption, we average across all categories and use the complementary probability. Third, we take the average of closed- and open-weight performance for Open vs Proprietary Polarity. Lastly, we take the values assigned to the \u201cRapid\u201d scenario and TRS levels 8 and above for the General AI Progress and Technological Richter Scale questions, respectively. <a href=\"#9d6dd258-27c6-4bbf-a8a2-d7b72fb8f0f9-link\" aria-label=\"Jump to footnote reference 74\">\u21a9\ufe0e<\/a><\/li><li id=\"7e5ba175-383b-4ed0-a966-7bd43feb4245\">We use Mann\u2013Whitney U tests for equality in distribution unless otherwise stated, with a 5% significance threshold. All Mann\u2013Whitney U tests and Cliff\u2019s <em>\u03b4<\/em> values are currently unweighted. <a href=\"#7e5ba175-383b-4ed0-a966-7bd43feb4245-link\" aria-label=\"Jump to footnote reference 75\">\u21a9\ufe0e<\/a><\/li><li id=\"6dc2d733-d15e-453b-8e68-32880d3cca1c\">We claim a group predicts statistically significantly less progress according to a Mann\u2013Whitney U test <em>and<\/em> a negative Cliff\u2019s <em>\u03b4<\/em>. <a href=\"#6dc2d733-d15e-453b-8e68-32880d3cca1c-link\" aria-label=\"Jump to footnote reference 76\">\u21a9\ufe0e<\/a><\/li><li id=\"2ff68abb-a816-4876-958f-ba26747c8edf\">Cliff\u2019s <em>\u03b4<\/em> performs pairwise comparisons between all values of two empirical distributions. It takes the number of comparisons where the value from the first distribution exceeds the second and subtracts the number of comparisons where the value from the second distribution exceeds the first and reports this difference as a proportion of the count of comparisons. In other words, it reports the probability that a randomly drawn value from the first distribution exceeds a randomly drawn value from the second distribution, over and above what would be expected by pure random chance if the two distributions were identical. <a href=\"#2ff68abb-a816-4876-958f-ba26747c8edf-link\" aria-label=\"Jump to footnote reference 77\">\u21a9\ufe0e<\/a><\/li><li id=\"888c9df3-1e71-4349-9a83-a8f74371c6ce\">See <a href=\"https:\/\/leap.forecastingresearch.org\/reports\/\">https:\/\/leap.forecastingresearch.org\/reports\/<\/a> to access these tables. <a href=\"#888c9df3-1e71-4349-9a83-a8f74371c6ce-link\" aria-label=\"Jump to footnote reference 78\">\u21a9\ufe0e<\/a><\/li><li id=\"328076b5-8365-43d5-abc4-cc93d7587a88\"><em>Pooled distribution<\/em>: IQR (6.9%\u201346.1%); variance decomposition: 47% between\u2013forecaster disagreement, 53% within\u2013forecaster uncertainty. <em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (10%\u201340%); median 25<sup>th<\/sup> and 75<sup>th<\/sup> percentile forecasts were 8% and 35% respectively. <a href=\"#328076b5-8365-43d5-abc4-cc93d7587a88-link\" aria-label=\"Jump to footnote reference 79\">\u21a9\ufe0e<\/a><\/li><li id=\"3f0107f8-f99a-4f11-9abf-973f0331d9a1\"><em>Pooled distribution<\/em>: IQR (4%\u201331%); variance decomposition: 66% between\u2013forecaster disagreement, 34% within\u2013forecaster uncertainty. <br><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (5%\u201329%); median 25<sup>th<\/sup> and 75<sup>th<\/sup> percentile forecasts were 5% and 20% respectively. <br>Uncertainty and disagreement metrics on other claims made in this list can be found in the Monthly Reports at <a href=\"https:\/\/leap.forecastingresearch.org\/reports\/\">https:\/\/leap.forecastingresearch.org\/reports\/<\/a>. <a href=\"#3f0107f8-f99a-4f11-9abf-973f0331d9a1-link\" aria-label=\"Jump to footnote reference 80\">\u21a9\ufe0e<\/a><\/li><li id=\"3467dc4d-ea16-46eb-98f5-47a868923605\">&#8220;To gauge the difficulty of FrontierMath problems, we organized a competition at MIT involving around 40 exceptional math undergraduates and subject-matter experts. Participants formed eight teams of four or five members, each with internet access, and had four and a half hours to solve 23 problems. On a subset of 23 tier 1-3 problems, the average team scored 19%, while 35% of the problems were solved collectively across all teams.&#8221; (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Epoch AI 2025<\/a>). <a href=\"#3467dc4d-ea16-46eb-98f5-47a868923605-link\" aria-label=\"Jump to footnote reference 81\">\u21a9\ufe0e<\/a><\/li><li id=\"23a598cf-525c-4d4e-ac54-155fb5d62df1\">This estimate of 23% reflects the fraction of experts whose median forecast is that AI systems will achieve performance of at least 90% (which we call saturation) on Tiers 1\u20133 of FrontierMath. We take the average of the proportions calculated under weak and strict inequality. <a href=\"#23a598cf-525c-4d4e-ac54-155fb5d62df1-link\" aria-label=\"Jump to footnote reference 82\">\u21a9\ufe0e<\/a><\/li><li id=\"59e81bf9-27c0-408e-9e27-224496185e50\">Physics: 32%; Materials Science: 37%; Medicine: 37%. <a href=\"#59e81bf9-27c0-408e-9e27-224496185e50-link\" aria-label=\"Jump to footnote reference 83\">\u21a9\ufe0e<\/a><\/li><li id=\"80d423ed-cb7d-4acf-8d3e-f74c9ea10fcb\">Physics: 27%; Materials Science: 30%; Medicine: 30%. <a href=\"#80d423ed-cb7d-4acf-8d3e-f74c9ea10fcb-link\" aria-label=\"Jump to footnote reference 84\">\u21a9\ufe0e<\/a><\/li><li id=\"54630412-1747-4c6c-8b27-9bb8eaf39c0d\">For this comparison, we switch to reporting Cliff\u2019s \u03b4 values calculated with the public as the first distribution. <a href=\"#54630412-1747-4c6c-8b27-9bb8eaf39c0d-link\" aria-label=\"Jump to footnote reference 85\">\u21a9\ufe0e<\/a><\/li><li id=\"cd6bbf30-b63a-4157-a9b6-136fd4cb82dd\">The median forecast was 23%. <em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (12%\u201335%) <a href=\"#cd6bbf30-b63a-4157-a9b6-136fd4cb82dd-link\" aria-label=\"Jump to footnote reference 86\">\u21a9\ufe0e<\/a><\/li><li id=\"31ae1fcd-f1a7-4a0e-8954-f07e28840e22\">The median forecast was 20%. <em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (10%\u201330%) <a href=\"#31ae1fcd-f1a7-4a0e-8954-f07e28840e22-link\" aria-label=\"Jump to footnote reference 87\">\u21a9\ufe0e<\/a><\/li><li id=\"38196ecc-fcfc-4953-b552-fee38be2bf97\"><em>Pooled distribution<\/em>: IQR (6.9%\u201346.2%); variance decomposition: 47% between\u2013forecaster disagreement, 53% within\u2013forecaster uncertainty. <br><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (10%\u201340%); median 25<sup>th<\/sup> and 75<sup>th<\/sup> percentile forecasts were 8% and 35% respectively. <a href=\"#38196ecc-fcfc-4953-b552-fee38be2bf97-link\" aria-label=\"Jump to footnote reference 88\">\u21a9\ufe0e<\/a><\/li><li id=\"0db06cd8-5f5f-4038-93ff-a68285545c4d\"><em>Pooled distribution<\/em>: IQR (3%\u201325%); variance decomposition: 48% between\u2013forecaster disagreement, 52% within\u2013forecaster uncertainty. <br><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (3%\u201325%); median 25<sup>th<\/sup> and 75<sup>th<\/sup> percentile forecasts were 4% and 20% respectively. <br>Uncertainty and disagreement metrics on other claims made in this paragraph can be found in the Monthly Reports at <a href=\"https:\/\/leap.forecastingresearch.org\/reports\/\">https:\/\/leap.forecastingresearch.org\/reports\/<\/a>. <a href=\"#0db06cd8-5f5f-4038-93ff-a68285545c4d-link\" aria-label=\"Jump to footnote reference 89\">\u21a9\ufe0e<\/a><\/li><li id=\"f6aa4b20-0c7b-4251-a1d4-a3629b227fc0\">See <a href=\"https:\/\/leap.forecastingresearch.org\/reports\/\">https:\/\/leap.forecastingresearch.org\/reports\/<\/a> to access these tables. <a href=\"#f6aa4b20-0c7b-4251-a1d4-a3629b227fc0-link\" aria-label=\"Jump to footnote reference 90\">\u21a9\ufe0e<\/a><\/li><li id=\"a8016211-a299-4c2c-a058-715e656b8533\">Experts were asked, \u201cWhat will be the highest percentage accuracy achieved by an AI model on FrontierMath, by Jan 1 of 2026, 2028, and 2031?\u201d <a href=\"#a8016211-a299-4c2c-a058-715e656b8533-link\" aria-label=\"Jump to footnote reference 91\">\u21a9\ufe0e<\/a><\/li><li id=\"c1891d77-58c4-4aca-8ca2-d9c550549066\">Actual progress was marginally slower. The top Tier 1-3 accuracy rate rose from 1.03% in June of 2024 to 29% in August of 2025, where it remained as of the publication of this paper. <a href=\"#c1891d77-58c4-4aca-8ca2-d9c550549066-link\" aria-label=\"Jump to footnote reference 92\">\u21a9\ufe0e<\/a><\/li><li id=\"150075bf-55f0-4ce6-8676-ee214749eb17\">Experts were asked, \u201cWhat percentage of U.S. ride-hailing trips will be provided by autonomous vehicles that are classified SAE Level 4 or above in the years 2027 and 2030?\u201d <a href=\"#150075bf-55f0-4ce6-8676-ee214749eb17-link\" aria-label=\"Jump to footnote reference 93\">\u21a9\ufe0e<\/a><\/li><li id=\"15dc6f35-e4d0-4ebb-a421-50422463048e\">Experts were asked, \u201cWhat will the percent change in the number of jobs (compared to Jan 1, 2025) in the U.S. be for white-collar, blue-collar, and service occupations, by Jan 1 of 2028 and 2031?\u201d <a href=\"#15dc6f35-e4d0-4ebb-a421-50422463048e-link\" aria-label=\"Jump to footnote reference 94\">\u21a9\ufe0e<\/a><\/li><li id=\"45326349-1cc9-4056-a9ee-e162931dae19\">Experts were presented with three scenarios that detailed the development of AI capabilities and asked, \u201cAt the end of 2030, what percent of LEAP panelists will choose \u201cslow progress,\u201d \u201cmoderate progress,\u201d or \u201crapid progress\u201d as best matching the general level of AI progress?\u201d <a href=\"#45326349-1cc9-4056-a9ee-e162931dae19-link\" aria-label=\"Jump to footnote reference 95\">\u21a9\ufe0e<\/a><\/li><li id=\"baac8fbb-0b1e-44ee-b779-3704da1d4dc3\">Actual rate of improvement likely falls within this range, especially given recent acceleration trends, but there is considerable uncertainty and domain variability. See: <a href=\"https:\/\/metr.org\/blog\/2025-03-19-measuring-ai-ability-to-complete-long-tasks\/\">https:\/\/metr.org\/blog\/2025-03-19-measuring-ai-ability-to-complete-long-tasks\/<\/a> for more information. <a href=\"#baac8fbb-0b1e-44ee-b779-3704da1d4dc3-link\" aria-label=\"Jump to footnote reference 96\">\u21a9\ufe0e<\/a><\/li><li id=\"25fab556-1fa8-43b8-9d4e-72a8e47447c7\">Nate Silver\u2019s book \u201cOn the Edge\u201d proposes the technological Richter scale (TRS) which, analogous with earthquake magnitudes, rates the impact of technologies on a roughly logarithmic scale with10 representing that greatest impact (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Silver 2024<\/a>). Experts were asked, \u201cAt the end of 2040, what is the probability for AI achieving the following levels of net impact [ranging from 5-10] on human society as compared to the impact of past technological events?\u201d <a href=\"#25fab556-1fa8-43b8-9d4e-72a8e47447c7-link\" aria-label=\"Jump to footnote reference 97\">\u21a9\ufe0e<\/a><\/li><li id=\"303a3c69-d754-4e8c-8bba-63ac24995998\">Experts were asked, \u201cWhat is the probability that AI will solve or substantially assist in solving a Millennium Prize Problem in mathematics by Dec 31 of 2027, 2030, and 2040?\u201d <a href=\"#303a3c69-d754-4e8c-8bba-63ac24995998-link\" aria-label=\"Jump to footnote reference 98\">\u21a9\ufe0e<\/a><\/li><li id=\"cabcd0dd-a9c8-4846-8934-d23aa0862b47\">Experts were asked, \u201cWhat percent of publications in the fields of Physics, Materials Science, and Medicine in 2030 will be \u2018AI-engaged\u2019 as measured in a replication of this study?\u201d (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Duede et al. 2024<\/a>) <a href=\"#cabcd0dd-a9c8-4846-8934-d23aa0862b47-link\" aria-label=\"Jump to footnote reference 99\">\u21a9\ufe0e<\/a><\/li><li id=\"4b26fec0-a80a-4b36-b343-5fc01a5fe3b9\">Experts were asked, \u201cWhat percent of sales of recently approved U.S. drugs will be from AI-discovered drugs and products derived from AI-discovered drugs in the years 2027, 2030 and 2040?\u201d <a href=\"#4b26fec0-a80a-4b36-b343-5fc01a5fe3b9-link\" aria-label=\"Jump to footnote reference 100\">\u21a9\ufe0e<\/a><\/li><li id=\"9eddcef0-6559-409f-a235-4e5b1e3bb50a\">This was true at the time this expert completed the survey. <a href=\"#9eddcef0-6559-409f-a235-4e5b1e3bb50a-link\" aria-label=\"Jump to footnote reference 101\">\u21a9\ufe0e<\/a><\/li><li id=\"fc555ffd-72de-41b2-93aa-8b26cf617ca0\">Experts were asked, \u201cWhat percent of U.S. electricity consumption will be used for training and deploying AI systems in the years 2027, 2030 and 2040?\u201d <a href=\"#fc555ffd-72de-41b2-93aa-8b26cf617ca0-link\" aria-label=\"Jump to footnote reference 102\">\u21a9\ufe0e<\/a><\/li><li id=\"b41f19c3-e70b-4826-8e0e-f6e36d60d407\">Experts were asked, \u201cWhat will be the global private investment (in billion USD) in AI in the years 2027 and 2030?\u201d <a href=\"#b41f19c3-e70b-4826-8e0e-f6e36d60d407-link\" aria-label=\"Jump to footnote reference 103\">\u21a9\ufe0e<\/a><\/li><li id=\"37ebf1a6-a5b9-4718-98ec-b418ceec79fc\">Experts were asked, \u201cWhat proportion of U.S. adults will self-report using AI for companionship at least once daily by Dec 31 of 2027, 2030, and 2040?\u201d <a href=\"#37ebf1a6-a5b9-4718-98ec-b418ceec79fc-link\" aria-label=\"Jump to footnote reference 104\">\u21a9\ufe0e<\/a><\/li><li id=\"dbf69356-d11c-4580-8af5-0f30101613d6\">This claim may refer to the ~23% of U.S. adults who, according to a 2024 KFF (formerly Kaiser Family Foundation) study, \u201csay they received mental health counseling and\/or prescription medication for mental health concerns in the last year.\u201d See Panchal and Lo (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2024<\/a>). <a href=\"#dbf69356-d11c-4580-8af5-0f30101613d6-link\" aria-label=\"Jump to footnote reference 105\">\u21a9\ufe0e<\/a><\/li><li id=\"d5be9c36-5816-48d1-b8dc-7855b561d596\"><em>Link provided to participants<\/em>: <a href=\"https:\/\/www.claymath.org\/millennium-problems\/\">https:\/\/www.claymath.org\/millennium-problems\/<\/a> <a href=\"#d5be9c36-5816-48d1-b8dc-7855b561d596-link\" aria-label=\"Jump to footnote reference 106\">\u21a9\ufe0e<\/a><\/li><li id=\"3f59b267-230c-4c22-8b5b-2095c003c706\"><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (3.0%\u201320.0%). 90<sup>th<\/sup> percentile of median forecast: 44.5. <a href=\"#3f59b267-230c-4c22-8b5b-2095c003c706-link\" aria-label=\"Jump to footnote reference 107\">\u21a9\ufe0e<\/a><\/li><li id=\"0b5a5127-7d50-40c5-9ec3-68f02e8ccf83\"><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (10.0%\u201350.0%). 90<sup>th<\/sup> percentile of median forecast: 65.4. <a href=\"#0b5a5127-7d50-40c5-9ec3-68f02e8ccf83-link\" aria-label=\"Jump to footnote reference 108\">\u21a9\ufe0e<\/a><\/li><li id=\"cefe5981-7c1a-4add-abf6-6847f9d23bd5\"><em>Raw data<\/em>: IQR on the 50<sup>th<\/sup> percentile was (30.3%\u201380.8%). 90<sup>th<\/sup> percentile of median forecast: 95.0. <a href=\"#cefe5981-7c1a-4add-abf6-6847f9d23bd5-link\" aria-label=\"Jump to footnote reference 109\">\u21a9\ufe0e<\/a><\/li><li id=\"c72e2948-fd92-4df8-9136-0c41fff551be\">The expert is referring to and quoting from Ansede (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">2025<\/a>). <a href=\"#c72e2948-fd92-4df8-9136-0c41fff551be-link\" aria-label=\"Jump to footnote reference 110\">\u21a9\ufe0e<\/a><\/li><li id=\"cf5f7d6c-56ee-4976-a10c-004354151b62\">The expert is referring to MacKenzie (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">1999<\/a>). <a href=\"#cf5f7d6c-56ee-4976-a10c-004354151b62-link\" aria-label=\"Jump to footnote reference 111\">\u21a9\ufe0e<\/a><\/li><li id=\"52fb1676-507a-472e-b319-e457143e4133\">The expert appears to be referring to an August 2025 post by OpenAI researcher Sebastien Bubeck (<a href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" id=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">Bubeck 2025<\/a>). <a href=\"#52fb1676-507a-472e-b319-e457143e4133-link\" aria-label=\"Jump to footnote reference 112\">\u21a9\ufe0e<\/a><\/li><li id=\"06475ed0-c168-4784-bda4-e7e8682711a3\">Recall from <a href=\"#public-sampling\">Public Sampling<\/a> that our public sample consists largely of highly engaged, past participants from FRI work. <a href=\"#06475ed0-c168-4784-bda4-e7e8682711a3-link\" aria-label=\"Jump to footnote reference 113\">\u21a9\ufe0e<\/a><\/li><li id=\"d34e17c1-44b3-48aa-b7da-41c3f2034432\">Examples of questions include: (1) What will be the National Average Temperature Rank for May 2025 in the contiguous United States, according to NOAA&#8217;s Climate Data Center, where 1 is the coolest and 130 is the highest rank representing the warmest on record?; and (2) What will be the closing stock price of Meta on 30 May 2025? <a href=\"#d34e17c1-44b3-48aa-b7da-41c3f2034432-link\" aria-label=\"Jump to footnote reference 114\">\u21a9\ufe0e<\/a><\/li><li id=\"f4eaac42-3c24-41ec-860f-14305bca01b1\">Survey: <a href=\"https:\/\/airtable.com\/appGCchUyUTPvT90e\/pagPpnUX2SiiTUNpp\/form\">https:\/\/airtable.com\/appGCchUyUTPvT90e\/pagPpnUX2SiiTUNpp\/form<\/a>. Project Team Contact: leap@forecastingresearch.org. <a href=\"#f4eaac42-3c24-41ec-860f-14305bca01b1-link\" aria-label=\"Jump to footnote reference 115\">\u21a9\ufe0e<\/a><\/li><li id=\"bccb8797-974a-42b0-801a-cbd08c5a401a\">See <a href=\"https:\/\/leap.forecastingresearch.org\/forecasts\">https:\/\/leap.forecastingresearch.org\/forecasts<\/a>. <a href=\"#bccb8797-974a-42b0-801a-cbd08c5a401a-link\" aria-label=\"Jump to footnote reference 116\">\u21a9\ufe0e<\/a><\/li><\/ol>\n\n\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"btn orange\" href=\"https:\/\/forecastingresearch.org\/pdf\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\" target=\"_blank\" rel=\"noreferrer noopener\">References and the appendix are provided in the full PDF report <svg width=\"7\" height=\"9\" viewBox=\"0 0 7 9\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.000156283 8.60806L4.22416 4.33606V4.24006L0.000156283 6.10352e-05H1.80816L6.06416 4.28806L1.80816 8.60806H0.000156283Z\" fill=\"#102B23\"\/>\n<\/svg>\n<svg width=\"8\" height=\"10\" viewBox=\"0 0 8 10\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.601719 8.85794L4.82572 4.58594V4.48994L0.601719 0.249939H2.40972L6.66572 4.53794L2.40972 8.85794H0.601719Z\" fill=\"#102B23\"\/>\n<\/svg><\/a><\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"btn orange\" href=\"https:\/\/leap.forecastingresearch.org\/\">Visit the LEAP website <svg width=\"7\" height=\"9\" viewBox=\"0 0 7 9\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.000156283 8.60806L4.22416 4.33606V4.24006L0.000156283 6.10352e-05H1.80816L6.06416 4.28806L1.80816 8.60806H0.000156283Z\" fill=\"#102B23\"\/>\n<\/svg>\n<svg width=\"8\" height=\"10\" viewBox=\"0 0 8 10\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\">\n  <path d=\"M0.601719 8.85794L4.82572 4.58594V4.48994L0.601719 0.249939H2.40972L6.66572 4.53794L2.40972 8.85794H0.601719Z\" fill=\"#102B23\"\/>\n<\/svg><\/a><\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"The Longitudinal Expert AI Panel (LEAP) is a three-year project tracking the views of leading computer scientists, industry professionals, policy researchers, and economists on the trajectory of artificial intelligence. In this paper, we introduce LEAP and summarize findings from the first three waves.","protected":false},"featured_media":1677,"template":"","meta":{"footnotes":"[{\"id\":\"349340c1-373d-4eb1-85d3-28f508f33d90\",\"content\":\"Forecasters are denoted \\u201csuperforecasters\\u201d if they (1) were in the top 2% of the accuracy distribution in a given year of the Intelligence Advanced Research Projects Activity (IARPA) Aggregative Contingent Estimation (ACE) tournament (IARPA ACE Program n.d.; Mellers et al. 2014) or (2) they were a highly accurate performer on Good Judgment Open, an online continuous geopolitical forecasting tournament. Good Judgment Inc., which runs Good Judgment Open, then adds these top forecasters to the \\u201csuperforecaster\\u201d pool. Most superforecasters come from the first selection criteria. Mellers et al. (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2015<\\\/a>) finds persistent performance of these superforecasters across several years of geopolitical forecasting.\"},{\"id\":\"d15cee71-1108-43b3-b329-2dec7e6e33f9\",\"content\":\"Expert rationales averaged 92 words, with 25% of expert rationales exceeding 100 words and 8% exceeding 200 words. Among superforecasters, 47% of rationales exceeded 100 words and 23% exceeded 200 words.\"},{\"id\":\"d99b5bf5-3708-4392-8e7b-df7289f0c6aa\",\"content\":\"If not otherwise stated, we report values from the 50<sup>th<\\\/sup> percentile forecasts given by each expert. We elaborate on the use of quantile forecasting in <a href=\\\"#monthly-surveys-and-forecasting-questions\\\">Monthly Surveys and Forecasting Questions<\\\/a>.\"},{\"id\":\"66522d7b-d668-4484-884c-38aa6b6f8528\",\"content\":\"Respondents were shown a historical baseline value of 2%, based on an earlier version of the cited paper. A new version of the working paper estimates a range of 1.6% to 6.6%. We select the midpoint, 4.1%, as the historical baseline value.\"},{\"id\":\"92219e5a-e381-4fb7-a009-03912e8393c4\",\"content\":\"The median expert expects electricity consumption used for AI to rise to 12% by 2040.\"},{\"id\":\"fa6b7dcf-4e3b-48d1-9e52-b21ae96cd313\",\"content\":\"See <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=116\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=96\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix E.II., 4. Electricity Consumption<\\\/a> for information on the baseline estimate of 1.0%.\"},{\"id\":\"a7e95b27-899d-4afa-bdac-b90b259a26e9\",\"content\":\"Regarding their private AI investment indicator, Our World in Data (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2025<\\\/a>) notes: 1. \\u201cThe data likely underestimates total global AI investment, as it only captures certain types of private equity transactions, excluding other significant channels and categories of AI-related spending;\\u201d 2. \\u201cThe source does not fully disclose its methodology and what's included or excluded. This means it may not fully capture important areas of AI investment, such as those from publicly traded companies, corporate internal R&amp;D, government funding, public sector initiatives, data center infrastructure, hardware production, semiconductor manufacturing, and expenses for research and talent.\\u201d More details on what is likely excluded can be found at Our World in Data (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2025<\\\/a>).\"},{\"id\":\"4657eacc-2029-4fa9-8213-fa8ffe075025\",\"content\":\"The FrontierMath benchmark consists of math problems that resemble those a math PhD student might spend several days solving.\"},{\"id\":\"ee90ef6f-442e-457d-b06d-42db793b5c9e\",\"content\":\"This estimate of 23% reflects the fraction of experts whose median forecast is that AI systems will achieve performance of at least 90% (which we call saturation) on Tiers 1\\u20133 of FrontierMath. We take the average of the proportions calculated under weak and strict inequality.\"},{\"id\":\"8dae5ba0-5fad-4f9b-90c4-526705f192b7\",\"content\":\"The Millennium Prize Problems are seven mathematical problems identified by the Clay Mathematics Institute in 2000 as the most important unsolved questions in mathematics (C<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">lay Mathematics Institute n.d.<\\\/a>). Only one has been solved to date.\"},{\"id\":\"b6814db1-908d-4c41-a402-60e57447bb67\",\"content\":\"Respondents were shown a historical baseline value of 2%, based on an earlier version of the cited paper. A new version of the working paper estimates a range of 1.6% to 6.6%. We select the midpoint, 4.1%, as the historical baseline value.\"},{\"id\":\"63d51702-88b7-44e8-946b-9c7393b547e7\",\"content\":\"A forecaster expects the realized outcome to be below their 25<sup>th<\\\/sup> percentile forecast in just 25% of cases, compared to 50% of cases for a median forecast. We additionally ask for a 75<sup>th<\\\/sup> percentile forecast whenever we collect a 25<sup>th<\\\/sup> percentile forecast.\"},{\"id\":\"dbfb006f-8326-4aa7-9c86-c321c5a2b32e\",\"content\":\"Forecasters are asked what percentage of sales revenue from recently FDA-approved drugs will come from those discovered using AI methods available after 2022. See <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=114\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix E.II. 3. Drug Discovery<\\\/a>.\"},{\"id\":\"83eadd21-22ab-44de-83c1-39f0f652accd\",\"content\":\"Note that for two categorical questions about overall AI scenarios, we report averages instead of medians. Since respondents assign probabilities that sum to 100%, we use average aggregation to maintain this property.\"},{\"id\":\"e264401d-38b7-41d7-8e5a-6e77ec5c63f2\",\"content\":\"To assess the extent to which low-effort or relatively lower comprehension from the public could drive these results, we compare members of our public with high levels of forecasting accuracy in other studies to those with low accuracy. We do not find that one group systematically expects more or less progress. <a href=\\\"#public-accuracy-stratification\\\">Public Accuracy Stratification<\\\/a> details this analysis.\"},{\"id\":\"9fa4990e-410c-4d97-a3de-a3b1e47a6f5d\",\"content\":\"While surveys show that fewer than half of Americans report using AI products (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">NORC 2025<\\\/a>), nearly all (99%) actually use AI-enabled tools like navigation apps, streaming services, and social media weekly (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Maese 2025<\\\/a>); this gap reveals that AI has become ubiquitous in some applications, but is sometimes invisible to users.\"},{\"id\":\"57335b56-eb62-469f-a244-c2dbbcd32a50\",\"content\":\"AI is already accelerating scientific discoveries across a wide range of disciplines such as medicine and materials science (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Dai et al. 2025<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Kay 2025<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Russell et al. 2023<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Stanford Medicine News Center 2025<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Sundermier 2024<\\\/a>). While some of these claims may be overblown or overstated to encourage media attention, AI is certainly affecting many aspects of science and research.\"},{\"id\":\"1f53487e-20fd-4bed-9901-426a33ef65ec\",\"content\":\"Despite public figures predicting extensive job destruction\\u2014for example Jamie Dimon speaking at Fortune\\u2019s Most Powerful Women summit: \\u201c[AI] will eliminate jobs. People should stop sticking their head in the sand\\u201d (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Gerut 2025<\\\/a>)\\u2014overall employment effects of AI remain small (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Chandar 2025<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Gimbel et al. 2025<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Eckhardt and Goldschlag 2025<\\\/a>) , the impact of rapid technological change on jobs in the short run is ambiguous, and in the longer run is neutral under standard economic models (e.g., <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Aghion, Jones, and Jones 2018<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Agrawal, Gans, and Goldfarb 2019<\\\/a>). Some recent evidence suggests that AI may be contributing to significant declines in entry-level hiring for workers in especially AI-exposed occupations (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Brynjolfsson, Chandar, and Chen 2025<\\\/a>), with those workers experiencing increased wages. This could suggest a negative supply shock, rather than a negative demand shock from substitution to AI. The ambiguity of technological change arises because automating some human processes often augments others (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Agrawal, Gans, and Goldfarb 2023<\\\/a>).\"},{\"id\":\"ae2ddc53-1ace-4f33-8208-e055f402bd19\",\"content\":\"Jason Furman, in an interview with Ross Douthat, estimates that 92% of the increase in demand in the U.S. in the first half of 2025 comes from categories related to AI investment and services (information processing equipment and software). Accounting for equilibrium effects, Furman posits that roughly half of GDP growth is from the AI boom (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Douthat 2025<\\\/a>). This view is echoed by Karen Dynan, an economics professor at Harvard University, who argues, \\u201cin a mechanical sense it\\u2019s fair to say that AI has been the main driver of U.S. GDP growth this year\\u201d (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Curran and Niquette 2025<\\\/a>).\"},{\"id\":\"17c90b83-c2e9-4427-8fc0-369dd9597752\",\"content\":\"Demis Hassabis: \\u201cI think one day maybe we can cure all disease with the help of AI.\\u201d (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Hassabis 2025<\\\/a>).\"},{\"id\":\"d2c807a0-8b97-43c6-a1f7-3c7c1e3fdd3f\",\"content\":\"\\u201cThere Will Be Very Hard Parts like Whole Classes of Jobs Going Away\\u201d (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Altman 2025<\\\/a>).\"},{\"id\":\"865aa83d-5511-4739-98e3-e025ab768709\",\"content\":\"\\u201c[A] dream scenario\\u2014perhaps a goal to aim for\\u2014would be 20% annual GDP growth rate in the developing world\\u201d (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Amodei 2024<\\\/a>).\"},{\"id\":\"579fdc13-94d7-4f0e-9228-b858d3ceee3b\",\"content\":\"Forecasters are denoted \\u201csuperforecasters\\u201d if they (1) were in the top 2% of the accuracy distribution in a given year of the IARPA ACE tournament (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">IARPA ACE Program n.d.<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Mellers et al. 2014<\\\/a>) or (2) they were a highly accurate performer on Good Judgment Open, an online continuous geopolitical forecasting tournament. Good Judgment Inc., which runs Good Judgment Open, then adds these top forecasters to the \\u201csuperforecaster\\u201d pool. Most superforecasters come from the first selection criteria. Mellers et al. (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2015<\\\/a>) finds persistent performance of these superforecasters across several years of geopolitical forecasting.\"},{\"id\":\"9fc23d2a-f118-4f4e-8692-d1172ee6fddc\",\"content\":\"\\u201cRepresentative\\u201d is a contested, and sometimes fraught concept in social science (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Chasalow and Levy 2021<\\\/a>). Here, we adopt a narrow definition for our expert sample. We specify an expert sampling frame that we believe closely tracks groups of experts that policymakers are most inclined to consult about AI. We then apply standard survey reweighting methods to combat nonresponse bias, reweighting our respondent sample to match the observable characteristics in our initial sampling frame. For our public sample, we use known characteristics of the U.S. population as our reweighting targets. See <a href=\\\"#reweighting\\\">Reweighting<\\\/a> for more details and a discussion of limitations of this approach.\"},{\"id\":\"889cae6c-7ce7-4e15-8357-1ccdecf3e7a0\",\"content\":\"The <a href=\\\"#monthly-surveys-and-forecasting-questions\\\">Monthly Surveys and Forecasting Questions<\\\/a> section contain further detail on resolution criteria.\"},{\"id\":\"93825f9e-0f1b-4259-92b6-3e3c903b5ea5\",\"content\":\"As typically specified, the \\u201cwisdom of the crowd\\u201d phenomenon appeals to independent and unbiased judgments, which lead the aggregate to outperform randomly selected components of the aggregate (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Davis-Stober et al. 2014<\\\/a>). An extensive literature develops improved aggregation approaches (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Baron et al. 2014<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Himmelstein, Budescu, and Han 2023<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Himmelstein, Budescu, and Ho 2023<\\\/a>), but a key theme of that work is that a simple median performs quite well as an aggregation mechanism across contexts.\"},{\"id\":\"064d21f7-b3d1-4317-bef2-4e8847e58d17\",\"content\":\"LEAP identifies distinct target populations (experts, superforecasters, and the general public) whose views we wish to study. We carefully construct sampling frames, or the part of the target populations that have a chance of being sampled, using multiple data sources to maximize the coverage of these populations. See <a href=\\\"#sampling\\\" id=\\\"#sampling\\\">Sampling<\\\/a> for more information on how sampling frames are constructed.\"},{\"id\":\"61f5b523-52cd-481a-a891-9ef64a3e70b1\",\"content\":\"Top citations in AI publications have also been used to target experts (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Muller and Bostrom 2014<\\\/a>), as well as a broader search of the literature using targeted publication classification codes (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">O\\u2019Donovan et al. 2025<\\\/a>).\"},{\"id\":\"6346c640-6888-47ea-ba4b-d11ef7f533f7\",\"content\":\"Participation from the general public in these types of surveys relies on well-established panels (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">McClain et al. 2025<\\\/a>), online opinion polling platforms (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Zhang and Defoe 2019<\\\/a>), or by identifying non-experts with a demonstrated interest in AI (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Walsh 2017<\\\/a>).\"},{\"id\":\"1f9b4d46-19b4-462a-b266-41e84b18c234\",\"content\":\"These individuals, labeled \\u201csuperforecasters,\\u201d are distinguished by their ability to sustain high-accuracy forecasts and \\u201cavoid regression to the mean\\u201d across multiple prediction instances (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Mellers et al. 2015<\\\/a>). This was demonstrated by findings from three consecutive years of geopolitical forecasting tournaments conducted by the Good Judgment Project under the U.S. Intelligence Advanced Research Projects Activity (IARPA).\"},{\"id\":\"ae7ac65a-0fe7-47f1-b51b-2b2c644162e8\",\"content\":\"Muller and Bostrom (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2014<\\\/a>) find experts working in theoretical AI were more likely to respond and more likely to be concerned about the negative effects of AI. Grace et al. (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2018<\\\/a>) find respondents to have less time in the field and lower citation indexes which in turn was associated with more optimistic views on the timing of human-level machine intelligence (HLMI). O\\u2019Donovan et al. (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2025<\\\/a>) find an association between views on AI safety and governance and respondent views on the inevitability of HLMI, as well as an association between their categorization of respondents as AI optimists and the inevitability of AI.\"},{\"id\":\"6d01365e-a7d7-4422-b04e-aeafbefcdbd7\",\"content\":\"A panel survey is a survey that repeatedly collects data from the same group of respondents over time, and LEAP is the first continuous panel survey of AI experts. There is one other repeated expert sample of AI experts that we have identified in the literature: Zhang et al. (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2022<\\\/a>) and Stein-Perlman and Grace (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2022<\\\/a>) do \\u201cmatched panel\\u201d analysis by matching their respondents to earlier responses in Grace et al. (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2018<\\\/a>).\"},{\"id\":\"7f3f6025-dedc-4c9c-82a9-1eda5a37187f\",\"content\":\"Nevertheless, the creation of unambiguous forecasting questions remains difficult. We dropped one question on the use of AI-use during K-12 instructional hours from this analysis, due to substantial misinterpretations.\"},{\"id\":\"84f8937b-1ffb-4a3f-94a4-ba5d8b3bba48\",\"content\":\"Some of the cited studies extrapolate forecasts backwards to obtain estimates for earlier dates, but these methods require parametric assumptions and generally do not allow for discontinuities in the time paths of forecasts.\"},{\"id\":\"6401e4e8-3ac9-49bd-9b42-4cf3fa0a82de\",\"content\":\"Forecasters are denoted \\u201csuperforecasters\\u201d if they (1) were in the top 2% of the accuracy distribution in a given year of the IARPA ACE tournament (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">IARPA ACE Program n.d.<\\\/a>; <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Mellers et al. 2014<\\\/a>) or (2) they were a highly accurate performer on Good Judgment Open, an online continuous geopolitical forecasting tournament. Good Judgment Inc., which runs Good Judgment Open, then adds these top forecasters to the \\u201csuperforecaster\\u201d pool. Most superforecasters come from the first selection criteria. Mellers et al. (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2015<\\\/a>) finds persistent performance of these superforecasters across several years of geopolitical forecasting.\"},{\"id\":\"e994d151-c382-4662-965f-c28352b5d1d5\",\"content\":\"Highly-engaged participants were recruited from previous high effort projects spanning multiple weeks.\"},{\"id\":\"0409d0c2-0b62-47b0-a0c0-2506997a6fdc\",\"content\":\"Pew describes \\u201craking,\\u201d also known as iterative proportional fitting, as the most common approach to reweighting public opinion surveys, <a href=\\\"https:\\\/\\\/www.pewresearch.org\\\/methods\\\/2018\\\/01\\\/26\\\/how-different-weighting-methods-work\\\/#:~:text=For%20public%20opinion%20surveys%2C%20the,each%20variable%20used%20in%20weighting.\\\" id=\\\"https:\\\/\\\/www.pewresearch.org\\\/methods\\\/2018\\\/01\\\/26\\\/how-different-weighting-methods-work\\\/#:~:text=For%20public%20opinion%20surveys%2C%20the,each%20variable%20used%20in%20weighting.\\\">\\u201cFor public opinion surveys, the most prevalent method for weighting is iterative proportional fitting, more commonly referred to as raking\\u201d<\\\/a> (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Mercer et al. 2018<\\\/a>).\"},{\"id\":\"0f8c5e8c-45d5-4684-9430-7bd8115fa6ee\",\"content\":\"Effective Altruism (EA) is a philosophical and social movement focused on directing resources towards improving the world. It is not a monolithic group. Many effective altruists strongly disagree about how best to direct resources towards improving the world, the philosophical framework that determines what an \\u2018improvement\\u2019 means, the risks one should take to improve the world, and the focus one should place on the short- and long-run in pursuit of improving the world. We consider someone EA-affiliated if they or their employer have or have had funding ties to EA, they publicly endorse EA, or they self-identify as EA-affiliated or EA-adjacent. Further, if their work focuses on EA, AI safety, global catastrophic risks, or existential risks, we consider them to be EA-affiliated. These criteria are permissive and favor overinclusion, as we chose to measure an upper bound of EA affiliation. Nevertheless, the proportion of our frame tagged as having EA affiliation remains relatively small, at 14.3%.\"},{\"id\":\"67c8e553-de80-40a5-8016-37d549db3725\",\"content\":\"We define \\u201ctop AI labs\\u201d as the 10 unique labs which provide the 20 most computationally intensive models (in terms of training FLOP) on Epoch AI\\u2019s <a href=\\\"https:\\\/\\\/epoch.ai\\\/data\\\/ai-models\\\" id=\\\"https:\\\/\\\/epoch.ai\\\/data\\\/ai-models\\\">Data on AI Models<\\\/a> table (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Epoch AI 2024b<\\\/a>).\"},{\"id\":\"b7a4e61c-0e08-472b-8df1-2381b8e75f7b\",\"content\":\"As seen in the table below, we split our expert population into four categories of expertise: Computer Science, Economics, Industry and Policy. For the purposes of reweighting, we equally weight these categories. Our initial frame slightly overrepresented Computer Science and Industry professionals and underrepresented Economics and Policy professionals.\"},{\"id\":\"8c49426e-4ad8-40a9-85a3-f09d3efd32ca\",\"content\":\"We use the <a href=\\\"https:\\\/\\\/doi.org\\\/10.18128\\\/D010.V16.0\\\">IPUMS USA combined Census and American Community Survey (ACS) data<\\\/a> to derive population targets for all variables except party identification (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Ruggles et al. 2025<\\\/a>). For party identification, we use <a href=\\\"https:\\\/\\\/www.pewresearch.org\\\/politics\\\/2024\\\/04\\\/09\\\/the-partisanship-and-ideology-of-american-voters\\\/\\\" id=\\\"https:\\\/\\\/www.pewresearch.org\\\/politics\\\/2024\\\/04\\\/09\\\/the-partisanship-and-ideology-of-american-voters\\\/\\\">data from Pew Research Center<\\\/a> (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Nadeem 2024<\\\/a>).\"},{\"id\":\"3e1cd8cc-472b-4a6f-b1ab-ab02415bd022\",\"content\":\"Note that the actual proportion of participants invited from Industry was 36%. For the purposes of reweighting, each respondent category of expertise was weighted equally as discussed above.\"},{\"id\":\"31f3e343-943a-4f7c-b254-30b03fd86a60\",\"content\":\"These respondents are either 200 top-cited according to OpenAlex, or part of our age-stratified CS author list.\"},{\"id\":\"98cb1a57-9113-4df3-b1a4-4ac0efac4e9d\",\"content\":\"Research rankings for economics schools and journals, based on publication records. Authors at an institution hosting one of the top-50 economics departments according to RePEc are considered.\"},{\"id\":\"41549962-1a59-45e2-a739-8bb71e8a1b2f\",\"content\":\"The percentage decrease in completions from Wave 2 to 3 per category were as follows: Economics: 1.7%, Superforecasters: 3.5%, Public: 3.8%, Computer Science: 4.7%, Industry: 5.3%, Policy: 7.2%.\"},{\"id\":\"10c0bd6e-dc02-4b64-8601-2383555bcf25\",\"content\":\"The survey platform records the active time spent on the survey window on respondents\\u2019 devices. If they leave the page (even to do additional research relevant to the survey), the timer is paused. Hence, this measure likely serves as a lower bound for time spent on the survey.\"},{\"id\":\"2a066104-34d3-43c9-916f-65ad3ecd00c2\",\"content\":\"We provide expert participants with $2,000 per each year of full participation (prorated for the number of surveys they complete, with an expectation that we will complete 12 surveys in a typical year). In other words, experts receive $166.67 per survey completed. We provide superforecasters with $1,000, prorated similarly, or $83.33 per survey completed. We pay public participants in line with CloudResearch platform norms, or $8 per survey completed ($13.71 per hour).\"},{\"id\":\"b8186e41-dd7f-49b2-820c-dc0836183480\",\"content\":\"Respondents are prompted to give their forecasts in the form of probabilities.\"},{\"id\":\"fbb1f957-9891-42fa-8c14-a32faa4e4b21\",\"content\":\"Respondents are prompted to give their forecasts in the form of 25<sup>th<\\\/sup> , 50<sup>th<\\\/sup> , and 75<sup>th<\\\/sup> percentiles.\"},{\"id\":\"9753e07e-ab53-40f2-a948-77cfa0cacb13\",\"content\":\"The pooled distribution is not necessarily the optimal way to aggregate forecasts in terms of forecasting accuracy. For example, Ranjan and Gneiting (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2010<\\\/a>) show that combining well-calibrated forecasts in this fashion yields forecasts that are miscalibrated. Similar results are reported by Lichtendahl et al (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2013<\\\/a>).\"},{\"id\":\"ab935eb6-3a6a-45e4-84b4-e5c4214e247c\",\"content\":\"In future work, we will summarize forecaster-level gaps between 25<sup>th<\\\/sup> and 75<sup>th<\\\/sup> percentile forecasts.\"},{\"id\":\"98818581-9957-4f4d-ab1f-c4945458f22a\",\"content\":\"<em>Pooled distribution<\\\/em>: IQR (7.3%\\u201334.6%); variance decomposition: 47% between\\u2013forecaster disagreement, 53% within\\u2013forecaster uncertainty. <br><em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (9%\\u201330%); median 25<sup>th<\\\/sup> and 75<sup>th<\\\/sup> percentile forecasts were 9% and 28% respectively. <br>Uncertainty and disagreement metrics on other claims made in this list can be found in the Monthly Reports at <a href=\\\"https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/\\\">https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/<\\\/a>.\"},{\"id\":\"2b7fae99-fc07-44f2-b81f-3174831227bb\",\"content\":\"Respondents were shown a historical baseline value of 2%, based on an earlier version of the cited paper. The most recent draft gives a range of 1.6% to 6.6%. We select the midpoint, 4.1%, as the historical baseline value.\"},{\"id\":\"8cc15fdf-409f-4f29-98f6-67baaf97c59d\",\"content\":\"Sentences of the form, \\u201cThe median expert gives an X% chance,\\u201d report the median of experts\\u2019 X<sup>th<\\\/sup> percentile forecasts.\"},{\"id\":\"23bfde2d-f28b-477d-8655-ea9c96555e8f\",\"content\":\"Regarding their private AI investment indicator, Our World in Data (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2025<\\\/a>) notes: 1. \\u201cThe data likely underestimates total global AI investment, as it only captures certain types of private equity transactions, excluding other significant channels and categories of AI-related spending;\\u201d 2. \\u201cThe source does not fully disclose its methodology and what's included or excluded. This means it may not fully capture important areas of AI investment, such as those from publicly traded companies, corporate internal R&amp;D, government funding, public sector initiatives, data center infrastructure, hardware production, semiconductor manufacturing, and expenses for research and talent.\\u201d More details on what is likely excluded can be found at Our World in Data (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2025<\\\/a>).\"},{\"id\":\"c2d6b13f-32f9-40bb-b188-ddff67dc7421\",\"content\":\"This estimate of 23% reflects the fraction of experts whose median forecast is that AI systems will achieve performance of at least 90% (which we call saturation) on Tiers 1-3 of FrontierMath. We take the average of the proportions calculated under weak and strict inequality.\"},{\"id\":\"8acd763f-4363-457b-a67c-090e56541707\",\"content\":\"<em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (10%\\u201330%).\"},{\"id\":\"bdd05175-27a9-4dcd-aa6b-f7c5de77458a\",\"content\":\"See <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix E.I. Survey Questions: Wave 1<\\\/a> for background information. We ask participants the probability that LEAP panelists will choose \\u201cslow progress,\\u201d \\u201cmoderate progress,\\u201d or \\u201crapid progress\\u201d as best matching the general level of AI progress.\"},{\"id\":\"f7f7e06f-3b9e-412d-afe6-7537f6485a8d\",\"content\":\"See <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix E.I. Survey Questions: Wave 1<\\\/a> for background information\"},{\"id\":\"f5441114-d01f-46a5-8871-c9bbaeb2942e\",\"content\":\"<em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (-4%\\u20135%)\"},{\"id\":\"41a645ad-d348-452d-9788-c893693a4f45\",\"content\":\"We have a complementary survey in the field exploring these topics which we plan to release results from in early 2026.\"},{\"id\":\"cad14124-84e3-405a-8c44-d454bc9b91c3\",\"content\":\"See <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=108\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=108\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix E.I. Survey Questions: Wave 1, Question 5, Technological Richter Scale<\\\/a> for details.\"},{\"id\":\"5d86a372-9fcf-43bb-a540-84cfcdf20a1e\",\"content\":\"The median forecast for this question was 25%. <br><em>Pooled distribution<\\\/em>: IQR (8.4%\\u201353.8%); variance decomposition: 52% between\\u2013forecaster disagreement, 48% within\\u2013forecaster uncertainty. <br><em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (10%\\u201350%); median 25<sup>th<\\\/sup> and 75<sup>th<\\\/sup> percentile forecasts were 10% and 43% respectively.\"},{\"id\":\"d3f78ea9-9e62-4a05-95f9-937aa2e600e9\",\"content\":\"The median forecast for this question was 60%.\"},{\"id\":\"0f876b32-154f-4ffd-81cd-cb565602b4f8\",\"content\":\"See <a href=\\\"https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/\\\">https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/<\\\/a> to access these tables.\"},{\"id\":\"1c277265-12d5-46e7-86a4-18ea84d9eedb\",\"content\":\"<em>Raw data<\\\/em>: The median forecast for this question was 20%. IQR on the 50<sup>th<\\\/sup> percentile was (10%\\u201330%).\"},{\"id\":\"d5650a95-e72c-4f84-9fb3-40613353d20e\",\"content\":\"See <a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=98\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Appendix E.I. Survey Questions: Wave 1<\\\/a> for background information. We ask participants the probability LEAP panelists will choose \\u201cslow progress,\\u201d \\u201cmoderate progress,\\u201d or \\u201crapid progress\\u201d as best matching the general level of AI progress.\"},{\"id\":\"d11df40e-72a1-411b-8fde-3f2d502bf544\",\"content\":\"<em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (-4%\\u20135%).\"},{\"id\":\"6feae762-cbb5-456d-8f2d-9c0ecc33f95a\",\"content\":\"In a future survey wave, we plan to collect forecasts of the predicted relationship between AI capabilities and employment growth in each sector by asking respondents to forecast employment growth conditional on low-, moderate-, and rapid-progress scenarios.\"},{\"id\":\"88000fb8-dbe0-4c1e-bdba-6f267cd8ccbf\",\"content\":\"<em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (30%\\u201381%).\"},{\"id\":\"2997cdbe-89c3-4246-aa80-9590826ba717\",\"content\":\"<em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (10%\\u201350%).\"},{\"id\":\"cbd0269b-114c-4ad3-9bd8-1b550056f9d2\",\"content\":\"The degree to which progress on Millennium Prize Problems is serial or parallel, as well as the general difficulty of the Problems, complicates this comparison. Eliciting forecasts from multiple experts on consistent forecasting questions with clear resolution criteria helps us bring clarity to debates often plagued by ambiguous definitions.\"},{\"id\":\"934b8442-9d4f-4d36-8120-1843f29193d6\",\"content\":\"Questions include FrontierMath, Autonomous Vehicle Trips, Millennium Prize, Diffusion of AI Across Sciences, Drug Discovery, Electricity Consumption, Cognitive Limitations, AI Investment, Generative AI Use Intensity, Open vs Proprietary Polarity, AI Companions, Barriers to Adoption, General AI progress, and Technological Richter Scale.\"},{\"id\":\"9d6dd258-27c6-4bbf-a8a2-d7b72fb8f0f9\",\"content\":\"Forecasting questions with clear valence have an unambiguous directional association with progress. For some questions, like employment by sector, it is unclear whether higher or lower levels of unemployment would be associated with more advanced or less advanced AI progress, so we exclude those questions from this analysis. We also transform some forecasts to establish the progress valence. First, we take the average across all fields in Diffusion of AI Across Sciences. Second, for Cognitive Limitations and Barriers to Adoption, we average across all categories and use the complementary probability. Third, we take the average of closed- and open-weight performance for Open vs Proprietary Polarity. Lastly, we take the values assigned to the \\u201cRapid\\u201d scenario and TRS levels 8 and above for the General AI Progress and Technological Richter Scale questions, respectively.\"},{\"id\":\"7e5ba175-383b-4ed0-a966-7bd43feb4245\",\"content\":\"We use Mann\\u2013Whitney U tests for equality in distribution unless otherwise stated, with a 5% significance threshold. All Mann\\u2013Whitney U tests and Cliff\\u2019s <em>\\u03b4<\\\/em> values are currently unweighted.\"},{\"id\":\"6dc2d733-d15e-453b-8e68-32880d3cca1c\",\"content\":\"We claim a group predicts statistically significantly less progress according to a Mann\\u2013Whitney U test <em>and<\\\/em> a negative Cliff\\u2019s <em>\\u03b4<\\\/em>.\"},{\"id\":\"2ff68abb-a816-4876-958f-ba26747c8edf\",\"content\":\"Cliff\\u2019s <em>\\u03b4<\\\/em> performs pairwise comparisons between all values of two empirical distributions. It takes the number of comparisons where the value from the first distribution exceeds the second and subtracts the number of comparisons where the value from the second distribution exceeds the first and reports this difference as a proportion of the count of comparisons. In other words, it reports the probability that a randomly drawn value from the first distribution exceeds a randomly drawn value from the second distribution, over and above what would be expected by pure random chance if the two distributions were identical.\"},{\"id\":\"888c9df3-1e71-4349-9a83-a8f74371c6ce\",\"content\":\"See <a href=\\\"https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/\\\">https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/<\\\/a> to access these tables.\"},{\"id\":\"328076b5-8365-43d5-abc4-cc93d7587a88\",\"content\":\"<em>Pooled distribution<\\\/em>: IQR (6.9%\\u201346.1%); variance decomposition: 47% between\\u2013forecaster disagreement, 53% within\\u2013forecaster uncertainty. <em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (10%\\u201340%); median 25<sup>th<\\\/sup> and 75<sup>th<\\\/sup> percentile forecasts were 8% and 35% respectively.\"},{\"id\":\"3f0107f8-f99a-4f11-9abf-973f0331d9a1\",\"content\":\"<em>Pooled distribution<\\\/em>: IQR (4%\\u201331%); variance decomposition: 66% between\\u2013forecaster disagreement, 34% within\\u2013forecaster uncertainty. <br><em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (5%\\u201329%); median 25<sup>th<\\\/sup> and 75<sup>th<\\\/sup> percentile forecasts were 5% and 20% respectively. <br>Uncertainty and disagreement metrics on other claims made in this list can be found in the Monthly Reports at <a href=\\\"https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/\\\">https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/<\\\/a>.\"},{\"id\":\"3467dc4d-ea16-46eb-98f5-47a868923605\",\"content\":\"\\\"To gauge the difficulty of FrontierMath problems, we organized a competition at MIT involving around 40 exceptional math undergraduates and subject-matter experts. Participants formed eight teams of four or five members, each with internet access, and had four and a half hours to solve 23 problems. On a subset of 23 tier 1-3 problems, the average team scored 19%, while 35% of the problems were solved collectively across all teams.\\\" (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Epoch AI 2025<\\\/a>).\"},{\"id\":\"23a598cf-525c-4d4e-ac54-155fb5d62df1\",\"content\":\"This estimate of 23% reflects the fraction of experts whose median forecast is that AI systems will achieve performance of at least 90% (which we call saturation) on Tiers 1\\u20133 of FrontierMath. We take the average of the proportions calculated under weak and strict inequality.\"},{\"id\":\"59e81bf9-27c0-408e-9e27-224496185e50\",\"content\":\"Physics: 32%; Materials Science: 37%; Medicine: 37%.\"},{\"id\":\"80d423ed-cb7d-4acf-8d3e-f74c9ea10fcb\",\"content\":\"Physics: 27%; Materials Science: 30%; Medicine: 30%.\"},{\"id\":\"54630412-1747-4c6c-8b27-9bb8eaf39c0d\",\"content\":\"For this comparison, we switch to reporting Cliff\\u2019s \\u03b4 values calculated with the public as the first distribution.\"},{\"id\":\"cd6bbf30-b63a-4157-a9b6-136fd4cb82dd\",\"content\":\"The median forecast was 23%. <em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (12%\\u201335%)\"},{\"id\":\"31ae1fcd-f1a7-4a0e-8954-f07e28840e22\",\"content\":\"The median forecast was 20%. <em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (10%\\u201330%)\"},{\"id\":\"38196ecc-fcfc-4953-b552-fee38be2bf97\",\"content\":\"<em>Pooled distribution<\\\/em>: IQR (6.9%\\u201346.2%); variance decomposition: 47% between\\u2013forecaster disagreement, 53% within\\u2013forecaster uncertainty. <br><em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (10%\\u201340%); median 25<sup>th<\\\/sup> and 75<sup>th<\\\/sup> percentile forecasts were 8% and 35% respectively.\"},{\"id\":\"0db06cd8-5f5f-4038-93ff-a68285545c4d\",\"content\":\"<em>Pooled distribution<\\\/em>: IQR (3%\\u201325%); variance decomposition: 48% between\\u2013forecaster disagreement, 52% within\\u2013forecaster uncertainty. <br><em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (3%\\u201325%); median 25<sup>th<\\\/sup> and 75<sup>th<\\\/sup> percentile forecasts were 4% and 20% respectively. <br>Uncertainty and disagreement metrics on other claims made in this paragraph can be found in the Monthly Reports at <a href=\\\"https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/\\\">https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/<\\\/a>.\"},{\"id\":\"f6aa4b20-0c7b-4251-a1d4-a3629b227fc0\",\"content\":\"See <a href=\\\"https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/\\\">https:\\\/\\\/leap.forecastingresearch.org\\\/reports\\\/<\\\/a> to access these tables.\"},{\"id\":\"a8016211-a299-4c2c-a058-715e656b8533\",\"content\":\"Experts were asked, \\u201cWhat will be the highest percentage accuracy achieved by an AI model on FrontierMath, by Jan 1 of 2026, 2028, and 2031?\\u201d\"},{\"id\":\"c1891d77-58c4-4aca-8ca2-d9c550549066\",\"content\":\"Actual progress was marginally slower. The top Tier 1-3 accuracy rate rose from 1.03% in June of 2024 to 29% in August of 2025, where it remained as of the publication of this paper.\"},{\"id\":\"150075bf-55f0-4ce6-8676-ee214749eb17\",\"content\":\"Experts were asked, \\u201cWhat percentage of U.S. ride-hailing trips will be provided by autonomous vehicles that are classified SAE Level 4 or above in the years 2027 and 2030?\\u201d\"},{\"id\":\"15dc6f35-e4d0-4ebb-a421-50422463048e\",\"content\":\"Experts were asked, \\u201cWhat will the percent change in the number of jobs (compared to Jan 1, 2025) in the U.S. be for white-collar, blue-collar, and service occupations, by Jan 1 of 2028 and 2031?\\u201d\"},{\"id\":\"45326349-1cc9-4056-a9ee-e162931dae19\",\"content\":\"Experts were presented with three scenarios that detailed the development of AI capabilities and asked, \\u201cAt the end of 2030, what percent of LEAP panelists will choose \\u201cslow progress,\\u201d \\u201cmoderate progress,\\u201d or \\u201crapid progress\\u201d as best matching the general level of AI progress?\\u201d\"},{\"id\":\"baac8fbb-0b1e-44ee-b779-3704da1d4dc3\",\"content\":\"Actual rate of improvement likely falls within this range, especially given recent acceleration trends, but there is considerable uncertainty and domain variability. See: <a href=\\\"https:\\\/\\\/metr.org\\\/blog\\\/2025-03-19-measuring-ai-ability-to-complete-long-tasks\\\/\\\">https:\\\/\\\/metr.org\\\/blog\\\/2025-03-19-measuring-ai-ability-to-complete-long-tasks\\\/<\\\/a> for more information.\"},{\"id\":\"25fab556-1fa8-43b8-9d4e-72a8e47447c7\",\"content\":\"Nate Silver\\u2019s book \\u201cOn the Edge\\u201d proposes the technological Richter scale (TRS) which, analogous with earthquake magnitudes, rates the impact of technologies on a roughly logarithmic scale with10 representing that greatest impact (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Silver 2024<\\\/a>). Experts were asked, \\u201cAt the end of 2040, what is the probability for AI achieving the following levels of net impact [ranging from 5-10] on human society as compared to the impact of past technological events?\\u201d\"},{\"id\":\"303a3c69-d754-4e8c-8bba-63ac24995998\",\"content\":\"Experts were asked, \\u201cWhat is the probability that AI will solve or substantially assist in solving a Millennium Prize Problem in mathematics by Dec 31 of 2027, 2030, and 2040?\\u201d\"},{\"id\":\"cabcd0dd-a9c8-4846-8934-d23aa0862b47\",\"content\":\"Experts were asked, \\u201cWhat percent of publications in the fields of Physics, Materials Science, and Medicine in 2030 will be \\u2018AI-engaged\\u2019 as measured in a replication of this study?\\u201d (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Duede et al. 2024<\\\/a>)\"},{\"id\":\"4b26fec0-a80a-4b36-b343-5fc01a5fe3b9\",\"content\":\"Experts were asked, \\u201cWhat percent of sales of recently approved U.S. drugs will be from AI-discovered drugs and products derived from AI-discovered drugs in the years 2027, 2030 and 2040?\\u201d\"},{\"id\":\"9eddcef0-6559-409f-a235-4e5b1e3bb50a\",\"content\":\"This was true at the time this expert completed the survey.\"},{\"id\":\"fc555ffd-72de-41b2-93aa-8b26cf617ca0\",\"content\":\"Experts were asked, \\u201cWhat percent of U.S. electricity consumption will be used for training and deploying AI systems in the years 2027, 2030 and 2040?\\u201d\"},{\"id\":\"b41f19c3-e70b-4826-8e0e-f6e36d60d407\",\"content\":\"Experts were asked, \\u201cWhat will be the global private investment (in billion USD) in AI in the years 2027 and 2030?\\u201d\"},{\"id\":\"37ebf1a6-a5b9-4718-98ec-b418ceec79fc\",\"content\":\"Experts were asked, \\u201cWhat proportion of U.S. adults will self-report using AI for companionship at least once daily by Dec 31 of 2027, 2030, and 2040?\\u201d\"},{\"id\":\"dbf69356-d11c-4580-8af5-0f30101613d6\",\"content\":\"This claim may refer to the ~23% of U.S. adults who, according to a 2024 KFF (formerly Kaiser Family Foundation) study, \\u201csay they received mental health counseling and\\\/or prescription medication for mental health concerns in the last year.\\u201d See Panchal and Lo (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2024<\\\/a>).\"},{\"id\":\"d5be9c36-5816-48d1-b8dc-7855b561d596\",\"content\":\"<em>Link provided to participants<\\\/em>: <a href=\\\"https:\\\/\\\/www.claymath.org\\\/millennium-problems\\\/\\\">https:\\\/\\\/www.claymath.org\\\/millennium-problems\\\/<\\\/a>\"},{\"id\":\"3f59b267-230c-4c22-8b5b-2095c003c706\",\"content\":\"<em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (3.0%\\u201320.0%). 90<sup>th<\\\/sup> percentile of median forecast: 44.5.\"},{\"id\":\"0b5a5127-7d50-40c5-9ec3-68f02e8ccf83\",\"content\":\"<em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (10.0%\\u201350.0%). 90<sup>th<\\\/sup> percentile of median forecast: 65.4.\"},{\"id\":\"cefe5981-7c1a-4add-abf6-6847f9d23bd5\",\"content\":\"<em>Raw data<\\\/em>: IQR on the 50<sup>th<\\\/sup> percentile was (30.3%\\u201380.8%). 90<sup>th<\\\/sup> percentile of median forecast: 95.0.\"},{\"id\":\"c72e2948-fd92-4df8-9136-0c41fff551be\",\"content\":\"The expert is referring to and quoting from Ansede (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">2025<\\\/a>).\"},{\"id\":\"cf5f7d6c-56ee-4976-a10c-004354151b62\",\"content\":\"The expert is referring to MacKenzie (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">1999<\\\/a>).\"},{\"id\":\"52fb1676-507a-472e-b319-e457143e4133\",\"content\":\"The expert appears to be referring to an August 2025 post by OpenAI researcher Sebastien Bubeck (<a href=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" id=\\\"https:\\\/\\\/forecastingresearch.org\\\/pdf\\\/longitudinal-expert-ai-panel-leap-working-paper.pdf#page=61\\\" target=\\\"_blank\\\" rel=\\\"noreferrer noopener\\\">Bubeck 2025<\\\/a>).\"},{\"id\":\"06475ed0-c168-4784-bda4-e7e8682711a3\",\"content\":\"Recall from <a href=\\\"#public-sampling\\\">Public Sampling<\\\/a> that our public sample consists largely of highly engaged, past participants from FRI work.\"},{\"id\":\"d34e17c1-44b3-48aa-b7da-41c3f2034432\",\"content\":\"Examples of questions include: (1) What will be the National Average Temperature Rank for May 2025 in the contiguous United States, according to NOAA's Climate Data Center, where 1 is the coolest and 130 is the highest rank representing the warmest on record?; and (2) What will be the closing stock price of Meta on 30 May 2025?\"},{\"id\":\"f4eaac42-3c24-41ec-860f-14305bca01b1\",\"content\":\"Survey: <a href=\\\"https:\\\/\\\/airtable.com\\\/appGCchUyUTPvT90e\\\/pagPpnUX2SiiTUNpp\\\/form\\\">https:\\\/\\\/airtable.com\\\/appGCchUyUTPvT90e\\\/pagPpnUX2SiiTUNpp\\\/form<\\\/a>. Project Team Contact: leap@forecastingresearch.org.\"},{\"id\":\"bccb8797-974a-42b0-801a-cbd08c5a401a\",\"content\":\"See <a href=\\\"https:\\\/\\\/leap.forecastingresearch.org\\\/forecasts\\\">https:\\\/\\\/leap.forecastingresearch.org\\\/forecasts<\\\/a>.\"}]"},"research_type":[4],"class_list":["post-1355","research","type-research","status-publish","has-post-thumbnail","hentry","research_type-working-paper"],"acf":[],"yoast_head":"<title>The Longitudinal Expert AI Panel: Understanding Expert Views on AI Capabilities, Adoption, and Impact &#8211; Forecasting Research Institute<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The Longitudinal Expert AI Panel: Understanding Expert Views on AI Capabilities, Adoption, and Impact &#8211; Forecasting Research Institute\" \/>\n<meta property=\"og:description\" content=\"The Longitudinal Expert AI Panel (LEAP) is a three-year project tracking the views of leading computer scientists, industry professionals, policy researchers, and economists on the trajectory of artificial intelligence. In this paper, we introduce LEAP and summarize findings from the first three waves.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper\" \/>\n<meta property=\"og:site_name\" content=\"Forecasting Research Institute\" \/>\n<meta property=\"article:modified_time\" content=\"2026-05-05T11:23:12+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/illustration_Midjourney_LEAP_paper_waves-1-2-3.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1607\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/longitudinal-expert-ai-panel-leap-working-paper\",\"url\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/longitudinal-expert-ai-panel-leap-working-paper\",\"name\":\"The Longitudinal Expert AI Panel: Understanding Expert Views on AI Capabilities, Adoption, and Impact &#8211; Forecasting Research Institute\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/longitudinal-expert-ai-panel-leap-working-paper#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/longitudinal-expert-ai-panel-leap-working-paper#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/forecastingresearch.org\\\/wp-content\\\/uploads\\\/2026\\\/04\\\/illustration_Midjourney_LEAP_paper_waves-1-2-3.jpg\",\"datePublished\":\"2025-11-10T12:00:00+00:00\",\"dateModified\":\"2026-05-05T11:23:12+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/longitudinal-expert-ai-panel-leap-working-paper#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/longitudinal-expert-ai-panel-leap-working-paper\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/longitudinal-expert-ai-panel-leap-working-paper#primaryimage\",\"url\":\"https:\\\/\\\/forecastingresearch.org\\\/wp-content\\\/uploads\\\/2026\\\/04\\\/illustration_Midjourney_LEAP_paper_waves-1-2-3.jpg\",\"contentUrl\":\"https:\\\/\\\/forecastingresearch.org\\\/wp-content\\\/uploads\\\/2026\\\/04\\\/illustration_Midjourney_LEAP_paper_waves-1-2-3.jpg\",\"width\":2560,\"height\":1607},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/research\\\/longitudinal-expert-ai-panel-leap-working-paper#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/forecastingresearch.org\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"The Longitudinal Expert AI Panel: Understanding Expert Views on AI Capabilities, Adoption, and Impact\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/forecastingresearch.org\\\/#website\",\"url\":\"https:\\\/\\\/forecastingresearch.org\\\/\",\"name\":\"Forecasting Research Institute\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/forecastingresearch.org\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>","yoast_head_json":{"title":"The Longitudinal Expert AI Panel: Understanding Expert Views on AI Capabilities, Adoption, and Impact &#8211; Forecasting Research Institute","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper","og_locale":"en_US","og_type":"article","og_title":"The Longitudinal Expert AI Panel: Understanding Expert Views on AI Capabilities, Adoption, and Impact &#8211; Forecasting Research Institute","og_description":"The Longitudinal Expert AI Panel (LEAP) is a three-year project tracking the views of leading computer scientists, industry professionals, policy researchers, and economists on the trajectory of artificial intelligence. In this paper, we introduce LEAP and summarize findings from the first three waves.","og_url":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper","og_site_name":"Forecasting Research Institute","article_modified_time":"2026-05-05T11:23:12+00:00","og_image":[{"width":2560,"height":1607,"url":"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/illustration_Midjourney_LEAP_paper_waves-1-2-3.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper","url":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper","name":"The Longitudinal Expert AI Panel: Understanding Expert Views on AI Capabilities, Adoption, and Impact &#8211; Forecasting Research Institute","isPartOf":{"@id":"https:\/\/forecastingresearch.org\/#website"},"primaryImageOfPage":{"@id":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper#primaryimage"},"image":{"@id":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper#primaryimage"},"thumbnailUrl":"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/illustration_Midjourney_LEAP_paper_waves-1-2-3.jpg","datePublished":"2025-11-10T12:00:00+00:00","dateModified":"2026-05-05T11:23:12+00:00","breadcrumb":{"@id":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper#primaryimage","url":"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/illustration_Midjourney_LEAP_paper_waves-1-2-3.jpg","contentUrl":"https:\/\/forecastingresearch.org\/wp-content\/uploads\/2026\/04\/illustration_Midjourney_LEAP_paper_waves-1-2-3.jpg","width":2560,"height":1607},{"@type":"BreadcrumbList","@id":"https:\/\/forecastingresearch.org\/research\/longitudinal-expert-ai-panel-leap-working-paper#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/forecastingresearch.org\/"},{"@type":"ListItem","position":2,"name":"The Longitudinal Expert AI Panel: Understanding Expert Views on AI Capabilities, Adoption, and Impact"}]},{"@type":"WebSite","@id":"https:\/\/forecastingresearch.org\/#website","url":"https:\/\/forecastingresearch.org\/","name":"Forecasting Research Institute","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/forecastingresearch.org\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research\/1355","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research"}],"about":[{"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/types\/research"}],"version-history":[{"count":93,"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research\/1355\/revisions"}],"predecessor-version":[{"id":2176,"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research\/1355\/revisions\/2176"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/media\/1677"}],"wp:attachment":[{"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/media?parent=1355"}],"wp:term":[{"taxonomy":"research_type","embeddable":true,"href":"https:\/\/forecastingresearch.org\/api\/wp\/v2\/research_type?post=1355"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}