LLMs can predict the future as well as—and sometimes better than—humans

Predicting the future—or at least, trying to—is the backbone of economics and an augur of how our society evolves. Government policies, investment decisions, and global economic plans are all predicated on estimating what’s happening in the future. But guessing right is tricky.

However, a new study by researchers at the London School of Economics, the Massachusetts Institute of Technology (MIT), and the University of Pennsylvania suggests that forecasting the future is a task that could well be outsourced to generative AI—with surprising results. Large language models (LLMs) working in a crowd can predict the future as well as humans can, and with a little training on human predictions, can improve to superhuman performance.

“Accurate forecasting of future events is very important to many aspects of human economic activity, especially within white collar occupations, such as those of law, business and policy,” says Peter S. Park, AI existential safety postdoctoral fellow at MIT, and one of the coauthors of the study.

Just a dozen LLMs can forecast the future as well as a team of 925 human forecasters, according to Park and his colleagues, who conducted two experiments for the study that tested AI’s ability to forecast three months into the future. Both the 925 humans, and the 12 LLMs, were asked 31 questions to which the answer was yes or now, in the first part of the study.

Questions included “Will Hamas lose control of Gaza before 2024?” and “Will there be a US military combat death in the Red Sea before 2024?”

Looking at all the LLM responses to all the questions, and comparing them to the humans’ responses to the same questions, the AI models performed as well as the human forecasters. In the second experiment in the study, the AI models were informed about the median prediction for each question from the human forecasters to better inform their prediction. Doing so helped improve LLMs’ prediction accuracy by between 17 and 28%.

“To be honest, I was not surprised [by the results],” Park says. “There are historical trends that have been true for a long time that make it reasonable that AI cognitive capabilities will continue to advance.” The fact that LLMs are trained on vast volumes of data, trawled on the internet, and are designed to produce the most predictable, consensual—some would say average—response is also an indication of why LLMs may have strength in predictive capabilities. The scale of the data they pull from, and the range of opinions, also helps supercharge the traditional wisdom of the crowd concept that helps make accurate predictions.

The paper’s findings have huge ramifications for our ability to gaze into the metaphorical crystal ball—and for the future employment of human forecasters. As one AI expert put it on X: “Everything is about to get really weird.”

https://www.fastcompany.com/91049323/llms-can-predict-the-future-as-well-as-and-sometimes-better-than-humans?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creată 1y | 7 mar. 2024, 19:50:02


Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

Uber is hedging its bets when it comes to robotaxis

Uber CEO Dara Khosrowshahi is enthusiastic about the company’s pilot with Waymo. In

10 mai 2025, 14:50:05 | Fast company - tech
Apple may radically change its iPhone release schedule. Here are 3 business-boosting reasons why

For well over a decade now, consumers have been used to new iPhones coming out in the fall, like clockwork. However, according to a series of reports, Apple may be planning to change its iPhone re

10 mai 2025, 10:20:04 | Fast company - tech
How Google can save you money the next time you book travel

Booking travel has become a bit of a game—especially if you want to get the best possible prices and avoid getting ripped off.

That’s because hotels and airlines have developed the lovel

10 mai 2025, 10:20:03 | Fast company - tech
Uber staff revolts over return-to-office mandate

Uber is facing internal staff unrest as it attempts to implement a three-day-per-week return to office (RTO) mandate and stricter sabbatical eligibility. 

An all-hands meeting late

10 mai 2025, 01:10:03 | Fast company - tech
Why ‘k’ is the most hated text message, according to science

A study has confirmed what we all suspected: “K” is officially the worst text you can send.

It might look harmless enough, but this single letter has the power to shut down a conversatio

9 mai 2025, 22:40:05 | Fast company - tech
SoundCloud faces backlash after adding an AI training clause in its user terms

SoundCloud is facing backlash after creators took to social media to complain upon discovering that the music-sharing platform uses uploaded music to train its AI systems.

According to S

9 mai 2025, 20:30:02 | Fast company - tech