Edge Master Class 2015: A Short Course in Superforecasting, Class II

Tournaments: Prying Open Closed Minds in Unnecessarily Polarized Debates Philip Tetlock [8.24.15]


Edge Master Class 2015 with Philip Tetlock
A Short Course in Superforecasting

Philip Tetlock:    I thought it might be a good idea before we go into the quality of questions that are input into tournaments to say a few more words about the nature of the Good Judgment Project and what it meant to win the forecasting tournament, how it won the forecasting tournament, and what inferences you might or might not want to draw from victory in forecasting tournaments in general.

If you were to turn to slide twenty-eight in the book, it raises the question, how much can the Good Judgment Project improve foresight?

On the weekend of July 30th, Edge convened one of its "Master Classes." In the past, these classes have featured short courses taught by people such as psychologist and Nobel Laureate Daniel Kahneman ("A Short Course in Thinking About Thinking"); behavioral economists Richard Thaler and Sendhil Mullainathan, again with Kahneman ("A Short Course in Behavioral Economics"); and genomic researchers George Church and J. Craig Venter ("A Short Course on Synthetic Genomics").

This year, the psychologist and social scientist Philip E. Tetlock presented the findings based on his work on forecasting as part of the Good Judgment Project. In 1984, Tetlock began holding "forecasting tournaments" in which selected candidates were asked questions about the course of events: In the wake of a natural disaster, what policies will be changed in the United States? When will North Korea test nuclear weapons? Candidates examine the questions in teams. They are not necessarily experts, but attentive, shrewd citizens.

Steven Pinker, who has written about Tetlock's work on Superforecasting, noted that "Tetlock is one of the very, very best minds in the social sciences today. He has come up with one brilliant idea after another, and superforecasting is no exception. Everyone agrees that the way to know if an idea is right  is to see whether it accurately predicts the future. But which ideas, which methods, which people have an actual, provable track record of non-obvious predictions vindicated by the course of events? The answers will surprise you, and have radical implications for politics, policy, journalism, education, and even epistemology—how we can best gain knowledge about the world we live in."

Among Tetlock's "students" at the Edge weekend were many intellectual heavyweights including political scientist and National Medal of Science winner Robert Axelrod; psychologist, Nobel Laureate, and recipient of the 2013 Presidential Medal of Freedom Daniel Kahneman; the political scientist and Director of Stanford's CASBS Margaret Levi; Google Senior Vice President Salar Kamangar; psychologist and National Medal of Science winner Anne Treisman; Roboticist Rodney Brooks, former head of MIT's Computer Science Lab; W. Daniel Hillis, pioneer in massively parallel computation; medical inventor Dean Kamen; and Peter Lee, Corporate Vice President, Microsoft Research, overseeing MSR NExT. 

Over the weekend in Napa, Tetlock held five classes, which are being presented by Edge in their entirety (8.5 hours of video and audio) along with accompanying transcripts (61,000 words). Commenting on the event, one of the participants wrote:

"The interesting thing is that this is not about a latest trend that might scale in one or two years, but about real change that might take a decade or two. Also, these masterclasses are not only much more profound than any of the conferences popularizing contemporary intellectualism. The possibility to spend that much time with the clairvoyants in a setting like this also gives you a sense of community so much greater than any of the advertised."



John Brockman
Editor, Edge 

PHILIP E. TETLOCK, Political and Social Scientist, is the Annenberg University Professor at the University of Pennsylvania, with appointments in Wharton, psychology and political science. He is co-leader of the Good Judgment Project, a multi-year forecasting study, the author of Expert Political Judgment and (with Aaron Belkin) Counterfactual Thought Experiments in World Politics, and co-author (with Dan Gardner) of Superforecasting: The Art & Science of Prediction.

CLASS I — Forecasting Tournaments: What We Discover When We Start Scoring Accuracy

It is as though high status pundits have learned a valuable survival skill, and that survival skill is they've mastered the art of appearing to go out on a limb without actually going out on a limb. They say dramatic things but there are vague verbiage quantifiers connected to the dramatic things. It sounds as though they're saying something very compelling and riveting. There's a scenario that's been conjured up in your mind of something either very good or very bad. It's vivid, easily imaginable.

It turns out, on close inspection they're not really saying that's going to happen. They're not specifying the conditions, or a time frame, or likelihood, so there's no way of assessing accuracy. You could say these pundits are just doing what a rational pundit would do because they know that they live in a somewhat stochastic world. They know that it's a world that frequently is going to throw off surprises at them, so to maintain their credibility with their community of co-believers they need to be vague. It's an essential survival skill. There is some considerable truth to that, and forecasting tournaments are a very different way of proceeding. Forecasting tournaments require people to attach explicit probabilities to well-defined outcomes in well-defined time frames so you can keep score.

CLASS II — Tournaments: Prying Open Closed Minds in Unnecessarily Polarized Debates

Tournaments have a scientific value. They help us test a lot of psychological hypotheses about the drivers of accuracy, they help us test statistical ideas; there are a lot of ideas we can test in tournaments. Tournaments have a value inside organizations and businesses. A more accurate probability helps to price options better on Wall Street, so they have value. 

I wanted to focus more on what I see as the wider societal value of tournaments and the potential value of tournaments in depolarizing unnecessarily polarizing policy debates. In short, making us more civilized. ... 

There is well-developed research literature on how to measure accuracy. There is not such well-developed research literature on how to measure the quality of questions. The quality of questions is going to be absolutely crucial if we want tournaments to be able to play a role in tipping the scales of plausibility in important debates, and if you want tournaments to play a role in incentivizing people to behave more reasonably in debates.

CLASS III — Counterfactual History: The Elusive Control Groups in Policy Debates
CLASS IV — Counterfactuals and the Making of (Better) Superforecasters
CLASS V — Condensing it All Into Four Big Problems and a Killer App Solution
For the psychology professor Philip Tetlock, the hunt for Osama Bin Laden is a classic example of the insufficiency of secret-service agencies. When Barack Obama gave the green light for that operation four years ago, he knew he was making one of the most difficult decisions in his life—one that would not only mean life or death for those involved, but also sway the course of history and help determine his legacy. The prognoses offered by the secret-service agencies were inconclusive: some put the likelihood for success at 40%, others at 80%. In the movie based on this operation, Zero Dark Thirty, the CIA agent Maya insists she is 100% certain of success. In reality, Obama determined the chances stood at fifty-fifty and gave the green light against the advice of his secretary of defense. 

In Tetlock's view, such imprecisions present an unacceptable risk. Forecasts alleging complete certainty are, of course, unscientific. But Tetlock argues that a historic decision must not be based on imprecise reports. While Obama may have enjoyed luck on a historic scale, with his special task force finding Bin Laden and killing him, Tetlock insists that the work of secret-service agencies must change—fundamentally.

Since the eighties Tetlock has worked on precisely this endeavor. For four years now he has pursued research at the University of Pennsylvania at the behest of the Intelligence Advanced Research Projects Activity (IARPA), which the NSA and the CIA, together with fourteen other American secret-service agencies, established in 2006, in order to develop new methods for secret-service work in the post-9/11 era. Among IARPA’s divisions are the Office for Anticipating Surprise, the Office of Smart Collection, and the Office of Incisive Analysis.

Psychologists' "forecasting tournaments" capture the interest of the NSA and the CIA

This past weekend Tetlock met with twenty scientists and engineers on a vineyard north of San Francisco. Two European journalists were invited; otherwise, the meeting was closed to the public. Tetlock wanted to discuss the results of his Good Judgment Project, which he has worked on for 24 years. The scientists discuss the project under ideal circumstances: sheltered from the summer heat in the cool living room of a stately Victorian house. With palms in the garden, a front porch and wainscoting, the house exudes colonial splendor. The air is redolent with the rose beds in front of the windows and the precious woods of the furniture. The host is John Brockman of Edge Foundation, Inc. (http://edge.org), which is the best network for such debates in the country. That explains the presence of such intellectual heavyweights as the Nobel Laureate in Economics Daniel Kahneman, the political scientist and National Medal of Science winner Robert Axelrod, the political scientist Margaret Levi, and Google Vice President Salar Kamangar. It isn’t easy to hold one’s own in such a group. Kahneman in particular, the cleverest of them all, is skeptical.

Tetlock begins by recounting the history of the Good Judgment Project. In 1984 he began holding "forecasting tournaments" in which selected candidates are asked questions about the course of events. In the wake of a natural disaster, what policies will be changed in the United States? When will North Korea test nuclear weapons? Candidates examine the questions in teams. They are not necessarily experts, but attentive, shrewd citizens. One of the best forecasters so far is Bill Flack, a former official of the U.S. Department of Agriculture from Nebraska.

