Reference Class Architect
Using historical precedent to forecast accurately.
Overview
This pillar implements reference class forecasting to ground predictions in historical data. It systematically finds comparable past events to create a statistical baseline, which helps avoid cognitive biases and anecdotal evidence.
What It Does
The Reference Class Architect identifies a target event and then defines strict criteria for what constitutes a similar historical case. It constructs a 'reference class' of these past events from historical data. The pillar then analyzes the distribution of outcomes within this class to produce a probabilistic forecast, moving beyond a narrow, single-case view.
Why It Matters
It provides a powerful antidote to the 'planning fallacy' and optimism bias, where forecasters focus too much on the unique details of a situation. By forcing an 'outside view', it grounds predictions in what has actually happened in similar situations before, improving accuracy.
How It Works
First, the prediction market question is deconstructed into its core attributes. Second, historical databases are scanned to find past events that match these attributes based on predefined inclusion and exclusion criteria. Third, these events form the reference class. Finally, the frequency of different outcomes within this class is calculated to establish a base rate probability.
Methodology
The methodology involves defining a feature vector for the target event. A similarity score is used to identify and select historical precedents from data sources, forming the reference class. The primary output is the base rate, calculated as the simple frequency of the outcome of interest within the class. A key part of the analysis is managing the trade-off between a narrow, highly relevant class and a broad, statistically robust one.
Edge & Advantage
This pillar provides an edge by systematically debiasing predictions, replacing subjective intuition with a statistical reality check from relevant historical data.
Key Indicators
-
Class Homogeneity Score
highMeasures the statistical similarity of cases within the reference class to ensure they are genuinely comparable.
-
Sample Size
highThe number of historical cases in the reference class. A larger size provides more statistical confidence.
-
Inclusion/Exclusion Criteria
mediumThe specific, logged rules used to define the reference class, ensuring transparency and repeatability of the analysis.
Data Sources
-
Sources like JSTOR, Google Scholar, and SSRN for studies containing historical data and analysis.
-
Provides national and international data on economics, demographics, and more from sources like the World Bank or Bureau of Labor Statistics.
-
Historical News Archives
Services like LexisNexis or ProQuest that contain archives of news articles detailing past events.
Example Questions This Pillar Answers
- → Will the incumbent party retain control of the Senate in the next election?
- → Will this blockbuster film's opening weekend gross exceed $150 million?
- → Will the proposed merger between two major tech companies receive regulatory approval?
Tags
Use Reference Class Architect on a real market
Run this analytical framework on any Polymarket or Kalshi event contract.
Try PillarLab