Tracking Poll Regression & Awareness Corrections
Correcting box office hype with data.
Overview
This pillar analyzes pre-release movie tracking polls to separate genuine audience intent from inflated marketing awareness. It provides a more realistic projection of a film's opening weekend potential by correcting for common polling biases.
What It Does
It performs a regression analysis on historical tracking data for comparable films, identifying patterns where high 'unaided awareness' does not translate to 'first choice' ticket purchases. The model then generates a correction factor based on genre, marketing spend, and release date competition. This adjusts the raw tracking numbers to a more probable outcome.
Why It Matters
Early tracking polls are notoriously volatile and often skewed by marketing blitzes, creating mispriced prediction markets. This pillar offers a crucial second look, grounding predictions in historical reality rather than just current hype. It helps identify overvalued and undervalued films before the market corrects itself.
How It Works
First, the pillar ingests the latest tracking data for a film, focusing on key metrics like unaided awareness and first choice percentages. It then queries a database of past films to find historical analogs with similar profiles. Using a regression model, it calculates the historical error rate for those analogs and applies a weighted correction factor to the current film's tracking estimate.
Methodology
The core is a multivariate regression model that uses historical opening weekend data as the dependent variable. Independent variables include tracking poll data (4 weeks out, 2 weeks out), specifically the ratio of 'Unaided Awareness' to 'First Choice' score, genre, MPAA rating, and critic scores. A 'Hype Inflation Score' is calculated as: (Unaided Awareness % / First Choice %) * log(Marketing Spend). This score is used to adjust the final projection downwards if it exceeds a historical threshold of 1.5 for the specific genre.
Edge & Advantage
This provides an edge by quantifying the difference between 'people know the movie exists' and 'people will actually buy a ticket'. It systematically fades public hype that isn't backed by genuine consumer intent.
Key Indicators
-
Awareness to Intent Ratio
highThe ratio between 'unaided awareness' and 'first choice' metrics. A high ratio suggests soft interest.
-
Historical Comp Error
highThe average tracking poll error for historically similar films in the final weeks before release.
-
Social Media Velocity
mediumThe rate of change in positive social media mentions, used as a tie-breaker.
Data Sources
-
Industry-standard polling data for upcoming films, often reported by trade publications.
-
Provides historical box office data used for building the regression model.
-
Financial data on film budgets and historical performance.
Example Questions This Pillar Answers
- → Will 'Superhero Movie Sequel' gross over $150M in its domestic opening weekend?
- → Will 'Indie Darling' open to more than $10M domestically?
- → Which film will have a higher opening weekend: 'Action Film A' or 'Comedy Film B'?
Tags
Use Tracking Poll Regression & Awareness Corrections on a real market
Run this analytical framework on any Polymarket or Kalshi event contract.
Try PillarLab