Back to Forecasting

Model Calibration

How well do our predictions match reality? Calibration metrics show whether models are overconfident, underconfident, or well-calibrated.

1774
Total Markets
96
Resolved
1676
Active

Calibration Results

95
Markets Resolved
0.166
Avg Brier Score
+0.009
Avg Improvement
0.175
Initial
0.166
Final
+0.009
Improvement

Resolved Markets

MarketInitialFinalChange
ADBEWill Adobe's Q1 FY2026 revenue exceed the $6.30B high end of guidance?0.1600.160
ADSKWill Autodesk guide FY2027 revenue above $7.80B at the Q4 FY2026 earnings call?0.1020.102
ADSKWill Autodesk report Q4 FY2026 non-GAAP operating margin at or above 38%?0.0480.048
ADSKWill Autodesk's RPO growth exceed revenue growth in Q4 FY2026?0.0900.090
ASANWill Asana report dollar-based net retention rate (NRR) of 97% or higher for Q4 FY2026?0.1090.109
ASANWill Asana guide FY2027 revenue growth at 10% or higher?0.1090.109
ASANWill Asana disclose AI-specific revenue or ARR metrics by Q2 FY2027 earnings (September 2026)?0.7570.757
ASANWill Asana report $100K+ customer count growth of 10% or higher YoY in Q4 FY2026?0.0100.010
ASANWill Asana's remaining performance obligations (RPO) YoY growth rate exceed its revenue growth rate in Q4 FY2026?0.0260.026
ASTSWill AST SpaceMobile's H2 2025 total revenue meet or exceed the low end of guidance ($50M)?0.4900.008+0.482
ASTSWill AST SpaceMobile conduct an additional dilutive equity or convertible offering exceeding $200M by December 31, 2026?0.0900.090
BRK.BWill Greg Abel's first shareholder letter (expected ~Feb 28, 2026) announce a specific capital allocation initiative (buyback resumption, acquisition pipeline, or dividend)?0.0220.022
BYNDWill BYND receive a NASDAQ non-compliance notice for minimum bid price by June 30, 2026?0.0080.008
BYNDWill BYND report Q4 2025 revenue above $70M (YoY decline less than 15%)?0.0170.017
CCLWill CCL beat consensus EPS estimates for Q1 FY2026 (reported March 25, 2026)?0.1090.109
COMPWill Compass receive a favorable ruling (or settlement) in the Compass v. Zillow private exclusives lawsuit by September 30, 2026?0.0400.040
COSTWill Costco fail to achieve SG&A leverage for 4+ consecutive quarters by Q4 FY2026 despite comp sales above 5%?0.2500.250
CRMWill Salesforce report Q4 FY2026 revenue growth of 10% or higher (constant currency)?0.5780.578
CRMWill Salesforce disclose AgentForce ARR at or above $1B by Q4 FY2026 earnings?0.0530.053
CRMWill Salesforce's current RPO (cRPO) growth exceed 11% YoY in Q4 FY2026?0.2120.212
CRMWill Salesforce guide FY2027 non-GAAP operating margin at 34% or higher?0.0530.053
CRMWill Salesforce issue new debt specifically to fund share repurchases by Q2 FY2027 (July 2026)?0.3250.325
CVNAWill Carvana's Q4 2025 'Other' gross profit per unit remain above $420?0.2300.230
CVNAWill Carvana's trailing-twelve-month operating cash flow to net income conversion ratio fall below 50% as of Q4 2025?0.0320.032
DALWill Delta report Q1 2026 EPS at or above $0.50 (low end of guidance)?0.0580.058
DALWill average jet fuel price remain above $3.00/gallon through H1 2026?0.2210.001+0.220
DDOGWill Datadog's Q4 2025 year-over-year revenue growth rate fall below 25%?0.1440.144
DDOGWill Datadog disclose or confirm that a single customer represents more than 5% of total revenue in Q4 2025 earnings or Investor Day?0.0120.006+0.006
DDOGWill Datadog's initial FY2026 full-year revenue guidance imply growth of 22% or above?0.1760.176
DOCUWill DocuSign guide FY2027 revenue growth at or above 10% at the Q4 FY2026 earnings call?0.0400.040
DOCUWill DocuSign report dollar-based net retention (DNR) at or above 103% in Q4 FY2026?0.0630.063
DOCUWill DocuSign report stock-based compensation below 20% of revenue for full-year FY2026?0.0480.048
DOCUWill DocuSign report Q4 FY2026 billings growth of 10% or higher year-over-year?0.4900.490
FDXWill FedEx Express/Ground (FEC) report Q3 FY2026 adjusted operating margin above 9.0%?0.3360.336
FDXWill FedEx Freight report operating margins below 14% for any quarter in H2 FY2026 (Q3 or Q4)?0.3600.360
FRMIWill FRMI close its $500M MUFG credit facility by September 30, 2026?0.1850.185
FRMIWill FRMI record an impairment charge on construction in progress by its first 10-K filing?0.0140.014
FRMIWill FRMI publicly disclose its specific nuclear reactor design by December 31, 2026?0.2300.230
GTLBWill GitLab's Q4 FY2026 revenue growth exceed the guided ~19% YoY by 3+ percentage points?0.3360.336
GTLBWill GitLab's non-GAAP operating margin exceed 20% in any quarter by Q2 FY2027?0.1440.144
HIMSWill HIMS report Q4 2025 weight loss specialty revenue below $175M?0.2300.102+0.128
HIMSWill HIMS report Q4 2025 year-over-year subscriber growth below 15%?0.1230.123
HIMSWill HIMS management withdraw or materially reduce its $6.5B 2030 revenue target at Q4 2025 earnings?0.0780.078
HOODWill Robinhood's year-over-year total revenue growth fall below 30% in any quarter by Q2 2026?0.1440.144
HOODWill Robinhood's crypto transaction revenue decline more than 30% quarter-over-quarter in Q4 2025?0.0400.040
INTUWill Credit Karma's revenue growth fall below 10% YoY in Q2 FY2026?0.1020.102
INTUWill Intuit report Q2 FY2026 revenue at or above the high end of guidance ($5.25B)?0.1220.122
INTUWill Intuit disclose continued Mailchimp revenue decline through H1 FY2026?0.0530.053
INTUWill Intuit's stock-based compensation exceed 11% of revenue in Q2 FY2026?0.1090.109
KRMNWill KRMN's net leverage ratio exceed 3.5x adjusted EBITDA at FY2025 year-end?0.6720.672
KRMNWill KRMN report Q4 FY2025 revenue growth above 25% year-over-year?0.0170.017
KRMNWill KRMN guide FY2026 revenue growth of 20% or higher at their Q4 FY2025 earnings call?0.0580.058
KSSWill Kohl's report Q4 FY2025 comparable sales worse than -4%?0.3720.372
LMNDWill LMND report Q4 2025 gross loss ratio below 65%?0.1160.116
LMNDWill LMND report Q4 2025 in-force premium (IFP) growth above 30% YoY?0.2300.230
MDBWill MongoDB's Q4 FY2026 Atlas revenue growth decelerate below 25% year-over-year?0.0400.040
MDBWill MongoDB's initial FY2027 full-year revenue guidance imply growth of 20% or above?0.2700.270
MDBWill MongoDB's FY2026 10-K filing contain any auditor emphasis-of-matter paragraph, accounting policy change, or auditor change?0.0320.020+0.013
MOHWill Molina Healthcare's 2026 full-year diluted EPS guidance midpoint exceed $14.00?0.7230.723
MPWill MP's Materials segment achieve positive EBITDA in any quarter of FY2025?0.2030.203
MPWill MP's Magnetics segment generate revenue in H1 2025?0.1370.137
MRNAWill Moderna report Q4 2025 year-end cash below $7.0B or issue 2026 revenue guidance below $1.5B?0.1020.102
MRVLWill Marvell's Q4 FY2026 revenue fall below $2.09B?0.0080.008
MUWill Micron's FQ2 FY2026 revenue exceed $18.0B?0.0260.026
NFLXWill the DOJ file suit to block the Netflix-WBD merger by December 31, 2026?0.3250.325
NFLXWill Netflix abandon or withdraw the WBD acquisition by December 31, 2026?0.5180.518
NFLXWill Netflix and DOJ announce consent decree negotiations for the WBD deal by June 30, 2026?0.0400.040
NFLXWill Netflix disclose permanent financing terms for the WBD acquisition by June 30, 2026?0.1220.122
NFLXWill the EU Commission open a Phase II investigation into the Netflix-WBD merger by September 30, 2026?0.1370.137
NVOWill CagriSema demonstrate >=20% body weight loss in the REDEFINE 4 trial results reported by June 30, 2026?0.3840.384
OKTAWill Okta's Q4 FY2026 cRPO growth come in below the guided ~9% YoY?0.0730.073
OKTAWill Okta's initial FY2027 revenue guidance imply below 10% YoY growth?0.3720.372
ORCLWill Oracle's OCI IaaS revenue growth decelerate below 40% YoY in Q3 FY2026?0.0100.010
ORCLWill Oracle's Q3 FY2026 operating cash flow fall below $3B?0.0900.090
PDWill PagerDuty guide FY2027 revenue above $500M (positive growth vs. FY2026)?0.1600.160
PDWill PagerDuty report dollar-based net retention rate below 100% in Q4 FY2026?0.0780.078
PDWill PagerDuty's Q4 FY2026 revenue meet or exceed the midpoint of guidance (~$128M)?0.1600.160
REZIWill REZI report a goodwill impairment charge in its 2025 10-K?0.1020.102
SWill SentinelOne's Q4 FY2026 revenue exceed $250M?0.0580.058
SWill SentinelOne disclose a quantitative net retention rate at Q4 FY2026 earnings?0.6400.640
SWill SentinelOne's initial FY2027 revenue guidance imply 20%+ YoY growth?0.1760.176
SWill SentinelOne announce a permanent CFO with finance credentials by Q2 FY2027?0.0630.063
SNOWWill Snowflake's initial FY2027 full-year product revenue guidance imply growth below 25%?0.3480.348
TGTWill Target report a 5th consecutive negative comparable sales quarter in Q4 FY2025?0.0480.048
TGTWill Target guide FY2026 adjusted EPS below $8.00 at the March 2026 earnings call?0.3140.314
TGTWill Target guide FY2026 CapEx below $4.0B at the March 2026 Financial Community Meeting?0.0100.010
TWLOWill Twilio report Q4 2025 organic revenue growth below 10% YoY?0.0120.012
TWLOWill Twilio guide FY2026 organic revenue growth at 10% or above?0.4620.462
TWLOWill Twilio's Q4 2025 non-GAAP gross margin fall below 50%?0.1160.116
UAMYWill UAMY's FY2025 revenue meet or exceed the $40M guidance floor?0.3840.384
UAMYWill UAMY maintain $125M FY2026 revenue guidance on the FY2025 earnings call?0.0900.090
UAMYWill UAMY's Montana Thompson Falls expansion be fully operational by Q1 2026?0.3020.302
ULTAWill Ulta Beauty's Q4 FY2025 operating margin fall below 12.0%?0.0900.090
XYZWill Block report Q4 2025 gross profit growth of 15% or higher YoY?0.0200.020
XYZWill Block report Q4 2025 Square GPV growth of 10% or higher YoY?0.1020.102

Scoring Methodology

Brier Score (Binary)

For binary yes/no predictions, we use the Brier Score:

Brier = (probability - outcome)²
  • 0.00 — Perfect prediction (100% confident and correct)
  • 0.25 — Maximally uncertain (50% prediction)
  • 1.00 — Worst possible (100% confident and wrong)

Calibration Curve

A perfectly calibrated model's predictions should match outcomes:

  • Events predicted at 30% should occur ~30% of the time
  • Events predicted at 70% should occur ~70% of the time

We group predictions into buckets (0-10%, 10-20%, etc.) and compare predicted rates to actual outcomes.

Model Comparison

Each market receives predictions from multiple models:

Opus

Deep reasoning, handles edge cases and complex scenarios

Sonnet

Balanced approach, good at pattern recognition

Haiku

Fast, pattern-focused, captures obvious signals

The aggregate prediction uses the median across all model runs. Model agreement is calculated as 1 minus normalized standard deviation — higher agreement suggests more confidence in the prediction.