When the model says X% confidence, does it actually win X%? This is how we audit ourselves.
| Prop Type | Sample | Brier | Accuracy |
|---|---|---|---|
| PITCHER STRIKEOUTS | 270 | 0.2491 | 55.9% |
| PITCHER WALKS | 278 | 0.2367 | 62.2% |
| HITS RUNS RBI | 2451 | 0.2527 | 46.3% |
| TOTAL BASES | 1176 | 0.2566 | 47.4% |
| Source | Sample | Accuracy |
|---|---|---|
| baseline | 3905 | 47.8% |
| blended | 270 | 55.9% |