Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: The Animated GIFs Of January

Sabermetrics 101: Expected Wins and Losses

Pythagorean win/loss record is one of the most misused concepts in sabremetrics. That doesn't mean it doesn't have moments where it shines, so let's look at what it is, when to use it, and when not to use it.

Prerequisites for understanding: None.

Prerequisites for derivation: Data.

Star-divide

Triangles

It was Bill James who first noticed the non-linear relationship between runs scored, runs allowed, and wins. It turned out to be relatively easy to predict a team's win-loss record using a simple formula, which very closely resembles trigonometry's Pythagorean Theorem (and I apologise for actually having math in this one):

The formula has been updated frequently (generally by changing the exponent) to match empirical results, but there's a statistical reason for the relationship as well, which is too complex to go into without getting into some serious maths. Regardless, what we need to know for now is that there's both an empirical and logical relationship between runs scored, runs allowed, and wins, and they agree down to some very small details. Pythagorean expectation goes by other names; notably 'Pythaganport' and 'Pythagenpat'. These are both more accurate versions of the original.

Barking Up The Wrong Tree

Teams whose real winning percentages exceed their expected winning percentages are often referred to as 'lucky', and teams who do the opposite are 'unlucky'. This is a crutch, and it's far from statistically rigorous. We should not pretend to be able to extract true talent level from two variables alone, and it's clear that 'luck' strikes far more deeply than in simple runs scored and runs allowed in a season. A team with an expected winning percentage of .500 and an actual record of 77-85 is not 'really' an 81-win team, although it is true that deviations from pythagorean win-loss are subject to regression. While pure pythagorean expectancy is probably a better way of gauging a team than actual wins and losses, we have some far more informative tricks up our sleeve (we'll get to them in good time), and so there's no reason to assume that we're getting the whole truth from runs scored and runs allowed alone. The idea of pythagorean 'luck' is a quick rule of thumb and nothing more.

Another commonly held belief about pythagorean expectation is that its function is to predict wins and losses given the runs scored/runs allowed data. This is not true: it is merely a statement of a relationship, and it's very important not to forget that. There is no need for pythagorean expectancy to take into account run distribution, or bullpen WPA, or any other input in order to increase its predictive value. Doing so detracts from the central relationship, the very core of what makes pythagorean expectancy useful.

So What's It For?

If you shouldn't use pythagorean expectancy to guess at team talent, and you probably shouldn't refine it to more accurately 'retro-predict' actual wins, what exactly is the point of learning about it?

The quick answer is that you can use it to predict wins given expected runs scored and against, perhaps in a projection system. The longer and better answer is that you can use it to derive our win-run conversion. It's stunningly elegant, really. Without this relationship it would be impossible to look at player value without a statistic that didn't inherently include wins above some benchmark. If the game state is the heart of most of the advanced statistics, pythagorean expectancy is the soul.

What Follows?

The win-run conversion; pitching statistics.

Comment 11 comments  |  0 recs  | 

Do you like this story?

Comments

Display:

I'm a bit surprised...

How heavily it’s discounted for real-world application.

I know it’s nowhere near the best for analyzing what a team record should be, but it always seemed like a good lazy starting point, like OPS.

by Sidi on Feb 20, 2010 12:37 AM PST reply actions  

I guess that's a pretty apt comparison...

OPS is pretty old-school too, and isn’t weighted well enough to be a very useful measure. It’s a stat for a rough judgment, but not much more.

by Sidi on Feb 20, 2010 12:41 AM PST up reply actions  

OPS is pretty good.

Certainly better than pythag as a starting point. Anyway, people tend to just take pythag as the be all and end all, which is frustrating.

by Graham MacAree on Feb 20, 2010 9:18 AM PST up reply actions  

Did you just agree with yourself?

Nonetheless, I think you make a fair point.

Just like OPS, pythagorean records have gone so mainstream as to be misused in representing value.

by hcoguy on Feb 20, 2010 1:03 AM PST reply actions  

Over a season, what is the average error or range of error

between a team’s actual runs scored and their projected runs scored? Same for runs allowed?

by Edgar for Pres on Feb 20, 2010 1:51 AM PST reply actions  

Good description

We have strong association between wins and this calculation. There is variance involved – I know you are trying to avoid the issue, but variance is really important because an understanding of it allows someone to understand why these measures are not perfect, but that’s a philosophical issue.

In any case, the point that this relationship provides a jumping off point for explaining how runs come into the mix for calculation of some of the win-related metrics is a really good one.

by New England Fan on Feb 20, 2010 8:33 AM PST reply actions  

Yes, that's where the post really got interesting...

…and then it ended. I assume that we’re getting these in bite-sized chunks and we’ll be getting a whole separate post making the connection between what Pythag tells us runs are worth in terms of wins, and how that gives us WAR for players. (I already understand the relationship, I think, but I’ll appreciate having it in one pithy piece I can link to).

by wandergeist on Feb 20, 2010 9:05 AM PST up reply actions  

If this is the only piece about Pythag

It would be nice to extend it to a little explanation of the 2nd and 3rd order wins that BP uses for their standings. I think I understand what they’re doing but their explanation is pretty terse, and 3rd order wins do seem to hold up to scrutiny a bit better than the straight Pythagorean calculation. (The M’s 2009 results look to be over-achieving by 10 whole wins according to the bare Pythagorean calculation, but only by 2 wins according to the third-order calculation.)

by wandergeist on Feb 20, 2010 9:12 AM PST reply actions  

This is tricky because I don't know exactly what they're doing

Basically they’re not really straight pythag at that point, but adjusting for luck in runs scored and allowed (i.e. run distribution), and then adjusting it again for strength of schedule. I don’t understand a strength of schedule adjustment being necessary in that sort of context, but it is what it is, I suppose.

It’s clearly better than just taking pythagorean win-loss, but it does seem overly convoluted (to me, at least).

by Graham MacAree on Feb 20, 2010 9:16 AM PST up reply actions  

Comments For This Post Are Closed


User Tools

By reading a game thread of your own volition you agree to accept all liability for any and all damage done to your delicate sensibilities.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Moar_bacon_small
Everything I Know About Jesus Montero

Recent FanPosts

Wbc_029_small
Friday Morning Music Thread
Small
OTDOD - Early February Edition
Agentejebaox3_small
A Statistical Analysis of Mariners' Fan Support
Small
Who will have a better season?
Claw_small
BA's Top 10 M's Prospects
Wbc_029_small
Friday Morning Music Thread
Small
Munenori Kawasaki Predictions!!!
Small
The Longevity and Future Success of Felix Hernandez.
Small
The present vs future conundrum

+ New FanPost All FanPosts >


Sexy People

Wbc_029_small Jeff Sullivan

Small Matthew