Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Dissecting Nick Diaz's Positive Drug Test

Probability vs. Certainty

Jeff's note: this is required reading. Go over it as many times as is necessary to understand that everything - everything - we do deals with probability, and that nothing is black and white.

Picture the following:

Two friends are arguing over the likely outcome of a set of ten coin tosses.

One declares - quite sensibly, since these coins are known to be fair - that he expects said coins to be distributed evenly.

The other, more radical in thought, feels that they'll come up all heads but one.

The coins are flipped. Nine heads, one tails.

'Ah', says the second friend, quite happy, 'You were wrong. That's why we flip the coins!'

Star-divide

Pretty silly, right? Everyone knows that, cheating aside, coins have a 50-50 split between heads and tails. This obviously isn't to say that every time that 10 coins are flipped, the result is even. If we flipped 10 coins 10,000 times, we would instead see a distribution that looked something very similar to this curve (except it would be jagged):

This is known as a probability density function, and understanding what these are is vital when assessing the strength of a prediction. A PDF essentially gives the expected probability for a whole random of outcomes. In this case, it's a bell curve, but they can of course look much more complicated, and even bell curves can look very different because of what's called variance (essentially the spread of the data). Obviously, friend #1 was wise to predict 5/5, even though if you look closely a true 50/50 split will only occur 24.6% (252/1024) of the time. 9/1? 1% (10/1024). Using PDFs, you can say things like 'the number of heads is 62% likely to be 5 or less', which would be completely accurate even if it didn't turn out that way the next time you ran the experiment.

'What does this have to do with anything?', you ask yourselves. 'I hate numbers!' a heckler in the back calls out. 'Please won't someone love me?' a strange young boy cries.

Well, the thing is, every half-competant baseball analyst is in the business of thinking in terms of these PDFs. No, things are never going to be as simple as a coin toss - these are athletes playing a sport, not random numbers dancing around a spreadsheet - but that's not a prohibitive barrier with all the research that goes on these days. PECOTA? It's not giving a number, it's giving a curve. That's where things like 10%, 50%, and 75% levels come from. You've got a team's Pythag predicted accurately? Great, then you can generate a PDF and say that they have something like a 70% chance to be within +/- 4 games of that.

We don't deal with certainties when we look at this game. Sometimes, it comes across that way to people who've never seen stuff like this before and then have PECOTA dropped on them, but it's really not true at all - 'I don't think the Mariners have a high chance of making the playoffs' is NOT the same as 'We're not going to make the playoffs'. And it's not just analysts who do this - everyone who thinks about the future does, albeit subconsciously. What do you think a scout is doing when he's evaluating prospects his team might want to pick up in the draft?

Anyway, here's the crux of the matter.

We work in probabalistic terms. This means that when you tease a single number out of us, it's going to be our best guess and will probably be wrong. This does not mean that the prediction curve itself is complete bollocks (although sometimes this is in fact the case).

'That's why we play the games' is not an acceptable response to an argument about probability any more than 'that's why we flip the coins' is. An argument against a prediction must be conducted against that prediction's assumptions, rather than with a 'Well we'll just wait and see what actually happens then' because that's just not how probability works. Challenge the mechanism behind the prediction, not the expected outcome. I'm not saying the situation is as black and white as my hypothetical argument between friends, but invoking that statement means that 2 doesn't really understand that 1 was NOT stating with 100% certainty that 5 coins would come up heads.

He was, after all, perfectly correct even though he was wrong. The illusion of certainty is a ghost that many people would do well to stop chasing.

Comment 255 comments  |  1 recs  | 

Do you like this story?

Comments

Display:

You are a very smart and eloquent man.
I wish there was a pocket version of this post that I could carry around with me at all times so that when I try to explain these concepts to people I could make a lick of sense.

Maybe Baker will link to this. I doubt it, but maybe.

--aaron c.

by Aaron Campeau on Feb 9, 2008 2:00 PM PST reply actions  

Yes, exactly.
"You know, as that was coming out of my mouth, I knew that it was wrong."

by JI on Feb 9, 2008 2:04 PM PST reply actions  

Curse you Grey-Ham!
"Kruger! My son tells me, your company shtinks!"-Frank Costanza.

by Coach Owens on Feb 9, 2008 2:21 PM PST reply actions  

Ah.
The strange young boy mentioned in the post makes an appearance.

by Graham MacAree on Feb 9, 2008 2:23 PM PST up reply actions  

You'd think this would be common sense.
But it's not. Football Outsiders correctly argued this last Super Bowl was one of the largest mismatches ever, but when the Giants won countless people took that as proof that their system doesn't work. Or, worse, is fabricated. I hate to be intolerant or elitist, but some people seem, if not incapable, at least retarded from thinking in anything but absolutes. I fear you cannot simply explain away that weakness.
You can ching ching ching cash in on this tragedy.

by John Morgan @ Lookout Landing on Feb 9, 2008 2:30 PM PST reply actions  

Totally agree there
That said, the arrogance of the Statistics crowd on certain sites around this fine internet has become pretty nauseating.

I see where they're coming from, but I also definitely see why the average fan would tell them to go fuck themselves and enjoy some baseball.

by Garces on Feb 9, 2008 2:33 PM PST up reply actions  

It's the average fan's fault
for engaging the statistics crowd in conversation.

And most of us rather do enjoy watching baseball.

by Jeff Sullivan on Feb 9, 2008 2:39 PM PST up reply actions  

There's "fault" on both sides
the "average fan" should know what they're getting into, but in a lot of cases, the statistics crowd can come off as sounding so sure of themselves that everyone else is automatically not just wrong, but a lesser fan because of it, and that's where I have a disconnect.

Like I said below, posts like this are great because they attempt to educate with the data - which is a whole different process from presenting data and saying "this is right, and if you don't believe it you're wrong".  Which is a factual statement, from a data perspective, but it's a bit off-putting from the "average fan" perspective.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 2:43 PM PST up reply actions  

That's true.
The numbers crowd can definitely have an air of haughtiness from time to time. (I wonder if this isn't in part due to the fact that there's an inverse relationship between scientific/numerical aptitude and the ability to write.)

Where I have a problem is when people come to places like this or USSM and blast us for using too many numbers. That's like waltzing into a Rufus Wainwright show and complaining that he sings too much.

by Jeff Sullivan on Feb 9, 2008 2:55 PM PST up reply actions  

In the academic community...
(and I don't meant to get too obtuse) this same thing plays itself out between economists and every other social scientist at conferences.  The economist is the stats guy.  He explains his position--often, though not always, in a belligerent fashion--and then loses his patience and gives everyone the academic iteration of the bird.

For what it's worth, I happen to like economists and stat-heads.

by skijake1 on Feb 9, 2008 3:17 PM PST up reply actions  

Hey,
Speaking as a Soc. major, watch your mouth, or I might just have to go symbolic-interactionist on your ass.

by Liebkartoffel on Feb 9, 2008 3:30 PM PST up reply actions  

I don't even know what that means
But I assume it's some sort of threat.

My army of bio-robots will be knocking on your door shortly.

by Graham MacAree on Feb 9, 2008 3:31 PM PST up reply actions  

Really?
What significance do you attach to bio-robots? Is it socially appropriate for them to be "knocking at my door" or are they a fundamentally deviant construction? Your choice of the word "door" intrigues me. Do we both attach the same significance to this symbolic door? What does it symbolize--keeping you out or me in? What other barriers are there between us in our relationship as blog commenters? Are there other shared symbols that we employ to further define said relationship? What are they? Is it a dynamic or static relationship? What about blogs? What types of norms and collective behaviors have evolved in blog culture? Does your behavior change within the concept of this blog? Or toward Jeff, the officially established authority figure? What about unofficial authority figures? Where do you feel you fit in within the blog's hierarchy? Do you feel that you're treated differently due to your status? Why?...And I could go on.

Qualitative analysis, bitches--helping mathematically challenged people sound smart for the past two centuries.

by Liebkartoffel on Feb 9, 2008 3:56 PM PST up reply actions  

I'm going to give them anthrax cannons
At least you're not a philosopher, though. My last argument with one involved me laying out an argument for 5 minutes, and him replying with:

'So define "it".'

And then I drank some gin.

by Graham MacAree on Feb 9, 2008 3:59 PM PST up reply actions  

What a cop out.
The standard philosophy argument toward anything confusing sounding or "sciency" is to reply:

"Yeah, but how do you know? Isn't knowledge entirely dependent on perception? I mean, when you really think about it, couldn't you just be a figment of my imagination?"...

At which point you punch them in face and respond with:

"Yeah, well, how'd you perceive that motherfucker?"

by Liebkartoffel on Feb 9, 2008 4:10 PM PST up reply actions  

He was Scottish
The last time I've seen a Scottish person angry, he kicked down the front door of a house and then pissed on a policeman.

Never punch a Scotsman in the face.

by Graham MacAree on Feb 9, 2008 4:45 PM PST up reply actions  

Ah,
Well, the implied corollary is that all bets are off when it comes to Scots.

by Liebkartoffel on Feb 9, 2008 5:22 PM PST up reply actions  

We philosophers get a bad rap
Speaking as a professional philosopher I can promise you that most of us aren't like this.  As for the economists, I love what they do so long as they remember that they aren't engaged in a normative discipline and they stick to making predictive models.  Economists are too quick to use words like "rational" innapropriately, implying that what is rational is simply what maximizes ones own preferences.  In short, economists, keep making models leave the normative stuff to philosophers, that's our turf.

by Uncle Ted on Feb 10, 2008 11:00 AM PST up reply actions  

Social Sciences
tend to disguise simple concepts with large words.  I'm sure you'd understand it.

by Edgar for Pres on Feb 9, 2008 4:36 PM PST up reply actions  

Of course I would
But I'm an engineer and therefore chronically lazy.

by Graham MacAree on Feb 9, 2008 4:37 PM PST up reply actions  

Correction:
Social sciences describe various phenomena that the general populace take for granted but never bother to identify because they're chronically incapable of describing their or other's behavior.

...Or did I just prove your point?

by Liebkartoffel on Feb 9, 2008 5:16 PM PST up reply actions  

Symbolic Interaction!
God, this blog has got it all!

(Is also a Sociology major)

by tworsandtwols on Feb 9, 2008 4:43 PM PST up reply actions  

Nice.
Three cheers for Erving Goffman!

by Liebkartoffel on Feb 9, 2008 5:30 PM PST up reply actions  

Fuck you.
Any time you want to act like an idiot while drunk, I'll point to expectancy theory and laugh at you.
...and now I'm here

by CapSea on Feb 9, 2008 3:59 PM PST up reply actions  

The problem with that logic is that you
set yourself up to be taken advantage of.  I had a stats prof who said a very profound thing that has always stuck with me.  

"Say you a flip a coin and it comes up heads.  Ask the person you are working with what the odds are that the next flip will be tails.  If they say anything greater than 50% do not do business with them.  They will inevitably fail in business based on their inability to understand probability and independent results."

by Sec 108 on Feb 9, 2008 2:44 PM PST up reply actions  

Higher math is not common sense
I have never grown up thinking in numerical terms - my mother was a Spanish teacher and my dad had a Ph.D in psychology, and worked in the public school system most of his life.  So, while I'm OK with basic math, things like this are by no means common sense or easily intuited by a lot of people.

Which is why posts like this are so valuable - the concept is elusive, but if you sit down and think about it it does make good sense.

And this applies especially to sports fans - for most people, sports is an ESCAPE from things like numbers, probabilities, and bell curves.  They craft a sports narrative that fits the heroic model more than it does the mathematical model, and when presented with the math side of things, they refuse to believe it because it doesn't fit their narrative.

Me, on the other hand, I largely choose to ignore the mathematics - but that doesn't mean I don't value them, or want to learn more about them than what I currently know.  But, most people, as you say, deal in absolutes, so there's no room for alternative narratives in their world.  This doesn't make them retarded, nor is it a weakness; people enjoy things up to the level that makes them happy, and that's fine.  More power to them.

Rather than take the intolerant/elitist approach, point your non-stat friends to posts like Graham's, and some of the USSM stuff - don't demonize, educate.  

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 2:39 PM PST up reply actions  

I'd definitely bring them to LL
But I wouldn't link them to USSM unless I wasn't planning on being their friend for very much longer.

Bringing statistics and probability into the game DOES enhance the experience in the way that rooting for players on your Fantasy Football teams does or throwing down cash at the sports book enhances the viewing of whatever your betting on.

Which reminds me: I need to gamble more often!

by Garces on Feb 9, 2008 2:43 PM PST up reply actions  

I think that first line
Might just be the clunkiest ever sentence written.

I blame the California sun.

by Garces on Feb 9, 2008 2:45 PM PST up reply actions  

I always love getting the line
about how numbers detract from a person's ability to get into the game.

Whenever that comes up, I just point the other person to the broken IKEA chair.

by Jeff Sullivan on Feb 9, 2008 2:58 PM PST up reply actions  

Agreed
It's a huge boost to the enjoyment of baseball, more than any other sport.  As silly as win probability is, it's like a bowl full of skittles.

Nobody can resist a bowl full of skittles.

by Garces on Feb 9, 2008 3:00 PM PST up reply actions  

The two advanced stats
I don't ever really follow are pythag and win probability.  Pythag, because I don't get the point of it (and I'll go dig up the post where I asked the question, I don't want this to get into what value Pythag has), and win probability, because frankly, when the M's are down 7-1 in the 7th inning of an away game, it doesn't take a rocket scientist to know that they have a good chance of actually losing that game, so WP doesn't really add much to my enjoyment of the game because quantifying that probability is just depressing.  

But then again, I was always more an M&M guy than a Skittles guy anyway.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 3:04 PM PST up reply actions  

Something we can all agree on
The peanut butter M&Ms were a mistake.

by Garces on Feb 9, 2008 3:06 PM PST up reply actions  

a-frickin-men
those things were bad.
Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 3:07 PM PST up reply actions  

Both of you had better
shut your motherfucking mouths right now. Them be fighting words.
These pretzels....are making me thirsty!

by Goose on Feb 9, 2008 4:17 PM PST up reply actions  

I think Win Probability is more fun after a game
when you're able to look back on a chart and see how much it mirrored your emotions.

And peanut butter M&M's are delicious. (This is NOT a new diary topic.)

by Jeff Sullivan on Feb 9, 2008 3:10 PM PST up reply actions  

You should have a
'buy author some peanut butter m&m's' button
I fucking hate you Mariners

by kentroyals5 on Feb 9, 2008 3:24 PM PST up reply actions  

Here's the thing though...
You don't need proficiency in math to understand Graham's point. You need only to understand that something can be predictable without being certain. I've met so, so many people who can't wrap their head around that idea. I wholly disagree that it's not a weakness, too. To me, this is exactly the type of logic that allows someone to drink and drive, succeed, and thus conclude that they are somehow capable of drinking and driving when others are not.
You can ching ching ching cash in on this tragedy.

by John Morgan @ Lookout Landing on Feb 9, 2008 3:10 PM PST up reply actions  

It's uncertainty.
people seemingly can't abide uncertainty.  If they don't know the answer to something, they tend to pick one and then believe it firmly, regardless of whether that opinion is supported by the evidence.

Plus, by holding an opinion, they're introducing a confirmation bias into their future discovery of evidence, so even if they learn the truth later they might not believe it.

This is how religion works.  People don't know how the universe functions, so they believe a story that explains it all even though there's no good reason to reach that conclusion.

Statistical probability goes completely over most people's heads because it measures the uncertainty.  Since they can't accept that uncertainty ever exists, any attempt to measure it must therefore be fabrication.  If the outcome of the Superbowl happened as it did, then there must be an explanation as to why, and a typical person will choose an explanation that fits the outcome regardless of whether its supported by any data.

When Billy Beane said "my shit doesn't work in the playoffs", his point was that the outcome of the playoffs, from his point of view, was unexplained variance.  BP's Joe Sheehan has put it better by saying, "In a short series between two playoff calibre teams, any result is reasonably likely."  But since both of these contain explicit recognition of uncertainty, the average person doesn't actually understand what's being said.

Given that the average person doesn't understand any line of reasoning where the outcome is uncertain, is it any surprise when they don't understand the point Graham is making here?  And isn't it scary that we let them vote?

by Llewdor on Feb 9, 2008 10:32 PM PST up reply actions  

That's the attitude right there
Given that the average person doesn't understand any line of reasoning where the outcome is uncertain, is it any surprise when they don't understand the point Graham is making here?  And isn't it scary that we let them vote?

Just because people may not understand that line of reasoning is no reason to be arrogant.  You're not a better person than me just because you grasp that concept faster than I do.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 10, 2008 8:00 AM PST up reply actions  

It's not arrogance if I'm right
Furthermore, if I grasp a concept faster than you, then I am better than you (at grasping that concept).

This is demonstrably true.

by Llewdor on Feb 10, 2008 11:36 AM PST up reply actions  

Yes, it is
You can be right without being superior about it.  Comments like "isn't it scary that we let them vote?" are just assholish.  

As I said, just because you may be better than me at grasping a particular concept does not make you a better person than me, nor does it allow you to judge my ability like you do when you say "isn't it scary that we let them vote?".

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 10, 2008 12:40 PM PST up reply actions  

Being a "better person"
is a meaningless designation without a more detailed definition.

As for the voting comment, that's only assholish if you think voting is some sort of fundamental right that should never be withheld from anyone (so my suggestion that it be withheld is abhorrent).  But I certainly never espoused that opinion.

Since one of the more important jobs of a government is to act as an economic steward, and most economics relies on just these sorts of probabilistic measurments, allowing people who can't understand the math behind the decisions to influence the decisions is crazy.

by Llewdor on Feb 10, 2008 1:25 PM PST up reply actions  

That's interesting
allowing people who can't understand the math behind the decisions to influence the decisions is crazy.

So in your world, only an educated elite should have any say in the direction of a representative democracy?  Interesting.  

Or, put another way:  If you don't know how the engineering of a combustion engine works, should you be allowed to operate a car?

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 10, 2008 5:01 PM PST up reply actions  

Not disagreeing with you.
But that's the completely wrong argument. He's arguing that you shouldn't allow people that don't know how an Internal Combustion Engine works to design one.

Extrapolating, he's (probably) saying that economists should run the economy. The equivalent argument to the car you brought up would be "noone but economists should be able to have or use money".

by Faux on Feb 11, 2008 9:11 AM PST up reply actions  

Maybe I misunderstood
but when he said:

allowing people who can't understand the math behind the decisions to influence the decisions is crazy.

I read that as saying if I don't understand higher economic theory/mathematics, I should not be allowed to have a say in how those theories get applied (as in, I should not be able to vote).  The car thing was kind of a stretch, but what i was thinking was that it is like driving - when I drive, I need to make decisions about proper speed, acceleration rate (when passing), braking, and the like.  

I have no idea how, when I press the gas or brake pedal, the speed increases or decreases; I just know it does.  And the fact that I don't know how the speed increases does not make me any less competent of a driver than my brother-in-law, the mechanical engineer, who probably does understand.

Voting's the same way - I don't need to know how to apply economic theory in order to be able to cast a vote.  Does it help?  Maybe.  But should it be a requirement?  Never.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 11, 2008 10:03 AM PST up reply actions  

Not to step on toes
But I just think those are two fundementally different analogies. The way I see it, to stick with your car one:

-Lets say you happen to own stock in Car Company X (CCX). The board of directors recently decided they want to come up with a brand new clean-energy car line, but they're undecided as to which clean-energy to pursue so they're going to put it to a shareholder's vote. You have the right, in this case, to vote on the decision, but should you?

Or, to put it another way, you say:
when I drive, I need to make decisions about proper speed, acceleration rate (when passing), braking, and the like.

But you did go to some sort of school for that. Either driver's ed or possibly you were taught by a parent, whatever. There was training involved. We don't let people who haven't passed a driving test drive, so why do we let people who haven't passed an economics test vote on economic policy?

I think that's more of what he's saying.

by Matthew on Feb 11, 2008 10:13 AM PST up reply actions  

Yeah, I think I get it
but I tend to get a little sensitive when I start hearing ideas that would restrict people's rights to vote.  We let people who haven't passed an economics test vote on economic policy because it's a fundamental right of all Americans to be able to vote - with no preconditions or tests.

You could extend that argument even further, and say that if you aren't a trained educator, you shouldn't be able to vote for your school board, or if you haven't studied the law you shouldn't be able to vote for judges; before long, you're back to the "landed gentry" concept where only a small percentage of the educated elite (read: rich white men) are entitled to cast a vote.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 11, 2008 10:21 AM PST up reply actions  

I don't think judges should be elected
but that's a different argument, and not one we need to have here.

by Llewdor on Feb 11, 2008 10:31 AM PST up reply actions  

I agree, actually
but as long as they are, I want to be able to have a say.
Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 11, 2008 10:43 AM PST up reply actions  

Yeah, it's a "slippery slope" problem
but I don't think that makes the argument/opinion any less valid, though naturally that's my opinion and I've never put much faith in the voting process in modern America.

by Matthew on Feb 11, 2008 10:40 AM PST up reply actions  

Don't mistake my fervent defense
as an endorsement of how elections work.  In this country, they largely don't.  But I don't think that's the fault of the electorate and its ignorance as much as it is a fault of the system and its entrenchment of the status quo.
Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 11, 2008 10:44 AM PST up reply actions  

I'm not willing to concede that the people
generally make better decisions than the landed gentry would.  The problem with the landed gentry was that they had no incentive to make decisions that benefitted anyone but themselves.

So on one side you have a small group of knowledgeable people making decisions, while on the other side you have a much larger group making uninformed decisions.  I can't honestly say that the democratic option looks superior.

by Llewdor on Feb 12, 2008 10:12 AM PST up reply actions  

I would say if you are part of the landed gentry
then go with the oligarchy.  If not, go with democracy.

Also, just because many people choose not to educate themselves (not everyone, but American Idol does destroy CSPAN in ratings), doesn't mean the theory of democracy isn't good.  You probably know that about 1/2 the population votes and the more educated a person is, the more likely that person will vote.  I believe the key to a well functioning democracy is successfully educating and empowering the population.

by Jed MC on Feb 12, 2008 10:26 AM PST up reply actions  

It's not about better or worse
it's about the ability to choose for yourself.  If the right to vote is suddenly taken from me, for whatever reason (I didn't take physics, I'm not a lawyer, whatever), why should I trust that some anonymous person has my best interest in mind?  

The problem with the landed gentry was that they had no incentive to make decisions that benefitted anyone but themselves.

This is what I like to call "human nature", and it would b ethe same today as it was then.  This is the best argument for allowing everyone to vote that there is.  If only one group is allowed to express opinions, of course those opinions will benefit themselves and themselves only; if every group has that right, the effect is diluted to a point where everyone seems willing to live with it.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 12, 2008 10:31 AM PST up reply actions  

In an ideal world.
I still cannot fathom why election day is not a national holiday.

by Matthew on Feb 12, 2008 12:17 PM PST up reply actions  

an anonymous person
like say, a delegate or elector?

I know it's pedantic, but I find it humorous how interchangeably the term democracy is used for our republic and how we're a lot closer to an oligarchy than most people realize.

You vote in a primary, to elect delegates to the convention, who, depending on the state, may not even be required to vote per the primary results, which in turn selects the candidate, who goes into the national election (only if it's one of the two main parties) where the public votes again, but for electors who then actually vote to elect the president, and again, they may not be bound to vote as the public in their state did.

Toss in voter fraud and the screwed up system where we only have two parties, nomination within those parties are partly (I'd argue mostly) controlled by the media (see: debates) and lobbyists/PACs and really, are we anything close to a democracy?

by Matthew on Feb 12, 2008 12:26 PM PST up reply actions  

We're pretty damn close
on the city/county/state level.  Presidential elections are not close.  The easy way to fix that is to lose the electoral college entirely, but then there's still the media/lobbyists/PACS to deal with; but abolishing the electoral college would be a good start.

As for elections being a national holiday, it's a great idea - I'd go a step further, and do what Australia does.  Australia requires their citizens to vote, or it's a $100 fine.  They can vote for whoever they want, but they are compelled by law to do so.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 12, 2008 1:33 PM PST up reply actions  

god no
I'd vehemently oppose being required to vote.

by Matthew on Feb 12, 2008 1:36 PM PST up reply actions  

On what basis?
(not being snarky, just genuinely curious)

If you vote regularly anyway, what difference would it make if there were a law making it compulsory?  And if you don't vote, you would now have an economic incentive to do so.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 12, 2008 1:40 PM PST up reply actions  

Because to me,
not voting is a choice same as voting for candidate X.

Forcing people to vote is the same, in my mind, as stripping people of the right to vote.

by Matthew on Feb 12, 2008 1:47 PM PST up reply actions  

Interesting point.
Never really thought of it that way.  maybe if "none of the above" were added to the ballot, or something...
Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 12, 2008 1:50 PM PST up reply actions  

Martial law
Also, some educated people don't vote because they realise their single vote has no marginal impact on the outcome, so the non-zero opportunity cost associated with voting is necessarily too high.

For example, in the US presidential race, if you live in a winner-take-all state that always goes Democrat, it simply does not matter for which candidate you vote.  If you vote Democrat, the Democrat wins all the electoral college votes.  If you vote Republican, the Democrat wins all the electoral college votes.  No change; your vote made no material difference.  So why did you bother?

by Llewdor on Feb 12, 2008 2:07 PM PST up reply actions  

I wonder if having
"None of the above" mandatory on all ballots would have prevented the 2004 POTUS election from ever happening. The parties would have had to pick new candidates after the first poll post-convention that would have come back like:

Bush: 20%
Kerry: 20%
Neither: 60%

by Matthew on Feb 12, 2008 2:14 PM PST up reply actions  

I have argued in the past
that people should only be permitted to drive if they've completed a highschool physics course, and thus have an understanding of friction and inertia.

Whenever I'm in Seattle I'm amazed at how many people spin their tires in the rain.  This can only be due to a failure to understand the difference between static and kinetic friction.

I would argue that your lack of awareness (I was going to say "ignorance", but you'd probably take that as some sort of attack) of how your car works does make you a less good driver.  Someone who knows how the vehicle works can improvise in unexpected situations.  If your car has traction control, for example, knowing how it works can help you a lot when your wheels are experiencing disparate traction levels.

by Llewdor on Feb 11, 2008 10:29 AM PST up reply actions  

There's levels of knowledge, though.
I know how my car works (and "ignorance" isn't too far off, but in this case it's a willful ignorance). I know, for instance, how fast it accelerates - I can't tell you the number, but I can tell you from experience that I can get through an intersection in a couple seconds, or whether, if I'm in the curb lane and I have to get into the left lane to make a turn by the next block, I can do that safely or not.

I know all this without having ever taken a physics course.  Even in high school, which was so long ago for me as to be irrelevant.  

If you spend any length of time driving, and are at least marginally aware of cause and effect, you can figure out how well your car works, and figure out where the limits of your car's performance are, from the experience of driving it repeatedly and frequently.  And you don't need a working knowledge of physics in order to figure it out -  you just need to be aware of what you're doing.  

Whenever I'm in Seattle I'm amazed at how many people spin their tires in the rain.  This can only be due to a failure to understand the difference between static and kinetic friction.

"Only" is a very restrictive word.  It could also be due to the person behind you being impatient and tailgating you, thus making you feel pressured into going faster than conditions dictate, and causing your wheels to spin; or it could be that you were on your phone and didn't see the light turn green, so you overcompensate (you shouldn't call while you're driving, but that's a different story).  Driving also requires reacting to other people.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 11, 2008 10:42 AM PST up reply actions  

There is a huge difference between
knowledge and execution.

I always had trouble with physics, I don't know how to change the oil, and I certainly can't explain how the all-wheel-drive system of a Subaru works.  For nearly 4 years I lived miles down a dirt road in Vermont that was muddy, icy, snowy, sometimes all three, and I never got my vehicle stuck in any bad weather.

To relate this to baseball, how many major league pitchers can explain the physics of a curveball?  I don't know how smart Felix is, but do you think a class in physics would make him a better pitcher?

by Jed MC on Feb 11, 2008 11:37 AM PST up reply actions  

I think a class in physics
would make Mel Stottlemyre a better pitching coach.

Biomechanics would be even better.

by Llewdor on Feb 12, 2008 10:09 AM PST up reply actions  

Agreed, but that is a different argument.
Coaches and teachers should be better if they are educated in their field.  I would say biomechanics and athletics are part of the same field in a similar way that steroid research is in the same field as sports.

by Jed MC on Feb 12, 2008 10:17 AM PST up reply actions  

Very well written.
I am a true believer in statistics. The only problem I have when it comes to statistics in baseball is that there is no control. For statistics to be gathered properly, you need to run an experiment the same way every time and record the randomness in the result.

In baseball, nothing is done the same way every time. Pitchers, hitters, and fielders make adjustments for every situation (this happens as frequently as every pitch). This means the experiment is run differently every time and the results cannot be accurately distributed on a bell curve.

So, in other words, the statistics in baseball are too random to make any proper statistical analysis. This is why neither side of the issue is going to be correct in their predictions.

Therefore, this is why the games are played. The SABR community does make a good effort, though. I trust their numbers more than just some random predication from some random dude.

by Wilder83 on Feb 9, 2008 2:44 PM PST reply actions  

Yeah, we're not going to come close to perfect
But if SABR stuff does even a 1% better job than conventional wisdom, it's a worthwhile effort.

by Graham MacAree on Feb 9, 2008 2:49 PM PST up reply actions  

The SABR community
is definitely improving their ability to find a control. The closer they get, the better their statistics get.

And according to my statistics, the SABR community has done a 43% better job than conventional wisdom (+/- 8.5% margin of error).

by Wilder83 on Feb 9, 2008 3:00 PM PST up reply actions  

Agreed
They're great tool to enhance stat-based previews of teams, players, individual games, etc.

But Baker's got a point.  At the end of they day, they're just predictions.  Adopting an "I'm clearly right and you're wrong...so fuck off" position is extremely hubristic.

by Garces on Feb 9, 2008 2:52 PM PST up reply actions  

Meh
Me saying to the average fan, 'I'm right because I'm smarter than you' is clearly hubris.

Me saying to the average fan, 'Your argument doesn't hold water because of this, this, this, and this' (assuming good research and all that) really isn't.

What tends to go wrong is that people very familiar with all the work done in the blogosphere sometimes forget that not everyone has already read everything about a certain subject, so they'll just bring out a conclusion without siting the mass of analysis behind it.

I can definitely see how it could be interpreted as someone thinking they're superior and pulling numbers out of thin air, but that's never the intent.

by Graham MacAree on Feb 9, 2008 3:10 PM PST up reply actions  

Don't disagree with that at all
But if someone were to say that the SABR-community doesn't foster a contempt to whatever definition of 'average guy trying to enjoy baseball' (note: not me) that you have, then I would have to counter with a rousing "pfft."

by Garces on Feb 9, 2008 3:19 PM PST up reply actions  

Yeah, probably
But I actually find educating people really exciting.

What gets me is when people take carefully crafted arguments and throw them out of the window with 'pfft. Stat nerd. INTANGIBLES!'

And then I flame them.

by Graham MacAree on Feb 9, 2008 3:22 PM PST up reply actions  

Perhaps this is a better explanation
I for one actually like to watch baseball being played, so I hope we do continue to play games, and see lots of remarkable stuff that I wouldn't have bet on, but you know some interesting shit is going to happen.  Richie Sexson winning the game with a homer for instance.

by BlueBaron on Feb 10, 2008 6:36 AM PST up reply actions  

Wait a minute...
What was all this "one in a million" talk???
Copenhagen = WAYYYYY better than Paris

by MarinerintheDistrict on Feb 9, 2008 2:53 PM PST reply actions  

When you start giving me
the standard deviation for your predictions, I'll stop saying 'that's why they play the games.'

by Digger on Feb 9, 2008 2:54 PM PST reply actions  

"Knock yourself out"
Is a pretty prime example of what's being discussed here.

by Garces on Feb 9, 2008 3:05 PM PST up reply actions  

Here's the other point
That I think is often missed...not only will a projection system forced to spit out one number (like a win total for the 08 M's) most often be wrong, but when you deal with 30 different projections (for each team) of course a few of the results are going to end up at one end of the curve or the other.

Just because teams come "out of nowhere" or "crap the bed" doesn't prove there's a flaw in the system, just that there are way too many variables in play for anything approximating a totally accurate prediction.

Kickass Sports Writing - Sportszilla and the Jabber Jocks - www.sportszillablog.com

by Sportszilla on Feb 9, 2008 3:00 PM PST reply actions  

By the way, at least in a Bedard trade context
I think it's important to understand that all the team projection stuff that we've been spitting out is mainly for the purposes of explaining why we're against the move.

The fact that a lot us consider the M's the second-best team in the division isn't going to take anything away from the thrill of watching the season play out. I've enjoyed following longer shots. If you're incapable of separating numbers from your emotions, then you aren't trying hard enough.

by Jeff Sullivan on Feb 9, 2008 3:08 PM PST reply actions  

Well, clearly
But when I pop on to Baker's blog and see this:

Posted by Druv

6:06 PM, Feb 08, 2008
This will likely go down as the worst trade in Mariner history.

I start to wonder what a lot of these people are drinking, because it deserves its own blog.

by Garces on Feb 9, 2008 3:10 PM PST up reply actions  

Those people are insane
I think you'll find it's always best - in every facet of life - to ignore people who occupy the extremes.

by Jeff Sullivan on Feb 9, 2008 3:13 PM PST up reply actions  

This made me laugh:
And for the people who complain about graphs without absolute axis bounds:

Totally a middle finger.

by Garces on Feb 9, 2008 3:16 PM PST up reply actions  

DMZ might be mad to know
you hotlinked his image...

Now i can refresh this page thousands of times and put pressure on his server

by Corco on Feb 9, 2008 3:22 PM PST up reply actions  

He's always mad
But I should have tossed it on my photobucket account.

by Garces on Feb 9, 2008 3:27 PM PST up reply actions  

Maybe I just need to stop going to USSM
Jay R. Says:
I am starting to understand why the Mariners are profitable each year. The vast majority of their fans don't understand baseball and are just happy to see the pretty uniforms and the Moose.

msb Says:
[Oh my god. Well], here is an example of your casual Mariner fan:
"Adam Jones has a primadonna attitude who doesn't run the ball out, and doesn't seem to care. They need to platoon Mike Morse, he's a hustler who'll get to the ball."

I guess my point is...who cares?  Aren't they a fan of the same team?

I love the show Lost because I read all the theories, check the screen cap blogs, etc.  Someone else loves the show just because.

Am I more badass than that person just because I know more about Dharma and can predict the show that much more accurately?  All the other dude wants to do is just let it play out and enjoy the ride.  Where's the hostility coming from?

by Garces on Feb 9, 2008 3:25 PM PST reply actions  

i agree that they have just as much right as i do
to be a fan of the team... bu that doesn't change the fact that some of the shit they say makes me want to gouge my eyes out with an ice pick. Its not really hostility, they just make me want to argue my point more feverishly, and that can be misconstrued as hostility.

by thenatural on Feb 9, 2008 3:30 PM PST up reply actions  

Dude.
Don't ever get in a fight with an idiot. They'll drag you down to their level and beat you with experience.
Yeah, I'm an All-Star.

by MarinerintheDistrict on Feb 9, 2008 3:43 PM PST up reply actions  

Look man
The comment section of that blog is always going to be an absolute mess. Trust me, I spent about 5 hours moderating the Bedard thread last night. The readership is so high that they can't help but attact some bad eggs.

But at this point, it just seems that you're repeatedly going after them under the premise of... I don't know, what?

I realise that as someone who volunteers over there, I'm biased and all that, but I just don't understand the hostility on your end, even while I acknowledge that there have been some really stupid things said over there recently.

by Graham MacAree on Feb 9, 2008 3:30 PM PST up reply actions  

It comes from both sides
let's not pretend that the "average fans" who've been around USSM lately have acted like angels.

by Jeff Sullivan on Feb 9, 2008 3:30 PM PST up reply actions  

Side question
When did I become a pot-smoking hippie?

by Garces on Feb 9, 2008 3:36 PM PST up reply actions  

Dead meme?
Okay, I'm done sidetracking this thread.

by Jeff Sullivan on Feb 9, 2008 3:42 PM PST up reply actions  

My favorite thing to do
Is to sneak up on people who are jaywalking and can't hear my car.  They turn around and I'm idling a few inches away and there's a lot of this:

by Garces on Feb 9, 2008 3:45 PM PST up reply actions  

Can't deny that
It's like the Baker blog comments and the USSM blog comments are involved in some absurdly stupid civil war.

Thank God for this place.

by Garces on Feb 9, 2008 3:37 PM PST up reply actions  

Glad someone else mentioned this.
It seems to be very tit-for-tit at the moment.  If Geoff wasn't seeing to his missus this weekend, I am sure he would have some kind of response to the latest DMZ post about how we suck.

It is all getting kind of silly between them, especially when you consider how high they both were on each other last year.  Ever since Geoff has started to get into low-level numbers, there seems to have been tension between the two.

And to think, I could have chosen to support the Yankees or Red Sox...

by EnglishMariner on Feb 9, 2008 3:54 PM PST up reply actions  

As Jeff mentions in this post
At LL, nothing is black and white.

At Baker's Blog it's all white, and at USSM it's all black, unfortunately.

Combine that with the [snotty comment about your spelling]s, or the STFU AND ENJOY THE GAME STAT GEEKS, it makes you ache for a beer diary.

by Garces on Feb 9, 2008 3:59 PM PST up reply actions  

I'm the one who does the []
You don't really want to see the posts that are under there.

by Graham MacAree on Feb 9, 2008 4:02 PM PST up reply actions  

I stopped reading USSM comments back in
July. It's not worth it.
These pretzels....are making me thirsty!

by Goose on Feb 9, 2008 4:16 PM PST up reply actions  

Here's the thing, though...
When you put in a lot of time and effort educating yourself about something, aren't you allowed to place more value on your own opinions than those of people less educated about said topic than you?

As an example, my main focus in college (before I dropped out to get the hell away from Olympia, so take this with a grain of salt if you so choose,) was urban studies/urban planning. It's something I have studied pretty intensively both in an academic setting and in my own free time. Now, I wouldn't consider myself an expert by any means, but I know my shit pretty well. And I am of the opinion that increasing traffic capacity in order to ease traffic congestion is the wrong way to go about things. I believe this because I have a lot of research and scientific study to back my position. There are people who disagree with me. Many of them are equally well versed in these things, and while I disagree with their position, I am willing to listen with an open mind if they give decent counter arguments and argue their position intelligently. I am not likely to take seriously the position of anyone who's assertion is that adding freeway lanes will ease traffic because, duh, it's just common sense.

My (rather long winded, yikes) point is that it's totally acceptable to dismiss the opinions of people who don't know nearly as much about something as you, especially when those people are using terrible grammar, insulting the intelligence of others, and refusing to acknowledge widely accepted knowledge with little or no evidence to back their positions up. And with as much crap as DMZ, Dave and the other, more well versed posters at USSM (and other places) it shouldn't come as a surprise that they're getting frustrated and testy. I would be, too.  

--aaron c.

by Aaron Campeau on Feb 9, 2008 4:19 PM PST up reply actions  

No, it's not
t's totally acceptable to dismiss the opinions of people who don't know nearly as much about something as you

Wouldn't it be better to educate them as to your point of view, and attempt to convert them, rather than dismissing them?

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 4:22 PM PST up reply actions  

To a point
and that's what the main page posts are for. But I think the authors have just run out of patience for the relentless onslaught of people who don't understand what USSM is talking about. It's too bad, but I hardly blame them for it. I'm actually surprised they lasted as long as they did before becoming dismissive.

If you have one person who doesn't understand a point, it's better to teach than to ignore. But if you have an ungodly number of those people, and if for every one you teach there are five more who don't know anything, it becomes really hard to stay motivated.

by Jeff Sullivan on Feb 9, 2008 4:52 PM PST up reply actions  

People must want to learn
before they can be taught.  If someone is insulting you for presenting a position then it is natural to be an ass back to them.

by Sec 108 on Feb 9, 2008 5:04 PM PST up reply actions  

There's also that
I think you'll find that USSM is quite good about responding to people who ask honest questions/raise intriguing points.

by Jeff Sullivan on Feb 9, 2008 5:05 PM PST up reply actions  

Absolutely
I'd say I've asked probably 10 questions of the authors in the comments and probably seven have been answered kindly by Dave or Derek. The rest are probably (understandably) lost in the shuffle. It's just a simple matter of sounding inquisitive and not dickish, and making sure you've exhausted some other options before asking a dumb question, like, you know, reading the post.

by Teej on Feb 9, 2008 5:30 PM PST up reply actions  

Okay, poor choice of words on my part.
What I mean to imply is that it is acceptable to not put as much stock into the opinions of those who are not as well educated as you are on a given subject, and if well supported, well reasoned arguments are continually dismissed or attacked without merit by them, it becomes acceptable to have a dismissive (and, depending on the person, somewhat smug) attitude towards that person and their opinions.
--aaron c.

by Aaron Campeau on Feb 9, 2008 5:09 PM PST up reply actions  

You analyze Lost?
Seriously?
Willie is ours, and you can't have him

by spittle8 on Feb 9, 2008 10:01 PM PST up reply actions  

No
But can it really be good? I mean, the previews...
Willie is ours, and you can't have him

by spittle8 on Feb 13, 2008 12:34 PM PST up reply actions  

Re
To be honest, I think even if you agree with everything that's ever been written on USSM the tone can still be a bit galling. I agree with them more often than not, but I find it impossible not to resent clicking a button with the words "This comment is free of spelling errors" every time I want to post anything and I'm in stitches by the time I've got through "USSM Orientation: a guide to having a conversation".

It's not surprising it irks some of the readership. If you want to put that in the language of statistics, they're committing a gross individualist fallacy by assuming that every commenter is a borderline illiterate moron on the basis of their experience with a few isolated users.

by anunderwaterguy on Feb 9, 2008 3:47 PM PST reply actions  

Clearly you read Baker's blog.
I think the worst part about that statement, is that it defeats the purpose of building a good team. "That's why we play the games" implies that it really doesn't matter who is on your team, or how you win, or if you win. It takes away the purpose behind watching a sport.
...and now I'm here

by CapSea on Feb 9, 2008 3:53 PM PST reply actions  

USSM is fascinating and all
but I'd appreciate it if we could keep this thread on topic. It's an important subject.

by Jeff Sullivan on Feb 9, 2008 3:58 PM PST reply actions  

The overuse of "dead meme" ...
should be a motherfucking dead meme.

by AK1984 on Feb 9, 2008 4:09 PM PST up reply actions  

The meme police, they live inside of my head
I'll probably still call dead memes out but not by that name.  Creative mockery is such fun.
Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 4:17 PM PST up reply actions  

Geoff Baker's blog attacked USSM
for not understanding why the game is played, or something. So it is somewhat on topic.
...and now I'm here

by CapSea on Feb 9, 2008 4:06 PM PST up reply actions  

I agree.
I was enjoying the discussion about math while I prepared estimates.  It made me think how I am forced to present pricing to customers differently depending on how they react to terms like added value percentage or variable cost data.

I honestly do believe that some people develop an inherent hatred for mathematical analysis at a young age.  As soon as you ask them to swim in those waters they shut down on you.  It is frustrating, but it is how it goes.

by Sec 108 on Feb 9, 2008 4:08 PM PST up reply actions  

When I was a kid
I loved maths and hated English.

Now maths is my weakest subject by miles. Strange.

by Graham MacAree on Feb 9, 2008 4:11 PM PST up reply actions  

I specifically chose work
That required me to do math in my head all day long.

It is possibly the only gift I have so I like to keep it sharp.

by Sec 108 on Feb 9, 2008 4:13 PM PST up reply actions  

I was always the irritating little snot
Who'd ask "why" after every basic math principle.

"So 2x5 is 10."

"Why?"

"It just is."

"Why?"

"Because that's how multiplication works."

"Why?"

"Listen, 2x5 is 10. Just trust me on this."

"You mean, there's only one answer? How boring"--at which point I'm mentally throttled by everybody else in the room.

by Liebkartoffel on Feb 9, 2008 4:23 PM PST up reply actions  

I have a friend
who's a professor of theoretical mathematics.  His entire career revolves around finding out exactly why 2x5=10, and what some alternate answers to that equation would be.  I tried once to ask what he did, and within about 30 seconds of him starting to answer I just kinda went "oooh, shiny!" and that was that.
Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 4:25 PM PST up reply actions  

I find it fascinating, in a way
but I couldn't even begin to know why anybody would want to be in that sort of field.
Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 4:29 PM PST up reply actions  

Because it's super badass!!
It's like the metaphysics of logic.
You can ching ching ching cash in on this tragedy.

by John Morgan @ Lookout Landing on Feb 9, 2008 4:59 PM PST up reply actions  

My brother's a
Linguistics professor. At one point I made the mistake of asking him what the origin of the verb "to be" is. It took me half an hour to escape.

by Liebkartoffel on Feb 9, 2008 4:30 PM PST up reply actions  

I once went over the proof
of why 1+1=2, which is what the rest of arithmetic is based on really. Pretty fun, actually, and not as esoteric as one might think.

by Bearskin Rugburn on Feb 10, 2008 1:37 PM PST up reply actions  

It's not a hatred
(just speaking for myself)

it's a lack of understanding, combined with the occasional presentation tone of "you're not smart enough to understand this so I'll patronize you now" that gets me.

I understand that I'm not as smart in the ways of mathematics as a lot of people, and I'm entirely OK with that; I have other areas of expertise.  What I resent is when I'm made to feel like that lack of knowledge of statistics somehow implies that I'm not a fan in the "right" way.  

I'd rather have someone show me how to swim in those waters than have someone ask me to swim in them; it's a small distinction, but a critical one.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 4:14 PM PST up reply actions  

Disagreeing with someone
No matter if they're completely right, never makes you a worse fan.

It might make you wrong, but that's neither here nor there on the fanship scale.

by Graham MacAree on Feb 9, 2008 4:17 PM PST up reply actions  

And that's the disconnect
that a lot of non-stat people seem to have; they take the disagreement entirely personally and seem to think that if they're disagreed with it's an attack on their fandom.  99% of my real-world friends are the non-stat variety of baseball fan, and most of them seem to think that stat sites and people tend to "talk down to them".  

And when I try to get people to look past that, and see the data, and show them posts like this, it works; then those people start to read the inane Baker/USSM blog war, and they're right back to square one.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 4:20 PM PST up reply actions  

Agreed
A response to a totally valid, on topic, non-aggressive argument:


February 10th, 2008 at 9:22 pm
10Dave Says:
Do you even realize that the inputs that the simulations Derek is doing aren't based on Diamond Mind's projections? Any problems you have with its engine have nothing to do with this. If you disagree with the expected performances, you're taking objection to the ZiPS projections, not DMB. And you haven't given us any reason to believe that your opinion is more credible than ZiPS.
And that's just one of your many errors (others - assuming `07 run distribution is true talent level, ignoring Guillen/Wilkerson dropoff, ignoring remarkable health of position players last year, not regressing expected performances from Johjima, Ichiro, Burke, and ignoring the weakness of the bench). Not bad for two paragraphs.

I get the argument that they get a lot of crap, so they're defensive.  I just don't think that's it.  I think it's just that they enjoy being pricks.

by Garces on Feb 11, 2008 1:25 AM PST up reply actions  

Great post
An enjoyable read, and worth bookmarking for referring to in the future, I suspect.

What's worth reiterating here (I know that it's hinted at but not specifically mentioned) is that this is just as relevent in micro situations, where a decision is made to do something on a play. That attempt might not come off, but if the probability of it coming out good was high enough then it was still the right decision

(It's late, and I'm struggling to come up with a good example - can you tell?).

Meanwhile, who'd have thought that, ten years ago when I studied Stats at school, the initials "pdf" would later be superceded in general vocab quite so strongly... for shame

Accidental Mariner - P3 W1 L2 (.333)
Sponsor of Jamie Burke's baseball-reference page

by MarkE on Feb 9, 2008 4:14 PM PST reply actions  

Indeed
Arguably this is a much easier thing to put into practice. There are literally millions of influences which can affect how the results of a trade pan out, but in micro situations, whilst we still can't account for everything, there's such a massive data set that it's possible to do probability calculations with a high level of accuracy. The bunt would be a good example of where this should be used rather than the more traditional method of resorting to unverified heuristics concerning the strategy a particular situation merits.

by anunderwaterguy on Feb 9, 2008 4:24 PM PST up reply actions  

I like Bacon
I hate to advocate drugs, alcohol, violence, or insanity to anyone, but they've always worked for me.

by wadswerth on Feb 9, 2008 4:23 PM PST reply actions  

That's why we play the GAMEZ, fellas!!
And here I thought we learned our lesson when Felix took himself out of the game in his 3rd start last year.  We're no more invincible than the Angels or the Red Sox.

We still have yet to see Bedard reach +200 innings.  I think he can, but it's not a lock.  The negative projections are largely due to the fact that there are more things working against this team than for it.

Still, good or bad, the Bedard trade is such an important trade, one that I still lean in favor of.  I believe it is still possible for this FO to construct a strong team in a short amount of time (yea, talent evalutation is an issue) while signing Bedard to a reasonable or even overblown extension since this team will be perceived as competitive.

But since we "play the games", how bout giving us a +48 homerun season, King Awesome. :)

by ThundaPC on Feb 9, 2008 4:45 PM PST reply actions  

I think one of the biggest steps
projections need to take is to try to do a better job of defining the uncertainty in their projections. I know some try to do it but I dont' really have a clue if its accurate at all. It will be a difficult step but would be very interesting to see. Which projection would you rather see 80 +/- 6 wins or 79 +/- 2 wins? The uncertainty tells you a lot. When it comes down to it, I bet our systems right now are probably much closer to +/- 6 wins than +/-2 wins.

by Edgar for Pres on Feb 9, 2008 4:46 PM PST reply actions  

They don't even predict wins
They predict pythagorean wins, which is another uncertainty.

by Graham MacAree on Feb 9, 2008 4:48 PM PST up reply actions  

Yeah but the run estimates they come out
with are still the main source of error I'm pretty sure.

by Edgar for Pres on Feb 9, 2008 4:53 PM PST up reply actions  

Well
maybe, I'm not totally sure about that though.

by Edgar for Pres on Feb 9, 2008 4:53 PM PST up reply actions  

What I meant is
Standard deviation for wins given a certain pythag is something like 4.8, which is probably greater than the error in predicting the pythag in the first place.

by Graham MacAree on Feb 9, 2008 5:01 PM PST up reply actions  

Pythag
I would have thought also that it's quite difficult to give an explicit uncertainty value when it comes to pythagorean wins. You can, of course, have a standard deviation across the whole of MLB, but if you take the perspective that a particular team's roster construction has the ability to skew pythagorean predictions (the bullpen is always given as the example in that regard) then team's have the potential to be rules unto themselves.

Put another way, you can come up with an uncertainty value for a number of teams in the same analysis "x% of teams will fall within a particular range of their pythagorean win-loss record", but making explicit statements of uncertainty about a specific team could be problematic due to the high potential for outliers related to roster construction.

by anunderwaterguy on Feb 9, 2008 5:07 PM PST up reply actions  

Its totally reasonable to think
that there is an inherent uncertainty associated with using the pythag to calculate wins. Whenever you have an equation with constants (that aren't exact) there will be uncertainty added into your answer.

by Edgar for Pres on Feb 9, 2008 5:11 PM PST up reply actions  

I also think he's assuming
that the error associated with the pythag is probably mostly random. For all the hope that there are ways to consistantly outperform your predicted pythag record, there is very little data to back up this point.

by Edgar for Pres on Feb 9, 2008 5:13 PM PST up reply actions  

Example
An example would be to contrast pythagorean wins with the uncertainty levels in opinion polls. In an opinion poll you have a percentage figure for a particular population which you attempt to measure through the use of a sample. In that situation the properties of the population are largely irrelevant to the accuracy of your measure: so long as you avoid any sampling bias and so long as you sample the necessary number of people, you can be 95% certain that your results are accurate within a particular range.

Pythagorean wins are quite different. If you accept that win-loss records cannot be entirely reduced to the two variables of runs scored and runs allowed (and only the most dogmatic of pythag supporters would argue against this) then the properties of a particular team - whether these are intentional or otherwise - have the potential to alter the accuracy of your measurements. Consequently, I'd imagine it's a little more difficult to come up with an accurate uncertainty figure for pythagorean win projections, than it is with other statistics.

by anunderwaterguy on Feb 9, 2008 6:07 PM PST up reply actions  

Uncertainty from equations is a little
different than uncertainty from polling. If the pythagorean equation is accurate and unbiased then you should be able to associate an uncertainty. You can derive the pythag eq from just assuming general runs allowed and runs scored distributions and get something of the form of the pythag equation (I don't know where but I think this is true). This should make the pythag pretty accurate or at least it roughly models reality. Also, the average pythag win comes out to .500 which is good. It should be fairly simple to come up with how accurate it is. All you need to do is do (Pythag Win% - Actual Win%) and then find the standard devaition for all teams in the past 20 yrs of baseball for example. This should be a big enough sample to get an idea of how accurate it is. So yeah, you can't derive an uncertainty like you can when doing polling but that doesn't mean you can't find the uncertainty.

by Edgar for Pres on Feb 9, 2008 6:30 PM PST up reply actions  

I'm not anti-pythag
I was just musing that it's probably more difficult to come up with accurate measures of uncertainty than it would be for other statistics. It isn't difficult to do if you assume that win-loss records are entirely reducible to RS and RA, but it's a little bit of a catch 22 situation when you start talking about uncertainty. If you assume that deviations from pythag aren't just random circumstance, then you'd have to conclude that the properties of teams (other variables which aren't included in the pythag calculation) can affect your measurements to at least some extent. That would make calculating an uncertainty value a little more difficult because the two variables used in the calculation (RS and RA) would affect individual teams differently based on their own unique properties.

Looking at the standard deviations would allow you to make statements about the general accuracy of pythagorean projections, but at the individual level there's at least the potential for it to break down if a team's specific properties isolate it from more general trends. Anyway, this is a bit of a pedantic point because as you say, the average pythag win% comes out at .500, but it's worth raising nonetheless.

by anunderwaterguy on Feb 9, 2008 6:57 PM PST up reply actions  

I know what you mean
If you looked at the distribution of (Pythag Win% - Actual Win%) and it was normally distributed around .500 then I'd say its probably random error. If it isn't then there is something funky with the equation and we could probably do better. That is one of the tests I use with an equation when I want to see if it fits reality because usually in my experience random error leads to a bell curve.

by Edgar for Pres on Feb 9, 2008 9:10 PM PST up reply actions  

So you're saying if I give you
a set of RS/RA data that is perfect, the stdev for the wins the pythag would give me would be just under 5 wins. Thats pretty shitty. Not surprising, but still pretty bad.

by Edgar for Pres on Feb 9, 2008 5:09 PM PST up reply actions  

Site comment
I hope the next version of this site will allow users to "favorite" posts and diary entires.  This one would definitely make the cut.

(Bookmarking is tedious and because I access the web from a variety of locations it isn't that useful.)

by ryanhealy on Feb 9, 2008 5:47 PM PST reply actions  

Back on topic...
...sometimes the system turns out to not have been constructed with fair 50%/50% coins after all, and you have to go back and figure out why.  The model would have been correct if the coins WERE fair, but since it didn't realize we were using coins weighted 55% towards heads, the results consistently skew in that direction.

Metaphorically, I'm trying to outline the impact of a freaking ridiculous bullpen on these projections.  I'd like to see how data comparisions after this season line up with data comparisons on the 2007 season... how dominant was the bullpen in each year, and how much did we beat Pythag in each year?

I guess I don't look at 2007 as the 90th percentile on its curve.  There are a lot of reasons to distrust that - Sexson died, Lopez died, the defense died, and the rotation died.  You're telling me only 10% of possible scenarios were better than that (generic you, btw - not anyone in particular)?  I think that Pythag is a great tool, but very approximate... and the approximation was particularly bad on the 2007 M's.  I'm not a baseball stats expert, so consider this all hot air if you want... but I'd imagine the 2007 M's being somewhere between 45%-65% on whatever the 'accurate' bell curve should have been.

Any constructive thoughts on the theory appreciated.

World Series or bust!!!

by KingCorran on Feb 9, 2008 6:12 PM PST reply actions  

I'm not an expert by any means.
But my response to you would be this; the 2007 M's were an anomaly. Just as there are times when a fair coin will turn up heads ten times in a row, there are bound to be times, given all of the variables that occur over 162 games, that the Pythag will be off by a fairly wide margin. But it can still be a decent method of evaluating a team.

Just for fun, I went through and looked at the M's Pythag vs. actual W/L record for every year of the franchise's history. The two numbers were exactly the same three times, +-1 nine times, +-2 four times, +-3 four times, +-4 three times, +-5 twice, +-6 twice, +-7 three times, and +-9 once. So the Pythag W/L record was within 4 wins or losses 74% of the time. 74% isn't perfect, but it's close enough that I think it's incorrect to call the system busted based on the biggest gap between actual and Pythag W/L records over 31 seasons.  

 

--aaron c.

by Aaron Campeau on Feb 9, 2008 8:04 PM PST up reply actions  

I guess...
...I wouldn't call the system 'busted' so much as 'incomplete'.  We definitely didn't have a bullpen like that in all of those years.  

I wonder what would happen if we took a look at teams with bullpen profiles similar to the 2007 Mariners (300 ERA+ closer and/or whatever the bullpen's ERA+ as a whole was), and charted out their Pythag differences?  Is there a trend?  Unfortunately, I don't have the time or researching expertise to figure that out, but I'd be curious to see the results if anyone made that comparison.

I don't mind being proven wrong - I guess I just see results that don't make sense intuitively, and want to know why.

World Series or bust!!!

by KingCorran on Feb 9, 2008 8:18 PM PST up reply actions  

That's a fair question.
I find myself wondering if there is a constant among teams who significantly outperform their Pythag. If there is, it would be some interesting knowledge to posses. I'm not sure it would make me change my opinion about the usefulness of the method, but it could help explain the seemingly random noise.

I'm too inexperienced/lazy to do the research myself, but if someone ever tackled it, I think it would be fun to see the results. Of course, for all I know, this has been covered, and the longer-tenured staheads all think I'm an idiot.

--aaron c.

by Aaron Campeau on Feb 9, 2008 8:29 PM PST up reply actions  

Okay, this took some work, but here you go
(Data from 2004-2007)

Chart 1: Correlation between difference from Pythagorean record and bullpen strength (measured by total bullpen WPA). There is a relationship, here - stronger bullpens generally lead to outperforming the Pythagorean record a little bit. However, while the correlation is statistically significant, it isn't all that strong. It also doesn't explain the Mariners' huge gap (7.38 bullpen WPA, 9 game difference); by plugging 7.38 in for x, you get a y between 2-3, not 9. So the M's were both good and lucky.

Chart 2: Correlation between bullpen strength in one year and bullpen strength the next. Weak relationship. In other words, even though the Mariners had a great clutch bullpen in 2007, we can't really say much about how they'll do in 2008. Even if the group performance doesn't change that much, the timing of said performance might, which could make a world of difference.

Chart 3: Correlation between Pythagorean difference in one year and Pythagorean difference the next. No clear relationship at all. Beating your Pythagorean record doesn't appear sustainable.

by Jeff Sullivan on Feb 9, 2008 9:17 PM PST up reply actions  

Thank you sir,
Incredibly enlightening.  So in other words, bullpen may have been a factor - but almost certainly not all of it.

I still find it incredibly odd that the M's had so many bad performances, and still beat Pythag by so much.  Luck, plus a good bullpen, plus two earned run magnets (Weaver + Ramirez), plus something else?

At this point, I would expect some bullpen regression and luck regression, slight-to-modest improvement from Sexson/Lopez, and a large improvement in the rotation to put this team... well, right about 90 wins.  This year, however, I think that Pythag catches up to reality.  That more than anything else seems like the flukey part of this equation... while I don't trust pythag completely, it is generally a very useful tool.

Jeff, thanks very, very much for the insight.  I can't say I'm surprised to see that bullpen excellence can't explain the whole thing away... but neither am I surprised that it has some impact.  Cool stuff.

World Series or bust!!!

by KingCorran on Feb 9, 2008 9:25 PM PST up reply actions  

I think, more than the bullpen last year,
the biggest factor in our beating Pythagoras (outside of dumb luck) was the wildly inconsistent run distributions put up by Weaver/Ramirez/Feierabend/Baek/etc. They weren't putting up bell curves; they were either really good or really bad. This skewed the system in our favor. Unfortunately, it skewed it in a way that we're unlikely to repeat in 2008.

by Jeff Sullivan on Feb 9, 2008 9:30 PM PST up reply actions  

I agree with that
I bet SP WPA standard deviation (or SP R/GS stdev) is also probably correlated to pythag win% - actual win% pretty well.

by Edgar for Pres on Feb 9, 2008 9:42 PM PST up reply actions  

Thats very interesting
I'm going to think about this a little.

by Edgar for Pres on Feb 9, 2008 9:26 PM PST up reply actions  

I think basically
anything that is going to skew your runs distribution away from the "ideal" shape is going to mess up your pythag. The bullpen is one component but I bet blowouts are another big factor. If you don't have a blowout, that gives your bullpen a chance to gain some major WPA (high leverage) but we'd expect bad bullpens to lose as much as good ones win so you wouldn't really expect to see pythag to depend on RP WPA. I think why it does is because WPA doesn't take into account whether a SP or RP is in the game. Since RP tend to be better than a SP you would expect a RP to gain positive WPA on average whenever they pitched. Therefore bullpen WPA is probably well correlated to IP and leverage. If you have high leverage situations (close and not a blowout) then you gain alot of bullpen WPA. If you have a blowout and your bullpen has high IP then you also have more WPA. You might want to be a little careful. It might come back to the whole correlation/causeation business. Overall I'd expect a good bullpen to help a team outperform their pythag but I don't know if its as simple as we all want it to be.

by Edgar for Pres on Feb 9, 2008 9:39 PM PST up reply actions  

Help me
Love this topic. Here's my question about probability that I've always wanted to understand. I get that the odds of any coin flip being heads is 50%. What I don't understand is how that relates to the odds of a longer string. If every coin flip is 50/50 heads/tails, how does that relate to the odds of 100 heads in a row being different than 10 heads in a row? I know I'm missing something big.

by OldFan on Feb 9, 2008 8:32 PM PST reply actions  

Still a bit dense here.
I understand how you get the odds, but when you have the coin that has just flipped 99 heads in a row, on that 100th flip, my head explodes trying to reconcile the 50/50 of a single flip, with the 100th in a row.

Is it just a problem of sample size?

by OldFan on Feb 9, 2008 9:13 PM PST up reply actions  

Nope
The 99 heads are irrelevant at that point.  The question isn't "can I flip 100 heads in a row," but "now that I have flipped 99 heads in a row, can I flip one head in a row?"  The 99 do not affect the outcome of the last one.
World Series or bust!!!

by KingCorran on Feb 9, 2008 9:15 PM PST up reply actions  

The 100th flip
is independent of the 99 that preceded it.

I know this kind of thing can be difficult to grasp at first. It's why I highly recommend that everybody take an intro stats course, or at least read a probability book or two. This kind of thing is everywhere in life, and if you're able to understand it, you'll get a leg up on everyone else who doesn't.

by Jeff Sullivan on Feb 9, 2008 9:23 PM PST up reply actions  

But if nobody understands this
then Las Vegas and its myriad entertainments for the non-gambler like myself can continue to exist.  So, y'know, be careful what you wish for.
Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 9, 2008 9:25 PM PST up reply actions  

A slightly related question
Say the Mariners have lost 9 straight. Assuming they have a league average pitcher against a .500 team (Washburn vs. the Blue Jays, perhaps), would their result for the 10th game be independent, dependent or are there too many variables to put it so simplistically?
When life gives you scurvy, make lemonade.

by Mariner John on Feb 10, 2008 12:39 AM PST up reply actions  

I don't think it'd be completely independent
but the impact of the other variables would be small enough that their odds of winning would still approach 50%.

by Jeff Sullivan on Feb 10, 2008 1:07 AM PST up reply actions  

Here's where I question this...
The difference between flipping a coin and playing a baseball game is that the coin doesn't get fatigued and doesn't suffer injury.  I'd very much believe that coin flipping is very independent on the previous results, whereas a baseball game can very greatly effect the results of the next game.  A major injury to a key player can greatly reduce the chances of winning the game(s) immediately after that game.  Likewise, a washout that forces a double-header before a long road trip can wear a team down enough such that they can't perform to their optimal abilities in the game after the double-header.  

Certainly all teams have this "noise" and it'd be really hard to analyze the data to find anything statistically significant (because of the sample size issues that would likely be involved). But I'm very comfortable accepting that baseball games are clearly not independent of each other, and it's probably worth trying to analyze this further, even if the results aren't "statistically significant" due to sample sizes...  

Ill Ligitamus Non Carberendum

by PositivePaul on Feb 10, 2008 10:34 AM PST up reply actions  

Yeah they aren't independent of each other
but its pretty close.  If all the teams feel the same effects of fatigue then those will kinda roughly cancel.  I think one of the big ones would be playing Boston for 4 games and then going to play the Royals for example.

Almost no outcomes are independent but many (like baseball games) appear pretty close.  We also like to assume that each PA is an independent event even though they aren't really.  We do this because if you want to apply almost any statistics, the first assumption you usually make is that the events are independent of the previous/future events.  In baseball this is much more true than other sports which is why you have seen so much more sucesss with stats.

by Edgar for Pres on Feb 10, 2008 10:45 AM PST up reply actions  

Independent
Every game is independent from every other game during the season, but over the entire season things will even out. Whether you win or lose the 10th game, you will have another 10-game stretch where you win 9 of 10 or 10 of 10, assuming you are a .500 team.

The coin flip example is used to relate how each game is independent from the next. But in the long run, you have a 50% possibility with each flip. Baseball is a bit more complicated because there are so many variables that go into the game. Predicting wins is difficult and you really do not know how good a team is until after the season finishes.

The Mariners were a .543 team in 2007. Theoretically, this means the Mariners had a .543 chance of winning every game. But when the Mariners were playing that 10th game of a 9-game losing streak, there was no way to really know what kind of odds the Mariners had at winning. Factor in the numerous random variables that go into each game, I do not think you can place any odds on an independent game even knowing the overall winning percentage at the end of the season.

I think I have come to realize that the strength of SABR statistics isn't about calculating the probability of a team winning, it is about finding underappreciated strengths and skills of individual players that will help improve a team's unknown probability of winning. And even then, after collecting every player's statistics and running each team's rosters against each other, I do not think you can determine how well a team will perform because you are using random statistics to find a random result.

Forgive me for thinking out loud. I don't know if I even answered your question.

by Wilder83 on Feb 10, 2008 1:29 AM PST up reply actions  

I wasn't asking from Sabermetrics perspective
I was asking as a different example from the coin flip and as a sort of stats question. I was going through my mind wondering whether each game was independent of the others, which it should be in a vacuum. But I was thinking that the other variable (momentum, whether it exists or not, injuries, etc.) that could factor in to it. I was sort of thinking out loud too.

Your response was helpful, however.

When life gives you scurvy, make lemonade.

by Mariner John on Feb 10, 2008 1:42 AM PST up reply actions  

I think if your coin
comes up heads 99 times in a row, it might be worth investigating if its a trick coin.

by tworsandtwols on Feb 10, 2008 2:15 AM PST up reply actions  

Ok, so
The odds of flipping a fair coin 99 times and getting heads in all of them is a very small number (0.5^99) that we can call x.

Once you have flipped those 99 coins, the probability that the next one will be heads is 0.5.

The odds of flipping a fair coin 100 times and getting all heads is (0.5^100) or x/2.

Here's the reconciliation:

Consider the coin flipping exercise as navigating through a tree. I'll show you what I mean for the purposes of just a few flips.

That's just the tree for two flips. As you can see, you simply progress along each branch, taking 50/50 (or 0.5, as I've labelled it) chances at each node until you reach the end. Then, to get the probability of a whole chain, you just sum up the probability of all the branches it takes to get to that point (for example, since the HT/TH outcome can be reached two different ways, we just trace one branch back to the origin (0.5x0.5) and then add it to the other (0.5x0.5).

That's a pretty simple example, but you can expand on it very easily by just adding more flips or changing the probabilities assigned to each branch.

Fastforward to a 100-flip tree. You have just flipped the coin 99 times and gotten heads each time. The probability of this chain of events happenening (going back to the start) is x. The probability of the next coin being heads or tails has nothing to do with the chain - the branch probability still stays the same. However, if you do flip that coin, and it comes up heads, the probability of that chain happening was x/2.

Another way to think about it:

Someone tells you 'I have just flipped 99 coins in a row. They all turned up heads. What is the chance that I will get to 100 heads in a row?'

The answer is pretty clearly 50%.

Someone asks 'What is the probability of flipping 100 coins, and them all turning up heads?'

The answer then is x/2.

The difference between question 1 and question 2 is that 1 just has more information in it. We start very close to the top of the tree and only have to work our way through one branch, rather than all of them. There are very limited outcomes at that point (you're either going to have 99 heads or 100 heads), so the probability of each outcome has changed.

Does that help at all?

by Graham MacAree on Feb 10, 2008 3:37 AM PST up reply actions  

It does help
Two different answers for two different questions, asked at different times.

I appreciate the trouble you all took.

by OldFan on Feb 10, 2008 7:44 PM PST up reply actions  

You can also just work it out
for 1 toss: possible outcomes = H or T chance of 1 heads = 0.5 for 2 tosses: possible outcomes = H/H or H/T or T/H or T/T chance of 1 heads = 0.5 chance of 2 heads = 0.25 for 3 tosses: possible outcomes = H/H/H or H/H/T or H/T/H or H/T/T or T/T/T or T/T/H or T/H/T or T/H/H chance of 1 heads = 2/8 = 0.25 chance of 2 heads = 3/8 = 0.375 chance of 3 heads = 1/8 = 0.125

by Edgar for Pres on Feb 9, 2008 9:17 PM PST up reply actions  

Hmm, damn you HTML formatting
To make the above post readable.

for 1 toss:
possible outcomes = H or T
chance of 1 heads = 0.5

for 2 tosses:
possible outcomes = H/H or H/T or T/H or T/T
chance of 1 heads = 0.5
chance of 2 heads = 0.25

for 3 tosses:
possible outcomes = H/H/H or H/H/T or H/T/H or H/T/T or T/T/T or T/T/H or T/H/T or T/H/H
chance of 1 heads = 2/8 = 0.25
chance of 2 heads = 3/8 = 0.375
chance of 3 heads = 1/8 = 0.125

by Edgar for Pres on Feb 9, 2008 9:29 PM PST up reply actions  

Thanks
I kinda sorta get it. I think my hang up is in framing the question, King's point about it not asking about 100 in a row, but now that I've got the 99...I'm not sure I can ever make this intuitive, but at least I get the logic now. Thanks.

by OldFan on Feb 9, 2008 9:27 PM PST reply actions  

"Dice have no memory"
That's usually how the principle is described.

The coin doesn't know it's in the middle of doing something highly improbable, so the next outcome is unaffected by those circumstances.

by Llewdor on Feb 10, 2008 11:42 AM PST up reply actions  

Potentially Costly
Unfortunately, many (most?) people don't get this concept, and their behavior often reflects this.

A guy walks up to a roulette wheel and notices that red has come up 7 times in a row, so he plops down a large stack of chips on black.  The wheel spins, and a red number comes up.

The guy stomps off, bemoaning his bad luck.  After all, what are the odds of 8 reds coming up in a row?

Of course, pointing out to him that the odds were just under 50/50 once 7 reds had hit, might not be safest course of action at that point in time.

by HopHead on Feb 11, 2008 10:47 AM PST up reply actions  

All I have to say is that
none of you should try to learn statistical mechanics. Its a painful trip down a complex statistics journey but in the end you understand entropy and can describe almost all of the basic chemistry we see around us. Ok that was really nerdy.

by Edgar for Pres on Feb 9, 2008 10:35 PM PST reply actions  

Even statistician aren't that smart
http://sports.espn.go.com/mlb/news/story?id=3239471

Using ERA and WHIP to try to prove points.  Come on, you can do better than that.  You're stats professors.  If you look at ERA+ vs Age you'll see an R^2 = 0.0011.  

by Edgar for Pres on Feb 10, 2008 7:32 PM PST reply actions  

To be fair...
I can almost guarantee you that the professors who analyzed the report know next to nothing about ERA+. They were probably just told by someone that ERA and WHIP were good indicators of pitcher performance. Not that I'm condoning that, I'm just saying what I think happened. If I were told to analyze something in a field that I don't fully understand, I would probably either look around to see what seem to be considered important indicators or I would just look at the things that the person who commissioned me said were important.

But anyways, I don't think that the professors really showed anything. They did bring up the good point that the Clemens report basically cherry-picked statistics to prove its point, but I kind of feel like the analysis that they did suffers from the same problem.

The real moral of the story is how if you use selection bias, you can prove almost anything. Anytime you see a statistical report, it's important to see what assumptions were made in how the data was collected and analyzed to see if any selection bias is taking place. If there is a lot of selection bias that is not accounted for, then the data was most likely used to prove a point, rather than analyzed to determine what is actually happening.

by Jono on Feb 10, 2008 10:29 PM PST up reply actions  

Yeah I totally agree
I expected a little more from some stats profs though.

by Edgar for Pres on Feb 10, 2008 10:45 PM PST up reply actions  

There is more; lots more.
But there's a pesky problem of getting it published. Check out the full report first.

by Matthew on Feb 11, 2008 12:12 AM PST up reply actions  

And I can 100% guarantee you
that the professors who analyzed the report know everything there is to know about ERA+ and are better baseball statisticians than anyone here.

by Matthew on Feb 11, 2008 12:13 AM PST up reply actions  

There you go
You actually know the guys, so I'll certainly take your word for it. I was basing my assumptions from my experiences with stat professors, none of whom give a crap about baseball. I wish that at least one of them did...

I just stumbled upon the paper you did with Jensen regarding throwing ability. It looks interesting. I'll try to give it a read at some point today.

by Jono on Feb 11, 2008 8:07 AM PST up reply actions  

Good
That makes me feel better.

by Edgar for Pres on Feb 11, 2008 8:27 AM PST up reply actions  

I'm much more of a lurker here,
but I love the Mariners and beer just as much as you all.  I guess I'm what you all would label as a "casual fan" because I don't get all crazy about the numbers.  Though how bleeding blue for all of these years could label me as casual, I'm not sure.  I am in no way at all a math guy.  Any math that I can't do in my head just hurts my poor brain too much.  Though you would be incorrect to call me not smart because I can speak 4 languages other than English - my nugget is just wired more towards linguistics than numbers.  Anyway I stumbled on this site less than a year ago and I wanted to let you know that I have learned a shitload more about the game than I previously knew.  I'm glad you all do the numbers so I don't have to.

by coolguyrob on Feb 11, 2008 4:37 AM PST reply actions  

I noticed and replied
But I'm not going to get drawn out into an argument on unfriendly ground, so it's both fairly conciliatory and one and done.

by Graham MacAree on Feb 11, 2008 11:32 AM PST up reply actions  

<sigh>
We hear talk of Seattle's Pythagorean record of 78 wins last season and are told they should come back down to Earth. What about the stats that show that teams outperforming their Pythag one year are likely to do so again the following season? Why do I rarely hear those numbers brought up?

Because it's not true Geoff. That's why.

by Matthew on Feb 11, 2008 11:33 AM PST up reply actions  

Truth is overrated.
Much like freedom. And happiness.
...and now I'm here

by CapSea on Feb 11, 2008 11:39 AM PST up reply actions  

Slowly Geoff is wearing on me
because of how he reacts to posts here and at USSM.  Every Intro to Stats class starts with a discussion of coin flips.  Your post was an Intro level "Welcome to Statistical Analysis" type of thing and he really only seemed to focus on how coins flips are too simple to be applied to baseball.  

Of course coin flips are simple.  However, if you do not get the coin flip idea then you do not have a foundation of knowledge on which to build.  That foundation is necessary to analyze more complex data.

by Sec 108 on Feb 11, 2008 12:04 PM PST up reply actions  

I know
I even thought I'd pointed that out in my post.

At least he picked up on my point that you should argue the assumptions and machinery behind the projections rather than the output, even though I don't think he noticed that that was one of my more important points.

Ah well. At least it got read.

by Graham MacAree on Feb 11, 2008 12:07 PM PST up reply actions  

It's not even just entry-level Stats
I've done my share of theoretical work in Math and Computer Science and we still coach some ideas in those realms in terms of coin flips.

by Matthew on Feb 11, 2008 12:29 PM PST up reply actions  

Well...
He is, first and foremost, a professional newspaper writer.  He was hired on the following principle:
  1. Build readership
  2. ?????
  3. Profit!!!!!
So, yeah, he's pretty darn good at what he does.  
Ill Ligitamus Non Carberendum

by PositivePaul on Feb 11, 2008 12:38 PM PST up reply actions  

By no means do I think he is bad.
In fact I agree he is good at what he does.  I am just finding that he misses the point a good deal of the time which makes HIS opinions and analysis of less value to me.

by Sec 108 on Feb 11, 2008 1:02 PM PST up reply actions  

That is a little true.
When he argues something that is missing the point, it is difficult to take his analysis seriously, which is too bad because he clearly does his research.
...and now I'm here

by CapSea on Feb 11, 2008 1:48 PM PST up reply actions  

I love that people are actually looking for fights
where there aren't any.

I still like Geoff Baker. Sorry dudes.

by Jeff Sullivan on Feb 11, 2008 3:23 PM PST up reply actions  

Hey, he replied to the post
I like him too. Misinterpreting half of it >>> ignoring it, IMO. Yay Geoff.

by Graham MacAree on Feb 11, 2008 3:24 PM PST up reply actions  

makes sense
you're the guy that wrote Moneyball, and that book's a joke, so it follows that this would also be a waste of time.  

/*joe morgan

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Feb 11, 2008 3:13 PM PST up reply actions  

And with that comment
we can add smug and snarky to the list.  

Watch your attitude.  

by marc w on Feb 11, 2008 3:15 PM PST up reply actions  

Comments For This Post Are Closed


User Tools

By reading a game thread of your own volition you agree to accept all liability for any and all damage done to your delicate sensibilities.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Moar_bacon_small
Everything I Know About Jesus Montero

Recent FanPosts

Wbc_029_small
Friday Morning Music Thread
Small
OTDOD - Early February Edition
Agentejebaox3_small
A Statistical Analysis of Mariners' Fan Support
Small
Who will have a better season?
Claw_small
BA's Top 10 M's Prospects
Wbc_029_small
Friday Morning Music Thread
Small
Munenori Kawasaki Predictions!!!
Small
The Longevity and Future Success of Felix Hernandez.
Small
The present vs future conundrum

+ New FanPost All FanPosts >


Sexy People

Wbc_029_small Jeff Sullivan

Small Matthew