Lookout Landing: An SB Nation Community

Navigation: Jump to content areas:


Sports blogs for fans, by fans.
Around SBN: SB Nation NFL Power Rankings for Week 11

Pitching Components Year-To-Year

We strive to find measurements in pitching that give us high correlations year to year because that helps tell us the measure of pitching that are more stable and stability is what you need in order to do projections. If a pitcher's win total had a year-to-year correlation of exactly zero (it's close), then the best case that we could do for projecting 2008 win totals would be to assign the same, league average, number of wins to each pitcher. That's not very helpful. In fact, it's not helpful at all. Conversely, if we knew that a pitcher's groundball ratio had a correlation of exactly one (it's close) then we could predict 2008's groundball ratios accurately by simply using 2007's totals. That's incredibly useful.

These disparate examples show the full range of possibilities when it comes to projecting future performance. At one extreme, we can use 100% of the pitcher's own totals (i.e. 2007) and ignore everyone else. At the other, we are forced to use 100% of the average and completely disregard what the individual pitcher accomplished.

It's important to know which category each stat falls into and not only to just focus on the ones with high correlation, but also to remove the taint of the stats with low correlation. This is what we call regression. If our numbers shows that a pitcher's strikeout rate has a year-to-year correlation of 0.75 (actual number), that means that for projecting Pitcher X, we would take his 2007 strikeout rate * 0.75 and add in (1 - 0.75) * league average 2007 strikeout rate to arrive at our projection for his 2008 strikeout rate.

Naturally there are far more complex things that go into most projections such as aging curves, park factors and the like, but at their base all projection systems anchor themselves in these regressions in some form or another. So, wouldn't you like to know what those regressions are? Of course you do. To find them out, I went through my pitch data I pulled out all the pitches from starting pitchers who faced at least 200 batters and threw at least 1,000 pitches in a season of which there were 827 such seasons. I then grouped them by player by consecutive year so that the data would only be counted if a pitcher appeared in both year x and year x+1. Here's the results:

Star-divide

There's some pretty important factoids up there. At the very bottom we see why ERA is a terrible measurement. That low correlation goes to show how little control a pitcher has over it. Ditto BABIP (no surprise) and ditto HR/FB which might come as a bit more of a surprise especially if you noticed how incredibly stable GB% is, but as the plot below shows, there appears to be little to zero correlation between a pitcher's GB rate and his HR/FB ratio.

Since HR/FB isn't stable, it's no surprise that HR isn't either. Walks and strikeouts are fairly predictable, but not as good as balls and missed bats which is what leads me to say that if we wanted to construct a finer version of the three true outcomes for pitchers, it would be balls, swinging strikes and groundballs. It dovetails both with the data above and what's logical to me; that the three things we want most in a pitcher is to throw strikes, miss bats and keep the ball on the ground. Most everything else is out of his control.

0 recs  |  Comment 15 comments

Story-email Email Printer Print

Comments

Display:

Never die.
This stuff is awesome.  I am curious though ... across the majors, what is the percentage of strikeouts that are swinging and called?  
And to think, I could have chosen to support the Yankees or Red Sox...

by EnglishMariner on Mar 21, 2008 3:56 PM PDT reply actions   0 recs

Not quite
I haven't looked at the end cause of strikeouts, just total pitches swung and missed at or called for a strike.

League-wide, it's ~30% of strikeouts are on called strikes.

by Matthew on Mar 21, 2008 4:08 PM PDT up reply actions   0 recs

Anyone else think it's strange
That BABIP is so decoupled from LD%?

by Graham on Mar 21, 2008 4:04 PM PDT reply actions   0 recs

oh yeah
odd.  

This is YTY correlation for pitchers, right?  So it could be that guys who play in front of great defenses (esp. FB pitchers) have great BABIPs despite fluctuations in LD%.

I wonder what happens if you only look at guys who switched teams.  

by marc w on Mar 21, 2008 4:13 PM PDT up reply actions   0 recs

I've been talking to Matthew about this
Basically our thought is that the high variance in BABIP isn't pitchers not having the skill to prevent hits but rather a combination of that and high volatility in team defensive ability year to year, which goes pretty strongly against the current sabremetric grain.

We're trying to figure out ways of testing this idea. I'm looking forward to seeing what it yields.

by Graham on Mar 21, 2008 4:17 PM PDT up reply actions   0 recs

Volatility in defensive ability
is a huge issue.   This'll be really important stuff...  

Still, I'd guess that teams that are able to keep 3 OFs together would show SOME stability, at least as far as FBs are concerned - the M's from 2001-2003 come to mind.   It'll be cool to see this by batted ball type.

by marc w on Mar 21, 2008 11:06 PM PDT up reply actions   0 recs

Basically the plan
Is to build exBABIP for a pitcher based on his batted ball lines and overall team defensive ability, then compare it to actual BABIP.

The only problem I can see here is that fielding line drives isn't really a consistent skill...

by Graham on Mar 22, 2008 5:20 AM PDT up reply actions   0 recs

Yes, this is YTY
stay tuned for an upcoming study where I'll look at pitcher's BABIP broken down by batted ball type compared to the team's BABIP broken down by batted ball type.

That should eliminate any defensive bias and if DIPS is right, will show little to no correlation year to year.

by Matthew on Mar 21, 2008 4:18 PM PDT up reply actions   0 recs

I have no idea what to expect.
I'm fearful of it showing any significant correlation.

by Matthew on Mar 21, 2008 4:26 PM PDT up reply actions   0 recs

I think it'd be cool
It's not like it'd discredit DIPS entirely, it'd just show that there's more to measuring pitching than assuming they have no control over BABIP.

by Graham on Mar 21, 2008 4:28 PM PDT up reply actions   0 recs

who in the fuck
allowed >30% of fly balls to leave the park and when did he pitch for the Mariners?

by Mere Tantalisers on Mar 21, 2008 6:45 PM PDT reply actions   0 recs

Also, the HR/F
is really, really surprising. Considering how little variance there is to begin with and the tendency of that number to creep toward 11 (11.6 to be precise, right?)... I'm stunned that its at the bottom of the list.

by Mere Tantalisers on Mar 21, 2008 6:48 PM PDT reply actions   0 recs

Comments For This Post Are Closed


User Tools

By reading a game thread of your own volition you agree to accept all liability for any and all damage done to your delicate sensibilities.
Start posting about the Mariners »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Max_small
FootbaLL IV (Nov 21st)
Small
The Sabermetrics of Cats

Recent FanPosts

Laughing_man_elmex_small
AFL Championship game thread
Smell-the-glove_small
OFFTOP 11/19 - The Hand of Fraud
Small
Tony Blengino interview on 710 AM ESPN Seattle's Hot Stove League
Ichirocameron_small
OTFPOTD 11/16: Recipes Edition
Small
Lookout Landing as a communtiy
Small
Royals determined to unload Callaspo...
Ichiro_small
Ackley and Triunfel on MLB TV right now
Small
Xbox LLive, Pt 3

+ New FanPost All FanPosts >


Sexy People

1_small Graham

Small Matthew

Small Jeff