Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: PHOTOS: Mike Moser's Dunk Face Is Spectacular

Seeing the World Pitch by Pitch

INTRODUCTION
Back in February of 2007, I had an informal exchange with John Beamer over J.J. Putz over the merits of Putz's 2006 improvement and whether it was repeatable (scroll down to No. 5). Unsurprisingly, most projection systems did not think so, but I surmised that was because they failed to grasp the driving forces of J.J.'s transformation, the splitter he picked up from Eddie Guardado which completely changed his profile as a pitcher. In literally the blink of an eye J.J. went from just another hard-throwing fastball guy with average control and no secondary out pitch, to a bona fide two plus-pitch pitcher.

While statistical research has done wonders into breaking down the hitting side of the game, including the aging and development curves that apply well to a broad number of hitters, the pitching (and individual defense) side are still lacking. Most notably, the aging and development curves on pitchers (and defenders) is all over the map. Pitchers do not follow a standard curve of development as hitters do. Instead, it's more of a series of plateaus, something that resembles a step-wise function for those of you out there keen with math.

ERA is still the measurement de jour used to evaluate pitchers. However, most of us have come to realize that ERA is not a very good way to quantify a pitcher's contributions. For one, it is incredibly team-dependent, in that a pitcher's ERA is largely influenced by the defense behind the pitcher and we all now generally agree that the pitcher bears little control on a batted ball once it is put into play (the DIPS theory).

For that reason, and several others, we moved onto to other measurements, eventually settling on the three true outcomes: strikeouts, walks and home runs. Those three categories served us pretty well in evaluating pitchers and formed the basis of the DICE, FIP and dERA statistics. Why? Well, they correlated much much better on a year-to-year basis than ERA, suggesting that they are less influenced by random noise and better reflect a pitcher's innate ability.

However, I hope that we can find something better, or at least more complete, than this and so I turned to mining Retrosheet and MLB's Gameday for pitch-by-pitch data. One of the things that we can do right away with this data is analyze how the outcomes are reached. For example, a pitcher's strikeout ratio is highly influenced by his percentage of swinging strikes generated (R^2 around 0.62) while a pitcher's percentage of called strikes alone bears almost no statistical influence on the pitcher's strikeout ratio (R^2 around 0.03).

 little bit of difference here  

Star-divide

AVERAGES
First off we'll cover the average pitcher broken down in two ways (SP vs RP and LH vs RH) across the years involved (2003-7) to provide a typical pitcher profile. Afterwards, we'll look to see if there are any interesting trends from 2003 to present.

I break up the data in two ways, by pitch and by plate appearance and house it all in one table, what I coined the IPORT (for "Individual Pitch and Outcome Report Table") back in 2007. For every pitch thrown there are one of five possible results:

-batter takes for a ball
-batter takes for a strike
-batter swings and misses
-batter swings and fouls it off
-batter swings and keeps it fair

Of those pitches that fall into that last category, balls in play (incl. home runs for now), there are four possible categories for the type:

-groundball
-line drive
-flyball
-infield pop

Individual Pitches

For every (read: 99.99%) plate appearance, it will end in one of six ways:

-batter strikes out
-batter draws a walk
-batter is intentionally walked
-batter is hit
-batter hits a home run
-batter puts the ball in play (obviously not incl. home runs now)
-catcher interferes (we toss these out b/c the pitcher isn't involved)

Outcome Reports

INTERESTING TRENDS?
What's fascinating to me is how little variation there is across all five years. With more years at our disposal we would be able to ferret out some interesting investigations and that is also one of my goals is to extend the coverage of this back to 1989 when Retrosheet first introduced pitch-by-pitch data. The reason I don't right now is because I have concerns over how they marked their batted ball types pre-2003.

Percentage of PAs resulting in a strikeout by RH SP
15.4, 15.5, 15.2, 15.4, 15.6

Percentage of PAs resulting in an unintentional walk by RH RP
7.9, 7.8, 7.7, 7.6, 7.8

Percentage of pitches resulting in a groundball by LH SP
9.1, 8.8, 9.0, 9.0, 8.7

Percentage of pitches resulting in a swinging strike by LH RP
9.4, 9.4, 9.6, 9.6, 9.5

Oh, you wanted actual interesting trends? Oh, hmm... well K/uBB looks like it might be increasing at a snail's rate across all four categories. BABIP might be inching upwards. There's just not much there. The state of the underlying game has been incredibly static for the last five years.

FUN STATS
Ball% vs CalledStr%
If the batter chooses not to swing a pitch, how often is the pitch called a ball compared to called a strike? 68.89% are called a ball.

Foul% vs (LD%+GB%+FB%+IF%)
If the batter swings and makes contact, how often is the pitch fouled off compared to kept fair (incl. home runs)? 47.07% are hit foul.

CONCLUSION
The end goal of all of this will be to identify, out of the myriad of trackable statistics, the key indicators that best indicate future performance. As mentioned, swinging strike percentage is a terrific indicator of future strikeout rates and can help us in determining if a pitcher's raw ability to miss bats is improving or not. That way, we can identify real gains in talent, like Putz's, from lucky stretches like Jarrod Washburn's higher-than-normal strikeout rate in the middle of 2006.

This is all going to take a long while to sift through and run models on, the details of which will interest almost none of you, so for now I wanted get some good and/or interesting referential data in your hands. Future posts dealing with this data will link back to this original writeup and will also feature "IPORT" in the subject line.

Comment 21 comments  |  0 recs  | 

Do you like this story?

Comments

Display:

BABIP
I noticed two things about the listed BABIPs.
  1. Relievers have a lower BABIP than starters, which jives with their lower LD% allowed.
  2. For both starters and relievers, RHP had slightly lower BABIP than LHP.  Could this be because they face more same-handed batters?
So my real question is, are either of these observations meaningful?  Or is it just noise/randomness in a too-small sample size?

by patsfan on Mar 11, 2008 1:20 PM PDT reply actions  

I'd have to run a t-value test to be sure,
but these are actually large samples so I assume the difference is statistically significant.

by Matthew on Mar 11, 2008 1:36 PM PDT up reply actions  

2 runs contrary to some of the
DIPS amendments for change-of-speed lefties from Moyer to Barry Zito who consistently run BABIPs under the expected value.

If I had to guess at why this RH-have-lower-BABIP-on-aggregate phenomenon exists, allow me to put forward the Parrish Postulate, which states that teams give playing time to left handers who are manifestly not major league caliber.  

If that's true, we'll see a little glut of LHPs at the tail end of the BABIP distribution.

by marc w on Mar 11, 2008 2:05 PM PDT up reply actions  

According to The Book
there are a lot more bad righties than bad lefties in MLB, which is why as a whole, left-handed batters enjoy a greater platoon split than right-handed batters.

by patsfan on Mar 11, 2008 3:37 PM PDT up reply actions  

To head off a couple possible questions:
-The ERA vs FIP chart goes back to '89 because that was the original extent of my data before I axed all the pre-2003 stuff. ERA and FIP hold though because it doesn't involve batted ball types.

-There's no datapoints on '99 or '00 because Retrosheet doesn't have pitch-by-pitch data for '99

-The x-axis on the bivariate fit graphs are swing% and called% relative to league average. The y-axis is absolute k%.

by Matthew on Mar 11, 2008 2:04 PM PDT reply actions  

Anyone know?
If there's a site that has swinging strike percentages available to the public?
Kickass Sports Writing - Sportszilla and the Jabber Jocks - www.sportszillablog.com

by Sportszilla on Mar 11, 2008 4:41 PM PDT reply actions  

Baseball-reference
under Pitch Data Summary has it, but you have to multiply StS% by Strk% to get it in the units that I use.

by Matthew on Mar 11, 2008 4:58 PM PDT up reply actions  

Matthew's diaries
Make our website community look a lot more intelligent.
...and now I'm here

by CapSea on Mar 11, 2008 4:50 PM PDT reply actions  

although i have this bad habit
of seeing graphs on this site and instantly tuning out.  

I know it must be really interesting and helpful, but my brain says 'no'

I fucking hate you Mariners

by kentroyals5 on Mar 11, 2008 7:36 PM PDT up reply actions  

Given that during the season
there's a graph every day, it's a wonder you've stuck with us this long.

by Jeff Sullivan on Mar 11, 2008 7:43 PM PDT up reply actions  

On the correlation graph
is each data point the correlation of ERA:ERA and FIP:ERA from on year to the next or ERA:ERA and FIP:FIP from one year to the next? and the data points fall on specific years, not between them, the 89 value of ~0.5, is that 88:89 or 89:90? Just trying to clear things up for myself a bit.

by Bearskin Rugburn on Mar 12, 2008 7:56 AM PDT reply actions  

Cool.
what about the correlation, is it ERA:ERA, FIP:ERA?

by Bearskin Rugburn on Mar 12, 2008 9:08 AM PDT up reply actions  

Can you link any articles that correlate
low FIP to pitcher 'success'? It must have been done at some point, but it seems that switching from ERA to FIP simply because FIP doesn't change as much year to year is not a great reason. It makes great sense of course, but it has to be shown.

Alternately, since FIP is scaled to ERA the numbers should theoretically be the same for any given pitcher given a neutral defense and no contracts signed in blood. has there been any research on what happens the next year to pitchers whose ERAs outstrip their FIPs in any given season? (I'm thinking of guys like Zito and Maddux on the low ERA, high FIP end and, I dunno, Jeremy Bonderman on the other side).

by Bearskin Rugburn on Mar 12, 2008 9:55 AM PDT up reply actions  

It reads like you're coming
from the basis that ERA defines success. It doesn't; not even close. What's "success"? You cannot link low FIP to "success" unless you define "success" to be something tangibly different.

"Alternately, since FIP is scaled to ERA the numbers should theoretically be the same for any given pitcher given a neutral defense and no contracts signed in blood."

I have no idea what you mean here. What numbers?

by Matthew on Mar 12, 2008 10:22 AM PDT up reply actions  

No need to be defensive
I know ERA doesn't define succcess, but something has to. What do we mean when we say a pitcher is good? That he wins a lot of games? That the team wins on days he starts? That he has a low ERA or FIP? At some point, the numbers have to relate to winning ballgames, right?.

What I'm asking for in the second part of my comment is whether over a player's career, ERA will end up about the same as FIP. Since FIP is scaled to ERA, and is supposed to represent true talent level, then given a neutral defense and luck (what I meant by contract signed in blood, ie deal with the devil) one would expect this to happen. By numbers I mean ERA and FIP.

Again, I'm not questioning the rationale behind DIPS, it makes great sense and I buy it. I just wonder if it has been shown statistically to have relevance.

by Bearskin Rugburn on Mar 12, 2008 10:58 AM PDT up reply actions  

I'm not defensive, I'm trying to understand
what you're asking and where you're coming from because it's not intuitive for me.

I would define success as performing the best at the aspects you have sole input in, which for pitchers means measurements including: Ks, BBs, HBPs, HRs, Batted Ball types, and a few other minor things. I feel that (for now) FIP is the best measurement of that. In other words, FIP is what I use to measure success.

As for the second part, the entire reason the addition factor in FIP exists is to make it line up with ERA. So given a large enough sample, yes, ERA and FIP will converge, but that doesn't mean anything.

by Matthew on Mar 12, 2008 11:34 AM PDT up reply actions  

I think I was the one getting defensive
But I'd like to keep on this tack for a bit longer, and feel free to ignore the questions cause I'm just trying to understand this better and its not your job to tutor me in SABR.

Yes, being a good pitcher means doing the things that FIP measures, so can I see how there value in a statistic that is predictive of itself. But take a case like Jeremy Bonderman. His ERA has been consistently above his FIP by a certain factor, and while there are plenty of things that can influence ERA out of a pitcher's control, in his case most of those factors should go in his favor. He's played in front of some terrific defenses, in a large home park, and the sample at this point is large enough to be significant. Is he an exception to the rule (ie that for 99% oof pitchers ERA will regress to FIP given enough time) or are there guys out there who cannot harness their skills and translate them into results? And conversely guys like, I dunno, Moyer (I think Bannister is trying to do this tooo), who don't have the skills but can make the most of what they got?

by Bearskin Rugburn on Mar 12, 2008 12:56 PM PDT up reply actions  

There's always going to be outliers
and you have to free yourself from using ERA as an anchor.

by Matthew on Mar 12, 2008 1:02 PM PDT up reply actions  

Bunts
This is great.  Seems like you might want to add bunt fair, bunt foul to the swing type and pop up bunt and grounder bunt to the result types.

That's just off the top of my head, so maybe not.

by KENtastic @ Lookout Landing on Mar 14, 2008 1:44 PM PDT reply actions  

Comments For This Post Are Closed


User Tools

By reading a game thread of your own volition you agree to accept all liability for any and all damage done to your delicate sensibilities.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Moar_bacon_small
Everything I Know About Jesus Montero

Recent FanPosts

Wbc_029_small
Friday Morning Music Thread
Small
OTDOD - Early February Edition
Agentejebaox3_small
A Statistical Analysis of Mariners' Fan Support
Small
Who will have a better season?
Claw_small
BA's Top 10 M's Prospects
Wbc_029_small
Friday Morning Music Thread
Small
Munenori Kawasaki Predictions!!!
Small
The Longevity and Future Success of Felix Hernandez.
Small
The present vs future conundrum

+ New FanPost All FanPosts >


Sexy People

Wbc_029_small Jeff Sullivan

Small Matthew