Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: The Animated GIFs Of January

StatCorner.com

Graham's edit: We are aware that the AB and PA counts for both hitters and pitchers are off. We aren't entirely sure why, though, so fixing it is going to require implementing a couple of new tricks, which may take some time.

Matthew's Subsequent edit: Stats for years before 2007 should now be fixed.

You've been asking for it and we've been working toward it. Four months in development and over two years in theoretical planning and it's finally here. I'm proud to announce StatCorner.com.

A big big thanks goes to Graham who's helped a tonne (as he would spell it) with the front-end design and implementation, not to mention his creation of tRA and our joint effort to take a tediously hand-compiled spreadsheet and turn it into a dynamic database and web presentation. Here's what we have to offer.

Star-divide

-Daily updated tRA, as provided by Graham.
-Daily updated wOBA, as provided by Tom Tango.
-and many more core stats for both hitters and pitchers.

There's a lot of stuff yet to come, and the winter months promises some breathing time to implement some major new features, but we felt that now was a good time to release it to the public. It is stable (we hope) and through the end of the season, there will likely be only minor tweaks and feature additions.

In addition to the aforementioned big help from Graham, there are a few other people I have to thank.
-Tom Tango, for letting me publish wOBA.
-David Gassko, for his terrific work on park factors which I've totally cribbed.
-Jeff, for being a long time beta tester of not only the site but also of my ideas.
-Robert, who at some point will provide me with some data that was too dull for even me to compile.
-and finally, to Joel Pineiro, who's bad missed bat ratio in 2006 prompted me to look further into pitch by pitch results and to take the very first steps that have culminated here.

Comment 150 comments  |  14 recs  | 

Do you like this story?

Comments

Display:

Question

I aways wondered how sites like this, fangraphs, etc update every day. Does it have to be done manually or can it be coded to take the info from elsewhere automatically?

by Fett42 on Aug 14, 2008 12:20 AM PDT reply actions  

Coded.

For instance, it’s updating as we speak :)

by Matthew on Aug 14, 2008 12:26 AM PDT up reply actions  

Holy crap you guys are good at what you do

just one suggestion – seems like you have some LL-centric memes in your writing (the tRA intro post for example calling Matthew the Data Fairy). Might seem a bit more professional, for lack of a better word to clean it up so non-LLers reading the site won’t think “wait wtf?” when they come across something like that.

The site looks fantastic though – it’s already been bookmarked

The ♥ Jose Lopez ♥ Watch - 146 H, 16 BB, 44 G Left

by seattlebruin on Aug 14, 2008 12:22 AM PDT reply actions  

baller

Determined, Jonesing Commentor

by Corco on Aug 14, 2008 12:35 AM PDT reply actions  

This is awesome. Congrats, guys.

I’m excited to be able to find tRA on my own, and that’s what I was expecting with this announcement, but I didn’t know you guys were going to bring wOBA, too. I understand it was available elsewhere (FirstInning, maybe?), but it’ll be nice to have great hitting and pitching metrics at the same site.

So, a somewhat political question: How much do you guys believe in wOBA? Is Tango’s formula, right now, the best “one-stop” offensive number that we have?

by Teej on Aug 14, 2008 12:37 AM PDT reply actions  

I think so yes

and you were right about FirstInning having wOBA. I always forget they have MLB stats as well, I spend so much time there looking at minor league numbers.

by Matthew on Aug 14, 2008 12:39 AM PDT up reply actions  

Right on.

Well, good luck with the site. We’re spoiled.

by Teej on Aug 14, 2008 12:47 AM PDT up reply actions  

I'm excited for StatCorner 1.0

This thing is still in beta, remember?

Determined, Jonesing Commentor

by Corco on Aug 14, 2008 12:40 AM PDT up reply actions  

Oh yeah, whoops.

But com’on, you know 2.0 is going to have some amazing new features.

by Wilder. on Aug 14, 2008 12:42 AM PDT up reply actions  

This is really cool.

Good job guys!

JI/Robert '08!

by Fin on Aug 14, 2008 12:54 AM PDT reply actions  

Another question:

As far as the batted-ball data, you guys use MLB’s stuff and not Fangraphs’ data, right?

Where does Fangraphs get its data? BIS?

by Teej on Aug 14, 2008 12:56 AM PDT reply actions  

Okay, so this is awesome

I’ve just started reading the book, so appreciation is at an all-time personal high.

The site’s great guys. Functionally fantastic. I love being able to compare tRA+ season-on-season (GS52 dwon from 160 to 103?!). Having all this stuff centralised is great. Oh, and being able to flip between RP and SP data is a godsend.

I might have missed this in one of the writeups, but RA stats only seem to appear for 2007 data, no other years. Presumably that’s intentional?

Keep up the fab work, looking forward to continued development.

The english Mariner who isn't EnglishMariner --- P3 W1 L2 (.333)
Sponsor of Jamie Burke's baseball-reference page

by MarkE on Aug 14, 2008 1:08 AM PDT reply actions  

SB 2.0wned

can’t control the itcalic function

The english Mariner who isn't EnglishMariner --- P3 W1 L2 (.333)
Sponsor of Jamie Burke's baseball-reference page

by MarkE on Aug 14, 2008 1:09 AM PDT up reply actions  

Congratulations Matthew.

I sent the site to my uncle. He’s a statistician at UWisc and has been writing papers for the journal of quantitative analysis in sports (about baseball). He should appreciate all you guys do.

...and now I'm here

by CapSea on Aug 14, 2008 2:07 AM PDT reply actions  

The site is amazing!

One question – did you explain somewhere why your traditional stats appear to differ from what’s at other sites, like baseballreference.com?

For example, you have Ichiro’s Avg. for 2002 at .297, whereas B-R has it at .321. Is this one of the things that’s still in Beta mode?

by Nadingo on Aug 14, 2008 7:01 AM PDT reply actions  

Hah.

I kind of wish we’d noticed this earlier… that needs fixing.

by Graham MacAree on Aug 14, 2008 9:04 AM PDT up reply actions  

some work still to do

Hey I love the site (in fact it’s already made my list of bookmarked baseball sites along with LL, USSM, and Fangraphs), but there are some issues.

For instance, Ichiro’s stats are wrong. His traditional stats thing he hit below .300 his first two years in the league, which clearly didn’t happen. Might want to check that sort of thing.

Otherwise, great work and great site, I will be a consistent visitor.

Celebrated my 20th birthday on 02/05/2006. Not a good day.

by abender20 on Aug 14, 2008 7:11 AM PDT reply actions  

This is tits.

I live in georegia but i dont see rusia no where not even sound but they says theres tanks should i be worrie-Yahoo Answers

by Phil Hatzenbuehler on Aug 14, 2008 7:38 AM PDT reply actions  

phenomenal

Do you have an ETA on regressed tRA (seems like all of the related studies would be a first in the sabrmetric community)? Hoping you take into account stuff like GB pitchers allowing fewer hits on GBs than FB pitchers.

Any chance of a tRA number on the ERA scale for easy comparison?

Any chance of a tRA for hitters (like PrOPS) that’s properly regressed? (Obviously different regression for hitters than for pitchers.)

my blog // calling BJ Upton lazy is lazy // past results do not guarantee future performance

by Sky Kalkman on Aug 14, 2008 7:59 AM PDT reply actions  

I believe tRA is meant to be read like ERA

5+ = ugh
4-5 = meh
3-4 = not bad
lower 2 = WOOOOOOOOOOOOOOO!

by BrianL on Aug 14, 2008 8:04 AM PDT up reply actions  

league-average tRA appears to be about a quarter run higher than league-average ERA

because well, it’s tRA, tERA. and that makes sense because the whole earned run/unearned run things DOESN’T make a lot of sense.

the mental adjustment isn’t difficult, but, you know, most people aren’t going to do it.

my blog // calling BJ Upton lazy is lazy // past results do not guarantee future performance

by Sky Kalkman on Aug 14, 2008 8:12 AM PDT up reply actions  

Yeah, it's on a R/9 scale.

We’re having philosophical difficulty with putting it on an ERA scale, but it’s something we’re thinking about.

by Graham MacAree on Aug 14, 2008 9:13 AM PDT up reply actions  

Regressed tRA (tRA*) already exists, but it's such a pain to code that we haven't got around to putting it on the site.

It doesn’t look at the change in out% on things like FBs and GBs for different pitcher types, but that’s a good suggestion and we’ll take a look at it.

Maybe you’ll get tERA but I don’t really like the idea.

The problem with hitters is that they throw off run and out values when it’s the individual hitter over an entire year. Ichiro does b etter on GB, worse on FB, etc. So using a tRA type measurement may not be very sensible.

by Graham MacAree on Aug 14, 2008 9:16 AM PDT up reply actions  

can you point me to an explanation of tRA*? thanks

I don’t get your point about hitters being different from pitchers. I realize you don’t really want hitters’ tRA, you want to use the pieces of tRA to get tOPS or twOBA. But just like not all pitchers have the same run values for FBs (isn’t this something you would regress in tRA*?), neither do hitters. you just regress pitchers different from hitters, because hitters have more of a spread in skill than pitchers in most classical DIPS things like BABIP, IFF/FB, HR/BIA.

my blog // calling BJ Upton lazy is lazy // past results do not guarantee future performance

by Sky Kalkman on Aug 14, 2008 2:03 PM PDT up reply actions  

Each hitter is going to have a ~unique run value for his GB, LD, and FB's

Let’s say that, over the next month, Ichiro and Albert Pujols each put up the following BIP distributions:

60% GB
20% FB
20% LD

What you are proposing would treat both players equally, even though it should be pretty obvious that their numbers in the end would be substantially different.

By facing a large sample of hitters, most of the noise gets teased out for pitchers, and the run values of their balls in play approach the average. It is not so for hitters.

by Jeff Sullivan on Aug 14, 2008 2:22 PM PDT up reply actions  

Bookmarked.

Looking forward to the development of this site.

However, there are layout issues that I think could be improved going forward. And not to nitpick, but why, under the Brewers team entry, is CC Sabathia listed twice? (Once with and once – properly – without the periods after his initials.)

"Maybe Chief has to go and grab somebody from his neck and throw him into the wall."

by esoteric on Aug 14, 2008 8:03 AM PDT reply actions  

because he appears twice on MLB's XML rosters.

I’m trying to stop that from happening, but if you could tell MLB to stop listing him twice, that would help :)

by Matthew on Aug 14, 2008 8:31 AM PDT up reply actions  

As much as you've worked on this, I didn't expect you to roll out the live beta so soon

Nicely done, men.

Let me know if there’s anything I could possibly help with w/r/t compiling numbers for the minors.

by Gomez on Aug 14, 2008 8:49 AM PDT reply actions  

This site blows!

And by blows, I mean totally rocks my socks.

You guys rule.

"All I’ve ever done is be Juan Pierre when I wear this jersey. They’re sticking it to me this year for whatever reason. "

.280/.324/.316

~Juan Pierre

by Goose on Aug 14, 2008 8:54 AM PDT reply actions  

I will visit often, and will inform my friends of this resource!

I’ve had a couple of questions on some things for a while, and this seems like a decent place to ask them.

1) Recently, Jeff was lamenting the need for human coding of batted ball types; this is something hit fx is going to fix, but for now, LD% is a huge determinant of any number of stats – from PrOPS to, obviously, tRA. So the question is, how do we know that this coding is being done well? I thought of this when scanning through the SF Giants rotation on THT. Each pitcher saw a distinct jump in his LD% in 2008 over 2007, even as their RA or FIP dropped (Matt Cain is waaay out of line with his career ave., but is having fairly normal results. Lincecum’s given up way more LDs, but lots fewer runs). This can easily be explained by random chance; it’s not like Lincecum’s giving up 50% LDs and an RA of 2.9. But it got me wondering about ‘park’ effects for LDs, which is really more like saying the effects of home scorers at each park.
I checked out the piece that David Gassko wrote on park effects for batted balls, which is great stuff, but it didn’t mention LDs. Maybe it’s in there but wasn’t mentioned in the article. If that’s the case, what’s the YTY correlation for LD rate? Do you see any large (define that how you will) swings in LD% by park from year to year? More basically, do you think that the ambiguity around LD versus FB versus ‘fliner’ hurts tRA in any way? Have you had a chance to compare tRA in 2007 to tRA (so far) in 2008? More or less stable than FIP?

2) Might there be a way to rank parks based on batted ball types? Gassko had those mini ‘leaderboards’ in the THT article; it might be nice just to have a sortable table where you could rank each park on IF or LD or whatnot.

by marc w on Aug 14, 2008 9:09 AM PDT reply actions  

leaderboards on hopefully everything will roll out in phase 2

It’s on the agenda, big time. After fixing the stat numbers to be 100% perfect, it’s pretty much all I’ll be working on.

by Matthew on Aug 14, 2008 10:26 AM PDT up reply actions  

Congrats

The site works very well.

by G_ on Aug 14, 2008 9:09 AM PDT reply actions  

Awesome site

Though I noticed that Tex is still listed as a Brave and not an Angel. I’m guessing it’s because you set the rosters before all the trades? If so, you might want to double check those.

the other angels fan

by Eyebrows on Aug 14, 2008 9:12 AM PDT reply actions  

It's sorted by team name right now unfortunately

and the players appear on every team they’ve played in that season for now.

Thanks for the catch on Tex though, helped me spot an error in the player loading algorithm.

by Matthew on Aug 14, 2008 10:32 AM PDT up reply actions  

Heh

http://www.statcorner.com/pitcherRP.php?id=406550

The ♥ Jose Lopez ♥ Watch - 147 H, 16 BB, 42 G Left

by seattlebruin on Aug 14, 2008 9:18 AM PDT reply actions  

But you left out his Awesome Factor!

He has an AF of like 672.4 !

the other angels fan

by Eyebrows on Aug 14, 2008 9:20 AM PDT up reply actions  

But pitcher losses!

The ♥ Jose Lopez ♥ Watch - 147 H, 16 BB, 42 G Left

by seattlebruin on Aug 14, 2008 9:21 AM PDT up reply actions  

I don't know why everyone is criticizing Batista

per tRA he’s been unlucky this season!

The ♥ Jose Lopez ♥ Watch - 147 H, 16 BB, 42 G Left

by seattlebruin on Aug 14, 2008 9:24 AM PDT reply actions  

Maybe tRA is kinda like BABIP for hitters.

It only applies if the pitcher really is major league quality. Batista might be completely done as a pitcher so advanced metrics don’t apply.

I live in georegia but i dont see rusia no where not even sound but they says theres tanks should i be worrie-Yahoo Answers

by Phil Hatzenbuehler on Aug 14, 2008 9:38 AM PDT up reply actions  

Ah.

I’m saying this to myself: WHOOSH!

I live in georegia but i dont see rusia no where not even sound but they says theres tanks should i be worrie-Yahoo Answers

by Phil Hatzenbuehler on Aug 14, 2008 10:23 AM PDT up reply actions  

Awesome job guys.

I was wondering if there is a place (or a place coming) where you could compare the team averages.

I’m just curious to see how certain teams that say they are interested in things like OBP, etc are doing at achieving that. And if teams that are more interested in “grit” are good because they lucked into guys who are good in those categories.

by GhettoBear04 on Aug 14, 2008 9:27 AM PDT reply actions  

Thanks so much for doing this you guys!

Do you have any desire to jump into the Defensive Stats game?

by Sec 108 on Aug 14, 2008 9:48 AM PDT reply actions  

Very cool idea.

Would the basic application of that be showing how the defense is affecting the difference between actual results and expected results?

I wish I had more free time because this stuff looks way too fun. Keep it up.

by Sec 108 on Aug 14, 2008 9:56 AM PDT up reply actions  

definitely!

tRA compared to actual RA would contain a bunch of noise, but if you compared outs and expected outs, you’d have something. also expected SLG_BIP and actual SLG_BIP. good idea.

my blog // calling BJ Upton lazy is lazy // past results do not guarantee future performance

by Sky Kalkman on Aug 14, 2008 2:08 PM PDT up reply actions  

What, no wOBA numbers for pitchers?

They hit too.

I like using semi-colons; they make me feel smart.

by Llewdor on Aug 14, 2008 10:08 AM PDT reply actions  

Huzzah!

The A's colors are green and gold.

by mikeA on Aug 14, 2008 10:20 AM PDT reply actions  

i agree -- awesome glossary

my blog // calling BJ Upton lazy is lazy // past results do not guarantee future performance

by Sky Kalkman on Aug 14, 2008 2:09 PM PDT up reply actions  

I've e-mailed this site to some friends

and already plan on having it replace baseballreference as my go-to site for stats. Right on, guys.

by katal on Aug 14, 2008 10:28 AM PDT reply actions  

not that i disagree, but i think B-Ref is underrated if just for their BtWins stat

my blog // calling BJ Upton lazy is lazy // past results do not guarantee future performance

by Sky Kalkman on Aug 14, 2008 2:10 PM PDT up reply actions  

Great stuff. Just a few suggestions

Have outcomes only go to 2 decimal places.
Have BABIP only go to 3 decimal places.
On the hitters/batters page an option to sort differently would be nice. (IP, etc.)
Sortable stats for the league. Who has the best tRA? Even a leaderboard would be nice.

Thanks guys.

by Edgar for Pres on Aug 14, 2008 11:22 AM PDT reply actions  

Good call Corco.

"All I’ve ever done is be Juan Pierre when I wear this jersey. They’re sticking it to me this year for whatever reason. "

.280/.324/.316

~Juan Pierre

by Goose on Aug 14, 2008 7:04 PM PDT up reply actions  

Nice piece of work!

I’ve been catching references to the work you’ve been putting into this all season, I thought it was an in joke or something. You should charge money!! I’m going to be geeking out on this until Christmas.

by dpseadv on Aug 14, 2008 11:31 AM PDT reply actions  

Grammar error

on the Authors page. “Currently residing in Seattle, he can reached via e-mail.”

Should read: “Currently residing in Seattle, he can be reached via e-mail.”

Sorry I’ve been fixing string errors at work for a couple days now.

If it wasn't for college football I'd probably have given up on sports.

by bluemax on Aug 14, 2008 12:52 PM PDT reply actions  

No need to apologize.

Because so much of is written inside the code, it makes spell and grammar checking impossible except by eyeballing it. Please let me know any such errors.

by Matthew on Aug 14, 2008 12:57 PM PDT up reply actions  

Hardcoded strings

tsk tsk. Although maybe for websites the rules are different.

If it wasn't for college football I'd probably have given up on sports.

by bluemax on Aug 14, 2008 1:25 PM PDT up reply actions  

Shouldn't be.

Nice Guys Finish Third - Hopelessly lost, but makin' good time.

by pdb on Aug 14, 2008 1:28 PM PDT up reply actions  

I always type out my text before coding it

though I should spell check it before doing so….

by Gomez on Aug 14, 2008 2:01 PM PDT up reply actions  

Awesome liftoff guys

just one little reaction…
So much green… is that really necessary? Any chance a graphic designer will be employed?

But definitely keep it to a single screen on the front page, and keep the same simple/effective vibe. That part rocks.

by johnbai on Aug 14, 2008 1:05 PM PDT reply actions  

average vs. replacement question

If going from runs above average to runs above replacement, how many runs is each PA worth? I know Sean Smith’s shortcut is to assign two wins per 700 PAs. Anyone have a better number or good explanation how to figure it out?

my blog // calling BJ Upton lazy is lazy // past results do not guarantee future performance

by Sky Kalkman on Aug 14, 2008 2:20 PM PDT reply actions  

league adjustment

Am I right that there’s no adjustment for the difference in league-quality between the AL and NL? (Not that any other stat sites do this.)

my blog // calling BJ Upton lazy is lazy // past results do not guarantee future performance

by Sky Kalkman on Aug 14, 2008 2:20 PM PDT reply actions  

given the half-win difference that most people use for converting between NL and AL...

… how many points of wOBA would this be?

my blog // calling BJ Upton lazy is lazy // past results do not guarantee future performance

by Sky Kalkman on Aug 14, 2008 2:21 PM PDT up reply actions  

Fantastic, gents.

My most hearty congratulations.

J.K.L.

by Aaron Campeau on Aug 14, 2008 7:40 PM PDT reply actions  

Comments For This Post Are Closed


User Tools

By reading a game thread of your own volition you agree to accept all liability for any and all damage done to your delicate sensibilities.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Moar_bacon_small
Everything I Know About Jesus Montero

Recent FanPosts

Wbc_029_small
Friday Morning Music Thread
Small
OTDOD - Early February Edition
Agentejebaox3_small
A Statistical Analysis of Mariners' Fan Support
Small
Who will have a better season?
Claw_small
BA's Top 10 M's Prospects
Wbc_029_small
Friday Morning Music Thread
Small
Munenori Kawasaki Predictions!!!
Small
The Longevity and Future Success of Felix Hernandez.
Small
The present vs future conundrum

+ New FanPost All FanPosts >


Sexy People

Wbc_029_small Jeff Sullivan

Small Matthew