So what the heck do you do when you have a very good conceptual argument for causation, but the correlation is nil?
A case study: 2008-09 O.N. Thugs, UPL Basketball.
If you think about the stats that we use (PTS, 3PM, FG%, FT%, AST, BLK, STL, REB, OREB, A/TO), you'd probably guess that 3PM and PTS would be correlated (though not necessarily in a super high way), and you'd figure that the accuracy stats would be correlated with PTS, and also that AST would be correlated with A/TO. But two categories should stand out: OREB and REB. Because each OREB is also a REB. So, if a player is a good offensive rebounder, then he'll likely be a good rebounder. in fact, it would be somewhat shocking if a team was very good at OREB, but bad at REB. Yet somehow, halfway through the season, I was something like 2nd in offensive rebounding (and had the highest rate by far), but was 9th in overall rebounding (with a rate that was about 7th). Eventually, things caught up a bit, and I finished 1st in OREB, and 4th in REB, but even that was a bit strange. So maybe it's reasonable to expect things to even out over time.
The current problem: 2009 O.N. Thugs, UPL Baseball.
If you think about the stats in baseball (R, HR, RBI, SB, OBP, SLG, W, L, SV, K, ERA, and WHIP), the ones that you'd figure to be the most correlated are HR with RBI and HR with R, since each HR you hit guarantees 1 R and 1 RBI. And you expect a little lesser correlation between HR and SLG. But take a look at this:
Team | R | HR | RBI | SB | OBP | SLG | RBI/HR | R/HR |
'90 Reds | 249 | 55 | 229 | 30 | 0.388 | 0.506 | 4.164 | 4.527 |
O.N. Thugs | 256 | 47 | 253 | 43 | 0.397 | 0.471 | 5.383 | 5.447 |
IamJabrone | 248 | 78 | 238 | 41 | 0.358 | 0.495 | 3.051 | 3.179 |
Westy's Sluggers | 247 | 74 | 259 | 37 | 0.389 | 0.522 | 3.500 | 3.338 |
Black Sox | 231 | 66 | 249 | 26 | 0.361 | 0.476 | 3.773 | 3.500 |
Cheeseheads | 230 | 61 | 223 | 43 | 0.346 | 0.458 | 3.656 | 3.770 |
Muddy Mush Heads | 239 | 51 | 195 | 54 | 0.360 | 0.444 | 3.824 | 4.686 |
IStillSuckCurveballs | 234 | 58 | 244 | 14 | 0.382 | 0.489 | 4.207 | 4.034 |
Phatsnapper | 189 | 54 | 219 | 23 | 0.346 | 0.456 | 4.056 | 3.500 |
TheJimmyDixLongballs | 202 | 54 | 226 | 40 | 0.334 | 0.450 | 4.185 | 3.741 |
Benver Droncos | 244 | 63 | 238 | 29 | 0.338 | 0.460 | 3.778 | 3.873 |
Hats for Bats | 224 | 51 | 212 | 40 | 0.338 | 0.424 | 4.157 | 4.392 |
Somehow, I have managed to lead the league in R, and am a close 2nd in RBI. But I am dead last in HR. Compared to the Jabrones, I have 31 fewer HR, which means that I've managed to score 8 more R, despite giving away 31 R from my lack of HR. And historically, R:HR and RBI:HR ratios come in around 4 (just a quick glance suggests that 3.5 to 4.5 are reasonable values to expect. Note, the R:HR can be a lot further off, given the relatively less rare case of guys who score 90+ runs on only 10 or so HR (whereas someone like Adam Dunn, whose 100 RBI on 40 HR is about as low as you'd probably get). But overall, as you look at how UPL teams are put together, you see some stability in these ratios. And then you have the '09 O.N. Thugs, who are at about 5.4 to 1 for both R and RBI.
So what does this mean? I have no clue. Moving forward, you can either make the case that a) I'm due for a bunch of HR since my team is good, but just underachieving right now in power, or b) my team sucks and has been overachieving with everthing other than HR. I think that a) is more likely than b), although I'm definitely biased on this one.
I don't really know what to make of this right now, and will think about this more, but I definitely have had some interesting questions open up regarding how teams should be constructed, if you take a statistical look at the way the UPL is structured. Taking a look at some of these insights is interesting. For example, I'd bet that if you were to take a poll that asked fantasy players which of the 6 offensive categories would be the most useful in predicting fantasy success, you'd probably get HR as the overwhelming answer, with SB being the worst. However, my first look at the numbers suggests that if you were to use only one criteria in evaluating offense, you should look at RBI over anything else (though this is very preliminary, and restricted to historical UPL numbers).
-Chairman (aka O.N. Thugs)