HS vs College pitchers
As you all know - we need pitching, and quality pitching is expensive to buy. The ability to draft quality pitching can make or break teams that can't afford to (or just will not, out of principle) buy the highest quality free agents. I realize that this is probably not the best time to be discussing the draft, but we constantly come back to the `high risk/high reward' vs `safe pick' discussions, whether it is in reference to signing free agents, drafting amateurs, or trade proposals. And with the recent front pages summarizing the Cardinals farm system (courtesy of Red Baron), I decided that I really wanted to know more about the various draft strategies, in particular as those strategies apply to pitching. So I started with one main question: how do high school draftees fare as compared to college draftees? I actually asked the community this question a long time ago buried deeply in some other thread, but received no answers. I haven't found an answer online anywhere (though I may just be looking in the wrong places), so decided to try to answer it myself.
To try to answer this question, I downloaded draft datasheets on all players drafted as pitchers between the years 1999 and 2006 (data from www.baseball-reference.com ). I chose the years based on the assumption that players drafted in 1999, if they are going to get major league experience at all, should have done so by the 2007 season. There may be a few stragglers that are called up after 9 or more years in the minors, but these should be relatively few. This datasheet contains the draft positions for all players, along with the school they were drafted from, and some career stats, including ERA, WHIP, and OPS. Rather than search for stats on all 6202 pitchers drafted in that time, I used the stats provided.
I used the WHIP statistic first as a surrogate for MLB experience. If they have a WHIP, they have major league experience. I also was able to use the school that players were drafted from to determine high school (HS) vs college (Col) draftees. So let's look first at the percentage of draftees that make it to the majors, comparing HS to Col draftees.
Year HS(%) Col(%)
1999 8.2 14.3
2000 10.5 11.1
2001 6.5 9.5
2002 6.5 10.4
2003 3.2 7.5
2004 1.7 4.4
2005 0 2.3
2006 0 0.9
This confirms what many here have known to be true: HS pitchers take longer to reach the major leagues and have a lower likelihood of making it to the majors at all. Further, only 11.8% of pitchers drafted in 1999 have ever pitched at the major league level. I should note here that a few players drafted as pitchers have converted to position players (such as Brad Hawpe of the Rockies), and therefore probably should not be considered as `failures'.
However, making it to the majors is one thing - succeeding at that level is something quite different. And this is really the question that I had - it is often implied that HS draftees have a higher `upside'. So the question is this: do HS draftees that make it actually perform better than Col draftees? To look at success, I used WHIP (may not be ideal, but I wasn't going to try to find more appropriate stats for 6000+ pitchers - and it should be noted that we are dealing with the league as a whole, so team defense or park effects should be relatively standardized, making defense-dependent stats more trustworthy). So how does the average (mean) Col pitcher fare against HS (note: using the median gave similar results)?
Year HS Col
1999 1.538 1.681
2000 1.571 1.688
2001 1.696 1.687
2002 1.730 1.594
2003 1.523 1.497
2004 1.443 1.482
2005 - - 1.589
2006 - - 1.224
What does this tell us? First of all, college pitchers that make it to the majors very quickly are good (low WHIP). Second, good HS pitchers take longer, but after about three years, the good ones start to make it, but they don't perform as well as the first year college pitchers (comparing 2004 HS to 2006 Col). Third, WHIPs for college pitchers generally rise the longer we get from the draft year. I interpret this to mean that the minor league journeymen are slowly being called up, and that the reason that they were in the minor leagues for seven years is that they aren't very good. On the contrary, the HS draftees show a slight decreased for the 1999 and 2000 draft years. Maybe this reflects the `high upside' that we always reference. Those HS draftees can take a long time, but the potential to figure it out and succeed in the bigs doesn't go away. So if you are looking for pitchers quickly (1-2 year - 2005,6 data), it might be advisable to draft a quality college pitcher (such as everyone's favorite trade target, Tim Lincecum or Ian Kennedy or Joba Chamberlain - all 2006 college draftees). Even after six years, the difference in WHIP between Col and HS draftees (years 2001-2004) is minimal. The upside, on average, doesn't appear until years 7-8 (1999 and 2000 drafts).
To come back to the original question, do HS pitchers outperform Col pitchers? I think that to answer that, one has to look at both the rate that the draftee makes it to the majors (MLB%) AND the stats (WHIP, in this case). Let us look just at the years 1999 and 2000 - draft years in which most of the draftees will have pitched in the majors if they ever will.
Draft MLB(%) WHIP
HS 9.15 1.556
Col 12.62 1.684
So if you are drafting 8 years into the future, a HS pitcher is less likely to ever make it to the majors than a college pitcher, but those that do tend to be slightly better pitchers. This does seem to confirm the `high risk: high reward' approach does what it purports to do: a typical HS draftee is less likely to ever see the majors, but more likely to succeed if they do make it. But the high reward only comes to the very patient - 7-8 years after the draft year.
With that being said - should the cardinals, with a relatively low quality farm system, be drafting HS or Col pitchers? Probably the answer lies somewhere in between. Col pitcher can fill out the ranks faster than HS, but over the long term, HS can be slightly better on average. Is the slight difference in WHIP worth the decreased likelihood of making it to the big leagues? I don't know; makes for a nice debate topic though.
0 recs |
38 comments
Comments
Round taken
by indakind on Dec 11, 2007 1:19 PM EST reply actions 0 recs
round effect
Draft A B
HS 1.503 1.672
Col 1.546 1.656
So there is a slight trend there for group A HS draftees being slightly better than Col, but ever so sligth.
by cdb on Dec 11, 2007 2:30 PM EST up reply actions 0 recs
thanks for the answer
by indakind on Dec 11, 2007 3:14 PM EST up reply actions 0 recs
right....
my guess is the answer is usually a resounding "yes," but i'm willing to be shown wrong.
by kindred on Dec 11, 2007 3:31 PM EST up reply actions 0 recs
That's a diary.
I wonder how the numbers are affected by where they are drafted. I know there would be no causation but there might be a correlation.
Maybe like in poker. Sometimes to call to draw to the flush and sometimes you don't. The odds will tell you. Same with pitchers. maybe a pitcher with a scouted score of 72 (or whatever a good scouted score is) factored with chance of success will tell you if you are getting value.
I personally don't like High School pitchers. To much chance of injury. let them get hurt at Cal State Fullerton is what I say.
by Harknights on Dec 11, 2007 1:21 PM EST reply actions 0 recs
Good Diary...
i'd also like to normalize for IP in the majors, but that probably isn't possible without tons of data work. still, i think it is relevant to parse whether these guys are getting September call-ups for the Royals and just throwing a handful of innings, or whether they're pitching full seasons for contending teams. since WHIP is the measure of quality, presumably a few dudes that come up and throws 5 innings of 0.85 WHIP in the majors could skew the numbers. the same could happen at the top of the distribution.
i also would like to see some sort of breakdown of standard deviations. the mean values only tell us so much. high school pitchers could still be more high risk/reward types even if the mean values are exactly the same as those of the college pitchers. the standard deviations could clue us in on that.
but it's still a great diary.
by kindred on Dec 11, 2007 1:56 PM EST reply actions 0 recs
variability
I can provide some estimate of variation though. Draft rounds one to three are A, rounds 4 and up are B. These are the ranges in WHIP (all pitchers that made it to majors are included):
HS-A 1.18 - 1.90 (42 samples)
Col-A 0.75 - 3.32 (106 samples)
HS-B 0.99 - 4.00 (65 samples)
Col-B 0.63 - 4.20 (187 samples)
Or as mean +- Standard deviation
HS-A 1.50 +- 0.2
Col-A 1.55 +- 0.37
HS-B 1.67 +- 0.55
Col-B 1.66 +/- 0.56
So more variation for Col than HS in early rounds, no difference in late rounds.
by cdb on Dec 11, 2007 3:19 PM EST up reply actions 0 recs
nice work...
by kindred on Dec 11, 2007 3:34 PM EST up reply actions 0 recs
Nice work
Players taken in the June draft coming out of college (or with at least two years of college) perform dramatically better than players drafted out of high school.
Since pitchers tend to peak later than hitters—in their late twenties—it makes a ton of sense to draft pitchers who'll be peaking in their cost-controlled seasons, and who you've shown are more likely to contribute in the show.
That being said, I think it's a good idea to draft a few more HS pitchers. With the new CBA, you have 5 years to develop them before they need protection—having some talented arms that can be broken in as relief pitchers before converting to starters (like Wainwright) is a solid way to run a system.
Once things settle down a bit, I'll look into this too. It'd be a fun off-season project to put together a community-wide collaborative draft strategy report.
by liam on Dec 11, 2007 2:39 PM EST reply actions 0 recs
i agree with the last point...
there are some sports economics journals out there now, i believe; if we do it right, there might even be a chance of publication.
by kindred on Dec 11, 2007 3:35 PM EST up reply actions 0 recs
The Answer
Outstanding diary, btw.
by redbirdnation8206 on Dec 11, 2007 4:18 PM EST reply actions 0 recs
One important thing to note here
If HS pitchers, generally, perform better -- they should. They're the better pitchers, by and large.
If college pitchers, generally, perform better (or even as well) -- doesn't that speak to the fact that pitching is so difficult to predict? You'd be taking lesser talent and having it produce as well or better.
So, to your question, which should we draft? The HS pitchers should be the more talented b/c the best of them never make it to college. College pitchers are probably less risky and closer to the majors but rarely have the upside of the HS pitchers. The major infusions of talent come from the HS ranks.
by chuckb on Dec 11, 2007 4:29 PM EST reply actions 0 recs
additionally
I remember an interview with Warren Spahn a few years back when he was talking about the younger players today not throwing enough as kids to develop arm strength and the effects of it on arm strain injuries. College pitchers go through a much more rigorous conditioning program than most high school kids see.
I think this also proves that college pitchers, for the most part, are able to have longer careers and are less likely to get injured after going through a college strength and conditioning program. Obviously there are exceptions (Darren Dreifort please stand up), but generally I believe this is the case. Most college pitchers also are more polished, have better mechanics, and are cheaper to sign, so going after only "great" high school arms and paying the tax seems like a feasible draft policy to me. Regardless, both are usually going to hit the majors around 22 or 23 years old -- those that come up and work heavy loads earlier usually end up petering out because their body couldn't handle it or they had poor mechanics that never got straightened out (Doc Gooden, Steve Avery, Kerry Wood)
The other thing that hasn't been mentioned here is that a lot of college pitchers with average fastballs have to re-learn to use the inside part of the plate many times because of the aluminum college bats. Since jamming hitters with aluminum isn't nearly as effective as jamming hitters using wood, there are a few college pitchers who've had trouble making the adjustment. Mike Mussina would be one who ended up having a nice career, Ben McDonald would be an example of a guy who never made the adjustment despite having ++ stuff.
by fourstick on Dec 11, 2007 5:48 PM EST up reply actions 0 recs
Are they more polished?
As for your strategy of picking HS pitchers early and college pitchers later -- that seems reasonable to me. If you pick HS pitchers later, they're likely to go to college anyway and you may as well try and reap the rewards of the extra coaching that college pitchers have received.
by chuckb on Dec 11, 2007 6:16 PM EST up reply actions 0 recs
my point
My point is that with college kids you generally have a better idea of what you're getting and I think that college pitchers are more reliable, although they may have a higher upside. They also usually cost the club less money in the long run.
by fourstick on Dec 13, 2007 9:49 AM EST up reply actions 0 recs
Flashback to 1997
So, just for fun on a lousy day I decided to take a closer, although admittedly unscientific look, at the draft class of 1997. I chose 1997 solely because it was ten years ago and it seems like anybody who is going to make a mark would have done so by age 28.
Interestingly enough there were sixty pitchers taken in round 1-3, including the supplemental round. The data are as follows:
Summary
HS College
Active 7 6
Done 7 13
DNP 16 11
Total 30 30
Active MLB
IP
Mean 471 706
Min 55 175
Max 1428 1714
Median 242 417
ERA +
Mean 99 101
Min 77 87
Max 118 127
Median 101 99
Retired MLB
IP
Mean 108 216
Min 11 2
Max 228 544
Median 84 188
ERA +
Mean 72 76
Min 63 30
Max 97 95
Median 72 81
From the summary information you can see that college pitchers (63%) were more likely than HS pitchers (47%) to at least get a cup of coffee at the big league level. FWIW, I counted JUCO players as college level. There were only three and only one made it to the show with insignificant results. I don't think it is too surprising to see that college pitchers are more likely to get to the big leagues.
It was interesting to me that a higher number of high schoolers (23%) than college pitchers (20%)are still active in the big leagues. Probably because they are younger than the college guys, but the college guys should only be 31 or 32 years old. Not really a significant difference anyway.
The active pitchers look pretty comparable to me with the college guys having a slight advantage - again probably due to the age difference. One could easily assume that the career accomplishments of each group will be pretty similar. So it comes down to if you like the college guys from this draft, most notably:
Jeff Weaver
Randy Wolf
Scott Linebrink
or the better HS guys:
John Garland
Aaron Cook
Jeremy Affeldt
Rick Ankiel
I have no idea how to deal with the Ankiel factor, but he still was a very good pitcher with a career ERA+ of 118 which would put him in the top 150 of all time if he could have done that over a career.
The only really significant difference in the two popluations is that the college guys who made it for a while and faded away seem to have made a better mark than the HSers. To me, the fact that college pitchers might offer you 188 innings at an ERA+ of 76 versus only 84 career innings at an ERA+ of 72 is statistically interesting, but should have no bearing on draft strategy.
I would like to take a look at all the current big leaguers with ERA+ over 100 and see where they came from. My guess is the answer will be Latin America, Asia, and high school. I will try to put that together in the coming weeks unless someone else wants to step up.
by giveml on Dec 11, 2007 6:29 PM EST reply actions 0 recs
more data
1995-2000
Round Level Mean St Dev Median MLB%
1_5 Col 1.67 0.48 1.55 45.42
1_5 HS 1.64 0.51 1.50 39.24
6_10 Col 1.63 0.40 1.54 21.68
6_10 HS 1.69 0.55 1.57 17.39
11_15 Col 1.69 0.46 1.61 14.70
11_15 HS 1.65 0.54 1.57 13.21
16_20 Col 1.60 0.31 1.52 11.80
16_20 HS 1.56 0.33 1.48 12.12
21_25 Col 1.67 0.46 1.56 9.03
21_25 HS 1.69 0.27 1.70 5.41
25+ Col 1.63 0.47 1.52 5.36
25+ HS 1.61 0.31 1.54 3.76
The differences, with this larger sample size, become even less apparent (by this measure) between HS and Col. There may be a better way of gauging 'upside' or talent than I am using, but this measure is not detecting any upside for HS draftees.
Sorry - I was going to make this a diary edit, but I couldn't get the formatting correct - it was a mess.
by cdb on Dec 11, 2007 7:24 PM EST reply actions 0 recs
So Where Do All the Good Pitchers Come From?
So I started trolling around for data as I am not exactly a sophisticate in this type of analysis. Thankfully I didn't have to go too far to find a comprehensive list of the top pitchers as ranked by ERA+. This may or may not be the ultimate determinant of a pitchers value, but I was pretty sure it was a valid enough measure for trying to rank pitchers. As an ERA+ of 100 represents major league average, I decided to limit my study to pitchers with an ERA+ of 110 and above. Of all the pitchers active through 2007 with at least 1000 IP, there are 38 with an ERA+ on the good side of 110.
These 38 pitchers come from one of three places: they are amateur free agents (AFAs) signed outside the U.S. or they are either high school or college pitchers draft in the MLB amateur draft. I don't know the total number of prospects signed each year from each source, but of the top 38 pitchers the breakdown is as follows:
* 8 AFAs (21%)
* 9 College picks (24%)
* 21 High School picks (55%)
So much for my earlier post that, "I don't really agree with the argument that the best pitchers sign out of high school." Either I was wrong or the minor leagues are a much better place to develop than college. I don't think the disparity could be that great, so it is a pretty safe bet I was wrong.
Some argument could be made that while the HS picks have been more successful some of the very best pitchers have come from both the AFA signees and the college picks. After all, Pedro Martinez, Johan Santana, and Carlos Zambrano are three of the top eight. Three college players, Roger Clemens, Brandon Webb, and Randy Johnson are also in the top eight. That leaves only Roy Oswalt and Greg Maddux to represent the high schoolers. On top of that the average ERA+ for the AFA group is 124, the college group is at 125, and the high school group trails at 119. A better way to look at is who would you want to be in your starting rotation. I would love to have any of these five during their peaks. Ranked by ERA+ each group would be represented as follows:
AFA College HS
Pedro Martinez Brandon Webb Roy Oswalt
Johan Santana Roger Clemens Greg Maddux
Carlos Zambrano Randy Johnson Roy Halladay
Andy Pettitte!! Tim Hudson Mark Buehrle
Bartolo Colon Mike Mussina Derek Lowe
A couple of notes on the pitchers above: I don't know Andy Pettite's history, but my source showed him as an amateur free agent. Doesn't sound right, but as Joaquin would say, "youneverknow". Mark Buehrle doesn't have a college listed, but I thought for sure he went to Jefferson College for at least a year. Maybe he was a draft and follow.
One additional bit of interest, especially to Cardinal fans, is which teams actually identified these guys. I don't think many will be surprised that the Cardinals join the Devil (or not) Rays, Giants, Mariners, Phillies, Reds, Rockies, Royals, and Twins as the only MLB teams not to have signed or drafted a single one of these 38 pitchers. My guess is these are all teams that don't have overseas operations and draft mostly college pitchers. The winner in drafting the most of them is, of course, the Cubs with four. I think that shows the value of drafting high....The Astros, Blue Jays and Braves come in second with three apiece. Interestingly enough, the Astros were fortunate enough to have been the only team that signed two AFAs who made the list. Unfortunately for them, they have the distinction of not having seen a single MLB inning from either Johan Santana or Freddy Garcia.
Finally, it apparently isn't all that hard to identify these guys as 15 of the 30 drafted players on the list were taken in one of the first two rounds. Since eight were AFAs, only 15 players drafted after the second round have put up significantly above average major leagues careers. The latest pitcher drafted was high schooler Kenny Rogers by the White Sox in the 39th round. No college player drafted after the 8th round made the list while nine of the HS guys were drafted in the 8th round or later.
I could use some help drawing conclusions here. It seems like it is not a particularly good idea to draft a college pitcher high unless you are pretty sure he will make it to the bigs pretty quickly. Six of the eight college pitchers on the list were drafted in the first two rounds while Jake Peavy, John Smoltz, and Roy Oswalt were drafted in the 15th, 22nd, and 23rd rounds respectively.
by giveml on Dec 11, 2007 11:37 PM EST reply actions 0 recs
great job
FYI Pettitte was a draft and follow out of HS (NYY, 22rd, 1990) and buehrle was drafted out of college (CWS, 38rd, 1998).
by SleepyCA on Dec 12, 2007 12:28 AM EST up reply actions 0 recs
nice
I am of the opinion right now though, that the cardinals should spend there early round pics on quality college pitchers, when available, based solely on the fact that they have relatively poor pitching depth in the minors. Then use later rounds on 'signability' HSers. This would potentially fill out the ranks more quickly, allow the Major league squad to have genuine options regarding free agency vs calling up rookies (in the near future), and still allow potential HS 'upside' pics to develop at a more leisurely rate.
But I could be convinced otherwise as we collect more data on this.
by cdb on Dec 12, 2007 10:58 AM EST up reply actions 0 recs
different results
This could due to the lack of an innings filter in the data I am using. But it also might be related to the fact that college pitchers tend to arrive at the majors at an older age than HS pitchers (the good ones at least). By using a 1000 inning cutoff, you might be biasing for those that arrive in the league at a younger age - maybe. Or it could be that ERA+ is more sensitive than WHIP.
As an interesting aside, if I look at all pitchers outside the top 25% - that is the mediocre to bad pitchers, there are 436 in total. 157 of them are high school draftees(36%) and 279 are college draftees (64%). Same proportion as the top 25% or 10%.
by cdb on Dec 12, 2007 11:36 AM EST up reply actions 0 recs
does anybody have the numbers
by giveml on Dec 12, 2007 11:54 AM EST up reply actions 0 recs
HS% of draftees
Year number % HS
1995 820 42.2
1996 847 41.7
1997 793 43.3
1998 764 42.4
1999 765 41.3
2000 798 37.1
2001 809 41.7
2002 704 36.9
2003 773 32.4
2004 793 29.0
2005 796 29.0
2006 764 29.0
By draft round (all years combined):
Round Number % HS
1-5 1025 41.2
6-10 930 31.0
11-15 908 28.0
16-20 949 27.7
20-25 892 28.7
26+ 4722 42.9
Interesting to look at the trend in HS draftees over time - seems like fewer HS pitchers are being drafted of late. Also interesting to see the spike in HS draftees in the late rounds (26+).
I don't have data for AFA
by cdb on Dec 12, 2007 12:16 PM EST up reply actions 0 recs
WHIP
by giveml on Dec 12, 2007 11:59 AM EST up reply actions 0 recs
WHIP
I and think that people have issues with BABIP as being random - batting average of balls in play. I think that pitchers have control over the number of balls put into play, but no control over what happens after it is hit.
Maybe some of the more experienced sabermetricians could comment on the shortcomings of WHIP????
by cdb on Dec 12, 2007 12:21 PM EST up reply actions 0 recs
isn't it funny
by giveml on Dec 12, 2007 12:38 PM EST up reply actions 0 recs
yeah...
giveml - great work. i have one question, tho: if it's easily done with your data set, could you lower the minimum innings requirements? i might even lower it down to 150 or 200 IP, so that stud relief pitchers could be included. 1000 IP is a lot, so you've ended up with only 38 observations. a better picture might be drawn by lowering the bottom standard.
another interesting exercise would be to lower the ERA+ requirement to 100. in other words, look at how many of these guys are above-average. to some extent, picking a Santana or Oswalt is like catching lightening in a bottle: it isn't going to happen often. but longer trends of above-average performance might give us a better idea of what's really going on.
by kindred on Dec 12, 2007 7:50 PM EST up reply actions 0 recs
FTR on Buehrle
I met Bob Dunahue (the coach over there) one time about 4 years ago and he told me a story about the relatively new "Mark Buehrle Rule." No coach at Howell North is allowed to cut a player unless Dunahue signs off on it.
by redbirdnation8206 on Dec 13, 2007 2:14 AM EST up reply actions 0 recs
Fantastic diary,
by Alxfritz on Dec 12, 2007 10:56 AM EST reply actions 0 recs
Excellent
by AustinBOB on Dec 12, 2007 3:01 PM EST reply actions 0 recs
what and when
by sportsman on Dec 12, 2007 8:21 PM EST reply actions 0 recs
OK, so here's the story
I don't have a way to identify the population down below 1000 IP, so I will stick with what these data show me. Anyway, if I am drafting and looking for guys who are significantly above average I am for the most part looking for starters. Most relievers just kind of happen. I think it is fair to define a starter who is significantly above average as one who can rack up 1000 IP. That's only 5-7 years of starting.
The numbers are a bit different as the college pitchers fare better with the same percentage of the pool as HS pitchers - 43%. That leaves only 14% for the amateur free agents (i.e. Latins from outside U.S.) Combined with the numbers from the previous post on pitchers with ERA+ of 110 and above, it means that of all the active pitchers with ERA+ above 100 the breakdown is as follows:
AFA 11/59 19%
College 18/59 30%
High School 30/59 51%
Combined with the total draftees numbers provided by cbd indicating that historically a lower percentage of HS pitchers (42% and declining) have been drafted, it is striking that the numbers are skewed so far in favor of the HSers. I guess you could say that a HS pitcher is close to twice as likely as a college pitcher to become an above average MLB starter.
Again, it doesn't seem all that difficult to identify who these guys are as 13/21 or 62% of this group were either AFAs or drafted in the first two rounds. Woody Williams has the distinction of being the lowest drafted college pitcher to make the cut as he was drafted in the 28th round. As most of you are probably aware, Woody was drafted as a shortstop. That means that NOT A SINGLE ONE of the 59 currently active above average starters with over 1000 IP was a college pitcher drafted after the eighth round. I hope Luhnow and company are aware of this. Eight of the HS pitchers on the list, or 14% of the total were drafted after the 8th round.
Thanks to Matt Morris with an ERA+ of 109 the Cardinals were not entirely shut out. He is the only active big league starter with 1000 IP and an ERA+ above average to have been selected by the Cardinals in a period that roughly covers at least a dozen drafts. Again, the overall leader in selecting top pitchers is the Cubs with six.
If you wonder what an ERA+ of 100 buys you there were three of those: Livan Hernandez, Aaron Sele, and Steve Trachsel. Some names that were under 100:
Joel Pineiro 97
Matt Clement 96
Jason Marquis 96
Kip Wells 96
Kyle Lohse 95
Jeff Weaver 93
by giveml on Dec 12, 2007 10:54 PM EST reply actions 0 recs
hmmm
interesting... So much for the "college pitchers are the safe bet" meme. Though 1000IP may be a bit high.
"a HS pitcher is close to twice as likely as a college pitcher to become an above average MLB starter."
did you account for the number of each chosen?
by SleepyCA on Dec 13, 2007 3:22 AM EST up reply actions 0 recs
well, sort of....
As far as the 1000 IP goes, I agree that seems to be a pretty large number. But if you are a GM that wants to tap superior starting pitchers then your measurement window is the six years you can control that pitcher. An above average starter should get you at least 1000 IP in those six years as that is 167 IP per year. Also, that was the only data I could lay my hands on. I think 700 would be a nice cutoff, but I doubt the numbers would be much different.
I guess the corollary to this is that college pitchers, since there are more of them in the big leagues in total, must be more likely to be relief pitchers and below average starters. Or they don't make it to 1000 IP...
by giveml on Dec 13, 2007 10:47 AM EST up reply actions 0 recs
Cards mediocrity
Your ERA+ data is pretty convincing, but I am still puzzled by the discrepency with WHIP - I will try to figure it out soon. But I will say that if the ERA+ 1000 inning data is reliable, there is a good reason the cardinals don't have any representatives on your lists; they don't draft HS pitchers. This table has each team and the percentage of HS pitchers over the years 1995-2006:
Cardinals 21.2
Athletics 22.39
Giants 23.99
Rockies 28.66
Angels 29.17
Blue Jays 30.53
D'backs 31.42
Phillies 31.62
Tigers 31.91
Indians 32.35
Padres 32.57
Yankees 33.8
Astros 34.56
Rangers 35.28
Brewers 35.96
Reds 36.45
Marlins 37.76
WhiteSox 38.94
Mets 40
RedSox 40.69
Twins 41.89
Mariners 42.2
Nationals 43.17
Royals 43.54
Cubs 43.89
Orioles 44.87
Pirates 47.49
D'Rays 48.83
Dodgers 49.07
Braves 53.01
Cardinals are the LEAST likely team in all of MLB to draft a pitcher out of high school over that period.
by cdb on Dec 13, 2007 11:19 AM EST up reply actions 0 recs
Wow.
by the red baron on Dec 13, 2007 8:01 AM EST reply actions 0 recs
muchas gracias
by cdb on Dec 13, 2007 6:52 PM EST up reply actions 0 recs
numbers are the anti-drug...
I love all the discussion this is generating and the great work of other posters suggests that WHIP may not be an ideal metric for determining the value of HS vs Col draftees. I downloaded data from Baseball Prospectus for pitchers in the 2007 season. They don't have ERA+, but they do have RA+, which they claim to be analogous to ERA+ (higher is better). Also included is the VORP statistic, which many love so dearly. I also calculated WHIP from their BB/9 and H/9 data. I merged this data with the data I have on drafts. This new data is only 2007 data, and is also only calculated on active pitchers.
At this point, I realized something - that the draft datasets that I have list simply which name was submitted for each position - which means that people drafted more than once are in the dataset multiple times (i didn't realize this before, because they were sorted by year/draft position). Often a player is drafted out of high school, but does not sign, and is then again drafted out of college - sometimes multiple times (some were drafted four times before they signed). So what this means is that for those pitchers that did not sign when drafted out of high school, but did sign when drafted out of college, I was using the same data for both HS and College. One would expect that this might artificially reduce the discrepency between HS and College. I apologize for the error - it was too much data to handle and I didn't inspect closely enough.
To get more reliable data, I used the dataset from BP (containing only 2007 pitchers), and merged it with the perviously generated draft dataset, so that I now have all the 2007 pitching data and the career WHIP and draft data in one table. 2007 VORP is a pretty good indicator of quality I would think, but one also would expect WHIP and ERA+ or RA+ to reflect quality. I reanalyzed the new dataset, using a new draft status. There are three categories:
H,H - drafted from high school, signed out of high school
H,C - drafted out of high school, signed out of college (multiple drafts)
C,C - drafted out of college, signed out of college
The data below are for 4 metrics - career WHIP, 2007 WHIP, 2007 RA+, and 2007 VORP:
All rounds:
Draft n Car_WHIP 2007_WHIP 2007_RA+ 2007_VORP
C,C 120 1.403 +- 0.202 1.425 +- 0.260 1.162 +- 0.630 13.607 +- 17.276
H,C 45 1.456 +- 0.186 1.454 +- 0.222 1.056 +- 0.367 8.867 +- 15.034
H,H 79 1.439 +- 0.187 1.455 +- 0.255 1.053 +- 0.309 14.400 +- 19.545
First five rounds only:
Draft n Car_WHIP 2007_WHIP 2007_RA+ 2007_VORP
C,C 53 1.354 +- 0.218 1.369 +- 0.285 1.285 +- 0.845 15.821 +- 16.453
H,C 23 1.456 +- 0.161 1.482 +- 0.190 0.926 +- 0.213 8.717 +- 14.855
H,H 51 1.451 +- 0.179 1.481 +- 0.252 1.034 +- 0.308 15.004 +- 19.758
This dataset is more limited than the last, but the data are more reliable, and the trends are essentially the same as the original dataset I was working with. HS draftees, either directly or indirectly (through college) do not appear to outperform college draftees. To more directly compare to giveml's data, the below is for just the top 25% RA+ pitchers (RA+ > 1.2).
Draft n Car_WHIP 2007_WHIP 2007_RA+ 2007_VORP
C,C 40 1.265 +- 0.160 1.203 +- 0.187 1.664 +- 0.869 26.757 +- 16.971
H,C 11 1.309+- 0.164 1.198 +- 0.194 1.539 +- 0.359 20.791 +- 13.619
H,H 24 1.306 +- 0.125 1.254 +- 0.134 1.406 +- 0.221 30.8 +- 18.686
Still, the differences are not exactly apparent when we look at career WHIP or 2007 WHIP. There is a slight trend for VORP and RA+, but in opposite directions - H,H players have a lower (worse) RA+ than C,C, but a higher VORP. Still the variability is huge in all cases for these two derived metrics - much larger than the relative variability for WHIP. So this still leaves us in a bit of a quandary with regards to the difference between the approach I have taken vs that taken by giveml. Why are the results at odds? Part of the problem is that I am using only RA+ data from 2007, while I think giveml's ERA+ data are career data (correct me if I am wrong). I also used WHIP, but WHIP correlates pretty well with RA+, which is supposed to behave like ERA+ (BPs words, not mine): increased WHIP results in lower RA+. (I can show some plots if someone knows how to post images).
But I think part of it comes down to approach - if we look at those we know have had success, we can see that a larger percentage than would be expected are high school draftees. In the 2007 season, almost exactly 50% of the pitchers were drafted out of high school originally. If we look at all the drafts from 1995 - 2006, HSs accounted for 37% of the draftee pitchers. So when we look at those that made it to the majors in 2007 (hindsight is 20:20), there does appear to be an advantage to drafting HSers. But if we look at the average HS vs the average Col pitcher, we can't tell them apart, suggesting that the level at which you are drafted is not very predictive of future success.
by cdb on Dec 13, 2007 4:52 PM EST reply actions 0 recs

by 
















