Viva El Birdos: An SB Nation Community

Navigation: Jump to content areas:


Sports blogs for fans, by fans.
Around SBN: Cal RB Jahvid Best Seriously Injured, Carted Off Field

HS vs College pitchers

As you all know - we need pitching, and quality pitching is expensive to buy.  The ability to draft quality pitching can make or break teams that can't afford to (or just will not, out of principle) buy the highest quality free agents.  I realize that this is probably not the best time to be discussing the draft, but we constantly come back to the `high risk/high reward' vs `safe pick' discussions, whether it is in reference to signing free agents, drafting amateurs, or trade proposals.  And with the recent front pages summarizing the Cardinals farm system (courtesy of Red Baron), I decided that I really wanted to know more about the various draft strategies, in particular as those strategies apply to pitching.  So I started with one main question:  how do high school draftees fare as compared to college draftees?  I actually asked the community this question a long time ago buried deeply in some other thread, but received no answers.  I haven't found an answer online anywhere (though I may just be looking in the wrong places), so decided to try to answer it myself.  

To try to answer this question, I downloaded draft datasheets on all players drafted as pitchers between the years 1999 and 2006 (data from www.baseball-reference.com ).  I chose the years based on the assumption that players drafted in 1999, if they are going to get major league experience at all, should have done so by the 2007 season. There may be a few stragglers that are called up after 9 or more years in the minors, but these should be relatively few. This datasheet contains the draft positions for all players, along with the school they were drafted from, and some career stats, including ERA, WHIP, and OPS.  Rather than search for stats on all 6202 pitchers drafted in that time, I used the stats provided.  

I used the WHIP statistic first as a surrogate for MLB experience.  If they have a WHIP, they have major league experience.  I also was able to use the school that players were drafted from to determine high school (HS) vs college (Col) draftees.  So let's look first at the percentage of draftees that make it to the majors, comparing HS to Col draftees.  

Year    HS(%)    Col(%)
1999    8.2    14.3
2000    10.5    11.1
2001    6.5    9.5
2002    6.5    10.4
2003    3.2    7.5
2004    1.7    4.4
2005    0    2.3
2006    0    0.9

This confirms what many here have known to be true:  HS pitchers take longer to reach the major leagues and have a lower likelihood of making it to the majors at all.  Further, only 11.8% of pitchers drafted in 1999 have ever pitched at the major league level.  I should note here that a few players drafted as pitchers have converted to position players (such as Brad Hawpe of the Rockies), and therefore probably should not be considered as `failures'.  

However, making it to the majors is one thing - succeeding at that level is something quite different.  And this is really the question that I had - it is often implied that HS draftees have a higher `upside'.  So the question is this: do HS draftees that make it actually perform better than Col draftees?  To look at success, I used WHIP (may not be ideal, but I wasn't going to try to find more appropriate stats for 6000+ pitchers - and it should be noted that we are dealing with the league as a whole, so team defense or park effects should be relatively standardized, making defense-dependent stats more trustworthy).   So how does the average (mean) Col pitcher fare against HS (note: using the median gave similar results)?

Year    HS    Col
1999    1.538    1.681
2000    1.571    1.688
2001    1.696    1.687
2002    1.730    1.594
2003    1.523    1.497
2004    1.443    1.482
2005      - -     1.589
2006      - -    1.224

What does this tell us?  First of all, college pitchers that make it to the majors very quickly are good (low WHIP).  Second, good HS pitchers take longer, but after about three years, the good ones start to make it, but they don't perform as well as the first year college pitchers (comparing 2004 HS to 2006 Col).  Third, WHIPs for college pitchers generally rise the longer we get from the draft year.  I interpret this to mean that the minor league journeymen are slowly being called up, and that the reason that they were in the minor leagues for seven years is that they aren't very good.  On the contrary, the HS draftees show a slight decreased for the 1999 and 2000 draft years.  Maybe this reflects the `high upside' that we always reference.  Those HS draftees can take a long time, but the potential to figure it out and succeed in the bigs doesn't go away.  So if you are looking for pitchers quickly (1-2 year - 2005,6 data), it might be advisable to draft a quality college pitcher (such as everyone's favorite trade target, Tim Lincecum or Ian Kennedy or Joba Chamberlain - all 2006 college draftees).  Even after six years, the difference in WHIP between Col and HS draftees (years 2001-2004) is minimal.  The upside, on average, doesn't appear until years 7-8 (1999 and 2000 drafts).  

To come back to the original question, do HS pitchers outperform Col pitchers?  I think that to answer that, one has to look at both the rate that the draftee makes it to the majors (MLB%) AND the stats (WHIP, in this case).  Let us look just at the years 1999 and 2000 - draft years in which most of the draftees will have pitched in the majors if they ever will.  

Draft    MLB(%)    WHIP   
HS     9.15        1.556
Col    12.62        1.684

So if you are drafting 8 years into the future, a HS pitcher is less likely to ever make it to the majors than a college pitcher, but those that do tend to be slightly better pitchers.  This does seem to confirm the `high risk: high reward' approach does what it purports to do: a typical HS draftee is less likely to ever see the majors, but more likely to succeed if they do make it. But the high reward only comes to the very patient - 7-8 years after the draft year.

With that being said - should the cardinals, with a relatively low quality farm system, be drafting HS or Col pitchers?  Probably the answer lies somewhere in between.  Col pitcher can fill out the ranks faster than HS, but over the long term, HS can be slightly better on average. Is the slight difference in WHIP worth the decreased likelihood of making it to the big leagues?  I don't know;  makes for a nice debate topic though.  

0 recs  |  Comment 38 comments

Story-email Email Printer Print

Comments

Display:

Round taken
i applaud your efforts but i have to wonder if this question should only be applied to the first 1-3 rounds of the draft. after round 3 the signing bonuses are much smaller and the likelihood of finding a gem becomes more and more of a crapshoot. in the first 3 rounds i would think that the question is much more relevant. furthermore, i think that the philosophy of college vs. high school would yield larger result gaps. maybe you can narrow down your results to answer this question.

by indakind on Dec 11, 2007 1:19 PM EST reply actions   0 recs

round effect
Since I had the data, I actually looked at the effect of draft round vs WHIP - no correlation.  To more directly address the question you pose, and put it into numbers, group A is only rounds 1-3, group B is all other rounds (data from all pitchers that made it to the majors):

Draft   A        B
HS      1.503    1.672
Col     1.546    1.656

So there is a slight trend there for group A HS draftees being slightly better than Col, but ever so sligth.  

by cdb on Dec 11, 2007 2:30 PM EST up reply actions   0 recs

thanks for the answer
thanks for the answer. one thing i think needs to be noted is that high school pitchers taken in the first round probably require higher signing bonuses than equivalently slotted college players because the high school kid has the option of going to college and can leverage that option in contract negotiations with the team. otoh, the college kid's only option is to go back for his senior year (if a junior) or go to the independent league. thus, high school pitchers become an even higher risk due to their lesser chance of reaching the majors coupled with higher signing terms.

by indakind on Dec 11, 2007 3:14 PM EST up reply actions   0 recs

right....
... so if we're measuring the statistical worth of high school pitchers (as a group) vs. college pitchers (as a group), and we were hoping to come up with a some sort of Expected Value metric, we'd have to discount it by the difference in monetary cost. So, for example, Porcello got $7mn. For a team with relatively certain budgets (e.g. the Cardinals @ ~ $115mn. this year) could that team spend the money better on "lesser" talents (or on other means of development) and get a higher discounted present value -- measured in terms of added marginal wins -- once risk and cost are factored in?

my guess is the answer is usually a resounding "yes," but i'm willing to be shown wrong.

by kindred on Dec 11, 2007 3:31 PM EST up reply actions   0 recs

That's a diary.
Good job.

I wonder how the numbers are affected by where they are drafted. I know there would be no causation but there might be a correlation.

Maybe like in poker. Sometimes to call to draw to the flush and sometimes you don't. The odds will tell you. Same with pitchers. maybe a pitcher with a scouted score of 72 (or whatever a good scouted score is) factored with chance of success will tell you if you are getting value.

I personally don't like High School pitchers. To much chance of injury. let them get hurt at Cal State Fullerton is what I say.

by Harknights on Dec 11, 2007 1:21 PM EST reply actions   0 recs

Good Diary...
... i wish i wasn't so busy, so i could follow up better. i think draft position is probably somewhat important, but i imagine that if you control for that then the college pitchers will look a lot better. there probably aren't many high school pitchers going in the 12th round. so, if a lot of the college pitchers are going in the late rounds, and their top potential is as a middle-reliever, that could skew the numbers, making college pitchers look worse as a group than they really are.

i'd also like to normalize for IP in the majors, but that probably isn't possible without tons of data work. still, i think it is relevant to parse whether these guys are getting September call-ups for the Royals and just throwing a handful of innings, or whether they're pitching full seasons for contending teams. since WHIP is the measure of quality, presumably a few dudes that come up and throws 5 innings of 0.85 WHIP in the majors could skew the numbers. the same could happen at the top of the distribution.

i also would like to see some sort of breakdown of standard deviations. the mean values only tell us so much. high school pitchers could still be more high risk/reward types even if the mean values are exactly the same as those of the college pitchers. the standard deviations could clue us in on that.

but it's still a great diary.

by kindred on Dec 11, 2007 1:56 PM EST reply actions   0 recs

variability
I can't answer all of the questions.  I do know that many of the pitchers fizzled out after a year, so that is one point.  But I don't have that data accessible. And the pitchers that were just called up in september - small sample size.  Can't argue with that.  

I can provide some estimate of variation though. Draft rounds one to three are A, rounds 4 and up are B.  These are the ranges in WHIP (all pitchers that made it to majors are included):

HS-A   1.18 - 1.90 (42 samples)
Col-A  0.75 - 3.32 (106 samples)
HS-B   0.99 - 4.00 (65 samples)
Col-B  0.63 - 4.20 (187 samples)

Or as mean +- Standard deviation

HS-A  1.50 +- 0.2
Col-A 1.55 +- 0.37
HS-B  1.67 +
- 0.55
Col-B 1.66 +/- 0.56

So more variation for Col than HS in early rounds, no difference in late rounds.  

by cdb on Dec 11, 2007 3:19 PM EST up reply actions   0 recs

nice work...
... and sort of counter-intuitive. on the surface, this seems to show that college pitchers can be higher risk/reward than HS pitchers. of course, this only considers those that make the majors, but it's still interesting.

by kindred on Dec 11, 2007 3:34 PM EST up reply actions   0 recs

Nice work
That college pitchers tend to outperform high school pitchers is one of Bill James' tenets:
Players taken in the June draft coming out of college (or with at least two years of college) perform dramatically better than players drafted out of high school.

Since pitchers tend to peak later than hitters—in their late twenties—it makes a ton of sense to draft pitchers who'll be peaking in their cost-controlled seasons, and who you've shown are more likely to contribute in the show.

That being said, I think it's a good idea to draft a few more HS pitchers. With the new CBA, you have 5 years to develop them before they need protection—having some talented arms that can be broken in as relief pitchers before converting to starters (like Wainwright) is a solid way to run a system.

Once things settle down a bit, I'll look into this too. It'd be a fun off-season project to put together a community-wide collaborative draft strategy report.

by liam on Dec 11, 2007 2:39 PM EST reply actions   0 recs

i agree with the last point...
... this could be a good community project for this year. i'm no statistical demon, but i'd be happy to chip in in any way i can.

there are some sports economics journals out there now, i believe; if we do it right, there might even be a chance of publication.

by kindred on Dec 11, 2007 3:35 PM EST up reply actions   0 recs

The Answer
I've always thought that the answer was somewhere in the middle.  There is the occasional arm that is to good to pass up out of the high school ranks.  However, a wiser choice is for the majority of your picks to be college players (not just pitchers).  This data seems to reinforce this, as you're gambling with a lot of money by taking highly touted HS arms.  

Outstanding diary, btw.

by redbirdnation8206 on Dec 11, 2007 4:18 PM EST reply actions   0 recs

One important thing to note here
is that most of the best pitchers never went to college.  They were drafted out of high school and went straight to the professional ranks.  Therefore, comparing HS to college pitchers isn't comparing apples to apples.  This isn't a criticism; rather, it's a reflection of the available talent pool.  

If HS pitchers, generally, perform better -- they should.  They're the better pitchers, by and large.  

If college pitchers, generally, perform better (or even as well) -- doesn't that speak to the fact that pitching is so difficult to predict?  You'd be taking lesser talent and having it produce as well or better.

So, to your question, which should we draft?  The HS pitchers should be the more talented b/c the best of them never make it to college.  College pitchers are probably less risky and closer to the majors but rarely have the upside of the HS pitchers.  The major infusions of talent come from the HS ranks.

by chuckb on Dec 11, 2007 4:29 PM EST reply actions   0 recs

additionally
I agree with your assessment, but I wonder if the best strategy isn't to go after great high school pitchers early and then fill out with solid college kids later on.  For every Maddux and Glavine there's a Steve Avery or Rick Ankiel.  

I remember an interview with Warren Spahn a few years back when he was talking about the younger players today not throwing enough as kids to develop arm strength and the effects of it on arm strain injuries.  College pitchers go through a much more rigorous conditioning program than most high school kids see.

I think this also proves that college pitchers, for the most part, are able to have longer careers and are less likely to get injured after going through a college strength and conditioning program.  Obviously there are exceptions (Darren Dreifort please stand up), but generally I believe this is the case.  Most college pitchers also are more polished, have better mechanics, and are cheaper to sign, so going after only "great" high school arms and paying the tax seems like a feasible draft policy to me.  Regardless, both are usually going to hit the majors around 22 or 23 years old -- those that come up and work heavy loads earlier usually end up petering out because their body couldn't handle it or they had poor mechanics that never got straightened out (Doc Gooden, Steve Avery, Kerry Wood)

The other thing that hasn't been mentioned here is that a lot of college pitchers with average fastballs have to re-learn to use the inside part of the plate many times because of the aluminum college bats.  Since jamming hitters with aluminum isn't nearly as effective as jamming hitters using wood, there are a few college pitchers who've had trouble making the adjustment.  Mike Mussina would be one who ended up having a nice career, Ben McDonald would be an example of a guy who never made the adjustment despite having ++ stuff.

"The Cardinals have won a World Series in THEIR new stadium!" --my Uncle Jim to a heckling Cubs fan

by fourstick on Dec 11, 2007 5:48 PM EST up reply actions   0 recs

Are they more polished?
Do college pitchers have better mechanics?  These might be true but you're comparing college pitchers who are 3 years older w/ 3 years more coaching to the HS pitchers.  Are college pitchers more polished than HS pitchers w/ 3 years of experience in the minors?  Maybe.  Maybe not.  Do they have better mechanics than HS pitchers w/ 3 years of pro experience?  I don't know.

As for your strategy of picking HS pitchers early and college pitchers later -- that seems reasonable to me.  If you pick HS pitchers later, they're likely to go to college anyway and you may as well try and reap the rewards of the extra coaching that college pitchers have received.

by chuckb on Dec 11, 2007 6:16 PM EST up reply actions   0 recs

my point
High School pitchers after three years in the minors might be just as fundamentally sound as a pitcher coming out of college, but if you draft them in high school and pay them for those four years they may turn out to hurt themselves before making an impact.  You've now wasted time with a player for 2 or 3 years when you could have taken a college kid who's pitched for 2 or 3 years and developed his body to the point where he's probably ready to jump into the AA or AAA level within his first 18 months in pro ball.

My point is that with college kids you generally have a better idea of what you're getting and I think that college pitchers are more reliable, although they may have a higher upside.  They also usually cost the club less money in the long run.

"The Cardinals have won a World Series in THEIR new stadium!" --my Uncle Jim to a heckling Cubs fan

by fourstick on Dec 13, 2007 9:49 AM EST up reply actions   0 recs

Flashback to 1997
I have been intrigued by this question for a long time.  It is very difficult to really prove anything, but lots of fun to talk about.  I don't really agree with the argument that the best pitchers sign out of high school.  My son is a D1 pitcher and I have seen several other prospects who were very highly regarded who essentially took themselves out of the draft by raising "signability" issues.

So, just for fun on a lousy day I decided to take a closer, although admittedly unscientific look, at the draft class of 1997.  I chose 1997 solely because it was ten years ago and it seems like anybody who is going to make a mark would have done so by age 28.

Interestingly enough there were sixty pitchers taken in round 1-3, including the supplemental round.  The data are as follows:

Summary       
    HS    College
Active    7    6
Done    7    13
DNP    16    11
Total    30    30

Active  MLB       
IP       
Mean    471    706
Min    55    175
Max    1428    1714
Median    242    417
ERA +       
Mean    99    101
Min    77    87
Max    118    127
Median    101    99

Retired  MLB       
IP       
Mean    108    216
Min    11    2
Max    228    544
Median    84    188
ERA +       
Mean    72    76
Min    63    30
Max    97    95
Median    72    81

From the summary information you can see that college pitchers (63%) were more likely than HS pitchers (47%) to at least get a cup of coffee at the big league level.  FWIW, I counted JUCO players as college level.  There were only three and only one made it to the show with insignificant results.  I don't think it is too surprising to see that college pitchers are more likely to get to the big leagues.  

It was interesting to me that a higher number of high schoolers (23%) than college pitchers (20%)are still active in the big leagues.  Probably because they are younger than the college guys, but the college guys should only be 31 or 32 years old.  Not really a significant difference anyway.

The active pitchers look pretty comparable to me with the college guys having a slight advantage - again probably due to the age difference.  One could easily assume that the career accomplishments of each group will be pretty similar.  So it comes down to if you like the college guys from this draft, most notably:

Jeff Weaver
Randy Wolf
Scott Linebrink

or the better HS guys:

John Garland
Aaron Cook
Jeremy Affeldt
Rick Ankiel

I have no idea how to deal with the Ankiel factor, but he still was a very good pitcher with a career ERA+ of 118 which would put him in the top 150 of all time if he could have done that over a career.

The only really significant difference in the two popluations is that the college guys who made it for a while and faded away seem to have made a better mark than the HSers.  To me, the fact that college pitchers might offer you 188 innings at an ERA+ of 76 versus only 84 career innings at an ERA+ of 72 is statistically interesting, but should have no bearing on draft strategy.

I would like to take a look at all the current big leaguers with ERA+ over 100 and see where they came from.  My guess is the answer will be Latin America, Asia, and high school.  I will try to put that together in the coming weeks unless someone else wants to step up.

Those Pilgrims ain't lookin' so proud now...

by giveml on Dec 11, 2007 6:29 PM EST reply actions   0 recs

more data
After some of the comments, I thought that more data, going back in time, might be beneficial.  Below is the WHIP for all pitchers from draft years 1995-2000 that made it to the majors. WHIP is presented either by mean and standard deviation, or as the median. Also, the last column is the percent that made it to the majors.  Round indicates draft round.  

1995-2000                   
Round    Level    Mean    St Dev    Median    MLB%
1_5    Col    1.67    0.48    1.55    45.42
1_5    HS    1.64    0.51    1.50    39.24
6_10    Col    1.63    0.40    1.54    21.68
6_10    HS    1.69    0.55    1.57    17.39
11_15    Col    1.69    0.46    1.61    14.70
11_15    HS    1.65    0.54    1.57    13.21
16_20    Col    1.60    0.31    1.52    11.80
16_20    HS    1.56    0.33    1.48    12.12
21_25    Col    1.67    0.46    1.56    9.03
21_25    HS    1.69    0.27    1.70    5.41
25+    Col    1.63    0.47    1.52    5.36
25+    HS    1.61    0.31    1.54    3.76

The differences, with this larger sample size, become even less apparent (by this measure) between HS and Col.  There may be a better way of gauging 'upside' or talent than I am using, but this measure is not detecting any upside for HS draftees.  

Sorry - I was going to make this a diary edit, but I couldn't get the formatting correct - it was a mess.  

by cdb on Dec 11, 2007 7:24 PM EST reply actions   0 recs

So Where Do All the Good Pitchers Come From?
I have been thinking about this HS vs. College thing for a while now and it occurred to me I was thinking about it all wrong.  While it is a great thing to draft in a way that maximizes your potential for developing major league talent, I have been thinking about it in terms of quantity rather than quality.   I thought why not just look at the best pitchers and see where they lurk.  After all, if I'm a GM I get a lot more excited about drafting an Albert Pujols than a Skip Schumacher - nothing personal against Skip.

So I started trolling around for data as I am not exactly a sophisticate in this type of analysis.  Thankfully I didn't have to go too far to find a comprehensive list of the top pitchers as ranked by ERA+.   This may or may not be the ultimate determinant of a pitchers value, but I was pretty sure it was a valid enough measure for trying to rank pitchers.  As an ERA+ of 100 represents major league average, I decided to limit my study to pitchers with an ERA+ of 110 and above.  Of  all the pitchers active through 2007 with at least 1000 IP, there are 38 with an ERA+ on the good side of 110.

These 38 pitchers come from one of three places:  they are amateur free agents (AFAs) signed outside the U.S. or they are either high school or college pitchers draft in the MLB amateur draft.  I don't know the total number of prospects signed each year from each source, but of the top 38 pitchers the breakdown is as follows:

*    8 AFAs (21%)
*    9 College picks (24%)
*    21 High School picks (55%)

So much for my earlier post that, "I don't really agree with the argument that the best pitchers sign out of high school."  Either I was wrong or the minor leagues are a much better place to develop than college.  I don't think the disparity could be that great, so it is a pretty safe bet I was wrong.  

Some argument could be made that while the HS picks have been more successful some of the very best pitchers have come from both the AFA signees and the college picks.  After all, Pedro Martinez, Johan Santana, and Carlos Zambrano are three of the top eight.  Three college players, Roger Clemens, Brandon Webb, and Randy Johnson are also in the top eight.  That leaves only Roy Oswalt and Greg Maddux to represent the high schoolers.    On top of that the average ERA+ for the AFA group is 124, the college group is at 125, and the high school group trails at 119.  A better way to look at is who would you want to be in your starting rotation.  I would love to have any of these five during their peaks.  Ranked by ERA+ each group would be represented as follows:

AFA               College      HS
Pedro Martinez      Brandon Webb      Roy Oswalt
Johan Santana      Roger Clemens      Greg Maddux
Carlos Zambrano      Randy Johnson      Roy Halladay
Andy Pettitte!!      Tim Hudson      Mark Buehrle
Bartolo Colon      Mike Mussina      Derek Lowe

A couple of notes on the pitchers above:  I don't know Andy Pettite's history, but my source showed him as an amateur free agent.  Doesn't sound right, but as Joaquin would say, "youneverknow".  Mark Buehrle doesn't have a college listed, but I thought for sure he went to Jefferson College for at least a year.  Maybe he was a draft and follow.

One additional bit of interest, especially to Cardinal fans, is which teams actually identified these guys.  I don't think many will be surprised that the Cardinals join the Devil (or not) Rays, Giants, Mariners, Phillies, Reds, Rockies, Royals, and Twins as the only MLB teams not to have signed or drafted a single one of these 38 pitchers.  My guess is these are all teams that don't have overseas operations and draft mostly college pitchers.   The winner in drafting the most of them is, of course, the Cubs with four.  I think that shows the value of drafting high....The Astros, Blue Jays and Braves come in second with three apiece.  Interestingly enough, the Astros were fortunate enough to have been the only team that signed two AFAs who made the list.  Unfortunately for them, they have the distinction of not having seen a single MLB inning from either Johan Santana or Freddy Garcia.

Finally, it apparently isn't all that hard to identify these guys as 15 of the 30 drafted players on the list were taken in one of the first two rounds.  Since eight were AFAs, only 15 players drafted after the second round have put up significantly above average major leagues careers.  The latest pitcher drafted was high schooler Kenny Rogers by the White Sox in the 39th round.  No college player drafted after the 8th round made the list while nine of the HS guys were drafted in the 8th round or later.

I could use some help drawing conclusions here.  It seems like it is not a particularly good idea to draft a college pitcher high unless you are pretty sure he will make it to the bigs pretty quickly.  Six of the eight college pitchers on the list were drafted in the first two rounds while Jake Peavy, John Smoltz, and Roy Oswalt were drafted in the 15th, 22nd, and 23rd rounds respectively.  

Those Pilgrims ain't lookin' so proud now...

by giveml on Dec 11, 2007 11:37 PM EST reply actions   0 recs

great job
i think you came to a very interesting and correct conclusion here.  It might be interesting to plot these guys on a graph with ERA+ on one axis and draft round on another, with source (HS or College) denoted by color...  I'd be very interested in seeing how that looks.

FYI Pettitte was a draft and follow out of HS (NYY, 22rd, 1990) and buehrle was drafted out of college (CWS, 38rd, 1998).

Christmas lights are an extremely efficient method for converting Christmas Spirit into heat. -anon

by SleepyCA on Dec 12, 2007 12:28 AM EST up reply actions   0 recs

nice
That is interesting... Filtering out the  mediocre to bad pitchers seems to make the trend more clear.  So more of the most successful pitchers were drafted out of high school.  Of course, that still does have to be interpreted in the context of failure rate and time to the majors, but that data does suggest that the perceived 'upside' of HS draftees is real.  

I am of the opinion right now though, that the cardinals should spend there early round pics on quality college pitchers, when available, based solely on the fact that they have relatively poor pitching depth in the minors.  Then use later rounds on 'signability' HSers.  This would potentially fill out the ranks more quickly, allow the Major league squad to have genuine options regarding free agency vs calling up rookies (in the near future), and still allow potential HS 'upside' pics to develop at a more leisurely rate.  

But I could be convinced otherwise as we collect more data on this.

by cdb on Dec 12, 2007 10:58 AM EST up reply actions   0 recs

different results
I tried to do a similar thing using the dataset I have.  THe difference is that I did not use a 1000 inning filter, and that I am using WHIP instead of ERA+.  I took all pitchers drafted between 1995 and 2000, and looked at the distibution in WHIP.  I then took only the top 25% of those pitchers (WHIP of 1.4 or less).  This gives 149 pitchers.  Of those 149, 55 were drafted out of high school (37%) and 94 were drafted out of college (63%).  So almost the opposite trend to what you see.  I see the same trend if I use only the top 10% of pitchers (WHIP < 1.27)

This could due to the lack of an innings filter in the data I am using.  But it also might be related to the fact that college pitchers tend to arrive at the majors at an older age than HS pitchers (the good ones at least).  By using a 1000 inning cutoff, you might be biasing for those that arrive in the league at a younger age - maybe.  Or it could be that ERA+ is more sensitive than WHIP.  

As an interesting aside, if I look at all pitchers outside the top 25% - that is the mediocre to bad pitchers, there are 436 in total.  157 of them are high school draftees(36%) and 279 are college draftees (64%).  Same proportion as the top 25% or 10%.  

by cdb on Dec 12, 2007 11:36 AM EST up reply actions   0 recs

does anybody have the numbers
on total number of hs picks vs college picks on an average basis?  also, how many AFAs get signed each year?
Those Pilgrims ain't lookin' so proud now...

by giveml on Dec 12, 2007 11:54 AM EST up reply actions   0 recs

HS% of draftees
Good point, draftees by year:

Year    number  % HS
1995    820    42.2
1996    847    41.7
1997    793    43.3
1998    764    42.4
1999    765    41.3
2000    798    37.1
2001    809    41.7
2002    704    36.9
2003    773    32.4
2004    793    29.0
2005    796    29.0
2006    764    29.0

By draft round (all years combined):

Round   Number  % HS
1-5    1025    41.2
6-10    930    31.0
11-15    908    28.0
16-20    949    27.7
20-25    892    28.7
26+    4722    42.9

Interesting to look at the trend in HS draftees over time - seems like fewer HS pitchers are being drafted of late.  Also interesting to see the spike in HS draftees in the late rounds (26+).  

I don't have data for AFA

by cdb on Dec 12, 2007 12:16 PM EST up reply actions   0 recs

WHIP
just curious why you chose WHIP?  I can't really justify ERA+, but it seemed like a more independent assessment of a pitchers skill that had been normalized for league and ballpark.  I have heard that the number of hits allowed is really something pitchers don't control.  I am not personally convinced on this, but that is what the people who think harder on this than I am capable of seem to be saying.
Those Pilgrims ain't lookin' so proud now...

by giveml on Dec 12, 2007 11:59 AM EST up reply actions   0 recs

WHIP
I used WHIP becuase it was readily available to be perfectly honest.  The data table I had downloaded had W-L, Saves, ERA, or WHIP. WHIP seems to be the best indicator of ability out of that list.  You are probably right that ERA+ is a better metric though - I just couldn't get my hands on that data without spending months collecting it.  Also, the metric used will not effect the data on the numbers that make it to the majors, but it will effect the interpretation of 'success' at the majors.  

I and think that people have issues with BABIP as being random - batting average of balls in play.  I think that pitchers have control over the number of balls put into play, but no control over what happens after it is hit.  

Maybe some of the more experienced sabermetricians could comment on the shortcomings of WHIP????  

by cdb on Dec 12, 2007 12:21 PM EST up reply actions   0 recs

isn't it funny
how we are creatures of opportunity.  I used ERA+ because it was readily available as well.
Those Pilgrims ain't lookin' so proud now...

by giveml on Dec 12, 2007 12:38 PM EST up reply actions   0 recs

yeah...
... but now i'm wondering what the correlation is between WHIP and ERA+. That is, are the top performers as measured by WHIP generally the top performers as measured by ERA+? at first glance, comparing both of your datasets, they would appear to be negatively correlated, but i can't imagine that is correct.

giveml - great work. i have one question, tho: if it's easily done with your data set, could you lower the minimum innings requirements? i might even lower it down to 150 or 200 IP, so that stud relief pitchers could be included. 1000 IP is a lot, so you've ended up with only 38 observations. a better picture might be drawn by lowering the bottom standard.

another interesting exercise would be to lower the ERA+ requirement to 100. in other words, look at how many of these guys are above-average. to some extent, picking a Santana or Oswalt is like catching lightening in a bottle: it isn't going to happen often. but longer trends of above-average performance might give us a better idea of what's really going on.

by kindred on Dec 12, 2007 7:50 PM EST up reply actions   0 recs

FTR on Buehrle
Buehrle is a local kid who did indeed go to JeffCo.  As I'm sure everyone knows by now, he also didn't play baseball for two years in high school at Francis Howell North, and wasn't exactly a hot prospect coming out of there after his senior year.  Talk about a late bloomer, but better late than never I suppose.

I met Bob Dunahue (the coach over there) one time about 4 years ago and he told me a story about the relatively new "Mark Buehrle Rule."  No coach at Howell North is allowed to cut a player unless Dunahue signs off on it.  

by redbirdnation8206 on Dec 13, 2007 2:14 AM EST up reply actions   0 recs

Fantastic diary,
by the way
I will be brief.

by Alxfritz on Dec 12, 2007 10:56 AM EST reply actions   0 recs

Excellent
diary material, cdb.  Thanks for the contribution.
"Requiescat In Pace - 2007 Zombie Cardinals"

by AustinBOB on Dec 12, 2007 3:01 PM EST reply actions   0 recs

what and when
seems all this boils down to the individual.  clemens probably had a choice out of high school (wild guess), but chose college.  i suspect his career would have been largely the same either way.  the later maturation of pitchers seems on the surface to simply suggest concentrating on HS position and college pitchers is the best way to go.  for college players, the landscape is changing a bit this year with the new transfer rules.  i think these rules will help because it gives the college coaches and the p[layers more of a long term outlook.  i think this will enable college baseball programs to move a bit closer to the talent development factories they are for basketball and football.

by sportsman on Dec 12, 2007 8:21 PM EST reply actions   0 recs

OK, so here's the story
on the guys who are least statistically above average.  More specifically, this is a followup to cdb's request that I look at the pitchers with over 1000 IP and ERA+ ratings from 101-109.  These are the guys Cardinal fans are more likely to have some familiarity with:  the Matt Morrises, Jeff Suppans, and Woody Williamses of the world.  Surprisingly enough there are only 21 starters on this list.  I guess that means if you are only barely above average it is hard to stay around long enough to rack up 1000 IP.  

I don't have a way to identify the population down below 1000 IP, so I will stick with what these data show me.  Anyway, if I am drafting and looking for guys who are significantly above average I am for the most part looking for starters.  Most relievers just kind of happen.  I think it is fair to define a starter who is significantly above average as one who can rack up 1000 IP. That's only 5-7 years of starting.

The numbers are a bit different as the college pitchers fare better with the same percentage of the pool as HS pitchers - 43%.  That leaves only 14% for the amateur free agents (i.e. Latins from outside U.S.)  Combined with the numbers from the previous post on pitchers with ERA+ of 110 and above, it means that of all the active pitchers with ERA+ above 100 the breakdown is as follows:

AFA           11/59     19%
College       18/59     30%
High School   30/59     51%

Combined with the total draftees numbers provided by cbd indicating that historically a lower percentage of HS pitchers (42% and declining) have been drafted, it is striking that the numbers are skewed so far in favor of the HSers.  I guess you could say that a HS pitcher is close to twice as likely as a college pitcher to become an above average MLB starter.

Again, it doesn't seem all that difficult to identify who these guys are as 13/21 or 62% of this group were either AFAs or drafted in the first two rounds.  Woody Williams has the distinction of being the lowest drafted college pitcher to make the cut as he was drafted in the 28th round.  As most of you are probably aware, Woody was drafted as a shortstop.  That means that NOT A SINGLE ONE of the 59 currently active above average starters with over 1000 IP was a college pitcher drafted after the eighth round.  I hope Luhnow and company are aware of this.  Eight of the HS pitchers on the list, or 14% of the total were drafted after the 8th round.

Thanks to Matt Morris with an ERA+ of 109 the Cardinals were not entirely shut out.  He is the only active big league starter with 1000 IP and an ERA+ above average to have been selected by the Cardinals in a period that roughly covers at least a dozen drafts.   Again, the overall leader in selecting top pitchers is the Cubs with six.  

If you wonder what an ERA+ of 100 buys you there were three of those:  Livan Hernandez, Aaron Sele, and Steve Trachsel.  Some names that were under 100:

Joel Pineiro     97
Matt Clement     96
Jason Marquis    96
Kip Wells        96
Kyle Lohse       95
Jeff Weaver      93

Those Pilgrims ain't lookin' so proud now...

by giveml on Dec 12, 2007 10:54 PM EST reply actions   0 recs

hmmm
"NOT A SINGLE ONE of the 59 currently active above average starters with over 1000 IP was a college pitcher drafted after the eighth round."

interesting... So much for the "college pitchers are the safe bet" meme.  Though 1000IP may be a bit high.

"a HS pitcher is close to twice as likely as a college pitcher to become an above average MLB starter."

did you account for the number of each chosen?

Christmas lights are an extremely efficient method for converting Christmas Spirit into heat. -anon

by SleepyCA on Dec 13, 2007 3:22 AM EST up reply actions   0 recs

well, sort of....
I didn't go back and check the draft year of each pitcher and do the detailed calculation.  However, I used cdb's numbers for the relative percentages of HS v. College pitchers drafted since 1995 elsewhere in this thread. My thought was that from 1995-1999 the annual percentage was pretty constant at roughly 42% of the pitchers taken came from high school (about 800)and  58% were from college (about 1,100).  Since over several years the HS pool made up 51% of the above average pitchers vs 30%, the HSers are roughly twice as likely to make it into the group.  I would love to know an actual number, but it makes me tired just to think about the work involved.

As far as the 1000 IP goes, I agree that seems to be a pretty large number.  But if you are a GM that wants to tap superior starting pitchers then your measurement window is the six years you can control that pitcher.  An above average starter should get you at least 1000 IP in those six years as that is 167 IP per year.  Also, that was the only data I could lay my hands on.  I think 700 would be a nice cutoff, but I doubt the numbers would be much different.

I guess the corollary to this is that college pitchers, since there are more of them in the big leagues in total, must be more likely to be relief pitchers and below average starters.  Or they don't make it to 1000 IP...

Those Pilgrims ain't lookin' so proud now...

by giveml on Dec 13, 2007 10:47 AM EST up reply actions   0 recs

Cards mediocrity
Well giveml,

Your ERA+ data is pretty convincing, but I am still puzzled by the discrepency with WHIP - I will try to figure it out soon.  But I will say that if the ERA+ 1000 inning data is reliable, there is a good reason the cardinals don't have any representatives on your lists; they don't draft HS pitchers.  This table has each team and the percentage of HS pitchers over the years 1995-2006:
Cardinals    21.2
Athletics    22.39
Giants    23.99
Rockies    28.66
Angels    29.17
Blue Jays    30.53
D'backs    31.42
Phillies    31.62
Tigers    31.91
Indians    32.35
Padres    32.57
Yankees    33.8
Astros    34.56
Rangers    35.28
Brewers    35.96
Reds    36.45
Marlins    37.76
WhiteSox    38.94
Mets    40
RedSox    40.69
Twins    41.89
Mariners    42.2
Nationals    43.17
Royals    43.54
Cubs    43.89
Orioles    44.87
Pirates    47.49
D'Rays    48.83
Dodgers    49.07
Braves    53.01

Cardinals are the LEAST likely team in all of MLB to draft a pitcher out of high school over that period.  

by cdb on Dec 13, 2007 11:19 AM EST up reply actions   0 recs

Wow.
This is a really great discussion.  
Punch. Drink. Cry.

by the red baron on Dec 13, 2007 8:01 AM EST reply actions   0 recs

muchas gracias
Red Baron,Thanks for the front page mention.  I thought for sure that would bring some more people in. There are a few people that seem to really like this sort of thing, and apparently alot of people that just aren't that interested.

by cdb on Dec 13, 2007 6:52 PM EST up reply actions   0 recs

numbers are the anti-drug...
subtitle: 'let's ignore the mitchell report'.  

I love all the discussion this is generating and the great work of other posters suggests that WHIP may not be an ideal metric for determining the value of HS vs Col draftees.  I downloaded data from Baseball Prospectus for pitchers in the 2007 season.  They don't have ERA+, but they do have RA+, which they claim to be analogous to ERA+ (higher is better).  Also included is the VORP statistic, which many love so dearly.  I also calculated WHIP from their BB/9 and H/9 data.  I merged this data with the data I have on drafts.  This new data is only 2007 data, and is also only calculated on active pitchers.  

At this point, I realized something - that the draft datasets that I have list simply which name was submitted for each position - which means that people drafted more than once are in the dataset multiple times (i didn't realize this before, because they were sorted by year/draft position).  Often a player is drafted out of high school, but does not sign, and is then again drafted out of college - sometimes multiple times (some were drafted four times before they signed).  So what this means is that for those pitchers that did not sign when drafted out of high school, but did sign when drafted out of college, I was using the same data for both HS and College.  One would expect that this might artificially reduce the discrepency between HS and College. I apologize for the error - it was too much data to handle and I didn't inspect closely enough.  

To get more reliable data, I used the dataset from BP (containing only 2007 pitchers), and merged it with the perviously generated draft dataset, so that I now have all the 2007 pitching data and the career WHIP and draft data in one table.  2007 VORP is a pretty good indicator of quality I would think, but one also would expect WHIP and ERA+ or RA+ to reflect quality.  I reanalyzed the new dataset, using a new draft status.  There are three categories:
H,H - drafted from high school, signed out of high school
H,C - drafted out of high school, signed out of college (multiple drafts)
C,C - drafted out of college, signed out of college
The data below are for 4 metrics - career WHIP, 2007 WHIP, 2007 RA+, and 2007 VORP:

All rounds:                   
Draft    n    Car_WHIP    2007_WHIP    2007_RA+    2007_VORP
C,C    120    1.403 +- 0.202    1.425 +- 0.260    1.162 +- 0.630    13.607 +- 17.276
H,C    45    1.456 +- 0.186    1.454 +- 0.222    1.056 +- 0.367    8.867 +- 15.034
H,H    79    1.439 +- 0.187    1.455 +- 0.255    1.053 +- 0.309    14.400 +- 19.545

First five rounds only:                   
Draft    n    Car_WHIP    2007_WHIP    2007_RA+    2007_VORP
C,C    53    1.354 +- 0.218    1.369 +- 0.285    1.285 +- 0.845    15.821 +- 16.453
H,C    23    1.456 +- 0.161    1.482 +- 0.190    0.926 +- 0.213    8.717 +- 14.855
H,H    51    1.451 +- 0.179    1.481 +- 0.252    1.034 +- 0.308    15.004 +- 19.758

This dataset is more limited than the last, but the data are more reliable, and the trends are essentially the same as the original dataset I was working with. HS draftees, either directly or indirectly (through college) do not appear to outperform college draftees.  To more directly compare to giveml's data, the below is for just the top 25% RA+ pitchers (RA+ > 1.2).  

Draft    n    Car_WHIP    2007_WHIP    2007_RA+    2007_VORP
C,C    40    1.265 +- 0.160    1.203 +- 0.187    1.664 +- 0.869    26.757 +- 16.971
H,C    11    1.309+- 0.164    1.198 +- 0.194    1.539 +- 0.359    20.791 +- 13.619
H,H    24    1.306 +- 0.125    1.254 +- 0.134    1.406 +- 0.221    30.8 +- 18.686

Still, the differences are not exactly apparent when we look at career WHIP or 2007 WHIP.  There is a slight trend for VORP and RA+, but in opposite directions - H,H players have a lower (worse) RA+ than C,C, but a higher VORP.  Still the variability is huge in all cases for these two derived metrics - much larger than the relative variability for WHIP.  So this still leaves us in a bit of a quandary with regards to the difference between the approach I have taken vs that taken by giveml.  Why are the results at odds?  Part of the problem is that I am using only RA+ data from 2007, while I think giveml's ERA+ data are career data (correct me if I am wrong).    I also used WHIP, but WHIP correlates pretty well with RA+, which is supposed to behave like ERA+ (BPs words, not mine): increased WHIP results in lower RA+.  (I can show some plots if someone knows how to post images).  

But I think part of it comes down to approach - if we look at those we know have had success, we can see that a larger percentage than would be expected are high school draftees.  In the 2007 season, almost exactly 50% of the pitchers were drafted out of high school originally.  If we look at all the drafts from 1995 - 2006, HSs accounted for 37% of the draftee pitchers.  So when we look at those that made it to the majors in 2007 (hindsight is 20:20), there does appear to be an advantage to drafting HSers. But if we look at the average HS vs the average Col pitcher, we can't tell them apart, suggesting that the level at which you are drafted is not very predictive of future success.  

by cdb on Dec 13, 2007 4:52 PM EST reply actions   0 recs

Comments For This Post Are Closed


User Tools

Welcome to the Internet's #1 St. Louis Cardinals blog.
Start posting about the Cardinals »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Stl_ia_card_45_-_dark_small
Jeff Pearlman Thinks Of Hair Clumps When He Thinks Of The Thief McGwire
Black-spider-monkey_small
Losing my religion (w/ baseball)

Recent FanPosts

Flanders_small
Yadi2first
Small
40 Man Question..
Cathybachebay1_small
The current Busine$$ of Baseball...how long can it last?
Avatar_small
VEB CheBird T-Shirt for Sale - Red or Powder Blue, CLEARANCE
Stl_ia_card_45_-_dark_small
October Lore: One In A Million
St-louis-cardinals-script_small
A Team of Free Agents
St-louis-cardinals_small
Report: LaRussa Will Return
Small
Skip's Lament: The Curse of Too Many Decent Players

+ New FanPost All FanPosts >


Managers

Jack_benny_small DanUpBaby

Editors

Bender1_small azruavatar

Adam1_small chuckb

Trigun_001_small the red baron