Baseball data analysts agree: the sample size is too smallby Fred Hofstetter on May 29, 2018
The consensus is unanimous: the sample size is just too small to say one way or the other.
After intense deliberation, data analysts across Major League Baseball have come to a decision: we don’t know for sure, the sample size is just too small.
Data analysts across the league agree that more data would be instrumental in making the decision easier and more informed.
“You can’t throw around evaluations too rashly,” said one data scientist in the Kansas City Royals organization. “You’d really prefer to have 300-400 seasons of league data to provide a fair analysis of productivity. We’ve got to weed out the outliers in the dataset.”
Another statistician with the Baltimore Orioles explained it was too early, we’re comparing apples to oranges, no single stat tells the whole story, and the data collection methodology itself is imperfect.
“The raw numbers look great,” she explains, “but the rates are less clear and only provoke more questions. We really just need more data to work from.”
The latest articles
Keene's comprehensive book tells several stories behind the V-5 Pre-Flight School in Chapel Hill, North Carolina: home to one of the rarest, greatest baseball teams in American history.
There's good reason why The Glory of Their Times appears on every "best baseball book of all time" list you'll find anywhere.
Discover how amateur and pro baseball scouting is done, how departments are built, and how organizations find talent in Future Value.
Practicality explains why baseball players may want to wear a billed cap. But why does every player always wear a hat? Because it’s the right thing to do.