Disclaimer : I don't know anything about baseball, I do understand the basic theory of the game though.
What do you notice in the visualization?
1st story point
- All the heights are almost the same. That's because no one is less than (guessing) 48". There is a lot of debate whether you should include the 0 in this sort of graph, but it might make sense here to show the comparative heights of each player
- HR is Heart Rate, maybe Hit Rate ? Don't worry, it's just my not understanding the game terminology, but am I the target audience or not ?
- Handedness is easier to understand - Left, Right or Both but you have the space to use the whole word rather than a single letter
2nd story point
- The handedness order on the bar graph is R, L, B
- The handedness order on the scatter graph is B, R, L
- Shouldn't they both be L, B, R (is that the more natural order?)
I'll get back to this later, just wanted to post a few quick comments
Just some comments -
first off you say you are looking at players performance but you are only looking at HR and Batting Average - there are other measures of performance - like fielding percentage, OB percentage, strike outs, and many more - so it you are only going to look at HR and average - need to change the titles
second - you focus a lot on handedness - R/L/B - but the general population is about 10-12% left and 88-90% right handed - does that mean left handed people make better baseball players?
- you also look into total HR's as attribute of handedness - how does length of career affect a metric like total HR
- also are players who play in the NL where the pitcher doesn't hit and the ERA is about 1 run better than the AL have better performance at lower averages or HR's
the third chart I guess is looking at career top 5 performers - that data is by its nature sparse to draw conclusions
How were the top 5 or 10 players determined - don't see some name there that the casual fan would expect - did Henry Aaron or Babe Ruth not make the list?
Just a POV
just an additional comment or 2 - I understand the user can change the number of Top Players - the point is that if some players are not in the dat like Aaron or Ruth it leads to questions about the data - looks like they were excluded or the data set is not complete - and the results reflect a subset of all players _ I did download your book and found that the top N players are determined by the number of HRs hit in the filtered dataset - Not clear from the title that Total HR's is the criterion
I didn't bring up age I brought up length of career - when you present an absolute metric like total HR the results get skewed to players who had long careers -
One final comment - I did not see any attributes - if you pulled the datafile from a source there should be an attribute to the source
I will try to clarify a few things -
1. Choice of number of players means filtering, this does not change the underlying data. I attach the link to the data below if you want it.
Indeed the game of baseball is both a team sport and an individual sport. So there lots of factors that contribute to ones success, this is a small data set with only 2 KPI ( homerums and batting average).
Very simplified for a beginers class environment.
Updated - Tableau Public
Thanks for all your comments - will use this to complete my assignment.
Link to data below:
BTW - Chris thanks about the " - Shouldn't they both be L, B, R (is that the more natural order? "
This allowed me to sort and to look further into my analysis and come up with better conclusions.