00:00 Determining Ratios. Vertical and horizontal bar charts are often a good choice if you want to see the difference between your categories. If you’re interested in ratios, however, pie plots are the way to go.
There are several tiny slices with overlapping labels. To address the problem, the smaller categories can all be lumped together into a single group. This code merges all the categories with a total of under 100,000 into a category called
"Other", and then creates a pie plot.
01:45 Zooming in on Categories. Sometimes you also want to verify whether a certain categorization makes sense. Are the members of a category more similar to one another than they are to the rest of the dataset? Again, a distribution is a good tool to get a first overview.
02:01 Generally, we expect the distribution of a category to be similar to the normal distribution but have a smaller range. This code creates a histogram plot showing the distribution of the median earnings for the engineering majors.
Become a Member to join the conversation.