Exploring the University Towns Dataset
00:23 Your boss says it can’t be loaded directly into a DataFrame. They’ve tried, but it wasn’t successful. You’ve been asked to make a CSV file with the data, and your boss says not to clean it, just divide it into columns. Okay, so, first step is to take a look at the data that we’re dealing with.
However, it’s not immediately clear what columns you’re meant to put things in. Take this line by line, and first
Alabama. Okay, so that’s a state, and it has this
 in there. And then by highlighting it, I can see that there’s a bunch of other ones that have
 next to them:
California—these all seem like states—
and a university usually after them. So yes, almost all of them that don’t have the
 in them have university. So the columns that you’ll want to make are, say, one column for the state and one column for the town.
What would it be, in this sense, is that the first rows would be
Alabama | Auburn,
Alabama | Florence,
Alabama | Jacksonville,
Alabama | Livingston, and so on until it gets to
Alaska, where it would be
Alaska | Fairbanks.
02:07 Now that you’re somewhat familiar with the university towns dataset and what needs to be done, in the next lesson, you’ll look at processing the data before loading a DataFrame, because as it is, this data cannot be loaded directly into our DataFrame.
Become a Member to join the conversation.