#100DaysOfCode in Python Transcripts
Chapter: Days 37-39: Using CSV data
Lecture: Some amazing data sets
Login or
purchase this course
to watch this video and the rest of the course contents.
0:00
Let's talk about our data sets real quickly. You're probably familiar with CSV data, but just in case you're not, it looks like this.
0:08
It's a plain text file and there's a set of headers across the top that tell you what each column is. So here we can see we've got data,
0:16
an actual mean temperature, an actual minimum temperature, and an actual maximum temperature. And then we have a bunch of columns
0:23
that correspond to that data. Now I told you we're going to work with some interesting data sets and it's true,
0:29
I found a good collection here for us to play with. Now you may be familiar with a place called FiveThirtyEight.
0:37
It's like a data-driven journalistic news site where they gather up a bunch of data and they use it to write articles
0:46
and do investigative journalism type things. And it turns out every article they have, the data that they use to derive those conclusions
0:55
is available online on GitHub, so that's pretty awesome. So over at github.com/fivethirtyeight/data, that is where we're going to be working
1:06
for the next three days. All right, so let's jump over to my web browser here and we'll just have a quick look through all the data.
1:13
I told you there's a lot, look at the size of that scroll bar. There's a ton of options here. So let's skip down past all the folders
1:18
and just go to this section. So here you can see all of the articles written by FiveThirtyEight and then the corresponding data that goes with it.
1:29
So let's just grab one here, American Chess is Great Again, and if we come back, click on it and you can see
1:33
here's the actual article that they wrote about, and you can see here's the graphs that they drew based on the data and so on.
1:41
But here is the actual data, so you can come over here and it actually describes what it is and so on.
1:48
If you click on it, you can see here's all the data that they were using to make these conclusions.
1:54
So what we're going to do in this section of the course is we're going to take one set of CSV files and use that for our demos
2:01
and ask and answer interesting questions. And then for the next three days afterwards, you'll be turned loose to build
2:08
your own investigative journalism app. You'll choose one of these CSV files and come up with a set of questions, and it's going to be a ton of fun
2:15
so I hope you're ready to get started on that.