#100DaysOfCode in Python Transcripts
Chapter: Days 37-39: Using CSV data
Lecture: Some amazing data sets
0:00 Let's talk about our data sets real quickly.
0:02 You're probably familiar with CSV data,
0:04 but just in case you're not, it looks like this.
0:07 It's a plain text file and there's a set of headers
0:11 across the top that tell you what each column is.
0:13 So here we can see we've got data,
0:15 an actual mean temperature, an actual minimum temperature,
0:18 and an actual maximum temperature.
0:21 And then we have a bunch of columns
0:22 that correspond to that data.
0:25 Now I told you we're going to work with
0:26 some interesting data sets and it's true,
0:28 I found a good collection here for us to play with.
0:32 Now you may be familiar with a place called FiveThirtyEight.
0:36 It's like a data-driven journalistic news site
0:41 where they gather up a bunch of data
0:43 and they use it to write articles
0:45 and do investigative journalism type things.
0:48 And it turns out every article they have,
0:51 the data that they use to derive those conclusions
0:54 is available online on GitHub, so that's pretty awesome.
1:00 So over at github.com/fivethirtyeight/data,
1:03 that is where we're going to be working
1:05 for the next three days.
1:08 All right, so let's jump over to my web browser here
1:10 and we'll just have a quick look through all the data.
1:12 I told you there's a lot,
1:13 look at the size of that scroll bar.
1:14 There's a ton of options here.
1:15 So let's skip down past all the folders
1:17 and just go to this section.
1:19 So here you can see all of the articles
1:21 written by FiveThirtyEight and then
1:23 the corresponding data that goes with it.
1:28 So let's just grab one here, American Chess is Great Again,
1:31 and if we come back, click on it and you can see
1:32 here's the actual article that they wrote about,
1:35 and you can see here's the graphs
1:37 that they drew based on the data and so on.
1:40 But here is the actual data, so you can come over here
1:43 and it actually describes what it is and so on.
1:47 If you click on it, you can see here's all the data
1:50 that they were using to make these conclusions.
1:53 So what we're going to do in this section of the course
1:56 is we're going to take one set of CSV files
1:59 and use that for our demos
2:00 and ask and answer interesting questions.
2:03 And then for the next three days afterwards,
2:05 you'll be turned loose to build
2:07 your own investigative journalism app.
2:10 You'll choose one of these CSV files
2:12 and come up with a set of questions,
2:13 and it's going to be a ton of fun
2:14 so I hope you're ready to get started on that.