#100DaysOfCode in Python Transcripts
Chapter: Days 37-39: Using CSV data
Lecture: Some amazing data sets
Login or
purchase this course
to watch this video and the rest of the course contents.
0:00
Let's talk about our data sets real quickly.
0:02
You're probably familiar with CSV data,
0:04
but just in case you're not, it looks like this.
0:07
It's a plain text file and there's a set of headers
0:11
across the top that tell you what each column is.
0:13
So here we can see we've got data,
0:15
an actual mean temperature, an actual minimum temperature,
0:18
and an actual maximum temperature.
0:21
And then we have a bunch of columns
0:22
that correspond to that data.
0:25
Now I told you we're going to work with
0:26
some interesting data sets and it's true,
0:28
I found a good collection here for us to play with.
0:32
Now you may be familiar with a place called FiveThirtyEight.
0:36
It's like a data-driven journalistic news site
0:41
where they gather up a bunch of data
0:43
and they use it to write articles
0:45
and do investigative journalism type things.
0:48
And it turns out every article they have,
0:51
the data that they use to derive those conclusions
0:54
is available online on GitHub, so that's pretty awesome.
1:00
So over at github.com/fivethirtyeight/data,
1:03
that is where we're going to be working
1:05
for the next three days.
1:08
All right, so let's jump over to my web browser here
1:10
and we'll just have a quick look through all the data.
1:12
I told you there's a lot,
1:13
look at the size of that scroll bar.
1:14
There's a ton of options here.
1:15
So let's skip down past all the folders
1:17
and just go to this section.
1:19
So here you can see all of the articles
1:21
written by FiveThirtyEight and then
1:23
the corresponding data that goes with it.
1:28
So let's just grab one here, American Chess is Great Again,
1:31
and if we come back, click on it and you can see
1:32
here's the actual article that they wrote about,
1:35
and you can see here's the graphs
1:37
that they drew based on the data and so on.
1:40
But here is the actual data, so you can come over here
1:43
and it actually describes what it is and so on.
1:47
If you click on it, you can see here's all the data
1:50
that they were using to make these conclusions.
1:53
So what we're going to do in this section of the course
1:56
is we're going to take one set of CSV files
1:59
and use that for our demos
2:00
and ask and answer interesting questions.
2:03
And then for the next three days afterwards,
2:05
you'll be turned loose to build
2:07
your own investigative journalism app.
2:10
You'll choose one of these CSV files
2:12
and come up with a set of questions,
2:13
and it's going to be a ton of fun
2:14
so I hope you're ready to get started on that.