Move from Excel to Python with Pandas Transcripts
Chapter: Course conclusion and review
Lecture: File structure for Jupyter
Login or
purchase this course
to watch this video and the rest of the course contents.
0:00
in the next section of the course,
0:01
we talked about how important it was to have an organized file system.
0:06
Once you start to build out your Jupiter notebooks,
0:09
I recommend a structure with a base directory and then subdirectories underneath it that it include
0:15
all of your notebook files, keeping those separate from your Excel or CSP or input
0:20
files. Keeping all of your input files in a raw directory underneath data where they
0:26
can remain untouched so that you can repeat your analysis.
0:30
And then, as you create intermediate files,
0:32
you can store those in a process directory.
0:34
And then, finally, the Reports directory is where you store your final output.
0:39
We also walk through how to use cookie cutter to make it easier to set this
0:44
all up so that you wouldn't have to do it by hand.
0:46
After talking about the directory structure,
0:49
we walk through how to launch your Jupiter notebook,
0:52
and there were a couple of things we discussed about structuring your Jupiter notebooks.
0:55
Make sure that you have a good notebook name at the top so you don't end
0:59
up with a whole bunch of entitled files.
1:02
Use the markdown text fields to give yourself information so that you can understand the rationale
1:09
for your analysis and understand where the files came from.
1:13
Then you contract your changes over time as well.
1:16
I encourage you to keep all of your imports at the top of the notebook and
1:20
then define your input and output files in the beginning as well.
1:23
Finally, when you're all done,