Getting started with Dask Transcripts
Chapter: Using the Dask DataFrame
Lecture: Download the dataset
Login or
purchase this course
to watch this video and the rest of the course contents.
0:00
We can use Dask DataFrame to handle larger than memory data while relying on the familiar Pandas API. We don't need to change our code,
0:09
Dask DataFrames are made up of multiple, smaller Pandas DataFrames split along the index.
0:14
In fact, Dask DataFrame actually executes Pandas operations internally. Each of those smaller DataFrames are called chunks,
0:22
or partitions, and the upper and lower bound separating them are called divisions. Dask operations
0:28
are executed harmoniously on the smaller Pandas DataFrames.