Skip to content

Commit

Permalink
Updates
Browse files Browse the repository at this point in the history
  • Loading branch information
jrbourbeau committed Dec 15, 2023
1 parent c8989d9 commit 3a8bcdc
Showing 1 changed file with 10 additions and 10 deletions.
20 changes: 10 additions & 10 deletions docs/source/dataframe.rst
Original file line number Diff line number Diff line change
Expand Up @@ -19,15 +19,15 @@ Dask DataFrame
.. grid-item::
:columns: 12 12 8 8

Dask Dataframe helps you process large tabular data by parallelizing pandas,
Dask DataFrame helps you process large tabular data by parallelizing pandas,
either on your laptop for larger-than-memory computing, or on a distributed
cluster of computers.

- **Just pandas:** Dask dataframes are just many pandas dataframes.
- **Just pandas:** Dask DataFrames are a collection of many pandas DataFrames.

The API is the same. The execution is the same.
- **Large scale:** Works on 100 GiB on a laptop, or 100 TiB on a cluster
- **Easy to use:** Pure Python, easy to set up and debug
- **Large scale:** Works on 100 GiB on a laptop, or 100 TiB on a cluster.
- **Easy to use:** Pure Python, easy to set up and debug.

.. grid-item::
:columns: 12 12 4 4
Expand All @@ -39,7 +39,7 @@ Dask DataFrames coordinate many pandas DataFrames/Series arranged along the
index. A Dask DataFrame is partitioned *row-wise*, grouping rows by index value
for efficiency. These pandas objects may live on disk or on other machines.

From Pandas to Dask
From pandas to Dask
-------------------

Dask DataFrame copies pandas, and so should be familiar to most users
Expand All @@ -49,7 +49,7 @@ Dask DataFrame copies pandas, and so should be familiar to most users
.. tab-item:: Load Data

Pandas and Dask have the same API, and so switching from one to the other
is easy
is straightforward.

.. grid:: 1 1 2 2

Expand Down Expand Up @@ -116,7 +116,7 @@ Dask DataFrame copies pandas, and so should be familiar to most users
.. tab-item:: Machine Learning

Machine learning libraries often have Dask submodules that
expect Dask dataframes and operate in parallel
expect Dask DataFrames and operate in parallel.

.. grid:: 1 1 2 2

Expand Down Expand Up @@ -154,12 +154,12 @@ Dask DataFrame copies pandas, and so should be familiar to most users
As with all Dask collections, you trigger computation by calling the
``.compute()`` method or persist data in distributed memory with the
``.persist()`` method..
``.persist()`` method.

When not to use Dask Dataframes
When not to use Dask DataFrames
-------------------------------

Dask dataframes are often used either when ...
Dask DataFrames are often used either when ...

1. Your data is too big
2. Your computation is too slow and other techniques don't work
Expand Down

0 comments on commit 3a8bcdc

Please sign in to comment.