Colloquium 2023 Accelerated DataFrame with Dask-cuDF on multiple GPUs

From SHARCNETHelp
Revision as of 11:44, 20 January 2023 by Syam (talk | contribs) (Created page with "cuDF is a GPU DataFrame library in Python. It provides a Pandas-like API with accelerated performance for DataFrame operations on a single GPU. However, dealing with large dat...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

cuDF is a GPU DataFrame library in Python. It provides a Pandas-like API with accelerated performance for DataFrame operations on a single GPU. However, dealing with large datasets is limited by the memory available on a single GPU. Since Dask provides a framework for scalable computing, Dask-cuDF integrates cuDF with Dask to allow scaling a large DataFrame workload across multiple GPUs. This webinar introduces Dask-cuDF with demo examples on a multi-GPU node on the national clusters.