Skip to content

Split 10min.ipynb into cuDF-only and Dask-cuDF notebooks#21669

Open
Anas0709 wants to merge 4 commits intorapidsai:mainfrom
Anas0709:fix/split-10min-notebook-21374
Open

Split 10min.ipynb into cuDF-only and Dask-cuDF notebooks#21669
Anas0709 wants to merge 4 commits intorapidsai:mainfrom
Anas0709:fix/split-10min-notebook-21374

Conversation

@Anas0709
Copy link
Copy Markdown

@Anas0709 Anas0709 commented Mar 5, 2026

Fixes

Splits the combined 10min notebook into:

  • 10min.ipynb: cuDF-only content for single-GPU workflows (no Dask)
  • 10min_dask_cudf.ipynb: Dask-cuDF content for distributed GPU workflows

The cuDF-only notebook can now be validated in CI, since it no longer uses Dask.

Changes:

  • Removed Dask references from 10min.ipynb
  • Added 10min_dask_cudf.ipynb with Dask-cuDF examples
  • Updated user guide toctree

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Mar 5, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@review-notebook-app
Copy link
Copy Markdown

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@mroeschke
Copy link
Copy Markdown
Contributor

Thank you for the pull request. Could you run the pre-commit checks before we run the CI?

@Anas0709
Copy link
Copy Markdown
Author

Anas0709 commented Mar 5, 2026

Hi @mroeschke, could you please add the doc and non-breaking labels to this PR so the Label Checker passes? Thank you!

"id": "f5e303df",
"metadata": {},
"outputs": [
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you ensure all these cells remain executed?

@mroeschke mroeschke added doc Documentation non-breaking Non-breaking change labels Mar 5, 2026
Anas0709 and others added 4 commits March 5, 2026 17:12
…sai#21374)

- 10min.ipynb: cuDF-only content (no Dask) for CI validation
- 10min_dask_cudf.ipynb: Dask-cuDF content for distributed GPU workflows
- Added 10min_dask_cudf to user guide toctree

Made-with: Cursor
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@Anas0709 Anas0709 force-pushed the fix/split-10min-notebook-21374 branch from 72d92b5 to bc282ef Compare March 5, 2026 23:12
@Anas0709
Copy link
Copy Markdown
Author

Anas0709 commented Mar 5, 2026

Addressed the missing agg_col1/agg_col2 setup cell, it was omitted during the split and is now added before the Grouping section.

I don’t have access to a RAPIDS GPU environment locally, so the cells still have empty outputs. I’m relying on RAPIDS CI to execute the notebook during the docs build. Let me know if you’d prefer a different approach.

@Anas0709 Anas0709 requested a review from mroeschke March 5, 2026 23:15
@mroeschke
Copy link
Copy Markdown
Contributor

Sorry for the delay @Anas0709.

Yes, we would prefer to avoid using the CI to incrementally validate notebook changes since it's a shared, limited resource. I would recommend using Google Colab to validate changes "locally" before we run them in CI. You can follow and modify the instructions in https://docs.rapids.ai/deployment/stable/platforms/colab/ on how to get set up with a GPU environment

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

doc Documentation non-breaking Non-breaking change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants