JOURNAL ARTICLE

Interactive distributed computing with Jupyter, Python and Dask on Jetstream2 for Science Gateways

Zonca, AndreaWeakley, Le MaiStandish, Matthew

Year: 2023 Journal:   Zenodo (CERN European Organization for Nuclear Research)   Publisher: European Organization for Nuclear Research

Abstract

Dask is an open-source Python library for parallel computing widely used by data scientists. Dask can scale Python code from multi-core local machines to large distributed clusters in the cloud. This tutorial will go over how to leverage Dask to provide distributed computing capabilities in Python to Jupyter Notebook users running on a JupyterHub instance deployed on top of Kubernetes on Jetstream2. We will explain how Dask works and how easy it is to process data in parallel using its high-level API, we will also rely on the Jestream2 object store system to save/load data in parallel using the cloud-native file format Zarr. During the tutorial we will also detail how the whole infrastructure is deployed and where each of the services is running within the infrastructure. Pointers to tutorials on how to deploy all the different components to Jetstream2 will also be provided.

Keywords:
Python (programming language) Leverage (statistics) MIT License Scripting language Distributed Computing Environment Data structure Object-oriented programming

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.27
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Computational Physics and Python Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Scientific Computing and Data Management
Social Sciences →  Decision Sciences →  Information Systems and Management
Data Analysis with R
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Interactive distributed computing with Jupyter, Python and Dask on Jetstream2 for Science Gateways

Zonca, AndreaWeakley, Le MaiStandish, Matthew

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2023
JOURNAL ARTICLE

Project Jupyter: interactive computing in the context of modern science gateways

P Santos Fernando

Journal:   OPAL (Open@LaTrobe) (La Trobe University) Year: 2016
JOURNAL ARTICLE

Interactive Classrooms with Jupyter and Python

Jacob Frias KoehlerSoomi Kim

Journal:   Mathematics Teacher Learning and Teaching PK-12 Year: 2018 Vol: 111 (4)Pages: 304-308
JOURNAL ARTICLE

Dask and Jupyter

Carsten Fortmann-Grote

Year: 2021
© 2026 ScienceGate Book Chapters — All rights reserved.