SciPy 2025

Deepak Cherian

Deepak Cherian is an Xarray maintainer and Forward Engineer at Earthmover. Previously he was an oceanographer at the National Center for Atmospheric Research. He helps build and maintain many parts of the scientific Python ecosystem, includinh Xarray, dask, zarr and related projects.

The speaker's profile picture

Sessions

07-08
13:30
240min
Hierarchical Data Analysis with Xarray DataTree & Zarr
Deepak Cherian, Negin Sobhani, Ian Hunt-Isaak, Eniola Awowale, Tom Nicholas, Joe Hamman, Justus Magin

Xarray provides data structures for multi-dimensional labeled arrays and a toolkit for scalable data analysis on large, complex datasets. Many real-world datasets often have hierarchical or heterogeneous structure, and are best organized through groups of related data arrays. Through xarray.DataTree, the xarray data model now supports opening datasets with a hierarchical structure of groups, such as HDF5 files and Zarr stores. This expanded data model is now general enough to manage data across different scientific disciplines, including geosciences and biosciences. This hands-on tutorial focuses on intermediate and advanced workflows using xarray to analyze real-world hierarchical data.

Tutorials
Room 315
07-10
16:30
30min
The brave new world of slicing and dicing Xarray objects.
Deepak Cherian, Justus Magin, BenoƮt Bovy

We illustrate the power and flexibility of a new extension point in Xarray's data model: "custom indexes" that allow Xarray users to neatly handle complex grids, and enables at least one new data model (vector data cubes). We present a whirlwind tour of specific examples to illustrate the power of this feature, and aim to stimulate experimentation during the sprints.

Earth, Ocean, Geo, Climate, and Atmospheric Science
Room 315
0min
Turbocharging Xarray GroupBy, oh my!
Deepak Cherian

This poster summarizes vast improvements made to Xarray's GroupBy functionality in the past three years for increased flexibility and performance, such as grouping by multiple arrays, and an extension point for domain-specific custom grouping logic.

Earth, Ocean, Geo, Climate, and Atmospheric Science