Tom's Blog
Cloud Native Geospatial Conference (2025)
High Performance Remote IO
Iowa's Proposed State Science Standards
Dask Geopandas Spatial Shuffle
My 2024 Year in Books
Serializing Dataclasses
stac-geoparquet
What's Next? (2024 edition)
My Real-World Match / Case
STAC Updates I'm Excited About
Gone Rafting
National Water Model on Azure
Jupyter, STAC, and Tool Building
py-spy in Azure Batch
Planetary Computer Release: January 2023
Cloud Optimized Vibes
Queues in the News
My 2022 Year in Books
Podcast: Revolutions
Rebooting
What's Next?
Maintaining Performance
Compatibility Code
Dask Workshop
pandas + binder
A Confluence of Extension
Tabular Data in Scikit-Learn and Dask-ML
Distributed Auto-ML with TPOT with Dask
Moral Philosophy for pandas or: What is `.values`?
Modern Pandas (Part 8): Scaling
dask-ml 0.4.1 Released
Extension Arrays for Pandas
Easy distributed training with Joblib and dask
Rewriting scikit-learn for big data, in under 9 hours.
dask-ml
Scalable Machine Learning (Part 3): Parallel
Scalable Machine Learning (Part 2): Partial Fit
Scalable Machine Learning (Part 1)
Dask Performace Trip
Introducing Stitch
Modern Pandas (Part 7): Timeseries
Modern Pandas (Part 6): Visualization
Modern Pandas (Part 5): Tidy Data
Modern Panadas (Part 3): Indexes
Modern Pandas (Part 4): Performance
Modern Pandas (Part 2): Method Chaining
Modern Pandas (Part 1)
dplyr and pandas
Practical Pandas Part 3 - Exploratory Data Analysis
Practical Pandas Part 2 - More Tidying, More Data, and Merging
Practical Pandas Part 1 - Reading the Data
Using Python to tackle the CPS (Part 4)
Using Python to tackle the CPS (Part 3)
Tidy Data in Action
Organizing Papers
Using Python to tackle the CPS (Part 2)
Using Python to tackle the CPS
About