RSS.Social

Tom's Blog

follow: @[email protected]

Posts

Cloud Native Geospatial Conference (2025)

High Performance Remote IO

Iowa's Proposed State Science Standards

Dask Geopandas Spatial Shuffle

My 2024 Year in Books

Serializing Dataclasses

stac-geoparquet

What's Next? (2024 edition)

My Real-World Match / Case

STAC Updates I'm Excited About

Gone Rafting

National Water Model on Azure

Jupyter, STAC, and Tool Building

py-spy in Azure Batch

Planetary Computer Release: January 2023

Cloud Optimized Vibes

Queues in the News

My 2022 Year in Books

Podcast: Revolutions

Rebooting

What's Next?

Maintaining Performance

Compatibility Code

Dask Workshop

pandas + binder

A Confluence of Extension

Tabular Data in Scikit-Learn and Dask-ML

Distributed Auto-ML with TPOT with Dask

Moral Philosophy for pandas or: What is `.values`?

Modern Pandas (Part 8): Scaling

dask-ml 0.4.1 Released

Extension Arrays for Pandas

Easy distributed training with Joblib and dask

Rewriting scikit-learn for big data, in under 9 hours.

dask-ml

Scalable Machine Learning (Part 3): Parallel

Scalable Machine Learning (Part 2): Partial Fit

Scalable Machine Learning (Part 1)

Dask Performace Trip

Introducing Stitch

Modern Pandas (Part 7): Timeseries

Modern Pandas (Part 6): Visualization

Modern Pandas (Part 5): Tidy Data

Modern Panadas (Part 3): Indexes

Modern Pandas (Part 4): Performance

Modern Pandas (Part 2): Method Chaining

Modern Pandas (Part 1)

dplyr and pandas

Practical Pandas Part 3 - Exploratory Data Analysis

Practical Pandas Part 2 - More Tidying, More Data, and Merging

Practical Pandas Part 1 - Reading the Data

Using Python to tackle the CPS (Part 4)

Using Python to tackle the CPS (Part 3)

Tidy Data in Action

Organizing Papers

Using Python to tackle the CPS (Part 2)

Using Python to tackle the CPS

About