Shane Caldwell

follow: @[email protected]

Posts

The Tests All Pass

Intro to GPUs For the Research Oriented

All Reduce Across the Atlantic: Bandwidth in Decentralized Training

Twenty Billion Tokens of What, Exactly?

Pretraining at home: 20B tokens from 222 hours to 12

Offsec Evals: Growing Up In The Dark Forest

DiLoCo: Data Parallelism for the Datacenter Poor

RL Needed LLMs Because Agency Requires Priors

GPT-5 is Good, Actually: The Agony and Ecstasy of Public Benchmarks

The Religious Devotion of Haskell

The Input Sanitization Perspective on Prompt Injection

Infosec's Data Problem

Deep Reinforcement Learning for Security: Toward an Autonomous Pentesting Agent

An ML Eng's Review of OSCP