RSS.Social

Simon Willison's Weblog

follow: @[email protected]

Posts

GPT-5.2

Useful patterns for building HTML tools

The Normalization of Deviance in AI

Dark mode

10 Years of Let's Encrypt

Devstral 2

Under the hood of Canada Spends with Brendan Samek

Agentic AI Foundation

mistralai/mistral-vibe

Quoting Claude

Prediction: AI will make formal verification go mainstream

Deprecations via warnings don’t work for Python libraries

Niche Museums: The Museum of Jurassic Technology

Quoting Cory Doctorow

Using LLMs at Oxide

Quoting David Crespo

The Unexpected Effectiveness of One-Shot Decompilation with Claude

Quoting Daniel Lemire

TIL: Subtests in pytest 9.0.0+

Thoughts on Go vs. Rust vs. Zig

The Resonant Computing Manifesto

Django 6.0 released

Text a community college librarian

Quoting Mitchell Hashimoto

TIL: Dependency groups and uv run

Anthropic acquires Bun

Introducing Mistral 3

Claude 4.5 Opus' Soul Document

DeepSeek-V3.2

I sent out my November sponsor newsletter

Quoting David Bauder, AP News

YouTube embeds fail with a 153 error

Quoting Felix Nolan

ChatGPT is three years old today

Quoting Rodrigo Arias Mallo

Context plumbing

Quoting Wikipedia content guideline

A ChatGPT prompt equals about 5.1 seconds of Netflix

Bluesky Thread Viewer thread by @simonwillison.net

Quoting Qwen3-VL Technical Report

deepseek-ai/DeepSeek-Math-V2

Highlights from my appearance on the Data Renegades podcast with CL Kao and Dori Wilson

Google Antigravity Exfiltrates Data

Constant-time support lands in LLVM: Protecting cryptographic code at the compiler level

llm-anthropic 0.23

LLM SVG Generation Benchmark

Quoting Claude Opus 4.5 system prompt

Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult

sqlite-utils 3.39

sqlite-utils 4.0a1 has several (minor) backwards incompatible changes

"Good engineering management" is a fad

Agent design is still hard

Olmo 3 is a fully open LLM

We should all be using dependency cooldowns

Nano Banana Pro aka gemini-3-pro-image-preview is the best available image generation model

Quoting Nicholas Carlini

Building more with GPT-5.1-Codex-Max

How I automate my Substack newsletter with content from my blog

Quoting Matthew Prince

llm-gemini 0.27

MacWhisper has Automatic Speaker Recognition now

Google Antigravity

Quoting Ethan Mollick

Trying out Gemini 3 Pro with audio transcription and a new pelican benchmark

The fate of “small” open source

Quoting Andrej Karpathy

llm-anthropic 0.22

parakeet-mlx

GPT-5.1 Instant and GPT-5.1 Thinking System Card Addendum

Introducing GPT-5.1 for developers

Datasette 1.0a22

Nano Banana can be prompt engineered for extremely nuanced AI image generation

Quoting Nov 12th letter from OpenAI to Judge Ona T. Wang

What happens if AI labs train for pelicans riding bicycles?

Quoting Steve Krouse

Fun-reliable side-channels for cross-container communication

Scaling HNSWs

Agentic Pelican on a Bicycle

Six coding agents at once

Quoting Netflix

Pelican on a Bike - Raytracer Edition

Reverse engineering Codex CLI to get GPT-5-Codex-Mini to draw me a pelican

Quoting Kenton Varda

Mastodon 4.5

Quoting Josh Cohenzadeh

Could LLMs encourage new programming languages?

Using Codex CLI with gpt-oss:120b on an NVIDIA DGX Spark via Tailscale

Game design is simple, actually

You should write an agent

Quoting Ben Stolovitz

Kimi K2 Thinking

Quoting Nathan Lambert

Video + notes on upgrading a Datasette plugin for the latest 1.0 alpha

Code research projects with async coding agents like Claude Code and Codex

Open redirect endpoint in Datasette prior to 0.65.2 and 1.0a21

Removing XSLT for a more secure browser

Quoting @belligerentbarbies

Code execution with MCP: Building more efficient agents

A new SQL-powered permissions system in Datasette 1.0a20

MCP Colors: Systematically deal with prompt injection risk

Quoting Steve Francia

The fetch()ening

Quoting Barry Warsaw

The case against pgvector

Quoting MiniMax

New prompt injection papers: Agents Rule of Two and The Attacker Moves Second

PyCon US 2026 call for proposals is now open

How I Use Every Claude Code Feature

Claude Code Can Debug Low-level Cryptography

October 2025 sponsors-only newsletter

Quoting Julian Andres Klode

Curiosity-driven blogging

CoreWeave adds Marimo to their 2025 acquisition spree

Quoting François Chollet

Introducing SWE-1.5: Our Fast Agent Model

MiniMax M2 & Agent: Ingenious in Simplicity

Composer: Building a fast frontier model with RL

Hacking the WiFi-enabled color screen GitHub Universe conference badge

Quoting Aaron Boodman

The PSF has withdrawn a $1.5 million proposal to US government grant program

GenAI Image Editing Showdown

Sora might have a 'pervert' problem on its hands

Setting up a codebase for working with coding agents

Quoting Claude Docs

Visual Features Across Modalities: SVG and ASCII Art Reveal Cross-Modal Understanding

claude_code_docs_map.md

Quoting Geoffrey Litt

OpenAI no longer has to preserve all of its ChatGPT data, with some exceptions

Quoting AWS

Video: Building a tool to copy-paste share terminal sessions using Claude Code for web

Dane Stuckey (OpenAI CISO) on prompt injection risks for ChatGPT Atlas

Living dangerously with Claude

SLOCCount in WebAssembly

Don't let Claude Code delete your session logs

Unseeable prompt injections in screenshots: more vulnerabilities in Comet and other AI browsers

Introducing ChatGPT Atlas

Quoting Phil Gyford

Quoting Bruce Schneier and Barath Raghavan

Claude Code for web - a new asynchronous coding agent from Anthropic

Getting DeepSeek-OCR working on an NVIDIA Spark via brute force using Claude Code

TIL: Exploring OpenAI's deep research API model o4-mini-deep-research

The AI water issue is fake

Andrej Karpathy — AGI is still a decade away

Quoting Alexander Fridriksson and Jay Miller

Should form labels be wrapped or separate?

Quoting Barry Zhang

Claude Skills are awesome, maybe a bigger deal than MCP

NVIDIA DGX Spark + Apple Mac Studio = 4x Faster LLM Inference with EXO 1.0

Quoting Riana Pfefferkorn

Coding without typing the code

Quoting Catherine Wu

Introducing Claude Haiku 4.5

Quoting Claude Haiku 4.5 System Card

A modern approach to preventing CSRF in Go

NVIDIA DGX Spark: great hardware, early days for the ecosystem

Just Talk To It - the no-bs Way of Agentic Engineering

nanochat

Quoting Slashdot

Claude Code sub-agents

Vibing a Non-Trivial Ghostty Feature

Note on 11th October 2025

An MVCC-like columnar table on S3 with constant-time deletes

simonw/claude-skills

Superpowers: How I'm using coding agents in October 2025

A Retrospective Survey of 2024/2025 Open Source Supply Chain Compromises

Video of GPT-OSS 20B running on a phone

Quoting Gergely Orosz

TIL: Testing different Python versions with uv with-editable and uv-test

Claude can write complete Datasette plugins now

Python 3.14 Is Here. How Fast Is It?

Quoting Simon Højberg

Why NetNewsWire Is Not a Web App

Python 3.14

Gemini 2.5 Computer Use can solve Google's own CAPTCHAs

Quoting Thomas Klausner

Vibe engineering

Deloitte to pay money back to Albanese government after using AI in $440,000 report

a system that can do work independently on behalf of the user

gpt-image-1-mini

GPT-5 pro

OpenAI DevDay 2025 live blog

Quoting Tim Berners-Lee

Bad bots

Embracing the parallel coding agent lifestyle

Let the LLM Write the Prompts: An Intro to DSPy in Compound Al Pipelines

Litestream v0.5.0 is Here

Sora 2 prompt injection

Daniel Stenberg's note on AI assisted curl bug reports

Quoting Nadia Eghbal

aavetis/PRarena

Two more Chinese pelicans

September monthly sponsors newsletter

Sora 2

Designing agentic loops

Claude Sonnet 4.5 is probably the "best coding model in the world" (at least for now)

Armin Ronacher: 90%

Quoting Scott Aaronson

Quoting Nick Turley

Video models are zero-shot learners and reasoners

Quoting Dan Abramov

ForcedLeak: AI Agent risks exposed in Salesforce AgentForce

How to stop AI’s “lethal trifecta”

GitHub Copilot CLI is now in public preview

Improved Gemini 2.5 Flash and Flash-Lite

Don't hide your best documentation

Quoting Stanford CS221 Autumn 2025

Cross-Agent Privilege Escalation: When Agents Free Each Other

GPT-5-Codex

Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action

Why AI systems might never be secure

Quoting Kate Niederhoffer, Gabriella Rosen Kellerman, Angela Lee, Alex Liebscher, Kristina Rapuano and Jeffrey T. Hancock

Four new releases from Qwen

CompileBench: Can AI Compile 22-year-old Code?

ChatGPT Is Blowing Up Marriages as Spouses Use AI to Attack Their Partners

Locally AI