George Ho
Data is Plural Podcast — Crossword Data
“So What Is Your Job, Exactly?”
Thoughts on Hanukkah of Data 2022
Use Your Computer Faster By Reading Less
Merriam-Webster and Unstructured Data Processing
Link Bulletin, August 2022
Datasets and Dictionaries for Crosswords
Link Bulletin, April 2022
How to Improve Your Static Site's Typography
Migrating to Hugo
Data Collection is Hard. You Should Try It.
Streaming Data with Tornado and WebSockets
Joining Flatiron Health
`cryptics.georgeho.org` — A Dataset of Cryptic Crossword Clues
How Many Cryptic Crossword Grids Are There?
Understanding NUTS and HMC
What I Wish Someone Had Told Me About Tensor Computation Libraries
Serif Fonts for Coding
`littlemcmc` — A Standalone HMC and NUTS Sampler in Python
Pollen and Digital Publishing (a.k.a. _The Book is a Program_)
Floating-Point Formats and Deep Learning
Transformers in Natural Language Processing — A Brief Survey
Adventures in Manipulating Python ASTs
Benchmarks for Mass Matrix Adaptation
Introducing `stan-vim`
Anatomy of a Probabilistic Programming Framework
Graduated Cooper Union, Joining Point72
Python Port of _Common Statistical Tests are Linear Models_
Decaying Evidence and Contextual Bandits — Bayesian Reinforcement Learning (Part 2)
Autoregressive Models in Deep Learning — A Brief Survey
Modern Computational Methods for Bayesian Inference — A Reading List
Probabilistic and Bayesian Matrix Factorizations for Text Clustering
Multi-Armed Bandits and Conjugate Models — Bayesian Reinforcement Learning (Part 1)
Cookbook — Bayesian Modelling with PyMC3
Understanding Hate Speech on Reddit through Text Clustering
Why Latent Dirichlet Allocation Sucks
~~Fruit~~ Loops and Learning - The LUPI Paradigm and SVM+
Linear Discriminant Analysis for Starters
Portfolio Risk Analytics and Performance Attribution with Pyfolio
Modelling Hate Speech on Reddit — A Three-Act Play (Slide Deck)
Hello World!