inFERENCe

follow: @[email protected]

Posts

Discrete Diffusion: Continuous-Time Markov Chains

We may finally crack Maths. But should we?

Mortal Komputation: On Hinton's argument for superhuman AI.

Autoregressive Models, OOD Prompts and the Interpolation Regime

We May be Surprised Again: Why I take LLMs seriously.

Implicit Bayesian Inference in Large Language Models

Eastern European Guide to Writing Reference Letters

Causal inference 4: Causal Diagrams, Markov Factorization, Structural Equation Models

On Information Theoretic Bounds for SGD

Notes on the Origin of Implicit Regularization in SGD

An information maximization view on the $\beta$-VAE objective

Some Intuition on the Neural Tangent Kernel

Notes on Causally Correct Partial Models

Meta-Learning Millions of Hyper-parameters using the Implicit Function Theorem

The secular Bayesian: Using belief distributions without really believing