Aman Madaan
The case for inference-time compute
LLMs Are Stochastic Compilers
Notes on Weight Initialization for Deep Neural Networks
The Curious Case of the Missing US 101 Exits
Training Char-RNNs for Transferring Name Styles
Everyone Can Write Bad Code