Learning from Failure to Tackle Extremely Hard Problems
Diffusion Beats Autoregressive in Data-Constrained Settings