Machine Learning Blog | ML@CMU | Carnegie Mellon University
Teaching Vision-Language Models to Speak Cinema
Introducing ARFBench: A time series question-answering benchmark based on real incidents
Carnegie Mellon at ICLR 2026
When Should AI Step Aside?: Teaching Agents When Humans Want to Intervene
LumberChunker: Long-Form Narrative Document Segmentation
Yes, AI, There is a Santa Claus
Validating LLM-as-a-Judge Systems under Rating Indeterminacy
Carnegie Mellon at NeurIPS 2025
How to Explore to Scale RL Training of LLMs on Hard Problems?
Carnegie Mellon University at EMNLP 2025
Learning from Failure to Tackle Extremely Hard Problems
Diffusion Beats Autoregressive in Data-Constrained Settings