RSS.Social

artgor

follow: @[email protected]

Posts

Paper Review: Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper Review: DINOv3

My experience of searching for a job in 2024 as an MLE

Paper Review: Group Sequence Policy Optimization

Paper Review: Subliminal Learning: Language models transmit behavioral traits via hidden signals in data

Paper Review: ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper Review: V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Paper Review: Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper Review: SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper Review: Visual Planning: Lets Think Only with Images

Paper Review: AlphaEvolve: A coding agent for scientific and algorithmic discovery