AI Papers Academy, Author at AI Papers Academy

Less Is More: Tiny Recursive Model (TRM) Paper Explained

In this post, we break down the TRM paper, a simpler version of the HRM, that beats HRM and top reasoning LLMs with a tiny 7M params model.

In this post we break down Meta AI’s DINOv3 research paper, which introduces a state-of-the-art Computer Vision foundation models family

In this post we break down the Hierarchical Reasoning Model (HRM), a new model that rivals top LLMs on reasoning benchmarks with only 27M params!

In this post we break down Microsoft’s Reinforcement Pre-Training, which scales up reinforcement learninng with next-token reasoning

In this post we explain the Darwin Gödel Machine, a novel method for self-improving AI agents by Sakana AI

Dive into Continuous Thought Machines, a novel architecture that strive to push AI closer to how the human brain works

Dive into Perception Language Models by Meta, a family of fully open SOTA vision-language models with detailed visual understanding

DeepSeekMath is the fundamental GRPO paper, the reinforcement learning method used in DeepSeek-R1. Dive in to understand how it works

Explore DAPO, an innovative open-source Reinforcement Learning paradigm for LLMs that rivals DeepSeek-R1 GRPO method.

Discover how OpenAI’s research reveals AI models cheating the system through reward hacking — and what happens when trying to stop them