AI Papers Academy, Author at AI Papers Academy

START by Alibaba: Teaching LLMs To Debug Themselves

In this post we break down a recent Alibaba’s paper: START: Self-taught Reasoner with Tools. This paper shows how Large Language Models (LLMs) can teach themselves to debug their own thinking using Python. Introduction Top reasoning models, such as DeepSeek-R1, achieve remarkable results with long chain-of-thought (CoT) reasoning. These models are presented with complex problems […]

START by Alibaba: Teaching LLMs To Debug Themselves Read More »

SWE-RL by Meta — Reinforcement Learning for Software Engineering LLMs

NLP / AI Papers Academy

Dive into SWE-RL by Meta, a DeepSeek-R1 style recipe for training LLMs for software engineering with reinforcement learning.

SWE-RL by Meta — Reinforcement Learning for Software Engineering LLMs Read More »

Large Language Diffusion Models: The Era Of Diffusion LLMs?

NLP / AI Papers Academy

Discover Large Language Diffusion Models (LLaDA), a novel diffusion based approach to language modeling that challenges traditional LLMs.

Large Language Diffusion Models: The Era Of Diffusion LLMs? Read More »

CoCoMix by Meta AI – The Future of LLMs Pretraining?

NLP / AI Papers Academy

Discover CoCoMix by Meta AI – a new approach for LLM pretraining using Continuous Concept Mixing, enriching word tokens with latent concepts!

CoCoMix by Meta AI – The Future of LLMs Pretraining? Read More »

s1: Simple Test-Time Scaling – Can 1k Samples Rival o1-Preview?

NLP / AI Papers Academy

Discover s1: a simple yet powerful approach to test-time scaling for LLMs, rivaling o1-preivew with just 1k samples!

s1: Simple Test-Time Scaling – Can 1k Samples Rival o1-Preview? Read More »

DeepSeek Janus Pro Paper Explained – Multimodal AI Revolution?

Computer Vision, Multimodality / AI Papers Academy

Dive into DeepSeek Janus Pro, another magnificent open-source release, this time a multimodal AI model that rivals top multimodal models!

DeepSeek Janus Pro Paper Explained – Multimodal AI Revolution? Read More »

DeepSeek-R1 Paper Explained – A New RL LLMs Era in AI?

NLP / AI Papers Academy

Dive into the groundbreaking DeepSeek-R1 research paper, introduces open-source reasoning models that rivals the performance OpenAI’s o1!

DeepSeek-R1 Paper Explained – A New RL LLMs Era in AI? Read More »

Titans by Google: The Era of AI After Transformers?

NLP / AI Papers Academy

Dive into Titans, a new AI architecture by Google, showing promising results comparing to Transformers! Paving the way for a new era in AI?

Titans by Google: The Era of AI After Transformers? Read More »

rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math?

NLP / AI Papers Academy

Discover how System 2 thinking through Monte Carlo Tree Search enables rStar-Math to rival OpenAI’s o1 in math, using Small Language Models!

rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math? Read More »

Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?

Multimodality, NLP / AI Papers Academy

Explore Meta’s Large Concept Models (LCMs) - an AI model that processes concepts instead of tokens. Can it become the next LLM architecture?

Large Concept Models (LCMs) by Meta: The Era of AI After LLMs? Read More »

Author name: AI Papers Academy