NLP Papers

Looking for a specific paper or subject?


Large Diffusion Language Models

Large Language Diffusion Models: The Era Of Diffusion LLMs?

In this post we break down the Large Language Diffusion Models paper, introducing the first diffusion based LLM that rival a strong LLM at large scale. Introduction Large Language Models (LLMs) have become extremely powerful over the recent years, paving the way toward artificial general intelligence. These models are fundamentally autoregressive, meaning they predict the…
CoCoMix Teaser

CoCoMix by Meta AI – The Future of LLMs Pretraining?

Discover CoCoMix by Meta AI – a new approach for LLM pretraining using Continuous Concept Mixing, enriching word tokens with latent concepts!…
s1 teaser

s1: Simple Test-Time Scaling – Can 1k Samples Rival o1-Preview?

Discover s1: a simple yet powerful approach to test-time scaling for LLMs, rivaling o1-preivew with just 1k samples!…
DeepSeek-R1 teaser

DeepSeek-R1 Paper Explained – A New RL LLMs Era in AI?

Dive into the groundbreaking DeepSeek-R1 research paper, introduces open-source reasoning models that rivals the performance OpenAI’s o1!…
Titans teaser

Titans by Google: The Era of AI After Transformers?

Dive into Titans, a new AI architecture by Google, showing promising results comparing to Transformers! Paving the way for a new era in AI?…
rStar-Math Preview

rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math?

Discover how System 2 thinking through Monte Carlo Tree Search enables rStar-Math to rival OpenAI’s o1 in math, using Small Language Models!…
Large Concept Model High-level Architecture

Large Concept Models (LCMs) by Meta: The Era of AI After LLMs?

Explore Meta’s Large Concept Models (LCMs) - an AI model that processes concepts instead of tokens. Can it become the next LLM architecture?…
BLT architecture

Byte Latent Transformer (BLT) by Meta AI: A Tokenizer-free LLM Revolution

Explore Byte Latent Transformer (BLT) by Meta AI: A tokenizer-free LLM that scales better than tokenization-based LLMs…
Scroll to Top