Skip to content

Home
Papers
Foundational
Books
- AI & ML Books for Beginners to Intermediates
- AI & ML Theory Books
About
Search
Newsletter
Advertise

Home
Papers
Foundational
Books
- AI & ML Books for Beginners to Intermediates
- AI & ML Theory Books
About
Search
Newsletter
Advertise

Latest Reviews

Emergent Hierarchical Reasoning in LLMs Through Reinforcement Learning
Less Is More: Tiny Recursive Model (TRM) Paper Explained
DINOv3 Paper Explained: The Computer Vision Foundation Model
The Era of Hierarchical Reasoning Models?
Microsoft’s Reinforcement Pre-Training (RPT) – A New Direction in LLM Training?

NLP

Active Evol-Instruct

WizardMath – Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Leave a Comment / NLP / AI Papers Academy

Diving into WizardMath, a LLM for mathematical reasoning contributed by Microsoft, surpassing models such as WizardLM and LLaMA-2.

WizardMath – Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct Read More »

Imitation learning

Orca Research Paper Explained

Leave a Comment / NLP / AI Papers Academy

In this post we dive into Orca’s paper which shows how to do imitation tuning effectively, outperforms ChatGPT with about 7% of its size!

Orca Research Paper Explained Read More »

Dilated attention overview

LongNet: Scaling Transformers to 1B Tokens with Dilated Attention

Leave a Comment / NLP / AI Papers Academy

In this post we dive into the LongNet research paper which introduced the Dilated Attention mechanism and explain how it works

LongNet: Scaling Transformers to 1B Tokens with Dilated Attention Read More »

LIMA overview

LIMA from Meta AI – Less Is More for Alignment of LLMs

Leave a Comment / NLP / AI Papers Academy

In this post we explain LIMA, a LLM by Meta AI which was fine-tuned on only 1000 samples, yet it achieves competitive results with top LLMs

LIMA from Meta AI – Less Is More for Alignment of LLMs Read More »

Shepherd example

Shepherd: A Critic for Language Model Generation

1 Comment / NLP / AI Papers Academy

Dive into Shepherd, a LLM from Meta AI which is purposed to critique responses from other LLMs, a step in resolving LLMs hallucinations.

Shepherd: A Critic for Language Model Generation Read More »

LLM attacks

Universal and Transferable Adversarial LLM Attacks

NLP / AI Papers Academy

LLMs are aligned for safety to avoid generation of harmful content. In this post we review a paper that is able to successfully attack LLMs.

Universal and Transferable Adversarial LLM Attacks Read More »

Soft MoE

From Sparse to Soft Mixture of Experts

1 Comment / Computer Vision, NLP / AI Papers Academy

In this post we review Google DeepMind’s paper that introduces Soft Mixture of Experts, a fully-differentiable sparse Transformer.

From Sparse to Soft Mixture of Experts Read More »

Post pagination

← Previous 1 … 4 5

Home
Papers
Foundational
About
Privacy
Newsletter
Advertise

Copyright © 2025 AI Papers Academy

Search for:

Scroll to Top