How Meta AI ‘s Human-Like V-JEPA Works?
Explore V-JEPA, which stands for Video Joint-Embedding Predicting Architecture. Another step in Meta AI’s journey for human-like AI
Explore V-JEPA, which stands for Video Joint-Embedding Predicting Architecture. Another step in Meta AI’s journey for human-like AI
In this post we dive into the era of 1-bit LLMs paper by Microsoft, which shows a promising direction for low cost large language models
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Read More »
In this post we dive into the Self-Rewarding Language Models paper by Meta AI, which can possibly be a step towards open-source AGI
Diving into a research paper introducing an innovative method to enhance LLM inference efficiency using memory offloading
Fast Inference of Mixture-of-Experts Language Models with Offloading Read More »
In this post we dive into TinyGPT-V, a small but mighty Multimodal LLM which brings Phi-2 success to vision-language tasks
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones Read More »
In this post we dive into LLM in a flash paper by Apple, that introduces a method to run LLMs on devices that have limited memory
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Read More »
In this post we go back to the important vision transformers paper, to understand how ViT adapted transformers to computer vision
Dive into Orca 2 research paper, the second version of the successful Orca small language model from Microsoft,
Orca 2: Teaching Small Language Models How to Reason Read More »
Following LCM-LoRA release, in this post we explore the evolution of diffusion models up to latent consistency models with LoRA
In this post we dive into Microsoft’s CODEFUSION, an approach to use diffusion models for code generation that achieves remarkable results
CODEFUSION: A Pre-trained Diffusion Model for Code Generation Read More »