AIREVOLUTION

Retrieval-Augmented Generation (RAG): Architecture, Mechanisms, and Core Advantages

Retrieval-Augmented Generation (RAG) represents a paradigm shift in natural language processing (NLP), integrating large language models (LLMs) with dynamic information retrieval systems to produce responses that are both contextually enriched and factually grounded (Lewis et al. 2020). At its core, the RAG architecture couples a conventional generative model—one that

by Miklós Sebők - Rebeka Kiss • Mar 25, 2025

GenAI textbook Part 7 Chapter 4

Comparing leading large language models: architectures, performance and specialised capabilities

Most contemporary LLMs employ a decoder‑only transformer architecture, which processes sequences in parallel via self‑attention. However, scaling dense transformers linearly in size increases computation and cost. Mixture‑of‑experts (MoE) approaches address this by activating only a subset of parameters per token. In the Switch Transformer, MoE routing

by Miklós Sebők - Rebeka Kiss • Mar 23, 2025

GenAI textbook Part 7 Chapter 3

Small Language Models (SLMs) and Knowledge Distillation

Small Language Models (SLMs) are compact neural networks designed to perform natural language processing (NLP) tasks with significantly fewer parameters and lower computational requirements than their larger counterparts. SLMs aim to deliver robust performance in resource-constrained environments, such as mobile devices or edge computing systems, where efficiency is paramount. The

by Miklós Sebők - Rebeka Kiss • Mar 20, 2025

GenAI textbook Part 7 Chapter 2

Why Size Matters: The Impact of Model Scale on Performance and Capabilities in Large Language Models

A defining characteristic of LLMs is their scale, measured by the number of parameters, which has grown exponentially in recent years. Models such as GPT-3, with 175 billion parameters, and its successors have demonstrated remarkable capabilities, raising questions about the relationship between model size and performance (Brown et al. 2020)

by Miklós Sebők - Rebeka Kiss • Mar 16, 2025

GenAI textbook Part 7 Chapter 1

Definition and Characteristics of Large Language Models

A large language model can be defined as a computational model, typically based on deep neural networks, trained on vast datasets of text to perform a wide range of language-related tasks. According to Vaswani et al. (2017), the advent of transformer architectures marked a pivotal shift in NLP, enabling models

by Miklós Sebők - Rebeka Kiss • Mar 12, 2025