Загрузка...

LLM Fundamentals AI Agents Context Engineering Prompting Techniques Playground My Path Pricing

Neural Networks Training Dynamics Transfer Learning Tokenization Embeddings Attention Mechanism Transformer Architecture Inference & KV-Cache Decoding Strategies LLM Settings Prompt Elements Prompt Engineering Basics Best Practices Quantization Fine-tuning vs Prompting Reasoning Models Open-Source Models RLHF Mixture of Experts (MoE)Knowledge Distillation RLVR and GRPO Diffusion LLMs Test-Time Compute Scaling

ReAct Pattern Function Calling Tool Impact Agent Memory Multi-Agent Systems Planning & Decomposition Agent Architectures Building AI Agents LangChain & LlamaIndex Agent Evaluation & Testing A2A Protocol (Agent-to-Agent)Computer Use & Browser Agents Agent Protocols: A2A, ACP & AP2

Context Engineering System Prompts Context Windows Prompt Structure RAG — Retrieval Augmented Generation Prompt Security Context Engineering Strategies: Write, Select, Compress, Isolate

Zero-Shot Prompting Chain-of-Thought — Step-by-Step AI Reasoning Few-Shot Learning Self-Consistency — Improving Accuracy via Voting Tree of Thoughts Meta-Prompting Reflexion — Self-Correcting LLM Outputs Least-to-Most — Bottom-Up Problem Decomposition Program of Thought Chain of Verification RAG — Retrieval-Augmented Generation Prompt Chaining Generate Knowledge Structured Output APE — Automatic Prompt Engineering DSPy — Programming Language Models ART — Automatic Reasoning and Tool-use Multimodal CoT Prompt Frameworks Practical Patterns Combining Techniques Self-Refine

Code Generation Text Classification Summarization Information Extraction Question Answering Data Generation Chatbots & Conversational AI Text Transformation Semantic Search Content Generation Text-to-SQL — Natural Language to SQL Queries Structured Output Sentiment & Opinion Mining Image Generation Prompting Per-Model Prompting Guides Agentic Coding

Model Selection Guide LLM Benchmarks Vector Databases LLM Observability Cost Optimization API Integration Patterns LLM Deployment Production Guardrails RAG vs Fine-tuning Small Language Models Prompt Caching LLMOps Model Routing Harness Engineering

Vision LLMs Image Analysis Prompt Engineering for Vision Document Understanding Vision Hallucinations Multimodal RAG Voice Agents Real-Time Multimodal Video & Audio Multimodal Costs Diffusion Models

Prompt Injection Jailbreaking — LLM Safety Bypass Techniques Factuality & Hallucinations Biases in LLMs AI Safety & Alignment Data Privacy & PII Leakage Red Teaming for LLMs Context Laundering

Introduction to Claude Code Installation & Setup Your First Session The Agent Loop 18 Built-in Tools File Operations Search & Navigation Sub-agents Overview Built-in Agents Creating Custom Agents Agent Patterns MCP Fundamentals Popular MCP Servers MCP Configuration Advanced MCP Memory & CLAUDE.md Modular Rules Context Management Hooks System Skills & Commands Creating Plugins Agent SDK

Prompting Techniques Comparison AI Prompt Engineering Challenges LLM Playground — Test AI Prompts BPE Tokenizer — See How AI Reads Text Semantic Search Demo — AI Embeddings Token Probabilities — AI Model Predictions Context Window Visualizer Prompt Battle — Technique Comparison

Prompt Constructor

AI Recipes

Practical step-by-step guides for solving real tasks with AI. Focus on problem-solving logic, not just tools.

Agent Memory: Context That Survives a Restart

AdvancedAI Agents25 minMem0, JSON file storage, Claude API

Humans have working, long-term, and procedural memory — an agent needs the same. In 2026 memory stopped being an optimization and became an architectural requirement: without it every session starts from scratch, and the user explains the same thing for the third time in a row.

Reflection: The Agent That Critiques Itself

IntermediateAI Agents20 minClaude API, Any LLM, Prompt chaining

Agents trust their first answer too much — and fail exactly where a human would reread and fix. Reflection forces the model to draft first, then critique, then revise — and quality jumps significantly.

Human-in-the-Loop: Where to Stop the Agent Before It Breaks Things

IntermediateAI Agents20 minLangGraph, State checkpointer, Any agent framework

In 2026 only 22% of people trust autonomous agents — and the EU AI Act explicitly requires human oversight for high-risk systems. An agent that knows when to pause and ask for approval at critical points is not a bug — it is a mandatory part of production architecture.

LLM Router: Use the Expensive Model Only Where It Earns Its Keep

IntermediateAI DevOps20 minClaude Haiku/Sonnet/Opus, OpenAI o3/gpt-4o/gpt-4o-mini, Classifier model

Most teams run one premium LLM on everything — from classification to architectural decisions. A smart router classifies the request first, then dispatches to a cheap/mid/premium model — cutting cost 30-80% without a quality drop.

Guardrails: Layered Defense for LLM Apps in Production

IntermediateAI DevOps25 minNeMo Guardrails, Guardrails AI, Any validator library

87% of top models remain vulnerable to jailbreaks, EU AI Act fines reach €35M, and layered guardrails catch 95% of incidents collectively. We break down how to build defense at every seam of the pipeline: a single guardrail is no guardrail — just a feeling of one.

Evals for AI Agents: How to Know Your Agent Actually Works

IntermediateAI DevOps35 minDeepEval, Langfuse, Claude API

Everyone builds agents, almost nobody measures them. We break down why classical tests don't work here, how to assemble your first eval dataset in an hour, the difference between end-to-end and trajectory evaluation, and how not to fool yourself with LLM-as-judge.

Prompt Caching: How to Cut 80% Off Your LLM Bill

IntermediateAI DevOps25 minAnthropic API, OpenAI API

The cheapest LLM optimization isn't a smaller model or a smarter RAG — it's a properly structured prefix cache. We break down how caching actually works, where to place cache points, why segment order matters more than content, and how to measure hit-rate so you don't fool yourself.

LLM-Wiki Second Brain: The Alternative to RAG for Personal Knowledge

IntermediateContent30 minObsidian, Claude, Markdown

Classic RAG isn't the only way to build a personal knowledge base. We unpack Andrej Karpathy's idea: an LLM acting as a librarian that maintains a topical wiki from your raw notes. When it beats RAG, how the structure works, and what to do when it breaks.

Voice AI Agent: Why 800 ms Is the Most Important Number in the Architecture

AdvancedExperimental40 minLiveKit Agents, Deepgram, ElevenLabs, Claude Haiku

A voice agent isn't 'a text bot plus TTS'. It's an STT → LLM → TTS chain that has to respond before the human starts talking again. We break down the latency budget, turn-taking, interruption handling, and three typical streaming-pipeline breakdowns.

Browser Agent: When You Need One, How to Think About Reliability, Why It's Dangerous

IntermediateExperimental35 minbrowser-use, Playwright, Claude Vision

Vision agents clicking through websites are the hottest automation trend of 2026. We break down when you actually need one (not always), the difference between DOM and vision approaches, why 'click and wait 2 seconds' almost always breaks, and which boundaries are mandatory before you ship to production.

AI Bot for Customer Review Responses

IntermediateAutomation25 minn8n, Claude API, Google Sheets

Automatically analyzes review sentiment and generates a personalized response. We break down the core principle: one complex prompt is a trap, a chain of simple ones is power. We build a workflow with branching for cases when the AI needs to call a human.

Build Your Own MCP Server: Give Claude Access to Any API

IntermediateMCP30 minTypeScript, MCP SDK, Claude Desktop

MCP (Model Context Protocol) is like a USB interface for AI: a standard connector through which Claude plugs into any tool. We break down how to design and run your own MCP server — from choosing what tools to expose to making the first call from Claude Desktop.

Agentic RAG — Let the Agent Decide What to Retrieve

IntermediateRAG & Data30 minLangGraph, Python, ChromaDB

Classic RAG is a brainless conveyor: query, retrieve, answer. Agentic RAG is when the agent decides whether to search, what to search for, and whether the results are good enough. We break down how to turn a linear pipeline into a decision loop — using LangGraph and ChromaDB.

Code Review Agent with Claude Agent SDK

AdvancedAI Agents35 minClaude Agent SDK, Python, GitHub API

Code review is a perfect task for an AI agent: it requires reading context, finding related files, and applying different criteria to different parts of code. We break down how to build an agent that reviews code like an experienced engineer — prioritizing findings, searching for context, and posting a structured report directly to GitHub.

AI Document Processing Pipeline: Classify → Extract → Route

BeginnerAutomation20 minClaude API, n8n, Google Sheets

Every company drowns in documents: invoices, contracts, requests, letters. AI can read them — but for this to work reliably, you need not one magic button, but a pipeline: first understand what the document is, then extract the needed data, then decide what to do with it. We break down how to build such a pipeline in n8n with the Claude API.

Multi-Agent Research Team with CrewAI

IntermediateAI Agents25 minCrewAI, Python, SerperDev API

CrewAI is a framework where multiple AI agents work as a real team: each with a dedicated role, tools, and area of responsibility. We break down how to build a research crew with a coordinator and specialists — from role design to debugging your first real scenario.

Resources

Glossary Learning Path Playground What's New

Legal

Terms of Service Privacy PolicyContact Us About the Author

© 2024-2026 ExplainLLM. All rights reserved.

Telegram channel