Agentic AI 3 min

Agent Memory: In-Context, External, Procedural

How agents remember what they have done and what they know across multiple tasks.

Last updated July 3, 2026

Agents need memory to stay useful across steps. In-context memory is what sits in the current prompt. External memory is persisted data in a store, fetched only when relevant. Procedural memory is task policy and learned workflow structure, like system instructions and tool-use rules.

Use the Right Memory for the Job

Keep short-term reasoning in context. Store durable facts and user preferences externally. Keep operating rules in system instructions and tool policies.

Here is the failure mode in concrete terms: a standard chat completion API is stateless, so every single turn re-sends the entire conversation history. If you never summarize or externalize anything, the prompt you pay for at turn 20 contains all 20 turns, and you paid for the growing prefix on every one of those 20 calls, not just the last one. Try it below.

Interactive: Context Growth Simulator

Memory Strategy

Every chat-completion API call is stateless: the full conversation is re-sent on every turn. Drag the slider to see what that does to your prompt size and bill, with and without external memory.

Conversation length: 12 turns

All-in-context (no external memory)

Prompt at turn 12: 2,160 tokens

External memory + rolling summary

Prompt at turn 12: 580 tokens

$0.042

cumulative billed cost, all-in-context

$0.019

cumulative billed cost, external memory

The fix is not 'use a smaller model,' it is architectural: keep only the last N turns verbatim, and replace anything older with a periodically-updated summary of fixed size. That turns quadratic cost growth into linear cost growth, and is exactly what external memory buys you.

Brain, Notebook, Playbook

In-context memory is your short-term brain. External memory is your notebook. Procedural memory is your playbook. Strong agents use all three intentionally.

What's Next

Memory keeps one agent coherent. The next lesson looks at what happens when you split the work across several agents at once.

See it in actionThis concept is implemented in a real project

Agentic AIAdvanced

Autonomous Research Agent

A ReAct agent that takes a research question, searches the web, reads papers, cross-references sources, and produces a structured Markdown summary on its own.

View on GitHub

Browse all projects →