Memory

Manage conversation history and agent scratchpad state.

Flint ships three memory primitives — messages(), scratchpad(), and conversationMemory() — that cover the full range from simple REPL loops to long-running agents that need automatic history compression.

Importing

import { messages, scratchpad, conversationMemory } from 'flint/memory';
import type { Messages, Scratchpad, ConversationMemory, ConversationMemoryOpts } from 'flint/memory';

`messages()`

A lightweight, ordered store for Message objects. Use this when you control the conversation loop yourself and just need a reliable place to accumulate turns.

Type

type Messages = {
  push(m: Message): void;
  slice(from: number, to?: number): Message[];
  replace(index: number, m: Message): void;
  all(): Message[];
  clear(): void;
};

function messages(): Messages;

Methods

Method	Description
`push(m)`	Append a message to the store.
`all()`	Return a snapshot copy of all messages.
`slice(from, to?)`	Return a sub-range of messages (same semantics as `Array.prototype.slice`).
`replace(index, m)`	Overwrite a specific message by index — useful for tool-result injection.
`clear()`	Remove all messages.

Example

import { messages } from 'flint/memory';
import { call } from 'flint';
import { anthropicAdapter } from '@flint/adapter-anthropic';

const adapter = anthropicAdapter({ apiKey: process.env.ANTHROPIC_API_KEY! });
const history = messages();

// First turn
history.push({ role: 'user', content: 'Hello!' });
const r1 = await call({ adapter, messages: history.all(), model: 'claude-haiku-4-5' });
if (r1.ok) history.push(r1.value.message);

// Second turn — full history is passed automatically
history.push({ role: 'user', content: 'What did I just say?' });
const r2 = await call({ adapter, messages: history.all(), model: 'claude-haiku-4-5' });
if (r2.ok) history.push(r2.value.message);

When to use: Multi-turn chat loops where you want explicit control over every message. The replace() method is especially useful when you need to retrofit a tool-call result into a prior turn rather than appending it.

`scratchpad()`

An append-only note store for an agent's intermediate reasoning. Use this when your agent needs to accumulate observations, plans, or working notes across multiple steps before producing a final answer.

Type

type Scratchpad = {
  note(text: string): void;
  notes(): string[];
  clear(): void;
};

function scratchpad(): Scratchpad;

Methods

Method	Description
`note(text)`	Append a string note.
`notes()`	Return a snapshot copy of all notes.
`clear()`	Remove all notes.

Example

import { scratchpad } from 'flint/memory';
import { call } from 'flint';
import { anthropicAdapter } from '@flint/adapter-anthropic';

const adapter = anthropicAdapter({ apiKey: process.env.ANTHROPIC_API_KEY! });
const pad = scratchpad();

// Agent accumulates observations across tool calls
pad.note('User is asking about flight prices to Tokyo.');
pad.note('Tool returned: cheapest fare is $780 on March 15.');
pad.note('User previously mentioned a $700 budget constraint.');

// Inject notes into the final prompt
const context = pad.notes().join('\n');
const result = await call({
  adapter,
  messages: [
    { role: 'system', content: `Working notes:\n${context}` },
    { role: 'user', content: 'What should I do?' },
  ],
  model: 'claude-haiku-4-5',
});
if (result.ok) console.log(result.value.message.content);

When to use: Agentic loops where intermediate steps produce context that should influence the final response but shouldn't be part of the user-visible conversation history.

`conversationMemory()`

A rolling-window store with automatic summarization. When the message count reaches summarizeAt, it compresses older messages into a summary and keeps only the most recent max - summarizeAt turns in full.

Use this for long-running assistants where you cannot afford unbounded token growth but still need the model to have awareness of earlier parts of the conversation.

Type

type ConversationMemoryOpts = {
  max: number;
  summarizeAt: number;
  summarizer: (messages: Message[]) => Promise<string>;
};

type ConversationMemory = {
  append(m: Message): void;
  messages(): Message[];
  summary(): string | undefined;
  clear(): void;
};

function conversationMemory(opts: ConversationMemoryOpts): ConversationMemory;

Options

Option	Type	Description
`max`	`number`	Maximum number of messages to retain after a summarization pass.
`summarizeAt`	`number`	Trigger summarization when the store reaches this many messages. Must be less than `max`.
`summarizer`	`(messages: Message[]) => Promise<string>`	Async function that receives the messages to compress and returns a summary string.

Methods

Method	Description
`append(m)`	Add a message; triggers summarization automatically if the threshold is met.
`messages()`	Return the current in-memory messages (may include a leading summary system message).
`summary()`	Return the most recent summary text, or `undefined` if none has been generated.
`clear()`	Remove all messages and reset the summary.

Example

import { conversationMemory } from 'flint/memory';
import { call, agent } from 'flint';
import { anthropicAdapter } from '@flint/adapter-anthropic';

const adapter = anthropicAdapter({ apiKey: process.env.ANTHROPIC_API_KEY! });

// Build a summarizer using any LLM call
async function summarize(msgs: Message[]): Promise<string> {
  const transcript = msgs
    .map((m) => `${m.role}: ${typeof m.content === 'string' ? m.content : '[content]'}`)
    .join('\n');
  const result = await call({
    adapter,
    messages: [
      {
        role: 'user',
        content: `Summarize this conversation in 2-3 sentences:\n\n${transcript}`,
      },
    ],
    model: 'claude-haiku-4-5',
  });
  if (!result.ok) throw result.error;
  return result.value.message.content as string;
}

const mem = conversationMemory({
  max: 20,          // keep at most 20 messages after compression
  summarizeAt: 30,  // compress when we hit 30 messages
  summarizer: summarize,
});

// Pass mem.messages() into the agent each turn
const out = await agent({
  adapter,
  model: 'claude-opus-4-7',
  messages: mem.messages(), // the rolling window, possibly with a leading summary
  tools,
  budget: budget({ maxSteps: 10 }),
});

Fail-open behavior: If summarizer throws, the store is left unchanged and the conversation continues without compression. Check summary() to verify whether summarization has occurred.

When to use: Production chatbots or long-running sessions where you need to bound token costs without abruptly losing conversation context.

Which memory primitive to use

Primitive	Use when
`messages()`	You want a simple array of messages with manual management
`scratchpad()`	You need free-form text scratch space for the agent's working notes
`conversationMemory()`	You want automatic summarization for long-running conversations

conversationMemory() options

type ConversationMemoryOptions = {
  adapter: ProviderAdapter;
  model: string;         // model used for auto-summarization
  maxMessages: number;   // trigger summarization when history exceeds this count
  keepLast: number;      // messages to keep verbatim after summarization
};

Auto-summarization trigger

Summarization happens when memory.add() is called and messages().length >= maxMessages. It:

Takes the oldest messages.length - keepLast messages
Calls the LLM to summarize them
Replaces them with a single system message containing the summary
Retains the last keepLast messages verbatim

Thread safety

conversationMemory() is not thread-safe. Don't call memory.add() concurrently from multiple async paths.

Memory ​

Importing ​

messages() ​

Type ​

Methods ​

Example ​

scratchpad() ​

Type ​

Methods ​

Example ​

conversationMemory() ​

Type ​

Options ​

Methods ​

Example ​

Which memory primitive to use ​

conversationMemory() options ​

Auto-summarization trigger ​

Thread safety ​

See Also ​

Memory

Importing

`messages()`

Type

Methods

Example

`scratchpad()`

Type

Methods

Example

`conversationMemory()`

Type

Options

Methods

Example

Which memory primitive to use

conversationMemory() options

Auto-summarization trigger

Thread safety

See Also