OpenClaw Cloud Plugin

Compatibility recommendation To avoid plugin loading or runtime errors, we recommend upgrading OpenClaw after MemOS has completed compatibility adaptation for the latest OpenClaw version.

OpenClaw's going viral lately. But if you've actually used it for a while, you'll find two issues you can hardly avoid:

Tokens burn way too quickly：OpenClaw can handle many long-tail tasks, but the cost is that each run consumes a huge number of tokens. When you have it monitoring your screen, running scheduled tasks, or handling complex workflows, the token consumption is painfully fast.
("u know token is money🫠")
Its memory function is rather poor：Many claim OpenClaw's memory outperforms ChatGPT. Yet in practice, you'll find it does retain some information—but often not what you need. Crucial preferences may be forgotten, while trivial chatter is remembered in vivid detail.
("can u please remember something really matter to me???")

NOT OpenClaw's fault, ALL AI agents suffering.

This tutorial guides you through using the MemOS OpenClaw plugin to figure out these 3 pain issues:

Significantly reduce token consumption — intelligently retrieve relevant memories without indiscriminately loading all history
Make memories genuinely useful — professional memory categorisation and management, remembering what should be retained and forgetting what should be discarded
Preserve OpenClaw's core strengths — cross-device control, proactive interaction, and human-like experience remain intact

Why is OpenClaw now a Token Killer🥷？

Issues with OpenClaw

1st convo: 500 tokens
2nd convo: 500 + 800 = 1,300 tokens
3rd convo: 1,300 + 600 = 1,900 tokens
10th convo: 10,000+ tokens

When you have OpenClaw monitoring your screen, performing executive tasks, and running on a schedule, this figure increases even more rapidly.

Three critical points in OpenClaw's native memory management

OpenClaw's memories reside in local .md files, categorised as global memories and daily memories. While this sounds promising, practical use reveals three unavoidable issues:

1. Global memories become booming

As global memories accumulate, context overload ensues. Moreover, these memories persistently interfere with current conversations. You might simply wish to ask a straightforward question, yet it dredges up every utterance from three months prior.

2. Daily memory recall proves difficult

Accumulating daily memories invariably makes retrieval cumbersome. To recall yesterday's activities, one must undergo an additional retrieval process. Maintaining cross-session memory becomes nearly impossible.

3. Memory relies on the model's proactive logging

OpenClaw's memory system relies on the model to log information itself, rather than automatic logging. This means it frequently misses details—you mention something, and it promptly forgets.

I've encountered this several times myself: I'd explicitly emphasised a particular project configuration, yet when restarting the conversation the next day, it had no recollection whatsoever, requiring me to explain it all over again.

OpenClaw vs OpenClaw + MemOS: Memory Solution Comparison

OpenClaw Native Memory Solution

Memory Storage Solution

Core Philosophy: File is Truth — Abandoning opaque vector databases in favor of Markdown files as the core carrier of memory.

Memory Storage Solution

Memory Retrieval Solution: Dual-Engine Drive

Engine	Technology	Features
Vector Search	Cosine Similarity	Captures semantic associations, excels at "concept matching", e.g., associating "login flow" with "authentication"
BM25 Search (Lexical Matching)	FTS5-based lexical matching	Handles "exact tokens", such as error codes, function names, or specific IDs

Retrieval Trigger: Triggered via Prompt, model decides automatically

Weighted Score Fusion: Score = (0.7 * VectorScore) + (0.3 * BM25Score)

Pain Points of Existing Solutions

Rudimentary Retrieval Algorithms: Unstable recall, weak relevance, Agent repeats trial and error, Token accumulates rapidly
Excessive Context Injection: Fixed reading of today + yesterday + long-term memory, high proportion of invalid context
Lack of Structure and Deduplication in Memory: Tool call long outputs are written directly and re-transmitted repeatedly, costs snowball

OpenClaw + MemOS Memory Solution

MemOS-OpenClaw

Three Core Effects

Effect 1: Controllable Token Costs 💰

From "Full Context Stuffing" to "Precise Recall per Task"

OpenClaw no longer stuffs today+yesterday+long-term memory every time. Instead, MemOS retrieves the most relevant few memories based on the current task (recall budget/count can be set), significantly reducing the proportion of invalid context and avoiding Token snowballing.

Effect 2: More Stable and Accurate Retrieval 🎯

Reduce repeated trial and error and re-asking, improve one-shot hit rate

MemOS provides stronger memory organization and retrieval capabilities (structured, hierarchical/multi-granular, semantic retrieval + rule filtering, etc.), making OpenClaw's recalled content more relevant and stable, reducing repeated reasoning and confirmation caused by "unstable recall".

Effect 3: Cleaner and More Usable Memory ✨

Structured + Deduplicated + High Compression, avoiding "Long Output Pollution"

Long outputs from tool calls (such as traversal results, config/schema, etc.) are not written back to the context verbatim repeatedly; MemOS can summarize/compress, deduplicate, and archive, making it "cleaner" over long-term operation, with memory quality improving rather than deteriorating over time.

After integrating the MemOS OpenClaw plugin👇🏻

✅ Retrieve only 3–5 relevant memories at a time
✅ Maintain context stability within 2,000–3,000 tokens
✅ Cost remains manageable regardless of dialogue length

MemOS plugins can enhance your OpenClaw

Feature	Description
Automatically remember all conversations	without relying on models to actively log, ensuring no critical information is missed
Precise recall	retrieve relevant memories based on current task intent, avoiding irrelevant historical data
Remember user preferences	categorise and store preference information specifically, remaining effective across sessions

MemOS OpenClaw has restructured the token consumption model, transforming costs from a ‘historical length function’ into a ‘task relevance function’. Your local OpenClaw costs become manageable, and the system operates more stably.

Quick Start

Three steps to boost your Agent with basic memory capabilities.

1. Install OpenClaw

Ensure that the OpenClaw environment is installed on your system:

# Install the newest version
npm install -g openclaw@latest

# Initialize and configure startup
openclaw onboard

2. Get and configure your API Key

2.1 Get your Key

2.2 Set Environment Variables

The plugin tries env files in order (openclaw → moltbot → clawdbot). For each key, the first file with a value wins. If none of these files exist (or the key is missing), it falls back to the process environment.

Where to configure

Files (priority order):
- ~/.openclaw/.env
- ~/.moltbot/.env
- ~/.clawdbot/.env
Each line is KEY=value

Quick setup (shell)

echo 'export MEMOS_API_KEY="mpg-..."' >> ~/.zshrc
source ~/.zshrc
# or

echo 'export MEMOS_API_KEY="mpg-..."' >> ~/.bashrc
source ~/.bashrc

Quick setup (Windows PowerShell)

[System.Environment]::SetEnvironmentVariable("MEMOS_API_KEY", "mpg-...", "User")

If MEMOS_API_KEY is missing, the plugin will warn with setup instructions and the API key URL.

Minimal config

MEMOS_API_KEY=YOUR_TOKEN

3. Install Plugins

Option A — NPM (Recommended)

openclaw plugins install @memtensor/memos-cloud-openclaw-plugin@latest
openclaw gateway restart

Note for Windows Users: If you encounter Error: spawn EINVAL, this is a known issue with OpenClaw's plugin installer on Windows. Please use Option B (Manual Install) below.

Make sure it’s enabled in ~/.openclaw/openclaw.json:

{
  "plugins": {
    "entries": {
      "memos-cloud-openclaw-plugin": { "enabled": true }
    }
  }
}

Option B — Manual Install (Workaround for Windows)

Download the latest .tgz from NPM.
Extract it to a local folder (e.g., C:\Users\YourName\.openclaw\extensions\memos-cloud-openclaw-plugin).
Configure ~/.openclaw/openclaw.json (or %USERPROFILE%\.openclaw\openclaw.json):

{
  "plugins": {
    "entries": {
      "memos-cloud-openclaw-plugin": { "enabled": true }
    },
    "load": {
      "paths": [
        "C:\\Users\\YourName\\.openclaw\\extensions\\memos-cloud-openclaw-plugin\\package"
      ]
    }
  }
}

Note: The extracted folder usually contains a package subfolder. Point to the folder containing package.json.

Restart the gateway after config changes.

4. Update Plugin

You can manually update the cloud plugin to the latest version using the following commands:

openclaw plugins update @memtensor/memos-cloud-openclaw-plugin@latest
openclaw gateway restart

Advanced Configuration for Open-Source Projects

If you wanna unlock further possibilities, you may explore and configure additional features via the MemOS GitHub project!

Visual Configuration UI (Config UI)

Starting from version v0.1.12, the Cloud Plugin features a built-in local visual configuration service, allowing you to manage and modify plugin settings more intuitively.

How to access:

Start your OpenClaw node or host gateway.
Once the plugin is successfully loaded and detects that the gateway is ready, it will automatically start the Config UI service in the background.
An access link will be printed in the terminal console logs (the default URL is typically http://127.0.0.1:38463).
Open this link in your browser to access the plugin's visual management backend.

Features:

Intuitive Editing: Supports form-based editing of all core configurations (such as Knowledge Base IDs, LLM retrieval parameters, multi-agent override rules, etc.).
Real-time Synchronization: Configuration changes saved via the interface take effect immediately during plugin runtime, without requiring a service restart.
Status Monitoring: The interface provides heartbeat detection with the host gateway to ensure the configuration synchronization link is healthy.

Multi-Agent Support & Isolation

The plugin provides powerful native support for multi-agent architectures (via the agent_id parameter), making it ideal for complex workflows or team agent scenarios.

1. Enable & Data Isolation

How to enable: Set "multiAgentMode": true in the config or configure the environment variable MEMOS_MULTI_AGENT_MODE=true.
Automatic Isolation: When enabled, the plugin automatically reads ctx.agentId from the context. This Agent identifier is attached to memory retrieval and writing, ensuring complete data isolation between different Agents under the same user (Note: the default "main" Agent is ignored to maintain legacy data compatibility).

2. Memory Switch per Agent (Whitelist Control) In Multi-Agent mode, if you do not want all Agents to consume memory, you can use allowedAgents to precisely control the whitelist:

{
  "plugins": {
    "entries": {
      "memos-cloud-openclaw-plugin": {
        "enabled": true,
        "config": {
          "multiAgentMode": true,
          "allowedAgents": ["research-agent", "coding-agent"]
        }
      }
    }
  }
}

(Tip: 1. If allowedAgents is not configured or is an empty array [], it means all Agents are allowed to use memory retrieval and writing. 2. If it is configured, Agents not in the configuration will be completely skipped, and only the configured Agents will be effective for memory retrieval and writing, thereby avoiding Token waste.)

3. Per-Agent Configuration (agentOverrides) Beyond simple toggles, you can use agentOverrides to configure different memory parameters for each Agent. For example, giving a research assistant a looser retrieval threshold, while restricting a coding assistant to read only a specific codebase knowledge base:

{
  "plugins": {
    "entries": {
      "memos-cloud-openclaw-plugin": {
        "enabled": true,
        "config": {
          "multiAgentMode": true,
          "allowedAgents": ["research-agent", "coding-agent"],
          "memoryLimitNumber": 6,
          "relativity": 0.45,

          "agentOverrides": {
            "research-agent": {
              "knowledgebaseIds": ["kb-research-papers"],
              "memoryLimitNumber": 12,
              "relativity": 0.3,
              "queryPrefix": "research context: "
            },
            "coding-agent": {
              "knowledgebaseIds": ["kb-codebase"],
              "memoryLimitNumber": 9,
              "addEnabled": false
            }
          }
        }
      }
    }
  }
}

(In the example above, memory writing is disabled for the coding-agent, and it can only retrieve the top 9 highly relevant memories from the kb-codebase knowledge base).

Deep customisation of environment variables

In addition to the required API Key, you may also adjust the plugin's behaviour via environment variables。

Further configuration details can be found in the MemTensor official plugin repo

Testing

Now, you can engage in multi-turn conversations with your Agent, for example:

First convo:

"My favourite programming language is Python"
"I'm developing an e-commerce project"

Second convo (new convo):

"Do you recall which programming language I prefer?"
"How is the project I mentioned previously progressing?"

Now, your OpenClaw will retrieve memories from MemOS Cloud and provide accurate responses ✅

Use with the CLI Tool

If you use OpenClaw alongside other Agents that can execute shell commands (e.g. Cursor, Codex, Claude Code), the MemOS CLI lets them all share the same memory space.

Install CLI and generate plugin-aware Skill

npm install -g @memtensor/memos-cloud-cli
memos init --agent openclaw --memos-plugin

--memos-plugin makes the generated Skill aware of the installed OpenClaw cloud plugin, avoiding duplicate memory writes.

Using OpenClaw as an example, in the LOCOMO evaluation, using MemOS CLI alone reduced token usage by about 65.5%; integrating MemOS Cloud + CLI improved accuracy from 66.60% to 77.27%.

Install Skill for other Agents

memos init --agent cursor
memos init --agent codex

Once installed, these Agents will also automatically search and write MemOS memories through the CLI, sharing the same memory space as OpenClaw.

Test whether the plugin and CLI work together

Return to OpenClaw and start a conversation that should produce long-term memory:

I am recently working on an e-commerce project, mainly using Python. Please remember this.

Then start a new conversation and ask:

What project am I working on recently? Which language am I mainly using?

If OpenClaw answers using the previous turn, the plugin has written and recalled memory successfully. The generated CLI Skill is also aware of the installed plugin, so it avoids duplicate memory writes.

See the MemOS CLI Guide for the full command reference and more usage details.

More Integration Methods

In addition to the OpenClaw plugin, MemOS also supports MCP (for Cursor, Claude Desktop, and other clients) and CLI (for any Agent framework that can execute shell commands). See Use in Agents for a full comparison.

Changelog

Local Plugin