Feb 12, 2026

v2.0.6

New Features

Retrieval Optimization:

Added keyword-based search in the search phase, while relativity continues to use cosine similarity of embeddings.
Added a relativity output field to the Chat API, allowing users to control the retrieval threshold during conversations using relativity.

Bug Fixes

Fixes:

Fixed an issue where an incorrect field was used in preference memory threshold filtering.
Fixed an issue that caused failures or hanging when calling the get_memory API with the filter parameter containing both field conditions and logical operators such as AND/OR/NOT.
Fixed issues related to passing extra body parameters to LLMs, ensuring compatibility with the enable thinking parameter input

Feb 10, 2026

v2.0.5

New Features

Knowledge Base Retrieval Optimization:

Dual-track retrieval: Supports coordinated storage and retrieval of "original text + memory", with customizable recall modes.
Context wake-up: Supports recalling surrounding context before/after chunks to improve semantic coherence in long texts.
Relevance filtering: The search API adds a relativity field (0~1) to filter low-quality recalls by threshold.

MindDock:

Cloud Skill: Added cloud skill support and a quick entry.
Real-time prompt injection: Dynamically inject Skill modules during chat to enable skill interoperability across models.

MCP Memory Deletion:

When deletion intent is detected, synchronously trigger deleteMemory and addFeedback.

Improvements

Scheduling & Retrieval Pipeline Refactor:

Pipeline refactor: Rebuilt the retrieval chain into four stages—Search -> Enhance -> Rerank -> Filter—scheduled by a unified API.
Scheduling optimization: Modularized the task processor and fixed Redis Streams serialization and user_context passing issues.

Skills Improvements:

Skill docs localization: Supports local storage of skill docs and generates dedicated URLs, downloadable directly via API.
Skill generation optimization: Can now generate richer and more complete skills from users' historical information.

Bug Fixes

Playground Improvements:

Fixed known UX issues and improved environment stability.

Jan 30, 2026

v2.0.4

New Features

Core Capabilities:

Added Skill Memory.
Added modules supporting memory versioning.
Added chat/completions interface, compatible with OAI protocol.

Improvements

Retrieval Optimization:

Reduced duplication between factual and preference memories during retrieval.

Bug Fixes

Bug Fixes:

Fixed bug for new users in Playground.
Fixed occasional latency when adding memories in cloud service.
Resolved graph database efficiency issues.

Jan 22, 2026

v2.0.3

New Features

Documentation:

Open-source project documentation: Fixed formatting issues, improved example descriptions, resolved Chinese-English inconsistencies, and added practical usage tips.

Dashboard:

Knowledge base now supports more file types: JSON, MD, XML.

GitHub Repository:

Adjusted the startup approach (introduced a new server router class) and removed the reference to src/memos/product_api.py.

Improvements

Open Source:

Optimized text and image memory addition by processing both types of information together as a single message.
Knowledge base search now supports the 'all' field.

Bug Fixes

Open Source:

Fixed an exception that occurred when the search input text was too long.

Jan 15, 2026

v2.0.2

New Features

Released "Knowledge Base Q&A Assistant" Construction Tutorial:

Official documentation updated with best practices for "Developing Q&A Assistants Based on Knowledge Bases", guiding you step-by-step to build a "MemOS Knowledge Base Assistant".

Improvements

Knowledge Base Document Retrieval:

Further improved the retrieval memory interface's ability to recall detailed knowledge base document content, optimizing answer quality for document-based queries.

Tool Memory Mechanism:

Added procedural experience for call trajectories to enhance the auxiliary effect of tool memory;
Compressed tool schema information to avoid duplicate addition of ToolSchemaMemory.

Fact Memory Merging and Archiving Mechanism:

Added merging and archiving mechanisms in fact memory processing to optimize issues of duplicate memory writing and recall, while ensuring extraction completeness.

Get Memory Interface:

Support returning tool memories;
Support returning memories under specific filtering conditions for debugging and display scenarios.

Bug Fixes

Get Memory Interface:

Fixed call error in get memory interface when parameter include_preference=False.

Jan 8, 2026

v2.0.1

New Features

Get Memory Interface Launch:

Get memory interface launched, supporting full retrieval of user memories.

Chat Function Launch on Cloud Service:

Chat function launched on cloud service, enabling one-click start of "memory-persistent" conversations.

Playground Update:

Playground memory management page supports memory deletion, allowing users to manually manage and delete expired memories they no longer need.

Improvements

Delete Memory Interface:

Optimized delete memory interface to support deletion of all types of memories, including user memories, knowledge base memories, etc.

Preference Memory:

feedback interface added feedback processing operation for preference memory

Memory Retrieval Optimization:

search interface supports optional deduplication parameter, supporting semantic similarity deduplication to improve retrieval result diversity.

Bug Fixes

kv cache Issue Fix:

Solved compatibility error of kv cache code in new version of memos

Scheduling Module Issue Fix:

Fixed error when scheduling enables local mode due to missing redis configuration

Dec 24, 2025

v2.0.0

New Features

Knowledge Base & Memory:

Added knowledge base support for long-term memory from documents and URLs

Feedback & Memory Management:

Added natural language feedback and correction for memories
Added memory deletion API by memory ID
Added MCP support for memory deletion and feedback

Conversation & Retrieval:

Added chat API with memory-aware retrieval
Added memory filtering with custom tags (Cloud & Open Source)

Multimodal & Tool Memory:

Added tool memory for tool usage history
Added image memory support for conversations and documents

Improvements

Data & Infrastructure:

Upgraded database for better stability and performance

Scheduler:

Rebuilt scheduler with Redis Streams and queue isolation
Added task priority, auto-recovery, and quota-based scheduling

Deployment & Engineering:

Added lightweight deployment with quick and full modes

Bug Fixes

Memory Scheduling & Updates:

Fixed legacy scheduling API to ensure correct memory isolation
Fixed memory update logging to show new memories correctly

Nov 5, 2025

v1.1.3

New Features

Memory Add & Search:

Added async mode (plaintext & preference)
Preference Memory now supported
Reranker strategy suite
BM25 for TreeTextMemory

Scheduler:

Modularized API scheduler
Redis ORM optimized for history sync & hybrid search

Data & Infra:

PolarDB graph backend connection pool/timeout & fixes
Unified graph factory (Neo4j/PolarDB/Nebula)
Milvus interface & item optimizations
Enhanced logging pipeline
Nacos-based dynamic configuration

Evaluation:

PrefEval field standardization
LoCoMo/LongMemEval/PrefEval/PersonaMem evaluation upgrades

Improvements

Plaintext Memory:

Standardized preference fields

Framework:

Updated API routes
More robust error handling & context tracing

Bug Fixes

Scheduler:

Fixed query scheduling edge case
Corrected message schema inconsistencies

Plaintext Memory:

Graph/DB：corrected PolarDB issues

Framework:

Fixed SQLite user listing bug

Sep 25, 2025

v1.1.1

New Features

Cloud Service 1.0 Beta Release:

🛠️ Ready-to-Use：Deploy directly via cloud API without complex setup
🔄 Cross-Session Memory：Automatically retrieve user profiles, preferences, and behaviors for continuous personalization
📖 Developer-Friendly：Comprehensive documentation and example code support
🎁 Free Quota：Every developer can receive basic quota to explore

Enterprise Group Q&A bot:

Supports Q&A based on cloud service knowledge base

Sep 10, 2025

v1.0.1

New Features

Group Q&A Bot:

Launched group Q&A bot based on MemOS Cube

KV-Cache Performance Optimization:

Updated comparative experimental data for KV-Cache on different GPU deployment schemes
Optimized test benchmarks and statistical methods

Plaintext Memory Enhancement:

Added Reranker sorting functionality for plaintext memory

Playground Updates:

Updated Playground version with all the above new features

Improvements

Plaintext Memory Optimization:

Optimized plaintext memory hallucination issues

Aug 7, 2025

v1.0.0

New Features

Playground:

Expanded Playground features and algorithm performance.

MemCube Construction:

Added a text game demo based on the MemCube novel.

Extended Evaluation Set:

Added LongMemEval evaluation results and scripts.

Improvements

Plaintext Memory:

Integrated internet search with Bocha.
Added support for Nebula database.
Added contextual understanding for the tree-structured plaintext memory search interface.

Bug Fixes

KV Cache Concatenation:

Fixed the concat_cache method.

Plaintext Memory:

Fixed Nebula search-related issues.

Jul 29, 2025

v0.2.2

New Features

Explicit Memory:

Implemented internet search integration with Nebula database support
Enhanced contextual understanding for memory extraction
Deployed internet search interface integrated with memreader for web message processing

KV Cache:

Conducted KV Cache evaluation including LMCache comprehensive research and performance testing
Completed stress testing for MemOS vllm (v0.2.1) in specific environments and models
Refined ttft test data for preloaded KV Cache on Qwen2.5-72B-Instruct model and updated report
Completed initial research and design for high-ROI inference service system using domestic hardware

Memory Operation Model:

Completed model training evaluation and release
Released 4b, 1.7b and 0.6b models supporting memory extraction and integration

Improvements

Documentation:

Added English version for first three chapters of Cookbook

Memory Scheduling:

Performed code refactoring and functionality improvements
Refactored functional modules including monitor, dispatcher and retriever
{ "Added major code categories": "schemas and utils" }
Enhanced network logging functionality
Improved scheduler robustness with new exception-catching decorators
Implemented locking for shared resources

Development Environment:

Updated Docker configuration
Updated dim environment configuration

API Enhancements:

Added playground context support
Enhanced product API functionality
Rewrote query module
Implemented chat history feature

Bug Fixes

Data Parsing:

Fixed date parsing error
Fixed memos_w_scheduler example code issues
Fixed fine-grained search bug in Nebula graph database
Fixed metadata filter retrieval logic

System Integration:

Aligned MOSProduct._build_system_prompt signature with MOSCore

Explicit Memory:

Fixed general text memory processing logic
Fixed memreader component issues

Jul 21, 2025

v0.2.1

New Features

MemCube Features:

Added plaintext memory + KV Cache functionality with reasoning and decoding performance report
Completed development of interactive cube feature with full workflow support and embedding model switching

MemOS System:

Released MemOS Neo lightweight version with simplified architecture and core API modules
Added MCP support research and preparation work, expanding MCP capabilities
Created embedded Agent flow Pipeline documentation with Coze framework integration

Deployment & Environment:

Added Docker deployment support
Enhanced API layer with multi-model support, compatible with OpenAI/Qwen/DeepSeek and other mainstream models
Improved embedding model support
Added neo4j Community Edition/Nebular database support
Implemented multi-tenant architecture support within single database

Evaluation & Testing:

Adapted and evaluated memos API interface format, updated evaluation code

Documentation & Examples:

Added new Cookbook content
Included Mud game example

Jul 11, 2025

v0.2.0

New Features

Website Usability Updates:

Added documentation search functionality
Implemented Chinese/English documentation switching
Enhanced footer navigation links and page editing features

Memory Operator Model:

Introduced MemReader-4B small model for memory extraction operations
Enabled fully local deployment for restricted network environments
Achieved lower cost and faster memory operations with performance exceeding GPT-4o-mini
Fine-tuned based on Qwen3-4b with human and model annotation data using supervised fine-tuning

Cross-Platform Framework Adaptation:

Added Windows platform deployment support
Added Mac platform deployment support
Completed adaptation for Linux, Windows, and macOS mainstream operating systems
Passed testing on Ubuntu 20.04+/CentOS, Windows 10+/11, macOS 14 Ventura+ platforms
Supports stable operation of core modules (memory lifecycle management, MIP protocol, memory cache scheduling)

Playground Progress:

Completed engineering, front-end, and algorithm end-to-end connection, and bug fixing and optimization in progress

Changelog