性能调优
向量嵌入优化
fast_embedder = {
"backend": "ollama",
"config": {
"model_name_or_path": "nomic-embed-text:latest"
}
}
slow_embedder = {
"backend": "sentence_transformer",
"config": {
"model_name_or_path": "nomic-ai/nomic-embed-text-v1.5"
}
}
推理速度优化
generation_config = {
"max_new_tokens": 256, # 限制响应长度
"temperature": 0.7,
"do_sample": True
}
系统资源优化
内存容量限制
scheduler_config = {
"memory_capacities": {
"working_memory_capacity": 20, # 工作记忆
"user_memory_capacity": 500, # 用户记忆
"long_term_memory_capacity": 2000, # 长时记忆
"transformed_act_memory_capacity": 50 # KV cache(激活记忆)
}
}
批处理操作
def batch_memory_operations(operations, batch_size=10):
for i in range(0, len(operations), batch_size):
batch = operations[i:i + batch_size]
yield batch # 批量处理