AI Memory
AI memory allows models and agents to recall past interactions, adapt over time, and reason more effectively. We examined the most popular LLMs' ability to store information in both long-term and short-term memory, along with their context window capabilities.
RELC-Bench: Retrieval on Long Context Benchmark
RELC-Bench (RELC-Bench: Retrieval on Long Context Benchmark) aims to measure a model’s ability to find and extract a specific numeric value from one or more documents within its context. It tests whether the model can remember and retrieve a specific fact it just saw in the input.
AI Apps with MCP Memory Benchmark & Tutorial
We tested four Model Context Protocol (MCP) memory servers to measure which ones actually retain and retrieve context across AI agent sessions. Using LangChain’s ReAct Agent, we connected each server, ran standardized multi-session conversations, and scored them on memory operation accuracy.