Solution architecture - Amazon ElastiCache
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Solution architecture

The following architecture implements persistent memory for agentic AI applications using ElastiCache for Valkey as the vector storage component.

Key components:

  • Amazon Bedrock AgentCore Runtime – Provides the hosting environment for deploying and running agents. It provides access to the LLM and embedding models required for the architecture.

  • Agent framework (for example, Strands Agents) – Manages LLM invocations, tool execution, and user conversations. Strands Agents supports multiple LLMs, including models from Amazon Bedrock, Anthropic, Google Gemini, and OpenAI.

  • Mem0 – The memory orchestration layer that sits between AI agents and storage systems. Mem0 manages the memory lifecycle, from extracting information from agent interactions to storing and retrieving it.

  • Amazon ElastiCache for Valkey – The managed in-memory data store that serves as the vector storage component. ElastiCache uses Valkey's vector similarity search capabilities to store high-dimensional vector embeddings, enabling semantic memory retrieval.