π§ ATOM Memory Guide
ATOM's memory engine gives your assistant real recall: it can remember who you are, what you've uploaded, and what you've asked before β all running locally.
β‘ New in April 2025 β Smarter Memory Pipeline
ATOM now includes an advanced memory pipeline for sharper, more relevant context injection and reasoning:
β Semantic Deduplication
Filters out near-duplicate memory chunks using cosine similarity to reduce repetition and noise.
β LLM-Powered Reranking
Uses your selected local model (like mistral
or cogito
) to re-rank vector memories for relevance before theyβre injected into the prompt.
β Thread Awareness
Tracks the last 5 user messages and includes them as [Recent Thread]
in the prompt to maintain coherent, flowing conversations.
β Cogito Prompt Format
When using the cogito
model, ATOM switches to a structured prompt format to support logical reasoning:
# System:
<persona>
# Memory:
<context>
# Question:
<your input>
# Thoughtful Answer:
These upgrades make ATOM far more consistent and reflective across multi-turn sessions.
πΎ Memory Types
ATOM supports multiple memory layers, each with a different lifespan and retrieval method:
πΉ Identity Memory
Stores things like your name, preferences, tone, and personality traits.
- Injected often
- Never expires unless cleared
πΉ Chat Memory
Captures past conversation history, prioritized by relevance and recency.
- Retrieved using vector search
- Pruned automatically over time
πΉ File Memory
Stores content from uploaded files, chunked into searchable segments.
- Searchable by question
- Context-aware during file Q&A
πΉ Summary Memory
Reflections and rollups that summarize what the assistant has learned.
- Triggered automatically every 10 turns
- Includes file summaries and self-generated thoughts
π₯ File Upload Pipeline
When you upload a file, ATOM:
- Parses and chunks the text
- Scores each chunk (skips junk)
- Runs summarization
- Stores content in vector memory
Supported formats:
.txt
,.pdf
,.docx
,.zip
- Scanned PDFs get OCR'd automatically
π Reflection Cycle
Every 10 messages, ATOM:
- Generates reflections for each memory type
- Purges low-score or expired chunks
- Reprioritizes recent and useful data
These reflections are visible in the Memory Dashboard.
π§ͺ Advanced Behaviors
- Score thresholds prevent noisy memory from being used
- Chat context is rebuilt fresh each turn
- Active file context is injected alongside vector matches
- Streaming is disabled automatically for Discord-based messages to prevent partial outputs
Oh yeah..... it has a built in discord bridge, add it as a legit bot to your server and talk to any of your ollama models ON DISCORD.
ATOMβs memory engine is built for local, smart, and modular reasoning. Most of the time. YMMV.
Want to see how tools work under the hood?
β Head to the Tools page next.