What it provides
- Keyword search via FTS5 full-text indexing (BM25 scoring).
- Vector search via embeddings from any supported provider.
- Hybrid search that combines both for best results.
- CJK support via trigram tokenization for Chinese, Japanese, and Korean.
- sqlite-vec acceleration for in-database vector queries (optional).
Getting started
If you have an API key for OpenAI, Gemini, Voyage, or Mistral, the builtin engine auto-detects it and enables vector search. No config needed. To set a provider explicitly:Supported embedding providers
| Provider | ID | Auto-detected | Notes |
|---|---|---|---|
| OpenAI | openai | Yes | Default: text-embedding-3-small |
| Gemini | gemini | Yes | Supports multimodal (image + audio) |
| Voyage | voyage | Yes | |
| Mistral | mistral | Yes | |
| Ollama | ollama | No | Local, set explicitly |
| Local | local | Yes (first) | GGUF model, ~0.6 GB download |
memorySearch.provider to override.
How indexing works
FluffBuzz indexesMEMORY.md and memory/*.md into chunks (~400 tokens with
80-token overlap) and stores them in a per-agent SQLite database.
- Index location:
~/.fluffbuzz/memory/<agentId>.sqlite - File watching: changes to memory files trigger a debounced reindex (1.5s).
- Auto-reindex: when the embedding provider, model, or chunking config changes, the entire index is rebuilt automatically.
- Reindex on demand:
fluffbuzz memory index --force
You can also index Markdown files outside the workspace with
memorySearch.extraPaths. See the
configuration reference.When to use
The builtin engine is the right choice for most users:- Works out of the box with no extra dependencies.
- Handles keyword and vector search well.
- Supports all embedding providers.
- Hybrid search combines the best of both retrieval approaches.
Troubleshooting
Memory search disabled? Checkfluffbuzz memory status. If no provider is
detected, set one explicitly or add an API key.
Stale results? Run fluffbuzz memory index --force to rebuild. The watcher
may miss changes in rare edge cases.
sqlite-vec not loading? FluffBuzz falls back to in-process cosine similarity
automatically. Check logs for the specific load error.