RAG Advisor
Designs retrieval-augmented generation pipelines — document ingestion, chunking, embedding, vector storage, retrieval, reranking, and generation prompt structure. Use when building a RAG system from scratch or evaluating whether RAG is the right approach. RAG architecture, vector search, semantic search, knowledge base.
Conversation Memory Designer
Designs conversation memory systems for chatbots and agents — sliding window, summarization, semantic recall, entity memory, and hybrid architectures with explicit token budgets and eviction policies. Use when building multi-turn or multi-session conversational AI that needs to remember context. Conversation memory, chat history, session persistence.
Embedding Strategy Advisor
Designs complete embedding pipelines — model selection, chunking strategy, vector index configuration, and query-time processing for search, RAG, and similarity matching. Use when choosing embedding models, configuring vector databases, or designing chunking for a new corpus. Embeddings, vector search, HNSW, chunking.
Retrieval Pipeline Optimizer
Diagnoses and fixes underperforming RAG retrieval through systematic failure analysis — recall failures, precision failures, ranking failures — with targeted optimizations and eval loops. Use when an existing RAG pipeline returns wrong, irrelevant, or poorly-ranked results. Retrieval quality, search optimization, reranking.