Local Inference and Retrieval
The assistant combines local model execution with RAG-style document access so responses can be fast, private, and tied to actual source material.
- local model execution
- retrieval over project documents
- context windows shaped around task relevance