I'd like to have a local, fully offline and open-source software into which I ca...

cbcoutinho · 2025-11-29T15:25:51 1764429951

This is why I built the Nextcloud MCP server, so that you can talk with your own data. Obviously this is Nextcloud-specific, but if you're using it already then this is possible now.

https://github.com/cbcoutinho/nextcloud-mcp-server

The default MCP server deployment supports simple CRUD operations on your data, but if you enable vector search the MCP server will begin embedding docs/notes/etc. Currently ollama and openai are supporting embeddings providers.

The MCP server then exposes tools you can use to search your docs based on semantic search and/or bm25 (via qdrant fusion) as well as generate responses using MCP sampling.

Importantly, rather than generating responses itself, the server relies on MCP sampling so that you can use any LLM/MCP client. This MCP sampling/RAG pattern is extremely powerful and it wouldn't surprise me if there was something open source that generalizes this across other data sources.

russdill · 2025-11-29T16:47:07 1764434827

Would love to see someone build an example using the offline wikipedia text.

fragmede · 2025-11-29T16:55:08 1764435308

Given the full text of Wikipedia is undoubtedly part of the training data, what would having it in a RAG add?

Zambyte · 2025-11-29T17:06:12 1764435972

High precision recall.

It may also be cheaper to update the source (Wikipedia) with new information than to update the model.