5.0 is the foundation. 5.1 takes four pieces of the platform and makes each one faster, smarter, or more resilient than it is today. This page is the customer-shaped read on those four: what changes for you, why it matters, what the trade-off looks like. Honest framing, no commit dates — we ship when the surface is right. For the briefer 5.1 card view see what's next.
Today most of Eldric's local-model traffic flows through Ollama. It works well, but it's a separate process to manage and a separate set of failure modes to debug. 5.1 wires the native Eldric inference daemon and the xLSTM workload daemon into the same cooperative path so the platform can serve language-model traffic, structured-ML workloads, and embeddings from one runtime — without an external sidecar.
What changes for you:
/v1/embeddings path. No separate embedding service to deploy.The trade-off: Ollama-format models still load (we don't break what works). But the recommended path moves to native GGUF for new installations — smaller surface area, fewer things to go wrong.
5.0 ships the routing layer with explicit configuration — the controller knows the peers because someone configured them. If a peer goes down mid-conversation, the controller catches the timeout and retries; but new requests still get sent the same way until you change the config. 5.1's "Path A peer-aware routing" makes the routing decision live: the controller continuously discovers the cluster's healthy nodes, ranks them by load and latency, and steers new requests to whichever node can actually serve them right now.
What changes for you:
The trade-off: a small new background task (peer health pings) runs on every controller. Negligible on real clusters; explicitly disable-able on single-node installs where it's noise.
The embedding model is the spine of RAG. If you change it — a better model becomes available, the chunking strategy gets revised, the vector store schema evolves — every chunk in every knowledge base needs to be re-embedded. Today that's a careful operator job: schedule a maintenance window, run an admin script per namespace, monitor progress, hope nothing surfaces a corner case.
5.1 turns the same operation into a single button:
The trade-off: re-embedding a large corpus takes time and burns embedding-model GPU cycles. The platform schedules the work to minimise user-facing latency, but a large institution with a million documents will still see a few hours of background work.
5.0 already ships content-aware chunking — the platform picks a strategy per content type based on a built-in table (scientific PDF → semantic, code → function-boundary, CSV → per-row, and so on). See chunking strategies for the full default set.
5.1 takes the same surface and adds two things:
The trade-off: the learning loop needs a few weeks of acceptance signal before the per-tenant defaults stabilise. Brand-new installations keep the built-in table defaults until the loop has enough signal.
None of the four pieces above has a release date attached. That's not because we don't have an internal plan — we do. It's because software dates slip; published dates create pressure to ship before the surface is right. Eldric 5.0 took the time it took. 5.1 will take the time it takes.
What you can rely on:
The 5.1 roadmap moves on customer signal. If something you need isn't here, write to office@eldric.ai. Paid-tier customers with a license ID: support@eldric.ai for prioritised handling — feature requests from active customers carry weight in scheduling.
For the briefer 5.1 cards view: what's next. For everything shipped right now: release notes. For 5.0-line bugs and preview-status caveats: known issues.