Eldric 5.0 GA — release notes

The patch train — through v5.0.51

What landed since GA.

The 5.0.x line ships continuously. Every patch is additive — sudo dnf update eldric-aios and you're current, no breaking changes and no migration. Here's the customer-visible sweep from the GA baseline through the current v5.0.51 release.

Production high availability — a multi-controller consensus cluster that forms, replicates and fails over under live test. Lose a controller and the platform keeps serving; the surviving nodes elect a new leader and clients ride through.
Multi-site federation — connect clusters across regions and network segments with mutual-TLS trust, cross-site routing, a disaster-recovery panel and a guided setup wizard. Run one logical platform over many locations.
The extension & plugin ecosystem — a marketplace of drop-in integrations you configure entirely in the admin GUI, no file editing: environmental sensors (the Bosch family), networking and infrastructure gear (Cisco, Fortinet, Aruba, Palo Alto, HPE, UniFi, FRITZ!Box), UPS and printer fleets over SNMP, and database connectors. Each device is named and multi-instance — add as many as you run. Customers write their own plugins for their own hardware.
Native structured-ML & world-models — the platform runs xLSTM-family workloads natively, in-process. A seismic world-model renders real earthquake wavefronts right in the chat from cataloged events — and answers honestly "not found" for anything not in the catalogue, never a fabricated result.
Knowledge storage that learns — associative matrix memory alongside exact vector storage, pinnable knowledge containers, per-tenant isolation and a compliance-grade delete. What the platform learns in one session it can recall in the next.
Native inference in the single binary — GGUF and xLSTM models load directly, with no external runtime to install or manage. Run inference on dedicated nodes when you have them; the cluster handles isolation and failover for you.
The science source registry — honest by construction — 140+ scientific data sources behind one dispatcher. A source that isn't wired returns a clean "not available" rather than inventing an answer. The same discipline runs cluster-wide: Eldric answers from real sources or says it couldn't find one.
Rolling in now — role-based access control (roles, permissions, and per-tenant enforcement) is landing through the current 5.0.x patches.

For where the line is headed next, see the patch-train roadmap. For the current downloads, see downloads.

What's new in 5.0

The major lifts.

xLSTM structured-ML worker

Eldric serves four structured-ML workload classes natively: policy execution (LRAM), forecasting (TiRex), encoding (ViL), and associative retrieval. Requests route by workload type; the IoT capability binds policy execution to real-time control loops with safety interlocks. Associative retrieval is native code with microsecond latency on CPU alone. License-gated per workload, structured error responses on missing capability.

IoT closed-loop bindings

Policy-driven control of OPC-UA, Modbus and MQTT-Sparkplug-B endpoints through the IoT worker. Safety interlocks ship by default: magnitude clamp on output, joint-limit checks, watchdog timeout, NaN reject, and an emergency-stop path. Bindings can be pinned to a specific xLSTM worker or set to failover automatically. Streaming via WebSocket transport for sub-tick latency; OPC-UA and MQTT-Sparkplug-B for fleet-wide subscriptions; watchdog-driven fallback (zero / freeze / last-known-good) when the policy worker misses its deadline. Details on xLSTM & IoT transports.

Admin bundle export / import

Pack the matrix memory, vector documents, knowledge-base sources, ENRN classifiers, skills, source-registry entries, dream artefacts and identity overlays of one Eldric installation into a single signed file. Ship it. Unpack on another cluster with a clean merge — same version or different. Per-tenant scope filters so an export never carries other tenants' data with it. Useful for tenant portability, project handoff, edge seeding, lightweight federation between regional clusters, and disaster recovery.

Edge runtime

Single-node Eldric install on Raspberry Pi 4 (4 GB+ RAM, 32 GB+ storage), Intel NUC and NVIDIA Jetson. Local knowledge-base, local matrix memory, local chat serving without a controller in sight. Buffers writes when the link to the central cluster is down and flushes when it returns. The cognitive layer of an autonomous platform, on the platform.

Memory subsystem

Multi-tier memory: matrix memory (associative updates), vector RAG (full-text + semantic search), ENRN classifiers, the Agent × Domain matrix (15 agents × 7 domains). Crash-safe persistence with integrity checksums. Sub-millisecond memory recall on hot paths.

RAG with citations

Retrieval-augmented generation ships on by default in 5.0. Upload documents (PDF, DOCX, Markdown, plain text, code, CSV, audio, video, sensor streams) and the platform runs them through content-aware chunking — different strategies per content type, with the intelligent-upload flow auto-suggesting parameters and letting you override before commit. The native GGUF embedding model (~80 MB, runs on CPU) embeds locally; the data worker stores chunks alongside vectors; queries cite the source passages back to the user. Custom classification (Pro+) lets you teach the router your own intent classes. The retention loop turns accepted answers into the next training corpus over time. See using RAG, RAG architecture, chunking strategies, RAG on demand, custom classification.

Science

The Source Registry groups scientific data sources into sixteen categories with twenty-eight seeded sources and a plugin entry point for adding your own. Eleven LLM-callable tools dispatch through one endpoint. Bioinformatics, pharma, CRISPR, LIMS, materials, climate, neuroscience, particle physics, gravitational waves, space agencies, and more.

Other workers

Communication worker (email, SMS, VoIP, WhatsApp, Signal, Teams, XMPP). Media worker (STT, TTS, video processing). Training worker (Unsloth, Axolotl, TRL, MLX, llama.cpp, xLSTM backends). Swarm controller, Agent worker, Data worker, Cloud worker, Native Inference daemon — all production-ready with 4.x feature parity.

Auto-update on macOS

The macOS GUI client carries a cryptographically-signed in-app updater. New releases are picked up automatically from the package server with a small notification in the app; the update applies on next launch. Update channels are stable by default and beta as an opt-in via the GUI's settings.

Faster, smaller memory (preview)

A research preview of EMM Compressed retrieval ships in 5.0 as an opt-in. Customers can convert a knowledge-base's matrix-memory layer to a compressed representation that's smaller on disk and faster at high concurrency, at a small accuracy trade-off on the kind of edge-case queries where exact retrieval matters most. The full-precision path stays available alongside it. Details on advanced retrieval.

Smart memory inference (preview)

Eldric's native inference now consults the matrix-memory layer directly at prompt boundaries. Model answers are grounded in your tenant's own data without a separate retrieval round-trip — useful for chat-style workloads where total latency matters and for customer-specific content where the model would otherwise produce a generic first pass. Sub-2 ms per-token overhead on CPU. Pro+ opt-in. Details on smart memory inference.

Eldric for iPad

The iPad client lands as part of the universal-app shape alongside iPhone. Tablet-aware split view (sidebar + chat + artifact), floating composer, Apple Pencil and Scribble, drag-drop ingest from Files / Photos / Mail, notification center for long-running agent tasks. Theme settings sync across all clients via the controller. Multi-window for Stage Manager + Split View ships as preview. TestFlight today, App Store next. Details on iPad.

Agent Builder admin UI

The "AI that builds AI" flow now has a chat-shell admin interface. Describe the agent you want; the platform generates the agent code, tool definitions, prompts and test cases; sandboxed test runs verify behaviour before deployment. The generator and the builder share the agent package format; either path produces a self-contained agents/<name>/ directory you version-control.

Distilled router classifier (preview)

An ENRN v18 router-intent classifier ships as an opt-in for Professional and Enterprise tiers. It replaces the small-LLM routing decision with a single-pass neural classifier trained on your cluster's accumulated routing patterns. Lower latency on the routing decision, less GPU time burned on classification; the LLM stays in the loop for genuinely ambiguous queries. See advanced retrieval for the customer-facing detail.

ARM64 minimal edge runtime — live

The minimal-runtime package now ships as a signed aarch64 RPM alongside x86-64. The same one-line installer detects the host architecture and installs the right build. Verified on Raspberry Pi 4, generic ARM64 servers and NVIDIA Jetson. See edge install for the deployment paths.

Industrial transports

The IoT worker's xLSTM policy bindings drive WebSocket, Modbus TCP and the publish half of MQTT-Sparkplug-B at GA (Pro+ tier). OPC-UA ships at preview in 5.0 GA — the C++ source is in the release, the open62541 library bundles into the cluster RPM at the next cut. The MQTT-subscribe half — pulling observations from a Sparkplug-B sensor fleet through the policy worker — stays preview for the 5.0 line and lands in an upcoming 5.0.x patch. Safety-fallback (zero / freeze / last-known-good) is unconditional across all transports. See xLSTM & IoT and known issues.

Model badges in chat

Every model in the chat picker shows a small coloured badge identifying the backend that serves it — Ollama, OpenAI, Eldric Inferenced, vLLM, llama.cpp, Anthropic, xAI, Groq and the rest. Green for models on your own cluster, brand-coloured for external APIs. The same badge appears under every assistant message so you always see which backend served it. No silent re-routing between local and external. Customer-facing detail on model providers.

Tool query info in chat

A new Show tool query details toggle in chat settings (default on) prints a one-line summary of what each tool actually queried — knowledge base + namespace + query + hit count for RAG, source + query for science look-ups, search engine + terms for web, recipient for comm. Turn it off for a denser transcript. Available across every chat client.

Agent Builder share, publish, edit

Agents you build inside the chat shell now have one-click share within your tenant, publish-to-marketplace for the cluster admin to review, and an in-place editor with versioning. Sandboxed test runs validate every change before deploy. The agent-package format stays the same — agents/<name>/ — so manual editing and the UI agree.

iPad Phase D — multi-window, keyboard shortcuts, drag-drop

Phase D adds Stage Manager + Split View multi-window, a comprehensive keyboard-shortcut map for chat actions, cross-window drag to fork a conversation, and a collision toast when two clients edit the same theme. macOS now also shows long-running tasks as native UNUserNotification entries, with APNs wired through for remote pushes from cluster events. See iPad page.

xLSTM workload control plane

Admin UI for the four xLSTM workload classes — policy, forecast, encode, retrieve — with per-tenant enable, per-class model selector, and live load chart. Routing requests to the xLSTM worker no longer needs an admin to read JSON.

EMM Compressed migration CLI and per-namespace toggle

Existing matrix-memory namespaces can be migrated to the EMM Compressed variant on a per-namespace basis with a single command. The chat shell shows migration progress; rollback is a single command. Lets customers adopt the new format on the namespaces that benefit without committing the whole installation at once.

Cross-distro packaging — coming shortly after GA

5.0 GA ships signed RPMs for the Fedora / RHEL / CentOS / Rocky / Alma family. Ubuntu 24.04 and Debian 12 DEB packages follow shortly after GA. Customers on Ubuntu / Debian today can run 5.0 via WSL2 or container.

Coming in 5.0.x

What's deferred.

Customer-named roles (RBAC). Tenant admins compose their own roles ("Auditor", "Lab tech", "Finance") from the underlying permission set. 5.0 ships with four built-in roles; an upcoming 5.0.x patch adds the composition layer.
Multi-region federation. Full real-time multi-region replication with the WLCG tier model. 5.0 ships lightweight federation via the bundle format; an upcoming 5.0.x patch adds the always-on replication path.
NOVA optional kernel. The experimental self-improvement module remains explicitly experimental and off by default in 5.0. Production-status (and customer-facing docs beyond the research framing) is later in 5.0.x.
Mac-arm64 native xLSTM (Level 1 JAX-XLA). The Level-1 native xLSTM backend compiles cleanly in 5.0 with the build flag disabled. Wiring it through as the default backend on Apple Silicon — with the native-backend link integration — lands in an upcoming 5.0.x patch.
EMM Compressed int8 quantization. EMM Compressed retrieval ships at full precision in 5.0. The int8-quantized variant — smaller .emm files on edge nodes — lands in an upcoming 5.0.x patch.
Custom theme palette. 5.0 ships five themes (Frost + four others) with cluster sync. Customer-customisable colour palettes inside the theme editor arrive in an upcoming 5.0.x patch.
Offline-mode automation. Automated drain + TTL + dead-letter queue + back-pressure on the buffer when a sustained outage threatens to overflow. 5.0 ships the buffering primitive; an upcoming 5.0.x patch adds the orchestration.
Unified brain orchestration. Cross-tier learning across the memory subsystem with deeper integration between the workers.
ENRN v8. Mixture-of-Experts container with dynamic expert routing. 5.0 ships ENRN v5–v7 plus the v18-GQA distilled router.
Intelligent dreaming full orchestration. 5.0 ships the dream engine; an upcoming 5.0.x patch adds the cluster-wide cadence orchestration.
Cluster-update rollback automation. 5.0 has the rolling-update orchestrator; an upcoming 5.0.x patch adds automated rollback on health-check failure.
Remote-destination backup / DR. 5.0 ships local-destination snapshots; an upcoming 5.0.x patch adds the offsite path.
Customer-facing event-bus catalog. 5.0 webhooks are subscribable; an upcoming 5.0.x patch publishes the event catalogue so plugin authors know what they can hook.

All deferrals are documented on the features catalogue with WIP markers, and the customer-facing list with status sits on known issues.

Known at GA

First-week follow-ups.

The full open-items list lives on known issues. The condensed first-week scope — items the GA close caught for the 5.0.1 cycle:

AI papers knowledge base — sparse content. The ai-papers-at knowledge base imported from 4.x is thinner than the 4.x corpus; Hochreiter-authored papers in particular are sparsely represented. Re-import in flight; 5.0.1 carries the rebuilt corpus.
Model picker statistics — first-run warm-up. The most-used / recently-used badges in the model picker need one warm-up pass to populate. The chat shell handles this automatically as customers use models; admins who need pre-populated stats on day one can request the warm-up routine via support@eldric.ai.
Vector search — namespace count display. The "namespace count" shown next to a knowledge base in the Admin Console can be stale immediately after upload. Document and chunk counts are correct; the namespace count refreshes on the next aggregate cycle. Fix lands in 5.0.1.
Demo databases — opt-in registration. The demo ClickHouse and Postgres connectors aren't registered with the platform by default. Customers who want them available run the setup scripts in the operations guide; 5.0.1 wires them as default-enabled with a one-click toggle in the Admin Console.
CLI binary — environment variables. Locally-built command-line clients need ELDRIC_EDGE_URL and ELDRIC_API_KEY in the environment to reach a cluster. RPM-installed CLI reads configuration from /etc/eldric/cli.conf automatically.
Edge tenant theme & branding — public reachability. The per-tenant theme and branding endpoints (/api/v1/tenants/{id}/theme, /api/v1/tenants/{id}/branding/logo) aren't yet reachable through the public Edge gateway. Pro+ customers configuring custom branding call them from inside the cluster network for the 5.0 line; the Edge allowlist update lands in 5.0.1.

Eldric 5.0.
Generally available.

What 5.0.1 fixes.

What landed since GA.

One sentence.

The major lifts.

xLSTM structured-ML worker

IoT closed-loop bindings

Admin bundle export / import

Edge runtime

Memory subsystem

RAG with citations

Science

Other workers

Auto-update on macOS

Faster, smaller memory (preview)

Smart memory inference (preview)

Eldric for iPad

Agent Builder admin UI

Distilled router classifier (preview)

ARM64 minimal edge runtime — live

Industrial transports

Model badges in chat

Tool query info in chat

Agent Builder share, publish, edit

iPad Phase D — multi-window, keyboard shortcuts, drag-drop

xLSTM workload control plane

EMM Compressed migration CLI and per-namespace toggle

Cross-distro packaging — coming shortly after GA

What's deferred.

First-week follow-ups.

From 5.0-alpha to 5.0 GA.

Tiers and features.

Next.

Eldric 5.0.Generally available.

What 5.0.1 fixes.

What landed since GA.

One sentence.

The major lifts.

xLSTM structured-ML worker

IoT closed-loop bindings

Admin bundle export / import

Edge runtime

Memory subsystem

RAG with citations

Science

Other workers

Auto-update on macOS

Faster, smaller memory (preview)

Smart memory inference (preview)

Eldric for iPad

Agent Builder admin UI

Distilled router classifier (preview)

ARM64 minimal edge runtime — live

Industrial transports

Model badges in chat

Tool query info in chat

Agent Builder share, publish, edit

iPad Phase D — multi-window, keyboard shortcuts, drag-drop

xLSTM workload control plane

EMM Compressed migration CLI and per-namespace toggle

Cross-distro packaging — coming shortly after GA

What's deferred.

First-week follow-ups.

From 5.0-alpha to 5.0 GA.

Tiers and features.

Next.

Eldric 5.0.
Generally available.