The xlstmd daemon hosts production xLSTM workloads — policy execution, time-series forecasting, vision encoding, associative retrieval — at predictable latency and memory cost.
Transformers dominate text generation but their O(n²) attention cost becomes a bottleneck for long sequences, real-time control loops, and any workload where memory budget matters more than parameter count. The xLSTM family (matrix-LSTM + scalar-LSTM blocks) restores linear-time recurrence with modern training tricks — a fit for control policies, multi-horizon forecasts, and dense retrieval at scale.
The xlstmd daemon (port 8884) hosts these workloads behind a stable HTTP surface so customers can call them from the chat shell, agentic flows, or external integrations — without managing the Python sidecar lifecycle directly.
Run a trained xLSTM policy on streaming observations and emit actions. For robotics, industrial control loops, autonomous decision-making.
Endpoint: POST /api/v1/xlstm/policy/step
Multi-horizon forecasts on telemetry streams — sensor data, financial series, OEE metrics. Returns mean + quantile bands per horizon.
Endpoint: POST /api/v1/xlstm/forecast
Frame-by-frame embedding of image / video streams. Pairs with the media worker (port 8894) for pipeline-encoded video and the data worker (port 8892) for embedded-frame retrieval.
Endpoint: POST /api/v1/xlstm/encode
High-recall key-value lookup against a learned associative memory. Complements the standard vector store with O(1) recall on exact keys.
Endpoint: POST /api/v1/xlstm/retrieve
eldric-xlstmd on port 8884. Registers with the controller at startup; routed to by the request router (port 8881) when intent classifies as xLSTM-shaped (forecast / control / encode / retrieve).xlstm-models namespace. Pull or replace via the standard model-management API.The four endpoints are live in 5.0. The Python sidecar ships in the standard eldric-aios RPM (and the CUDA-enabled variant) — no separate install. License-tier limits and Pro-only advanced features (custom architecture training, multi-tenant policy isolation) finalize during GA.
Coming in 5.0.x: voice over IP integration (real-time bidirectional streams), inline CAD viewer for STEP / IGES files, additional model architectures in the policy / encoder slots.