What Can You Do With Eldric?

💬

Your own ChatGPT

A Private AI Assistant

Like ChatGPT, but running entirely on your computer. Ask questions, brainstorm ideas, get help writing emails or code — without sending a single word to the cloud. Your conversations stay yours.

Help me write a professional email declining this meeting
Explain this error message like I'm not a developer
What's a good recipe with the ingredients I have?

100%

Private

Free

Forever

25+

AI Models

Under the hood

Connects to Ollama, vLLM, llama.cpp, or 25+ other backends. Supports streaming responses, conversation history, session export. Choose from models like Llama 3, Mistral, Qwen, DeepSeek, and more.

📚

Your files, searchable by AI

Ask Your Documents Anything

Point Eldric at a folder of PDFs, Word docs, or text files and ask questions in plain English. It reads, understands, and finds the answers — even across hundreds of documents.

What does my rental contract say about early termination?
Summarize the key findings from these research papers
Find all mentions of delivery dates in these invoices

∞

Documents

Local

Processing

Under the hood

Built-in RAG engine with vector database (SQLite, FAISS, or ChromaDB). Documents are chunked, embedded, and indexed locally. Semantic search with configurable chunking strategies and embedding models via Ollama or OpenAI.

✍️

Your writing co-pilot

Write Better, Faster

Get help with blog posts, reports, translations, cover letters, or social media content. Eldric understands context, matches your tone, and helps you overcome writer's block — all privately.

Rewrite this paragraph to sound more professional
Translate this email to German, keep it formal
Turn these meeting notes into a summary with action items

Under the hood

Uses the active LLM backend with customizable system prompts. Supports prompt templates for repeatable tasks, session persistence for long-form writing, and export to multiple formats.

🎤

Speak naturally, get spoken answers

Talk to Your AI

Speak to Eldric and hear it respond — like a voice assistant, but private. Great for hands-free work, accessibility, or when you just prefer talking over typing.

Hey Eldric, what's on my calendar today?
Read me a summary of this document
Remind me what we discussed about the budget

<500ms

Latency

30+

Languages

Under the hood

Media Worker pipeline: Whisper.cpp for speech-to-text, any LLM for reasoning, Piper TTS for natural speech output. All local, GPU-accelerated, streaming via SSE/WebSocket.

🏢

One server, everyone benefits

AI for Your Whole Office

Set up one Eldric server and give everyone in your office their own private AI assistant — through a simple web browser. No cloud subscriptions, no per-seat fees. Your team's conversations and data never leave your building.

Web UI

Browser-based

0

Per-seat cost

Multi

User Isolation

Team Chat — Eldric Web UI

S

Can you draft a response to the Anderson proposal? We want to counter at 15% below their ask.

E

Here's a draft response for the Anderson proposal. I've structured it to acknowledge their offer while presenting the 15% counter with supporting rationale from your previous negotiations...

S

Make it a bit more formal and add a deadline.

Under the hood

Edge Server provides a web chat UI at /chat with TLS, API key auth, and rate limiting. Multi-tenant isolation via Agent Worker. Supports OpenWebUI integration, SSO, and role-based access. Deploy on a single GPU server or scale across multiple machines.

🧠

Your collective knowledge, always available

Your Company Brain

Feed Eldric your handbooks, SOPs, product docs, and meeting notes. Now anyone on the team can ask questions and get accurate answers from your company's own knowledge — instantly.

What's our return policy for international orders?
How do I submit an expense report?
What did we decide about the new pricing in Q3?

Under the hood

Agent Worker with multi-tenant RAG. Data Worker indexes documents into vector storage. Supports FAISS, ChromaDB, or built-in SQLite vectors. Knowledge bases are isolated per team or department with configurable access.

📧

AI reads, sorts, and drafts for you

Smarter Email & Messaging

Connect your email, WhatsApp, Teams, or SMS. Eldric reads incoming messages, classifies them, suggests replies, and can auto-respond to routine questions — with your approval.

Show me all urgent emails from this week
Draft a reply to the supplier asking for a 2-week extension
What WhatsApp messages came in while I was in the meeting?

6

Channels

80%

Auto-handled

Under the hood

Communication Worker supports IMAP/SMTP, WhatsApp Business API, Signal, Twilio SMS, Microsoft Teams, and XMPP. AI auto-response with approval queue. All messages stored locally with semantic search via Data Worker.

🏠

Your house, your rules, your AI

Control Your Smart Home

Eldric connects to your smart home devices — lights, thermostats, cameras, sensors — and lets you control everything with natural language. No cloud service needed, so it works even when the internet is down.

Turn off all the lights downstairs
Set the living room to 22 degrees
What's the temperature in the garage right now?
If nobody's home, lock all doors and turn off the AC

MQTT

Protocol

Zigbee

Supported

Z-Wave

Supported

100%

Offline

Under the hood

IoT Worker connects via MQTT, Zigbee, Z-Wave, ModBus, and OPC-UA. Supports device discovery, automation rules, and AI-powered scene management. Works with Home Assistant, Philips Hue, and hundreds of IoT devices. Dashboard at /dashboard.

🔬

140+ scientific APIs, one platform

Research Anything

Access NASA satellite data, CERN particle physics, genome databases, climate data, drug interactions, and more — all from one interface. Eldric connects to 140+ scientific data sources so researchers can focus on discovery, not data wrangling.

Show me recent earthquakes above magnitude 5.0
Find clinical trials for CAR-T therapy in Europe
What did the James Webb telescope observe last week?

140+

Science APIs

NASA

CERN, ESA, ...

Under the hood

Science Worker with providers for bioinformatics (BLAST, NCBI, UniProt), pharma (AlphaFold, ADMET, molecular docking), CRISPR guide design, LIMS sample tracking, quantum computing simulation, and 15+ space agency APIs. GLP and 21 CFR Part 11 compliance modes.

⚙️

AI agents that work together

Automate Complex Workflows

Define a goal and let a team of AI agents figure out how to accomplish it. One agent researches, another writes code, a third reviews it, and a fourth deploys. They coordinate automatically.

Analyze our competitor's pricing pages and write a report
Review all open PRs, run tests, and summarize findings
Monitor this API for downtime and alert the team on Slack

13

Agent Types

40+

Built-in Tools

Under the hood

Swarm orchestration engine with 13 specialized agents (Explorer, Coder, Runner, Planner, Searcher, Database, Network, and more). Supports multi-agent goals, parallel execution, and agent-to-agent delegation. Visual management via Swarm dashboard.

🌐

From one server to a global cluster

Scale AI Across Your Organization

Start with a single machine and grow to hundreds of GPU nodes across data centers. Eldric handles load balancing, failover, multi-tenant isolation, and AI-powered routing — so your teams get fast, reliable AI no matter the load.

Deploy 3 more workers in the EU data center
Route all legal department requests to the 70B model
Show me cluster utilization for the last 24 hours

25+

Backends

AI

Load Balancing

Under the hood

Distributed architecture: Edge (TLS/auth), Router (AI-powered load balancing), Workers (inference), Controller (orchestration). Supports 25+ backends including Ollama, vLLM, TGI, NVIDIA Triton, OpenAI, Anthropic. Enterprise licensing with hardware binding and multi-region deployment.

🎓

Make AI an expert in your domain

Train AI on Your Expertise

Fine-tune AI models on your company's data — legal documents, medical records, engineering specs, customer interactions. The result is an AI that speaks your language and knows your domain better than any generic model.

Train a model on our customer support transcripts
Fine-tune for our internal coding standards
Create a specialized model for our regulatory filings

6

Training Backends

LoRA

QLoRA, DPO, ...

Under the hood

Training Worker supports Unsloth, Axolotl, TRL, DeepSpeed, MLX, and llama.cpp. Methods include LoRA, QLoRA, SFT, DPO, RLHF, PPO, and full fine-tuning. Training chains with visual node-based configuration. Multi-GPU and distributed training support.

🎥

Transcribe, analyze, and search media

Process Audio & Video at Scale

Automatically transcribe meetings, extract action items, detect speakers, and make all your audio and video content searchable. Process thousands of hours without sending anything to the cloud.

Transcribe all recordings from last week's conference
What did the CEO say about Q4 targets in the town hall?
Generate subtitles for these training videos in 5 languages

95%

Accuracy

30+

Languages

Under the hood

Media Worker with Whisper.cpp STT, Piper TTS, FFmpeg video processing. Speaker diarization, scene detection, frame extraction. All media indexed in Data Worker for semantic search across recordings. Streaming via SSE/WebSocket.

🔌

OpenAI-compatible API, zero lock-in

Add AI to Any App

Eldric exposes an OpenAI-compatible API. If your app already talks to OpenAI, just change the URL and you're running locally. No code changes. Works with LangChain, LlamaIndex, AutoGen, and every OpenAI SDK.

Point my Next.js app at Eldric instead of OpenAI
Use LangChain with a local Llama 3 model
Stream completions to my React frontend via SSE

25+

Backends

SSE

Streaming

REST

API

Under the hood

Full /v1/chat/completions and /v1/models API. Streaming via SSE. Native tool calling pass-through. Supports Ollama, vLLM, TGI, llama.cpp, MLX, NVIDIA Triton, OpenAI, Anthropic, Groq, xAI, Together, and more. Switch backends without changing client code.

🔍

Vector search + LLM, batteries included

Build RAG Applications

Ingest documents, generate embeddings, store vectors, and query with semantic search — all through REST APIs. Build knowledge-base chatbots, document Q&A, or context-aware assistants without managing separate infrastructure.

Ingest 10,000 support tickets and build a Q&A bot
Semantic search across all indexed documents
Agentic RAG with iterative retrieval for complex queries

API

First

Multi

Tenant

Under the hood

Data Worker provides vector storage (SQLite, FAISS, ChromaDB). Agent Worker adds agentic RAG with ReAct pattern, query decomposition, and multi-agent execution. REST APIs for /api/v1/vector/* and /api/v1/agent/*. Per-tenant namespace isolation.

🛠️

Step-through debugging for AI

Debug & Test AI Workflows

Use the DevToolbox to step through agent tool calls, test prompts interactively, profile token usage and latency, and generate test cases automatically. Stop guessing why your AI does what it does.

Step through each tool call in my agent pipeline
Compare prompt variants side-by-side
Profile which calls are eating my latency budget

Under the hood

DevToolbox includes Agent Debugger (step-through tool calls), Prompt Playground (interactive testing), Tool Tester (isolated execution with mock data), Performance Profiler (latency/token analysis), and Test Generator (automated test cases).

🔗

Plug into any MCP-compatible tool

Connect via MCP

Eldric speaks the Model Context Protocol. Connect to any MCP server — databases, file systems, APIs, custom tools — and your AI can use them natively. Build once, use with any MCP-compatible client.

Connect my Postgres MCP server to Eldric
Use a custom MCP tool for Jira ticket creation
Chain multiple MCP servers for complex workflows

Under the hood

Built-in MCP client with JSON-RPC protocol. Configure servers in ~/.config/eldric/mcp_servers.json. Supports stdio and SSE transports. Tools from MCP servers appear alongside native tools in agent workflows.

🤖

Describe it, Eldric builds it

Build Custom AI Agents

Describe what you need in plain English and the Agent Builder designs the architecture, generates the code, selects the right tools, and creates test cases. It's an AI that builds AI agents for you.

Build an agent that monitors AWS costs and alerts on anomalies
Create a code reviewer that checks PRs for security issues
Design a research assistant that gathers and synthesizes papers

8

Templates

LLM

Powered Design

Under the hood

Agent Builder uses LLM reasoning to analyze requirements, design architecture, and generate agent code + tool definitions + system prompts + test cases. Templates: security-auditor, code-reviewer, data-analyst, documentation-writer, test-generator, api-integrator, monitoring-agent, research-assistant. Deploy to cluster with /agent-builder deploy.

⚡

Mass-produce domain-specific agents

Generate Agents at Scale

Need 50 agents for different departments? The Agent Generator uses domain templates to produce consistent, production-ready agents instantly. Pick a domain, set constraints, generate. Deterministic, repeatable, fast.

Generate a finance agent with SEC compliance constraints
Batch-create support agents for each product line
Spin up a medical agent with HIPAA guardrails

6

Domain Templates

Instant

Generation

Under the hood

Template-based generation with domain-specific tool injection. Domains: Finance, Medical, DevOps, Legal, Science, Support. Auto-injects tools, knowledge bases, and safety guardrails. Deterministic output — same input always produces same config. REST API at /api/v1/agent-generator/*.

🎨

Specialized agents for every task

Orchestrate Multi-Agent Swarms

Combine 13 built-in agent types into coordinated swarms. An Explorer finds files, a Coder writes the fix, a Runner executes tests, and a Planner tracks the overall goal. Define the mission, agents handle the rest.

Set a swarm goal to refactor the auth module
Deploy a 6-agent swarm for security auditing
Visualize agent coordination in the Swarm dashboard

13

Agent Types

40+

Tools

Visual

Dashboard

Under the hood

Agents: General, Explorer, Coder, Runner, Planner, Searcher, Database, Learner, Network, Trainer, Merger, Orchestrator. Each has constrained tool allowlists. Swarm Controller (port 8885) coordinates goals, Agent Worker (port 8893) handles execution. UAP (Universal Agent Protocol) for inter-agent communication.

🚀

Extend with Python & JavaScript plugins

Write Custom Plugins

Extend Eldric with server-side Python plugins (tools, filters, pipes) and client-side JavaScript widgets. Add custom tools the LLM can call, pre/post-process messages, or create entirely new virtual model backends.

Write a tool plugin that queries our internal CRM
Add a filter that redacts PII before it reaches the LLM
Create a pipe plugin that routes to a custom model

Under the hood

Edge Plugin Host supports 4 types: Tool (server-side, callable by LLM), Filter (inlet/outlet message processing), Pipe (virtual model backends), Action/Widget (client-side JS). Plugins run in Python subprocesses via JSON-RPC. Manifest + valves configuration.

🏗️

Your platform, your brand, your margins

Run AI-as-a-Service

Use Eldric as the backbone for your AI product. Multi-tenant isolation, per-customer API keys, usage metering, rate limiting, and a white-label web UI — everything you need to sell AI services without building infrastructure from scratch.

Multi

Tenant

API Keys

Per Customer

TLS

Built-in

Metered

Usage

# Your customers just change one URL:
OPENAI_API_BASE=https://ai.yourcompany.com/v1
OPENAI_API_KEY=sk-customer-key-here
# That's it. Their apps just work.
# Streaming, tool calling, embeddings.
# All on your hardware.

Under the hood

Edge Server handles TLS termination, API key auth, and rate limiting. Controller manages the cluster. Multi-tenant isolation via Agent Worker. Enterprise licensing with per-backend worker limits. Farm mode for horizontal Edge scaling. Full /v1/chat/completions API with streaming.

🧰

One platform, every model

Host Any Model for Customers

Offer Llama, Mistral, Qwen, DeepSeek, or any open model — plus route to OpenAI or Anthropic when customers need it. AI-powered routing picks the best model and worker for each request automatically.

Serve Llama 3 70B on our GPU cluster
Fall back to GPT-4 for requests our local model can't handle
Route coding requests to CodeLlama, chat to Mistral

AI

Routing

25+

Backends

Under the hood

Router supports round_robin, least_connections, load_based, latency_based, and AI autonomous routing modes. AI routing uses an LLM to select workers based on model capabilities, load, and request characteristics. Workers support Ollama, vLLM, TGI, llama.cpp, MLX, Triton, and all major cloud APIs.

💰

Productize your AI expertise

Build & Sell AI Agents

Use the Agent Builder and Generator to create domain-specific AI agents, package them with knowledge bases and safety guardrails, and deploy them for customers. Version, A/B test, and iterate — all on your infrastructure.

Package our legal agent as a product for law firms
A/B test two agent variants and measure accuracy
Deploy v2 of the support agent to 50 customer tenants

Version

Control

A/B

Testing

Under the hood

Agent Builder for creative one-off agents, Agent Generator for consistent mass production across domains. Agent versioning and A/B testing (Enterprise tier). Deploy to cluster with /agent-builder deploy. Per-tenant isolation, knowledge base binding, safety constraint templates.

📊

Custom models as a service

Offer Custom Model Training

Let your customers fine-tune models on their own data — through your platform. Training pipelines with visual configuration, automatic dataset preparation, and deployment to inference workers when done.

Customer uploads data, selects base model, clicks train
Chain data prep → training → evaluation → deploy
Offer LoRA adapters so customers share base models

Chains

Visual Pipelines

6

Backends

Under the hood

Training Worker with Unsloth, Axolotl, TRL, DeepSpeed, MLX, llama.cpp. Training chains with visual node-based config. Latent reasoning techniques (COCONUT, Quiet-STaR, Pause Tokens). Auto dataset preparation from knowledge bases via Agent Worker. Multi-GPU and distributed training.

🖥️

Turn GPUs into managed AI services

Manage GPU Infrastructure

Deploy Eldric across your GPU fleet. Workers auto-register with the controller, routers balance load across them, and the Edge provides a single HTTPS endpoint for all your customers. Add or remove nodes without downtime.

Deploy workers on 20 A100 nodes with auto-registration
Scale down EU cluster to 5 nodes during off-peak
Monitor GPU utilization across all data centers

50+

Workers/License

Multi

Region

HA

Built-in

Under the hood

Distributed architecture: Controller (port 8880) manages cluster, Router (port 8881) load balances, Workers (port 8890) run inference. SSH-based deployment via Controller API. Enterprise licensing with hardware binding, per-backend worker limits, and multi-region support. Edge farm mode for horizontal gateway scaling.

🔑

License, meter, and bill

Enterprise Licensing & Metering

Eldric's built-in licensing system lets you gate features by tier, enforce node limits, bind to hardware, and track usage. Integrate with your billing system or use the license server directly.

Limit free tier to 2 workers and basic features
Enable RAG and training only for paid customers
Hardware-bind licenses to prevent unauthorized sharing

Ed25519

Signed

5

License Tiers

Under the hood

Ed25519+SHA-512 signed license files. 5 tiers: Free, Standard, Professional, Enterprise, Custom. Per-backend worker limits, hardware binding (machine ID), grace periods, online/offline validation. License server at octopus.core.at with REST API. Feature gating across all workers (science, training, media, comm, agent).

What Can You Do With Eldric?

For You

A Private AI Assistant

Ask Your Documents Anything

Write Better, Faster

Talk to Your AI

For Your Team

AI for Your Whole Office

Your Company Brain

Smarter Email & Messaging

For Your Home

Control Your Smart Home

For Your Organization

Research Anything

Automate Complex Workflows

Scale AI Across Your Organization

Train AI on Your Expertise

Process Audio & Video at Scale

For Developers

Add AI to Any App

Build RAG Applications

Debug & Test AI Workflows

Connect via MCP

For Agent Developers

Build Custom AI Agents

Generate Agents at Scale

Orchestrate Multi-Agent Swarms

Write Custom Plugins

For AI Companies

Run AI-as-a-Service

Host Any Model for Customers

Build & Sell AI Agents

Offer Custom Model Training

For Cloud & Hosting Providers

Manage GPU Infrastructure

Enterprise Licensing & Metering

Science & Research

Ready to Try It?