Deployment

Docker-based deployment, sizing, model configuration and operational practices.

Vera ships as a set of Docker images deployed in EU-region infrastructure — managed by Nex0 in our standard offering, or into the customer's own environment where data-sovereignty requirements demand it.

Services

Service	Role
`console`	The operator web console (Next.js).
`engine`	Reasoning API: language layer, solver orchestration, agent runtime.
`mcp`	The MCP server surface.
`neo4j`	Knowledge-graph memory.
`postgres`	Relational store: audit log, rule versions, users, configuration.

A reference docker-compose configuration covers single-site deployments; the services are stateless apart from the two stores and run on Kubernetes without modification for multi-site operators.

Sizing

No GPU is required at any tier.

CPU: 4–8 vCPU
Memory: 16–32 GB RAM
Storage: 50–100 GB (audit history dominates growth)
Inference: under 10,000 model calls per month for a typical site — the solver, not the model, does the heavy lifting

Model configuration

The language layer targets any OpenAI-compatible endpoint and is configured per deployment:

Hosted API model — fastest to start; only prompt text leaves the deployment.
Self-hosted open-weight model — Mistral-family or Teuken-class models via Ollama or vLLM, for full sovereignty. Requires its own inference hardware, separate from the sizing above.

Cutover between models is a configuration change gated on one condition: the candidate model must pass the extraction evaluation suite at parity or better. Decision integrity is unaffected by model choice — verification always happens in the solver.

Environments and the public demo

The public demonstration environment at vera.nex0.tech is the operator console running against a dedicated field-service demonstration workspace. It exists so evaluators can exercise the real pipeline — rule parsing, solver verdicts, unsat cores, audit traces — without touching customer workspaces (demo credentials are available on request). It is not a production deployment and carries no service levels.

Operational practices

Backups: nightly snapshots of both stores; audit log is append-only.
Upgrades: rolling, with rule-set and schema migrations applied transactionally — a failed migration rolls back, never half-applies.
Monitoring: structured logs from every service; solver latency and schema-validation failure rate are the two health signals worth alerting on.

For security architecture, data residency and the disclosure process, see Security.

← MCP server Security & data governance →