Skip to main content

Milestone 3.0 — Catalog-grounded AI architecture (decision record)

Status: Planned — applies to all EHX product surfaces, not a single technology (e.g. Kubernetes only).

Tracking: ehx-kb#12 · Epic: Phase 3 #6

Decision

The infrastructure assistant will not be a free-form “LLM writes infra from scratch” chat.

The target architecture is catalog-grounded AI orchestration:

LayerResponsibilitySource of truth
Conversation / reasoningUnderstand messy requests, ask clarifying questions, route to the right product pathLLM + prompt orchestration
RetrievalFind relevant packs, articles, suite playbooks, and variablesRAG over templates, modules, KB, suite narratives
GenerationProduce repeatable, reviewable artifactsDeterministic render engine (M2.3 / M3.2)
ValidationBlock or warn on unsafe output before exportSchema/policy checks (planned M3.6)
Product routingPricing, checkout, personas, plan entitlementsFirst-party routes + API (not hallucinated URLs)

Core differentiator (roadmap): AI plus deterministic templates — LLM intent with validated, repeatable outputs.

What is live today (preview only)

MilestoneShippedLimitation
M3.1Rule-based intent + suggestion linksNo LLM; ~8 keyword intents
M3.2Fixed multi-file bundles for 5 intentsStatic GENERATION_BUNDLES maps; starter v0.1 packs only
M2.3Single-file render + variablesManual file pick on /generate

These previews prove UX, API contracts, and analytics — they are not the final coverage model.

Product surfaces this architecture must cover

Generation and chat must eventually support every customer-facing catalog and suite, not only Kubernetes:

In-product catalogs (API-backed)

SurfaceCategories / sectionsNotes
Templates (/templates)terraform, kubernetes, helm, monitoring, edgeStarter packs v0.1.0
Modules (/modules)terraform, helm, monitoring, securityComposable packs v0.1.0
Knowledge base (/kb)best-practices, deployment, troubleshootingGuidance + checklists
Generate (/generate)templates + modules renderSingle-file export today

Suite offerings (marketing + evaluation narratives)

SuitePathDomain
EHX Launch/launchKubernetes deployment bundles
EHX Monitor/monitorObservability presets
EHX NodeOps/nodeopsWeb3 RPC, validators, chain ops
EHX Secure/securePosture, exposure, security presets

Commercial & onboarding flows

SurfacePathRole in chat
Get started / personas/get-startedRoute by role intent
Pricing/pricingPlan comparison, bundle mapping
Checkout/checkoutUpgrade path (stub today)
Plan & usage/account/planEntitlements after checkout

Phase 4 expansion (Web3 intelligence)

Ethereum, OP Stack, validator, and RPC templates must plug into the same orchestration model — not a separate chat product.

Planned Phase 3 milestones (after M3.2 preview)

MilestoneGoalKey deliverables
M3.3DashboardAuth, saved projects, export history
M3.4LLM orchestration + RAGModel provider wiring; retrieve from templates/modules/KB/suites; slot-filling (“environment?”, “ingress?”); llm_available: true
M3.5Composable catalog generationDynamic bundle assembly from catalog metadata (replace fixed intent maps); coverage matrix for all rows in this doc
M3.6Validation & safe exportTerraform/Helm/K8s schema checks; warnings before download; zip/multi-file export hardening

M3.4 answers what the user meant and what to retrieve.
M3.5 answers which files to compose across all services.
M3.6 answers is this safe to apply.

Anti-patterns (do not ship)

  • Raw LLM-generated Terraform/YAML with no catalog anchor
  • One-off hard-coded bundle per use case (does not scale to full catalog breadth)
  • Chat that invents pricing, checkout URLs, or entitlements
  • Kubernetes-only orchestration while NodeOps / Secure / Launch remain link-only

Acceptance criteria (Phase 3 complete)

  • User can describe a scenario in natural language for any catalog category or suite
  • Assistant retrieves relevant packs/articles (RAG) and asks for missing slots
  • Output is rendered from catalog files with applied variables
  • Bundle composition is metadata-driven, not a fixed Python dict per intent
  • Validation runs before export (minimum: syntax/schema warnings)
  • Suite flows (Launch, Monitor, NodeOps, Secure) have explicit generation or guided paths
  • GET /chat/capabilities reflects real coverage, not a static subset

Phase 4 preview (same orchestration model)

MilestoneShippedNotes
M4.1NodeOps packs v0.2.0 + web3_operations 8-file bundlemilestone-4-1-web3-templates.md
M4.2RPC health presets + web3_monitoring 8-file bundlemilestone-4-2-monitoring-intelligence.md
M4.3Security presets v0.2.0 + security_analysis 8-file bundlemilestone-4-3-security-analysis.md