I built a graph engine in CUE. Here's where it shines and where it breaks. #4288

quicue · 2026-02-27T01:14:32Z

quicue
Feb 27, 2026

I've spent the last several months building a dependency graph engine
entirely in CUE. Not configuration management. Not Kubernetes. Typed
DAGs where unification does the analysis — critical path scheduling,
gap analysis, compliance validation, provenance tracing, access
policies, risk scoring, and about 70 other patterns.

It's published as a CUE module (apercue.ca@v0), has 5 worked
examples, a project scaffolder, CI that validates everything, and
I submitted use cases to a few W3C Community Groups — two merged
into KG-Construct so far. I wanted to share what I've learned pushing CUE into this
territory — what works beautifully, and where I hit real walls.

Start here: a recipe as a dependency graph

Beef bourguignon. 17 steps — ingredients, prep, cooking — all typed
nodes with depends_on edges:

"deglaze": {
    name: "deglaze"
    "@type": {CookStep: true}
    description: "Deglaze pan with red wine, reduce by half"
    depends_on: {"brown-beef": true, "saute-mirepoix": true, "red-wine": true}
    time_min: 10
}

Run it:

$ cue eval ./examples/recipe-ingredients/ -e cpm.critical_sequence
[{resource: "beef-chuck", start: 0,   finish: 0,   duration: 0},
 {resource: "brown-beef", start: 0,   finish: 15,  duration: 15},
 {resource: "deglaze",    start: 15,  finish: 25,  duration: 10},
 {resource: "braise",     start: 25,  finish: 175, duration: 150},
 {resource: "finish",     start: 175, finish: 205, duration: 30}]

Critical path: 205 minutes. The braise dominates (150 min). Prep steps
have up to 172 minutes of slack — so you can dice onions anytime in the
first 2.5 hours. All computed from the dependency structure at eval time.

Same graph also answers: "are all ingredients present?" (gap analysis),
"do cook steps actually depend on something?" (compliance), "what's the
topology?" (layer grouping). One _steps struct, many projections.

What the engine actually does

The core is a #Graph pattern. You give it typed resources with
depends_on edges, it computes:

Topological layers
Depth per node
Roots and leaves
Transitive ancestors (full closure)
Dependents (reverse edges)
Validation (dangling edge detection)

Then ~63 projection patterns consume that graph. Each takes
Graph: #AnalyzableGraph as input and produces a different analysis:

Pattern	What it computes
`#CriticalPath`	Forward/backward scheduling, slack, critical sequence
`#ComplianceCheck`	Rule evaluation → pass/fail per rule
`#GapAnalysis`	Charter requirements vs. actual graph
`#ProvenanceTrace`	Dependency edges as derivation chains
`#ODRLPolicy`	Access policies by resource type
`#ValidationCredential`	Compliance wrapped in a credential envelope
`#DCATCatalog`	Resource catalog with typed themes
`#SinglePointsOfFailure`	Nodes whose removal disconnects dependents
`#BlastRadius`	Transitive impact from a single node failure
`#ImpactQuery`	"What breaks if X breaks?"
`#GraphDiff`	Structural diff between two graph versions
`#DriftReport`	Detect when actual state diverges from declared
`#FederatedMerge`	Safe merge of graphs from different sources
`#CycleDetector`	Reject graphs with dependency cycles
`#MermaidDiagram`	Graph → Mermaid syntax
`#GraphvizDiagram`	Graph → DOT syntax

Plus scheduling patterns, risk scoring, bootstrap planning, lifecycle
phases, type validation, schema alignment... 63 definitions across 13
files, about 4,000 lines of CUE total (core packages).

Key CUE patterns

Struct-as-set for types

Resources use structs for type membership instead of lists:

"@type": {Produce: true, Seasoning: true}

O(1) membership: resource["@type"]["Seasoning"] != _|_
Merging is unification: {Produce: true} & {Seasoning: true} just works
Dispatch via field overlap: patterns declare which types they serve,
binding is set intersection

This is the foundation. Every pattern dispatches on @type field
presence. A resource with {Dataset: true, Governed: true} matches
a data catalog pattern (serves Dataset) AND a policy pattern (serves
Governed) simultaneously.

Transitive closure via recursive struct merge

This is the most CUE-specific thing in the project:

_ancestors: {
    [_]: true
    if _hasDeps {
        for d, _ in _deps {
            (d): true
            resources[d]._ancestors
        }
    }
}

Each node accumulates its parents' ancestors through struct
unification. [_]: true constrains all values. Duplicates unify
cleanly (true & true = true). Result: every node knows its full
transitive ancestry.

This is what makes impact analysis, critical path, and gap analysis
work — they're all cheap comprehensions over precomputed ancestor sets.

Comprehensions as projections

Every analysis pattern follows the same shape:

#SomeProjection: {
    Graph: #AnalyzableGraph
    // ... comprehensions over Graph.resources
}

Swap the body, get a different output. The #AnalyzableGraph interface
is what makes this composable — both #Graph (full computation) and
#GraphLite (with precomputed topology) satisfy it, so all 75 patterns
work with either.

Charter system

A #Charter declares what a project needs to be complete:

_charter: charter.#Charter & {
    name: "beef-bourguignon"
    scope: {
        total_resources: len(_steps)
        required_types: {Protein: true, Produce: true, CookStep: true}
    }
    gates: {
        "mise-en-place": {
            phase: 1
            requires: {"beef-chuck": true, "onions": true, "dice-onions": true}
        }
        "cooking-complete": {
            phase: 2
            requires: {"braise": true, "finish": true}
            depends_on: {"mise-en-place": true}
        }
    }
}

#GapAnalysis unifies the charter against the actual graph and
reports: which gates are satisfied, which resources are missing, which
types aren't covered. If you cue vet a project that doesn't satisfy
its charter, it fails. Project completeness is a type check.

Where CUE breaks

This is the part I think CUE contributors will care about most.

No memoization on recursive struct references

The _ancestors computation? Beautiful on trees. Exponential on dense
diamond DAGs. If C depends on A and B, and both depend on D, then D's
ancestors get recomputed through both paths. Graph shape matters —
wide topologies (depth ~5-10) handle 60+ nodes natively, but dense
diamonds hit a wall around 35-40.

Workaround: precompute externally.

Precomputed?: {
    depth:      [string]: int
    ancestors?: [string]: {[string]: true}
}

A Python toposort.py does the expensive parts, CUE consumes the
result. #GraphLite skips recursion entirely. It works, but it means
the engine has a Python dependency for full transitive closure on
large dense graphs.

Most patterns — validation, depth, grouping, scheduling — scale to
1000+ nodes because they don't need transitive closure. The boundary
for full _ancestors computation depends on graph shape: wide trees
handle 60+ nodes natively, dense diamonds hit the limit around 35-40.
I run a real datacenter topology (69 nodes, wide tree) without
precomputation.

Question for CUE contributors: is there a path toward memoized
evaluation of recursive struct references? Even opt-in memoization
would eliminate the Python dependency entirely.

Both `if` branches always evaluate

_depth: {
    if Precomputed != _|_ { Precomputed.depth[name] }
    if Precomputed == _|_ { /* expensive recursive computation */ }
}

Both branches run regardless. Can't short-circuit the expensive path
when precomputed data exists. Had to create separate #Graph (with
recursion) and #GraphLite (without) to avoid paying for both.

Comprehension-level vs body-level `if`

This filters elements:

for name, r in resources if r["@type"]["CookStep"] != _|_ { ... }

This produces empty structs for non-matches:

for name, r in resources {
    if r["@type"]["CookStep"] != _|_ { ... }
}

The second form bit me many times. Comprehension-level if filters;
body-level if doesn't remove the element. Once you internalize this
it's fine, but it's a real gotcha.

The ecosystem

The graph engine (apercue.ca) is the generic layer. Other projects
import it as a CUE module:

apercue.ca@v0          Generic graph patterns (this project)
  └── quicue.ca@v0     Infrastructure types, providers, execution plans
  └── quicue-kg@v0     Knowledge graph types, SPARQL/Turtle projections
  └── cmhc-retrofit    Construction PM (housing retrofit domain)

Each downstream project declares domain-specific resources and types.
The patterns, projections, and analysis all come from the shared module.

Try it

git clone https://github.com/quicue/apercue
cd apercue

# Recipe: critical path, gap analysis, summary
cue eval ./examples/recipe-ingredients/ -e cpm.critical_sequence
cue eval ./examples/recipe-ingredients/ -e gap_summary
cue eval ./examples/recipe-ingredients/ -e summary --out json

# Supply chain: 6-stage pipeline
cue eval ./examples/supply-chain/ -e summary --out json

# Course prerequisites: university curriculum graph
cue eval ./examples/course-prereqs/ -e summary --out json

# Scaffold a new project from the patterns
bash tools/scaffold.sh ~/myproject example.com/myproject@v0

The 52-node governance example (gc-llm-governance/) is the stress
test — it uses precomputed topology and produces output across 8
different analysis dimensions.

Import it

The module is published as apercue.ca@v0:

import "apercue.ca/patterns@v0"

graph: patterns.#Graph & {Input: _myResources}
cpm:   patterns.#CriticalPath & {Graph: graph, Weights: _myWeights}
gaps:  charter.#GapAnalysis & {Charter: _myCharter, Graph: graph}

The scaffold.sh tool generates a starter project with the graph,
charter, and compliance patterns already wired up.

External validation

The graph output happens to be valid JSON-LD — the engine maps CUE
field names to standard vocabulary terms via a @context. I submitted
use cases to a few W3C Community Groups working on knowledge graph
construction and data governance — one accepted so far:

KG-Construct use cases (merged)
Dataspaces challenge
Context Graphs and UORA — early conversations

I mention this not for the W3C angle but because it validates that
the CUE output is structurally conformant to external specifications.
The patterns produce real, standards-compliant data — not toy output.

github.com/quicue/apercue — Apache 2.0,
4,000+ lines of CUE, 75 pattern definitions across 20 files, 5 examples, CI, scaffolder.

Happy to dig into any of the patterns, performance workarounds, the
module publishing setup, or how the charter system works.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I built a graph engine in CUE. Here's where it shines and where it breaks. #4288

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

I built a graph engine in CUE. Here's where it shines and where it breaks. #4288

Uh oh!

Uh oh!

quicue Feb 27, 2026

Start here: a recipe as a dependency graph

What the engine actually does

Key CUE patterns

Struct-as-set for types

Transitive closure via recursive struct merge

Comprehensions as projections

Charter system

Where CUE breaks

No memoization on recursive struct references

Both if branches always evaluate

Comprehension-level vs body-level if

The ecosystem

Try it

Import it

External validation

Replies: 0 comments

quicue
Feb 27, 2026

Both `if` branches always evaluate

Comprehension-level vs body-level `if`