API
Write
Read
Typed surface for mutations, traversals, graph algorithms, and lookups.
Most graph databases are built to hold one large graph. The Context Lake workload is the opposite: millions of smaller graphs, mostly cold, all temporal, all governed. Different workload, different runtime.
The Engine is shaped for the Context Lake workload from the data model up. Millions of graphs. Sparse activity per graph. High aggregate throughput. Governance applied independently to each.
Every architectural decision — tiered storage, in-memory adjacency, native ABAC, bi-temporal edges — follows from the workload.
Write
Read
Typed surface for mutations, traversals, graph algorithms, and lookups.
Mutations durably appended to the write-ahead log before acknowledgment.
Graph algorithms, traversals, and pattern matching run over compact in-memory structures.
Native to the substrate, not a layer bolted on. Every read and write is policy-gated for access and provenance; retention runs across the data lifecycle.
Hot graphs serve at memory speed. Inactive graphs are evicted to NVMe and object-store, and rehydrated in milliseconds.
Snapshots and WAL persist on a multi-AZ, highly durable object and document store.
Retrieval latency holds near-constant as the graph count grows. Current production runs sustain thousands of mutations and queries per second across millions of graphs — p50 latency unchanged from a thousand graphs to a million.
Three tiers. Hot graphs live in RAM at microsecond latency. Warm graphs sit on local NVMe. Cold graphs rest on object storage and rehydrate in milliseconds.
Cost tracks active graphs, not total graphs. A deployment with one million graphs and one percent of them hot pays for one percent of the memory.
Vector similarity, BM25, graph traversal, and pattern matching run over the same data. One query, one ranked answer — no separate retrieval stack to stitch together.
Hot graphs are held as adjacency lists and CSR matrices. Cache-friendly, deterministic layout, ready for matrix operations.
BFS, PageRank, pattern matching, path analysis, and temporal weighting run at microsecond latencies over those structures.
ABAC, multi-tenant isolation, customer key encryption, retention policies, audit, and provenance are properties of the substrate, not a layer above it. Every read and write is policy-gated.
Every edge carries four timestamps. Point-in-time queries, automatic invalidation, and temporal weighting follow from the data model.
Mutations are appended to a replicated, ordered write-ahead log before acknowledgment. Snapshots, indexes, and content persist on multi-AZ durable storage. Point-in-time recovery across the lifecycle.