Hosted recall demo not yet public
app.griff.run/ does not yet front a public recall console. WEB-2 focus group identified this as the highest-leverage missing artifact. The current /demo route shows MASTER ATC custody theater, not memory recall.
Tracked in: WEB-2 synthesis + Sprint 7 design (S7-T002 hosted demo console with abuse controls)
/openapi.json behind CF Access
Public OpenAPI spec is currently gated by Cloudflare Access on memory.griff.run. Developer evaluators (Marcus, Bo, Priya in WEB-2) flagged this as a show-stopper.
Tracked in: WEB-2 Theme 5 — fold into Sprint 7 S7-T002 as a 30-minute add
Pliny red-team battery not yet measured end-to-end
3 of 14 fixtures authored (classes 1-3 wrapper-skip / TOCTOU / prompt-inject-judge). Broker enforcement venues mapped per S19b §2 — 8 broker-primary MUSTs, 2 delegated (class #4 keyring hardening, class #10 classifier mislabel via eval-harness). SHOULDs #11-14 are S22 / S18 surface, not broker. No measured run yet.
Tracked in: Sprint S19b (brain-S19b-mcp-host-broker-design.md) + N18 fixtures audit
Real Execution Kernel: design only, sandbox primitive proven but not wired
P0c spike PASSED all 3 attacks (Job Object + cleared env + deny-DACL + WFP firewall, 2.6 ms spawn). Production integration deferred per V-6 Sprint 7 drop list — BD12 sandbox primitive integration is 6-9h of internal hygiene that doesn't produce customer surface; parked to S8 with ADR-007 deferred. ADR-006 dispatcher stub is what shipped for RC1.
Tracked in: phase0-P0c-sandbox-spike.md + V-6 Sprint 7 review §2.5 drops
Section 889 / Section 508 compliance: scaffolded, federal-conditional
S6-T003 shipped a 480 LOC scanner + CI workflow + sample.json + 4 tests (commit 8e8266b fixture exclude fix). Per Plan v2 §4 A10, full federal-customer-conditional checks are deferred unless a federal warm-intro materializes. NDAA 889 attestation generator is design-only beyond the scaffold.
Tracked in: rc1-rush-R3-section-889-shipped.md + master plan §4 A10 deferral
Memory federation cross-machine: post-RC1
Brain Plan v2 broker is single-host (S19b §0 explicit non-goal). Fleet federation across JWGH02 / GRIFFIN / JWGH03 is post-RC1. Memory recall today works cross-session on a single host via memory.griff.run; multi-host coherence is the next layer.
Tracked in: brain-S19b-mcp-host-broker-design.md §0 non-goals
No published benchmarks vs mem0 / Letta / Anthropic built-in memory
Sam (journalist persona, WEB-2) flagged. Comparison harness not yet built; recall eval harness (A2) gated on P0d golden corpus lock.
Tracked in: WEB-2 divergent themes + Plan v2 A2 acceptance criterion
17 [d1]-marked tests failing on stale token (non-functional)
RC1 freeze snapshot: 275 passed / 17 failed / 4 skipped. All 17 failures are D1 [d1]-marked tests with stale D1_ATOMIC_BATCH_TOKEN env var vs the rotated worker secret. Non-functional — env-refresh closes them. Honest disclosure rather than test-suppression.
Tracked in: RC1-integrated-mvp-build-spec.md §'Locked RC1 P0 acceptance thresholds'