"Research index"

Research index

This page groups the research-oriented documentation in docs/src/architecture/ so it is easier to discover without mistaking it for the current shipped architecture.

Research classes

PatternTypical statusMeaning
*-research-2026.mdresearchinvestigation, evidence gathering, constraints, and trade-offs
*-findings-2026.mdresearchsynthesized results or conclusions from a research wave
*-implementation-plan-2026.mdroadmapordered implementation proposal
*-implementation-blueprint.mdroadmap or experimentalintended technical design for a future or in-progress path
planning-meta/*current process docs or roadmap planning docscontributor planning governance, not public product narrative

Pipeline and corpus SSOT (implementation)

Corpus lab, vision, and Qwen family (research, April 2026)

Suggested reading paths

Deep Research Clusters (April 2026)

LLM Hallucination & Type System Impact (Wave 1)

Continual Learning & Flywheel Risks (Wave 2)

GRPO Reward Shaping for Code LLMs (Wave 3)

AI Agent Context and Handoff Continuity (Wave 4)

Autonomous Research Localization & MENS Research Lane (Wave 6)

Scientia distribution, discovery, and publication surfaces

  • SCIENTIA multi-platform ranking, discovery, and anti-slop SSOT (research 2026) — Tiered citations for social and scholarly ranking surfaces; ingest vs syndicate posture; manifest-centered projection profiles; operator KPI sketches for signal vs noise. Complements external discovery and impact / readership.
  • Syndication Ecosystem & Multi-Platform Publishing Research 2026 — Analysis and adoption strategy for third-party Rust SDKs (atrium, megalodon, twapi-v2) to reduce maintenance burden and eliminate manual reqwest manipulation for social publishing channels.
  • Scientia Community Publishing Playbook 2026 — Operational playbook for multi-platform community management with minimal overhead. Covers Discord webhook setup, Reddit OAuth + anti-spam rules, GitHub Discussions GraphQL API, vox-publisher data model extension requirements, Clavis secret registration needs, and subreddit policy pack templates. Companion to the multi-platform ranking research above.
  • 🔬 Scientia Publication Endpoints — Ground-Truth Research & Implementation Policy (April 2026) — v2. Comprehensive code audit + web research across all 18 publication targets. Adds: ResearchGate full policy (no API exists; passive via DOI; do not implement), ORCID member API (highest-leverage new scholarly target), Figshare REST API (datasets/supplementary). Corrects v1 errors: Reddit User-Agent WAS correct; social_retry.rs has zero call sites (dead code); bluesky/mastodon/discord/linkedin are absent from switching.rs allowlist and retry infrastructure. Defines formal implementation policy: channel classification taxonomy (ActivePush/ScholarlyDeposit/ManualAssist/PassiveDiscovery/Deferred), gate requirements per class, 13-column hallucination inventory, and 8-wave task backlog with ~50 EP-NNN gap IDs. Last verified: 2026-04-13.

Multi-Repository Context Isolation (Wave 5)

  • Multi-repo context isolation: research findings 2026.voxignore SSOT policy, scope guard architecture, agent instruction file hierarchy, IDE workspace isolation, Git worktree patterns, security threats (IDPI, slopsquatting, scope escalation), context engineering guidelines, monorepo/polyrepo AI-readiness analysis, and vox repo init scaffold specification. Directly actionable: gaps table, implementation priorities, and cross-references to cross-repo-query-observability.md and context-management-research-findings-2026.md.

Independent Deep Research Tracks

  • Agent Trust Reliability Evaluation
  • AI Plan Adequacy Heuristics
  • AI-Augmented Testing & Hourglass Architecture Research
  • Compiler Testing Research
  • Multi-Agent Mesh Economics
  • Grammar-Constrained Decoding for Code LLMs
  • LLM Output Mediation and Programmatic Validator Generation — Proposes a unified LlmMediator<T> architecture connecting vox-constrained-gen (Tier 1), vox-jsonschema-util (Tier 2), Socrates confidence (Tier 3), and the trust layer into a single composable seam. Covers dynamic finite-response-set schema derivation, MCP reduction strategy, RLVR training alignment, and a four-wave implementation roadmap. Cross-references grammar-constrained decoding, trust reliability, HITL doubt loop, and capability registry.
  • Clavis as a one-stop secrets manager: research findings 2026 — Comprehensive gap analysis for evolving Vox Clavis into a full-lifecycle secrets management platform. Covers: complete env-var taxonomy across 9 secret classes, user-facing feature requirements, OWASP NHI Top 10 alignment, AI-agent credential isolation boundaries, MCP OAuth 2.1 target model, A2A credential delegation via RFC 8693 Token Exchange, runtime secret redaction pipeline, KEK/DEK envelope encryption model, competitive feature gap table vs. Doppler/Infisical/Pulumi ESC/Vault. Extends clavis-secrets-env-research-2026.md.
  • Clavis V2: Full Implementation Plan (2026) — Codebase-verified, code-grounded implementation plan for the full Clavis V2 platform. Anchored in the live codebase (spec.rs, vox_vault.rs, resolver.rs, clavis.rs CLI). Defines: single canonical data structure for all ~580 secrets (TaxonomyClass + LifecycleMeta + scope_description on SecretSpec, 3 new ResolutionStatus variants, 4 new SecretMaterialKind variants); 4 new VoxDB tables (version history, audit log, profile overrides, A2A delegations); updated write path with atomic multi-table transactions; 12 new/updated CLI subcommands (set-secret, rotate, rollback, history, list, diff, run, audit-log, delegate, revoke-delegation); runtime secret scrubber (redact.rs + aho-corasick); consumer wiring for all 8 platform crates; 8-wave execution plan with verification steps per wave; 5 new security invariants extending the V1 threat model.
  • Cryptography Research Findings 2026 — ZIG/AEGIS eradication and AES performance evaluation.

Documentation

Packaging and portability

Language and architecture direction

Hygiene and maintenance

  • Dependency Sprawl Audit and Resolution (2026) — Records the workspace-wide audit of sprawling Cargo dependencies, centralization into the root [workspace.dependencies], and implementation of TOESTUB CI-CD enforcement rules.

Agentic planning and orchestration

SCIENTIA novelty / publication ledger (contracts)

  • Finding-candidate and novelty-evidence v1 JSON Schemas live under contracts/scientia/ (finding-candidate.v1.schema.json, novelty-evidence-bundle.v1.schema.json); example fixtures under contracts/reports/scientia-*.example.v1.json. CI: vox ci scientia-novelty-ledger-contracts (also nested in vox ci ssot-drift). CLI spot-check: vox scientia finding-candidate-validate, vox scientia novelty-evidence-bundle-validate.
  • 🔴 PRIMARY IMPLEMENTATION SSOT (use this for all implementation work): scientia-pipeline-ssot-2026.md — unified inbound + outbound gap remediation specification. Code-verified against real sources. 28 implementation tasks (G1–G28) organized into 9 dependency-ordered execution groups. Includes canonical data model, DB schema changes, env var registry, Clavis secret registry, and LLM-executor verification ritual. Supersedes gap analysis and wave playbook for implementation decisions.
  • Impact / readership / citation-adjacent signals (research seed): scientia-impact-readership-research-2026.md and tunable weights in contracts/scientia/impact-readership-projection.seed.v1.yaml (orthogonal to novelty; no default publish gate).
  • Multi-platform ranking, discovery, and anti-slop SSOT (research 2026): scientia-multi-platform-ranking-discovery-research-2026.md — social and scholarly feed mechanics (tiered sources), ingest vs syndicate, projection profiles, anti-slop metrics; bridges outbound vox-publisher syndication and inbound external discovery.
  • Publication-worthiness + SSOT unification research plan: scientia-publication-worthiness-ssot-unification-research-2026.md (standards-to-signals matrix, canonical metadata graph proposal, detection calibration protocol, Codex research snapshot persistence blueprint, automation boundary ledger).
  • Implementation wave playbook (historical context): scientia-implementation-wave-playbook-2026.md (232-task execution map, wave outputs, first-30 lock order, and contract inventory).
  • Comprehensive gap analysis (historical context): scientia-gap-analysis-2026.md — 45 identified problems with solutions, severity ratings, and a 7-wave execution order.
  • Scientia Worthiness × Socrates Unification (research 2026): scientia-socrates-unification-research-2026.md — deep structural analysis of isomorphisms between the Worthiness publication gate and the Socrates real-time confidence protocol. 38+ integration ideas organized into 8 themes (shared numeric language, inbound pipeline, A2A communication, MENS training, etc.), explicit separation-of-concerns boundaries, risk map, and wave-gated implementation roadmap.
  • Scientia Publisher & Orchestrator Hardening Plan (roadmap 2026): scientia-publisher-hardening-implementation-plan-2026.md — ordered execution plan for de-factoring God Objects across vox-publisher, vox-orchestrator, and vox-cli to adhere to the 500-line TOESTUB policy.
  • 🔴 PRIMARY IMPLEMENTATION TASK LIST v2 (use this to execute work): scientia-publication-pipeline-implementation-plan-2026.md — 31 explicit tasks (T-001 to T-031) across 8 waves. v2 corrects 13 factual errors from v1 including: Bluesky XRPC URL had wrong method path AND wrong request field conflation; SyndicationResult already had bluesky/mastodon/linkedin/discord fields; social_retry was already wired (not dead code); Zenodo adapter is fully complete (564L, create+upload+publish+retry); Mastodon API accepts JSON body; Discord resolves its own Clavis webhook; LinkedIn REST endpoint is /rest/posts not /v2/posts; all four social Clavis SecretIds already exist. Includes exact Rust code patterns, per-task verification commands, wave-gated dependency ordering, and a permanent Do-Not-Implement registry.

Labeling rule

If a page is primarily research or a roadmap, say so in the title, frontmatter, or first paragraph. Do not rely on filenames alone.