"Research index"

Research index

This page groups the research-oriented documentation in docs/src/architecture/ so it is easier to discover without mistaking it for the current shipped architecture.

Research classes

Pattern	Typical status	Meaning
`*-research-2026.md`	`research`	investigation, evidence gathering, constraints, and trade-offs
`*-findings-2026.md`	`research`	synthesized results or conclusions from a research wave
`*-implementation-plan-2026.md`	`roadmap`	ordered implementation proposal
`*-implementation-blueprint.md`	`roadmap` or `experimental`	intended technical design for a future or in-progress path
`planning-meta/*`	`current` process docs or `roadmap` planning docs	contributor planning governance, not public product narrative

Pipeline and corpus SSOT (implementation)

Vox source → Mens pipeline SSOT — single map from .vox on disk to Mens training inputs (lexer vs HF tokenizer).
Populi data pipeline — disambiguates mesh runtime data from training JSONL.

Corpus lab, vision, and Qwen family (research, April 2026)

Vox corpus lab: mass examples, metrics, and eval harness (research 2026) — Tier A/B/C layout, compiler lanes vs golden parity, Syntax-K and WebIR aggregates, optional UI and vision rubrics, Mens validate-batch integration sketch.
Mens vision and multimodal inputs (research 2026) — TrainingPair limits, orchestrator hints vs attachments, screenshot-to-JSON pipeline, Candle text-only vs remote VLMs.
Mens Qwen family migration and native stack (research 2026) — Qwen2 vs Qwen3.5 retention tiers, operator runbook vs code removal, external QwenLM and Hugging Face references.
GUI, v0/islands, vision, and Mens Qwen — virtuous-cycle implementation plan (2026) — 50+ tracked ideas with repo anchors: WebIR, vox island, Playwright/MCP screenshots, orchestrator vision, Mens Qwen3.5 text vs optional VL rubric lane, execution waves W0–W5.
Orchestrator attachment_manifest RFC (2026) — MIME+hash task attachments and vision routing without substring-only hints (spec ahead of types).

Labeling rule

If a page is primarily research or a roadmap, say so in the title, frontmatter, or first paragraph. Do not rely on filenames alone.

Vox: The AI-Native Programming Language

Research index

Research classes

Pipeline and corpus SSOT (implementation)

Corpus lab, vision, and Qwen family (research, April 2026)

Suggested reading paths

Deep Research Clusters (April 2026)

LLM Hallucination & Type System Impact (Wave 1)

Continual Learning & Flywheel Risks (Wave 2)

GRPO Reward Shaping for Code LLMs (Wave 3)

AI Agent Context and Handoff Continuity (Wave 4)

Autonomous Research Localization & MENS Research Lane (Wave 6)

Scientia distribution, discovery, and publication surfaces

Multi-Repository Context Isolation (Wave 5)

Independent Deep Research Tracks

Documentation

Packaging and portability

Language and architecture direction

Hygiene and maintenance

Agentic planning and orchestration

SCIENTIA novelty / publication ledger (contracts)

Labeling rule